TY - UNPB
T1 - Assessing the importance of feature selection in Landslide Susceptibility for Belluno province (Veneto Region, NE Italy)
AU - Meena, S.R.
AU - Puliero, Silvia
AU - Bhuyan, Kushanav
AU - Floris, Mario
AU - Catani, Filippo
PY - 2021/11/18
Y1 - 2021/11/18
N2 - In the domain of landslide risk science, landslide susceptibility mapping (LSM) is very important as it helps spatially identify potential landslide-prone regions. This study used a statistical ensemble model (Frequency Ratio and Evidence Belief Function) and two machine learning (ML) models (Random Forest and XG-Boost) for LSM in the Belluno province (Veneto Region, NE Italy). The study investigated the importance of the conditioning factors in predicting landslide occurrences using the mentioned models. In this paper, we evaluated the importance of the conditioning factors (features) in the overall prediction capabilities of the statistical and ML algorithms. By the trial-and-error method, we eliminated the least "important" features by using a common threshold. Conclusively, we found that removing the least "important" features does not impact the overall accuracy of the LSM for all three models. Based on the results of our study, the most commonly available features, for example, the topographic features, contributes to comparable results after removing the least "important" ones. This confirms that the requirement for the important factor maps can be assessed based on the physiography of the region. Based on the analysis of the three models, it was observed that most commonly available feature data can be useful for carrying out LSM at regional scale, eliminating the least available ones in most of the use cases due to data scarcity. Identifying LSMs at regional scale has implications for understanding landslide phenomena in the region and post-event relief measures, planning disaster risk reduction, mitigation, and evaluating potentially affected areas.
AB - In the domain of landslide risk science, landslide susceptibility mapping (LSM) is very important as it helps spatially identify potential landslide-prone regions. This study used a statistical ensemble model (Frequency Ratio and Evidence Belief Function) and two machine learning (ML) models (Random Forest and XG-Boost) for LSM in the Belluno province (Veneto Region, NE Italy). The study investigated the importance of the conditioning factors in predicting landslide occurrences using the mentioned models. In this paper, we evaluated the importance of the conditioning factors (features) in the overall prediction capabilities of the statistical and ML algorithms. By the trial-and-error method, we eliminated the least "important" features by using a common threshold. Conclusively, we found that removing the least "important" features does not impact the overall accuracy of the LSM for all three models. Based on the results of our study, the most commonly available features, for example, the topographic features, contributes to comparable results after removing the least "important" ones. This confirms that the requirement for the important factor maps can be assessed based on the physiography of the region. Based on the analysis of the three models, it was observed that most commonly available feature data can be useful for carrying out LSM at regional scale, eliminating the least available ones in most of the use cases due to data scarcity. Identifying LSMs at regional scale has implications for understanding landslide phenomena in the region and post-event relief measures, planning disaster risk reduction, mitigation, and evaluating potentially affected areas.
KW - ITC-GOLD
U2 - 10.5194/nhess-2021-299
DO - 10.5194/nhess-2021-299
M3 - Preprint
BT - Assessing the importance of feature selection in Landslide Susceptibility for Belluno province (Veneto Region, NE Italy)
PB - Copernicus
ER -