{"title":"An improved buffer-controlled sampling strategy for landslide susceptibility assessment considering the spatial heterogeneity of conditioning factors","authors":"Lei-Lei Liu, Hao Xiao, Yi-Li Zhang, Can Yang","doi":"10.1007/s10064-024-04008-x","DOIUrl":null,"url":null,"abstract":"<div><p>The selection of landslide and non-landslide samples significantly influences the performance of machine learning (ML)-based landslide susceptibility assessment (LSA). The commonly used buffer-controlled sampling (BCS) strategy for selecting non-landslide samples overlooks the spatial heterogeneity of the geological environment and lacks a standardized method for determining buffer radius. As a result, the sampling process introduces significant uncertainty to ML models. This paper proposes an improved BCS strategy that incorporates the spatial heterogeneity of conditioning factors to address this issue. The proposed strategy generates a buffer zone for each landslide by merging all neighboring areas with the same attributes as the landslide and then calculates the average equivalent radius of those zones for comparative analysis. The random forest (RF) and the support vector machine (SVM) models are employed to predict the landslide susceptibility of Taojiang County, China, using both the improved and the traditional BCS strategy. Furthermore, the impact of different buffer radii on the model prediction is thoroughly investigated to provide guidance for the selection of buffer radius. The results demonstrate that a buffer radius of less than 3,000 m is optimal in Taojiang County. Compared with the traditional RF and SVM model, the corresponding improved models exhibit superior performance, with higher AUC values and increased peak frequency ratios in areas of very high susceptibility. These findings confirm the effectiveness of the proposed strategy, offering valuable guidance for buffer radius selection and improving the ML-based LSA.</p></div>","PeriodicalId":500,"journal":{"name":"Bulletin of Engineering Geology and the Environment","volume":"83 12","pages":""},"PeriodicalIF":3.7000,"publicationDate":"2024-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bulletin of Engineering Geology and the Environment","FirstCategoryId":"5","ListUrlMain":"https://link.springer.com/article/10.1007/s10064-024-04008-x","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, ENVIRONMENTAL","Score":null,"Total":0}
引用次数: 0
Abstract
The selection of landslide and non-landslide samples significantly influences the performance of machine learning (ML)-based landslide susceptibility assessment (LSA). The commonly used buffer-controlled sampling (BCS) strategy for selecting non-landslide samples overlooks the spatial heterogeneity of the geological environment and lacks a standardized method for determining buffer radius. As a result, the sampling process introduces significant uncertainty to ML models. This paper proposes an improved BCS strategy that incorporates the spatial heterogeneity of conditioning factors to address this issue. The proposed strategy generates a buffer zone for each landslide by merging all neighboring areas with the same attributes as the landslide and then calculates the average equivalent radius of those zones for comparative analysis. The random forest (RF) and the support vector machine (SVM) models are employed to predict the landslide susceptibility of Taojiang County, China, using both the improved and the traditional BCS strategy. Furthermore, the impact of different buffer radii on the model prediction is thoroughly investigated to provide guidance for the selection of buffer radius. The results demonstrate that a buffer radius of less than 3,000 m is optimal in Taojiang County. Compared with the traditional RF and SVM model, the corresponding improved models exhibit superior performance, with higher AUC values and increased peak frequency ratios in areas of very high susceptibility. These findings confirm the effectiveness of the proposed strategy, offering valuable guidance for buffer radius selection and improving the ML-based LSA.
期刊介绍:
Engineering geology is defined in the statutes of the IAEG as the science devoted to the investigation, study and solution of engineering and environmental problems which may arise as the result of the interaction between geology and the works or activities of man, as well as of the prediction of and development of measures for the prevention or remediation of geological hazards. Engineering geology embraces:
• the applications/implications of the geomorphology, structural geology, and hydrogeological conditions of geological formations;
• the characterisation of the mineralogical, physico-geomechanical, chemical and hydraulic properties of all earth materials involved in construction, resource recovery and environmental change;
• the assessment of the mechanical and hydrological behaviour of soil and rock masses;
• the prediction of changes to the above properties with time;
• the determination of the parameters to be considered in the stability analysis of engineering works and earth masses.