Prediction of slope failure probability based on machine learning with genetic-ADASYN algorithm

IF 6.9 1区 工程技术 Q1 ENGINEERING, GEOLOGICAL Engineering Geology Pub Date : 2024-12-26 DOI:10.1016/j.enggeo.2024.107885
Kai Zhong, Xiaohui Tan, Shanwei Liu, Zhitang Lu, Xiaoliang Hou, Qiao Wang
{"title":"Prediction of slope failure probability based on machine learning with genetic-ADASYN algorithm","authors":"Kai Zhong,&nbsp;Xiaohui Tan,&nbsp;Shanwei Liu,&nbsp;Zhitang Lu,&nbsp;Xiaoliang Hou,&nbsp;Qiao Wang","doi":"10.1016/j.enggeo.2024.107885","DOIUrl":null,"url":null,"abstract":"<div><div>In geotechnical engineering, precise probabilistic assessment of slope stability is crucial for risk management and the safe operation of engineering projects. To perform probabilistic assessments of slopes accurately and efficiently, six machine learning (ML) algorithms, including Logistic Regression (LR), Support Vector Machine (SVM), Decision Tree (DT), Random Forest (RF), Extremely Randomized Tree (ERT), and Gradient Boosting Tree (GBT), are adopted to establish surrogate models for the relationship between a slope's safety factor and soil's shear parameters. Latin Hypercube Sampling (LHS) is employed to generate training samples for constructing surrogate models using ML algorithms. Adaptive Synthetic Sampling (ADASYN) is used to balance the number of samples of failure and safety classes by generating synthetic samples for the failure sample set, and a Genetic algorithm (GA) is used to optimize the hyper-parameters of ML and ADASYN algorithms to improve the accuracy of the surrogate models. Two criteria are proposed to measure the accuracy of surrogate models built using ML with Genetic-ADASYN algorithms, and a formula is presented to estimate the optimal number of samples for the training set. Based on the surrogate models, reliability indices and failure probabilities of slopes can be readily estimated using the Monte Carlo Simulation Method (MCSM). Case studies of five slopes with different complexities are adopted to illustrate the proposed method for the probabilistic analysis of slope stability and to compare the accuracy of surrogate models built using different ML algorithms. Results show that the Genetic-ADASYN algorithm can significantly improve the accuracy of surrogate models built using ML algorithms, and among the six ML algorithms, GBT is the best ML algorithm due to its generalizability and accuracy in slope stability prediction problems. The research findings can provide a reference for risk assessment of slope engineering and help to improve the accuracy and efficiency of probabilistic slope stability analysis.</div></div>","PeriodicalId":11567,"journal":{"name":"Engineering Geology","volume":"346 ","pages":"Article 107885"},"PeriodicalIF":6.9000,"publicationDate":"2024-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Geology","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S001379522400485X","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, GEOLOGICAL","Score":null,"Total":0}
引用次数: 0

Abstract

In geotechnical engineering, precise probabilistic assessment of slope stability is crucial for risk management and the safe operation of engineering projects. To perform probabilistic assessments of slopes accurately and efficiently, six machine learning (ML) algorithms, including Logistic Regression (LR), Support Vector Machine (SVM), Decision Tree (DT), Random Forest (RF), Extremely Randomized Tree (ERT), and Gradient Boosting Tree (GBT), are adopted to establish surrogate models for the relationship between a slope's safety factor and soil's shear parameters. Latin Hypercube Sampling (LHS) is employed to generate training samples for constructing surrogate models using ML algorithms. Adaptive Synthetic Sampling (ADASYN) is used to balance the number of samples of failure and safety classes by generating synthetic samples for the failure sample set, and a Genetic algorithm (GA) is used to optimize the hyper-parameters of ML and ADASYN algorithms to improve the accuracy of the surrogate models. Two criteria are proposed to measure the accuracy of surrogate models built using ML with Genetic-ADASYN algorithms, and a formula is presented to estimate the optimal number of samples for the training set. Based on the surrogate models, reliability indices and failure probabilities of slopes can be readily estimated using the Monte Carlo Simulation Method (MCSM). Case studies of five slopes with different complexities are adopted to illustrate the proposed method for the probabilistic analysis of slope stability and to compare the accuracy of surrogate models built using different ML algorithms. Results show that the Genetic-ADASYN algorithm can significantly improve the accuracy of surrogate models built using ML algorithms, and among the six ML algorithms, GBT is the best ML algorithm due to its generalizability and accuracy in slope stability prediction problems. The research findings can provide a reference for risk assessment of slope engineering and help to improve the accuracy and efficiency of probabilistic slope stability analysis.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于遗传- adasyn算法的机器学习边坡破坏概率预测
在岩土工程中,边坡稳定性的精确概率评估对于风险管理和工程项目的安全运行至关重要。为了准确有效地对边坡进行概率评估,采用Logistic回归(LR)、支持向量机(SVM)、决策树(DT)、随机森林(RF)、极度随机树(ERT)和梯度提升树(GBT)等6种机器学习(ML)算法,建立边坡安全系数与土体剪切参数关系的代理模型。采用拉丁超立方体采样(LHS)生成训练样本,用于使用ML算法构建代理模型。采用自适应合成采样(Adaptive Synthetic Sampling, ADASYN)方法,通过生成故障样本集的合成样本来平衡故障和安全类别的样本数量,并采用遗传算法(Genetic algorithm, GA)对ML和ADASYN算法的超参数进行优化,提高代理模型的准确性。提出了两个标准来衡量使用ML与Genetic-ADASYN算法建立的代理模型的准确性,并提出了一个公式来估计训练集的最佳样本数量。在此基础上,利用蒙特卡罗模拟方法(MCSM)可以方便地估计边坡的可靠度指标和失效概率。以5个不同复杂程度的边坡为例,对所提出的边坡稳定性概率分析方法进行了说明,并比较了采用不同ML算法建立的代理模型的准确性。结果表明,遗传- adasyn算法可以显著提高ML算法建立的代理模型的准确性,在6种ML算法中,GBT算法在边坡稳定性预测问题上具有通用性和准确性,是最好的ML算法。研究结果可为边坡工程风险评估提供参考,有助于提高概率边坡稳定性分析的准确性和效率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Engineering Geology
Engineering Geology 地学-地球科学综合
CiteScore
13.70
自引率
12.20%
发文量
327
审稿时长
5.6 months
期刊介绍: Engineering Geology, an international interdisciplinary journal, serves as a bridge between earth sciences and engineering, focusing on geological and geotechnical engineering. It welcomes studies with relevance to engineering, environmental concerns, and safety, catering to engineering geologists with backgrounds in geology or civil/mining engineering. Topics include applied geomorphology, structural geology, geophysics, geochemistry, environmental geology, hydrogeology, land use planning, natural hazards, remote sensing, soil and rock mechanics, and applied geotechnical engineering. The journal provides a platform for research at the intersection of geology and engineering disciplines.
期刊最新文献
Editorial Board Multi-temporal landslide inventory mapping after wildfire and implications for post-fire debris flow activity Editorial Board Rill erosion in post-seismic watershed – A non-negligible transporting way of fluvial sediment 3D high-density ambient noise imaging of the Nankou-Sunhe buried active fault in Beijing
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1