基于堆积集合学习和夏普利加法解释的岩石抗压强度预测模型

IF 3.7 2区 工程技术 Q3 ENGINEERING, ENVIRONMENTAL Bulletin of Engineering Geology and the Environment Pub Date : 2024-10-14 DOI:10.1007/s10064-024-03896-3
Luyuan Wu, Jianhui Li, Jianwei Zhang, Zifa Wang, Jingbo Tong, Fei Ding, Meng Li, Yi Feng, Hui Li
{"title":"基于堆积集合学习和夏普利加法解释的岩石抗压强度预测模型","authors":"Luyuan Wu,&nbsp;Jianhui Li,&nbsp;Jianwei Zhang,&nbsp;Zifa Wang,&nbsp;Jingbo Tong,&nbsp;Fei Ding,&nbsp;Meng Li,&nbsp;Yi Feng,&nbsp;Hui Li","doi":"10.1007/s10064-024-03896-3","DOIUrl":null,"url":null,"abstract":"<div><p>Accurately predicting the compressive strength of rock (RCS) is crucial for the construction and maintenance of rock engineering. However, RCS prediction based on single machine learning (ML) algorithms often face issues such as parameter sensitivity and inadequate generalization. To address these challenges, a new (RCS) prediction model based on a stacking ensemble learning method was proposed. This method combines multiple ML algorithms to achieve more accurate and stable prediction results. Firstly, 442 sets of rock mechanics experimental data were collected to form the prediction dataset, and data preprocessing techniques, including missing value imputation and normalization, were applied for data cleaning and standardization. Secondly, nine classic ML algorithms were used to establish RCS prediction models, and the optimal configurations were determined using k-fold cross-validation and Bayesian optimization. The selected base learners were LightGBM, Random Forest, and XGBoost, and the meta-learners were Ridge, Lasso, and Linear Regression. Finally, the models were verified using the testset, and the comparison showed that the proposed stacking models were better than all single models. Notably, the Stacking-LR model exhibited the best predictive accuracy(R<sup><b>2</b></sup>=0.946, MAE=5.59, MAPE=9.94<b>%</b>). Furthermore, the Shapley Additive exPlanations (SHAP) method was introduced to analyze the impact and dependencies of input features on the prediction results. It was found that both Young’s modulus and confining pressure are the most critical parameters influencing RCS and exert a positive impact on the prediction results. This finding is consistent with domain expert knowledge, enhances the model’s interpretability, and provides robust support for the predicted results.</p></div>","PeriodicalId":500,"journal":{"name":"Bulletin of Engineering Geology and the Environment","volume":"83 11","pages":""},"PeriodicalIF":3.7000,"publicationDate":"2024-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Prediction model for the compressive strength of rock based on stacking ensemble learning and shapley additive explanations\",\"authors\":\"Luyuan Wu,&nbsp;Jianhui Li,&nbsp;Jianwei Zhang,&nbsp;Zifa Wang,&nbsp;Jingbo Tong,&nbsp;Fei Ding,&nbsp;Meng Li,&nbsp;Yi Feng,&nbsp;Hui Li\",\"doi\":\"10.1007/s10064-024-03896-3\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Accurately predicting the compressive strength of rock (RCS) is crucial for the construction and maintenance of rock engineering. However, RCS prediction based on single machine learning (ML) algorithms often face issues such as parameter sensitivity and inadequate generalization. To address these challenges, a new (RCS) prediction model based on a stacking ensemble learning method was proposed. This method combines multiple ML algorithms to achieve more accurate and stable prediction results. Firstly, 442 sets of rock mechanics experimental data were collected to form the prediction dataset, and data preprocessing techniques, including missing value imputation and normalization, were applied for data cleaning and standardization. Secondly, nine classic ML algorithms were used to establish RCS prediction models, and the optimal configurations were determined using k-fold cross-validation and Bayesian optimization. The selected base learners were LightGBM, Random Forest, and XGBoost, and the meta-learners were Ridge, Lasso, and Linear Regression. Finally, the models were verified using the testset, and the comparison showed that the proposed stacking models were better than all single models. Notably, the Stacking-LR model exhibited the best predictive accuracy(R<sup><b>2</b></sup>=0.946, MAE=5.59, MAPE=9.94<b>%</b>). Furthermore, the Shapley Additive exPlanations (SHAP) method was introduced to analyze the impact and dependencies of input features on the prediction results. It was found that both Young’s modulus and confining pressure are the most critical parameters influencing RCS and exert a positive impact on the prediction results. This finding is consistent with domain expert knowledge, enhances the model’s interpretability, and provides robust support for the predicted results.</p></div>\",\"PeriodicalId\":500,\"journal\":{\"name\":\"Bulletin of Engineering Geology and the Environment\",\"volume\":\"83 11\",\"pages\":\"\"},\"PeriodicalIF\":3.7000,\"publicationDate\":\"2024-10-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Bulletin of Engineering Geology and the Environment\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s10064-024-03896-3\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ENGINEERING, ENVIRONMENTAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bulletin of Engineering Geology and the Environment","FirstCategoryId":"5","ListUrlMain":"https://link.springer.com/article/10.1007/s10064-024-03896-3","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, ENVIRONMENTAL","Score":null,"Total":0}
引用次数: 0

摘要

准确预测岩石抗压强度(RCS)对于岩石工程的施工和维护至关重要。然而,基于单一机器学习(ML)算法的 RCS 预测往往面临参数敏感性和泛化不足等问题。为了应对这些挑战,我们提出了一种基于堆叠集合学习方法的新型(RCS)预测模型。该方法结合了多种 ML 算法,以获得更准确、更稳定的预测结果。首先,收集了 442 组岩石力学实验数据组成预测数据集,并采用缺失值估算和归一化等数据预处理技术进行数据清理和标准化。其次,采用九种经典的 ML 算法建立 RCS 预测模型,并通过 k 倍交叉验证和贝叶斯优化确定最佳配置。选定的基础学习器为 LightGBM、Random Forest 和 XGBoost,元学习器为 Ridge、Lasso 和线性回归。最后,使用测试集对这些模型进行了验证,比较结果表明,所提出的堆叠模型优于所有单一模型。值得注意的是,Stacking-LR 模型的预测准确率最高(R2=0.946,MAE=5.59,MAPE=9.94%)。此外,还引入了 Shapley Additive exPlanations(SHAP)方法来分析输入特征对预测结果的影响和依赖性。结果发现,杨氏模量和约束压力是影响 RCS 的最关键参数,对预测结果有积极影响。这一发现与领域专家的知识一致,增强了模型的可解释性,并为预测结果提供了强有力的支持。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Prediction model for the compressive strength of rock based on stacking ensemble learning and shapley additive explanations

Accurately predicting the compressive strength of rock (RCS) is crucial for the construction and maintenance of rock engineering. However, RCS prediction based on single machine learning (ML) algorithms often face issues such as parameter sensitivity and inadequate generalization. To address these challenges, a new (RCS) prediction model based on a stacking ensemble learning method was proposed. This method combines multiple ML algorithms to achieve more accurate and stable prediction results. Firstly, 442 sets of rock mechanics experimental data were collected to form the prediction dataset, and data preprocessing techniques, including missing value imputation and normalization, were applied for data cleaning and standardization. Secondly, nine classic ML algorithms were used to establish RCS prediction models, and the optimal configurations were determined using k-fold cross-validation and Bayesian optimization. The selected base learners were LightGBM, Random Forest, and XGBoost, and the meta-learners were Ridge, Lasso, and Linear Regression. Finally, the models were verified using the testset, and the comparison showed that the proposed stacking models were better than all single models. Notably, the Stacking-LR model exhibited the best predictive accuracy(R2=0.946, MAE=5.59, MAPE=9.94%). Furthermore, the Shapley Additive exPlanations (SHAP) method was introduced to analyze the impact and dependencies of input features on the prediction results. It was found that both Young’s modulus and confining pressure are the most critical parameters influencing RCS and exert a positive impact on the prediction results. This finding is consistent with domain expert knowledge, enhances the model’s interpretability, and provides robust support for the predicted results.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Bulletin of Engineering Geology and the Environment
Bulletin of Engineering Geology and the Environment 工程技术-地球科学综合
CiteScore
7.10
自引率
11.90%
发文量
445
审稿时长
4.1 months
期刊介绍: Engineering geology is defined in the statutes of the IAEG as the science devoted to the investigation, study and solution of engineering and environmental problems which may arise as the result of the interaction between geology and the works or activities of man, as well as of the prediction of and development of measures for the prevention or remediation of geological hazards. Engineering geology embraces: • the applications/implications of the geomorphology, structural geology, and hydrogeological conditions of geological formations; • the characterisation of the mineralogical, physico-geomechanical, chemical and hydraulic properties of all earth materials involved in construction, resource recovery and environmental change; • the assessment of the mechanical and hydrological behaviour of soil and rock masses; • the prediction of changes to the above properties with time; • the determination of the parameters to be considered in the stability analysis of engineering works and earth masses.
期刊最新文献
Creep mechanism of landslide formation in rock with bedding and weak layers in Zezhou, Shanxi, China Effects of initial water and salt content on permeability and microstructure of sodic-saline loessal soils Experimental investigation and fractional elastoplastic damage constitutive modelling of gray sandstone under loading disturbance Probabilistic landslide-generated impulse waves estimation in mountain reservoirs, a case study A novel data-driven hybrid intelligent prediction model for reservoir landslide displacement
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1