Luyuan Wu, Jianhui Li, Jianwei Zhang, Zifa Wang, Jingbo Tong, Fei Ding, Meng Li, Yi Feng, Hui Li
{"title":"基于堆积集合学习和夏普利加法解释的岩石抗压强度预测模型","authors":"Luyuan Wu, Jianhui Li, Jianwei Zhang, Zifa Wang, Jingbo Tong, Fei Ding, Meng Li, Yi Feng, Hui Li","doi":"10.1007/s10064-024-03896-3","DOIUrl":null,"url":null,"abstract":"<div><p>Accurately predicting the compressive strength of rock (RCS) is crucial for the construction and maintenance of rock engineering. However, RCS prediction based on single machine learning (ML) algorithms often face issues such as parameter sensitivity and inadequate generalization. To address these challenges, a new (RCS) prediction model based on a stacking ensemble learning method was proposed. This method combines multiple ML algorithms to achieve more accurate and stable prediction results. Firstly, 442 sets of rock mechanics experimental data were collected to form the prediction dataset, and data preprocessing techniques, including missing value imputation and normalization, were applied for data cleaning and standardization. Secondly, nine classic ML algorithms were used to establish RCS prediction models, and the optimal configurations were determined using k-fold cross-validation and Bayesian optimization. The selected base learners were LightGBM, Random Forest, and XGBoost, and the meta-learners were Ridge, Lasso, and Linear Regression. Finally, the models were verified using the testset, and the comparison showed that the proposed stacking models were better than all single models. Notably, the Stacking-LR model exhibited the best predictive accuracy(R<sup><b>2</b></sup>=0.946, MAE=5.59, MAPE=9.94<b>%</b>). Furthermore, the Shapley Additive exPlanations (SHAP) method was introduced to analyze the impact and dependencies of input features on the prediction results. It was found that both Young’s modulus and confining pressure are the most critical parameters influencing RCS and exert a positive impact on the prediction results. This finding is consistent with domain expert knowledge, enhances the model’s interpretability, and provides robust support for the predicted results.</p></div>","PeriodicalId":500,"journal":{"name":"Bulletin of Engineering Geology and the Environment","volume":"83 11","pages":""},"PeriodicalIF":3.7000,"publicationDate":"2024-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Prediction model for the compressive strength of rock based on stacking ensemble learning and shapley additive explanations\",\"authors\":\"Luyuan Wu, Jianhui Li, Jianwei Zhang, Zifa Wang, Jingbo Tong, Fei Ding, Meng Li, Yi Feng, Hui Li\",\"doi\":\"10.1007/s10064-024-03896-3\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Accurately predicting the compressive strength of rock (RCS) is crucial for the construction and maintenance of rock engineering. However, RCS prediction based on single machine learning (ML) algorithms often face issues such as parameter sensitivity and inadequate generalization. To address these challenges, a new (RCS) prediction model based on a stacking ensemble learning method was proposed. This method combines multiple ML algorithms to achieve more accurate and stable prediction results. Firstly, 442 sets of rock mechanics experimental data were collected to form the prediction dataset, and data preprocessing techniques, including missing value imputation and normalization, were applied for data cleaning and standardization. Secondly, nine classic ML algorithms were used to establish RCS prediction models, and the optimal configurations were determined using k-fold cross-validation and Bayesian optimization. The selected base learners were LightGBM, Random Forest, and XGBoost, and the meta-learners were Ridge, Lasso, and Linear Regression. Finally, the models were verified using the testset, and the comparison showed that the proposed stacking models were better than all single models. Notably, the Stacking-LR model exhibited the best predictive accuracy(R<sup><b>2</b></sup>=0.946, MAE=5.59, MAPE=9.94<b>%</b>). Furthermore, the Shapley Additive exPlanations (SHAP) method was introduced to analyze the impact and dependencies of input features on the prediction results. It was found that both Young’s modulus and confining pressure are the most critical parameters influencing RCS and exert a positive impact on the prediction results. This finding is consistent with domain expert knowledge, enhances the model’s interpretability, and provides robust support for the predicted results.</p></div>\",\"PeriodicalId\":500,\"journal\":{\"name\":\"Bulletin of Engineering Geology and the Environment\",\"volume\":\"83 11\",\"pages\":\"\"},\"PeriodicalIF\":3.7000,\"publicationDate\":\"2024-10-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Bulletin of Engineering Geology and the Environment\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s10064-024-03896-3\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ENGINEERING, ENVIRONMENTAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bulletin of Engineering Geology and the Environment","FirstCategoryId":"5","ListUrlMain":"https://link.springer.com/article/10.1007/s10064-024-03896-3","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, ENVIRONMENTAL","Score":null,"Total":0}
引用次数: 0
摘要
准确预测岩石抗压强度(RCS)对于岩石工程的施工和维护至关重要。然而,基于单一机器学习(ML)算法的 RCS 预测往往面临参数敏感性和泛化不足等问题。为了应对这些挑战,我们提出了一种基于堆叠集合学习方法的新型(RCS)预测模型。该方法结合了多种 ML 算法,以获得更准确、更稳定的预测结果。首先,收集了 442 组岩石力学实验数据组成预测数据集,并采用缺失值估算和归一化等数据预处理技术进行数据清理和标准化。其次,采用九种经典的 ML 算法建立 RCS 预测模型,并通过 k 倍交叉验证和贝叶斯优化确定最佳配置。选定的基础学习器为 LightGBM、Random Forest 和 XGBoost,元学习器为 Ridge、Lasso 和线性回归。最后,使用测试集对这些模型进行了验证,比较结果表明,所提出的堆叠模型优于所有单一模型。值得注意的是,Stacking-LR 模型的预测准确率最高(R2=0.946,MAE=5.59,MAPE=9.94%)。此外,还引入了 Shapley Additive exPlanations(SHAP)方法来分析输入特征对预测结果的影响和依赖性。结果发现,杨氏模量和约束压力是影响 RCS 的最关键参数,对预测结果有积极影响。这一发现与领域专家的知识一致,增强了模型的可解释性,并为预测结果提供了强有力的支持。
Prediction model for the compressive strength of rock based on stacking ensemble learning and shapley additive explanations
Accurately predicting the compressive strength of rock (RCS) is crucial for the construction and maintenance of rock engineering. However, RCS prediction based on single machine learning (ML) algorithms often face issues such as parameter sensitivity and inadequate generalization. To address these challenges, a new (RCS) prediction model based on a stacking ensemble learning method was proposed. This method combines multiple ML algorithms to achieve more accurate and stable prediction results. Firstly, 442 sets of rock mechanics experimental data were collected to form the prediction dataset, and data preprocessing techniques, including missing value imputation and normalization, were applied for data cleaning and standardization. Secondly, nine classic ML algorithms were used to establish RCS prediction models, and the optimal configurations were determined using k-fold cross-validation and Bayesian optimization. The selected base learners were LightGBM, Random Forest, and XGBoost, and the meta-learners were Ridge, Lasso, and Linear Regression. Finally, the models were verified using the testset, and the comparison showed that the proposed stacking models were better than all single models. Notably, the Stacking-LR model exhibited the best predictive accuracy(R2=0.946, MAE=5.59, MAPE=9.94%). Furthermore, the Shapley Additive exPlanations (SHAP) method was introduced to analyze the impact and dependencies of input features on the prediction results. It was found that both Young’s modulus and confining pressure are the most critical parameters influencing RCS and exert a positive impact on the prediction results. This finding is consistent with domain expert knowledge, enhances the model’s interpretability, and provides robust support for the predicted results.
期刊介绍:
Engineering geology is defined in the statutes of the IAEG as the science devoted to the investigation, study and solution of engineering and environmental problems which may arise as the result of the interaction between geology and the works or activities of man, as well as of the prediction of and development of measures for the prevention or remediation of geological hazards. Engineering geology embraces:
• the applications/implications of the geomorphology, structural geology, and hydrogeological conditions of geological formations;
• the characterisation of the mineralogical, physico-geomechanical, chemical and hydraulic properties of all earth materials involved in construction, resource recovery and environmental change;
• the assessment of the mechanical and hydrological behaviour of soil and rock masses;
• the prediction of changes to the above properties with time;
• the determination of the parameters to be considered in the stability analysis of engineering works and earth masses.