{"title":"Optimized hybrid XGBoost-CatBoost model for enhanced prediction of concrete strength and reliability analysis using Monte Carlo simulations","authors":"Tuan Nguyen-Sy","doi":"10.1016/j.asoc.2024.112490","DOIUrl":null,"url":null,"abstract":"<div><div>Building on our previous work demonstrating the exceptional potential of the Extreme Gradient Boosting model (XGBoost) for predicting the uniaxial compressive strength of concrete, this study introduces several significant advancements. First, we develop a novel optimized hybrid model that synergistically combines XGBoost, CatBoost (one of the most advanced tree-boosting models), and the Optuna algorithm to achieve unprecedented prediction accuracy. Second, we apply this hybrid model in Monte Carlo simulations to conduct a pioneering reliability analysis of concrete strength, capturing the effects of input uncertainty. Third, we propose an innovative technique for estimating tree leaf values, which fundamentally improves prediction accuracy. Our optimized hybrid model delivers outstanding performance, as evidenced by a five-fold cross-validation showing a coefficient of determination (R²) of 0.953, a root mean squared error (RMSE) of 3.603 MPa, and a mean absolute error (MAE) of 2.261 MPa—metrics that surpass the best results reported in the existing literature. Additionally, our Monte Carlo simulations reveal a substantial error range of 10–20 MPa for a ±5 % variation in input features, underscoring the critical impact of input uncertainty on prediction reliability. Furthermore, our new leaf value estimation technique significantly outperforms traditional averaging methods, offering a transformative improvement in model accuracy. These findings are crucial for broadening the scope of machine learning applications in civil engineering and other engineering disciplines.</div></div>","PeriodicalId":50737,"journal":{"name":"Applied Soft Computing","volume":"167 ","pages":"Article 112490"},"PeriodicalIF":7.2000,"publicationDate":"2024-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Soft Computing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S156849462401264X","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Building on our previous work demonstrating the exceptional potential of the Extreme Gradient Boosting model (XGBoost) for predicting the uniaxial compressive strength of concrete, this study introduces several significant advancements. First, we develop a novel optimized hybrid model that synergistically combines XGBoost, CatBoost (one of the most advanced tree-boosting models), and the Optuna algorithm to achieve unprecedented prediction accuracy. Second, we apply this hybrid model in Monte Carlo simulations to conduct a pioneering reliability analysis of concrete strength, capturing the effects of input uncertainty. Third, we propose an innovative technique for estimating tree leaf values, which fundamentally improves prediction accuracy. Our optimized hybrid model delivers outstanding performance, as evidenced by a five-fold cross-validation showing a coefficient of determination (R²) of 0.953, a root mean squared error (RMSE) of 3.603 MPa, and a mean absolute error (MAE) of 2.261 MPa—metrics that surpass the best results reported in the existing literature. Additionally, our Monte Carlo simulations reveal a substantial error range of 10–20 MPa for a ±5 % variation in input features, underscoring the critical impact of input uncertainty on prediction reliability. Furthermore, our new leaf value estimation technique significantly outperforms traditional averaging methods, offering a transformative improvement in model accuracy. These findings are crucial for broadening the scope of machine learning applications in civil engineering and other engineering disciplines.
期刊介绍:
Applied Soft Computing is an international journal promoting an integrated view of soft computing to solve real life problems.The focus is to publish the highest quality research in application and convergence of the areas of Fuzzy Logic, Neural Networks, Evolutionary Computing, Rough Sets and other similar techniques to address real world complexities.
Applied Soft Computing is a rolling publication: articles are published as soon as the editor-in-chief has accepted them. Therefore, the web site will continuously be updated with new articles and the publication time will be short.