{"title":"A novel stacking ensemble learner for predicting residual strength of corroded pipelines","authors":"Qiankun Wang, Hongfang Lu","doi":"10.1038/s41529-024-00508-z","DOIUrl":null,"url":null,"abstract":"Accurately assessing the residual strength of corroded oil and gas pipelines is crucial for ensuring their safe and stable operation. Machine learning techniques have shown promise in addressing this challenge due to their ability to handle complex, non-linear relationships in data. Unlike previous studies that primarily focused on enhancing prediction accuracy through the optimization of single models, this work shifts the emphasis to a different approach: stacking ensemble learning. This study applies a stacking model composed of seven base learners and three meta-learners to predict the residual strength of pipelines using a dataset of 453 instances. Automated hyperparameter tuning libraries are utilized to search for optimal hyperparameters. By evaluating various combinations of base learners and meta-learners, the optimal stacking configuration was determined. The results demonstrate that the stacking model, using k-nearest neighbors as the meta-learner alongside seven base learners, delivers the best predictive performance, with a coefficient of determination of 0.959. Compared to individual models, the stacking model also significantly improves generalization performance. However, the stacking model’s effectiveness on low-strength pipelines is limited due to the small sample size. Furthermore, incorporating original features into the second-layer model did not significantly enhance performance, likely because the first-layer model had already extracted most of the critical features. Given the marginal contribution of model optimization to prediction accuracy, this work offers a novel perspective for improving model performance. The findings have important practical implications for the integrity assessment of corroded pipelines.","PeriodicalId":19270,"journal":{"name":"npj Materials Degradation","volume":" ","pages":"1-10"},"PeriodicalIF":6.6000,"publicationDate":"2024-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.nature.com/articles/s41529-024-00508-z.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"npj Materials Degradation","FirstCategoryId":"88","ListUrlMain":"https://www.nature.com/articles/s41529-024-00508-z","RegionNum":2,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MATERIALS SCIENCE, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Accurately assessing the residual strength of corroded oil and gas pipelines is crucial for ensuring their safe and stable operation. Machine learning techniques have shown promise in addressing this challenge due to their ability to handle complex, non-linear relationships in data. Unlike previous studies that primarily focused on enhancing prediction accuracy through the optimization of single models, this work shifts the emphasis to a different approach: stacking ensemble learning. This study applies a stacking model composed of seven base learners and three meta-learners to predict the residual strength of pipelines using a dataset of 453 instances. Automated hyperparameter tuning libraries are utilized to search for optimal hyperparameters. By evaluating various combinations of base learners and meta-learners, the optimal stacking configuration was determined. The results demonstrate that the stacking model, using k-nearest neighbors as the meta-learner alongside seven base learners, delivers the best predictive performance, with a coefficient of determination of 0.959. Compared to individual models, the stacking model also significantly improves generalization performance. However, the stacking model’s effectiveness on low-strength pipelines is limited due to the small sample size. Furthermore, incorporating original features into the second-layer model did not significantly enhance performance, likely because the first-layer model had already extracted most of the critical features. Given the marginal contribution of model optimization to prediction accuracy, this work offers a novel perspective for improving model performance. The findings have important practical implications for the integrity assessment of corroded pipelines.
期刊介绍:
npj Materials Degradation considers basic and applied research that explores all aspects of the degradation of metallic and non-metallic materials. The journal broadly defines ‘materials degradation’ as a reduction in the ability of a material to perform its task in-service as a result of environmental exposure.
The journal covers a broad range of topics including but not limited to:
-Degradation of metals, glasses, minerals, polymers, ceramics, cements and composites in natural and engineered environments, as a result of various stimuli
-Computational and experimental studies of degradation mechanisms and kinetics
-Characterization of degradation by traditional and emerging techniques
-New approaches and technologies for enhancing resistance to degradation
-Inspection and monitoring techniques for materials in-service, such as sensing technologies