{"title":"Machine learning-based outlier detection for pipeline in-line inspection data","authors":"Muhammad Hussain, Tieling Zhang","doi":"10.1016/j.ress.2024.110553","DOIUrl":null,"url":null,"abstract":"<div><div>Pipeline companies are facing challenges in maintaining the integrity and reliability of their pipelines. They are working towards predictive maintenance using machine learning-based approaches to predicting anomalies. Training machine learning models requires sufficient data. Data quality is therefore becoming important because inaccurate data will lead to an inaccurate or wrong decision on pipeline condition assessment and the following management. This research paper intends to address the data quality issues of pipeline inspection data such as in-line inspection (ILI) data using machine learning models. Different machine learning models developed by random forest regression, linear regression, and nearest neighbors’ methods were tested to detect outliers in the ILI data. In this paper, the ILI data collected from an oil pipeline over a period of 22 years was applied to testing and analysis. To verify the outlier detection results of machine learning models, we used statistical analysis including Z-score method to check and find if there are any gaps in the analysis. It verifies that all these methods show almost the same or very similar results for the detection of the outliers. Hence, this study presents a robust method for the field applications in the pipeline industry.</div></div>","PeriodicalId":54500,"journal":{"name":"Reliability Engineering & System Safety","volume":null,"pages":null},"PeriodicalIF":9.4000,"publicationDate":"2024-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Reliability Engineering & System Safety","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0951832024006252","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, INDUSTRIAL","Score":null,"Total":0}
引用次数: 0
Abstract
Pipeline companies are facing challenges in maintaining the integrity and reliability of their pipelines. They are working towards predictive maintenance using machine learning-based approaches to predicting anomalies. Training machine learning models requires sufficient data. Data quality is therefore becoming important because inaccurate data will lead to an inaccurate or wrong decision on pipeline condition assessment and the following management. This research paper intends to address the data quality issues of pipeline inspection data such as in-line inspection (ILI) data using machine learning models. Different machine learning models developed by random forest regression, linear regression, and nearest neighbors’ methods were tested to detect outliers in the ILI data. In this paper, the ILI data collected from an oil pipeline over a period of 22 years was applied to testing and analysis. To verify the outlier detection results of machine learning models, we used statistical analysis including Z-score method to check and find if there are any gaps in the analysis. It verifies that all these methods show almost the same or very similar results for the detection of the outliers. Hence, this study presents a robust method for the field applications in the pipeline industry.
期刊介绍:
Elsevier publishes Reliability Engineering & System Safety in association with the European Safety and Reliability Association and the Safety Engineering and Risk Analysis Division. The international journal is devoted to developing and applying methods to enhance the safety and reliability of complex technological systems, like nuclear power plants, chemical plants, hazardous waste facilities, space systems, offshore and maritime systems, transportation systems, constructed infrastructure, and manufacturing plants. The journal normally publishes only articles that involve the analysis of substantive problems related to the reliability of complex systems or present techniques and/or theoretical results that have a discernable relationship to the solution of such problems. An important aim is to balance academic material and practical applications.