Matthew Hayslep, Edward Keedwell, R. Farmani, Joshua Pocock
{"title":"An explainable machine learning approach to the prediction of pipe failure using minimum night flow","authors":"Matthew Hayslep, Edward Keedwell, R. Farmani, Joshua Pocock","doi":"10.2166/hydro.2024.204","DOIUrl":null,"url":null,"abstract":"\n \n Both minimum night flow (MNF) and pipe failures are common ways of understanding leakage within water distribution networks (WDNs). This article takes a data-driven approach and applies linear models, random forests, and neural networks to MNF and pipe failure prediction. First, models are trained to estimate the historic average MNF for over 800 real-world DMAs from the UK. Features for this problem are constructed from pipe records which detail the length, diameter, volume, age, material, and number of customer connections of each pipe. The results show that 65% of the variation in historic average MNF can be explained using these factors alone. Second, a novel method is proposed to deconstruct the models' predictions into a leakage contribution score (LCS), estimating how each individual pipe in a DMA has contributed to the MNF. In order to validate this novel approach, the LCS values are used to classify pipes based on historic pipe failure and are compared against models directly trained for this. The results show that the LCS performs well at this task, achieving an AUC of 0.71. In addition, it is shown that both LCS and directly trained models agree in many cases on an example real-world DMA.","PeriodicalId":2,"journal":{"name":"ACS Applied Bio Materials","volume":"67 29","pages":""},"PeriodicalIF":4.7000,"publicationDate":"2024-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Bio Materials","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.2166/hydro.2024.204","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATERIALS SCIENCE, BIOMATERIALS","Score":null,"Total":0}
引用次数: 0
Abstract
Both minimum night flow (MNF) and pipe failures are common ways of understanding leakage within water distribution networks (WDNs). This article takes a data-driven approach and applies linear models, random forests, and neural networks to MNF and pipe failure prediction. First, models are trained to estimate the historic average MNF for over 800 real-world DMAs from the UK. Features for this problem are constructed from pipe records which detail the length, diameter, volume, age, material, and number of customer connections of each pipe. The results show that 65% of the variation in historic average MNF can be explained using these factors alone. Second, a novel method is proposed to deconstruct the models' predictions into a leakage contribution score (LCS), estimating how each individual pipe in a DMA has contributed to the MNF. In order to validate this novel approach, the LCS values are used to classify pipes based on historic pipe failure and are compared against models directly trained for this. The results show that the LCS performs well at this task, achieving an AUC of 0.71. In addition, it is shown that both LCS and directly trained models agree in many cases on an example real-world DMA.
期刊介绍:
ACS Applied Bio Materials is an interdisciplinary journal publishing original research covering all aspects of biomaterials and biointerfaces including and beyond the traditional biosensing, biomedical and therapeutic applications.
The journal is devoted to reports of new and original experimental and theoretical research of an applied nature that integrates knowledge in the areas of materials, engineering, physics, bioscience, and chemistry into important bio applications. The journal is specifically interested in work that addresses the relationship between structure and function and assesses the stability and degradation of materials under relevant environmental and biological conditions.