Xiao Zhou , Yacan Man , Shuming Liu , Juan Zhang , Rui Yuan , Wei Wang , Kuizu Su
{"title":"Leveraging multi-level correlations for imputing monitoring data in water supply systems using graph signal sampling theory","authors":"Xiao Zhou , Yacan Man , Shuming Liu , Juan Zhang , Rui Yuan , Wei Wang , Kuizu Su","doi":"10.1016/j.wroa.2024.100274","DOIUrl":null,"url":null,"abstract":"<div><div>Data missing and anomalies in monitoring equipment have become critical barriers to developing intelligent Water Supply Systems (WSS). The valid data preceding and after the missing segments can be utilized to impute missing values. However, traditional imputation methods, such as linear interpolation and prediction-based methods, have limited capacity to use data relationships or can only utilize information before the missing values. Therefore, existing methods still need to work on efficiently and conveniently achieving high-accuracy imputation. According to the continuity and periodicity of WSS data, missing values often exhibit multi-level correlations with valid data. This paper innovatively employs graph structures to analyze the multi-level correlations at different timestamps and applies graph signal sampling algorithms to extract low-frequency features for imputation. A novel Graph-based Data Imputation (GDI) method has been developed, which leverages multi-level correlations to propagate information and completes imputation tasks without requiring complex feature engineering and pre-training processes. Results indicate that GDI outperforms Holt-Winters, Support Vector Regression, and Gated Recurrent Unit in the task of imputing continuous missing data. It can still achieve <span><math><mrow><msup><mrow><mi>R</mi></mrow><mn>2</mn></msup><mo>></mo><mn>0.8</mn></mrow></math></span> even when the proportion of missing values reaches 80 %. These results demonstrate that GDI ensures a more streamlined and efficient imputation with high robustness and accuracy.</div></div>","PeriodicalId":52198,"journal":{"name":"Water Research X","volume":"25 ","pages":"Article 100274"},"PeriodicalIF":7.2000,"publicationDate":"2024-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Water Research X","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2589914724000641","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ENVIRONMENTAL","Score":null,"Total":0}
引用次数: 0
Abstract
Data missing and anomalies in monitoring equipment have become critical barriers to developing intelligent Water Supply Systems (WSS). The valid data preceding and after the missing segments can be utilized to impute missing values. However, traditional imputation methods, such as linear interpolation and prediction-based methods, have limited capacity to use data relationships or can only utilize information before the missing values. Therefore, existing methods still need to work on efficiently and conveniently achieving high-accuracy imputation. According to the continuity and periodicity of WSS data, missing values often exhibit multi-level correlations with valid data. This paper innovatively employs graph structures to analyze the multi-level correlations at different timestamps and applies graph signal sampling algorithms to extract low-frequency features for imputation. A novel Graph-based Data Imputation (GDI) method has been developed, which leverages multi-level correlations to propagate information and completes imputation tasks without requiring complex feature engineering and pre-training processes. Results indicate that GDI outperforms Holt-Winters, Support Vector Regression, and Gated Recurrent Unit in the task of imputing continuous missing data. It can still achieve even when the proportion of missing values reaches 80 %. These results demonstrate that GDI ensures a more streamlined and efficient imputation with high robustness and accuracy.
Water Research XEnvironmental Science-Water Science and Technology
CiteScore
12.30
自引率
1.30%
发文量
19
期刊介绍:
Water Research X is a sister journal of Water Research, which follows a Gold Open Access model. It focuses on publishing concise, letter-style research papers, visionary perspectives and editorials, as well as mini-reviews on emerging topics. The Journal invites contributions from researchers worldwide on various aspects of the science and technology related to the human impact on the water cycle, water quality, and its global management.