Elucidating and forecasting the organochlorine pesticides in suspended particulate matter by a two-stage decomposition based interpretable deep learning approach
{"title":"Elucidating and forecasting the organochlorine pesticides in suspended particulate matter by a two-stage decomposition based interpretable deep learning approach","authors":"","doi":"10.1016/j.watres.2024.122315","DOIUrl":null,"url":null,"abstract":"<div><p>Accurately predicting the concentration of organochlorine pesticides (OCPs) presents a challenge due to their complex sources and environmental behaviors. In this study, we introduced a novel and advanced model that combined the power of three distinct techniques: Complete Ensemble Empirical Mode Decomposition with Adaptive Noise (CEEMDAN), Variational Mode Decomposition (VMD), and a deep learning network of Long Short-Term Memory (LSTM). The objective is to characterize the variation in OCPs concentrations with high precision. Results show that the hybrid two-stage decomposition coupled models achieved an average symmetric mean absolute percentage error (SMAPE) of 23.24 % in the empirical analysis of typical surface water. It exhibited higher predictive power than the given individual benchmark models, which yielded an average SMAPE of 40.88 %, and single decomposition coupled models with an average SMAPE of 29.80 %. The proposed CEEMDAN-VMD-LSTM model, with an average SMAPE of 13.55 %, consistently outperformed the other models, yielding an average SMAPE of 33.53 %. A comparative analysis with shallow neural network methods demonstrated the advantages of the LSTM algorithm when coupled with secondary decomposition techniques for processing time series datasets. Furthermore, the interpretable analysis derived by the SHAP approach revealed that precipitation followed by the total phosphorus had strong effects on the predicted concentration of OCPs in the given water. The data presented herein shows the effectiveness of decomposition technique-based deep learning algorithms in capturing the dynamic characteristics of pollutants in surface water.</p></div>","PeriodicalId":443,"journal":{"name":"Water Research","volume":null,"pages":null},"PeriodicalIF":11.4000,"publicationDate":"2024-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Water Research","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0043135424012144","RegionNum":1,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ENVIRONMENTAL","Score":null,"Total":0}
引用次数: 0
Abstract
Accurately predicting the concentration of organochlorine pesticides (OCPs) presents a challenge due to their complex sources and environmental behaviors. In this study, we introduced a novel and advanced model that combined the power of three distinct techniques: Complete Ensemble Empirical Mode Decomposition with Adaptive Noise (CEEMDAN), Variational Mode Decomposition (VMD), and a deep learning network of Long Short-Term Memory (LSTM). The objective is to characterize the variation in OCPs concentrations with high precision. Results show that the hybrid two-stage decomposition coupled models achieved an average symmetric mean absolute percentage error (SMAPE) of 23.24 % in the empirical analysis of typical surface water. It exhibited higher predictive power than the given individual benchmark models, which yielded an average SMAPE of 40.88 %, and single decomposition coupled models with an average SMAPE of 29.80 %. The proposed CEEMDAN-VMD-LSTM model, with an average SMAPE of 13.55 %, consistently outperformed the other models, yielding an average SMAPE of 33.53 %. A comparative analysis with shallow neural network methods demonstrated the advantages of the LSTM algorithm when coupled with secondary decomposition techniques for processing time series datasets. Furthermore, the interpretable analysis derived by the SHAP approach revealed that precipitation followed by the total phosphorus had strong effects on the predicted concentration of OCPs in the given water. The data presented herein shows the effectiveness of decomposition technique-based deep learning algorithms in capturing the dynamic characteristics of pollutants in surface water.
期刊介绍:
Water Research, along with its open access companion journal Water Research X, serves as a platform for publishing original research papers covering various aspects of the science and technology related to the anthropogenic water cycle, water quality, and its management worldwide. The audience targeted by the journal comprises biologists, chemical engineers, chemists, civil engineers, environmental engineers, limnologists, and microbiologists. The scope of the journal include:
•Treatment processes for water and wastewaters (municipal, agricultural, industrial, and on-site treatment), including resource recovery and residuals management;
•Urban hydrology including sewer systems, stormwater management, and green infrastructure;
•Drinking water treatment and distribution;
•Potable and non-potable water reuse;
•Sanitation, public health, and risk assessment;
•Anaerobic digestion, solid and hazardous waste management, including source characterization and the effects and control of leachates and gaseous emissions;
•Contaminants (chemical, microbial, anthropogenic particles such as nanoparticles or microplastics) and related water quality sensing, monitoring, fate, and assessment;
•Anthropogenic impacts on inland, tidal, coastal and urban waters, focusing on surface and ground waters, and point and non-point sources of pollution;
•Environmental restoration, linked to surface water, groundwater and groundwater remediation;
•Analysis of the interfaces between sediments and water, and between water and atmosphere, focusing specifically on anthropogenic impacts;
•Mathematical modelling, systems analysis, machine learning, and beneficial use of big data related to the anthropogenic water cycle;
•Socio-economic, policy, and regulations studies.