基于可解释的机器学习模型估算水质指数。

IF 2.5 4区 环境科学与生态学 Q3 ENGINEERING, ENVIRONMENTAL Water Science and Technology Pub Date : 2024-03-01 DOI:10.2166/wst.2024.068
Shiwei Yang, Ruifeng Liang, Junguang Chen, Yuanming Wang, Kefeng Li
{"title":"基于可解释的机器学习模型估算水质指数。","authors":"Shiwei Yang, Ruifeng Liang, Junguang Chen, Yuanming Wang, Kefeng Li","doi":"10.2166/wst.2024.068","DOIUrl":null,"url":null,"abstract":"<p><p>The water quality index (WQI) is an important tool for evaluating the water quality status of lakes. In this study, we used the WQI to evaluate the spatial water quality characteristics of Dianchi Lake. However, the WQI calculation is time-consuming, and machine learning models exhibit significant advantages in terms of timeliness and nonlinear data fitting. We used a machine learning model with optimized parameters to predict the WQI, and the light gradient boosting machine achieved good predictive performance. The machine learning model trained based on the entire Dianchi Lake water quality data achieved coefficient of determination (R<sup>2</sup>), mean square error, and mean absolute error values of 0.989, 0.228, and 0.298, respectively. In addition, we used the Shapley additive explanations (SHAP) method to interpret and analyse the machine learning model and identified the main water quality parameter that affects the WQI of Dianchi Lake as NH<sub>4</sub><sup>+</sup>-N. Within the entire range of Dianchi Lake, the SHAP values of NH<sub>4</sub><sup>+</sup>-N varied from -9 to 3. Thus, in future water environmental governance, it is necessary to focus on NH<sub>4</sub><sup>+</sup>-N changes. These results can provide a reference for the treatment of lake water environments.</p>","PeriodicalId":23653,"journal":{"name":"Water Science and Technology","volume":null,"pages":null},"PeriodicalIF":2.5000,"publicationDate":"2024-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/wst_2024_068/pdf/","citationCount":"0","resultStr":"{\"title\":\"Estimating the water quality index based on interpretable machine learning models.\",\"authors\":\"Shiwei Yang, Ruifeng Liang, Junguang Chen, Yuanming Wang, Kefeng Li\",\"doi\":\"10.2166/wst.2024.068\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>The water quality index (WQI) is an important tool for evaluating the water quality status of lakes. In this study, we used the WQI to evaluate the spatial water quality characteristics of Dianchi Lake. However, the WQI calculation is time-consuming, and machine learning models exhibit significant advantages in terms of timeliness and nonlinear data fitting. We used a machine learning model with optimized parameters to predict the WQI, and the light gradient boosting machine achieved good predictive performance. The machine learning model trained based on the entire Dianchi Lake water quality data achieved coefficient of determination (R<sup>2</sup>), mean square error, and mean absolute error values of 0.989, 0.228, and 0.298, respectively. In addition, we used the Shapley additive explanations (SHAP) method to interpret and analyse the machine learning model and identified the main water quality parameter that affects the WQI of Dianchi Lake as NH<sub>4</sub><sup>+</sup>-N. Within the entire range of Dianchi Lake, the SHAP values of NH<sub>4</sub><sup>+</sup>-N varied from -9 to 3. Thus, in future water environmental governance, it is necessary to focus on NH<sub>4</sub><sup>+</sup>-N changes. These results can provide a reference for the treatment of lake water environments.</p>\",\"PeriodicalId\":23653,\"journal\":{\"name\":\"Water Science and Technology\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":2.5000,\"publicationDate\":\"2024-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/wst_2024_068/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Water Science and Technology\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://doi.org/10.2166/wst.2024.068\",\"RegionNum\":4,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ENGINEERING, ENVIRONMENTAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Water Science and Technology","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.2166/wst.2024.068","RegionNum":4,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, ENVIRONMENTAL","Score":null,"Total":0}
引用次数: 0

摘要

水质指数(WQI)是评价湖泊水质状况的重要工具。在本研究中,我们使用 WQI 评价滇池的空间水质特征。然而,WQI 计算耗时较长,而机器学习模型在时效性和非线性数据拟合方面具有显著优势。我们采用了参数优化的机器学习模型来预测 WQI,其中光梯度提升机取得了良好的预测性能。基于整个滇池水质数据训练的机器学习模型的判定系数(R2)、均方误差和平均绝对误差值分别为 0.989、0.228 和 0.298。此外,我们还利用夏普利加解法(SHAP)对机器学习模型进行了解释和分析,确定了影响滇池水质指数的主要水质参数为 NH4+-N。在整个滇池范围内,NH4+-N的SHAP值从-9到3不等,因此在未来的水环境治理中,有必要关注NH4+-N的变化。这些结果可为湖泊水环境治理提供参考。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Estimating the water quality index based on interpretable machine learning models.

The water quality index (WQI) is an important tool for evaluating the water quality status of lakes. In this study, we used the WQI to evaluate the spatial water quality characteristics of Dianchi Lake. However, the WQI calculation is time-consuming, and machine learning models exhibit significant advantages in terms of timeliness and nonlinear data fitting. We used a machine learning model with optimized parameters to predict the WQI, and the light gradient boosting machine achieved good predictive performance. The machine learning model trained based on the entire Dianchi Lake water quality data achieved coefficient of determination (R2), mean square error, and mean absolute error values of 0.989, 0.228, and 0.298, respectively. In addition, we used the Shapley additive explanations (SHAP) method to interpret and analyse the machine learning model and identified the main water quality parameter that affects the WQI of Dianchi Lake as NH4+-N. Within the entire range of Dianchi Lake, the SHAP values of NH4+-N varied from -9 to 3. Thus, in future water environmental governance, it is necessary to focus on NH4+-N changes. These results can provide a reference for the treatment of lake water environments.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Water Science and Technology
Water Science and Technology 环境科学-工程:环境
CiteScore
4.90
自引率
3.70%
发文量
366
审稿时长
4.4 months
期刊介绍: Water Science and Technology publishes peer-reviewed papers on all aspects of the science and technology of water and wastewater. Papers are selected by a rigorous peer review procedure with the aim of rapid and wide dissemination of research results, development and application of new techniques, and related managerial and policy issues. Scientists, engineers, consultants, managers and policy-makers will find this journal essential as a permanent record of progress of research activities and their practical applications.
期刊最新文献
Sewage sludge management and enhanced energy recovery using anaerobic digestion: an insight. Spatial differences of dissolved organic matter composition and humification in an artificial lake. Wetland systems for water pollution control. Activated persulfate for efficient bisphenol A degradation via nitrogen-doped Fe/Mn bimetallic biochar. Assessment of water quality in wells and springs across various districts of Taza City, Morocco.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1