利用多种机器学习算法优化水质指标模型及其适用性

IF 7 2区 环境科学与生态学 Q1 ENVIRONMENTAL SCIENCES Ecological Indicators Pub Date : 2025-03-01 Epub Date: 2025-03-04 DOI:10.1016/j.ecolind.2025.113299
Fei Ding , Shilong Hao , Wenjie Zhang , Mingcen Jiang , Liangyao Chen , Haobin Yuan , Nan Wang , Wenpan Li , Xin Xie
{"title":"利用多种机器学习算法优化水质指标模型及其适用性","authors":"Fei Ding ,&nbsp;Shilong Hao ,&nbsp;Wenjie Zhang ,&nbsp;Mingcen Jiang ,&nbsp;Liangyao Chen ,&nbsp;Haobin Yuan ,&nbsp;Nan Wang ,&nbsp;Wenpan Li ,&nbsp;Xin Xie","doi":"10.1016/j.ecolind.2025.113299","DOIUrl":null,"url":null,"abstract":"<div><div>Water quality assessment model and spatiotemporal heterogeneity pose challenges to the uncertainty of water quality assessment. To improve the accuracy of the water quality index (WQI) model, multiple machine learning algorithms (CatBoost, SVM, LR, XGBoost, LightGBM) and entropy weight method (EWM) were introduced to determine the objective weight. Six combined weights were determined by game theory combining objective and subjective weights (AHP). Three aggregation functions were established, including a new function proposed based on the sigmoid function and two existing functions. Based on the six combined weights and three aggregation functions, eighteen WQI models were developed. To reduce the influence of spatiotemporal heterogeneity, the assessment models for the different water quality characteristics were proposed respectively. To validate the performance of improved model, the monthly water quality monitoring data of 16 sampling sites in Chaohu Lake during 2016–2020 was used. Among them, totally 10 water quality indicators were selected, including TN, TP, etc. The results showed high accuracy and reliability of the improved WQI assessment models. The model improved by CatBoost and EWM had low uncertainty (0.559–0.903) than SVM and LR (0.576–1.034). The sensitivity of the models improved by six combined weights is ranked as W<sub>AE</sub> &gt; W<sub>AC</sub> &gt; W<sub>AS</sub> &gt; W<sub>AX</sub> &gt; W<sub>ALGB</sub> &gt; W<sub>AL</sub>. The uncertainty of the models improved by the three aggregation functions were ranked as SGM &gt; SWM &gt; WQM and the sensitivity were ranked as WQM &gt; SWM &gt; SGM. Compared with WQM and SWM, SGM could reflect the water quality spatiotemporal heterogeneity more accurately. The WQM<sub>AE</sub>, SGM<sub>AC</sub> and SWM<sub>AC</sub> models were recommended for assessing water bodies with good quality, poor quality and heterogeneity respectively. Chaohu Lake was mainly Class II and Class III water. East had better water quality than the west. Water quality in summer and autumn was better than in spring and winter. This study can provide theoretical support for related water quality assessment work.</div></div>","PeriodicalId":11459,"journal":{"name":"Ecological Indicators","volume":"172 ","pages":"Article 113299"},"PeriodicalIF":7.0000,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Using multiple machine learning algorithms to optimize the water quality index model and their applicability\",\"authors\":\"Fei Ding ,&nbsp;Shilong Hao ,&nbsp;Wenjie Zhang ,&nbsp;Mingcen Jiang ,&nbsp;Liangyao Chen ,&nbsp;Haobin Yuan ,&nbsp;Nan Wang ,&nbsp;Wenpan Li ,&nbsp;Xin Xie\",\"doi\":\"10.1016/j.ecolind.2025.113299\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Water quality assessment model and spatiotemporal heterogeneity pose challenges to the uncertainty of water quality assessment. To improve the accuracy of the water quality index (WQI) model, multiple machine learning algorithms (CatBoost, SVM, LR, XGBoost, LightGBM) and entropy weight method (EWM) were introduced to determine the objective weight. Six combined weights were determined by game theory combining objective and subjective weights (AHP). Three aggregation functions were established, including a new function proposed based on the sigmoid function and two existing functions. Based on the six combined weights and three aggregation functions, eighteen WQI models were developed. To reduce the influence of spatiotemporal heterogeneity, the assessment models for the different water quality characteristics were proposed respectively. To validate the performance of improved model, the monthly water quality monitoring data of 16 sampling sites in Chaohu Lake during 2016–2020 was used. Among them, totally 10 water quality indicators were selected, including TN, TP, etc. The results showed high accuracy and reliability of the improved WQI assessment models. The model improved by CatBoost and EWM had low uncertainty (0.559–0.903) than SVM and LR (0.576–1.034). The sensitivity of the models improved by six combined weights is ranked as W<sub>AE</sub> &gt; W<sub>AC</sub> &gt; W<sub>AS</sub> &gt; W<sub>AX</sub> &gt; W<sub>ALGB</sub> &gt; W<sub>AL</sub>. The uncertainty of the models improved by the three aggregation functions were ranked as SGM &gt; SWM &gt; WQM and the sensitivity were ranked as WQM &gt; SWM &gt; SGM. Compared with WQM and SWM, SGM could reflect the water quality spatiotemporal heterogeneity more accurately. The WQM<sub>AE</sub>, SGM<sub>AC</sub> and SWM<sub>AC</sub> models were recommended for assessing water bodies with good quality, poor quality and heterogeneity respectively. Chaohu Lake was mainly Class II and Class III water. East had better water quality than the west. Water quality in summer and autumn was better than in spring and winter. This study can provide theoretical support for related water quality assessment work.</div></div>\",\"PeriodicalId\":11459,\"journal\":{\"name\":\"Ecological Indicators\",\"volume\":\"172 \",\"pages\":\"Article 113299\"},\"PeriodicalIF\":7.0000,\"publicationDate\":\"2025-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Ecological Indicators\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1470160X25002304\",\"RegionNum\":2,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/3/4 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ecological Indicators","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1470160X25002304","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/3/4 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0

摘要

水质评价模型和时空异质性对水质评价的不确定性提出了挑战。为了提高水质指数(WQI)模型的准确性,引入多种机器学习算法(CatBoost、SVM、LR、XGBoost、LightGBM)和熵权法(EWM)确定客观权重。采用客观权重与主观权重相结合的博弈论(AHP)确定了6个组合权重。建立了三个聚合函数,包括一个基于sigmoid函数提出的新函数和两个现有函数。基于6个组合权值和3个聚合函数,建立了18个WQI模型。为降低时空异质性的影响,提出了不同水质特征的评价模型。为了验证改进模型的有效性,利用2016-2020年巢湖16个采样点的月度水质监测数据。其中,共选取了TN、TP等10个水质指标。结果表明,改进的WQI评价模型具有较高的准确性和可靠性。与SVM和LR(0.576-1.034)相比,CatBoost和EWM改进的模型的不确定性(0.559-0.903)较低。6个组合权重提高的模型灵敏度为WAE >;WAC祝辞是比;蜡比;WALGB祝辞细胞膜。三种聚合函数改善的模型的不确定性分别为:SGM >;SWM祝辞WQM和灵敏度排序为WQM >;SWM祝辞新。与WQM和SWM相比,SGM能更准确地反映水质时空异质性。推荐WQMAE、SGMAC和SWMAC模型分别用于水质好、水质差和异质性水体的评价。巢湖以二类和三类水为主。东部水质好于西部。夏季和秋季水质好于春季和冬季。本研究可为相关水质评价工作提供理论支持。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

摘要图片

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Using multiple machine learning algorithms to optimize the water quality index model and their applicability
Water quality assessment model and spatiotemporal heterogeneity pose challenges to the uncertainty of water quality assessment. To improve the accuracy of the water quality index (WQI) model, multiple machine learning algorithms (CatBoost, SVM, LR, XGBoost, LightGBM) and entropy weight method (EWM) were introduced to determine the objective weight. Six combined weights were determined by game theory combining objective and subjective weights (AHP). Three aggregation functions were established, including a new function proposed based on the sigmoid function and two existing functions. Based on the six combined weights and three aggregation functions, eighteen WQI models were developed. To reduce the influence of spatiotemporal heterogeneity, the assessment models for the different water quality characteristics were proposed respectively. To validate the performance of improved model, the monthly water quality monitoring data of 16 sampling sites in Chaohu Lake during 2016–2020 was used. Among them, totally 10 water quality indicators were selected, including TN, TP, etc. The results showed high accuracy and reliability of the improved WQI assessment models. The model improved by CatBoost and EWM had low uncertainty (0.559–0.903) than SVM and LR (0.576–1.034). The sensitivity of the models improved by six combined weights is ranked as WAE > WAC > WAS > WAX > WALGB > WAL. The uncertainty of the models improved by the three aggregation functions were ranked as SGM > SWM > WQM and the sensitivity were ranked as WQM > SWM > SGM. Compared with WQM and SWM, SGM could reflect the water quality spatiotemporal heterogeneity more accurately. The WQMAE, SGMAC and SWMAC models were recommended for assessing water bodies with good quality, poor quality and heterogeneity respectively. Chaohu Lake was mainly Class II and Class III water. East had better water quality than the west. Water quality in summer and autumn was better than in spring and winter. This study can provide theoretical support for related water quality assessment work.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Ecological Indicators
Ecological Indicators 环境科学-环境科学
CiteScore
11.80
自引率
8.70%
发文量
1163
审稿时长
78 days
期刊介绍: The ultimate aim of Ecological Indicators is to integrate the monitoring and assessment of ecological and environmental indicators with management practices. The journal provides a forum for the discussion of the applied scientific development and review of traditional indicator approaches as well as for theoretical, modelling and quantitative applications such as index development. Research into the following areas will be published. • All aspects of ecological and environmental indicators and indices. • New indicators, and new approaches and methods for indicator development, testing and use. • Development and modelling of indices, e.g. application of indicator suites across multiple scales and resources. • Analysis and research of resource, system- and scale-specific indicators. • Methods for integration of social and other valuation metrics for the production of scientifically rigorous and politically-relevant assessments using indicator-based monitoring and assessment programs. • How research indicators can be transformed into direct application for management purposes. • Broader assessment objectives and methods, e.g. biodiversity, biological integrity, and sustainability, through the use of indicators. • Resource-specific indicators such as landscape, agroecosystems, forests, wetlands, etc.
期刊最新文献
Overstorey features provide stronger indicators of fine shrub fuel load than biophysical variables across tree species in northern Spain Dynamic assessment of dual droughts (SED-EED) in the East River basin under global change: A hybrid SWAT-SWSDI-copula framework based on coupled blue-green water analysis Rare bacteria with low absolute abundance as sensitive bioindicators of riverine pollution: A comparative study across river water, sediment, and riparian soils From plants to vertebrates: Vegetation-derived ecological indicator values reveal habitat drivers of a micro-endemic alpine salamander Evaluating multi-species indicators as tools for reserve-scale species distribution and abundance modelling
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1