Improving groundwater quality predictions in semi-arid regions using ensemble learning models

IF 5.8 3区 环境科学与生态学 0 ENVIRONMENTAL SCIENCES Environmental Science and Pollution Research Pub Date : 2025-01-04 DOI:10.1007/s11356-024-35874-3
Maedeh Mahmoudi, Amin Mahdavi-Meymand, Ammar AlDallal, Mohammad Zounemat-Kermani
{"title":"Improving groundwater quality predictions in semi-arid regions using ensemble learning models","authors":"Maedeh Mahmoudi,&nbsp;Amin Mahdavi-Meymand,&nbsp;Ammar AlDallal,&nbsp;Mohammad Zounemat-Kermani","doi":"10.1007/s11356-024-35874-3","DOIUrl":null,"url":null,"abstract":"<div><p>Groundwater resources constitute one of the primary sources of freshwater in semi-arid and arid climates. Monitoring the groundwater quality is an essential component of environmental management. In this study, a comprehensive comparison was conducted to analyze the performance of nine ensembles and regular machine learning (ML) methods in predicting two water quality parameters including total dissolved solids (TDS) and pH, in an area with semi-arid climate conditions. The study area under consideration is an aquifer located in the Sirjan plain, Kerman, Iran. The developed models include standard multilayer perceptron neural network (MLPNN), classification and regression trees (CART), Chi-square automatic interaction detection (CHAID), and their ensemble versions in bagging (BG) and boosting (BT) ensemble structures. The analysis revealed that standard MLs yield comparable results in predicting TDS. The MLPNN, exhibiting a standard root mean square error (SRMSE) of 0.085, demonstrated superior accuracy in predicting TDS when contrasted with CART and CHAID models. Predicting pH poses a greater challenge for the models. Ensemble techniques significantly enhanced the accuracy of regular models. On average, the bagging and boosting techniques resulted in a 22.68% improvement in the accuracy of regular models, which represents a statistically significant enhancement. The boosting method, with an average SRMSE of 0.0602, is more accurate than bagging. Based on the results, the CHAID-BT with SRMSE of 0.0790 and CHAID-BG with SRMSE of 0.0330 are ranked the most accurate models for predicting TDS and pH, respectively. The performance of ensemble techniques in predicting TDS is more remarkable. In practical implementation, ensemble techniques can be considered an alternative method with high accuracy for sustainable water resources management in semi-arid regions, helping to address water shortages, climate change, water pollution, etc.</p></div>","PeriodicalId":545,"journal":{"name":"Environmental Science and Pollution Research","volume":"32 4","pages":"1985 - 2006"},"PeriodicalIF":5.8000,"publicationDate":"2025-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Science and Pollution Research","FirstCategoryId":"93","ListUrlMain":"https://link.springer.com/article/10.1007/s11356-024-35874-3","RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Groundwater resources constitute one of the primary sources of freshwater in semi-arid and arid climates. Monitoring the groundwater quality is an essential component of environmental management. In this study, a comprehensive comparison was conducted to analyze the performance of nine ensembles and regular machine learning (ML) methods in predicting two water quality parameters including total dissolved solids (TDS) and pH, in an area with semi-arid climate conditions. The study area under consideration is an aquifer located in the Sirjan plain, Kerman, Iran. The developed models include standard multilayer perceptron neural network (MLPNN), classification and regression trees (CART), Chi-square automatic interaction detection (CHAID), and their ensemble versions in bagging (BG) and boosting (BT) ensemble structures. The analysis revealed that standard MLs yield comparable results in predicting TDS. The MLPNN, exhibiting a standard root mean square error (SRMSE) of 0.085, demonstrated superior accuracy in predicting TDS when contrasted with CART and CHAID models. Predicting pH poses a greater challenge for the models. Ensemble techniques significantly enhanced the accuracy of regular models. On average, the bagging and boosting techniques resulted in a 22.68% improvement in the accuracy of regular models, which represents a statistically significant enhancement. The boosting method, with an average SRMSE of 0.0602, is more accurate than bagging. Based on the results, the CHAID-BT with SRMSE of 0.0790 and CHAID-BG with SRMSE of 0.0330 are ranked the most accurate models for predicting TDS and pH, respectively. The performance of ensemble techniques in predicting TDS is more remarkable. In practical implementation, ensemble techniques can be considered an alternative method with high accuracy for sustainable water resources management in semi-arid regions, helping to address water shortages, climate change, water pollution, etc.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用集合学习模型改进半干旱区地下水质量预测。
地下水资源是半干旱和干旱气候下淡水的主要来源之一。地下水水质监测是环境管理的重要组成部分。在本研究中,对九种集成方法和常规机器学习(ML)方法在半干旱气候条件下预测总溶解固形物(TDS)和pH两项水质参数的性能进行了全面比较。考虑中的研究区域是位于伊朗克尔曼锡尔詹平原的含水层。开发的模型包括标准多层感知器神经网络(MLPNN)、分类与回归树(CART)、卡方自动交互检测(CHAID),以及它们在套袋(BG)和提升(BT)集成结构中的集成版本。分析显示,标准MLs在预测TDS方面的结果可比较。与CART和CHAID模型相比,MLPNN的标准均方根误差(SRMSE)为0.085,在预测TDS方面表现出更高的准确性。预测pH值对模型提出了更大的挑战。集成技术显著提高了常规模型的准确性。平均而言,bagging和boosting技术使常规模型的准确率提高了22.68%,这在统计学上是一个显著的提高。助推法的平均SRMSE为0.0602,比套袋法更准确。结果表明,CHAID-BT (SRMSE为0.0790)和CHAID-BG (SRMSE为0.0330)分别是预测TDS和pH最准确的模型。集成技术在预测TDS方面的表现更为显著。在实际实施中,集成技术可被视为半干旱区可持续水资源管理的一种高精度替代方法,有助于解决水资源短缺、气候变化、水污染等问题。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
8.70
自引率
17.20%
发文量
6549
审稿时长
3.8 months
期刊介绍: Environmental Science and Pollution Research (ESPR) serves the international community in all areas of Environmental Science and related subjects with emphasis on chemical compounds. This includes: - Terrestrial Biology and Ecology - Aquatic Biology and Ecology - Atmospheric Chemistry - Environmental Microbiology/Biobased Energy Sources - Phytoremediation and Ecosystem Restoration - Environmental Analyses and Monitoring - Assessment of Risks and Interactions of Pollutants in the Environment - Conservation Biology and Sustainable Agriculture - Impact of Chemicals/Pollutants on Human and Animal Health It reports from a broad interdisciplinary outlook.
期刊最新文献
Climate anomalies reshape abundance distribution and biological parametric of the endemic longtail shad (Tenualosa macrura) in the Siak River Estuary, Indonesia. Leaching behavior of strontium in cementitious materials immobilizing boric acid enriched in B-10 isotope. Retraction Note: A new paradigm of water, food, and energy nexus. Functional synergistic effects of graphene nanoribbons and surfactant stabilizers on inhibition of growth of biofilm-forming and biofilm non-forming bacteria. Bilgewater management in marine vessels: a systematic literature review of marine vessel bilgewater and treatment options.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1