Assessment of groundwater chemistry to predict arsenic contamination from a canal commanded area: applications of different machine learning models.

IF 3.2 3区 环境科学与生态学 Q3 ENGINEERING, ENVIRONMENTAL Environmental Geochemistry and Health Pub Date : 2025-01-09 DOI:10.1007/s10653-024-02334-3
Fazila Younas, Muhammad Fahad Sardar, Zahid Ullah, Jawad Ali, Xiaona Yu, Pengcheng Zhu, Weihua Guo, Khalid Mashay Al-Anazi, Mohammad Abul Farah, Zhaojie Cui
{"title":"Assessment of groundwater chemistry to predict arsenic contamination from a canal commanded area: applications of different machine learning models.","authors":"Fazila Younas, Muhammad Fahad Sardar, Zahid Ullah, Jawad Ali, Xiaona Yu, Pengcheng Zhu, Weihua Guo, Khalid Mashay Al-Anazi, Mohammad Abul Farah, Zhaojie Cui","doi":"10.1007/s10653-024-02334-3","DOIUrl":null,"url":null,"abstract":"<p><p>Groundwater arsenic (As), contamination is a significant issue worldwide including China and Pakistan, particularly in canal command areas. In this study, 131 groundwater samples were collected, and three machine learning models [Random Forest (RF), Logistic Regression (LR), and Artificial Neural Network (ANN)] were employed to predict As concentration. Descriptive statistics helped to conclude that all of the samples were inside the permitted limit of WHO for pH, Ca, Mg, Turbidity, Cl, K, Na, SO<sub>4</sub>, NO<sub>3</sub>, F and beyond limit of WHO for EC, HCO<sub>3</sub>, TDS, and As. RF suggested a median drop in Gini node impurity across all tree divisions. This predicted As contamination in samples due to presence of TDS, EC, HCO<sub>3</sub><sup>-</sup> and turbidity in upper end of graph which expressed significance of these factors in contaminating water with Arsenic. Moreover, these factors were found positively correlated with Ar contamination. LR model expressed about best fitness of model. ANN classified large data set into two classes i.e. (1) Inside limit of WHO and (2) and outside limit of WHO. Total dissolved solids (TDS), turbidity, sodium (Na) and electrical conductivity (EC) were positively correlated with Ar (Arsenic concentration) in the collected samples. pH and K were negatively associated with Arsenic concentration of the observed samples. Confusion matrices and ROC-AUC scores evaluated that RF, model outperforming than LR, and ANN, in accuracy and sensitivity. Key variables influencing As concentration in the groundwater resources of the study area were identified, such parameters include TDS, chloride (Cl), bicarbonate (HCO<sub>3</sub><sup>-</sup>) and turbidity. The study provided the complete profile of the 131 water samples which can be used to make strategies for the minimization of ground Water contamination for Rohri canal command area. Moreover, the steps can be taken to control the discussed parameters inside the WHO limit.</p>","PeriodicalId":11759,"journal":{"name":"Environmental Geochemistry and Health","volume":"47 2","pages":"46"},"PeriodicalIF":3.2000,"publicationDate":"2025-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Geochemistry and Health","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1007/s10653-024-02334-3","RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, ENVIRONMENTAL","Score":null,"Total":0}
引用次数: 0

Abstract

Groundwater arsenic (As), contamination is a significant issue worldwide including China and Pakistan, particularly in canal command areas. In this study, 131 groundwater samples were collected, and three machine learning models [Random Forest (RF), Logistic Regression (LR), and Artificial Neural Network (ANN)] were employed to predict As concentration. Descriptive statistics helped to conclude that all of the samples were inside the permitted limit of WHO for pH, Ca, Mg, Turbidity, Cl, K, Na, SO4, NO3, F and beyond limit of WHO for EC, HCO3, TDS, and As. RF suggested a median drop in Gini node impurity across all tree divisions. This predicted As contamination in samples due to presence of TDS, EC, HCO3- and turbidity in upper end of graph which expressed significance of these factors in contaminating water with Arsenic. Moreover, these factors were found positively correlated with Ar contamination. LR model expressed about best fitness of model. ANN classified large data set into two classes i.e. (1) Inside limit of WHO and (2) and outside limit of WHO. Total dissolved solids (TDS), turbidity, sodium (Na) and electrical conductivity (EC) were positively correlated with Ar (Arsenic concentration) in the collected samples. pH and K were negatively associated with Arsenic concentration of the observed samples. Confusion matrices and ROC-AUC scores evaluated that RF, model outperforming than LR, and ANN, in accuracy and sensitivity. Key variables influencing As concentration in the groundwater resources of the study area were identified, such parameters include TDS, chloride (Cl), bicarbonate (HCO3-) and turbidity. The study provided the complete profile of the 131 water samples which can be used to make strategies for the minimization of ground Water contamination for Rohri canal command area. Moreover, the steps can be taken to control the discussed parameters inside the WHO limit.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
评估地下水化学以预测运河指挥区域的砷污染:不同机器学习模型的应用。
地下水砷污染是包括中国和巴基斯坦在内的世界范围内的重大问题,特别是在运河指挥区域。本研究采集了131份地下水样本,采用随机森林(Random Forest, RF)、Logistic回归(Logistic Regression, LR)和人工神经网络(Artificial Neural Network, ANN) 3种机器学习模型对As浓度进行预测。描述性统计表明,所有样品的pH、Ca、Mg、浊度、Cl、K、Na、SO4、NO3、F均在WHO的允许范围内,EC、HCO3、TDS和As均超出WHO的允许范围。RF表明,基尼节点杂质中位数在所有树分区中都有所下降。在图的上端,由于TDS, EC, HCO3-和浊度的存在,这预测了样品中砷的污染,这表明了这些因素在砷污染水中的重要性。这些因素与砷污染呈正相关。LR模型表示了模型的最佳适应度。人工神经网络将大数据集分为(1)WHO的内限值和(2)WHO的外限值两类。样品中总溶解固形物(TDS)、浊度、钠(Na)和电导率(EC)与砷(Ar)浓度呈正相关。pH和K值与砷浓度呈负相关。混淆矩阵和ROC-AUC评分评估了RF、模型在准确性和灵敏度方面优于LR和ANN。确定了影响研究区地下水As浓度的关键变量,包括TDS、氯化物(Cl)、碳酸氢盐(HCO3-)和浊度。该研究提供了131个水样的完整剖面,可用于制定最大限度地减少罗赫里运河指挥区内地下水污染的策略。此外,可以采取步骤将所讨论的参数控制在世卫组织的限制范围内。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Environmental Geochemistry and Health
Environmental Geochemistry and Health 环境科学-工程:环境
CiteScore
8.00
自引率
4.80%
发文量
279
审稿时长
4.2 months
期刊介绍: Environmental Geochemistry and Health publishes original research papers and review papers across the broad field of environmental geochemistry. Environmental geochemistry and health establishes and explains links between the natural or disturbed chemical composition of the earth’s surface and the health of plants, animals and people. Beneficial elements regulate or promote enzymatic and hormonal activity whereas other elements may be toxic. Bedrock geochemistry controls the composition of soil and hence that of water and vegetation. Environmental issues, such as pollution, arising from the extraction and use of mineral resources, are discussed. The effects of contaminants introduced into the earth’s geochemical systems are examined. Geochemical surveys of soil, water and plants show how major and trace elements are distributed geographically. Associated epidemiological studies reveal the possibility of causal links between the natural or disturbed geochemical environment and disease. Experimental research illuminates the nature or consequences of natural or disturbed geochemical processes. The journal particularly welcomes novel research linking environmental geochemistry and health issues on such topics as: heavy metals (including mercury), persistent organic pollutants (POPs), and mixed chemicals emitted through human activities, such as uncontrolled recycling of electronic-waste; waste recycling; surface-atmospheric interaction processes (natural and anthropogenic emissions, vertical transport, deposition, and physical-chemical interaction) of gases and aerosols; phytoremediation/restoration of contaminated sites; food contamination and safety; environmental effects of medicines; effects and toxicity of mixed pollutants; speciation of heavy metals/metalloids; effects of mining; disturbed geochemistry from human behavior, natural or man-made hazards; particle and nanoparticle toxicology; risk and the vulnerability of populations, etc.
期刊最新文献
Correction: Polychlorinated biphenyls induced toxicities upon cell lines and stem cells: a review. Balancing application of plant growth-promoting bacteria and biochar in promoting selenium biofortification and remediating combined heavy metal pollution in paddy soil. Synergistic effects of indigenous bacterial consortia on heavy metal tolerance and reduction. Pedogeochemical mobility of metals from fluorescent lamp waste and human health risk assessment. Soil heavy metals assessment of the Zhoukou riparian zone base of Shaying river basin, China: spatial distribution, source analysis and ecological risk.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1