A combination of multivariate statistics and machine learning techniques in groundwater characterization and quality forecasting

Mahamuda Abu , Rabiu Musah , Musah Saeed Zango
{"title":"A combination of multivariate statistics and machine learning techniques in groundwater characterization and quality forecasting","authors":"Mahamuda Abu ,&nbsp;Rabiu Musah ,&nbsp;Musah Saeed Zango","doi":"10.1016/j.geogeo.2024.100261","DOIUrl":null,"url":null,"abstract":"<div><p>Globally, the quality of groundwater has proven to have been affected by some natural and human activities in recent years. To ensure there is good drinking water (Sustainable Development Goal 6.3, there is a need to elucidate the groundwater quality status of the area of interest. The groundwater in the northwestern parts of Ghana is not yet well characterized. Hence, this study employed a multi-method approach of hydrochemistry, water quality index (WQI), multivariate statistics, and machine models: multiple linear regression (MLR), decision tree regression (DTR), random forest regression (RFR), and artificial neural network (ANN), are combined in the characterization and prediction of the water quality in the area. They are robust in providing conclusions on groundwater assessment that can be relied upon for decision-making processes regarding groundwater usage and monitoring. Except for NO<sub>3</sub><sup>−</sup> and TDS exceeding their standard levels in 22 and 2 locations, respectively, the other physicochemical parameters are within acceptable limits. The groundwater is generally good for domestic usage based on the WQI, with 79.2% of excellent to good waters. The groundwater evolved from Na-type, Cl-type, and Cl(SO<sub>4</sub>)-Ca(Mg) facies. Agricultural activities are the main source of human impact on the groundwater. Silicate mineral dissolution and ion exchange processes are the natural processes that affect groundwater mineralization, with mineral dissolution being the dominant process. Based on the performance metrics: MAE, MSE and RMSE of the ML methods considered in the WQI forecasting, the order of performance of the models is ANN &gt; RFR &gt; DTR &gt; MLR, with the following respective R<sup>2</sup> values 0.9974, 0.9193, 0.8966 and 0.8886.</p></div>","PeriodicalId":100582,"journal":{"name":"Geosystems and Geoenvironment","volume":"3 2","pages":"Article 100261"},"PeriodicalIF":0.0000,"publicationDate":"2024-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772883824000116/pdfft?md5=d0b76476aa0d5c1b90a9c0c03f0f8e88&pid=1-s2.0-S2772883824000116-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Geosystems and Geoenvironment","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772883824000116","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Globally, the quality of groundwater has proven to have been affected by some natural and human activities in recent years. To ensure there is good drinking water (Sustainable Development Goal 6.3, there is a need to elucidate the groundwater quality status of the area of interest. The groundwater in the northwestern parts of Ghana is not yet well characterized. Hence, this study employed a multi-method approach of hydrochemistry, water quality index (WQI), multivariate statistics, and machine models: multiple linear regression (MLR), decision tree regression (DTR), random forest regression (RFR), and artificial neural network (ANN), are combined in the characterization and prediction of the water quality in the area. They are robust in providing conclusions on groundwater assessment that can be relied upon for decision-making processes regarding groundwater usage and monitoring. Except for NO3 and TDS exceeding their standard levels in 22 and 2 locations, respectively, the other physicochemical parameters are within acceptable limits. The groundwater is generally good for domestic usage based on the WQI, with 79.2% of excellent to good waters. The groundwater evolved from Na-type, Cl-type, and Cl(SO4)-Ca(Mg) facies. Agricultural activities are the main source of human impact on the groundwater. Silicate mineral dissolution and ion exchange processes are the natural processes that affect groundwater mineralization, with mineral dissolution being the dominant process. Based on the performance metrics: MAE, MSE and RMSE of the ML methods considered in the WQI forecasting, the order of performance of the models is ANN > RFR > DTR > MLR, with the following respective R2 values 0.9974, 0.9193, 0.8966 and 0.8886.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
在地下水特征描述和水质预测中结合使用多元统计和机器学习技术
近年来,全球范围内的地下水质量已被证明受到一些自然和人类活动的影响。为确保有良好的饮用水(可持续发展目标 6.3),有必要阐明相关地区的地下水质量状况。加纳西北部地区的地下水特征尚不明确。因此,本研究采用了水化学、水质指数 (WQI)、多元统计和机器模型(多元线性回归 (MLR)、决策树回归 (DTR)、随机森林回归 (RFR) 和人工神经网络 (ANN))等多种方法来描述和预测该地区的水质。这些方法都能提供可靠的地下水评估结论,可作为地下水使用和监测决策过程的依据。除 22 个地点的 NO3- 和 2 个地点的 TDS 超过标准水平外,其他理化参数均在可接受范围内。根据水质指数(WQI),79.2%的水域为优至良,地下水总体上适合家庭使用。地下水由 Na 型、Cl 型和 Cl(SO4)-Ca(Mg) 层演化而来。农业活动是人类影响地下水的主要来源。硅酸盐矿物溶解和离子交换过程是影响地下水矿化的自然过程,其中矿物溶解是最主要的过程。根据性能指标根据水质指数预测中考虑的 ML 方法的 MAE、MSE 和 RMSE,模型的性能顺序为 ANN > RFR > DTR > MLR,R2 值分别为 0.9974、0.9193、0.8966 和 0.8886。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
4.70
自引率
0.00%
发文量
0
期刊最新文献
Olivine composition of calcite-carbonatite from Sevvattur carbonatite alkaline complex, Dharmapuri Rift Zone, Southern Granulite Terrain, India Composition of olivines and spinel group minerals in aillikites from the Bushkanay dyke, South Siberian Craton: Insights into alkaline melt sources and evolution Petrology of ijolite xenoliths entrained in a nephelinite dyke from the Kamthai area, Late Cretaceous polychronous Sarnu-Dandali alkaline complex, North-West India: Evidence for recurrent magmatic pulses and magma mixing Petrogenetic and geochemical constraints on ca. 1.89–1.88 Ga Bastanar mafic dyke swarm, Bastar craton, India: Insights into MORB- and OIB-type contributions and interactions with metasomatized subcontinental lithospheric mantle Nd isotope systematics of Late Paleozoic granitoids from the Western Transbaikalia (Russia): Petrological consequences and plume model testing
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1