A simple and effective approach to investigate the dominant contaminant sources and accuracy in water quality estimation through Monte Carlo simulation, Gaussian Mixture Models (GMMs), and GIS machine learning methods
{"title":"A simple and effective approach to investigate the dominant contaminant sources and accuracy in water quality estimation through Monte Carlo simulation, Gaussian Mixture Models (GMMs), and GIS machine learning methods","authors":"Mobarok Hossain","doi":"10.1016/j.ecolind.2025.113188","DOIUrl":null,"url":null,"abstract":"<div><div>Water quality variables, linked with natural and human activities, significantly impact groundwater quality. Assessing these links and impacts is a complex method, but it can be simplified using innovative approaches like Monte Carlo simulation and Gaussian Mixture Models (GMMs). This approach was applied by examining data on water quality parameters for the years 2009 and 2021 in varied hydrogeology in West Bengal, India. Geospatial distribution of water quality was analyzed using 479 samples in 2009 and 734 samples in 2021 through a recently suggested Water Pollution Index (WPI). The WPI uncovered a deterioration in the quality of water as the percentage of samples categorized as “excellent” decreased from 30.5 % to 28 %, while “polluted” samples increased from 44 % to 45 %. Statistical analyses, including analysis of variance (ANOVA) for yearly comparisons and Friedman tests for variable group comparisons, showed significant differences (p < 0.0001) in both analyses. Sensitivity analysis identified total iron as the predominant factor influencing groundwater quality in both years, along with significant contributions from fluoride, electrical conductivity (EC), total dissolved solids (TDS), magnesium, and nitrate. Uncertainty analysis revealed a shift in the WPI distribution centre from 0.45 to 0.9 in 2009 to 0.5–1.1 in 2021, with shorter probability tails suggesting more consistent water quality measurements in 2021 compared to 2009. Cluster-wise source apportionment for groundwater contaminants using machine learning-based Gaussian Mixture Models (GMMs) suggested that various factors influenced the groundwater quality in the study region. This study highlights the potentiality of applying the Monte Carlo simulation to simply find out the drivers of groundwater pollution as well as linking the GMMs with geospatial distribution, which could be a simple, innovative and best solution to take targeted actions to address the issue and manage water quality effectively.</div></div>","PeriodicalId":11459,"journal":{"name":"Ecological Indicators","volume":"171 ","pages":"Article 113188"},"PeriodicalIF":7.0000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ecological Indicators","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1470160X25001177","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Water quality variables, linked with natural and human activities, significantly impact groundwater quality. Assessing these links and impacts is a complex method, but it can be simplified using innovative approaches like Monte Carlo simulation and Gaussian Mixture Models (GMMs). This approach was applied by examining data on water quality parameters for the years 2009 and 2021 in varied hydrogeology in West Bengal, India. Geospatial distribution of water quality was analyzed using 479 samples in 2009 and 734 samples in 2021 through a recently suggested Water Pollution Index (WPI). The WPI uncovered a deterioration in the quality of water as the percentage of samples categorized as “excellent” decreased from 30.5 % to 28 %, while “polluted” samples increased from 44 % to 45 %. Statistical analyses, including analysis of variance (ANOVA) for yearly comparisons and Friedman tests for variable group comparisons, showed significant differences (p < 0.0001) in both analyses. Sensitivity analysis identified total iron as the predominant factor influencing groundwater quality in both years, along with significant contributions from fluoride, electrical conductivity (EC), total dissolved solids (TDS), magnesium, and nitrate. Uncertainty analysis revealed a shift in the WPI distribution centre from 0.45 to 0.9 in 2009 to 0.5–1.1 in 2021, with shorter probability tails suggesting more consistent water quality measurements in 2021 compared to 2009. Cluster-wise source apportionment for groundwater contaminants using machine learning-based Gaussian Mixture Models (GMMs) suggested that various factors influenced the groundwater quality in the study region. This study highlights the potentiality of applying the Monte Carlo simulation to simply find out the drivers of groundwater pollution as well as linking the GMMs with geospatial distribution, which could be a simple, innovative and best solution to take targeted actions to address the issue and manage water quality effectively.
期刊介绍:
The ultimate aim of Ecological Indicators is to integrate the monitoring and assessment of ecological and environmental indicators with management practices. The journal provides a forum for the discussion of the applied scientific development and review of traditional indicator approaches as well as for theoretical, modelling and quantitative applications such as index development. Research into the following areas will be published.
• All aspects of ecological and environmental indicators and indices.
• New indicators, and new approaches and methods for indicator development, testing and use.
• Development and modelling of indices, e.g. application of indicator suites across multiple scales and resources.
• Analysis and research of resource, system- and scale-specific indicators.
• Methods for integration of social and other valuation metrics for the production of scientifically rigorous and politically-relevant assessments using indicator-based monitoring and assessment programs.
• How research indicators can be transformed into direct application for management purposes.
• Broader assessment objectives and methods, e.g. biodiversity, biological integrity, and sustainability, through the use of indicators.
• Resource-specific indicators such as landscape, agroecosystems, forests, wetlands, etc.