{"title":"Analysis and prediction of groundwater quality using machine learning algorithm for irrigation purposes","authors":"Hemant Raheja, Arun Goel, Mahesh Pal","doi":"10.1007/s12665-025-12173-9","DOIUrl":null,"url":null,"abstract":"<div><p>This study investigates the prediction of key irrigation water quality parameters—Sodium Adsorption Ratio (SAR), Magnesium Adsorption Ratio (MAR), Percent Sodium (%Na), Permeability Index (PI), and Kelly’s Index (KI) using hydrochemical data as input features. Groundwater quality data from 272 samples collected in the central western part of Haryana, India, were analyzed. To improve predictive accuracy, four machine learning models were employed: Random Forest (RF), Support Vector Regression (SVR), M5P, and Linear Regression (LR). Additionally, Principal Component Analysis (PCA) was conducted to identify underlying correlations between water quality parameters and to assess the influence of natural processes and anthropogenic activities, such as rock weathering and improper irrigation practices. The results indicated that PI and Magnesium Hazard (MH) values exceeded permissible limits for irrigation, while SAR, %Na, and KI were within acceptable ranges. The SVR and M5P models outperformed RF and LR in predictive accuracy, as supported by uncertainty analysis, which showed lower uncertainty for the former models. The findings highlight the potential of machine learning models, particularly SVR and M5P, to support decision-makers in managing irrigation water quality, offering a robust tool for sustainable water resource management in agricultural locations.</p></div>","PeriodicalId":542,"journal":{"name":"Environmental Earth Sciences","volume":"84 7","pages":""},"PeriodicalIF":2.8000,"publicationDate":"2025-03-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Earth Sciences","FirstCategoryId":"93","ListUrlMain":"https://link.springer.com/article/10.1007/s12665-025-12173-9","RegionNum":4,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
This study investigates the prediction of key irrigation water quality parameters—Sodium Adsorption Ratio (SAR), Magnesium Adsorption Ratio (MAR), Percent Sodium (%Na), Permeability Index (PI), and Kelly’s Index (KI) using hydrochemical data as input features. Groundwater quality data from 272 samples collected in the central western part of Haryana, India, were analyzed. To improve predictive accuracy, four machine learning models were employed: Random Forest (RF), Support Vector Regression (SVR), M5P, and Linear Regression (LR). Additionally, Principal Component Analysis (PCA) was conducted to identify underlying correlations between water quality parameters and to assess the influence of natural processes and anthropogenic activities, such as rock weathering and improper irrigation practices. The results indicated that PI and Magnesium Hazard (MH) values exceeded permissible limits for irrigation, while SAR, %Na, and KI were within acceptable ranges. The SVR and M5P models outperformed RF and LR in predictive accuracy, as supported by uncertainty analysis, which showed lower uncertainty for the former models. The findings highlight the potential of machine learning models, particularly SVR and M5P, to support decision-makers in managing irrigation water quality, offering a robust tool for sustainable water resource management in agricultural locations.
期刊介绍:
Environmental Earth Sciences is an international multidisciplinary journal concerned with all aspects of interaction between humans, natural resources, ecosystems, special climates or unique geographic zones, and the earth:
Water and soil contamination caused by waste management and disposal practices
Environmental problems associated with transportation by land, air, or water
Geological processes that may impact biosystems or humans
Man-made or naturally occurring geological or hydrological hazards
Environmental problems associated with the recovery of materials from the earth
Environmental problems caused by extraction of minerals, coal, and ores, as well as oil and gas, water and alternative energy sources
Environmental impacts of exploration and recultivation – Environmental impacts of hazardous materials
Management of environmental data and information in data banks and information systems
Dissemination of knowledge on techniques, methods, approaches and experiences to improve and remediate the environment
In pursuit of these topics, the geoscientific disciplines are invited to contribute their knowledge and experience. Major disciplines include: hydrogeology, hydrochemistry, geochemistry, geophysics, engineering geology, remediation science, natural resources management, environmental climatology and biota, environmental geography, soil science and geomicrobiology.