Geostatistical analyses empowered with gradient boosting and extra trees classifier algorithms in the prediction of groundwater quality and geology-lithology attributes over YSR district, India
{"title":"Geostatistical analyses empowered with gradient boosting and extra trees classifier algorithms in the prediction of groundwater quality and geology-lithology attributes over YSR district, India","authors":"Mogaraju Jagadish Kumar","doi":"10.1504/ijhst.2023.134621","DOIUrl":null,"url":null,"abstract":"Machine learning classifiers are integrated with the geostatistical analyses through interpolation techniques to predict groundwater quality and geology-lithology. Ordinary kriging is used to obtain the optimal interpolation model using RMSSE values. The data extracted from the surface maps were passed onto ML algorithms, resulting in prediction accuracies of 99% for groundwater quality and 96% in predicting the geology-lithology features. There was certain overfitting in the prediction and it was probably due to several classes of geology-lithology variables and limited data available for analysis. The interpolation methods might affect the model performance by producing overfitting and underfitting results. It is to report that the gradient boosting classifier yielded relatively high prediction accuracies in predicting groundwater quality when two classes were used. The extra trees classifier returned better results in predicting geology-lithology features when multiple classes were used in this study.","PeriodicalId":0,"journal":{"name":"","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1504/ijhst.2023.134621","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Machine learning classifiers are integrated with the geostatistical analyses through interpolation techniques to predict groundwater quality and geology-lithology. Ordinary kriging is used to obtain the optimal interpolation model using RMSSE values. The data extracted from the surface maps were passed onto ML algorithms, resulting in prediction accuracies of 99% for groundwater quality and 96% in predicting the geology-lithology features. There was certain overfitting in the prediction and it was probably due to several classes of geology-lithology variables and limited data available for analysis. The interpolation methods might affect the model performance by producing overfitting and underfitting results. It is to report that the gradient boosting classifier yielded relatively high prediction accuracies in predicting groundwater quality when two classes were used. The extra trees classifier returned better results in predicting geology-lithology features when multiple classes were used in this study.