Groundwater quality model through the combined use of principal component analysis (PCA) and multiple linear regression (MLR) Case study: Maturín’s aquifers, Monagas, Venezuela
{"title":"Groundwater quality model through the combined use of principal component analysis (PCA) and multiple linear regression (MLR) Case study: Maturín’s aquifers, Monagas, Venezuela","authors":"","doi":"10.26461/20.02","DOIUrl":null,"url":null,"abstract":"A set of data collected during 2016 was studied in eight wells in the city of Maturin, Monagas, Venezuela, in an attempt to evaluate and determine the contributions of the source that affect water quality. An accurate multiple linear regression (MRL) technique was used as an advanced tool for modeling and forecasting groundwater quality. Likewise, principal component analysis (PCA) was used to simplify and understand the complex relationship between water and quality parameters. Six main components responsible for 86.57% of the total variation were found. It was found that the main source of contamination of Maturin aquifers was the residual discharges with high values of fecal coliforms. An advanced receiver model was applied in order to identify the main sources of contamination. The result showed that the use of PCA as inputs improved the prediction of the MRL model by reducing its complexity and eliminating the collinearity of data, where the value of R2 in this study was 0.99, indicating that 99% of the variability of the water quality index (WQI) is explained by the eleven independent variables used in the model.","PeriodicalId":30552,"journal":{"name":"Innotec","volume":"14 9 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2020-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Innotec","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.26461/20.02","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
A set of data collected during 2016 was studied in eight wells in the city of Maturin, Monagas, Venezuela, in an attempt to evaluate and determine the contributions of the source that affect water quality. An accurate multiple linear regression (MRL) technique was used as an advanced tool for modeling and forecasting groundwater quality. Likewise, principal component analysis (PCA) was used to simplify and understand the complex relationship between water and quality parameters. Six main components responsible for 86.57% of the total variation were found. It was found that the main source of contamination of Maturin aquifers was the residual discharges with high values of fecal coliforms. An advanced receiver model was applied in order to identify the main sources of contamination. The result showed that the use of PCA as inputs improved the prediction of the MRL model by reducing its complexity and eliminating the collinearity of data, where the value of R2 in this study was 0.99, indicating that 99% of the variability of the water quality index (WQI) is explained by the eleven independent variables used in the model.