Hirak Mazumdar, M. P. Murphy, Shilpa Bhatkande, H. Emerson, D. Kaplan, Hardik A. Gohel
{"title":"预测地下水污染的优化机器学习模型","authors":"Hirak Mazumdar, M. P. Murphy, Shilpa Bhatkande, H. Emerson, D. Kaplan, Hardik A. Gohel","doi":"10.1109/MetroCon56047.2022.9971133","DOIUrl":null,"url":null,"abstract":"The use of physical models to predict groundwater contaminant movement remains technically challenging due to the complexity of the phenomena, the heterogeneity of key parameters in nature, and the presence of poorly defined interactive and feedback processes. New approaches to address these challenges are needed. In this study, we evaluate various Artificial Intelligence (AI)-based approaches to understand hexavalent chromium (Cr(VI)) plumes located on the U.S. Department of Energy’s (DOE) Hanford Site in Richland, WA. The groundwater monitoring dataset used in this study included data from the 100 Area along the Columbia River and included data collected between 2010 to 2019. This study investigates the most prominent contaminant, Cr(VI), with the Extreme Gradient Boosting (XGBoost) machine learning model. The XGBoost model was compared with an optimized version using an Empirical Bayes Search Cross-Validation technique for better prediction. The optimized XGBoost model yielded an R2 value of 0.99 on the training set and 0.85 on the testing set, whereas X G B Boost without optimization yielded a value of 0.83 on the training set and 0.73 on the testing set. This paper provides an overview of a computational method for groundwater contamination modeling that shows promise for improving current remediation efforts.","PeriodicalId":292881,"journal":{"name":"2022 IEEE MetroCon","volume":"41 20","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Optimized Machine Learning Model for Predicting Groundwater Contamination\",\"authors\":\"Hirak Mazumdar, M. P. Murphy, Shilpa Bhatkande, H. Emerson, D. Kaplan, Hardik A. Gohel\",\"doi\":\"10.1109/MetroCon56047.2022.9971133\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The use of physical models to predict groundwater contaminant movement remains technically challenging due to the complexity of the phenomena, the heterogeneity of key parameters in nature, and the presence of poorly defined interactive and feedback processes. New approaches to address these challenges are needed. In this study, we evaluate various Artificial Intelligence (AI)-based approaches to understand hexavalent chromium (Cr(VI)) plumes located on the U.S. Department of Energy’s (DOE) Hanford Site in Richland, WA. The groundwater monitoring dataset used in this study included data from the 100 Area along the Columbia River and included data collected between 2010 to 2019. This study investigates the most prominent contaminant, Cr(VI), with the Extreme Gradient Boosting (XGBoost) machine learning model. The XGBoost model was compared with an optimized version using an Empirical Bayes Search Cross-Validation technique for better prediction. The optimized XGBoost model yielded an R2 value of 0.99 on the training set and 0.85 on the testing set, whereas X G B Boost without optimization yielded a value of 0.83 on the training set and 0.73 on the testing set. This paper provides an overview of a computational method for groundwater contamination modeling that shows promise for improving current remediation efforts.\",\"PeriodicalId\":292881,\"journal\":{\"name\":\"2022 IEEE MetroCon\",\"volume\":\"41 20\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE MetroCon\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/MetroCon56047.2022.9971133\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE MetroCon","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MetroCon56047.2022.9971133","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Optimized Machine Learning Model for Predicting Groundwater Contamination
The use of physical models to predict groundwater contaminant movement remains technically challenging due to the complexity of the phenomena, the heterogeneity of key parameters in nature, and the presence of poorly defined interactive and feedback processes. New approaches to address these challenges are needed. In this study, we evaluate various Artificial Intelligence (AI)-based approaches to understand hexavalent chromium (Cr(VI)) plumes located on the U.S. Department of Energy’s (DOE) Hanford Site in Richland, WA. The groundwater monitoring dataset used in this study included data from the 100 Area along the Columbia River and included data collected between 2010 to 2019. This study investigates the most prominent contaminant, Cr(VI), with the Extreme Gradient Boosting (XGBoost) machine learning model. The XGBoost model was compared with an optimized version using an Empirical Bayes Search Cross-Validation technique for better prediction. The optimized XGBoost model yielded an R2 value of 0.99 on the training set and 0.85 on the testing set, whereas X G B Boost without optimization yielded a value of 0.83 on the training set and 0.73 on the testing set. This paper provides an overview of a computational method for groundwater contamination modeling that shows promise for improving current remediation efforts.