An ensemble optimizer with a stacking ensemble surrogate model for identification of groundwater contamination source

IF 3.5 3区 环境科学与生态学 Q2 ENVIRONMENTAL SCIENCES Journal of contaminant hydrology Pub Date : 2024-09-24 DOI:10.1016/j.jconhyd.2024.104437
Liuzhi Zhu , Wenxi Lu , Chengming Luo , Yaning Xu , Zibo Wang
{"title":"An ensemble optimizer with a stacking ensemble surrogate model for identification of groundwater contamination source","authors":"Liuzhi Zhu ,&nbsp;Wenxi Lu ,&nbsp;Chengming Luo ,&nbsp;Yaning Xu ,&nbsp;Zibo Wang","doi":"10.1016/j.jconhyd.2024.104437","DOIUrl":null,"url":null,"abstract":"<div><div>The application of the simulation-optimization method for groundwater contamination source identification (GCSI) encounters two main challenges: the substantial time cost of calling the simulation model, and the limitations on the accuracy of identification results due to the complexity, nonlinearity, and ill-posed nature of the inverse problem. To address these issues, we have innovatively developed an inversion framework based on ensemble learning strategies. This framework comprises a stacking ensemble model (SEM), which integrates three distinct machine learning models (Extremely Randomized Trees, Adaptive Boosting, and Bidirectional Gated Recurrent Unit), and an ensemble optimizer (<em>E</em>-GKSEEFO), which combines two newly proposed swarm intelligence optimizers (Genghis Khan Shark Optimizer and Electric Eel Foraging Optimizer). Specifically, the SEM serves as a surrogate model for the groundwater numerical simulation model. Compared to the original simulation model, it significantly reduces time cost while maintaining accuracy. The <em>E</em>-GKSEEFO, functioning as the search strategy for the optimization model, greatly enhances the accuracy of the optimization results. We have verified the performance of the SEM-<em>E</em>-GKSEEFO ensemble inversion framework through two hypothetical scenarios derived from an actual coal gangue pile. The results are as follows. (1) The SEM exhibits improved fitting performance compared to single machine learning models when dealing with high-dimensional nonlinear data from GCSI. (2) The <em>E</em>-GKSEEFO achieves significantly higher accuracy in the identification results of GCSI than individual optimizers. These findings affirm the effectiveness and superiority of the proposed SEM-<em>E</em>-GKSEEFO ensemble inversion framework.</div></div>","PeriodicalId":15530,"journal":{"name":"Journal of contaminant hydrology","volume":"267 ","pages":"Article 104437"},"PeriodicalIF":3.5000,"publicationDate":"2024-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of contaminant hydrology","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0169772224001414","RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

The application of the simulation-optimization method for groundwater contamination source identification (GCSI) encounters two main challenges: the substantial time cost of calling the simulation model, and the limitations on the accuracy of identification results due to the complexity, nonlinearity, and ill-posed nature of the inverse problem. To address these issues, we have innovatively developed an inversion framework based on ensemble learning strategies. This framework comprises a stacking ensemble model (SEM), which integrates three distinct machine learning models (Extremely Randomized Trees, Adaptive Boosting, and Bidirectional Gated Recurrent Unit), and an ensemble optimizer (E-GKSEEFO), which combines two newly proposed swarm intelligence optimizers (Genghis Khan Shark Optimizer and Electric Eel Foraging Optimizer). Specifically, the SEM serves as a surrogate model for the groundwater numerical simulation model. Compared to the original simulation model, it significantly reduces time cost while maintaining accuracy. The E-GKSEEFO, functioning as the search strategy for the optimization model, greatly enhances the accuracy of the optimization results. We have verified the performance of the SEM-E-GKSEEFO ensemble inversion framework through two hypothetical scenarios derived from an actual coal gangue pile. The results are as follows. (1) The SEM exhibits improved fitting performance compared to single machine learning models when dealing with high-dimensional nonlinear data from GCSI. (2) The E-GKSEEFO achieves significantly higher accuracy in the identification results of GCSI than individual optimizers. These findings affirm the effectiveness and superiority of the proposed SEM-E-GKSEEFO ensemble inversion framework.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用堆叠集合代用模型识别地下水污染源的集合优化器
模拟优化法在地下水污染源识别(GCSI)中的应用遇到了两个主要挑战:调用模拟模型的时间成本很高,以及由于反演问题的复杂性、非线性和多拟性,识别结果的准确性受到限制。为了解决这些问题,我们创新性地开发了一种基于集合学习策略的反演框架。该框架由一个堆叠集合模型(SEM)和一个集合优化器(E-GKSEEFO)组成,前者集成了三个不同的机器学习模型(极随机树、自适应提升和双向门控循环单元),后者则结合了两个新提出的蜂群智能优化器(成吉思汗鲨鱼优化器和电鳗觅食优化器)。具体而言,SEM 可作为地下水数值模拟模型的替代模型。与原始模拟模型相比,它在保持精度的同时大大减少了时间成本。E-GKSEEFO 作为优化模型的搜索策略,大大提高了优化结果的准确性。我们通过两个源自实际煤矸石堆的假设场景验证了 SEM-E-GKSEEFO 集合反演框架的性能。结果如下(1) 在处理来自 GCSI 的高维非线性数据时,与单一机器学习模型相比,SEM 的拟合性能有所提高。(2) E-GKSEEFO 对 GCSI 识别结果的准确性明显高于单个优化器。这些发现肯定了所提出的 SEM-E-GKSEEFO 集合反演框架的有效性和优越性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Journal of contaminant hydrology
Journal of contaminant hydrology 环境科学-地球科学综合
CiteScore
6.80
自引率
2.80%
发文量
129
审稿时长
68 days
期刊介绍: The Journal of Contaminant Hydrology is an international journal publishing scientific articles pertaining to the contamination of subsurface water resources. Emphasis is placed on investigations of the physical, chemical, and biological processes influencing the behavior and fate of organic and inorganic contaminants in the unsaturated (vadose) and saturated (groundwater) zones, as well as at groundwater-surface water interfaces. The ecological impacts of contaminants transported both from and to aquifers are of interest. Articles on contamination of surface water only, without a link to groundwater, are out of the scope. Broad latitude is allowed in identifying contaminants of interest, and include legacy and emerging pollutants, nutrients, nanoparticles, pathogenic microorganisms (e.g., bacteria, viruses, protozoa), microplastics, and various constituents associated with energy production (e.g., methane, carbon dioxide, hydrogen sulfide). The journal''s scope embraces a wide range of topics including: experimental investigations of contaminant sorption, diffusion, transformation, volatilization and transport in the surface and subsurface; characterization of soil and aquifer properties only as they influence contaminant behavior; development and testing of mathematical models of contaminant behaviour; innovative techniques for restoration of contaminated sites; development of new tools or techniques for monitoring the extent of soil and groundwater contamination; transformation of contaminants in the hyporheic zone; effects of contaminants traversing the hyporheic zone on surface water and groundwater ecosystems; subsurface carbon sequestration and/or turnover; and migration of fluids associated with energy production into groundwater.
期刊最新文献
Sorption behavior of oxytetracycline on microplastics and the influence of environmental factors in groundwater: Experimental investigation and molecular dynamics simulation. Optimizing coastal groundwater quality predictions: A novel data mining framework with cross-validation, bootstrapping, and entropy analysis. Comparison of adsorption capacity of 4-Nonylphenol on conventional and biodegradable microplastics aged under natural water. First evidence of microplastics in the Quilca-Vítor-Chili river basin, Arequipa region, Peru. Characterization and risk assessment of microplastics pollution in Mohamaya Lake, Bangladesh.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1