Machine learning surrogates for efficient hydrologic modeling: Insights from stochastic simulations of managed aquifer recharge

Timothy Dai, Kate Maher, Zach Perzan
{"title":"Machine learning surrogates for efficient hydrologic modeling: Insights from stochastic simulations of managed aquifer recharge","authors":"Timothy Dai, Kate Maher, Zach Perzan","doi":"arxiv-2407.20902","DOIUrl":null,"url":null,"abstract":"Process-based hydrologic models are invaluable tools for understanding the\nterrestrial water cycle and addressing modern water resources problems.\nHowever, many hydrologic models are computationally expensive and, depending on\nthe resolution and scale, simulations can take on the order of hours to days to\ncomplete. While techniques such as uncertainty quantification and optimization\nhave become valuable tools for supporting management decisions, these analyses\ntypically require hundreds of model simulations, which are too computationally\nexpensive to perform with a process-based hydrologic model. To address this\ngap, we propose a hybrid modeling workflow in which a process-based model is\nused to generate an initial set of simulations and a machine learning (ML)\nsurrogate model is then trained to perform the remaining simulations required\nfor downstream analysis. As a case study, we apply this workflow to simulations\nof variably saturated groundwater flow at a prospective managed aquifer\nrecharge (MAR) site. We compare the accuracy and computational efficiency of\nseveral ML architectures, including deep convolutional networks, recurrent\nneural networks, vision transformers, and networks with Fourier transforms. Our\nresults demonstrate that ML surrogate models can achieve under 10% mean\nabsolute percentage error and yield order-of-magnitude runtime savings over\nprocessed-based models. We also offer practical recommendations for training\nhydrologic surrogate models, including implementing data normalization to\nimprove accuracy, using a normalized loss function to improve training\nstability and downsampling input features to decrease memory requirements.","PeriodicalId":501270,"journal":{"name":"arXiv - PHYS - Geophysics","volume":"213 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - PHYS - Geophysics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2407.20902","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Process-based hydrologic models are invaluable tools for understanding the terrestrial water cycle and addressing modern water resources problems. However, many hydrologic models are computationally expensive and, depending on the resolution and scale, simulations can take on the order of hours to days to complete. While techniques such as uncertainty quantification and optimization have become valuable tools for supporting management decisions, these analyses typically require hundreds of model simulations, which are too computationally expensive to perform with a process-based hydrologic model. To address this gap, we propose a hybrid modeling workflow in which a process-based model is used to generate an initial set of simulations and a machine learning (ML) surrogate model is then trained to perform the remaining simulations required for downstream analysis. As a case study, we apply this workflow to simulations of variably saturated groundwater flow at a prospective managed aquifer recharge (MAR) site. We compare the accuracy and computational efficiency of several ML architectures, including deep convolutional networks, recurrent neural networks, vision transformers, and networks with Fourier transforms. Our results demonstrate that ML surrogate models can achieve under 10% mean absolute percentage error and yield order-of-magnitude runtime savings over processed-based models. We also offer practical recommendations for training hydrologic surrogate models, including implementing data normalization to improve accuracy, using a normalized loss function to improve training stability and downsampling input features to decrease memory requirements.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
用于高效水文建模的机器学习替代物:有管理的含水层补给随机模拟的启示
基于过程的水文模型是了解陆地水循环和解决现代水资源问题的宝贵工具。然而,许多水文模型的计算成本很高,根据分辨率和规模的不同,模拟可能需要数小时至数天才能完成。虽然不确定性量化和优化等技术已成为支持管理决策的重要工具,但这些分析通常需要数百次模型模拟,而基于过程的水文模型的计算成本太高。为了弥补这一差距,我们提出了一种混合建模工作流程,即使用基于过程的模型生成初始模拟集,然后训练机器学习(ML)代理模型来执行下游分析所需的剩余模拟。作为案例研究,我们将这一工作流程应用于模拟一个潜在的有管理含水层补给(MAR)地点的可变饱和地下水流。我们比较了深度卷积网络、循环神经网络、视觉变换器和傅立叶变换网络等多种 ML 架构的准确性和计算效率。我们的研究结果表明,与基于处理的模型相比,ML 代用模型的平均绝对百分比误差低于 10%,并能节省数量级的运行时间。我们还为训练水文代用模型提供了实用建议,包括实施数据归一化以提高准确性,使用归一化损失函数以提高训练稳定性,以及降低输入特征采样以减少内存需求。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Groundwater dynamics beneath a marine ice sheet Generalized failure law for landslides, rockbursts, glacier breakoffs, and volcanic eruptions DiffESM: Conditional Emulation of Temperature and Precipitation in Earth System Models with 3D Diffusion Models The Arpu Kuilpu Meteorite: In-depth characterization of an H5 chondrite delivered from a Jupiter Family Comet orbit The Sun's Birth Environment: Context for Meteoritics
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1