When ancient numerical demons meet physics-informed machine learning: adjoint-based gradients for implicit differentiable modeling

IF 5.7 1区 地球科学 Q1 GEOSCIENCES, MULTIDISCIPLINARY Hydrology and Earth System Sciences Pub Date : 2024-07-15 DOI:10.5194/hess-28-3051-2024
Yalan Song, W. Knoben, Martyn P. Clark, D. Feng, K. Lawson, Kamlesh Sawadekar, Chaopeng Shen
{"title":"When ancient numerical demons meet physics-informed machine learning: adjoint-based gradients for implicit differentiable modeling","authors":"Yalan Song, W. Knoben, Martyn P. Clark, D. Feng, K. Lawson, Kamlesh Sawadekar, Chaopeng Shen","doi":"10.5194/hess-28-3051-2024","DOIUrl":null,"url":null,"abstract":"Abstract. Recent advances in differentiable modeling, a genre of physics-informed machine learning that trains neural networks (NNs) together with process-based equations, have shown promise in enhancing hydrological models' accuracy, interpretability, and knowledge-discovery potential. Current differentiable models are efficient for NN-based parameter regionalization, but the simple explicit numerical schemes paired with sequential calculations (operator splitting) can incur numerical errors whose impacts on models' representation power and learned parameters are not clear. Implicit schemes, however, cannot rely on automatic differentiation to calculate gradients due to potential issues of gradient vanishing and memory demand. Here we propose a “discretize-then-optimize” adjoint method to enable differentiable implicit numerical schemes for the first time for large-scale hydrological modeling. The adjoint model demonstrates comprehensively improved performance, with Kling–Gupta efficiency coefficients, peak-flow and low-flow metrics, and evapotranspiration that moderately surpass the already-competitive explicit model. Therefore, the previous sequential-calculation approach had a detrimental impact on the model's ability to represent hydrological dynamics. Furthermore, with a structural update that describes capillary rise, the adjoint model can better describe baseflow in arid regions and also produce low flows that outperform even pure machine learning methods such as long short-term memory networks. The adjoint model rectified some parameter distortions but did not alter spatial parameter distributions, demonstrating the robustness of regionalized parameterization. Despite higher computational expenses and modest improvements, the adjoint model's success removes the barrier for complex implicit schemes to enrich differentiable modeling in hydrology.\n","PeriodicalId":13143,"journal":{"name":"Hydrology and Earth System Sciences","volume":null,"pages":null},"PeriodicalIF":5.7000,"publicationDate":"2024-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Hydrology and Earth System Sciences","FirstCategoryId":"89","ListUrlMain":"https://doi.org/10.5194/hess-28-3051-2024","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOSCIENCES, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

Abstract. Recent advances in differentiable modeling, a genre of physics-informed machine learning that trains neural networks (NNs) together with process-based equations, have shown promise in enhancing hydrological models' accuracy, interpretability, and knowledge-discovery potential. Current differentiable models are efficient for NN-based parameter regionalization, but the simple explicit numerical schemes paired with sequential calculations (operator splitting) can incur numerical errors whose impacts on models' representation power and learned parameters are not clear. Implicit schemes, however, cannot rely on automatic differentiation to calculate gradients due to potential issues of gradient vanishing and memory demand. Here we propose a “discretize-then-optimize” adjoint method to enable differentiable implicit numerical schemes for the first time for large-scale hydrological modeling. The adjoint model demonstrates comprehensively improved performance, with Kling–Gupta efficiency coefficients, peak-flow and low-flow metrics, and evapotranspiration that moderately surpass the already-competitive explicit model. Therefore, the previous sequential-calculation approach had a detrimental impact on the model's ability to represent hydrological dynamics. Furthermore, with a structural update that describes capillary rise, the adjoint model can better describe baseflow in arid regions and also produce low flows that outperform even pure machine learning methods such as long short-term memory networks. The adjoint model rectified some parameter distortions but did not alter spatial parameter distributions, demonstrating the robustness of regionalized parameterization. Despite higher computational expenses and modest improvements, the adjoint model's success removes the barrier for complex implicit schemes to enrich differentiable modeling in hydrology.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
当古老的数字恶魔遇到物理信息机器学习:基于邻接梯度的隐式可微建模
摘要可微分建模是一种物理信息机器学习流派,它将神经网络(NN)与基于过程的方程一起进行训练,可微分建模的最新进展表明,可微分建模有望提高水文模型的精度、可解释性和知识发现潜力。目前的可微分模型对于基于神经网络的参数区域化非常有效,但简单的显式数值方案与顺序计算(算子拆分)搭配会产生数值误差,而这些误差对模型的表示力和学习参数的影响尚不清楚。然而,由于梯度消失和内存需求等潜在问题,隐式方案无法依靠自动微分来计算梯度。在此,我们提出了一种 "离散化-优化 "邻接法,首次将可微分隐式数值方案用于大规模水文建模。该辅助模型的性能得到了全面提升,其克林-古普塔效率系数、峰值流量和低流量指标以及蒸散量都适度超过了已有竞争力的显式模型。因此,之前的顺序计算方法对模型表现水文动态的能力产生了不利影响。此外,通过描述毛细管上升的结构更新,该模型可以更好地描述干旱地区的基流,其产生的低流量甚至优于纯机器学习方法(如长短期记忆网络)。邻接模型纠正了一些参数失真,但没有改变空间参数分布,证明了区域化参数化的稳健性。尽管计算成本较高,改进幅度也不大,但邻接模型的成功消除了复杂隐式方案的障碍,丰富了水文学中的可微分建模。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Hydrology and Earth System Sciences
Hydrology and Earth System Sciences 地学-地球科学综合
CiteScore
10.10
自引率
7.90%
发文量
273
审稿时长
15 months
期刊介绍: Hydrology and Earth System Sciences (HESS) is a not-for-profit international two-stage open-access journal for the publication of original research in hydrology. HESS encourages and supports fundamental and applied research that advances the understanding of hydrological systems, their role in providing water for ecosystems and society, and the role of the water cycle in the functioning of the Earth system. A multi-disciplinary approach is encouraged that broadens the hydrological perspective and the advancement of hydrological science through integration with other cognate sciences and cross-fertilization across disciplinary boundaries.
期刊最新文献
Bimodal hydrographs in a semi-humid forested watershed: characteristics and occurrence conditions A decomposition approach to evaluating the local performance of global streamflow reanalysis Improving runoff simulation in the Western United States with Noah-MP and variable infiltration capacity Regional patterns and drivers of modelled water flows along environmental, functional, and stand structure gradients in Spanish forests When ancient numerical demons meet physics-informed machine learning: adjoint-based gradients for implicit differentiable modeling
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1