Advances in Land Surface Model-based Forecasting: A comparative study of LSTM, Gradient Boosting, and Feedforward Neural Network Models as prognostic state emulators

Marieke Wesselkamp, Matthew Chantry, Ewan Pinnington, Margarita Choulga, Souhail Boussetta, Maria Kalweit, Joschka Boedecker, Carsten F. Dormann, Florian Pappenberger, Gianpaolo Balsamo
{"title":"Advances in Land Surface Model-based Forecasting: A comparative study of LSTM, Gradient Boosting, and Feedforward Neural Network Models as prognostic state emulators","authors":"Marieke Wesselkamp, Matthew Chantry, Ewan Pinnington, Margarita Choulga, Souhail Boussetta, Maria Kalweit, Joschka Boedecker, Carsten F. Dormann, Florian Pappenberger, Gianpaolo Balsamo","doi":"arxiv-2407.16463","DOIUrl":null,"url":null,"abstract":"Most useful weather prediction for the public is near the surface. The\nprocesses that are most relevant for near-surface weather prediction are also\nthose that are most interactive and exhibit positive feedback or have key role\nin energy partitioning. Land surface models (LSMs) consider these processes\ntogether with surface heterogeneity and forecast water, carbon and energy\nfluxes, and coupled with an atmospheric model provide boundary and initial\nconditions. This numerical parametrization of atmospheric boundaries being\ncomputationally expensive, statistical surrogate models are increasingly used\nto accelerated progress in experimental research. We evaluated the efficiency\nof three surrogate models in speeding up experimental research by simulating\nland surface processes, which are integral to forecasting water, carbon, and\nenergy fluxes in coupled atmospheric models. Specifically, we compared the\nperformance of a Long-Short Term Memory (LSTM) encoder-decoder network, extreme\ngradient boosting, and a feed-forward neural network within a physics-informed\nmulti-objective framework. This framework emulates key states of the ECMWF's\nIntegrated Forecasting System (IFS) land surface scheme, ECLand, across\ncontinental and global scales. Our findings indicate that while all models on\naverage demonstrate high accuracy over the forecast period, the LSTM network\nexcels in continental long-range predictions when carefully tuned, the XGB\nscores consistently high across tasks and the MLP provides an excellent\nimplementation-time-accuracy trade-off. The runtime reduction achieved by the\nemulators in comparison to the full numerical models are significant, offering\na faster, yet reliable alternative for conducting numerical experiments on land\nsurfaces.","PeriodicalId":501166,"journal":{"name":"arXiv - PHYS - Atmospheric and Oceanic Physics","volume":"50 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - PHYS - Atmospheric and Oceanic Physics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2407.16463","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Most useful weather prediction for the public is near the surface. The processes that are most relevant for near-surface weather prediction are also those that are most interactive and exhibit positive feedback or have key role in energy partitioning. Land surface models (LSMs) consider these processes together with surface heterogeneity and forecast water, carbon and energy fluxes, and coupled with an atmospheric model provide boundary and initial conditions. This numerical parametrization of atmospheric boundaries being computationally expensive, statistical surrogate models are increasingly used to accelerated progress in experimental research. We evaluated the efficiency of three surrogate models in speeding up experimental research by simulating land surface processes, which are integral to forecasting water, carbon, and energy fluxes in coupled atmospheric models. Specifically, we compared the performance of a Long-Short Term Memory (LSTM) encoder-decoder network, extreme gradient boosting, and a feed-forward neural network within a physics-informed multi-objective framework. This framework emulates key states of the ECMWF's Integrated Forecasting System (IFS) land surface scheme, ECLand, across continental and global scales. Our findings indicate that while all models on average demonstrate high accuracy over the forecast period, the LSTM network excels in continental long-range predictions when carefully tuned, the XGB scores consistently high across tasks and the MLP provides an excellent implementation-time-accuracy trade-off. The runtime reduction achieved by the emulators in comparison to the full numerical models are significant, offering a faster, yet reliable alternative for conducting numerical experiments on land surfaces.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于地表模型的预报研究进展:将 LSTM、梯度提升和前馈神经网络模型作为预报状态模拟器的比较研究
对公众最有用的天气预报是近地表天气预报。与近地表天气预报最相关的过程也是那些互动性最强、表现出正反馈或在能量分配中起关键作用的过程。地表模式(LSM)将这些过程与地表异质性一起考虑,并预测水、碳和能量流,同时与大气模式相结合,提供边界和初始条件。由于大气边界的数值参数化计算成本高昂,因此越来越多地使用统计代用模型来加速实验研究的进展。我们评估了三种代用模式通过模拟陆地表面过程在加速实验研究方面的效率,陆地表面过程是预测耦合大气模式中水、碳和能量通量不可或缺的部分。具体来说,我们比较了长短期记忆(LSTM)编码器-解码器网络、极梯度提升和前馈神经网络在物理信息多目标框架内的性能。该框架模拟了 ECMWF 的综合预报系统(IFS)陆地表面方案 ECLand 在大陆和全球尺度上的关键状态。我们的研究结果表明,虽然所有模型在预测期内平均都表现出很高的准确性,但 LSTM 网络在经过精心调整后在大陆长程预测中表现出色,XGB 在所有任务中始终保持高分,而 MLP 则在实施时间与准确性之间实现了很好的权衡。与完整的数值模型相比,这些计算器的运行时间显著缩短,为开展地表数值实验提供了更快、更可靠的替代方案。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Harnessing AI data-driven global weather models for climate attribution: An analysis of the 2017 Oroville Dam extreme atmospheric river Super Resolution On Global Weather Forecasts Can Transfer Learning be Used to Identify Tropical State-Dependent Bias Relevant to Midlatitude Subseasonal Predictability? Using Generative Models to Produce Realistic Populations of the United Kingdom Windstorms Integrated nowcasting of convective precipitation with Transformer-based models using multi-source data
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1