Improved machine learning estimation of surface turbulent flux using interpretable model selection and adaptive ensemble algorithms over the Horqin Sandy Land area

IF 4.4 2区 地球科学 Q1 METEOROLOGY & ATMOSPHERIC SCIENCES Atmospheric Research Pub Date : 2025-04-15 Epub Date: 2025-01-26 DOI:10.1016/j.atmosres.2025.107952
Jing Zhao , Yiyi Guo , Hongsheng Zhang , Yihua Lin , Feng Liu , Zhenhai Guo
{"title":"Improved machine learning estimation of surface turbulent flux using interpretable model selection and adaptive ensemble algorithms over the Horqin Sandy Land area","authors":"Jing Zhao ,&nbsp;Yiyi Guo ,&nbsp;Hongsheng Zhang ,&nbsp;Yihua Lin ,&nbsp;Feng Liu ,&nbsp;Zhenhai Guo","doi":"10.1016/j.atmosres.2025.107952","DOIUrl":null,"url":null,"abstract":"<div><div>The turbulent exchanges between the land surface and atmosphere, crucial for global climate change and atmospheric circulation, are typically represented through bulk formulae based on Monin-Obukhov similarity theory (MOST), using simple regression as a function of the non-dimensional stability parameter derived from limited field experiments, which leaves large uncertainties. Recently, machine learning is anticipated as an alternative or complement to bulk algorithms, leveraging its ability to detect nonlinear relationships in large datasets without constraints from the similarity relationships and self-correlations prescribed in MOST. However, there are still unresolved problems and gaps, even though common models like random forest and neural networks can be directly applied. This study proposes a hybrid approach for improved estimation of surface turbulent flux, consisting of meta-learner estimation, interpretable model selection, and adaptive model integration. Motivated by understanding how different machine learning algorithms perform as surface-layer flux estimators and further exploring how to utilize results from multiple meta-learners for better estimations, the method starts with eight different machine learning algorithms. Then, a combination of Elastic Net and Shapley Additive Explanations is developed as an interpretable model selection module, followed by an adaptive model integration using AdaBoost and extreme learning machine. Experiments at the continuous observation station in the Horqin Sandy Land area, Inner Mongolia, China, demonstrate that the proposed system delivers reliable and stable performance, significantly reducing estimation bias of three scaling parameters, with root mean square error reductions of 43.16 %–56.97 % compared to MOST, and outperforming the best single machine learning model with additional error reductions of 4.24 %–7.90 %.</div></div>","PeriodicalId":8600,"journal":{"name":"Atmospheric Research","volume":"316 ","pages":"Article 107952"},"PeriodicalIF":4.4000,"publicationDate":"2025-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Atmospheric Research","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0169809525000444","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/26 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"METEOROLOGY & ATMOSPHERIC SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

The turbulent exchanges between the land surface and atmosphere, crucial for global climate change and atmospheric circulation, are typically represented through bulk formulae based on Monin-Obukhov similarity theory (MOST), using simple regression as a function of the non-dimensional stability parameter derived from limited field experiments, which leaves large uncertainties. Recently, machine learning is anticipated as an alternative or complement to bulk algorithms, leveraging its ability to detect nonlinear relationships in large datasets without constraints from the similarity relationships and self-correlations prescribed in MOST. However, there are still unresolved problems and gaps, even though common models like random forest and neural networks can be directly applied. This study proposes a hybrid approach for improved estimation of surface turbulent flux, consisting of meta-learner estimation, interpretable model selection, and adaptive model integration. Motivated by understanding how different machine learning algorithms perform as surface-layer flux estimators and further exploring how to utilize results from multiple meta-learners for better estimations, the method starts with eight different machine learning algorithms. Then, a combination of Elastic Net and Shapley Additive Explanations is developed as an interpretable model selection module, followed by an adaptive model integration using AdaBoost and extreme learning machine. Experiments at the continuous observation station in the Horqin Sandy Land area, Inner Mongolia, China, demonstrate that the proposed system delivers reliable and stable performance, significantly reducing estimation bias of three scaling parameters, with root mean square error reductions of 43.16 %–56.97 % compared to MOST, and outperforming the best single machine learning model with additional error reductions of 4.24 %–7.90 %.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于可解释模型选择和自适应集成算法的科尔沁沙地地表湍流通量改进机器学习估计
地表和大气之间的湍流交换对全球气候变化和大气环流至关重要,通常通过基于Monin-Obukhov相似理论(MOST)的大量公式来表示,使用简单回归作为有限现场试验得出的无量纲稳定性参数的函数,这留下了很大的不确定性。最近,机器学习有望作为批量算法的替代或补充,利用其在大型数据集中检测非线性关系的能力,而不受MOST中规定的相似关系和自相关性的约束。然而,即使随机森林和神经网络等常见模型可以直接应用,仍然存在未解决的问题和差距。本研究提出了一种基于元学习者估计、可解释模型选择和自适应模型集成的混合方法来改进地表湍流通量的估计。通过了解不同的机器学习算法如何作为表层通量估计器,并进一步探索如何利用多个元学习器的结果进行更好的估计,该方法从八种不同的机器学习算法开始。然后,结合Elastic Net和Shapley Additive Explanations开发可解释模型选择模块,然后使用AdaBoost和极限学习机进行自适应模型集成。在内蒙古科尔沁沙地地区连续观测站进行的实验表明,该系统具有可靠稳定的性能,显著降低了三个尺度参数的估计偏差,与MOST相比,均方根误差降低了43.16% ~ 56.97%,优于最佳的单一机器学习模型,误差降低了4.24% ~ 7.90%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Atmospheric Research
Atmospheric Research 地学-气象与大气科学
CiteScore
9.40
自引率
10.90%
发文量
460
审稿时长
47 days
期刊介绍: The journal publishes scientific papers (research papers, review articles, letters and notes) dealing with the part of the atmosphere where meteorological events occur. Attention is given to all processes extending from the earth surface to the tropopause, but special emphasis continues to be devoted to the physics of clouds, mesoscale meteorology and air pollution, i.e. atmospheric aerosols; microphysical processes; cloud dynamics and thermodynamics; numerical simulation, climatology, climate change and weather modification.
期刊最新文献
Characteristics of summer typhoon activities over China during 2020–2022 and their relationships with the Triple La Niña events The June 2024 middle east compound heatwave: Dynamical drivers and AI-weather forecast models' evaluation ENSO-like anomalies alter the diurnal pattern of seasonal rainfall in the Garúa-dominated highlands of the Galápagos archipelago Improving regional precipitation simulation over South China: a comparative evaluation of NEX-GDDP-CMIP6 and CMIP6 models Summer diurnal variation of precipitation and regional precipitation events over the steep eastern slope of the Tibetan Plateau
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1