Multi-Model Machine Learning Approach Accurately Predicts Lake Dissolved Oxygen With Multiple Environmental Inputs

IF 2.9 3区 地球科学 Q2 ASTRONOMY & ASTROPHYSICS Earth and Space Science Pub Date : 2024-07-18 DOI:10.1029/2023EA003473
Shuqi Lin, Donald C. Pierson, Robert Ladwig, Benjamin M. Kraemer, Fenjuan R. S. Hu
{"title":"Multi-Model Machine Learning Approach Accurately Predicts Lake Dissolved Oxygen With Multiple Environmental Inputs","authors":"Shuqi Lin,&nbsp;Donald C. Pierson,&nbsp;Robert Ladwig,&nbsp;Benjamin M. Kraemer,&nbsp;Fenjuan R. S. Hu","doi":"10.1029/2023EA003473","DOIUrl":null,"url":null,"abstract":"<p>As a key water quality parameter, dissolved oxygen (DO) concentration, and particularly changes in bottom water DO is fundamental for understanding the biogeochemical processes in lake ecosystems. Based on two machine learning (ML) models, Gradient Boost Regressor (GBR) and long-short-term-memory (LSTM) network, this study developed three ML model approaches: direct GBR; direct LSTM; and a 2-step mixed ML model workflow combining both GBR and LSTM. They were used to simulate multi-year surface and bottom DO concentrations in five lakes. All approaches were trained with readily available environmental data as predictors. Indices of lake thermal structure and mixing provided by a one-dimensional (1-D) hydrodynamic model were also included as predictors in the ML models. The advantages of each ML approach were not consistent for all the tested lakes, but the best one of them was defined that can estimate DO concentration with coefficient of determination (<i>R</i><sup>2</sup>) up to 0.6–0.7 in each lake. All three approaches have normalized mean absolute error (NMAE) under 0.15. In a polymictic lake, the 2-step mixed model workflow showed better representation of bottom DO concentrations, with a highest true positive rate (TPR) of hypolimnetic hypoxia detection of over 90%, while the other workflows resulted in, TPRs are around 50%. In most of the tested lakes, the predicted surface DO concentrations and variables indicating stratified conditions (i.e., Wedderburn number and the temperature difference between surface and bottom water) are essential for simulating bottom DO. The ML approaches showed promising results and could be used to support short- and long-term water management plans.</p>","PeriodicalId":54286,"journal":{"name":"Earth and Space Science","volume":null,"pages":null},"PeriodicalIF":2.9000,"publicationDate":"2024-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1029/2023EA003473","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Earth and Space Science","FirstCategoryId":"89","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1029/2023EA003473","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ASTRONOMY & ASTROPHYSICS","Score":null,"Total":0}
引用次数: 0

Abstract

As a key water quality parameter, dissolved oxygen (DO) concentration, and particularly changes in bottom water DO is fundamental for understanding the biogeochemical processes in lake ecosystems. Based on two machine learning (ML) models, Gradient Boost Regressor (GBR) and long-short-term-memory (LSTM) network, this study developed three ML model approaches: direct GBR; direct LSTM; and a 2-step mixed ML model workflow combining both GBR and LSTM. They were used to simulate multi-year surface and bottom DO concentrations in five lakes. All approaches were trained with readily available environmental data as predictors. Indices of lake thermal structure and mixing provided by a one-dimensional (1-D) hydrodynamic model were also included as predictors in the ML models. The advantages of each ML approach were not consistent for all the tested lakes, but the best one of them was defined that can estimate DO concentration with coefficient of determination (R2) up to 0.6–0.7 in each lake. All three approaches have normalized mean absolute error (NMAE) under 0.15. In a polymictic lake, the 2-step mixed model workflow showed better representation of bottom DO concentrations, with a highest true positive rate (TPR) of hypolimnetic hypoxia detection of over 90%, while the other workflows resulted in, TPRs are around 50%. In most of the tested lakes, the predicted surface DO concentrations and variables indicating stratified conditions (i.e., Wedderburn number and the temperature difference between surface and bottom water) are essential for simulating bottom DO. The ML approaches showed promising results and could be used to support short- and long-term water management plans.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
多模型机器学习方法利用多种环境输入准确预测湖泊溶解氧
溶解氧(DO)浓度,尤其是底层水溶解氧的变化是了解湖泊生态系统生物地球化学过程的关键水质参数。基于梯度提升回归模型(GBR)和长短期记忆网络(LSTM)这两种机器学习(ML)模型,本研究开发了三种 ML 模型方法:直接 GBR;直接 LSTM;以及结合 GBR 和 LSTM 的两步混合 ML 模型工作流。这些方法被用于模拟五个湖泊的多年表层和底层溶解氧浓度。所有方法都使用现成的环境数据作为预测因子进行训练。由一维(1-D)水动力模型提供的湖泊热结构和混合指数也作为预测因子纳入了 ML 模型。在所有测试的湖泊中,每种 ML 方法的优势并不一致,但其中最好的一种方法可以估算出每个湖泊的溶解氧浓度,其判定系数(R2)可达 0.6-0.7。所有三种方法的归一化平均绝对误差(NMAE)均小于 0.15。在一个多水体湖泊中,两步混合模型工作流程能更好地反映湖底溶解氧浓度,下沉缺氧检测的最高真阳性率(TPR)超过 90%,而其他工作流程的真阳性率约为 50%。在大多数测试湖泊中,预测的表层溶解氧浓度和表明分层条件的变量(即 Wedderburn 数和表层与底层水之间的温差)对于模拟底层溶解氧至关重要。ML 方法显示出良好的效果,可用于支持短期和长期的水管理计划。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Earth and Space Science
Earth and Space Science Earth and Planetary Sciences-General Earth and Planetary Sciences
CiteScore
5.50
自引率
3.20%
发文量
285
审稿时长
19 weeks
期刊介绍: Marking AGU’s second new open access journal in the last 12 months, Earth and Space Science is the only journal that reflects the expansive range of science represented by AGU’s 62,000 members, including all of the Earth, planetary, and space sciences, and related fields in environmental science, geoengineering, space engineering, and biogeochemistry.
期刊最新文献
Near Real-Time In Situ Monitoring of Nearshore Ocean Currents Using Distributed Acoustic Sensing on Submarine Fiber-Optic Cable Updated OMI Glyoxal Column Measurements Using Collection 4 Level 1B Radiances A Novel Surface Energy Balance Method for Thermal Inertia Studies of Terrestrial Analogs On the Love Numbers of an Andrade Planet The Benefits of Future Quantum Accelerometers for Satellite Gravimetry
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1