Enhanced prediction of river dissolved oxygen through feature- and model-based transfer learning.

IF 8 2区 环境科学与生态学 Q1 ENVIRONMENTAL SCIENCES Journal of Environmental Management Pub Date : 2024-11-20 DOI:10.1016/j.jenvman.2024.123310
Xinlin Chen, Wei Sun, Tao Jiang, Hong Ju
{"title":"Enhanced prediction of river dissolved oxygen through feature- and model-based transfer learning.","authors":"Xinlin Chen, Wei Sun, Tao Jiang, Hong Ju","doi":"10.1016/j.jenvman.2024.123310","DOIUrl":null,"url":null,"abstract":"<p><p>Water quality monitoring data from various points within the same basin often show non-uniformity. A key scientific question is how to extract relevant knowledge from data-rich sites (source domains) and leverage the possible inter-site consistency of water quality to compensate for the limitations of data-poor sites (target domains). Transfer learning (TL) methods can improve the applicability of water quality predictions for data-poor sites but their comparison and combination have not been fully explored. This study employs feature-based (Transfer Component Analysis, TCA) and model-based (pretraining and fine-tuning) transfer learning, to assist in constructing Long Short-Term Memory (LSTM) models for forecasting the dissolved oxygen (DO) levels in the West Channel of Guangzhou, southern coastal China. The LSTM models at Yagang and Shimen stations were constructed as the basic and baseline models for source and target domains, respectively. By comparing and selecting different transfer learning strategies, the best single-type TL strategy emerged as a multi-sequence LSTM model without TCA but with the fully connected layer frozen after pretraining. It achieved increases in validation Nash efficiency coefficient (NSE) of 5.2%, 10.8%, and 46.2% for predicting DO over the next 3 days, respectively, compared to the baseline LSTM model at Shimen station. The best combined TL strategy involved using TCA and freezing the second fully connected layer in a multi-sequence LSTM model. It improved upon the baseline LSTM model with a validation NSE increase of 5.3%, 21.4%, and 48.7% over the next three days, respectively. This study demonstrates that combining feature- and model-based transfer learning methods can yield better DO prediction performance in data-poor rivers than using a single-type transfer learning method.</p>","PeriodicalId":356,"journal":{"name":"Journal of Environmental Management","volume":"372 ","pages":"123310"},"PeriodicalIF":8.0000,"publicationDate":"2024-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Environmental Management","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1016/j.jenvman.2024.123310","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Water quality monitoring data from various points within the same basin often show non-uniformity. A key scientific question is how to extract relevant knowledge from data-rich sites (source domains) and leverage the possible inter-site consistency of water quality to compensate for the limitations of data-poor sites (target domains). Transfer learning (TL) methods can improve the applicability of water quality predictions for data-poor sites but their comparison and combination have not been fully explored. This study employs feature-based (Transfer Component Analysis, TCA) and model-based (pretraining and fine-tuning) transfer learning, to assist in constructing Long Short-Term Memory (LSTM) models for forecasting the dissolved oxygen (DO) levels in the West Channel of Guangzhou, southern coastal China. The LSTM models at Yagang and Shimen stations were constructed as the basic and baseline models for source and target domains, respectively. By comparing and selecting different transfer learning strategies, the best single-type TL strategy emerged as a multi-sequence LSTM model without TCA but with the fully connected layer frozen after pretraining. It achieved increases in validation Nash efficiency coefficient (NSE) of 5.2%, 10.8%, and 46.2% for predicting DO over the next 3 days, respectively, compared to the baseline LSTM model at Shimen station. The best combined TL strategy involved using TCA and freezing the second fully connected layer in a multi-sequence LSTM model. It improved upon the baseline LSTM model with a validation NSE increase of 5.3%, 21.4%, and 48.7% over the next three days, respectively. This study demonstrates that combining feature- and model-based transfer learning methods can yield better DO prediction performance in data-poor rivers than using a single-type transfer learning method.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
通过基于特征和模型的迁移学习增强河流溶解氧的预测。
同一流域内不同点的水质监测数据往往不一致。一个关键的科学问题是,如何从数据丰富的站点(源域)提取相关知识,并利用站点间可能存在的水质一致性来弥补数据贫乏站点(目标域)的局限性。迁移学习(TL)方法可以提高数据贫乏站点的水质预测适用性,但它们之间的比较和组合尚未得到充分探索。本研究采用了基于特征(迁移成分分析,TCA)和基于模型(预训练和微调)的迁移学习,帮助构建长短期记忆(LSTM)模型,用于预测中国南部沿海广州西航道的溶解氧(DO)水平。亚岗站和石门站的 LSTM 模型分别作为源域和目标域的基本模型和基线模型。通过比较和选择不同的迁移学习策略,最佳的单一类型迁移学习策略是不带 TCA 的多序列 LSTM 模型,但在预训练后冻结了全连接层。与石门站的基线 LSTM 模型相比,该模型预测未来 3 天溶解氧的验证纳什效率系数(NSE)分别提高了 5.2%、10.8% 和 46.2%。最佳的 TL 组合策略包括在多序列 LSTM 模型中使用 TCA 和冻结第二个全连接层。与基线 LSTM 模型相比,在接下来的三天中,该模型的验证 NSE 分别提高了 5.3%、21.4% 和 48.7%。这项研究表明,与使用单一类型的迁移学习方法相比,结合基于特征和基于模型的迁移学习方法可以在数据匮乏的河流中获得更好的溶解氧预测性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Journal of Environmental Management
Journal of Environmental Management 环境科学-环境科学
CiteScore
13.70
自引率
5.70%
发文量
2477
审稿时长
84 days
期刊介绍: The Journal of Environmental Management is a journal for the publication of peer reviewed, original research for all aspects of management and the managed use of the environment, both natural and man-made.Critical review articles are also welcome; submission of these is strongly encouraged.
期刊最新文献
The farmgate phosphorus balance as a measure to achieve river and lake water quality targets. A conceptual framework to inform conservation status assessments of non-charismatic species. A mouse in the spotlight: Response capacity to artificial light at night in a rodent pest species, the southern multimammate mouse (Mastomys coucha). Application of advance oxidation processes for elimination of carbamazepine residues in soils. Changes in soil inorganic carbon following vegetation restoration in the cropland on the Loess Plateau in China: A meta-analysis.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1