Enhanced prediction of river dissolved oxygen through feature- and model-based transfer learning.

IF 8 2区环境科学与生态学 Q1 ENVIRONMENTAL SCIENCES Journal of Environmental Management Pub Date : 2024-11-20 DOI:10.1016/j.jenvman.2024.123310

Xinlin Chen, Wei Sun, Tao Jiang, Hong Ju

{"title":"Enhanced prediction of river dissolved oxygen through feature- and model-based transfer learning.","authors":"Xinlin Chen, Wei Sun, Tao Jiang, Hong Ju","doi":"10.1016/j.jenvman.2024.123310","DOIUrl":null,"url":null,"abstract":"<p><p>Water quality monitoring data from various points within the same basin often show non-uniformity. A key scientific question is how to extract relevant knowledge from data-rich sites (source domains) and leverage the possible inter-site consistency of water quality to compensate for the limitations of data-poor sites (target domains). Transfer learning (TL) methods can improve the applicability of water quality predictions for data-poor sites but their comparison and combination have not been fully explored. This study employs feature-based (Transfer Component Analysis, TCA) and model-based (pretraining and fine-tuning) transfer learning, to assist in constructing Long Short-Term Memory (LSTM) models for forecasting the dissolved oxygen (DO) levels in the West Channel of Guangzhou, southern coastal China. The LSTM models at Yagang and Shimen stations were constructed as the basic and baseline models for source and target domains, respectively. By comparing and selecting different transfer learning strategies, the best single-type TL strategy emerged as a multi-sequence LSTM model without TCA but with the fully connected layer frozen after pretraining. It achieved increases in validation Nash efficiency coefficient (NSE) of 5.2%, 10.8%, and 46.2% for predicting DO over the next 3 days, respectively, compared to the baseline LSTM model at Shimen station. The best combined TL strategy involved using TCA and freezing the second fully connected layer in a multi-sequence LSTM model. It improved upon the baseline LSTM model with a validation NSE increase of 5.3%, 21.4%, and 48.7% over the next three days, respectively. This study demonstrates that combining feature- and model-based transfer learning methods can yield better DO prediction performance in data-poor rivers than using a single-type transfer learning method.</p>","PeriodicalId":356,"journal":{"name":"Journal of Environmental Management","volume":"372 ","pages":"123310"},"PeriodicalIF":8.0000,"publicationDate":"2024-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Environmental Management","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1016/j.jenvman.2024.123310","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}

引用次数: 0

Abstract

Water quality monitoring data from various points within the same basin often show non-uniformity. A key scientific question is how to extract relevant knowledge from data-rich sites (source domains) and leverage the possible inter-site consistency of water quality to compensate for the limitations of data-poor sites (target domains). Transfer learning (TL) methods can improve the applicability of water quality predictions for data-poor sites but their comparison and combination have not been fully explored. This study employs feature-based (Transfer Component Analysis, TCA) and model-based (pretraining and fine-tuning) transfer learning, to assist in constructing Long Short-Term Memory (LSTM) models for forecasting the dissolved oxygen (DO) levels in the West Channel of Guangzhou, southern coastal China. The LSTM models at Yagang and Shimen stations were constructed as the basic and baseline models for source and target domains, respectively. By comparing and selecting different transfer learning strategies, the best single-type TL strategy emerged as a multi-sequence LSTM model without TCA but with the fully connected layer frozen after pretraining. It achieved increases in validation Nash efficiency coefficient (NSE) of 5.2%, 10.8%, and 46.2% for predicting DO over the next 3 days, respectively, compared to the baseline LSTM model at Shimen station. The best combined TL strategy involved using TCA and freezing the second fully connected layer in a multi-sequence LSTM model. It improved upon the baseline LSTM model with a validation NSE increase of 5.3%, 21.4%, and 48.7% over the next three days, respectively. This study demonstrates that combining feature- and model-based transfer learning methods can yield better DO prediction performance in data-poor rivers than using a single-type transfer learning method.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

通过基于特征和模型的迁移学习增强河流溶解氧的预测。

同一流域内不同点的水质监测数据往往不一致。一个关键的科学问题是，如何从数据丰富的站点（源域）提取相关知识，并利用站点间可能存在的水质一致性来弥补数据贫乏站点（目标域）的局限性。迁移学习（TL）方法可以提高数据贫乏站点的水质预测适用性，但它们之间的比较和组合尚未得到充分探索。本研究采用了基于特征（迁移成分分析，TCA）和基于模型（预训练和微调）的迁移学习，帮助构建长短期记忆（LSTM）模型，用于预测中国南部沿海广州西航道的溶解氧（DO）水平。亚岗站和石门站的 LSTM 模型分别作为源域和目标域的基本模型和基线模型。通过比较和选择不同的迁移学习策略，最佳的单一类型迁移学习策略是不带 TCA 的多序列 LSTM 模型，但在预训练后冻结了全连接层。与石门站的基线 LSTM 模型相比，该模型预测未来 3 天溶解氧的验证纳什效率系数（NSE）分别提高了 5.2%、10.8% 和 46.2%。最佳的 TL 组合策略包括在多序列 LSTM 模型中使用 TCA 和冻结第二个全连接层。与基线 LSTM 模型相比，在接下来的三天中，该模型的验证 NSE 分别提高了 5.3%、21.4% 和 48.7%。这项研究表明，与使用单一类型的迁移学习方法相比，结合基于特征和基于模型的迁移学习方法可以在数据匮乏的河流中获得更好的溶解氧预测性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Journal of Environmental Management 环境科学-环境科学

CiteScore

13.70

自引率

5.70%

发文量

2477

审稿时长

84 days

期刊介绍： The Journal of Environmental Management is a journal for the publication of peer reviewed, original research for all aspects of management and the managed use of the environment, both natural and man-made.Critical review articles are also welcome; submission of these is strongly encouraged.