Evaluating deep learning architecture and data assimilation for improving water temperature forecasts at unmonitored locations

IF 2.6 Q2 WATER RESOURCES Frontiers in Water Pub Date : 2023-06-23 DOI:10.3389/frwa.2023.1184992

J. Zwart, Jeremy Diaz, Scott D. Hamshaw, S. Oliver, Jesse C. Ross, Margaux Sleckman, A. Appling, Hayley Corson-Dosch, X. Jia, J. Read, J. Sadler, Theodore Thompson, David W. Watkins, Elaheh White

{"title":"Evaluating deep learning architecture and data assimilation for improving water temperature forecasts at unmonitored locations","authors":"J. Zwart, Jeremy Diaz, Scott D. Hamshaw, S. Oliver, Jesse C. Ross, Margaux Sleckman, A. Appling, Hayley Corson-Dosch, X. Jia, J. Read, J. Sadler, Theodore Thompson, David W. Watkins, Elaheh White","doi":"10.3389/frwa.2023.1184992","DOIUrl":null,"url":null,"abstract":"Deep learning (DL) models are increasingly used to forecast water quality variables for use in decision making. Ingesting recent observations of the forecasted variable has been shown to greatly increase model performance at monitored locations; however, observations are not collected at all locations, and methods are not yet well developed for DL models for optimally ingesting recent observations from other sites to inform focal sites. In this paper, we evaluate two different DL model structures, a long short-term memory neural network (LSTM) and a recurrent graph convolutional neural network (RGCN), both with and without data assimilation for forecasting daily maximum stream temperature 7 days into the future at monitored and unmonitored locations in a 70-segment stream network. All our DL models performed well when forecasting stream temperature as the root mean squared error (RMSE) across all models ranged from 2.03 to 2.11°C for 1-day lead times in the validation period, with substantially better performance at gaged locations (RMSE = 1.45–1.52°C) compared to ungaged locations (RMSE = 3.18–3.27°C). Forecast uncertainty characterization was near-perfect for gaged locations but all DL models were overconfident (i.e., uncertainty bounds too narrow) for ungaged locations. Our results show that the RGCN with data assimilation performed best for ungaged locations and especially at higher temperatures (>18°C) which is important for management decisions in our study location. This indicates that the networked model structure and data assimilation techniques may help borrow information from nearby monitored sites to improve forecasts at unmonitored locations. Results from this study can help guide DL modeling decisions when forecasting other important environmental variables.","PeriodicalId":33801,"journal":{"name":"Frontiers in Water","volume":" ","pages":""},"PeriodicalIF":2.6000,"publicationDate":"2023-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Water","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/frwa.2023.1184992","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"WATER RESOURCES","Score":null,"Total":0}

引用次数: 0

Abstract

Deep learning (DL) models are increasingly used to forecast water quality variables for use in decision making. Ingesting recent observations of the forecasted variable has been shown to greatly increase model performance at monitored locations; however, observations are not collected at all locations, and methods are not yet well developed for DL models for optimally ingesting recent observations from other sites to inform focal sites. In this paper, we evaluate two different DL model structures, a long short-term memory neural network (LSTM) and a recurrent graph convolutional neural network (RGCN), both with and without data assimilation for forecasting daily maximum stream temperature 7 days into the future at monitored and unmonitored locations in a 70-segment stream network. All our DL models performed well when forecasting stream temperature as the root mean squared error (RMSE) across all models ranged from 2.03 to 2.11°C for 1-day lead times in the validation period, with substantially better performance at gaged locations (RMSE = 1.45–1.52°C) compared to ungaged locations (RMSE = 3.18–3.27°C). Forecast uncertainty characterization was near-perfect for gaged locations but all DL models were overconfident (i.e., uncertainty bounds too narrow) for ungaged locations. Our results show that the RGCN with data assimilation performed best for ungaged locations and especially at higher temperatures (>18°C) which is important for management decisions in our study location. This indicates that the networked model structure and data assimilation techniques may help borrow information from nearby monitored sites to improve forecasts at unmonitored locations. Results from this study can help guide DL modeling decisions when forecasting other important environmental variables.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

评估深度学习架构和数据同化，以改进未监测地点的水温预测

深度学习（DL）模型越来越多地用于预测水质变量，以用于决策。采用预测变量的最新观测结果已被证明可以大大提高监测位置的模型性能；然而，并不是在所有地点都收集到观测结果，DL模型也没有很好地开发出方法，以最佳地吸收来自其他地点的最新观测结果，从而为焦点地点提供信息。在本文中，我们评估了两种不同的DL模型结构，即长短期记忆神经网络（LSTM）和递归图卷积神经网络（RGCN），无论是否进行数据同化，都可以预测70段流网络中监测和未监测位置未来7天的日最高流温。我们所有的DL模型在预测流温度时都表现良好，因为在验证期内的1天交付周期内，所有模型的均方根误差（RMSE）在2.03至2.11°C之间，与未充气位置（RMSE=3.18–3.27°C）相比，在充气位置（均方根误差=1.45–1.52°C）的性能明显更好。对于充气位置，预测不确定性特征几乎是完美的，但对于未充气位置，所有DL模型都过于自信（即不确定性界限太窄）。我们的研究结果表明，具有数据同化的RGCN在未加标签的地点表现最好，尤其是在更高的温度（>18°C）下，这对我们研究地点的管理决策很重要。这表明，网络化的模型结构和数据同化技术可能有助于从附近的监测地点借用信息，以改进未监测地点的预测。这项研究的结果有助于在预测其他重要环境变量时指导DL建模决策。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊