LSTM-based data integration to improve snow water equivalent prediction and diagnose error sources

IF 2.9 3区地球科学 Q2 METEOROLOGY & ATMOSPHERIC SCIENCES Journal of Hydrometeorology Pub Date : 2023-10-30 DOI:10.1175/jhm-d-22-0220.1

Yalan Song, Wen-Ping Tsai, Jonah Gluck, Alan Rhoades, Colin Zarzycki, Rachel McCrary, Kathryn Lawson, Chaopeng Shen

{"title":"LSTM-based data integration to improve snow water equivalent prediction and diagnose error sources","authors":"Yalan Song, Wen-Ping Tsai, Jonah Gluck, Alan Rhoades, Colin Zarzycki, Rachel McCrary, Kathryn Lawson, Chaopeng Shen","doi":"10.1175/jhm-d-22-0220.1","DOIUrl":null,"url":null,"abstract":"Abstract Accurate prediction of snow water equivalent (SWE) can be valuable for water resource managers. Recently, deep learning methods such as long short-term memory (LSTM) have exhibited high accuracy in simulating hydrologic variables and can integrate lagged observations to improve prediction, but their benefits were not clear for SWE simulations. Here we tested an LSTM network with data integration (DI) for SWE in the western US to integrate 30-day-lagged or 7-day-lagged observations of either SWE or satellite-observed snow cover fraction (SCF) to improve future predictions. SCF proved beneficial only for shallow-snow sites during snowmelt, while lagged SWE integration significantly improved prediction accuracy for both shallow-and deep-snow sites. The median Nash-Sutcliffe model efficiency coefficient (NSE) in temporal testing improved from 0.92 to 0.97 with 30-day-lagged SWE integration, and root-mean-square error (RMSE) and the difference between estimated and observed peak SWE values ( d max ) were reduced by 41% and 57%, respectively. DI effectively mitigated accumulated model and forcing errors which would otherwise be persistent. Moreover, by applying DI to different observations (30-day-lagged, 7-day-lagged), we revealed the spatial distribution of errors with different persistent lengths. For example, integrating 30-day-lagged SWE was ineffective for ephemeral snow sites in the southwestern US, but significantly reduced monthly-scale biases for regions with stable seasonal snowpack such as high elevation sites in California. These biases are likely attributable to large interannual variability in snowfall or site-specific snow redistribution patterns that can accumulate to impactful levels over time for non-ephemeral sites. These results set up benchmark levels and provide guidance for future model improvement strategies.","PeriodicalId":15962,"journal":{"name":"Journal of Hydrometeorology","volume":"31 6","pages":"0"},"PeriodicalIF":2.9000,"publicationDate":"2023-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Hydrometeorology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1175/jhm-d-22-0220.1","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"METEOROLOGY & ATMOSPHERIC SCIENCES","Score":null,"Total":0}

引用次数: 0

Abstract

Abstract Accurate prediction of snow water equivalent (SWE) can be valuable for water resource managers. Recently, deep learning methods such as long short-term memory (LSTM) have exhibited high accuracy in simulating hydrologic variables and can integrate lagged observations to improve prediction, but their benefits were not clear for SWE simulations. Here we tested an LSTM network with data integration (DI) for SWE in the western US to integrate 30-day-lagged or 7-day-lagged observations of either SWE or satellite-observed snow cover fraction (SCF) to improve future predictions. SCF proved beneficial only for shallow-snow sites during snowmelt, while lagged SWE integration significantly improved prediction accuracy for both shallow-and deep-snow sites. The median Nash-Sutcliffe model efficiency coefficient (NSE) in temporal testing improved from 0.92 to 0.97 with 30-day-lagged SWE integration, and root-mean-square error (RMSE) and the difference between estimated and observed peak SWE values ( d max ) were reduced by 41% and 57%, respectively. DI effectively mitigated accumulated model and forcing errors which would otherwise be persistent. Moreover, by applying DI to different observations (30-day-lagged, 7-day-lagged), we revealed the spatial distribution of errors with different persistent lengths. For example, integrating 30-day-lagged SWE was ineffective for ephemeral snow sites in the southwestern US, but significantly reduced monthly-scale biases for regions with stable seasonal snowpack such as high elevation sites in California. These biases are likely attributable to large interannual variability in snowfall or site-specific snow redistribution patterns that can accumulate to impactful levels over time for non-ephemeral sites. These results set up benchmark levels and provide guidance for future model improvement strategies.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于lstm的数据集成改进雪水当量预报和诊断误差源

摘要雪水当量(SWE)的准确预测对水资源管理具有重要意义。近年来，长短期记忆(LSTM)等深度学习方法在模拟水文变量方面表现出较高的准确性，并且可以整合滞后观测来改进预测，但它们对SWE模拟的好处尚不清楚。在这里，我们测试了一个具有数据集成(DI)的LSTM网络，用于美国西部的SWE，以整合30天或7天滞后的SWE或卫星观测的积雪覆盖率(SCF)观测数据，以改进未来的预测。在融雪过程中，SCF仅对浅雪点有利，而滞后的SWE集成显著提高了浅雪点和深雪点的预测精度。在时间检验中，纳什-苏特克里夫模型效率系数(NSE)中位数从0.92提高到0.97，平均误差(RMSE)和估计与观测的峰值SWE值(d max)之间的差值分别降低了41%和57%。DI有效地减轻了累积的模型和强制误差，否则这些误差将持续存在。此外，通过对不同观测值(滞后30天、滞后7天)应用DI，我们揭示了不同持续长度误差的空间分布。例如，整合30天滞后的SWE对美国西南部的短暂积雪地点无效，但对季节性积雪稳定的地区(如加利福尼亚州的高海拔地区)，显著降低了月尺度偏差。这些偏差可能是由于降雪的年际变化较大，或特定地点的雪再分配模式随着时间的推移可能累积到有影响的水平。这些结果建立了基准水平，并为未来的模型改进策略提供了指导。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Journal of Hydrometeorology 地学-气象与大气科学

CiteScore

7.40

自引率

5.30%

发文量

116

审稿时长

4-8 weeks

期刊介绍： The Journal of Hydrometeorology (JHM) (ISSN: 1525-755X; eISSN: 1525-7541) publishes research on modeling, observing, and forecasting processes related to fluxes and storage of water and energy, including interactions with the boundary layer and lower atmosphere, and processes related to precipitation, radiation, and other meteorological inputs.