A compound approach for ten-day runoff prediction by coupling wavelet denoising, attention mechanism, and LSTM based on GPU parallel acceleration technology

IF 2.7 4区地球科学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Earth Science Informatics Pub Date : 2024-01-10 DOI:10.1007/s12145-023-01212-3

Yi-yang Wang, Wen-chuan Wang, Dong-mei Xu, Yan-wei Zhao, Hong-fei Zang

{"title":"A compound approach for ten-day runoff prediction by coupling wavelet denoising, attention mechanism, and LSTM based on GPU parallel acceleration technology","authors":"Yi-yang Wang, Wen-chuan Wang, Dong-mei Xu, Yan-wei Zhao, Hong-fei Zang","doi":"10.1007/s12145-023-01212-3","DOIUrl":null,"url":null,"abstract":"<p>Deep learning models have a high application value in runoff forecasting, but their prediction mechanism is difficult to interpret and their computational cost is high when dealing with complex hydrological relationships, limiting their feasibility in hydrological process mechanism analysis. To address these concerns, the paper first introduces an attention mechanism (AM) for building a long short-term memory network (LSTM) model with AM in the hidden layer (AM-LSTM). The AM-LSTM model employs attention layers to improve information extraction from hidden layers, resulting in a more accurate representation of the relationships between runoff-related elements. Furthermore, in the hidden layers of the AM-LSTM model, interpretable spatiotemporal attention units are established, which not only improves the model's prediction accuracy but also provides interpretability to the forecasting process. Furthermore, parallelization techniques are used in the paper to address the issue of model runtime cost. Simultaneously, to address the accuracy degradation caused by parallelization, the paper employs wavelet denoising (WD) techniques and builds the WD-AM-LSTM model. This accomplishment enables the runoff forecasting model to predict runoff in real time and with high accuracy. Based on validation using ten-day runoff data from the Huanren Reservoir in the Hun River's middle reaches, the results show that, with two layers and an eight-batch size, the AM-LSTM model outperforms the LSTM model in capturing spatiotemporal runoff features. During the model testing phase, the AM-LSTM model improves the MAE, RMSE, and NSE performance metrics by 8.46%, 13%, and 3.82%, respectively. The WD-AM-LSTM model effectively mitigates the noise impact caused by data parallelization under the conditions of two layers and a batch size of 512, achieving the same level of prediction performance while reducing computational cost by 92.01%. By incorporating attention mechanisms and wavelet denoising techniques, this study obtains high-speed and accurate predictions with interpretable results. It expands the deep learning models' applicability in ten-day runoff forecasting work.</p>","PeriodicalId":49318,"journal":{"name":"Earth Science Informatics","volume":"98 1","pages":""},"PeriodicalIF":2.7000,"publicationDate":"2024-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Earth Science Informatics","FirstCategoryId":"89","ListUrlMain":"https://doi.org/10.1007/s12145-023-01212-3","RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}

引用次数: 0

Abstract

Deep learning models have a high application value in runoff forecasting, but their prediction mechanism is difficult to interpret and their computational cost is high when dealing with complex hydrological relationships, limiting their feasibility in hydrological process mechanism analysis. To address these concerns, the paper first introduces an attention mechanism (AM) for building a long short-term memory network (LSTM) model with AM in the hidden layer (AM-LSTM). The AM-LSTM model employs attention layers to improve information extraction from hidden layers, resulting in a more accurate representation of the relationships between runoff-related elements. Furthermore, in the hidden layers of the AM-LSTM model, interpretable spatiotemporal attention units are established, which not only improves the model's prediction accuracy but also provides interpretability to the forecasting process. Furthermore, parallelization techniques are used in the paper to address the issue of model runtime cost. Simultaneously, to address the accuracy degradation caused by parallelization, the paper employs wavelet denoising (WD) techniques and builds the WD-AM-LSTM model. This accomplishment enables the runoff forecasting model to predict runoff in real time and with high accuracy. Based on validation using ten-day runoff data from the Huanren Reservoir in the Hun River's middle reaches, the results show that, with two layers and an eight-batch size, the AM-LSTM model outperforms the LSTM model in capturing spatiotemporal runoff features. During the model testing phase, the AM-LSTM model improves the MAE, RMSE, and NSE performance metrics by 8.46%, 13%, and 3.82%, respectively. The WD-AM-LSTM model effectively mitigates the noise impact caused by data parallelization under the conditions of two layers and a batch size of 512, achieving the same level of prediction performance while reducing computational cost by 92.01%. By incorporating attention mechanisms and wavelet denoising techniques, this study obtains high-speed and accurate predictions with interpretable results. It expands the deep learning models' applicability in ten-day runoff forecasting work.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于 GPU 并行加速技术，将小波去噪、注意力机制和 LSTM 相结合的旬径流预测复合方法

深度学习模型在径流预报中具有较高的应用价值，但其预测机理难以解释，在处理复杂水文关系时计算成本较高，限制了其在水文过程机理分析中的可行性。为了解决这些问题，本文首先介绍了一种注意力机制（AM），用于建立一个在隐层中带有 AM 的长短期记忆网络（LSTM）模型（AM-LSTM）。AM-LSTM 模型采用注意力层来改进隐藏层的信息提取，从而更准确地表示与径流相关的元素之间的关系。此外，在 AM-LSTM 模型的隐藏层中，建立了可解释的时空注意单元，这不仅提高了模型的预测精度，还为预测过程提供了可解释性。此外，本文还采用了并行化技术来解决模型运行成本问题。同时，为了解决并行化带来的精度下降问题，本文采用了小波去噪（WD）技术，并建立了 WD-AM-LSTM 模型。这一成果使得径流预报模型能够实时、高精度地预测径流。基于浑河中游桓仁水库十天径流数据的验证结果表明，在两层和八批次规模下，AM-LSTM 模型在捕捉径流时空特征方面优于 LSTM 模型。在模型测试阶段，AM-LSTM 模型的 MAE、RMSE 和 NSE 性能指标分别提高了 8.46%、13% 和 3.82%。WD-AM-LSTM 模型在两层和批量大小为 512 的条件下，有效地减轻了数据并行化带来的噪声影响，在达到相同预测性能水平的同时降低了 92.01% 的计算成本。通过结合注意力机制和小波去噪技术，本研究获得了高速、准确、可解释的预测结果。它拓展了深度学习模型在旬径流预报工作中的适用性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Earth Science Informatics COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS-GEOSCIENCES, MULTIDISCIPLINARY

CiteScore

4.60

自引率

3.60%

发文量

157

审稿时长

4.3 months

期刊介绍： The Earth Science Informatics [ESIN] journal aims at rapid publication of high-quality, current, cutting-edge, and provocative scientific work in the area of Earth Science Informatics as it relates to Earth systems science and space science. This includes articles on the application of formal and computational methods, computational Earth science, spatial and temporal analyses, and all aspects of computer applications to the acquisition, storage, processing, interchange, and visualization of data and information about the materials, properties, processes, features, and phenomena that occur at all scales and locations in the Earth system’s five components (atmosphere, hydrosphere, geosphere, biosphere, cryosphere) and in space (see "About this journal" for more detail). The quarterly journal publishes research, methodology, and software articles, as well as editorials, comments, and book and software reviews. Review articles of relevant findings, topics, and methodologies are also considered.