A compound approach for ten-day runoff prediction by coupling wavelet denoising, attention mechanism, and LSTM based on GPU parallel acceleration technology
{"title":"A compound approach for ten-day runoff prediction by coupling wavelet denoising, attention mechanism, and LSTM based on GPU parallel acceleration technology","authors":"Yi-yang Wang, Wen-chuan Wang, Dong-mei Xu, Yan-wei Zhao, Hong-fei Zang","doi":"10.1007/s12145-023-01212-3","DOIUrl":null,"url":null,"abstract":"<p>Deep learning models have a high application value in runoff forecasting, but their prediction mechanism is difficult to interpret and their computational cost is high when dealing with complex hydrological relationships, limiting their feasibility in hydrological process mechanism analysis. To address these concerns, the paper first introduces an attention mechanism (AM) for building a long short-term memory network (LSTM) model with AM in the hidden layer (AM-LSTM). The AM-LSTM model employs attention layers to improve information extraction from hidden layers, resulting in a more accurate representation of the relationships between runoff-related elements. Furthermore, in the hidden layers of the AM-LSTM model, interpretable spatiotemporal attention units are established, which not only improves the model's prediction accuracy but also provides interpretability to the forecasting process. Furthermore, parallelization techniques are used in the paper to address the issue of model runtime cost. Simultaneously, to address the accuracy degradation caused by parallelization, the paper employs wavelet denoising (WD) techniques and builds the WD-AM-LSTM model. This accomplishment enables the runoff forecasting model to predict runoff in real time and with high accuracy. Based on validation using ten-day runoff data from the Huanren Reservoir in the Hun River's middle reaches, the results show that, with two layers and an eight-batch size, the AM-LSTM model outperforms the LSTM model in capturing spatiotemporal runoff features. During the model testing phase, the AM-LSTM model improves the MAE, RMSE, and NSE performance metrics by 8.46%, 13%, and 3.82%, respectively. The WD-AM-LSTM model effectively mitigates the noise impact caused by data parallelization under the conditions of two layers and a batch size of 512, achieving the same level of prediction performance while reducing computational cost by 92.01%. By incorporating attention mechanisms and wavelet denoising techniques, this study obtains high-speed and accurate predictions with interpretable results. It expands the deep learning models' applicability in ten-day runoff forecasting work.</p>","PeriodicalId":49318,"journal":{"name":"Earth Science Informatics","volume":"98 1","pages":""},"PeriodicalIF":2.7000,"publicationDate":"2024-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Earth Science Informatics","FirstCategoryId":"89","ListUrlMain":"https://doi.org/10.1007/s12145-023-01212-3","RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Deep learning models have a high application value in runoff forecasting, but their prediction mechanism is difficult to interpret and their computational cost is high when dealing with complex hydrological relationships, limiting their feasibility in hydrological process mechanism analysis. To address these concerns, the paper first introduces an attention mechanism (AM) for building a long short-term memory network (LSTM) model with AM in the hidden layer (AM-LSTM). The AM-LSTM model employs attention layers to improve information extraction from hidden layers, resulting in a more accurate representation of the relationships between runoff-related elements. Furthermore, in the hidden layers of the AM-LSTM model, interpretable spatiotemporal attention units are established, which not only improves the model's prediction accuracy but also provides interpretability to the forecasting process. Furthermore, parallelization techniques are used in the paper to address the issue of model runtime cost. Simultaneously, to address the accuracy degradation caused by parallelization, the paper employs wavelet denoising (WD) techniques and builds the WD-AM-LSTM model. This accomplishment enables the runoff forecasting model to predict runoff in real time and with high accuracy. Based on validation using ten-day runoff data from the Huanren Reservoir in the Hun River's middle reaches, the results show that, with two layers and an eight-batch size, the AM-LSTM model outperforms the LSTM model in capturing spatiotemporal runoff features. During the model testing phase, the AM-LSTM model improves the MAE, RMSE, and NSE performance metrics by 8.46%, 13%, and 3.82%, respectively. The WD-AM-LSTM model effectively mitigates the noise impact caused by data parallelization under the conditions of two layers and a batch size of 512, achieving the same level of prediction performance while reducing computational cost by 92.01%. By incorporating attention mechanisms and wavelet denoising techniques, this study obtains high-speed and accurate predictions with interpretable results. It expands the deep learning models' applicability in ten-day runoff forecasting work.
期刊介绍:
The Earth Science Informatics [ESIN] journal aims at rapid publication of high-quality, current, cutting-edge, and provocative scientific work in the area of Earth Science Informatics as it relates to Earth systems science and space science. This includes articles on the application of formal and computational methods, computational Earth science, spatial and temporal analyses, and all aspects of computer applications to the acquisition, storage, processing, interchange, and visualization of data and information about the materials, properties, processes, features, and phenomena that occur at all scales and locations in the Earth system’s five components (atmosphere, hydrosphere, geosphere, biosphere, cryosphere) and in space (see "About this journal" for more detail). The quarterly journal publishes research, methodology, and software articles, as well as editorials, comments, and book and software reviews. Review articles of relevant findings, topics, and methodologies are also considered.