Mohammad Reza M. Behbahani, Maryam Mazarei, Amvrossios C. Bagtzoglou
{"title":"通过评估新的小波预处理技术改进趋势变化条件下基于深度学习的流量预报","authors":"Mohammad Reza M. Behbahani, Maryam Mazarei, Amvrossios C. Bagtzoglou","doi":"10.1007/s00477-024-02788-y","DOIUrl":null,"url":null,"abstract":"<p>Accurate machine learning streamflow prediction often requires coupling data-driven models with preprocessing techniques. This study aims to improve the performance of deep learning (DL) models, including long short-term memory, recurrent neural network (RNN), and gated recurrent unit (GRU) by incorporating maximal overlap discrete wavelet entropy transform (MODWET) techniques for streamflow forecasting. The merit of MODWET over maximal overlap discrete wavelet transform (MODWT) is that MODWET utilizes Entropy to determine the optimal decomposition level and suitable wavelet function, which was an unaddressed problem in wavelet-based decomposition models. Suitable decomposition level prevents providing unnecessary information or missing essential information. In this study we show that a unique decomposition level and wavelet filter is not suitable for any dataset. The research focuses on monthly streamflow data from three case studies in the CAMEL dataset in the USA. The accuracy of the models is evaluated using statistical measures such as Nash–Sutcliffe efficiency (NSE), root-mean-squared error, percent bias, and correlation coefficient (r). To determine the optimal model, a Taylor diagram is utilized. The results demonstrate the effectiveness of coupling MODWET with DL models in flood forecasting. Furthermore, genetic programming (GP) and partial correlation index (PCI) are employed for predictor selection. Hybrid models, namely MODWET-GP-GRU (NSE of 0.83), MODWET-GP-RNN (NSE of 0.95), and MODWET-PCI-GRU (NSE of 0.95), outperform simple DL models in terms of NSE and Taylor diagram evaluation. This study emphasizes the potential of hybrid models that combine DL algorithms with the recently proposed MODWET technique for streamflow prediction.</p>","PeriodicalId":21987,"journal":{"name":"Stochastic Environmental Research and Risk Assessment","volume":"38 1","pages":""},"PeriodicalIF":3.9000,"publicationDate":"2024-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Improving deep learning-based streamflow forecasting under trend varying conditions through evaluation of new wavelet preprocessing technique\",\"authors\":\"Mohammad Reza M. Behbahani, Maryam Mazarei, Amvrossios C. Bagtzoglou\",\"doi\":\"10.1007/s00477-024-02788-y\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Accurate machine learning streamflow prediction often requires coupling data-driven models with preprocessing techniques. This study aims to improve the performance of deep learning (DL) models, including long short-term memory, recurrent neural network (RNN), and gated recurrent unit (GRU) by incorporating maximal overlap discrete wavelet entropy transform (MODWET) techniques for streamflow forecasting. The merit of MODWET over maximal overlap discrete wavelet transform (MODWT) is that MODWET utilizes Entropy to determine the optimal decomposition level and suitable wavelet function, which was an unaddressed problem in wavelet-based decomposition models. Suitable decomposition level prevents providing unnecessary information or missing essential information. In this study we show that a unique decomposition level and wavelet filter is not suitable for any dataset. The research focuses on monthly streamflow data from three case studies in the CAMEL dataset in the USA. The accuracy of the models is evaluated using statistical measures such as Nash–Sutcliffe efficiency (NSE), root-mean-squared error, percent bias, and correlation coefficient (r). To determine the optimal model, a Taylor diagram is utilized. The results demonstrate the effectiveness of coupling MODWET with DL models in flood forecasting. Furthermore, genetic programming (GP) and partial correlation index (PCI) are employed for predictor selection. Hybrid models, namely MODWET-GP-GRU (NSE of 0.83), MODWET-GP-RNN (NSE of 0.95), and MODWET-PCI-GRU (NSE of 0.95), outperform simple DL models in terms of NSE and Taylor diagram evaluation. This study emphasizes the potential of hybrid models that combine DL algorithms with the recently proposed MODWET technique for streamflow prediction.</p>\",\"PeriodicalId\":21987,\"journal\":{\"name\":\"Stochastic Environmental Research and Risk Assessment\",\"volume\":\"38 1\",\"pages\":\"\"},\"PeriodicalIF\":3.9000,\"publicationDate\":\"2024-08-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Stochastic Environmental Research and Risk Assessment\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://doi.org/10.1007/s00477-024-02788-y\",\"RegionNum\":3,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, CIVIL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Stochastic Environmental Research and Risk Assessment","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1007/s00477-024-02788-y","RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
Improving deep learning-based streamflow forecasting under trend varying conditions through evaluation of new wavelet preprocessing technique
Accurate machine learning streamflow prediction often requires coupling data-driven models with preprocessing techniques. This study aims to improve the performance of deep learning (DL) models, including long short-term memory, recurrent neural network (RNN), and gated recurrent unit (GRU) by incorporating maximal overlap discrete wavelet entropy transform (MODWET) techniques for streamflow forecasting. The merit of MODWET over maximal overlap discrete wavelet transform (MODWT) is that MODWET utilizes Entropy to determine the optimal decomposition level and suitable wavelet function, which was an unaddressed problem in wavelet-based decomposition models. Suitable decomposition level prevents providing unnecessary information or missing essential information. In this study we show that a unique decomposition level and wavelet filter is not suitable for any dataset. The research focuses on monthly streamflow data from three case studies in the CAMEL dataset in the USA. The accuracy of the models is evaluated using statistical measures such as Nash–Sutcliffe efficiency (NSE), root-mean-squared error, percent bias, and correlation coefficient (r). To determine the optimal model, a Taylor diagram is utilized. The results demonstrate the effectiveness of coupling MODWET with DL models in flood forecasting. Furthermore, genetic programming (GP) and partial correlation index (PCI) are employed for predictor selection. Hybrid models, namely MODWET-GP-GRU (NSE of 0.83), MODWET-GP-RNN (NSE of 0.95), and MODWET-PCI-GRU (NSE of 0.95), outperform simple DL models in terms of NSE and Taylor diagram evaluation. This study emphasizes the potential of hybrid models that combine DL algorithms with the recently proposed MODWET technique for streamflow prediction.
期刊介绍:
Stochastic Environmental Research and Risk Assessment (SERRA) will publish research papers, reviews and technical notes on stochastic and probabilistic approaches to environmental sciences and engineering, including interactions of earth and atmospheric environments with people and ecosystems. The basic idea is to bring together research papers on stochastic modelling in various fields of environmental sciences and to provide an interdisciplinary forum for the exchange of ideas, for communicating on issues that cut across disciplinary barriers, and for the dissemination of stochastic techniques used in different fields to the community of interested researchers. Original contributions will be considered dealing with modelling (theoretical and computational), measurements and instrumentation in one or more of the following topical areas:
- Spatiotemporal analysis and mapping of natural processes.
- Enviroinformatics.
- Environmental risk assessment, reliability analysis and decision making.
- Surface and subsurface hydrology and hydraulics.
- Multiphase porous media domains and contaminant transport modelling.
- Hazardous waste site characterization.
- Stochastic turbulence and random hydrodynamic fields.
- Chaotic and fractal systems.
- Random waves and seafloor morphology.
- Stochastic atmospheric and climate processes.
- Air pollution and quality assessment research.
- Modern geostatistics.
- Mechanisms of pollutant formation, emission, exposure and absorption.
- Physical, chemical and biological analysis of human exposure from single and multiple media and routes; control and protection.
- Bioinformatics.
- Probabilistic methods in ecology and population biology.
- Epidemiological investigations.
- Models using stochastic differential equations stochastic or partial differential equations.
- Hazardous waste site characterization.