Applying Data-Driven Modeling for Streamflow Prediction in Semi-Arid Watersheds: A Comparative Evaluation of Machine Learning and Deep Learning Methodologies
Metin Sarıgöl, Okan Mert Katipoğlu, Hüseyin Yildirim Dalkilic
{"title":"Applying Data-Driven Modeling for Streamflow Prediction in Semi-Arid Watersheds: A Comparative Evaluation of Machine Learning and Deep Learning Methodologies","authors":"Metin Sarıgöl, Okan Mert Katipoğlu, Hüseyin Yildirim Dalkilic","doi":"10.1007/s00024-024-03607-9","DOIUrl":null,"url":null,"abstract":"<div><p>Modeling monthly stream flows most accurately is of vital importance for water resource management, agricultural irrigation and efficient hydroelectric energy production, especially in semi-arid areas. Soft computing approaches have recently taken an important place in estimating streamflow time series. The potential of various data-driven approaches to predict streamflow in challenging climate conditions was evaluated. The study used machine and deep learning algorithms to model average monthly stream flows in two stream gauging stations in semi-arid region of the Konya closed basin where agriculture is at the forefront, accurate and reliable estimation of the stream flows is the basis of the study. For this, the performances of emotional neural network algorithm (EmNN), long-short term memory (LSTM), Elman neural network (ENN), nonlinear autoregressive exogenous model (NARX), recurrent neural network (RNN), group method of data handling (GMDH) were compared. The study’s unique contribution lies in its comprehensive comparison of these diverse algorithms, including newer approaches like EmNN, in the specific context of semi-arid hydrology. Partial autocorrelation analysis was applied to select input combinations, and lagged values exceeding 95% confidence limits were presented to the models as the most essential features. Artificial intelligence (AI) models use lagged stream flows to predict the streamflow time series. Statistical parameters, scatter diagrams and a time series approach are used to compare model performance. The GMDH model produced the following test results for 1604 no station: KGE: 0.656, R<sup>2</sup>: 0.608, NSE: 0.343, RMSE: 27.021, MAE: 3.834, MAPE: 0.662, MBE: −0.217, BF: 0.972. Similarly, for 1623 no station, the GMDH model yielded the following test results: KGE: 0.770, R<sup>2</sup>: 0.615, NSE: 0.531, RMSE: 0.006, MAE: 0.047, MAPE: 0.217, MBE: −0.012, BF: 0.956. In addition, the EmNN algorithm was the approach with second prediction accuracy. The findings of the study are important resources for optimizing the selection of AI models for streamflow prediction in semi-regional areas. The study also provides critical information for policymakers and decision-makers in similar climate zones worldwide for water resource management.</p></div>","PeriodicalId":21078,"journal":{"name":"pure and applied geophysics","volume":"181 12","pages":"3561 - 3589"},"PeriodicalIF":1.9000,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"pure and applied geophysics","FirstCategoryId":"89","ListUrlMain":"https://link.springer.com/article/10.1007/s00024-024-03607-9","RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GEOCHEMISTRY & GEOPHYSICS","Score":null,"Total":0}
引用次数: 0
Abstract
Modeling monthly stream flows most accurately is of vital importance for water resource management, agricultural irrigation and efficient hydroelectric energy production, especially in semi-arid areas. Soft computing approaches have recently taken an important place in estimating streamflow time series. The potential of various data-driven approaches to predict streamflow in challenging climate conditions was evaluated. The study used machine and deep learning algorithms to model average monthly stream flows in two stream gauging stations in semi-arid region of the Konya closed basin where agriculture is at the forefront, accurate and reliable estimation of the stream flows is the basis of the study. For this, the performances of emotional neural network algorithm (EmNN), long-short term memory (LSTM), Elman neural network (ENN), nonlinear autoregressive exogenous model (NARX), recurrent neural network (RNN), group method of data handling (GMDH) were compared. The study’s unique contribution lies in its comprehensive comparison of these diverse algorithms, including newer approaches like EmNN, in the specific context of semi-arid hydrology. Partial autocorrelation analysis was applied to select input combinations, and lagged values exceeding 95% confidence limits were presented to the models as the most essential features. Artificial intelligence (AI) models use lagged stream flows to predict the streamflow time series. Statistical parameters, scatter diagrams and a time series approach are used to compare model performance. The GMDH model produced the following test results for 1604 no station: KGE: 0.656, R2: 0.608, NSE: 0.343, RMSE: 27.021, MAE: 3.834, MAPE: 0.662, MBE: −0.217, BF: 0.972. Similarly, for 1623 no station, the GMDH model yielded the following test results: KGE: 0.770, R2: 0.615, NSE: 0.531, RMSE: 0.006, MAE: 0.047, MAPE: 0.217, MBE: −0.012, BF: 0.956. In addition, the EmNN algorithm was the approach with second prediction accuracy. The findings of the study are important resources for optimizing the selection of AI models for streamflow prediction in semi-regional areas. The study also provides critical information for policymakers and decision-makers in similar climate zones worldwide for water resource management.
期刊介绍:
pure and applied geophysics (pageoph), a continuation of the journal "Geofisica pura e applicata", publishes original scientific contributions in the fields of solid Earth, atmospheric and oceanic sciences. Regular and special issues feature thought-provoking reports on active areas of current research and state-of-the-art surveys.
Long running journal, founded in 1939 as Geofisica pura e applicata
Publishes peer-reviewed original scientific contributions and state-of-the-art surveys in solid earth and atmospheric sciences
Features thought-provoking reports on active areas of current research and is a major source for publications on tsunami research
Coverage extends to research topics in oceanic sciences
See Instructions for Authors on the right hand side.