ML-Based Streamflow Prediction in the Upper Colorado River Basin Using Climate Variables Time Series Data

IF 3.1 Q2 WATER RESOURCES Hydrology Pub Date : 2023-01-19 DOI:10.3390/hydrology10020029

Pouya Hosseinzadeh, A. Nassar, S. F. Boubrahimi, S. M. Hamdi

{"title":"ML-Based Streamflow Prediction in the Upper Colorado River Basin Using Climate Variables Time Series Data","authors":"Pouya Hosseinzadeh, A. Nassar, S. F. Boubrahimi, S. M. Hamdi","doi":"10.3390/hydrology10020029","DOIUrl":null,"url":null,"abstract":"Streamflow prediction plays a vital role in water resources planning in order to understand the dramatic change of climatic and hydrologic variables over different time scales. In this study, we used machine learning (ML)-based prediction models, including Random Forest Regression (RFR), Long Short-Term Memory (LSTM), Seasonal Auto- Regressive Integrated Moving Average (SARIMA), and Facebook Prophet (PROPHET) to predict 24 months ahead of natural streamflow at the Lees Ferry site located at the bottom part of the Upper Colorado River Basin (UCRB) of the US. Firstly, we used only historic streamflow data to predict 24 months ahead. Secondly, we considered meteorological components such as temperature and precipitation as additional features. We tested the models on a monthly test dataset spanning 6 years, where 24-month predictions were repeated 50 times to ensure the consistency of the results. Moreover, we performed a sensitivity analysis to identify our best-performing model. Later, we analyzed the effects of considering different span window sizes on the quality of predictions made by our best model. Finally, we applied our best-performing model, RFR, on two more rivers in different states in the UCRB to test the model’s generalizability. We evaluated the performance of the predictive models using multiple evaluation measures. The predictions in multivariate time-series models were found to be more accurate, with RMSE less than 0.84 mm per month, R-squared more than 0.8, and MAPE less than 0.25. Therefore, we conclude that the temperature and precipitation of the UCRB increases the accuracy of the predictions. Ultimately, we found that multivariate RFR performs the best among four models and is generalizable to other rivers in the UCRB.","PeriodicalId":37372,"journal":{"name":"Hydrology","volume":" ","pages":""},"PeriodicalIF":3.1000,"publicationDate":"2023-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Hydrology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/hydrology10020029","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"WATER RESOURCES","Score":null,"Total":0}

引用次数: 5

Abstract

Streamflow prediction plays a vital role in water resources planning in order to understand the dramatic change of climatic and hydrologic variables over different time scales. In this study, we used machine learning (ML)-based prediction models, including Random Forest Regression (RFR), Long Short-Term Memory (LSTM), Seasonal Auto- Regressive Integrated Moving Average (SARIMA), and Facebook Prophet (PROPHET) to predict 24 months ahead of natural streamflow at the Lees Ferry site located at the bottom part of the Upper Colorado River Basin (UCRB) of the US. Firstly, we used only historic streamflow data to predict 24 months ahead. Secondly, we considered meteorological components such as temperature and precipitation as additional features. We tested the models on a monthly test dataset spanning 6 years, where 24-month predictions were repeated 50 times to ensure the consistency of the results. Moreover, we performed a sensitivity analysis to identify our best-performing model. Later, we analyzed the effects of considering different span window sizes on the quality of predictions made by our best model. Finally, we applied our best-performing model, RFR, on two more rivers in different states in the UCRB to test the model’s generalizability. We evaluated the performance of the predictive models using multiple evaluation measures. The predictions in multivariate time-series models were found to be more accurate, with RMSE less than 0.84 mm per month, R-squared more than 0.8, and MAPE less than 0.25. Therefore, we conclude that the temperature and precipitation of the UCRB increases the accuracy of the predictions. Ultimately, we found that multivariate RFR performs the best among four models and is generalizable to other rivers in the UCRB.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于ml的上科罗拉多河流域气候变量时间序列流量预测

径流预测在水资源规划中发挥着至关重要的作用，以了解不同时间尺度上气候和水文变量的急剧变化。在这项研究中，我们使用基于机器学习（ML）的预测模型，包括随机森林回归（RFR）、长短期记忆（LSTM）、季节性自回归综合移动平均值（SARIMA）和Facebook Prophet来预测位于美国科罗拉多河上游流域（UCRB）底部的Lees Ferry站点的自然流量提前24个月。首先，我们仅使用历史流量数据来预测未来24个月。其次，我们将温度和降水等气象成分视为附加特征。我们在6年的月度测试数据集上测试了这些模型，其中24个月的预测重复了50次，以确保结果的一致性。此外，我们进行了敏感性分析，以确定我们表现最好的模型。随后，我们分析了考虑不同跨度窗口大小对最佳模型预测质量的影响。最后，我们将我们的最佳性能模型RFR应用于UCRB中不同状态的另外两条河流，以测试该模型的可推广性。我们使用多种评估措施来评估预测模型的性能。发现多变量时间序列模型中的预测更准确，RMSE每月小于0.84 mm，R平方大于0.8，MAPE小于0.25。因此，我们得出结论，UCRB的温度和降水量提高了预测的准确性。最终，我们发现多元RFR在四个模型中表现最好，并且可推广到UCRB中的其他河流。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Hydrology Earth and Planetary Sciences-Earth-Surface Processes

CiteScore

4.90

自引率

21.90%

发文量

192

审稿时长

6 weeks

期刊介绍： Journal of Hydrology publishes original research papers and comprehensive reviews in all the subfields of the hydrological sciences, including water based management and policy issues that impact on economics and society. These comprise, but are not limited to the physical, chemical, biogeochemical, stochastic and systems aspects of surface and groundwater hydrology, hydrometeorology, hydrogeology and hydrogeophysics. Relevant topics incorporating the insights and methodologies of disciplines such as climatology, water resource systems, ecohydrology, geomorphology, soil science, instrumentation and remote sensing, data and information sciences, civil and environmental engineering are within scope. Social science perspectives on hydrological problems such as resource and ecological economics, sociology, psychology and behavioural science, management and policy analysis are also invited. Multi-and interdisciplinary analyses of hydrological problems are within scope. The science published in the Journal of Hydrology is relevant to catchment scales rather than exclusively to a local scale or site. Studies focused on urban hydrological issues are included.