ML-Based Streamflow Prediction in the Upper Colorado River Basin Using Climate Variables Time Series Data

IF 3.1 Q2 WATER RESOURCES Hydrology Pub Date : 2023-01-19 DOI:10.3390/hydrology10020029
Pouya Hosseinzadeh, A. Nassar, S. F. Boubrahimi, S. M. Hamdi
{"title":"ML-Based Streamflow Prediction in the Upper Colorado River Basin Using Climate Variables Time Series Data","authors":"Pouya Hosseinzadeh, A. Nassar, S. F. Boubrahimi, S. M. Hamdi","doi":"10.3390/hydrology10020029","DOIUrl":null,"url":null,"abstract":"Streamflow prediction plays a vital role in water resources planning in order to understand the dramatic change of climatic and hydrologic variables over different time scales. In this study, we used machine learning (ML)-based prediction models, including Random Forest Regression (RFR), Long Short-Term Memory (LSTM), Seasonal Auto- Regressive Integrated Moving Average (SARIMA), and Facebook Prophet (PROPHET) to predict 24 months ahead of natural streamflow at the Lees Ferry site located at the bottom part of the Upper Colorado River Basin (UCRB) of the US. Firstly, we used only historic streamflow data to predict 24 months ahead. Secondly, we considered meteorological components such as temperature and precipitation as additional features. We tested the models on a monthly test dataset spanning 6 years, where 24-month predictions were repeated 50 times to ensure the consistency of the results. Moreover, we performed a sensitivity analysis to identify our best-performing model. Later, we analyzed the effects of considering different span window sizes on the quality of predictions made by our best model. Finally, we applied our best-performing model, RFR, on two more rivers in different states in the UCRB to test the model’s generalizability. We evaluated the performance of the predictive models using multiple evaluation measures. The predictions in multivariate time-series models were found to be more accurate, with RMSE less than 0.84 mm per month, R-squared more than 0.8, and MAPE less than 0.25. Therefore, we conclude that the temperature and precipitation of the UCRB increases the accuracy of the predictions. Ultimately, we found that multivariate RFR performs the best among four models and is generalizable to other rivers in the UCRB.","PeriodicalId":37372,"journal":{"name":"Hydrology","volume":" ","pages":""},"PeriodicalIF":3.1000,"publicationDate":"2023-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Hydrology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/hydrology10020029","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"WATER RESOURCES","Score":null,"Total":0}
引用次数: 5

Abstract

Streamflow prediction plays a vital role in water resources planning in order to understand the dramatic change of climatic and hydrologic variables over different time scales. In this study, we used machine learning (ML)-based prediction models, including Random Forest Regression (RFR), Long Short-Term Memory (LSTM), Seasonal Auto- Regressive Integrated Moving Average (SARIMA), and Facebook Prophet (PROPHET) to predict 24 months ahead of natural streamflow at the Lees Ferry site located at the bottom part of the Upper Colorado River Basin (UCRB) of the US. Firstly, we used only historic streamflow data to predict 24 months ahead. Secondly, we considered meteorological components such as temperature and precipitation as additional features. We tested the models on a monthly test dataset spanning 6 years, where 24-month predictions were repeated 50 times to ensure the consistency of the results. Moreover, we performed a sensitivity analysis to identify our best-performing model. Later, we analyzed the effects of considering different span window sizes on the quality of predictions made by our best model. Finally, we applied our best-performing model, RFR, on two more rivers in different states in the UCRB to test the model’s generalizability. We evaluated the performance of the predictive models using multiple evaluation measures. The predictions in multivariate time-series models were found to be more accurate, with RMSE less than 0.84 mm per month, R-squared more than 0.8, and MAPE less than 0.25. Therefore, we conclude that the temperature and precipitation of the UCRB increases the accuracy of the predictions. Ultimately, we found that multivariate RFR performs the best among four models and is generalizable to other rivers in the UCRB.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于ml的上科罗拉多河流域气候变量时间序列流量预测
径流预测在水资源规划中发挥着至关重要的作用,以了解不同时间尺度上气候和水文变量的急剧变化。在这项研究中,我们使用基于机器学习(ML)的预测模型,包括随机森林回归(RFR)、长短期记忆(LSTM)、季节性自回归综合移动平均值(SARIMA)和Facebook Prophet来预测位于美国科罗拉多河上游流域(UCRB)底部的Lees Ferry站点的自然流量提前24个月。首先,我们仅使用历史流量数据来预测未来24个月。其次,我们将温度和降水等气象成分视为附加特征。我们在6年的月度测试数据集上测试了这些模型,其中24个月的预测重复了50次,以确保结果的一致性。此外,我们进行了敏感性分析,以确定我们表现最好的模型。随后,我们分析了考虑不同跨度窗口大小对最佳模型预测质量的影响。最后,我们将我们的最佳性能模型RFR应用于UCRB中不同状态的另外两条河流,以测试该模型的可推广性。我们使用多种评估措施来评估预测模型的性能。发现多变量时间序列模型中的预测更准确,RMSE每月小于0.84 mm,R平方大于0.8,MAPE小于0.25。因此,我们得出结论,UCRB的温度和降水量提高了预测的准确性。最终,我们发现多元RFR在四个模型中表现最好,并且可推广到UCRB中的其他河流。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Hydrology
Hydrology Earth and Planetary Sciences-Earth-Surface Processes
CiteScore
4.90
自引率
21.90%
发文量
192
审稿时长
6 weeks
期刊介绍: Journal of Hydrology publishes original research papers and comprehensive reviews in all the subfields of the hydrological sciences, including water based management and policy issues that impact on economics and society. These comprise, but are not limited to the physical, chemical, biogeochemical, stochastic and systems aspects of surface and groundwater hydrology, hydrometeorology, hydrogeology and hydrogeophysics. Relevant topics incorporating the insights and methodologies of disciplines such as climatology, water resource systems, ecohydrology, geomorphology, soil science, instrumentation and remote sensing, data and information sciences, civil and environmental engineering are within scope. Social science perspectives on hydrological problems such as resource and ecological economics, sociology, psychology and behavioural science, management and policy analysis are also invited. Multi-and interdisciplinary analyses of hydrological problems are within scope. The science published in the Journal of Hydrology is relevant to catchment scales rather than exclusively to a local scale or site. Studies focused on urban hydrological issues are included.
期刊最新文献
An Integrated Framework to Assess the Environmental and Economic Impact of Fertilizer Restrictions in a Nitrate-Contaminated Aquifer Assessing the Performance of a Citizen Science Based Water Quality Monitoring Program for Nitrates Using Test Strips Implemented in the Medjerda Hydrosystem in Northern Tunisia Analysis of Anomalies Due to the ENSO and Long-Term Changes in Extreme Precipitation Indices Using Data from Ground Stations Human Activities Increased Microplastics Contamination in the Himalaya Mountains Spatial Estimation of Snow Water Equivalent for Glaciers and Seasonal Snow in Iceland Using Remote Sensing Snow Cover and Albedo
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1