F. Vilaseca, A. Castro, C. Chreties, A. Gorgoglione
{"title":"Assessing influential rainfall–runoff variables to simulate daily streamflow using random forest","authors":"F. Vilaseca, A. Castro, C. Chreties, A. Gorgoglione","doi":"10.1080/02626667.2023.2232356","DOIUrl":null,"url":null,"abstract":"ABSTRACT This work aims to improve the feature selection for data-driven rainfall–runoff models by assessing the significance of each input variable in the learning process and analysing it from a physical point of view. For this purpose, a set of 14 experiments was carried out in two watersheds of the Santa Lucía Chico basin, Uruguay. A random forest model was trained and tested for daily discharge prediction in each of them using different input variables. A feature importance analysis was carried out for each model, using a non-model-biased method (Shapely additive explanations). Results showed that the most relevant variables were lagged discharges of one and two days, along with seven-day accumulated rainfall, which is interpreted as a proxy of the soil moisture condition of the watershed. The temperature was also relevant and was proven to represent the effect of the whole set of climatic variables (relative humidity, solar radiation, wind speed).","PeriodicalId":55042,"journal":{"name":"Hydrological Sciences Journal-Journal Des Sciences Hydrologiques","volume":null,"pages":null},"PeriodicalIF":2.8000,"publicationDate":"2023-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Hydrological Sciences Journal-Journal Des Sciences Hydrologiques","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1080/02626667.2023.2232356","RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"WATER RESOURCES","Score":null,"Total":0}
引用次数: 0
Abstract
ABSTRACT This work aims to improve the feature selection for data-driven rainfall–runoff models by assessing the significance of each input variable in the learning process and analysing it from a physical point of view. For this purpose, a set of 14 experiments was carried out in two watersheds of the Santa Lucía Chico basin, Uruguay. A random forest model was trained and tested for daily discharge prediction in each of them using different input variables. A feature importance analysis was carried out for each model, using a non-model-biased method (Shapely additive explanations). Results showed that the most relevant variables were lagged discharges of one and two days, along with seven-day accumulated rainfall, which is interpreted as a proxy of the soil moisture condition of the watershed. The temperature was also relevant and was proven to represent the effect of the whole set of climatic variables (relative humidity, solar radiation, wind speed).
期刊介绍:
Hydrological Sciences Journal is an international journal focused on hydrology and the relationship of water to atmospheric processes and climate.
Hydrological Sciences Journal is the official journal of the International Association of Hydrological Sciences (IAHS).
Hydrological Sciences Journal aims to provide a forum for original papers and for the exchange of information and views on significant developments in hydrology worldwide on subjects including:
Hydrological cycle and processes
Surface water
Groundwater
Water resource systems and management
Geographical factors
Earth and atmospheric processes
Hydrological extremes and their impact
Hydrological Sciences Journal offers a variety of formats for paper submission, including original articles, scientific notes, discussions, and rapid communications.