Sahar Safari, Mohammad Sadegh Sadeghian, Hooman Hajikandi, S. Sajad Mehdizadeh
{"title":"Identifying homogeneous hydrological zones for flood prediction using multivariable statistical methods and machine learning","authors":"Sahar Safari, Mohammad Sadegh Sadeghian, Hooman Hajikandi, S. Sajad Mehdizadeh","doi":"10.1007/s13201-024-02316-x","DOIUrl":null,"url":null,"abstract":"<div><p>One method for estimating floods in areas lacking statistical data is the use of regional frequency analysis based on machine learning. In this study, statistical and clustering-based approaches were evaluated for flood estimation in the Karkheh watershed. The hydrological homogeneity of the obtained zones was then assessed using linear moments and heterogeneity adjustment methods proposed by Hosking and Wallis. Then, the ZDIST statistic was used to calculate the three-parameter distributions for stations within each hydrologically homogeneous cluster. These parameters were computed using linear moments, and floods with different return periods at each station were estimated using regional relationships. The results indicated the creation of two clusters in this area, with five stations in cluster one and 11 stations in cluster two. The statistical homogeneity values for clusters one and two were calculated as 0.33 and 0.17, respectively, indicating the homogeneity of each region. Generalized Pearson type III and generalized extreme value distributions were selected as the best regional distributions for clusters 1 and 2, respectively. The results also showed that floods could be estimated for return periods of 2, 5, 25 years, and more. The highest estimated flood is predicted at the Jelugir-e Majin station, where the flood with a 2-year return period reaches 1034 m<sup>3</sup> s<sup>−1</sup>. This increases to 5360 m<sup>3</sup> s<sup>−1</sup> for a 100-year return period. The approach presented in this study is recommended for similar regions lacking complete information.</p></div>","PeriodicalId":8374,"journal":{"name":"Applied Water Science","volume":"14 12","pages":""},"PeriodicalIF":5.7000,"publicationDate":"2024-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s13201-024-02316-x.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Water Science","FirstCategoryId":"93","ListUrlMain":"https://link.springer.com/article/10.1007/s13201-024-02316-x","RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"WATER RESOURCES","Score":null,"Total":0}
引用次数: 0
Abstract
One method for estimating floods in areas lacking statistical data is the use of regional frequency analysis based on machine learning. In this study, statistical and clustering-based approaches were evaluated for flood estimation in the Karkheh watershed. The hydrological homogeneity of the obtained zones was then assessed using linear moments and heterogeneity adjustment methods proposed by Hosking and Wallis. Then, the ZDIST statistic was used to calculate the three-parameter distributions for stations within each hydrologically homogeneous cluster. These parameters were computed using linear moments, and floods with different return periods at each station were estimated using regional relationships. The results indicated the creation of two clusters in this area, with five stations in cluster one and 11 stations in cluster two. The statistical homogeneity values for clusters one and two were calculated as 0.33 and 0.17, respectively, indicating the homogeneity of each region. Generalized Pearson type III and generalized extreme value distributions were selected as the best regional distributions for clusters 1 and 2, respectively. The results also showed that floods could be estimated for return periods of 2, 5, 25 years, and more. The highest estimated flood is predicted at the Jelugir-e Majin station, where the flood with a 2-year return period reaches 1034 m3 s−1. This increases to 5360 m3 s−1 for a 100-year return period. The approach presented in this study is recommended for similar regions lacking complete information.