{"title":"使用新型混合SVR-IWOA和机器学习模型增强了锋利边缘宽度收缩的流量系数预测","authors":"Ehsan Afaridegan , Reza Fatahi-Alkouhi , Paymaneh Azarm , Nosratollah Amanian","doi":"10.1016/j.jhydrol.2025.133103","DOIUrl":null,"url":null,"abstract":"<div><div>Sharp-Edged Width Constrictions (SEWC) are hydraulic structures designed to measure flow in open channels. Accurate prediction of the discharge coefficient (<em>C<sub>d</sub></em>) in SEWC is crucial for determining water discharge in these channels. This information plays a key role in effective water resource management, supporting decision-making regarding the allocation and conservation of water for agricultural, industrial, and municipal purposes. This study introduces a novel hybrid machine learning model, combining Support Vector Regression (SVR) with the Improved Whale Optimization Algorithm (IWOA). Additionally, advanced machine learning models such as NGBoost, AutoInt, and TabNet were employed to predict <em>C<sub>d</sub></em> in SEWC. The SVR-IWOA model offers automatic hyperparameter tuning, significantly enhancing prediction accuracy in complex flow conditions. To develop these models, a dataset consisting of 156 laboratory data points from SEWC experiments was utilized, with 75 % of the data allocated for training and 25 % for testing. The Isolation Forest (IF) algorithm was applied to detect and remove outliers, leading to the exclusion of 5.1 % of the original dataset. Dimensional analysis identified critical factors influencing <em>C<sub>d</sub></em>, including the ratio of upstream depth to opening width (<em>h</em>/<em>b</em>) and the constriction ratio (<em>β</em> = <em>b</em>/<em>B</em>, where <em>B</em> is the channel width). The validity of these dimensionless parameters was confirmed using ANOVA and SHAP analyses, which highlighted <em>β</em> as the most influential factor affecting <em>C<sub>d</sub></em>. Model performance was rigorously evaluated using multiple metrics, including the coefficient of determination (<em>R</em><sup>2</sup>), Root Mean Squared Error (RMSE), Scatter Index (SI), Weighted Mean Absolute Percentage Error (WMAPE), and symmetric Mean Absolute Percentage Error (sMAPE). Comparative evaluations were conducted using Taylor Diagrams, Residual Error Curves (REC), and the Performance Index (PI). In the training stage, NGBoost demonstrated superior performance with a PI of 4994 and a normalized Root Mean Squared Error (<em>E′</em>) of 0.00021, followed by SVR-IWOA with a PI of 2490 and <em>E′</em> of 0.00035. During the testing stage, the SVR-IWOA model maintained strong performance, achieving a PI of 1986 and a low <em>E′</em> of 0.00046, while TabNet closely followed with a PI of 1986 and <em>E′</em> of 0.00047. In terms of <em>R</em><sup>2</sup> values, the models ranked as follows during testing: SVR-IWOA and TabNet tied for first with <em>R</em><sup>2</sup> = 0.993, followed by NGBoost (<em>R</em><sup>2</sup> = 0.992) and AutoInt (<em>R</em><sup>2</sup> = 0.973). These findings highlight the effectiveness of the proposed SVR-IWOA model in accurately predicting <em>C<sub>d</sub></em> and its strong generalization capabilities, positioning it as a robust tool for hydraulic applications.</div></div>","PeriodicalId":362,"journal":{"name":"Journal of Hydrology","volume":"657 ","pages":"Article 133103"},"PeriodicalIF":6.3000,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Enhanced prediction of discharge coefficient in sharp-edged width constrictions using a novel hybrid SVR-IWOA and machine learning models\",\"authors\":\"Ehsan Afaridegan , Reza Fatahi-Alkouhi , Paymaneh Azarm , Nosratollah Amanian\",\"doi\":\"10.1016/j.jhydrol.2025.133103\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Sharp-Edged Width Constrictions (SEWC) are hydraulic structures designed to measure flow in open channels. Accurate prediction of the discharge coefficient (<em>C<sub>d</sub></em>) in SEWC is crucial for determining water discharge in these channels. This information plays a key role in effective water resource management, supporting decision-making regarding the allocation and conservation of water for agricultural, industrial, and municipal purposes. This study introduces a novel hybrid machine learning model, combining Support Vector Regression (SVR) with the Improved Whale Optimization Algorithm (IWOA). Additionally, advanced machine learning models such as NGBoost, AutoInt, and TabNet were employed to predict <em>C<sub>d</sub></em> in SEWC. The SVR-IWOA model offers automatic hyperparameter tuning, significantly enhancing prediction accuracy in complex flow conditions. To develop these models, a dataset consisting of 156 laboratory data points from SEWC experiments was utilized, with 75 % of the data allocated for training and 25 % for testing. The Isolation Forest (IF) algorithm was applied to detect and remove outliers, leading to the exclusion of 5.1 % of the original dataset. Dimensional analysis identified critical factors influencing <em>C<sub>d</sub></em>, including the ratio of upstream depth to opening width (<em>h</em>/<em>b</em>) and the constriction ratio (<em>β</em> = <em>b</em>/<em>B</em>, where <em>B</em> is the channel width). The validity of these dimensionless parameters was confirmed using ANOVA and SHAP analyses, which highlighted <em>β</em> as the most influential factor affecting <em>C<sub>d</sub></em>. Model performance was rigorously evaluated using multiple metrics, including the coefficient of determination (<em>R</em><sup>2</sup>), Root Mean Squared Error (RMSE), Scatter Index (SI), Weighted Mean Absolute Percentage Error (WMAPE), and symmetric Mean Absolute Percentage Error (sMAPE). Comparative evaluations were conducted using Taylor Diagrams, Residual Error Curves (REC), and the Performance Index (PI). In the training stage, NGBoost demonstrated superior performance with a PI of 4994 and a normalized Root Mean Squared Error (<em>E′</em>) of 0.00021, followed by SVR-IWOA with a PI of 2490 and <em>E′</em> of 0.00035. During the testing stage, the SVR-IWOA model maintained strong performance, achieving a PI of 1986 and a low <em>E′</em> of 0.00046, while TabNet closely followed with a PI of 1986 and <em>E′</em> of 0.00047. In terms of <em>R</em><sup>2</sup> values, the models ranked as follows during testing: SVR-IWOA and TabNet tied for first with <em>R</em><sup>2</sup> = 0.993, followed by NGBoost (<em>R</em><sup>2</sup> = 0.992) and AutoInt (<em>R</em><sup>2</sup> = 0.973). These findings highlight the effectiveness of the proposed SVR-IWOA model in accurately predicting <em>C<sub>d</sub></em> and its strong generalization capabilities, positioning it as a robust tool for hydraulic applications.</div></div>\",\"PeriodicalId\":362,\"journal\":{\"name\":\"Journal of Hydrology\",\"volume\":\"657 \",\"pages\":\"Article 133103\"},\"PeriodicalIF\":6.3000,\"publicationDate\":\"2025-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Hydrology\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S002216942500441X\",\"RegionNum\":1,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/3/15 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, CIVIL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Hydrology","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S002216942500441X","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/3/15 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
Enhanced prediction of discharge coefficient in sharp-edged width constrictions using a novel hybrid SVR-IWOA and machine learning models
Sharp-Edged Width Constrictions (SEWC) are hydraulic structures designed to measure flow in open channels. Accurate prediction of the discharge coefficient (Cd) in SEWC is crucial for determining water discharge in these channels. This information plays a key role in effective water resource management, supporting decision-making regarding the allocation and conservation of water for agricultural, industrial, and municipal purposes. This study introduces a novel hybrid machine learning model, combining Support Vector Regression (SVR) with the Improved Whale Optimization Algorithm (IWOA). Additionally, advanced machine learning models such as NGBoost, AutoInt, and TabNet were employed to predict Cd in SEWC. The SVR-IWOA model offers automatic hyperparameter tuning, significantly enhancing prediction accuracy in complex flow conditions. To develop these models, a dataset consisting of 156 laboratory data points from SEWC experiments was utilized, with 75 % of the data allocated for training and 25 % for testing. The Isolation Forest (IF) algorithm was applied to detect and remove outliers, leading to the exclusion of 5.1 % of the original dataset. Dimensional analysis identified critical factors influencing Cd, including the ratio of upstream depth to opening width (h/b) and the constriction ratio (β = b/B, where B is the channel width). The validity of these dimensionless parameters was confirmed using ANOVA and SHAP analyses, which highlighted β as the most influential factor affecting Cd. Model performance was rigorously evaluated using multiple metrics, including the coefficient of determination (R2), Root Mean Squared Error (RMSE), Scatter Index (SI), Weighted Mean Absolute Percentage Error (WMAPE), and symmetric Mean Absolute Percentage Error (sMAPE). Comparative evaluations were conducted using Taylor Diagrams, Residual Error Curves (REC), and the Performance Index (PI). In the training stage, NGBoost demonstrated superior performance with a PI of 4994 and a normalized Root Mean Squared Error (E′) of 0.00021, followed by SVR-IWOA with a PI of 2490 and E′ of 0.00035. During the testing stage, the SVR-IWOA model maintained strong performance, achieving a PI of 1986 and a low E′ of 0.00046, while TabNet closely followed with a PI of 1986 and E′ of 0.00047. In terms of R2 values, the models ranked as follows during testing: SVR-IWOA and TabNet tied for first with R2 = 0.993, followed by NGBoost (R2 = 0.992) and AutoInt (R2 = 0.973). These findings highlight the effectiveness of the proposed SVR-IWOA model in accurately predicting Cd and its strong generalization capabilities, positioning it as a robust tool for hydraulic applications.
期刊介绍:
The Journal of Hydrology publishes original research papers and comprehensive reviews in all the subfields of the hydrological sciences including water based management and policy issues that impact on economics and society. These comprise, but are not limited to the physical, chemical, biogeochemical, stochastic and systems aspects of surface and groundwater hydrology, hydrometeorology and hydrogeology. Relevant topics incorporating the insights and methodologies of disciplines such as climatology, water resource systems, hydraulics, agrohydrology, geomorphology, soil science, instrumentation and remote sensing, civil and environmental engineering are included. Social science perspectives on hydrological problems such as resource and ecological economics, environmental sociology, psychology and behavioural science, management and policy analysis are also invited. Multi-and interdisciplinary analyses of hydrological problems are within scope. The science published in the Journal of Hydrology is relevant to catchment scales rather than exclusively to a local scale or site.