Shravan Kumar S. M., Manish Pandey, N. V. Umamahesh
{"title":"Predicting total upland sediment yield using regression and machine learning models for improved land management and water conservation","authors":"Shravan Kumar S. M., Manish Pandey, N. V. Umamahesh","doi":"10.2166/hydro.2024.159","DOIUrl":null,"url":null,"abstract":"\n In this study, various regression models were utilized to predict total sediment yield in tons, while their performance was evaluated for accuracy and reliability. The dataset utilized contains numerous predictors that have been standardized and processed through principal component analysis to improve model performance. Models evaluated here include linear regression, normalized linear regression, PCA, PCC with generalized ridge regression, kernel ridge regression, multivariate regression, lasso regression approaches such as CA-ANN or ANN, and more. Results suggest that the artificial neural network (ANN) model achieved the lowest mean squared error (MSE), 113.641; this suggests superior predictive capability compared to other models. Although environmental data were complex and relationships complex, an ANN model showed less error, followed closely by CA-ANN with an MSE of 124.83. Traditional models such as linear or lasso regression revealed larger errors with negative squared values that indicated poor fits to data. This exhaustive analysis not only showcases the power of advanced machine-learning techniques in environmental modeling but also stresses the significance of selecting models based on data characteristics and specific environmental phenomena studied. Furthermore, its insights could assist environmental planners and advocates with better prediction and management of soil erosion and sediment transport for planning purposes and conservation efforts.","PeriodicalId":507813,"journal":{"name":"Journal of Hydroinformatics","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Hydroinformatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2166/hydro.2024.159","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In this study, various regression models were utilized to predict total sediment yield in tons, while their performance was evaluated for accuracy and reliability. The dataset utilized contains numerous predictors that have been standardized and processed through principal component analysis to improve model performance. Models evaluated here include linear regression, normalized linear regression, PCA, PCC with generalized ridge regression, kernel ridge regression, multivariate regression, lasso regression approaches such as CA-ANN or ANN, and more. Results suggest that the artificial neural network (ANN) model achieved the lowest mean squared error (MSE), 113.641; this suggests superior predictive capability compared to other models. Although environmental data were complex and relationships complex, an ANN model showed less error, followed closely by CA-ANN with an MSE of 124.83. Traditional models such as linear or lasso regression revealed larger errors with negative squared values that indicated poor fits to data. This exhaustive analysis not only showcases the power of advanced machine-learning techniques in environmental modeling but also stresses the significance of selecting models based on data characteristics and specific environmental phenomena studied. Furthermore, its insights could assist environmental planners and advocates with better prediction and management of soil erosion and sediment transport for planning purposes and conservation efforts.