{"title":"A novel sample-enhancement framework for machine learning-based urban flood susceptibility assessment","authors":"Huabing Huang, Changpeng Wang, Zhiwen Tao, Jiayin Zhan","doi":"10.1016/j.envsoft.2024.106314","DOIUrl":null,"url":null,"abstract":"<div><div>The commonly used random sampling method in machine learning-based flood susceptibility studies has two major issues: a default invalid assumption of spatial homogeneity and an inadequate number of non-flood samples. To address these issues, this study proposed a novel sample-enhancement framework to improve the quality of training samples on both flood and non-flood sides. Three one-way enhancements (two flood and one non-flood) and two joint enhancements were designed. The enhancements were evaluated against random sampling using four mainstream machine learning algorithms (ANN, RF, SVM, and XGBoost) across two heterogeneous urban regions in Guangzhou, China. The highest performances are achieved by the joint enhancements, which are followed by one-way enhancements and random sampling (no enhancement). Another important conclusion is that one-way enhancements exhibit divergent yet complementary effects. Flood enhancements primarily affect susceptibility distribution (mean value and standard deviation), while non-flood enhancements mainly influence binary classification performance (AUC).</div></div>","PeriodicalId":310,"journal":{"name":"Environmental Modelling & Software","volume":"185 ","pages":"Article 106314"},"PeriodicalIF":4.8000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Modelling & Software","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S136481522400375X","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
The commonly used random sampling method in machine learning-based flood susceptibility studies has two major issues: a default invalid assumption of spatial homogeneity and an inadequate number of non-flood samples. To address these issues, this study proposed a novel sample-enhancement framework to improve the quality of training samples on both flood and non-flood sides. Three one-way enhancements (two flood and one non-flood) and two joint enhancements were designed. The enhancements were evaluated against random sampling using four mainstream machine learning algorithms (ANN, RF, SVM, and XGBoost) across two heterogeneous urban regions in Guangzhou, China. The highest performances are achieved by the joint enhancements, which are followed by one-way enhancements and random sampling (no enhancement). Another important conclusion is that one-way enhancements exhibit divergent yet complementary effects. Flood enhancements primarily affect susceptibility distribution (mean value and standard deviation), while non-flood enhancements mainly influence binary classification performance (AUC).
期刊介绍:
Environmental Modelling & Software publishes contributions, in the form of research articles, reviews and short communications, on recent advances in environmental modelling and/or software. The aim is to improve our capacity to represent, understand, predict or manage the behaviour of environmental systems at all practical scales, and to communicate those improvements to a wide scientific and professional audience.