Integration of multi-temporal SAR data and robust machine learning models for improvement of flood susceptibility assessment in the southwest coast of India
{"title":"Integration of multi-temporal SAR data and robust machine learning models for improvement of flood susceptibility assessment in the southwest coast of India","authors":"Pankaj Prasad , Sourav Mandal , Sahil Sandeep Naik , Victor Joseph Loveson , Simanku Borah , Priyankar Chandra , Karthik Sudheer","doi":"10.1016/j.acags.2024.100189","DOIUrl":null,"url":null,"abstract":"<div><p>The flood hazards in the southwest coastal region of India in 2018 and 2020 resulted in numerous casualties and the displacement of over a million people from their homes. In order to mitigate the loss of life and resources caused by recurrent major and minor flood events, it is imperative to develop a comprehensive spatial flood zonation map of the entire area. Therefore, the main aim of the present study is to prepare a flood susceptible map of the southwest coastal region of India using synthetic-aperture radar (SAR) data and robust machine learning algorithms. Accurate flood and non-flood locations have been identified from the multi-temporal Sentinel-1 images. These flood locations are correlated with sixteen flood conditioning geo-environmental variables. The Boruta algorithm has been applied to determine the importance of each flood conditioning parameter. Six efficient machine learning models, namely support vector machine (SVM), k-nearest neighbors (KNN), artificial neural network (ANN), random forest (RF), partial least squares (PLS) and penalized discriminant analysis (PDA) have been applied to delineate the flood susceptible areas of the study region. The performance of the models has been evaluated using several statistical criteria, including area under curve (AUC), overall accuracy, specificity, sensitivity and kappa index. The results have revealed that all models have performed more than 90% of AUC due to the high precision of radar data. However, the RF and SVM models have outperformed other models in terms of all statistical parameters. The findings have identified approximately 13% of the study region as highly vulnerable to flood hazards, emphasizing the need for proper planning and management in these areas.</p></div>","PeriodicalId":33804,"journal":{"name":"Applied Computing and Geosciences","volume":"24 ","pages":"Article 100189"},"PeriodicalIF":2.6000,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2590197424000363/pdfft?md5=c335020c63eb9eda70216e7662e23b2d&pid=1-s2.0-S2590197424000363-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Computing and Geosciences","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2590197424000363","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
The flood hazards in the southwest coastal region of India in 2018 and 2020 resulted in numerous casualties and the displacement of over a million people from their homes. In order to mitigate the loss of life and resources caused by recurrent major and minor flood events, it is imperative to develop a comprehensive spatial flood zonation map of the entire area. Therefore, the main aim of the present study is to prepare a flood susceptible map of the southwest coastal region of India using synthetic-aperture radar (SAR) data and robust machine learning algorithms. Accurate flood and non-flood locations have been identified from the multi-temporal Sentinel-1 images. These flood locations are correlated with sixteen flood conditioning geo-environmental variables. The Boruta algorithm has been applied to determine the importance of each flood conditioning parameter. Six efficient machine learning models, namely support vector machine (SVM), k-nearest neighbors (KNN), artificial neural network (ANN), random forest (RF), partial least squares (PLS) and penalized discriminant analysis (PDA) have been applied to delineate the flood susceptible areas of the study region. The performance of the models has been evaluated using several statistical criteria, including area under curve (AUC), overall accuracy, specificity, sensitivity and kappa index. The results have revealed that all models have performed more than 90% of AUC due to the high precision of radar data. However, the RF and SVM models have outperformed other models in terms of all statistical parameters. The findings have identified approximately 13% of the study region as highly vulnerable to flood hazards, emphasizing the need for proper planning and management in these areas.