{"title":"DEVELOPING A FLOOD FORECASTING SYSTEM WITH MACHINE LEARNING AND APPLYING TO GEOGRAPHIC INFORMATION SYSTEM","authors":"Jirayu Pungching, Sitang Pilailar","doi":"10.21163/gt_2023.181.01","DOIUrl":null,"url":null,"abstract":": Floods are natural disasters that can damage lives, property, and the economy. Therefore, it is necessary to have a reliable and accurate flood forecasting system to provide early warning in time. Although several Mathematical models have been developed and used to forecast floods continuously for decades, most require up-to-date and specific physical data, including a high experience user, to provide and interpret the result. It is an obstacle for use in remote areas with incomplete information and a lack of specialists. This study, therefore, developed a real-time flood forecasting system with Machine Learning by applying a 2-variable sliding window technique to restructure the data, which can solve the problem of data limitation. Thung Song District Nakhon Si Thammarat Province was selected to test this newly developed model. By importing the water level data of two water level observed stations, SWR025 at the upstream and NKO001 at Thung Song Municipality, into five machine learning algorithms (Linear Regression, Support Vector Machine, K-Nearest Neighbor, Decision Tree, and Random Forest) for forecasting the water level every 30 minutes for the next 5 hours. Their performance was compared by the MSE, MAE, and R 2, which ranged from 0.006-0.013, 0.044-0.063, and 0.518-0.750, respectively. The Random Forest was the most efficient algorithm for the 3-hour forecast with an efficiency value of MSE 0.006, MAE 0.044, and R 2 0.75. The developed ML flood forecasting model was validated by the flood data in November 2021 and showed good agreement. Then, the extent of the inundation area was evaluated by the mathematical model. Next, the water depth and surface elevation were transformed and applied to GIS. Finally, the flood risk areas on Google Maps under that specific rainfall are promptly notified to the people three hours before the flood occurs.","PeriodicalId":45100,"journal":{"name":"Geographia Technica","volume":" ","pages":""},"PeriodicalIF":0.7000,"publicationDate":"2022-10-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Geographia Technica","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.21163/gt_2023.181.01","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"GEOGRAPHY, PHYSICAL","Score":null,"Total":0}
引用次数: 0
Abstract
: Floods are natural disasters that can damage lives, property, and the economy. Therefore, it is necessary to have a reliable and accurate flood forecasting system to provide early warning in time. Although several Mathematical models have been developed and used to forecast floods continuously for decades, most require up-to-date and specific physical data, including a high experience user, to provide and interpret the result. It is an obstacle for use in remote areas with incomplete information and a lack of specialists. This study, therefore, developed a real-time flood forecasting system with Machine Learning by applying a 2-variable sliding window technique to restructure the data, which can solve the problem of data limitation. Thung Song District Nakhon Si Thammarat Province was selected to test this newly developed model. By importing the water level data of two water level observed stations, SWR025 at the upstream and NKO001 at Thung Song Municipality, into five machine learning algorithms (Linear Regression, Support Vector Machine, K-Nearest Neighbor, Decision Tree, and Random Forest) for forecasting the water level every 30 minutes for the next 5 hours. Their performance was compared by the MSE, MAE, and R 2, which ranged from 0.006-0.013, 0.044-0.063, and 0.518-0.750, respectively. The Random Forest was the most efficient algorithm for the 3-hour forecast with an efficiency value of MSE 0.006, MAE 0.044, and R 2 0.75. The developed ML flood forecasting model was validated by the flood data in November 2021 and showed good agreement. Then, the extent of the inundation area was evaluated by the mathematical model. Next, the water depth and surface elevation were transformed and applied to GIS. Finally, the flood risk areas on Google Maps under that specific rainfall are promptly notified to the people three hours before the flood occurs.
期刊介绍:
Geographia Technica is a journal devoted to the publication of all papers on all aspects of the use of technical and quantitative methods in geographical research. It aims at presenting its readers with the latest developments in G.I.S technology, mathematical methods applicable to any field of geography, territorial micro-scalar and laboratory experiments, and the latest developments induced by the measurement techniques to the geographical research. Geographia Technica is dedicated to all those who understand that nowadays every field of geography can only be described by specific numerical values, variables both oftime and space which require the sort of numerical analysis only possible with the aid of technical and quantitative methods offered by powerful computers and dedicated software. Our understanding of Geographia Technica expands the concept of technical methods applied to geography to its broadest sense and for that, papers of different interests such as: G.l.S, Spatial Analysis, Remote Sensing, Cartography or Geostatistics as well as papers which, by promoting the above mentioned directions bring a technical approach in the fields of hydrology, climatology, geomorphology, human geography territorial planning are more than welcomed provided they are of sufficient wide interest and relevance.