P. K. Putra, D. B. Sencaki, G. P. Dinanta, F. Alhasanah, R. Ramadhan
{"title":"Flood Monitoring with Information Extraction Approach from Social Media Data","authors":"P. K. Putra, D. B. Sencaki, G. P. Dinanta, F. Alhasanah, R. Ramadhan","doi":"10.1109/AGERS51788.2020.9452770","DOIUrl":null,"url":null,"abstract":"Flood natural disasters that often occur in Jakarta have a bad impact on many sectors. Countermeasures, fast action, and monitoring need to be done to minimize the impact that occurs. Social Media is a technology platform that can provide flood-related data that can be used as primary data or complementary data for monitoring systems. This study focuses on using social media data to be used as flood monitoring data. The analysis used is an analysis with a natural language processing approach. The classification algorithm method used in this study is naive Bayes, random forest, support vector machine, logistic regression, and conditional random field. Location information extraction methods used are Standford NER and Geocoding. This research produces three models. The first model is the classification model used to classify relevant data with an f1- score evaluation value of 82.5%. The second model is the NER model which is used to extract location entities from sentences with an f1-score evaluation value of 73%. The last one is the locator of geocoding with a success rate of 75% for identifying roads. This research also produces a simple dashboard that can be used as a visualization tool.","PeriodicalId":125663,"journal":{"name":"2020 IEEE Asia-Pacific Conference on Geoscience, Electronics and Remote Sensing Technology (AGERS)","volume":"75 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE Asia-Pacific Conference on Geoscience, Electronics and Remote Sensing Technology (AGERS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AGERS51788.2020.9452770","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Flood natural disasters that often occur in Jakarta have a bad impact on many sectors. Countermeasures, fast action, and monitoring need to be done to minimize the impact that occurs. Social Media is a technology platform that can provide flood-related data that can be used as primary data or complementary data for monitoring systems. This study focuses on using social media data to be used as flood monitoring data. The analysis used is an analysis with a natural language processing approach. The classification algorithm method used in this study is naive Bayes, random forest, support vector machine, logistic regression, and conditional random field. Location information extraction methods used are Standford NER and Geocoding. This research produces three models. The first model is the classification model used to classify relevant data with an f1- score evaluation value of 82.5%. The second model is the NER model which is used to extract location entities from sentences with an f1-score evaluation value of 73%. The last one is the locator of geocoding with a success rate of 75% for identifying roads. This research also produces a simple dashboard that can be used as a visualization tool.