{"title":"Twitter Buzzer Detection for Indonesian Presidential Election","authors":"Andi Suciati, A. Wibisono, P. Mursanto","doi":"10.1109/ICICoS48119.2019.8982529","DOIUrl":null,"url":null,"abstract":"The campaign that was done in social media has high correlation to the supporters who disseminating the information deliberately, which called as buzzer. However, data that were generated by buzzer accounts can be considered as noise and need to be removed. In this research we performed task for detecting the buzzer accounts in Twitter by observing the impact of features we used which we selected based on their Mutual Information scores. We examined the performance of four machine learning algorithms which are Ada Boost (AB), Gradient Boosting (GB), Extreme Gradient Boosting (XGB), and Histogram-based Gradient Boosting (HGB). The algorithms were evaluated using 10 folds cross validation and the results show that the best accuracy and precision achieved by AB which are 62.3% and 61.3% respectively with 25 features while the recall attained by XGB (67.9%) which the score same with its recall result with 20 features.","PeriodicalId":105407,"journal":{"name":"2019 3rd International Conference on Informatics and Computational Sciences (ICICoS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 3rd International Conference on Informatics and Computational Sciences (ICICoS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICICoS48119.2019.8982529","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 12
Abstract
The campaign that was done in social media has high correlation to the supporters who disseminating the information deliberately, which called as buzzer. However, data that were generated by buzzer accounts can be considered as noise and need to be removed. In this research we performed task for detecting the buzzer accounts in Twitter by observing the impact of features we used which we selected based on their Mutual Information scores. We examined the performance of four machine learning algorithms which are Ada Boost (AB), Gradient Boosting (GB), Extreme Gradient Boosting (XGB), and Histogram-based Gradient Boosting (HGB). The algorithms were evaluated using 10 folds cross validation and the results show that the best accuracy and precision achieved by AB which are 62.3% and 61.3% respectively with 25 features while the recall attained by XGB (67.9%) which the score same with its recall result with 20 features.