Jefferson Viana Fonseca Abreu, C. Ralha, J. Gondim
{"title":"Twitter Bot Detection with Reduced Feature Set","authors":"Jefferson Viana Fonseca Abreu, C. Ralha, J. Gondim","doi":"10.1109/ISI49825.2020.9280525","DOIUrl":null,"url":null,"abstract":"Online social networks provide a novel channel to allow interaction between human beings. Its success has attracted interest in attacking and exploiting them through a wide range of unethical activities, such as malicious actions to manipulate users. One of the methods to carry out these abuses is the use of bots on Twitter. Recent examples of bots influencing public opinion in the election process demonstrate their potential harm to the democratic world. Such malicious behavior needs to be checked and its effects should be diminished. Recently, machine learning (ML) classifiers to distinguish between real and bot accounts have proven advances. Thus, in this work four ML algorithms were tested using a public dataset and a few expressive features based on simple user profile counters for the classification of bots on Twitter. We measured their performance compared to one state–of–the–art bot detection work. The classifier accuracy was considered homogeneous with a mean of 0.8549 and 0.1889 of standard deviation. Besides, all multiclass classifiers obtained AUCs greater than 0.9 indicating a practical benefit for bot detection on Twitter.","PeriodicalId":90875,"journal":{"name":"ISI ... : ... IEEE Intelligence and Security Informatics. IEEE International Conference on Intelligence and Security Informatics","volume":"8 1","pages":"1-6"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ISI ... : ... IEEE Intelligence and Security Informatics. IEEE International Conference on Intelligence and Security Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISI49825.2020.9280525","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 10
Abstract
Online social networks provide a novel channel to allow interaction between human beings. Its success has attracted interest in attacking and exploiting them through a wide range of unethical activities, such as malicious actions to manipulate users. One of the methods to carry out these abuses is the use of bots on Twitter. Recent examples of bots influencing public opinion in the election process demonstrate their potential harm to the democratic world. Such malicious behavior needs to be checked and its effects should be diminished. Recently, machine learning (ML) classifiers to distinguish between real and bot accounts have proven advances. Thus, in this work four ML algorithms were tested using a public dataset and a few expressive features based on simple user profile counters for the classification of bots on Twitter. We measured their performance compared to one state–of–the–art bot detection work. The classifier accuracy was considered homogeneous with a mean of 0.8549 and 0.1889 of standard deviation. Besides, all multiclass classifiers obtained AUCs greater than 0.9 indicating a practical benefit for bot detection on Twitter.