减少特征集的Twitter Bot检测

Jefferson Viana Fonseca Abreu, C. Ralha, J. Gondim
{"title":"减少特征集的Twitter Bot检测","authors":"Jefferson Viana Fonseca Abreu, C. Ralha, J. Gondim","doi":"10.1109/ISI49825.2020.9280525","DOIUrl":null,"url":null,"abstract":"Online social networks provide a novel channel to allow interaction between human beings. Its success has attracted interest in attacking and exploiting them through a wide range of unethical activities, such as malicious actions to manipulate users. One of the methods to carry out these abuses is the use of bots on Twitter. Recent examples of bots influencing public opinion in the election process demonstrate their potential harm to the democratic world. Such malicious behavior needs to be checked and its effects should be diminished. Recently, machine learning (ML) classifiers to distinguish between real and bot accounts have proven advances. Thus, in this work four ML algorithms were tested using a public dataset and a few expressive features based on simple user profile counters for the classification of bots on Twitter. We measured their performance compared to one state–of–the–art bot detection work. The classifier accuracy was considered homogeneous with a mean of 0.8549 and 0.1889 of standard deviation. Besides, all multiclass classifiers obtained AUCs greater than 0.9 indicating a practical benefit for bot detection on Twitter.","PeriodicalId":90875,"journal":{"name":"ISI ... : ... IEEE Intelligence and Security Informatics. IEEE International Conference on Intelligence and Security Informatics","volume":"8 1","pages":"1-6"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":"{\"title\":\"Twitter Bot Detection with Reduced Feature Set\",\"authors\":\"Jefferson Viana Fonseca Abreu, C. Ralha, J. Gondim\",\"doi\":\"10.1109/ISI49825.2020.9280525\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Online social networks provide a novel channel to allow interaction between human beings. Its success has attracted interest in attacking and exploiting them through a wide range of unethical activities, such as malicious actions to manipulate users. One of the methods to carry out these abuses is the use of bots on Twitter. Recent examples of bots influencing public opinion in the election process demonstrate their potential harm to the democratic world. Such malicious behavior needs to be checked and its effects should be diminished. Recently, machine learning (ML) classifiers to distinguish between real and bot accounts have proven advances. Thus, in this work four ML algorithms were tested using a public dataset and a few expressive features based on simple user profile counters for the classification of bots on Twitter. We measured their performance compared to one state–of–the–art bot detection work. The classifier accuracy was considered homogeneous with a mean of 0.8549 and 0.1889 of standard deviation. Besides, all multiclass classifiers obtained AUCs greater than 0.9 indicating a practical benefit for bot detection on Twitter.\",\"PeriodicalId\":90875,\"journal\":{\"name\":\"ISI ... : ... IEEE Intelligence and Security Informatics. IEEE International Conference on Intelligence and Security Informatics\",\"volume\":\"8 1\",\"pages\":\"1-6\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-11-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ISI ... : ... IEEE Intelligence and Security Informatics. IEEE International Conference on Intelligence and Security Informatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISI49825.2020.9280525\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ISI ... : ... IEEE Intelligence and Security Informatics. IEEE International Conference on Intelligence and Security Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISI49825.2020.9280525","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 10

摘要

在线社交网络为人与人之间的互动提供了一个新的渠道。它的成功吸引了人们的兴趣,通过各种不道德的活动来攻击和利用它们,比如恶意操作用户。实施这些滥用的方法之一是在推特上使用机器人。最近机器人在选举过程中影响公众舆论的例子表明,它们对民主世界的潜在危害。这种恶意行为需要加以制止,其影响应该减少。最近,机器学习(ML)分类器在区分真实账户和机器人账户方面取得了长足的进步。因此,在这项工作中,使用一个公共数据集和一些基于简单用户配置文件计数器的表达特征来测试四种机器学习算法,用于Twitter上的机器人分类。我们将它们的性能与最先进的机器人检测工作进行了比较。分类器的准确率被认为是均匀的,平均为0.8549,标准差为0.1889。此外,所有多类分类器获得的auc都大于0.9,这表明Twitter上的bot检测具有实际优势。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Twitter Bot Detection with Reduced Feature Set
Online social networks provide a novel channel to allow interaction between human beings. Its success has attracted interest in attacking and exploiting them through a wide range of unethical activities, such as malicious actions to manipulate users. One of the methods to carry out these abuses is the use of bots on Twitter. Recent examples of bots influencing public opinion in the election process demonstrate their potential harm to the democratic world. Such malicious behavior needs to be checked and its effects should be diminished. Recently, machine learning (ML) classifiers to distinguish between real and bot accounts have proven advances. Thus, in this work four ML algorithms were tested using a public dataset and a few expressive features based on simple user profile counters for the classification of bots on Twitter. We measured their performance compared to one state–of–the–art bot detection work. The classifier accuracy was considered homogeneous with a mean of 0.8549 and 0.1889 of standard deviation. Besides, all multiclass classifiers obtained AUCs greater than 0.9 indicating a practical benefit for bot detection on Twitter.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Open Practices of Early Career Researchers: A Qualitative Study on Research and Teaching Behavior Design and Development of an Emoji Sentiment Lexicon Does the General Public Share Research on Twitter? A Case Study on the Online Conversation about the Search for a Nuclear Repository in Germany Vom Datenkatalog zum Wissensgraph - Forschungsdaten im konzeptuellen Modell von FRBR Omission of Information: Identifying Political Slant via an Analysis of Co-occurring Entities
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1