Spam profile detection in social networks based on public features

Ala’ M. Al-Zoubi, Ja'far Alqatawna, Hossam Paris
{"title":"Spam profile detection in social networks based on public features","authors":"Ala’ M. Al-Zoubi, Ja'far Alqatawna, Hossam Paris","doi":"10.1109/IACS.2017.7921959","DOIUrl":null,"url":null,"abstract":"In the context of Online Social Networks, Spam profiles are not just a source of unwanted ads, but a serious security threat used by online criminals and terrorists for various malicious purposes. Recently, such criminals were able to steal a number of accounts that belong to NatWest bank's customers. Their attack vector was based on spam tweets posted by a Twitter account which looked very close to NatWest customer support account and leaded users to a link of a phishing site. In this study, we investigate the nature of spam profiles in Twitter with a goal to improve social spam detection. Based on a set of publicly available features, we develop spam profiles detection models. At this stage, a dataset of 82 Twitter's profiles are collected and analyzed. With feature engineering, we investigate ten binary and simple features that can be used to classify spam profiles. Moreover, a feature selection process is utilized to identify the most influencing features in the process of detecting spam profiles. For feature selection, two methods are used ReliefF and Information Gain. While for classification, four classification algorithms are applied and compared: Decision Trees, Multilayer Perceptron, k-Nearest neighbors and Naive Bayes. Preliminary experiments in this work show that the promising detection rates can be obtained using such features regardless of the language of the tweets.","PeriodicalId":180504,"journal":{"name":"2017 8th International Conference on Information and Communication Systems (ICICS)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"59","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 8th International Conference on Information and Communication Systems (ICICS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IACS.2017.7921959","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 59

Abstract

In the context of Online Social Networks, Spam profiles are not just a source of unwanted ads, but a serious security threat used by online criminals and terrorists for various malicious purposes. Recently, such criminals were able to steal a number of accounts that belong to NatWest bank's customers. Their attack vector was based on spam tweets posted by a Twitter account which looked very close to NatWest customer support account and leaded users to a link of a phishing site. In this study, we investigate the nature of spam profiles in Twitter with a goal to improve social spam detection. Based on a set of publicly available features, we develop spam profiles detection models. At this stage, a dataset of 82 Twitter's profiles are collected and analyzed. With feature engineering, we investigate ten binary and simple features that can be used to classify spam profiles. Moreover, a feature selection process is utilized to identify the most influencing features in the process of detecting spam profiles. For feature selection, two methods are used ReliefF and Information Gain. While for classification, four classification algorithms are applied and compared: Decision Trees, Multilayer Perceptron, k-Nearest neighbors and Naive Bayes. Preliminary experiments in this work show that the promising detection rates can be obtained using such features regardless of the language of the tweets.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于公共特征的社交网络垃圾信息检测
在在线社交网络的背景下,垃圾资料不仅是不受欢迎的广告来源,而且是网络犯罪分子和恐怖分子用于各种恶意目的的严重安全威胁。最近,这类犯罪分子成功窃取了NatWest银行客户的多个账户。他们的攻击媒介是基于一个Twitter帐户发布的垃圾推文,该帐户看起来与NatWest客户支持帐户非常接近,并将用户引导到一个网络钓鱼网站的链接。在这项研究中,我们调查了Twitter中垃圾邮件配置文件的性质,目的是提高社交垃圾邮件检测。基于一组公开可用的特性,我们开发了垃圾邮件配置文件检测模型。在这个阶段,收集并分析了82个Twitter个人资料的数据集。通过特征工程,我们研究了10个可用于分类垃圾邮件配置文件的二进制和简单特征。此外,利用特征选择过程来识别检测垃圾邮件配置文件过程中最具影响力的特征。在特征选择方面,采用了ReliefF和Information Gain两种方法。在分类方面,采用了四种分类算法:决策树、多层感知器、k近邻和朴素贝叶斯。本工作的初步实验表明,无论推文的语言如何,使用这些特征都可以获得有希望的检测率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Using behaviour-driven development with hardware-software co-design for autonomous load management Efficient 3D placement of a UAV using particle swarm optimization A brain friendly tool to facilitate research-teaching nexus: Mind maps HidroMORE 2: An optimized and parallel version of HidroMORE Comparative analysis of MCDM methods for product aspect ranking: TOPSIS and VIKOR
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1