Spam profile detection in social networks based on public features

2017 8th International Conference on Information and Communication Systems (ICICS) Pub Date : 2017-04-01 DOI:10.1109/IACS.2017.7921959

Ala’ M. Al-Zoubi, Ja'far Alqatawna, Hossam Paris

{"title":"Spam profile detection in social networks based on public features","authors":"Ala’ M. Al-Zoubi, Ja'far Alqatawna, Hossam Paris","doi":"10.1109/IACS.2017.7921959","DOIUrl":null,"url":null,"abstract":"In the context of Online Social Networks, Spam profiles are not just a source of unwanted ads, but a serious security threat used by online criminals and terrorists for various malicious purposes. Recently, such criminals were able to steal a number of accounts that belong to NatWest bank's customers. Their attack vector was based on spam tweets posted by a Twitter account which looked very close to NatWest customer support account and leaded users to a link of a phishing site. In this study, we investigate the nature of spam profiles in Twitter with a goal to improve social spam detection. Based on a set of publicly available features, we develop spam profiles detection models. At this stage, a dataset of 82 Twitter's profiles are collected and analyzed. With feature engineering, we investigate ten binary and simple features that can be used to classify spam profiles. Moreover, a feature selection process is utilized to identify the most influencing features in the process of detecting spam profiles. For feature selection, two methods are used ReliefF and Information Gain. While for classification, four classification algorithms are applied and compared: Decision Trees, Multilayer Perceptron, k-Nearest neighbors and Naive Bayes. Preliminary experiments in this work show that the promising detection rates can be obtained using such features regardless of the language of the tweets.","PeriodicalId":180504,"journal":{"name":"2017 8th International Conference on Information and Communication Systems (ICICS)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"59","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 8th International Conference on Information and Communication Systems (ICICS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IACS.2017.7921959","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 59

Abstract

In the context of Online Social Networks, Spam profiles are not just a source of unwanted ads, but a serious security threat used by online criminals and terrorists for various malicious purposes. Recently, such criminals were able to steal a number of accounts that belong to NatWest bank's customers. Their attack vector was based on spam tweets posted by a Twitter account which looked very close to NatWest customer support account and leaded users to a link of a phishing site. In this study, we investigate the nature of spam profiles in Twitter with a goal to improve social spam detection. Based on a set of publicly available features, we develop spam profiles detection models. At this stage, a dataset of 82 Twitter's profiles are collected and analyzed. With feature engineering, we investigate ten binary and simple features that can be used to classify spam profiles. Moreover, a feature selection process is utilized to identify the most influencing features in the process of detecting spam profiles. For feature selection, two methods are used ReliefF and Information Gain. While for classification, four classification algorithms are applied and compared: Decision Trees, Multilayer Perceptron, k-Nearest neighbors and Naive Bayes. Preliminary experiments in this work show that the promising detection rates can be obtained using such features regardless of the language of the tweets.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于公共特征的社交网络垃圾信息检测

在在线社交网络的背景下，垃圾资料不仅是不受欢迎的广告来源，而且是网络犯罪分子和恐怖分子用于各种恶意目的的严重安全威胁。最近，这类犯罪分子成功窃取了NatWest银行客户的多个账户。他们的攻击媒介是基于一个Twitter帐户发布的垃圾推文，该帐户看起来与NatWest客户支持帐户非常接近，并将用户引导到一个网络钓鱼网站的链接。在这项研究中，我们调查了Twitter中垃圾邮件配置文件的性质，目的是提高社交垃圾邮件检测。基于一组公开可用的特性，我们开发了垃圾邮件配置文件检测模型。在这个阶段，收集并分析了82个Twitter个人资料的数据集。通过特征工程，我们研究了10个可用于分类垃圾邮件配置文件的二进制和简单特征。此外，利用特征选择过程来识别检测垃圾邮件配置文件过程中最具影响力的特征。在特征选择方面，采用了ReliefF和Information Gain两种方法。在分类方面，采用了四种分类算法:决策树、多层感知器、k近邻和朴素贝叶斯。本工作的初步实验表明，无论推文的语言如何，使用这些特征都可以获得有希望的检测率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2017 8th International Conference on Information and Communication Systems (ICICS)

自引率

0.00%

发文量