Towards building a word similarity dictionary for personality bias classification of phishing email contents

Ke Ding, Nicholas Pantic, You Lu, S. Manna, M. Husain
{"title":"Towards building a word similarity dictionary for personality bias classification of phishing email contents","authors":"Ke Ding, Nicholas Pantic, You Lu, S. Manna, M. Husain","doi":"10.1109/ICOSC.2015.7050815","DOIUrl":null,"url":null,"abstract":"Phishing attacks are a form of social engineering technique used for stealing private information from users through emails. A general approach for phishing susceptibility analysis is to profile the user's personality using personality models such as the Five Factor Model (FFM) and find out the susceptibility for a set of phishing attempts. The FFM is a personality profiling system that scores participants on five separate personality traits: openness to experience (O), conscientiousness (C), extraversion (E), agreeableness (A), and neuroticism (N). However, existing approaches don't take into account the fact that based on the content, for example, a phishing email offering an enticing free prize might be very effective on a dominant O-personality (curious, open to new experience), but not to an N-personality (tendency of experiencing negative emotion). Therefore, it is necessary to consider the personality bias of the phishing email contents during the susceptibility analysis. In this paper, we have proposed a method to construct a dictionary based on the semantic similarity of prospective words describing the FFM. Words generated through this dictionary can be used to label the phishing emails according to the personality bias and serve as the key component of a personality bias classification system of phishing emails. We have validated our dictionary construction using a large public corpus of phishing email data which shows the potential of the proposed system in anti-phishing research.","PeriodicalId":126701,"journal":{"name":"Proceedings of the 2015 IEEE 9th International Conference on Semantic Computing (IEEE ICSC 2015)","volume":"165 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2015 IEEE 9th International Conference on Semantic Computing (IEEE ICSC 2015)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICOSC.2015.7050815","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6

Abstract

Phishing attacks are a form of social engineering technique used for stealing private information from users through emails. A general approach for phishing susceptibility analysis is to profile the user's personality using personality models such as the Five Factor Model (FFM) and find out the susceptibility for a set of phishing attempts. The FFM is a personality profiling system that scores participants on five separate personality traits: openness to experience (O), conscientiousness (C), extraversion (E), agreeableness (A), and neuroticism (N). However, existing approaches don't take into account the fact that based on the content, for example, a phishing email offering an enticing free prize might be very effective on a dominant O-personality (curious, open to new experience), but not to an N-personality (tendency of experiencing negative emotion). Therefore, it is necessary to consider the personality bias of the phishing email contents during the susceptibility analysis. In this paper, we have proposed a method to construct a dictionary based on the semantic similarity of prospective words describing the FFM. Words generated through this dictionary can be used to label the phishing emails according to the personality bias and serve as the key component of a personality bias classification system of phishing emails. We have validated our dictionary construction using a large public corpus of phishing email data which shows the potential of the proposed system in anti-phishing research.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
构建用于网络钓鱼邮件内容人格偏见分类的词相似度词典
网络钓鱼攻击是一种社会工程技术,用于通过电子邮件窃取用户的私人信息。网络钓鱼易感性分析的一般方法是使用人格模型(如五因素模型(FFM))来分析用户的个性,并找出一组网络钓鱼尝试的易感性。FFM是一种性格分析系统,它根据参与者的五种不同的性格特征给他们打分:开放性(O),责任心(C),外向性(E),亲和性(A)和神经质(N)。然而,现有的方法并没有考虑到这样一个事实,即基于内容,例如,提供诱人的免费奖品的网络钓鱼电子邮件可能对占主导地位的O型人格(好奇,乐于接受新体验)非常有效,但对N型人格(倾向于体验负面情绪)就没有效果。因此,在敏感性分析中,有必要考虑网络钓鱼邮件内容的人格偏见。在本文中,我们提出了一种基于前置词的语义相似度来构建描述FFM的词典的方法。通过该词典生成的单词可以根据人格偏见对网络钓鱼邮件进行标记,并作为网络钓鱼邮件人格偏见分类系统的关键组成部分。我们使用大量网络钓鱼电子邮件数据验证了我们的词典构建,这表明了所提出的系统在反网络钓鱼研究中的潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
NNB: An efficient nearest neighbor search method for hierarchical clustering on large datasets Aggregating financial services data without assumptions: A semantic data reference architecture Reducing search space for Web Service ranking using semantic logs and Semantic FP-Tree based association rule mining An approximation of betweenness centrality for Social Networks Performance analysis of Ensemble methods on Twitter sentiment analysis using NLP techniques
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1