基于用户行为图划分方法的意见垃圾邮件检测

Bundit Manaskasemsak, Chayada Chanmakho, Jakapong Klainongsuang, A. Rungsawang
{"title":"基于用户行为图划分方法的意见垃圾邮件检测","authors":"Bundit Manaskasemsak, Chayada Chanmakho, Jakapong Klainongsuang, A. Rungsawang","doi":"10.1145/3325773.3325783","DOIUrl":null,"url":null,"abstract":"Online reviews, an important source of user opinions, help not only other customers to make a decision but also manufacturers to improve quality of their products or services. Due to commercial reasons, untruthful reviews (spam) written to promote or demote certain products rather than they deserve have become a crucial problem. Although existing supervised approaches have shown the effectiveness of spam detection by using statistical learning, they require much expensive cost for labeling the training data. In this paper, we present BeGP that is a graph-partitioned approach for opinion spam detection. A set of characteristic features is first extracted and a user behavioral graph is constructed by connecting reviewers sharing those features to capture their similar behavior. BeGP is a semi-supervised scheme without requiring any training. Hence, it starts with a small subgraph of labeled spammers and afterwards iteratively expands by conducting connected other users as a resulted set of suspects. We demonstrate the effectiveness of BeGP on two real-world review datasets from Yelp.com. The result shows that it outperforms several state-of-the-art methods with accurately identifying spammers as well as review spams within the k-first order of ranking.","PeriodicalId":419017,"journal":{"name":"Proceedings of the 2019 3rd International Conference on Intelligent Systems, Metaheuristics & Swarm Intelligence","volume":"37 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-03-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Opinion Spam Detection through User Behavioral Graph Partitioning Approach\",\"authors\":\"Bundit Manaskasemsak, Chayada Chanmakho, Jakapong Klainongsuang, A. Rungsawang\",\"doi\":\"10.1145/3325773.3325783\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Online reviews, an important source of user opinions, help not only other customers to make a decision but also manufacturers to improve quality of their products or services. Due to commercial reasons, untruthful reviews (spam) written to promote or demote certain products rather than they deserve have become a crucial problem. Although existing supervised approaches have shown the effectiveness of spam detection by using statistical learning, they require much expensive cost for labeling the training data. In this paper, we present BeGP that is a graph-partitioned approach for opinion spam detection. A set of characteristic features is first extracted and a user behavioral graph is constructed by connecting reviewers sharing those features to capture their similar behavior. BeGP is a semi-supervised scheme without requiring any training. Hence, it starts with a small subgraph of labeled spammers and afterwards iteratively expands by conducting connected other users as a resulted set of suspects. We demonstrate the effectiveness of BeGP on two real-world review datasets from Yelp.com. The result shows that it outperforms several state-of-the-art methods with accurately identifying spammers as well as review spams within the k-first order of ranking.\",\"PeriodicalId\":419017,\"journal\":{\"name\":\"Proceedings of the 2019 3rd International Conference on Intelligent Systems, Metaheuristics & Swarm Intelligence\",\"volume\":\"37 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-03-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2019 3rd International Conference on Intelligent Systems, Metaheuristics & Swarm Intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3325773.3325783\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2019 3rd International Conference on Intelligent Systems, Metaheuristics & Swarm Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3325773.3325783","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

摘要

在线评论是用户意见的重要来源,不仅可以帮助其他客户做出决定,还可以帮助制造商提高产品或服务的质量。由于商业原因,不真实的评论(垃圾邮件),以促进或贬低某些产品,而不是他们应得的已经成为一个关键问题。虽然现有的监督方法通过统计学习显示了垃圾邮件检测的有效性,但它们需要花费昂贵的成本来标记训练数据。在本文中,我们提出了一种用于意见垃圾检测的图分区方法——BeGP。首先提取一组特征特征,并通过连接共享这些特征的评论者来捕获他们的相似行为来构建用户行为图。BeGP是一种不需要任何训练的半监督方案。因此,它从标记的垃圾邮件发送者的一个小子图开始,然后通过将连接的其他用户作为嫌疑人的结果集进行迭代扩展。我们在来自Yelp.com的两个真实世界的评论数据集上展示了BeGP的有效性。结果表明,它在准确识别垃圾邮件发送者以及在排名的第k个顺序内审查垃圾邮件方面优于几种最先进的方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Opinion Spam Detection through User Behavioral Graph Partitioning Approach
Online reviews, an important source of user opinions, help not only other customers to make a decision but also manufacturers to improve quality of their products or services. Due to commercial reasons, untruthful reviews (spam) written to promote or demote certain products rather than they deserve have become a crucial problem. Although existing supervised approaches have shown the effectiveness of spam detection by using statistical learning, they require much expensive cost for labeling the training data. In this paper, we present BeGP that is a graph-partitioned approach for opinion spam detection. A set of characteristic features is first extracted and a user behavioral graph is constructed by connecting reviewers sharing those features to capture their similar behavior. BeGP is a semi-supervised scheme without requiring any training. Hence, it starts with a small subgraph of labeled spammers and afterwards iteratively expands by conducting connected other users as a resulted set of suspects. We demonstrate the effectiveness of BeGP on two real-world review datasets from Yelp.com. The result shows that it outperforms several state-of-the-art methods with accurately identifying spammers as well as review spams within the k-first order of ranking.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
An Online Password Guessing Method Based on Big Data Feature-Weighted Fuzzy K-Modes Clustering Epilepsy Detection in EEG Signal using Recurrent Neural Network Analysis of Ant Colony Optimization on a Dynamically Changing Optical Burst Switched Network with Impairments Gaussian Process Dynamical Autoencoder Model
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1