Spam Detection Based on Nearest Community Classifier

2015 International Conference on Intelligent Networking and Collaborative Systems Pub Date : 2015-09-02 DOI:10.1109/INCoS.2015.75

Michal Prilepok, M. Kudelka

引用次数: 3

Abstract

Undesirable emails (spam) are increasingly becoming a big problem nowadays, not only for users, but also for Internet service providers. Therefore, the design of new algorithms detecting the spam is currently one of the research hot-topics. We define two requirements and use them simultaneously. The first requirement is a low rate of falsely detected emails which has an impact on the algorithm performance. The second requirement is a fast detection of spams. It minimizes the delay in receiving emails. In this paper, we focus our effort on the first requirement. To solve this problem we applied network community analysis. The approach is to find communities - groups of same emails. In this paper, we present a new nearest community classifier and apply it in the field of spam detection. The obtained results are very close to Bayesian Spam Filter. We achieved 93.78% accuracy. The algorithm can detect 80.72% of spam emails and 98.01% non-spam emails.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于最近社区分类器的垃圾邮件检测

不受欢迎的电子邮件(垃圾邮件)如今日益成为一个大问题，不仅对用户，而且对互联网服务提供商。因此，设计新的垃圾邮件检测算法是当前的研究热点之一。我们定义两个需求并同时使用它们。第一个要求是低误检率的邮件，这会影响算法的性能。第二个要求是快速检测垃圾邮件。它最大限度地减少了接收电子邮件的延迟。在本文中，我们将重点放在第一个需求上。为了解决这个问题，我们应用了网络社区分析。方法是找到社区——由相同的电子邮件组成的群体。本文提出了一种新的最近社区分类器，并将其应用于垃圾邮件检测领域。所得结果与贝叶斯垃圾邮件过滤器非常接近。准确率达到93.78%。该算法能检测出80.72%的垃圾邮件和98.01%的非垃圾邮件。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2015 International Conference on Intelligent Networking and Collaborative Systems

自引率

0.00%

发文量