垃圾邮件过滤器性能和抗攻击鲁棒性的实验评估

2005 International Conference on Collaborative Computing: Networking, Applications and Worksharing Pub Date : 2005-12-19 DOI:10.1109/COLCOM.2005.1651219

Steve Webb, Subramanyam Chitti, C. Pu

{"title":"垃圾邮件过滤器性能和抗攻击鲁棒性的实验评估","authors":"Steve Webb, Subramanyam Chitti, C. Pu","doi":"10.1109/COLCOM.2005.1651219","DOIUrl":null,"url":null,"abstract":"In this paper, we show experimentally that learning filters are able to classify large corpora of spam and legitimate email messages with a high degree of accuracy. The corpora in our experiments contain about half a million spam messages and a similar number of legitimate messages, making them two orders of magnitude larger than the corpora used in current research. The use of such large corpora represents a collaborative approach to spam filtering because the corpora combine spam and legitimate messages from many different sources. First, we show that this collaborative approach creates very accurate spam filters. Then, we introduce an effective attack against these filters which successfully degrades their ability to classify spam. Finally, we present an effective solution to the above attack which involves retraining the filters to accurately identify the attack messages","PeriodicalId":365186,"journal":{"name":"2005 International Conference on Collaborative Computing: Networking, Applications and Worksharing","volume":"74 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2005-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"22","resultStr":"{\"title\":\"An experimental evaluation of spam filter performance and robustness against attack\",\"authors\":\"Steve Webb, Subramanyam Chitti, C. Pu\",\"doi\":\"10.1109/COLCOM.2005.1651219\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we show experimentally that learning filters are able to classify large corpora of spam and legitimate email messages with a high degree of accuracy. The corpora in our experiments contain about half a million spam messages and a similar number of legitimate messages, making them two orders of magnitude larger than the corpora used in current research. The use of such large corpora represents a collaborative approach to spam filtering because the corpora combine spam and legitimate messages from many different sources. First, we show that this collaborative approach creates very accurate spam filters. Then, we introduce an effective attack against these filters which successfully degrades their ability to classify spam. Finally, we present an effective solution to the above attack which involves retraining the filters to accurately identify the attack messages\",\"PeriodicalId\":365186,\"journal\":{\"name\":\"2005 International Conference on Collaborative Computing: Networking, Applications and Worksharing\",\"volume\":\"74 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2005-12-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"22\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2005 International Conference on Collaborative Computing: Networking, Applications and Worksharing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/COLCOM.2005.1651219\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2005 International Conference on Collaborative Computing: Networking, Applications and Worksharing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/COLCOM.2005.1651219","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 22

摘要

在本文中，我们通过实验证明，学习过滤器能够以很高的准确率对大量垃圾邮件和合法电子邮件信息进行分类。我们实验中的语料库包含大约50万条垃圾邮件和类似数量的合法消息，使它们比当前研究中使用的语料库大两个数量级。这种大型语料库的使用代表了垃圾邮件过滤的协作方法，因为语料库将来自许多不同来源的垃圾邮件和合法消息结合在一起。首先，我们展示了这种协作方法创建了非常精确的垃圾邮件过滤器。然后，我们引入了一种针对这些过滤器的有效攻击，成功地降低了它们对垃圾邮件的分类能力。最后，我们提出了一种有效的解决方案，即重新训练过滤器以准确识别攻击消息

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

An experimental evaluation of spam filter performance and robustness against attack

In this paper, we show experimentally that learning filters are able to classify large corpora of spam and legitimate email messages with a high degree of accuracy. The corpora in our experiments contain about half a million spam messages and a similar number of legitimate messages, making them two orders of magnitude larger than the corpora used in current research. The use of such large corpora represents a collaborative approach to spam filtering because the corpora combine spam and legitimate messages from many different sources. First, we show that this collaborative approach creates very accurate spam filters. Then, we introduce an effective attack against these filters which successfully degrades their ability to classify spam. Finally, we present an effective solution to the above attack which involves retraining the filters to accurately identify the attack messages

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2005 International Conference on Collaborative Computing: Networking, Applications and Worksharing

自引率

0.00%

发文量

期刊最新文献

A collaborative and multi-agent system for e-mail filtering and classification Collaborative development of business applications Impact of sniffer deployment on indoor localization Localization of sensor networks considering energy accuracy tradeoffs Symbiotic multi-path routing with attractor selection