{"title":"Comparative Study on Different Classification Techniques for Spam Dataset","authors":"S. Elhamayed, Cairo Egypt Eri","doi":"10.17706/IJCCE.2018.7.4.189-194","DOIUrl":null,"url":null,"abstract":"Nowadays, people and companies use emails for information exchange, email messages, and etc., because they are the fastest and the cheapest way. The main problem that faces email messages is the undesirable emails which known as spams. Spams may cause overflow the internet with considerable copies of the same message or carry malicious content that harms user system and reduce the performance. The purpose of this work is to make a comparative study of several classification techniques on the basis of their performance parameters using spam dataset. The performance of the different classifiers is measured with different ratio of the testing and training dataset. Also, the performance of the classifiers is calculated with and without low variance filter. By applying the low variance filter the accuracy of the KNN classifier is enhanced with about 9% while the accuracy of the other classifier is decreased.","PeriodicalId":23787,"journal":{"name":"World Academy of Science, Engineering and Technology, International Journal of Electrical, Computer, Energetic, Electronic and Communication Engineering","volume":"27 1","pages":"189-194"},"PeriodicalIF":0.0000,"publicationDate":"2018-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"World Academy of Science, Engineering and Technology, International Journal of Electrical, Computer, Energetic, Electronic and Communication Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.17706/IJCCE.2018.7.4.189-194","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
Nowadays, people and companies use emails for information exchange, email messages, and etc., because they are the fastest and the cheapest way. The main problem that faces email messages is the undesirable emails which known as spams. Spams may cause overflow the internet with considerable copies of the same message or carry malicious content that harms user system and reduce the performance. The purpose of this work is to make a comparative study of several classification techniques on the basis of their performance parameters using spam dataset. The performance of the different classifiers is measured with different ratio of the testing and training dataset. Also, the performance of the classifiers is calculated with and without low variance filter. By applying the low variance filter the accuracy of the KNN classifier is enhanced with about 9% while the accuracy of the other classifier is decreased.