R. Wahyudi, Hendra Marcos, U. Hasanah, Bambang Pilu Hartato, Tri Astuti, Rizal Anjas Prasetyo
{"title":"Algorithm Evaluation for Classification “Phishing Website” Using Several Classification Algorithms","authors":"R. Wahyudi, Hendra Marcos, U. Hasanah, Bambang Pilu Hartato, Tri Astuti, Rizal Anjas Prasetyo","doi":"10.1109/ICITISEE.2018.8720975","DOIUrl":null,"url":null,"abstract":"Phishing websites are a fooling technique by making victims as if they are accessing legitimate sites. Data mining is a technique for extracting hidden information in order to benefit more from existing data. Data mining is the process of discovering regularity, patterns, and relationships in large datasets. In this study, data mining will be used to determine the effect of feature selection on algorithm C4.5 and CART on phishing website dataset. From the tests that have been done the effect of feature selection on the phishing website, dataset proved to overcome the longer computational time. From the performance measurement of both algorithms that have been done, CART algorithm has a higher accuracy value than the algorithm C4.5 with an accuracy of 94.4%, while the algorithm C4.5 has an accuracy of 94.3%, so it can be concluded that CART algorithm has better performance value compared with the C4.5 algorithm.","PeriodicalId":180051,"journal":{"name":"2018 3rd International Conference on Information Technology, Information System and Electrical Engineering (ICITISEE)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 3rd International Conference on Information Technology, Information System and Electrical Engineering (ICITISEE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICITISEE.2018.8720975","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Phishing websites are a fooling technique by making victims as if they are accessing legitimate sites. Data mining is a technique for extracting hidden information in order to benefit more from existing data. Data mining is the process of discovering regularity, patterns, and relationships in large datasets. In this study, data mining will be used to determine the effect of feature selection on algorithm C4.5 and CART on phishing website dataset. From the tests that have been done the effect of feature selection on the phishing website, dataset proved to overcome the longer computational time. From the performance measurement of both algorithms that have been done, CART algorithm has a higher accuracy value than the algorithm C4.5 with an accuracy of 94.4%, while the algorithm C4.5 has an accuracy of 94.3%, so it can be concluded that CART algorithm has better performance value compared with the C4.5 algorithm.