{"title":"PERBANDINGAN KINERJA VARIASI NAÏVE BAYES MULTIVARIATE BERNOULLI DAN NAÏVE BAYES MULTINOMIAL DALAM PENGKLASIFIKASIAN DOKUMEN TEKS","authors":"Widyawati Widyawati, S. Sutanto","doi":"10.47080/IFTECH.V2I1.859","DOIUrl":null,"url":null,"abstract":"Classification of text documents with large amounts will be a job that requires a lot of time, effort and cost of having to read text documents and then categorize them manually, therefore the automatic classification of text documents is needed. The algorithm developed is K-Nearest-Neighbor (KNN), Naïve Bayes, Support Vector Machine (SVM), Decision Tree (DT), Neural Network (NN) and Maximum Entropy. The algorithm used as the object of research is a variation of the Naïve Bayes algorithm, the Naïve Bayes Multivariate Bernoulli and the Naïve Bayes Multinomial. This study discusses whether there are differences between the Algorithms. The Naïve Bayes Multivariate Bernoulli algorithm and the Naïve Bayes Multinomial can be seen from the value of the agreement and the speed of the process of classifying text documents, as well as more information about the process of processing requests that are getting more and more requested. While the highest value using the non-stemming Naïve Bayes Bernoulli method is 71.33%, and the fastest processing time is required using the non-stemming Naïve Bayes method which requires 0.12 seconds processing time.","PeriodicalId":117120,"journal":{"name":"Journal of Innovation And Future Technology (IFTECH)","volume":"51 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-02-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Innovation And Future Technology (IFTECH)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.47080/IFTECH.V2I1.859","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Classification of text documents with large amounts will be a job that requires a lot of time, effort and cost of having to read text documents and then categorize them manually, therefore the automatic classification of text documents is needed. The algorithm developed is K-Nearest-Neighbor (KNN), Naïve Bayes, Support Vector Machine (SVM), Decision Tree (DT), Neural Network (NN) and Maximum Entropy. The algorithm used as the object of research is a variation of the Naïve Bayes algorithm, the Naïve Bayes Multivariate Bernoulli and the Naïve Bayes Multinomial. This study discusses whether there are differences between the Algorithms. The Naïve Bayes Multivariate Bernoulli algorithm and the Naïve Bayes Multinomial can be seen from the value of the agreement and the speed of the process of classifying text documents, as well as more information about the process of processing requests that are getting more and more requested. While the highest value using the non-stemming Naïve Bayes Bernoulli method is 71.33%, and the fastest processing time is required using the non-stemming Naïve Bayes method which requires 0.12 seconds processing time.