{"title":"基于朴素贝叶斯算法粒子群优化的电子政务投诉文本分类","authors":"T. Hariguna, Sarmini, A. Hananto","doi":"10.1109/CyberneticsCom55287.2022.9865585","DOIUrl":null,"url":null,"abstract":"A web-based online complaint portal is one of the e-government public services. The complaint's content must be categorised in order for it to be transmitted to the appropriate agency swiftly and properly. The most often used standard classification algorithms are the Naive Bayes Classifier (NBC) and k-Nearest Neighbor (k-NN), both of which classify just one label and must be tuned. The purpose of this project is to categorize complaint messages that include several labels simultaneously using NBC tuned for particle swarm optimization (PSO). The data source is the Open Data Jakarta and is partitioned into 70% training data and 30% test data for classification into seven labels. The NBC and k-NN algorithms are used to compare PSO's optimization performance. Cross-validation ten times revealed that optimizing NBC with PSO obtained an accuracy of 88.16%, much superior than k-NN at 83% and NBC at 70.57%. This optimization approach may be used to improve the efficacy of community-based e-government services.","PeriodicalId":178279,"journal":{"name":"2022 IEEE International Conference on Cybernetics and Computational Intelligence (CyberneticsCom)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"E-government Public Complaints Text Classification Using Particle Swarm Optimization in Naive Bayes Algorithm\",\"authors\":\"T. Hariguna, Sarmini, A. Hananto\",\"doi\":\"10.1109/CyberneticsCom55287.2022.9865585\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A web-based online complaint portal is one of the e-government public services. The complaint's content must be categorised in order for it to be transmitted to the appropriate agency swiftly and properly. The most often used standard classification algorithms are the Naive Bayes Classifier (NBC) and k-Nearest Neighbor (k-NN), both of which classify just one label and must be tuned. The purpose of this project is to categorize complaint messages that include several labels simultaneously using NBC tuned for particle swarm optimization (PSO). The data source is the Open Data Jakarta and is partitioned into 70% training data and 30% test data for classification into seven labels. The NBC and k-NN algorithms are used to compare PSO's optimization performance. Cross-validation ten times revealed that optimizing NBC with PSO obtained an accuracy of 88.16%, much superior than k-NN at 83% and NBC at 70.57%. This optimization approach may be used to improve the efficacy of community-based e-government services.\",\"PeriodicalId\":178279,\"journal\":{\"name\":\"2022 IEEE International Conference on Cybernetics and Computational Intelligence (CyberneticsCom)\",\"volume\":\"36 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-06-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE International Conference on Cybernetics and Computational Intelligence (CyberneticsCom)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CyberneticsCom55287.2022.9865585\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Cybernetics and Computational Intelligence (CyberneticsCom)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CyberneticsCom55287.2022.9865585","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
基于网络的网上投诉门户是电子政务公共服务的一种。投诉的内容必须分类,以便迅速和适当地转交给适当的机构。最常用的标准分类算法是朴素贝叶斯分类器(NBC)和k-最近邻分类器(k-NN),这两种算法都只分类一个标签,并且必须进行调优。本项目的目的是对同时包含多个标签的投诉消息进行分类,使用针对粒子群优化(PSO)进行调优的NBC。数据源为Open data Jakarta,将其划分为70%的训练数据和30%的测试数据,分类为7个标签。用NBC算法和k-NN算法比较了粒子群算法的优化性能。10次交叉验证表明,PSO优化NBC的准确率为88.16%,明显优于k-NN的83%和NBC的70.57%。这种优化方法可用于提高基于社区的电子政务服务的效率。
E-government Public Complaints Text Classification Using Particle Swarm Optimization in Naive Bayes Algorithm
A web-based online complaint portal is one of the e-government public services. The complaint's content must be categorised in order for it to be transmitted to the appropriate agency swiftly and properly. The most often used standard classification algorithms are the Naive Bayes Classifier (NBC) and k-Nearest Neighbor (k-NN), both of which classify just one label and must be tuned. The purpose of this project is to categorize complaint messages that include several labels simultaneously using NBC tuned for particle swarm optimization (PSO). The data source is the Open Data Jakarta and is partitioned into 70% training data and 30% test data for classification into seven labels. The NBC and k-NN algorithms are used to compare PSO's optimization performance. Cross-validation ten times revealed that optimizing NBC with PSO obtained an accuracy of 88.16%, much superior than k-NN at 83% and NBC at 70.57%. This optimization approach may be used to improve the efficacy of community-based e-government services.