{"title":"基于粒子群优化的分类特征选择","authors":"Lucija Brezočnik","doi":"10.1109/EUROCON.2017.8011255","DOIUrl":null,"url":null,"abstract":"This paper proposes a method for the problem of processing high-dimensional data. When one has thousands of features (attributes) in a dataset, it is hard to achieve an efficient feature selection. To cope with this problem, we propose the use of a binary particle swarm optimization algorithm combined with the C4.5 as a classifier in the fitness function for the selection of informative attributes. The results obtained on 11 datasets were analyzed statistically and reveal that the proposed method, called BPSO+C4.5, outperforms known classifiers, i.e., C4.5, Naive Bayes, and SVM.","PeriodicalId":114100,"journal":{"name":"IEEE EUROCON 2017 -17th International Conference on Smart Technologies","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"21","resultStr":"{\"title\":\"Feature selection for classification using particle swarm optimization\",\"authors\":\"Lucija Brezočnik\",\"doi\":\"10.1109/EUROCON.2017.8011255\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper proposes a method for the problem of processing high-dimensional data. When one has thousands of features (attributes) in a dataset, it is hard to achieve an efficient feature selection. To cope with this problem, we propose the use of a binary particle swarm optimization algorithm combined with the C4.5 as a classifier in the fitness function for the selection of informative attributes. The results obtained on 11 datasets were analyzed statistically and reveal that the proposed method, called BPSO+C4.5, outperforms known classifiers, i.e., C4.5, Naive Bayes, and SVM.\",\"PeriodicalId\":114100,\"journal\":{\"name\":\"IEEE EUROCON 2017 -17th International Conference on Smart Technologies\",\"volume\":\"18 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"21\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE EUROCON 2017 -17th International Conference on Smart Technologies\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/EUROCON.2017.8011255\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE EUROCON 2017 -17th International Conference on Smart Technologies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EUROCON.2017.8011255","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Feature selection for classification using particle swarm optimization
This paper proposes a method for the problem of processing high-dimensional data. When one has thousands of features (attributes) in a dataset, it is hard to achieve an efficient feature selection. To cope with this problem, we propose the use of a binary particle swarm optimization algorithm combined with the C4.5 as a classifier in the fitness function for the selection of informative attributes. The results obtained on 11 datasets were analyzed statistically and reveal that the proposed method, called BPSO+C4.5, outperforms known classifiers, i.e., C4.5, Naive Bayes, and SVM.