Feature selection for fuzzy classifier using the spider monkey algorithm

IF 0.6 Q4 BUSINESS Biznes Informatika-Business Informatics Pub Date : 2019-06-30 DOI:10.17323/1998-0663.2019.2.29.42

I. Hodashinsky, M. Nemirovich-Danchenko, S. Samsonov

{"title":"Feature selection for fuzzy classifier using the spider monkey algorithm","authors":"I. Hodashinsky, M. Nemirovich-Danchenko, S. Samsonov","doi":"10.17323/1998-0663.2019.2.29.42","DOIUrl":null,"url":null,"abstract":"In this paper, we discuss the construction of fuzzy classifiers by dividing the task into the three following stages: the generation of a fuzzy rule base, the selection of relevant features, and the parameter optimization of membership functions for fuzzy rules. The structure of the fuzzy classifier is generated by forming the fuzzy rule base with use of the minimum and maximum feature values in each class. This allows us to generate the rule base with the minimum number of rules, which corresponds to the number of class labels in the dataset to be classified. Feature selection is carried out by a binary spider monkey optimization (BSMO) algorithm, which is a wrapper method. As a data preprocessing procedure, feature selection not only improves the efficiency of training algorithms but also enhances their generalization capability. In the process of feature selection, we investigate the dynamics of changes in classification accuracy, iteration by iteration, for various parameter values of the binary algorithm and analyze the effect of its parameters on its convergence rate. The parameter optimization of fuzzy rule antecedents uses another spider monkey optimization (SMO) algorithm that processes continuous numerical data. The performance of the fuzzy classifiers based on the rules and features selected by these algorithms is tested on some datasets from the KEEL repository. Comparison with two competitor algorithms on the same datasets is carried out. It is shown that fuzzy classifiers with the minimum number of rules and a significantly reduced number of features can be developed with their accuracy being statistically similar to that of the competitor classifiers.","PeriodicalId":41920,"journal":{"name":"Biznes Informatika-Business Informatics","volume":" ","pages":""},"PeriodicalIF":0.6000,"publicationDate":"2019-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biznes Informatika-Business Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.17323/1998-0663.2019.2.29.42","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"BUSINESS","Score":null,"Total":0}

引用次数: 4

Abstract

In this paper, we discuss the construction of fuzzy classifiers by dividing the task into the three following stages: the generation of a fuzzy rule base, the selection of relevant features, and the parameter optimization of membership functions for fuzzy rules. The structure of the fuzzy classifier is generated by forming the fuzzy rule base with use of the minimum and maximum feature values in each class. This allows us to generate the rule base with the minimum number of rules, which corresponds to the number of class labels in the dataset to be classified. Feature selection is carried out by a binary spider monkey optimization (BSMO) algorithm, which is a wrapper method. As a data preprocessing procedure, feature selection not only improves the efficiency of training algorithms but also enhances their generalization capability. In the process of feature selection, we investigate the dynamics of changes in classification accuracy, iteration by iteration, for various parameter values of the binary algorithm and analyze the effect of its parameters on its convergence rate. The parameter optimization of fuzzy rule antecedents uses another spider monkey optimization (SMO) algorithm that processes continuous numerical data. The performance of the fuzzy classifiers based on the rules and features selected by these algorithms is tested on some datasets from the KEEL repository. Comparison with two competitor algorithms on the same datasets is carried out. It is shown that fuzzy classifiers with the minimum number of rules and a significantly reduced number of features can be developed with their accuracy being statistically similar to that of the competitor classifiers.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于蜘蛛猴算法的模糊分类器特征选择

在本文中，我们通过将任务划分为以下三个阶段来讨论模糊分类器的构建：模糊规则库的生成、相关特征的选择以及模糊规则隶属函数的参数优化。模糊分类器的结构是通过使用每个类别中的最小和最大特征值形成模糊规则库来生成的。这使我们能够生成具有最小规则数的规则库，该规则数对应于要分类的数据集中的类标签数。特征选择是通过二元蜘蛛猴优化（BSMO）算法进行的，这是一种包装方法。特征选择作为一种数据预处理过程，不仅提高了训练算法的效率，而且增强了算法的泛化能力。在特征选择过程中，我们研究了二进制算法的各种参数值的分类精度的动态变化，一次又一次的迭代，并分析了其参数对其收敛速度的影响。模糊规则前因的参数优化使用了另一种处理连续数值数据的蜘蛛猴优化（SMO）算法。基于这些算法选择的规则和特征的模糊分类器的性能在KEEL存储库的一些数据集上进行了测试。在相同的数据集上与两种竞争对手的算法进行了比较。结果表明，可以开发出规则数量最少、特征数量显著减少的模糊分类器，其精度在统计上与竞争对手的分类器相似。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Biznes Informatika-Business Informatics BUSINESS-

自引率

33.30%

发文量