Feature selection for fuzzy classifier using the spider monkey algorithm

I. Hodashinsky, M. Nemirovich-Danchenko, S. Samsonov
{"title":"Feature selection for fuzzy classifier using the spider monkey algorithm","authors":"I. Hodashinsky, M. Nemirovich-Danchenko, S. Samsonov","doi":"10.17323/1998-0663.2019.2.29.42","DOIUrl":null,"url":null,"abstract":"In this paper, we discuss the construction of fuzzy classifiers by dividing the task into the three following stages: the generation of a fuzzy rule base, the selection of relevant features, and the parameter optimization of membership functions for fuzzy rules. The structure of the fuzzy classifier is generated by forming the fuzzy rule base with use of the minimum and maximum feature values in each class. This allows us to generate the rule base with the minimum number of rules, which corresponds to the number of class labels in the dataset to be classified. Feature selection is carried out by a binary spider monkey optimization (BSMO) algorithm, which is a wrapper method. As a data preprocessing procedure, feature selection not only improves the efficiency of training algorithms but also enhances their generalization capability. In the process of feature selection, we investigate the dynamics of changes in classification accuracy, iteration by iteration, for various parameter values of the binary algorithm and analyze the effect of its parameters on its convergence rate. The parameter optimization of fuzzy rule antecedents uses another spider monkey optimization (SMO) algorithm that processes continuous numerical data. The performance of the fuzzy classifiers based on the rules and features selected by these algorithms is tested on some datasets from the KEEL repository. Comparison with two competitor algorithms on the same datasets is carried out. It is shown that fuzzy classifiers with the minimum number of rules and a significantly reduced number of features can be developed with their accuracy being statistically similar to that of the competitor classifiers.","PeriodicalId":41920,"journal":{"name":"Biznes Informatika-Business Informatics","volume":null,"pages":null},"PeriodicalIF":0.6000,"publicationDate":"2019-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biznes Informatika-Business Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.17323/1998-0663.2019.2.29.42","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"BUSINESS","Score":null,"Total":0}
引用次数: 4

Abstract

In this paper, we discuss the construction of fuzzy classifiers by dividing the task into the three following stages: the generation of a fuzzy rule base, the selection of relevant features, and the parameter optimization of membership functions for fuzzy rules. The structure of the fuzzy classifier is generated by forming the fuzzy rule base with use of the minimum and maximum feature values in each class. This allows us to generate the rule base with the minimum number of rules, which corresponds to the number of class labels in the dataset to be classified. Feature selection is carried out by a binary spider monkey optimization (BSMO) algorithm, which is a wrapper method. As a data preprocessing procedure, feature selection not only improves the efficiency of training algorithms but also enhances their generalization capability. In the process of feature selection, we investigate the dynamics of changes in classification accuracy, iteration by iteration, for various parameter values of the binary algorithm and analyze the effect of its parameters on its convergence rate. The parameter optimization of fuzzy rule antecedents uses another spider monkey optimization (SMO) algorithm that processes continuous numerical data. The performance of the fuzzy classifiers based on the rules and features selected by these algorithms is tested on some datasets from the KEEL repository. Comparison with two competitor algorithms on the same datasets is carried out. It is shown that fuzzy classifiers with the minimum number of rules and a significantly reduced number of features can be developed with their accuracy being statistically similar to that of the competitor classifiers.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于蜘蛛猴算法的模糊分类器特征选择
在本文中,我们通过将任务划分为以下三个阶段来讨论模糊分类器的构建:模糊规则库的生成、相关特征的选择以及模糊规则隶属函数的参数优化。模糊分类器的结构是通过使用每个类别中的最小和最大特征值形成模糊规则库来生成的。这使我们能够生成具有最小规则数的规则库,该规则数对应于要分类的数据集中的类标签数。特征选择是通过二元蜘蛛猴优化(BSMO)算法进行的,这是一种包装方法。特征选择作为一种数据预处理过程,不仅提高了训练算法的效率,而且增强了算法的泛化能力。在特征选择过程中,我们研究了二进制算法的各种参数值的分类精度的动态变化,一次又一次的迭代,并分析了其参数对其收敛速度的影响。模糊规则前因的参数优化使用了另一种处理连续数值数据的蜘蛛猴优化(SMO)算法。基于这些算法选择的规则和特征的模糊分类器的性能在KEEL存储库的一些数据集上进行了测试。在相同的数据集上与两种竞争对手的算法进行了比较。结果表明,可以开发出规则数量最少、特征数量显著减少的模糊分类器,其精度在统计上与竞争对手的分类器相似。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
33.30%
发文量
0
期刊最新文献
Modeling and optimization of strategies for making individual decisions in multi-agent socio-economic systems with the use of machine learning An intelligent method for generating a list of job profile requirements based on neural network language models using ESCO taxonomy and online job corpus Decision support technology for a seller on a marketplace in a competitive environment The present and future of the digital transformation of real estate: A systematic review of smart real estate A knowledge management system in the strategic development of universities
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1