Using fuzzy undersampling and fuzzy PCA to improve imbalanced classification through Rotation Forest algorithm

M. Hosseinzadeh, M. Eftekhari
{"title":"Using fuzzy undersampling and fuzzy PCA to improve imbalanced classification through Rotation Forest algorithm","authors":"M. Hosseinzadeh, M. Eftekhari","doi":"10.1109/CSICSSE.2015.7369242","DOIUrl":null,"url":null,"abstract":"This paper proposed a novel undersampling method to reduce the imbalance ratio of a dataset using fuzzy memberships degrees as well as utilizing a new fuzzy principal components analysis (F-PCA) for the classification through Rotation Forest algorithm. In the undersampling phase, first two membership functions are defined on each feature (dimension); one indicates the minority concept and the other shows majority concept. After that, each data sample receives a score based on its membership degrees in each dimension of the feature space. Majority samples with the highest scores are the best candidates of removal. Then during the Rotation Forest algorithm's train phase, a fuzzy Principal Component Analysis (F-PCA) is applied on the fuzzified values of samples which are produced in the undersampling phase. Moreover, these values are used to build the base classifiers of the ensemble. The obtained results illustrate the efficiency and noteworthy high performance of our proposed method comparing to the other state-of-the-art algorithms for class imbalance problem.","PeriodicalId":115653,"journal":{"name":"2015 International Symposium on Computer Science and Software Engineering (CSSE)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 International Symposium on Computer Science and Software Engineering (CSSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSICSSE.2015.7369242","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

This paper proposed a novel undersampling method to reduce the imbalance ratio of a dataset using fuzzy memberships degrees as well as utilizing a new fuzzy principal components analysis (F-PCA) for the classification through Rotation Forest algorithm. In the undersampling phase, first two membership functions are defined on each feature (dimension); one indicates the minority concept and the other shows majority concept. After that, each data sample receives a score based on its membership degrees in each dimension of the feature space. Majority samples with the highest scores are the best candidates of removal. Then during the Rotation Forest algorithm's train phase, a fuzzy Principal Component Analysis (F-PCA) is applied on the fuzzified values of samples which are produced in the undersampling phase. Moreover, these values are used to build the base classifiers of the ensemble. The obtained results illustrate the efficiency and noteworthy high performance of our proposed method comparing to the other state-of-the-art algorithms for class imbalance problem.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用模糊欠采样和模糊主成分分析改进旋转森林算法的不平衡分类
本文提出了一种新的欠采样方法,利用模糊隶属度来降低数据集的不平衡率,并利用新的模糊主成分分析(F-PCA)通过旋转森林算法进行分类。在欠采样阶段,对每个特征(维度)定义前两个隶属函数;一个表示少数概念,另一个表示多数概念。之后,每个数据样本根据其在特征空间的每个维度上的隶属度得到一个分数。大多数得分最高的样本是移除的最佳候选。然后在旋转森林算法的训练阶段,对欠采样阶段产生的样本的模糊化值进行模糊主成分分析(F-PCA)。此外,这些值用于构建集成的基本分类器。所得结果表明,与其他最先进的类不平衡问题算法相比,我们提出的方法效率高,性能显著。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
A metric-driven approach for interlinking assessment of RDF graphs Making a tradeoff between adaptation and integration in adaptive service based systems High performance GPU implementation of k-NN based on Mahalanobis distance Game theory-based and heuristic algorithms for parking-lot search Using fuzzy undersampling and fuzzy PCA to improve imbalanced classification through Rotation Forest algorithm
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1