{"title":"Using fuzzy undersampling and fuzzy PCA to improve imbalanced classification through Rotation Forest algorithm","authors":"M. Hosseinzadeh, M. Eftekhari","doi":"10.1109/CSICSSE.2015.7369242","DOIUrl":null,"url":null,"abstract":"This paper proposed a novel undersampling method to reduce the imbalance ratio of a dataset using fuzzy memberships degrees as well as utilizing a new fuzzy principal components analysis (F-PCA) for the classification through Rotation Forest algorithm. In the undersampling phase, first two membership functions are defined on each feature (dimension); one indicates the minority concept and the other shows majority concept. After that, each data sample receives a score based on its membership degrees in each dimension of the feature space. Majority samples with the highest scores are the best candidates of removal. Then during the Rotation Forest algorithm's train phase, a fuzzy Principal Component Analysis (F-PCA) is applied on the fuzzified values of samples which are produced in the undersampling phase. Moreover, these values are used to build the base classifiers of the ensemble. The obtained results illustrate the efficiency and noteworthy high performance of our proposed method comparing to the other state-of-the-art algorithms for class imbalance problem.","PeriodicalId":115653,"journal":{"name":"2015 International Symposium on Computer Science and Software Engineering (CSSE)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 International Symposium on Computer Science and Software Engineering (CSSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSICSSE.2015.7369242","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
This paper proposed a novel undersampling method to reduce the imbalance ratio of a dataset using fuzzy memberships degrees as well as utilizing a new fuzzy principal components analysis (F-PCA) for the classification through Rotation Forest algorithm. In the undersampling phase, first two membership functions are defined on each feature (dimension); one indicates the minority concept and the other shows majority concept. After that, each data sample receives a score based on its membership degrees in each dimension of the feature space. Majority samples with the highest scores are the best candidates of removal. Then during the Rotation Forest algorithm's train phase, a fuzzy Principal Component Analysis (F-PCA) is applied on the fuzzified values of samples which are produced in the undersampling phase. Moreover, these values are used to build the base classifiers of the ensemble. The obtained results illustrate the efficiency and noteworthy high performance of our proposed method comparing to the other state-of-the-art algorithms for class imbalance problem.