{"title":"A Hybrid Method of Unsupervised Feature Selection Based on Ranking","authors":"Yun Li, Bao-Liang Lu, Zhong-fu Wu","doi":"10.1109/ICPR.2006.84","DOIUrl":null,"url":null,"abstract":"Feature selection is a key problem to pattern recognition. So far, most methods of feature selection focus on sample data where class information is available. For sample data without class labels, however, the related methods for feature selection are few. This paper proposes a new way of unsupervised feature selection. Our method is a hybrid approach based on ranking the features according to their relevance to clustering using a new ranking index which belongs to exponential entropy. Firstly a candidate feature subset is selected using a modified fuzzy feature evaluation index (FFEI) with a new method to calculate the feature weight, which makes the algorithm to be robust and independent of domain knowledge. Then a wrapper method is used to select compact feature subset from the candidate feature set based on the clustering performance. Experimental results on benchmark data sets indicate the effectiveness of the proposed method","PeriodicalId":236033,"journal":{"name":"18th International Conference on Pattern Recognition (ICPR'06)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2006-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"26","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"18th International Conference on Pattern Recognition (ICPR'06)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICPR.2006.84","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 26
Abstract
Feature selection is a key problem to pattern recognition. So far, most methods of feature selection focus on sample data where class information is available. For sample data without class labels, however, the related methods for feature selection are few. This paper proposes a new way of unsupervised feature selection. Our method is a hybrid approach based on ranking the features according to their relevance to clustering using a new ranking index which belongs to exponential entropy. Firstly a candidate feature subset is selected using a modified fuzzy feature evaluation index (FFEI) with a new method to calculate the feature weight, which makes the algorithm to be robust and independent of domain knowledge. Then a wrapper method is used to select compact feature subset from the candidate feature set based on the clustering performance. Experimental results on benchmark data sets indicate the effectiveness of the proposed method