{"title":"An Enhanced Clustering of High Dimensional Datasets Using Unsupervised Quick Reduct Algorithm (USQR) With Rough Set Theory","authors":"P. Gomathi, S. Dhanabal, V. K. Kaliappan","doi":"10.1109/WCCCT.2014.55","DOIUrl":null,"url":null,"abstract":"The performance of K-means clustering algorithm is poor for high dimensions data set. The goal of this paper is to reduce the high dimensional data to a meaningful low dimensional data representation, so that the efficiency of clustering algorithm will be elevated. Hence to improve the efficiency of clustering analysis, unsupervised quick reduct algorithm (USQR) is used for selecting the features from high dimensional data. Then the selected features are used to find the initial centroid using k-MAM initialization technique for k-means. The initial centroids are finally used to find the clusters. The results are compared to k-means and k-MAM with USQR so that outperforms well, in terms of accuracy and number of iterations compared to the k-means, for high dimensional data.","PeriodicalId":421793,"journal":{"name":"2014 World Congress on Computing and Communication Technologies","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 World Congress on Computing and Communication Technologies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WCCCT.2014.55","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The performance of K-means clustering algorithm is poor for high dimensions data set. The goal of this paper is to reduce the high dimensional data to a meaningful low dimensional data representation, so that the efficiency of clustering algorithm will be elevated. Hence to improve the efficiency of clustering analysis, unsupervised quick reduct algorithm (USQR) is used for selecting the features from high dimensional data. Then the selected features are used to find the initial centroid using k-MAM initialization technique for k-means. The initial centroids are finally used to find the clusters. The results are compared to k-means and k-MAM with USQR so that outperforms well, in terms of accuracy and number of iterations compared to the k-means, for high dimensional data.