An Enhanced Clustering of High Dimensional Datasets Using Unsupervised Quick Reduct Algorithm (USQR) With Rough Set Theory

2014 World Congress on Computing and Communication Technologies Pub Date : 2014-04-03 DOI:10.1109/WCCCT.2014.55

P. Gomathi, S. Dhanabal, V. K. Kaliappan

引用次数: 0

Abstract

The performance of K-means clustering algorithm is poor for high dimensions data set. The goal of this paper is to reduce the high dimensional data to a meaningful low dimensional data representation, so that the efficiency of clustering algorithm will be elevated. Hence to improve the efficiency of clustering analysis, unsupervised quick reduct algorithm (USQR) is used for selecting the features from high dimensional data. Then the selected features are used to find the initial centroid using k-MAM initialization technique for k-means. The initial centroids are finally used to find the clusters. The results are compared to k-means and k-MAM with USQR so that outperforms well, in terms of accuracy and number of iterations compared to the k-means, for high dimensional data.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于粗糙集理论的无监督快速约简算法增强高维数据集聚类

对于高维数据集，k均值聚类算法的性能较差。本文的目标是将高维数据简化为有意义的低维数据表示，从而提高聚类算法的效率。因此，为了提高聚类分析的效率，采用无监督快速约简算法(USQR)从高维数据中选择特征。然后利用k-均值的k-MAM初始化技术，利用所选特征找到初始质心。最后使用初始质心来找到簇。将结果与USQR的k-means和k-MAM进行比较，以便在高维数据方面，与k-means相比，在精度和迭代次数方面表现出色。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2014 World Congress on Computing and Communication Technologies

自引率

0.00%

发文量