一种基于最近邻方法的聚类框架

2017 International Conference on Computing, Communication and Automation (ICCCA) Pub Date : 2017-05-01 DOI:10.1109/CCAA.2017.8229793

Suvendu Kanungo, A. Shukla

{"title":"一种基于最近邻方法的聚类框架","authors":"Suvendu Kanungo, A. Shukla","doi":"10.1109/CCAA.2017.8229793","DOIUrl":null,"url":null,"abstract":"In this digital world, we are facing the flood of data, but depriving for knowledge. The eminent need of mining is useful to extract the hidden pattern from the wide availability of vast amount of data. Clustering is one such useful mining tool to handle this unfavorable situation by carrying out crucial steps refers as cluster analysis. It is the process of a grouping of patterns into clusters based on similarity. Partition based clustering algorithms are widely accepted for much diverse application such as pattern analysis, image segmentation, identification system. Among the different variations of the partition based clustering, due to its monotony and ease of implementation K-means algorithm gained a lot of attraction in the various field of research. A severe problem associated with the algorithm is that it is highly sophisticated while selecting the initial centroid and may converge to a local optimum solution of the criterion function value if the initial centroid is not chosen accurately. Additionally, it requires the prior information regarding a number of clusters to be formed and the computation of K-means are expensive. K-means algorithm is a two-step process includes initialization and assignment step. This paper works on initialization step of the algorithm and proposed an efficient enhanced K-means clustering algorithm which eliminates the deficiency of the existing one. A new initialization approach has been introduced in the paper to drawn an initial cluster centers for K means Algorithm. The paper also compares proposed technique with K-means technique.","PeriodicalId":6627,"journal":{"name":"2017 International Conference on Computing, Communication and Automation (ICCCA)","volume":"21 1","pages":"164-169"},"PeriodicalIF":0.0000,"publicationDate":"2017-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"A novel clustering framework using farthest neighbour approach\",\"authors\":\"Suvendu Kanungo, A. Shukla\",\"doi\":\"10.1109/CCAA.2017.8229793\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this digital world, we are facing the flood of data, but depriving for knowledge. The eminent need of mining is useful to extract the hidden pattern from the wide availability of vast amount of data. Clustering is one such useful mining tool to handle this unfavorable situation by carrying out crucial steps refers as cluster analysis. It is the process of a grouping of patterns into clusters based on similarity. Partition based clustering algorithms are widely accepted for much diverse application such as pattern analysis, image segmentation, identification system. Among the different variations of the partition based clustering, due to its monotony and ease of implementation K-means algorithm gained a lot of attraction in the various field of research. A severe problem associated with the algorithm is that it is highly sophisticated while selecting the initial centroid and may converge to a local optimum solution of the criterion function value if the initial centroid is not chosen accurately. Additionally, it requires the prior information regarding a number of clusters to be formed and the computation of K-means are expensive. K-means algorithm is a two-step process includes initialization and assignment step. This paper works on initialization step of the algorithm and proposed an efficient enhanced K-means clustering algorithm which eliminates the deficiency of the existing one. A new initialization approach has been introduced in the paper to drawn an initial cluster centers for K means Algorithm. The paper also compares proposed technique with K-means technique.\",\"PeriodicalId\":6627,\"journal\":{\"name\":\"2017 International Conference on Computing, Communication and Automation (ICCCA)\",\"volume\":\"21 1\",\"pages\":\"164-169\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 International Conference on Computing, Communication and Automation (ICCCA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CCAA.2017.8229793\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Conference on Computing, Communication and Automation (ICCCA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCAA.2017.8229793","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

在这个数字化的世界里，我们面对着海量的数据，却缺乏知识。挖掘的突出需求有助于从广泛可用的大量数据中提取隐藏的模式。聚类就是这样一种有用的挖掘工具，通过执行称为聚类分析的关键步骤来处理这种不利情况。它是基于相似性将模式分组成簇的过程。基于分割的聚类算法在模式分析、图像分割、识别系统等方面有着广泛的应用。在各种基于分割的聚类算法中，K-means算法由于其单调性和易于实现的特点，在各个研究领域受到了广泛的关注。该算法的一个严重问题是初始质心的选取过于复杂，如果初始质心选取不准确，可能会收敛到准则函数值的局部最优解。此外，它需要形成许多簇的先验信息，并且K-means的计算成本很高。K-means算法分为初始化和赋值两步。本文对算法的初始化步骤进行了研究，提出了一种高效的增强k均值聚类算法，消除了现有算法的不足。本文提出了一种新的初始化方法来绘制K均值算法的初始聚类中心。本文还将该方法与K-means方法进行了比较。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

A novel clustering framework using farthest neighbour approach

In this digital world, we are facing the flood of data, but depriving for knowledge. The eminent need of mining is useful to extract the hidden pattern from the wide availability of vast amount of data. Clustering is one such useful mining tool to handle this unfavorable situation by carrying out crucial steps refers as cluster analysis. It is the process of a grouping of patterns into clusters based on similarity. Partition based clustering algorithms are widely accepted for much diverse application such as pattern analysis, image segmentation, identification system. Among the different variations of the partition based clustering, due to its monotony and ease of implementation K-means algorithm gained a lot of attraction in the various field of research. A severe problem associated with the algorithm is that it is highly sophisticated while selecting the initial centroid and may converge to a local optimum solution of the criterion function value if the initial centroid is not chosen accurately. Additionally, it requires the prior information regarding a number of clusters to be formed and the computation of K-means are expensive. K-means algorithm is a two-step process includes initialization and assignment step. This paper works on initialization step of the algorithm and proposed an efficient enhanced K-means clustering algorithm which eliminates the deficiency of the existing one. A new initialization approach has been introduced in the paper to drawn an initial cluster centers for K means Algorithm. The paper also compares proposed technique with K-means technique.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2017 International Conference on Computing, Communication and Automation (ICCCA)

自引率

0.00%

发文量

期刊最新文献

Sentiment analysis on product reviews BSS: Blockchain security over software defined network A detailed analysis of data consistency concepts in data exchange formats (JSON & XML) CBIR by cascading features & SVM ADANS: An agriculture domain question answering system using ontologies