一种基于最近邻方法的聚类框架

Suvendu Kanungo, A. Shukla
{"title":"一种基于最近邻方法的聚类框架","authors":"Suvendu Kanungo, A. Shukla","doi":"10.1109/CCAA.2017.8229793","DOIUrl":null,"url":null,"abstract":"In this digital world, we are facing the flood of data, but depriving for knowledge. The eminent need of mining is useful to extract the hidden pattern from the wide availability of vast amount of data. Clustering is one such useful mining tool to handle this unfavorable situation by carrying out crucial steps refers as cluster analysis. It is the process of a grouping of patterns into clusters based on similarity. Partition based clustering algorithms are widely accepted for much diverse application such as pattern analysis, image segmentation, identification system. Among the different variations of the partition based clustering, due to its monotony and ease of implementation K-means algorithm gained a lot of attraction in the various field of research. A severe problem associated with the algorithm is that it is highly sophisticated while selecting the initial centroid and may converge to a local optimum solution of the criterion function value if the initial centroid is not chosen accurately. Additionally, it requires the prior information regarding a number of clusters to be formed and the computation of K-means are expensive. K-means algorithm is a two-step process includes initialization and assignment step. This paper works on initialization step of the algorithm and proposed an efficient enhanced K-means clustering algorithm which eliminates the deficiency of the existing one. A new initialization approach has been introduced in the paper to drawn an initial cluster centers for K means Algorithm. The paper also compares proposed technique with K-means technique.","PeriodicalId":6627,"journal":{"name":"2017 International Conference on Computing, Communication and Automation (ICCCA)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2017-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"A novel clustering framework using farthest neighbour approach\",\"authors\":\"Suvendu Kanungo, A. Shukla\",\"doi\":\"10.1109/CCAA.2017.8229793\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this digital world, we are facing the flood of data, but depriving for knowledge. The eminent need of mining is useful to extract the hidden pattern from the wide availability of vast amount of data. Clustering is one such useful mining tool to handle this unfavorable situation by carrying out crucial steps refers as cluster analysis. It is the process of a grouping of patterns into clusters based on similarity. Partition based clustering algorithms are widely accepted for much diverse application such as pattern analysis, image segmentation, identification system. Among the different variations of the partition based clustering, due to its monotony and ease of implementation K-means algorithm gained a lot of attraction in the various field of research. A severe problem associated with the algorithm is that it is highly sophisticated while selecting the initial centroid and may converge to a local optimum solution of the criterion function value if the initial centroid is not chosen accurately. Additionally, it requires the prior information regarding a number of clusters to be formed and the computation of K-means are expensive. K-means algorithm is a two-step process includes initialization and assignment step. This paper works on initialization step of the algorithm and proposed an efficient enhanced K-means clustering algorithm which eliminates the deficiency of the existing one. A new initialization approach has been introduced in the paper to drawn an initial cluster centers for K means Algorithm. The paper also compares proposed technique with K-means technique.\",\"PeriodicalId\":6627,\"journal\":{\"name\":\"2017 International Conference on Computing, Communication and Automation (ICCCA)\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 International Conference on Computing, Communication and Automation (ICCCA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CCAA.2017.8229793\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Conference on Computing, Communication and Automation (ICCCA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCAA.2017.8229793","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

在这个数字化的世界里,我们面对着海量的数据,却缺乏知识。挖掘的突出需求有助于从广泛可用的大量数据中提取隐藏的模式。聚类就是这样一种有用的挖掘工具,通过执行称为聚类分析的关键步骤来处理这种不利情况。它是基于相似性将模式分组成簇的过程。基于分割的聚类算法在模式分析、图像分割、识别系统等方面有着广泛的应用。在各种基于分割的聚类算法中,K-means算法由于其单调性和易于实现的特点,在各个研究领域受到了广泛的关注。该算法的一个严重问题是初始质心的选取过于复杂,如果初始质心选取不准确,可能会收敛到准则函数值的局部最优解。此外,它需要形成许多簇的先验信息,并且K-means的计算成本很高。K-means算法分为初始化和赋值两步。本文对算法的初始化步骤进行了研究,提出了一种高效的增强k均值聚类算法,消除了现有算法的不足。本文提出了一种新的初始化方法来绘制K均值算法的初始聚类中心。本文还将该方法与K-means方法进行了比较。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
A novel clustering framework using farthest neighbour approach
In this digital world, we are facing the flood of data, but depriving for knowledge. The eminent need of mining is useful to extract the hidden pattern from the wide availability of vast amount of data. Clustering is one such useful mining tool to handle this unfavorable situation by carrying out crucial steps refers as cluster analysis. It is the process of a grouping of patterns into clusters based on similarity. Partition based clustering algorithms are widely accepted for much diverse application such as pattern analysis, image segmentation, identification system. Among the different variations of the partition based clustering, due to its monotony and ease of implementation K-means algorithm gained a lot of attraction in the various field of research. A severe problem associated with the algorithm is that it is highly sophisticated while selecting the initial centroid and may converge to a local optimum solution of the criterion function value if the initial centroid is not chosen accurately. Additionally, it requires the prior information regarding a number of clusters to be formed and the computation of K-means are expensive. K-means algorithm is a two-step process includes initialization and assignment step. This paper works on initialization step of the algorithm and proposed an efficient enhanced K-means clustering algorithm which eliminates the deficiency of the existing one. A new initialization approach has been introduced in the paper to drawn an initial cluster centers for K means Algorithm. The paper also compares proposed technique with K-means technique.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Sentiment analysis on product reviews BSS: Blockchain security over software defined network A detailed analysis of data consistency concepts in data exchange formats (JSON & XML) CBIR by cascading features & SVM ADANS: An agriculture domain question answering system using ontologies
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1