基于对象双属性的K-Means聚类算法

Tu Linli, D. Yanni, Chu Siyong
{"title":"基于对象双属性的K-Means聚类算法","authors":"Tu Linli, D. Yanni, Chu Siyong","doi":"10.1109/ICMTMA.2015.12","DOIUrl":null,"url":null,"abstract":"The K-means clustering algorithm have played an important role in the data analysis, pattern recognition, image processing, and market research. Classical K-means algorithm randomly selected initial cluster centers, so that the clustering results unstable. In this paper, through deeply study on classical k-means algorithm, we proposed a new K - means algorithm of Clustering based on double attributes of objects. The algorithm is based on the dissimilarity degree matrix which generated by high density set to construct the Huffman tree, and then according to K value to select initial cluster centers points in the Huffman tree, using this method effectively overcomes the defects of classical K-means algorithm for clustering random selection caused the initial cluster centers result unstable defects. In this paper, the new algorithm uses two UCI data sets to validate. The results of experiment show that the new k-means algorithm can choose the initial cluster center of high quality stable, so as to get better clustering results.","PeriodicalId":196962,"journal":{"name":"2015 Seventh International Conference on Measuring Technology and Mechatronics Automation","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"A K-Means Clustering Algorithm Based on Double Attributes of Objects\",\"authors\":\"Tu Linli, D. Yanni, Chu Siyong\",\"doi\":\"10.1109/ICMTMA.2015.12\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The K-means clustering algorithm have played an important role in the data analysis, pattern recognition, image processing, and market research. Classical K-means algorithm randomly selected initial cluster centers, so that the clustering results unstable. In this paper, through deeply study on classical k-means algorithm, we proposed a new K - means algorithm of Clustering based on double attributes of objects. The algorithm is based on the dissimilarity degree matrix which generated by high density set to construct the Huffman tree, and then according to K value to select initial cluster centers points in the Huffman tree, using this method effectively overcomes the defects of classical K-means algorithm for clustering random selection caused the initial cluster centers result unstable defects. In this paper, the new algorithm uses two UCI data sets to validate. The results of experiment show that the new k-means algorithm can choose the initial cluster center of high quality stable, so as to get better clustering results.\",\"PeriodicalId\":196962,\"journal\":{\"name\":\"2015 Seventh International Conference on Measuring Technology and Mechatronics Automation\",\"volume\":\"23 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-06-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 Seventh International Conference on Measuring Technology and Mechatronics Automation\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICMTMA.2015.12\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 Seventh International Conference on Measuring Technology and Mechatronics Automation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMTMA.2015.12","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

摘要

K-means聚类算法在数据分析、模式识别、图像处理和市场研究中发挥了重要作用。经典K-means算法随机选择初始聚类中心,使得聚类结果不稳定。本文通过对经典K -means算法的深入研究,提出了一种新的基于对象双属性的K -means聚类算法。该算法基于高密度集生成的不相似度矩阵构造Huffman树,然后根据K值在Huffman树中选择初始聚类中心点,利用该方法有效克服了经典K-means算法进行聚类随机选择导致初始聚类中心结果不稳定的缺陷。本文采用两个UCI数据集对新算法进行验证。实验结果表明,新的k-means算法可以选择质量稳定的初始聚类中心,从而获得较好的聚类结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
A K-Means Clustering Algorithm Based on Double Attributes of Objects
The K-means clustering algorithm have played an important role in the data analysis, pattern recognition, image processing, and market research. Classical K-means algorithm randomly selected initial cluster centers, so that the clustering results unstable. In this paper, through deeply study on classical k-means algorithm, we proposed a new K - means algorithm of Clustering based on double attributes of objects. The algorithm is based on the dissimilarity degree matrix which generated by high density set to construct the Huffman tree, and then according to K value to select initial cluster centers points in the Huffman tree, using this method effectively overcomes the defects of classical K-means algorithm for clustering random selection caused the initial cluster centers result unstable defects. In this paper, the new algorithm uses two UCI data sets to validate. The results of experiment show that the new k-means algorithm can choose the initial cluster center of high quality stable, so as to get better clustering results.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Based on Data Analysis about Risks of Bidding Decisions in Engineering Projects Study on Trusted Vitual Machine Platform Based on Cipher Card Meso-Structure Quantitative Research about Coals Based on the Digital Image Processing Technology News of Atomic Events Time Sequence Relationship Recognition Based on Function Word and Predicate Co-occurrence Research on Bridge Subsidence Control Based on Slip Casting Control
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1