Density Peak Clustering Algorithm Based on High Density Connection with Entropy Optimization

Weiguo Yi, Bin Ma, Siwei Ma, Heng Zhang
{"title":"Density Peak Clustering Algorithm Based on High Density Connection with Entropy Optimization","authors":"Weiguo Yi, Bin Ma, Siwei Ma, Heng Zhang","doi":"10.1109/ISPDS56360.2022.9874073","DOIUrl":null,"url":null,"abstract":"In this paper, an MM-HDC (Max Mean and High Density Connection) method was proposed to find the initial clustering center based on the maximum mean distance and fuse each cluster based on the high density Connection. Firstly, $\\Delta\\rho=70\\%$ was set to select the initial clustering centers and the mean distance was introduced. The selection of cluster centers will be stopped until the distance between the desired new mean center and some previously selected cluster centers is less than $2^{\\ast}d_{c}$, and the selection of initial cluster centers is completed. Then use the allocation policy of k-means to clustering all the data points by the distance between each initial clustering center and data points, constantly updated after cluster center, center for migration, until the old and the new cluster centers position changed little (the distance is very small), then stop update clustering center, and the last of the clustering results as the final clustering results. Finally, iterative fusion method is used for center fusion to get better clustering results. Experimental results of classical data sets show that the MM-HDC method is superior to the DPC algorithm and k-means algorithm, and the improved density peak clustering algorithm has higher accuracy. Moreover, The MM-HDC algorithm can obtain satisfactory results on the data set with special shape or uneven distribution.","PeriodicalId":280244,"journal":{"name":"2022 3rd International Conference on Information Science, Parallel and Distributed Systems (ISPDS)","volume":"86 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 3rd International Conference on Information Science, Parallel and Distributed Systems (ISPDS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISPDS56360.2022.9874073","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In this paper, an MM-HDC (Max Mean and High Density Connection) method was proposed to find the initial clustering center based on the maximum mean distance and fuse each cluster based on the high density Connection. Firstly, $\Delta\rho=70\%$ was set to select the initial clustering centers and the mean distance was introduced. The selection of cluster centers will be stopped until the distance between the desired new mean center and some previously selected cluster centers is less than $2^{\ast}d_{c}$, and the selection of initial cluster centers is completed. Then use the allocation policy of k-means to clustering all the data points by the distance between each initial clustering center and data points, constantly updated after cluster center, center for migration, until the old and the new cluster centers position changed little (the distance is very small), then stop update clustering center, and the last of the clustering results as the final clustering results. Finally, iterative fusion method is used for center fusion to get better clustering results. Experimental results of classical data sets show that the MM-HDC method is superior to the DPC algorithm and k-means algorithm, and the improved density peak clustering algorithm has higher accuracy. Moreover, The MM-HDC algorithm can obtain satisfactory results on the data set with special shape or uneven distribution.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于熵优化高密度连接的密度峰值聚类算法
本文提出了一种MM-HDC (Max Mean and High Density Connection)方法,基于最大平均距离找到初始聚类中心,并基于高密度连接融合各个聚类。首先,通过设置$\Delta\rho=70\%$选择初始聚类中心,引入平均距离;直到期望的新平均中心与先前选择的一些聚类中心之间的距离小于$2^{\ast}d_{c}$,然后停止聚类中心的选择,完成初始聚类中心的选择。然后使用k-means的分配策略,根据每个初始聚类中心与数据点之间的距离对所有数据点进行聚类,不断更新聚类中心后,对中心进行迁移,直到新旧聚类中心的位置变化很小(距离非常小),才停止更新聚类中心,并将最后的聚类结果作为最终聚类结果。最后,采用迭代融合方法进行中心融合,得到较好的聚类结果。经典数据集的实验结果表明,MM-HDC方法优于DPC算法和k-means算法,改进的密度峰聚类算法具有更高的准确率。此外,对于形状特殊或分布不均匀的数据集,MM-HDC算法也能获得满意的结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Research on Intelligent Quality Inspection of Customer Service Under the “One Network” Operation Mode of Toll Roads Application of AE keying technology in film and television post-production Study on Artifact Classification Identification Based on Deep Learning Design of Real-time Target Detection System in CCD Vertical Target Coordinate Measurement An evaluation method of municipal pipeline cleaning effect based on image processing
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1