Stable Hierarchical Clustering Analysis Based on New Designed Cluster Validity Index

Erzhou Zhu, Binbin Zhu, Feng Liu
{"title":"Stable Hierarchical Clustering Analysis Based on New Designed Cluster Validity Index","authors":"Erzhou Zhu, Binbin Zhu, Feng Liu","doi":"10.1109/ICMCCE.2018.00146","DOIUrl":null,"url":null,"abstract":"Cluster validity index (CVI) is an important method for evaluating the effect of clustering results generated by clustering algorithms. Currently, many CVIs have proposed, but they are suffering from issues of unstable and narrow range of applications. Therefore, a new clustering validity index-NCVI, is proposed in this paper. Firstly, the NCVI index combines the idea of maximum spanning tree and Euclidean distance formula. The clustering results are obtained by using the average of the sum of the weights of the maximum spanning trees of each cluster and the minimum distance between the clusters using the Euclidean distance between each cluster center point. At the same time, based on the underlying algorithm (average link hierarchical clustering algorithm) to determine the optimal cluster number, combined with the new cluster validity index NCVI designed a new K value optimization algorithm (KVOA). Finally, the paper evaluates the validity of the newly proposed index (NCVI) through four simulation data sets and two UCI real data sets, and compares it with other six classical indicators. The experimental results show that the proposed index has a good performance advantage over other indexes in the tested data set.","PeriodicalId":198834,"journal":{"name":"2018 3rd International Conference on Mechanical, Control and Computer Engineering (ICMCCE)","volume":"115 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 3rd International Conference on Mechanical, Control and Computer Engineering (ICMCCE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMCCE.2018.00146","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Cluster validity index (CVI) is an important method for evaluating the effect of clustering results generated by clustering algorithms. Currently, many CVIs have proposed, but they are suffering from issues of unstable and narrow range of applications. Therefore, a new clustering validity index-NCVI, is proposed in this paper. Firstly, the NCVI index combines the idea of maximum spanning tree and Euclidean distance formula. The clustering results are obtained by using the average of the sum of the weights of the maximum spanning trees of each cluster and the minimum distance between the clusters using the Euclidean distance between each cluster center point. At the same time, based on the underlying algorithm (average link hierarchical clustering algorithm) to determine the optimal cluster number, combined with the new cluster validity index NCVI designed a new K value optimization algorithm (KVOA). Finally, the paper evaluates the validity of the newly proposed index (NCVI) through four simulation data sets and two UCI real data sets, and compares it with other six classical indicators. The experimental results show that the proposed index has a good performance advantage over other indexes in the tested data set.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于新设计聚类有效性指标的稳定分层聚类分析
聚类有效性指数(CVI)是评价聚类算法产生的聚类结果效果的重要方法。目前,已经提出了许多CVIs,但存在不稳定和应用范围狭窄的问题。为此,本文提出了一种新的聚类有效性指标——ncvi。首先,NCVI指数结合了最大生成树的思想和欧氏距离公式。利用聚类中心点之间的欧几里得距离,将每个聚类的最大生成树的权值和与聚类之间的最小距离的权值相加,得到聚类结果。同时,在底层算法(平均链路分层聚类算法)确定最优聚类数的基础上,结合新的聚类有效性指标NCVI设计了新的K值优化算法(KVOA)。最后,通过4个模拟数据集和2个UCI真实数据集对新指标NCVI的有效性进行了评价,并与其他6个经典指标进行了比较。实验结果表明,所提出的索引相对于测试数据集中的其他索引具有良好的性能优势。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Design Method of Primary Transmitting Coil for Realizing Large Uniform Magnetic Field Distribution Coordinated Control of Vehicle Lane Change and Speed at Intersection under V2X Application of High-Voltage Electrical Energy Meter in Smart Grid A Method for Exponential Component Storage Effect Evaluation Based on Bayesian Inference An Enhanced TLD Algorithm Based on Sparse Representation
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1