{"title":"基于新设计聚类有效性指标的稳定分层聚类分析","authors":"Erzhou Zhu, Binbin Zhu, Feng Liu","doi":"10.1109/ICMCCE.2018.00146","DOIUrl":null,"url":null,"abstract":"Cluster validity index (CVI) is an important method for evaluating the effect of clustering results generated by clustering algorithms. Currently, many CVIs have proposed, but they are suffering from issues of unstable and narrow range of applications. Therefore, a new clustering validity index-NCVI, is proposed in this paper. Firstly, the NCVI index combines the idea of maximum spanning tree and Euclidean distance formula. The clustering results are obtained by using the average of the sum of the weights of the maximum spanning trees of each cluster and the minimum distance between the clusters using the Euclidean distance between each cluster center point. At the same time, based on the underlying algorithm (average link hierarchical clustering algorithm) to determine the optimal cluster number, combined with the new cluster validity index NCVI designed a new K value optimization algorithm (KVOA). Finally, the paper evaluates the validity of the newly proposed index (NCVI) through four simulation data sets and two UCI real data sets, and compares it with other six classical indicators. The experimental results show that the proposed index has a good performance advantage over other indexes in the tested data set.","PeriodicalId":198834,"journal":{"name":"2018 3rd International Conference on Mechanical, Control and Computer Engineering (ICMCCE)","volume":"115 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Stable Hierarchical Clustering Analysis Based on New Designed Cluster Validity Index\",\"authors\":\"Erzhou Zhu, Binbin Zhu, Feng Liu\",\"doi\":\"10.1109/ICMCCE.2018.00146\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Cluster validity index (CVI) is an important method for evaluating the effect of clustering results generated by clustering algorithms. Currently, many CVIs have proposed, but they are suffering from issues of unstable and narrow range of applications. Therefore, a new clustering validity index-NCVI, is proposed in this paper. Firstly, the NCVI index combines the idea of maximum spanning tree and Euclidean distance formula. The clustering results are obtained by using the average of the sum of the weights of the maximum spanning trees of each cluster and the minimum distance between the clusters using the Euclidean distance between each cluster center point. At the same time, based on the underlying algorithm (average link hierarchical clustering algorithm) to determine the optimal cluster number, combined with the new cluster validity index NCVI designed a new K value optimization algorithm (KVOA). Finally, the paper evaluates the validity of the newly proposed index (NCVI) through four simulation data sets and two UCI real data sets, and compares it with other six classical indicators. The experimental results show that the proposed index has a good performance advantage over other indexes in the tested data set.\",\"PeriodicalId\":198834,\"journal\":{\"name\":\"2018 3rd International Conference on Mechanical, Control and Computer Engineering (ICMCCE)\",\"volume\":\"115 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 3rd International Conference on Mechanical, Control and Computer Engineering (ICMCCE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICMCCE.2018.00146\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 3rd International Conference on Mechanical, Control and Computer Engineering (ICMCCE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMCCE.2018.00146","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Stable Hierarchical Clustering Analysis Based on New Designed Cluster Validity Index
Cluster validity index (CVI) is an important method for evaluating the effect of clustering results generated by clustering algorithms. Currently, many CVIs have proposed, but they are suffering from issues of unstable and narrow range of applications. Therefore, a new clustering validity index-NCVI, is proposed in this paper. Firstly, the NCVI index combines the idea of maximum spanning tree and Euclidean distance formula. The clustering results are obtained by using the average of the sum of the weights of the maximum spanning trees of each cluster and the minimum distance between the clusters using the Euclidean distance between each cluster center point. At the same time, based on the underlying algorithm (average link hierarchical clustering algorithm) to determine the optimal cluster number, combined with the new cluster validity index NCVI designed a new K value optimization algorithm (KVOA). Finally, the paper evaluates the validity of the newly proposed index (NCVI) through four simulation data sets and two UCI real data sets, and compares it with other six classical indicators. The experimental results show that the proposed index has a good performance advantage over other indexes in the tested data set.