{"title":"Ward method of hierarchical clustering for non-Euclidean similarity measures","authors":"S. Miyamoto, Ryosuke Abe, Y. Endo, J. Takeshita","doi":"10.1109/SOCPAR.2015.7492784","DOIUrl":null,"url":null,"abstract":"The Ward linkage method in agglomerative hierarchical clustering is sometimes used for non-Euclidean similarity, i.e., non-positive definite matrix of similarity, which is not an adequate use of this method, since the square Euclidean distance should be its basis. Nevertheless, this paper shows that the Ward method for non positive-definite similarity can partly be justified. It is shown that the result from the Ward method to a non positive-definite and normalized similarity is almost the same as another result from the Ward method to a positive-definite matrix obtained from the original similarity by adding a positive constant to the diagonal elements. More precisely, the same clusters are generated by the same order from the both data. Only the levels of their generations are different.","PeriodicalId":409493,"journal":{"name":"2015 7th International Conference of Soft Computing and Pattern Recognition (SoCPaR)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"23","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 7th International Conference of Soft Computing and Pattern Recognition (SoCPaR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SOCPAR.2015.7492784","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 23
Abstract
The Ward linkage method in agglomerative hierarchical clustering is sometimes used for non-Euclidean similarity, i.e., non-positive definite matrix of similarity, which is not an adequate use of this method, since the square Euclidean distance should be its basis. Nevertheless, this paper shows that the Ward method for non positive-definite similarity can partly be justified. It is shown that the result from the Ward method to a non positive-definite and normalized similarity is almost the same as another result from the Ward method to a positive-definite matrix obtained from the original similarity by adding a positive constant to the diagonal elements. More precisely, the same clusters are generated by the same order from the both data. Only the levels of their generations are different.