以闵可夫斯基距离为度量的k均值聚类新方法

British Journal of Computer, Networking and Information Technology Pub Date : 2021-07-06 DOI:10.52589/BJCNIT-XEPSJBWX

Eric U.O., Michael O.O., Oberhiri-Orumah G., Chike H. N.

{"title":"以闵可夫斯基距离为度量的k均值聚类新方法","authors":"Eric U.O., Michael O.O., Oberhiri-Orumah G., Chike H. N.","doi":"10.52589/BJCNIT-XEPSJBWX","DOIUrl":null,"url":null,"abstract":"Cluster analysis is an unsupervised learning method that classifies data points, usually multidimensional into groups (called clusters) such that members of one cluster are more similar (in some sense) to each other than those in other clusters. In this paper, we propose a new k-means clustering method that uses Minkowski’s distance as its metric in a normed vector space which is the generalization of both the Euclidean distance and the Manhattan distance. The k-means clustering methods discussed in this paper are Forgy’s method, Lloyd’s method, MacQueen’s method, Hartigan and Wong’s method, Likas’ method and Faber’s method which uses the usual Euclidean distance. It was observed that the new k-means clustering method performed favourably in comparison with the existing methods in terms of minimization of the total intra-cluster variance using simulated data and real-life data sets.","PeriodicalId":326452,"journal":{"name":"British Journal of Computer, Networking and Information Technology","volume":"50 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"New K-means Clustering Method Using Minkowski’s Distance as its Metric\",\"authors\":\"Eric U.O., Michael O.O., Oberhiri-Orumah G., Chike H. N.\",\"doi\":\"10.52589/BJCNIT-XEPSJBWX\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Cluster analysis is an unsupervised learning method that classifies data points, usually multidimensional into groups (called clusters) such that members of one cluster are more similar (in some sense) to each other than those in other clusters. In this paper, we propose a new k-means clustering method that uses Minkowski’s distance as its metric in a normed vector space which is the generalization of both the Euclidean distance and the Manhattan distance. The k-means clustering methods discussed in this paper are Forgy’s method, Lloyd’s method, MacQueen’s method, Hartigan and Wong’s method, Likas’ method and Faber’s method which uses the usual Euclidean distance. It was observed that the new k-means clustering method performed favourably in comparison with the existing methods in terms of minimization of the total intra-cluster variance using simulated data and real-life data sets.\",\"PeriodicalId\":326452,\"journal\":{\"name\":\"British Journal of Computer, Networking and Information Technology\",\"volume\":\"50 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-07-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"British Journal of Computer, Networking and Information Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.52589/BJCNIT-XEPSJBWX\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"British Journal of Computer, Networking and Information Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.52589/BJCNIT-XEPSJBWX","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

聚类分析是一种无监督的学习方法，它将数据点(通常是多维的)分类成组(称为集群)，这样一个集群的成员(在某种意义上)比其他集群中的成员更相似。本文提出了一种新的k-means聚类方法，该方法在归一向量空间中使用Minkowski距离作为度量，它是欧几里得距离和曼哈顿距离的推广。本文讨论的k-means聚类方法有Forgy的方法、Lloyd的方法、MacQueen的方法、Hartigan和Wong的方法、Likas的方法和使用通常的欧几里得距离的Faber的方法。在模拟数据和真实数据集上，与现有方法相比，新的k-means聚类方法在最小化总簇内方差方面表现良好。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

New K-means Clustering Method Using Minkowski’s Distance as its Metric

Cluster analysis is an unsupervised learning method that classifies data points, usually multidimensional into groups (called clusters) such that members of one cluster are more similar (in some sense) to each other than those in other clusters. In this paper, we propose a new k-means clustering method that uses Minkowski’s distance as its metric in a normed vector space which is the generalization of both the Euclidean distance and the Manhattan distance. The k-means clustering methods discussed in this paper are Forgy’s method, Lloyd’s method, MacQueen’s method, Hartigan and Wong’s method, Likas’ method and Faber’s method which uses the usual Euclidean distance. It was observed that the new k-means clustering method performed favourably in comparison with the existing methods in terms of minimization of the total intra-cluster variance using simulated data and real-life data sets.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

British Journal of Computer, Networking and Information Technology

自引率

0.00%

发文量