Rexhep Rada, Erind Bedalli, Sokol Shurdhi, B. Çiço
{"title":"基于原型的聚类方法比较分析","authors":"Rexhep Rada, Erind Bedalli, Sokol Shurdhi, B. Çiço","doi":"10.1109/MECO58584.2023.10154917","DOIUrl":null,"url":null,"abstract":"In the machine learning domain, clustering is a fundamental unsupervised learning operation which aims to partition the instances of a dataset into clusters (i.e, groups, subsets) such that instances within the same cluster are much similar to each other and much different from the other clusters. In the broad spectrum of clustering methods, prototype-based methods characterize each cluster through a prototype (i.e. centroid) and a relocation scheme is employed to iteratively redistribute the instances into the clusters, guided by an objective function. In this paper, several prototype-based methods are brought into focus, including K-means, K-medoids, K-medians, Fuzzy C-means and Kernel K-means. These algorithms are experimentally analyzed on several original benchmark datasets, distorted benchmark datasets and synthetic datasets. The comparative analysis is oriented in two main aspects: accuracy and sensitivity to noise and outliers.","PeriodicalId":187825,"journal":{"name":"2023 12th Mediterranean Conference on Embedded Computing (MECO)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A comparative analysis on prototype-based clustering methods\",\"authors\":\"Rexhep Rada, Erind Bedalli, Sokol Shurdhi, B. Çiço\",\"doi\":\"10.1109/MECO58584.2023.10154917\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the machine learning domain, clustering is a fundamental unsupervised learning operation which aims to partition the instances of a dataset into clusters (i.e, groups, subsets) such that instances within the same cluster are much similar to each other and much different from the other clusters. In the broad spectrum of clustering methods, prototype-based methods characterize each cluster through a prototype (i.e. centroid) and a relocation scheme is employed to iteratively redistribute the instances into the clusters, guided by an objective function. In this paper, several prototype-based methods are brought into focus, including K-means, K-medoids, K-medians, Fuzzy C-means and Kernel K-means. These algorithms are experimentally analyzed on several original benchmark datasets, distorted benchmark datasets and synthetic datasets. The comparative analysis is oriented in two main aspects: accuracy and sensitivity to noise and outliers.\",\"PeriodicalId\":187825,\"journal\":{\"name\":\"2023 12th Mediterranean Conference on Embedded Computing (MECO)\",\"volume\":\"33 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-06-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 12th Mediterranean Conference on Embedded Computing (MECO)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/MECO58584.2023.10154917\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 12th Mediterranean Conference on Embedded Computing (MECO)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MECO58584.2023.10154917","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A comparative analysis on prototype-based clustering methods
In the machine learning domain, clustering is a fundamental unsupervised learning operation which aims to partition the instances of a dataset into clusters (i.e, groups, subsets) such that instances within the same cluster are much similar to each other and much different from the other clusters. In the broad spectrum of clustering methods, prototype-based methods characterize each cluster through a prototype (i.e. centroid) and a relocation scheme is employed to iteratively redistribute the instances into the clusters, guided by an objective function. In this paper, several prototype-based methods are brought into focus, including K-means, K-medoids, K-medians, Fuzzy C-means and Kernel K-means. These algorithms are experimentally analyzed on several original benchmark datasets, distorted benchmark datasets and synthetic datasets. The comparative analysis is oriented in two main aspects: accuracy and sensitivity to noise and outliers.