{"title":"Evolutionary clustering framework based on distance matrix for arbitrary-shaped data sets","authors":"Cong Liu, Chunxue Wu, Linhua Jiang","doi":"10.1049/iet-spr.2015.0335","DOIUrl":null,"url":null,"abstract":"Data clustering plays a key role in both scientific and real-world applications. However, current clustering methods still face some challenges such as clustering arbitrary-shaped data sets and detecting the cluster number automatically. This study addresses the two challenges. A novel clustering analysis method, named automatic evolutionary clustering method based on distance (AED) matrix, is proposed to determine the proper cluster number automatically, and to find the optimal clustering result as well. In AED, a distance matrix is first obtained by using a specific distance metric such as Euclidean distance metric or path distance metric, and then this distance matrix is partitioned by an evolutionary clustering framework. In this framework, a fixed-length representation scheme is implemented to represent the clustering result, a novel cross-over scheme is introduced to increase the convergence speed, and a validity index is proposed to evaluate the intermediate clustering results and the final clustering results. AED is systematically compared with some state-of-the-art clustering methods on both hyper-spherical and irregular-shaped data sets, and the experimental results suggest that the authors approach not only successfully detects the correct cluster numbers but also achieves better accuracy for most of test problems.","PeriodicalId":272888,"journal":{"name":"IET Signal Process.","volume":"3 5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IET Signal Process.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1049/iet-spr.2015.0335","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Data clustering plays a key role in both scientific and real-world applications. However, current clustering methods still face some challenges such as clustering arbitrary-shaped data sets and detecting the cluster number automatically. This study addresses the two challenges. A novel clustering analysis method, named automatic evolutionary clustering method based on distance (AED) matrix, is proposed to determine the proper cluster number automatically, and to find the optimal clustering result as well. In AED, a distance matrix is first obtained by using a specific distance metric such as Euclidean distance metric or path distance metric, and then this distance matrix is partitioned by an evolutionary clustering framework. In this framework, a fixed-length representation scheme is implemented to represent the clustering result, a novel cross-over scheme is introduced to increase the convergence speed, and a validity index is proposed to evaluate the intermediate clustering results and the final clustering results. AED is systematically compared with some state-of-the-art clustering methods on both hyper-spherical and irregular-shaped data sets, and the experimental results suggest that the authors approach not only successfully detects the correct cluster numbers but also achieves better accuracy for most of test problems.