{"title":"Clustering Ensemble Based on a New Consensus Function with Hamming Distance","authors":"Jieting Huo, Weihong Li, Boyi Wang","doi":"10.1109/IEEC.2010.5533268","DOIUrl":null,"url":null,"abstract":"Unlike classification problems, there are no well known approaches to combining multiple clusterings which is more difficult than designing classifier ensembles since cluster labels are unknown. A new algorithm is to use Hamming distance as the similarity metric to find the best partition is proposed. Also a scheme for a selective initial cluster centers by Hamming distance is used in the consensus function, which help us to find the most likely different classes of the data. Experiment results show that the algorithm is more stable, higher performance and more efficiently than other compared methods.","PeriodicalId":307678,"journal":{"name":"2010 2nd International Symposium on Information Engineering and Electronic Commerce","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 2nd International Symposium on Information Engineering and Electronic Commerce","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IEEC.2010.5533268","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Unlike classification problems, there are no well known approaches to combining multiple clusterings which is more difficult than designing classifier ensembles since cluster labels are unknown. A new algorithm is to use Hamming distance as the similarity metric to find the best partition is proposed. Also a scheme for a selective initial cluster centers by Hamming distance is used in the consensus function, which help us to find the most likely different classes of the data. Experiment results show that the algorithm is more stable, higher performance and more efficiently than other compared methods.