一种基于动态更新约束的半监督聚类分层聚类方法

Q2 Computer Science 自动化学报 Pub Date : 2015-01-01 DOI:10.16383/J.AAS.2015.C140859
Chenxi Zhou, Xun Liang, Jinshan Qi
{"title":"一种基于动态更新约束的半监督聚类分层聚类方法","authors":"Chenxi Zhou, Xun Liang, Jinshan Qi","doi":"10.16383/J.AAS.2015.C140859","DOIUrl":null,"url":null,"abstract":"A semi-supervised agglomerative hierarchical clustering method based on dynamically updating constraints is proposing in this research. Following the existing semi-supervised clustering algorithm, this method uses the must-link and cannot-link constraints. Instead of using the idea that the instances with must-link constraints are pre-clustered before agglomerating with the others, this method employs a more general and reasonable process. Firstly, must-link and cannot-link constraints are expanded to compose a constraints closure. Then, a standard agglomeration instructed by cannot-link constraints is processed. During this procedure, the must-link and cannot-link are dynamically updated according to the intermediate clustering results. This updating process guarantees the validity of the final results. The fundamental advantage of this method is omitting the pre-clustering process of the instances with must-link constraints. This modification ensures that data points gain a more reasonable agglomeration order, which may result in a significant improvement on the clustering results. This research also introduces an implementation of this model based on Ward0s method, leading to the C-Ward algorithm. The experimental analyses on both Artificial simulated datasets and real world datasets show that this method is much better than the others.","PeriodicalId":35798,"journal":{"name":"自动化学报","volume":"41 1","pages":"1253-1263"},"PeriodicalIF":0.0000,"publicationDate":"2015-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"A semi-supervised agglomerative hierarchical clustering method based on dynamically updating constraints\",\"authors\":\"Chenxi Zhou, Xun Liang, Jinshan Qi\",\"doi\":\"10.16383/J.AAS.2015.C140859\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A semi-supervised agglomerative hierarchical clustering method based on dynamically updating constraints is proposing in this research. Following the existing semi-supervised clustering algorithm, this method uses the must-link and cannot-link constraints. Instead of using the idea that the instances with must-link constraints are pre-clustered before agglomerating with the others, this method employs a more general and reasonable process. Firstly, must-link and cannot-link constraints are expanded to compose a constraints closure. Then, a standard agglomeration instructed by cannot-link constraints is processed. During this procedure, the must-link and cannot-link are dynamically updated according to the intermediate clustering results. This updating process guarantees the validity of the final results. The fundamental advantage of this method is omitting the pre-clustering process of the instances with must-link constraints. This modification ensures that data points gain a more reasonable agglomeration order, which may result in a significant improvement on the clustering results. This research also introduces an implementation of this model based on Ward0s method, leading to the C-Ward algorithm. The experimental analyses on both Artificial simulated datasets and real world datasets show that this method is much better than the others.\",\"PeriodicalId\":35798,\"journal\":{\"name\":\"自动化学报\",\"volume\":\"41 1\",\"pages\":\"1253-1263\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"自动化学报\",\"FirstCategoryId\":\"1093\",\"ListUrlMain\":\"https://doi.org/10.16383/J.AAS.2015.C140859\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"Computer Science\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"自动化学报","FirstCategoryId":"1093","ListUrlMain":"https://doi.org/10.16383/J.AAS.2015.C140859","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Computer Science","Score":null,"Total":0}
引用次数: 2

摘要

提出了一种基于约束动态更新的半监督聚类分层聚类方法。该方法继承了现有的半监督聚类算法,采用了必须链接约束和不能链接约束。这种方法不使用具有必须链接约束的实例在与其他实例聚合之前预先聚集的想法,而是采用更通用和合理的过程。首先,将必须链接约束和不能链接约束展开为约束闭包。然后,对非链接约束下的标准团聚问题进行了处理。在此过程中,根据中间聚类结果动态更新必须链接和不能链接。这一更新过程保证了最终结果的有效性。该方法的根本优点是省去了具有必须链接约束的实例的预聚类过程。这种修改保证了数据点获得更合理的聚类顺序,这可能会显著改善聚类结果。本研究还介绍了一种基于Ward0s方法的模型实现,从而引出了C-Ward算法。在人工模拟数据集和实际数据集上进行的实验分析表明,该方法比其他方法都要好得多。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
A semi-supervised agglomerative hierarchical clustering method based on dynamically updating constraints
A semi-supervised agglomerative hierarchical clustering method based on dynamically updating constraints is proposing in this research. Following the existing semi-supervised clustering algorithm, this method uses the must-link and cannot-link constraints. Instead of using the idea that the instances with must-link constraints are pre-clustered before agglomerating with the others, this method employs a more general and reasonable process. Firstly, must-link and cannot-link constraints are expanded to compose a constraints closure. Then, a standard agglomeration instructed by cannot-link constraints is processed. During this procedure, the must-link and cannot-link are dynamically updated according to the intermediate clustering results. This updating process guarantees the validity of the final results. The fundamental advantage of this method is omitting the pre-clustering process of the instances with must-link constraints. This modification ensures that data points gain a more reasonable agglomeration order, which may result in a significant improvement on the clustering results. This research also introduces an implementation of this model based on Ward0s method, leading to the C-Ward algorithm. The experimental analyses on both Artificial simulated datasets and real world datasets show that this method is much better than the others.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自动化学报
自动化学报 Computer Science-Computer Graphics and Computer-Aided Design
CiteScore
4.80
自引率
0.00%
发文量
6655
期刊介绍: ACTA AUTOMATICA SINICA is a joint publication of Chinese Association of Automation and the Institute of Automation, the Chinese Academy of Sciences. The objective is the high quality and rapid publication of the articles, with a strong focus on new trends, original theoretical and experimental research and developments, emerging technology, and industrial standards in automation.
期刊最新文献
Endocrine therapy and urogenital outcomes among women with a breast cancer diagnosis. Robust Approximations to Joint Chance-constrained Problems A Chebyshev-Gauss Pseudospectral Method for Solving Optimal Control Problems Forward Affine Point Set Matching Under Variational Bayesian Framework SAR Image Despeckling by Sparse Reconstruction Based on Shearlets
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1