Learning Common Semantics via Optimal Transport for Contrastive Multi-View Clustering

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society Pub Date : 2024-08-08 DOI:10.1109/TIP.2024.3436615

Qian Zhang;Lin Zhang;Ran Song;Runmin Cong;Yonghuai Liu;Wei Zhang

{"title":"Learning Common Semantics via Optimal Transport for Contrastive Multi-View Clustering","authors":"Qian Zhang;Lin Zhang;Ran Song;Runmin Cong;Yonghuai Liu;Wei Zhang","doi":"10.1109/TIP.2024.3436615","DOIUrl":null,"url":null,"abstract":"Multi-view clustering aims to learn discriminative representations from multi-view data. Although existing methods show impressive performance by leveraging contrastive learning to tackle the representation gap between every two views, they share the common limitation of not performing semantic alignment from a global perspective, resulting in the undermining of semantic patterns in multi-view data. This paper presents CSOT, namely Common Semantics via Optimal Transport, to boost contrastive multi-view clustering via semantic learning in a common space that integrates all views. Through optimal transport, the samples in multiple views are mapped to the joint clusters which represent the multi-view semantic patterns in the common space. With the semantic assignment derived from the optimal transport plan, we design a semantic learning module where the soft assignment vector works as a global supervision to enforce the model to learn consistent semantics among all views. Moreover, we propose a semantic-aware re-weighting strategy to treat samples differently according to their semantic significance, which improves the effectiveness of cross-view contrastive representation learning. Extensive experimental results demonstrate that CSOT achieves the state-of-the-art clustering performance.","PeriodicalId":94032,"journal":{"name":"IEEE transactions on image processing : a publication of the IEEE Signal Processing Society","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on image processing : a publication of the IEEE Signal Processing Society","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10630657/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Multi-view clustering aims to learn discriminative representations from multi-view data. Although existing methods show impressive performance by leveraging contrastive learning to tackle the representation gap between every two views, they share the common limitation of not performing semantic alignment from a global perspective, resulting in the undermining of semantic patterns in multi-view data. This paper presents CSOT, namely Common Semantics via Optimal Transport, to boost contrastive multi-view clustering via semantic learning in a common space that integrates all views. Through optimal transport, the samples in multiple views are mapped to the joint clusters which represent the multi-view semantic patterns in the common space. With the semantic assignment derived from the optimal transport plan, we design a semantic learning module where the soft assignment vector works as a global supervision to enforce the model to learn consistent semantics among all views. Moreover, we propose a semantic-aware re-weighting strategy to treat samples differently according to their semantic significance, which improves the effectiveness of cross-view contrastive representation learning. Extensive experimental results demonstrate that CSOT achieves the state-of-the-art clustering performance.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

通过最佳传输学习共同语义，实现多视角对比聚类

多视图聚类旨在从多视图数据中学习判别表征。虽然现有的方法通过利用对比学习来解决每两个视图之间的表征差距，从而显示出令人印象深刻的性能，但它们都有一个共同的局限性，即没有从全局角度进行语义对齐，导致多视图数据中的语义模式受到破坏。本文提出的 CSOT（即通过最优传输的通用语义）通过在整合所有视图的通用空间中进行语义学习来促进对比性多视图聚类。通过最优传输，多个视图中的样本被映射到共同空间中代表多视图语义模式的联合聚类中。利用从最优传输计划中得出的语义分配，我们设计了一个语义学习模块，其中软分配向量作为全局监督，强制模型在所有视图中学习一致的语义。此外，我们还提出了一种语义感知再加权策略，根据样本的语义意义对其进行不同处理，从而提高了跨视图对比表征学习的效果。广泛的实验结果表明，CSOT 实现了最先进的聚类性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society

自引率

0.00%

发文量