Auto-Pairing Positives Through Implicit Relation Circulation for Discriminative Self-Learning

IF 18.6 IEEE transactions on pattern analysis and machine intelligence Pub Date : 2025-01-09 DOI:10.1109/TPAMI.2025.3526802

Bo Pang;Zhenyu Wei;Jingli Lin;Cewu Lu

{"title":"Auto-Pairing Positives Through Implicit Relation Circulation for Discriminative Self-Learning","authors":"Bo Pang;Zhenyu Wei;Jingli Lin;Cewu Lu","doi":"10.1109/TPAMI.2025.3526802","DOIUrl":null,"url":null,"abstract":"Contrastive learning, a discriminative self-learning framework, is one of the most popular representation learning methods which has a wide range of application scenarios. Although relative techniques have been continuously updated in recent years, designing and seeking positive pairs are still inevitable. Just because of the requirement of explicit positive pairs, the utilization of contrastive learning is restricted in dense, multi-modal, and other scenarios where positive pairs are difficult to obtain. To solve this problem, in this paper, we design an auto-pairing mechanism called <bold>I</b>mplicit <bold>R</b>elation <bold>C</b>irculation (<bold>IRC</b>) for discriminative self-learning frameworks. Its core idea is to conduct a random walk among multiple feature groups we want to contrast but without explicit matchup, which we call the complex task (Task C). By linking the head and tail of the random walk to form a circulation with a simple task (task S) containing easy-obtaining pairs, we can apply cycle consistency as supervision guidance to gradually learn the wanted positive pairs among the random walk of feature groups automatically. We provide several amazing applications of IRC: we can learn 1) effective dense image pixel relations and representation with only image-level pairs; 2) 3D temporal point-level multi-modal point cloud relations and representation; and 3) even image representation with the help of language without off-the-shelf vision-language pairs. As an easy-to-use plug-and-play mechanism, we evaluate its universality and robustness with multiple self-learning algorithms, tasks, and datasets, achieving stable and significant improvements. As an illustrative example, IRC improves the SOTA performance by about 3.0 mIoU on image semantic segmentation, 1.5 mIoU on 3D segmentation, 1.3 mAP on 3D detection, and an average of 1.2 top1 accuracy on image classification with the help of the auto-learned positive pairs. Importantly, these improvements are achieved with little parameter and computation overhead. We hope IRC can provide the community with new insight into discriminative self-learning.","PeriodicalId":94034,"journal":{"name":"IEEE transactions on pattern analysis and machine intelligence","volume":"47 4","pages":"2739-2753"},"PeriodicalIF":18.6000,"publicationDate":"2025-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on pattern analysis and machine intelligence","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10835191/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Contrastive learning, a discriminative self-learning framework, is one of the most popular representation learning methods which has a wide range of application scenarios. Although relative techniques have been continuously updated in recent years, designing and seeking positive pairs are still inevitable. Just because of the requirement of explicit positive pairs, the utilization of contrastive learning is restricted in dense, multi-modal, and other scenarios where positive pairs are difficult to obtain. To solve this problem, in this paper, we design an auto-pairing mechanism called Implicit Relation Circulation (IRC) for discriminative self-learning frameworks. Its core idea is to conduct a random walk among multiple feature groups we want to contrast but without explicit matchup, which we call the complex task (Task C). By linking the head and tail of the random walk to form a circulation with a simple task (task S) containing easy-obtaining pairs, we can apply cycle consistency as supervision guidance to gradually learn the wanted positive pairs among the random walk of feature groups automatically. We provide several amazing applications of IRC: we can learn 1) effective dense image pixel relations and representation with only image-level pairs; 2) 3D temporal point-level multi-modal point cloud relations and representation; and 3) even image representation with the help of language without off-the-shelf vision-language pairs. As an easy-to-use plug-and-play mechanism, we evaluate its universality and robustness with multiple self-learning algorithms, tasks, and datasets, achieving stable and significant improvements. As an illustrative example, IRC improves the SOTA performance by about 3.0 mIoU on image semantic segmentation, 1.5 mIoU on 3D segmentation, 1.3 mAP on 3D detection, and an average of 1.2 top1 accuracy on image classification with the help of the auto-learned positive pairs. Importantly, these improvements are achieved with little parameter and computation overhead. We hope IRC can provide the community with new insight into discriminative self-learning.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于隐式关系循环的判别自学习自动配对阳性

对比学习是一种判别式的自学习框架，是目前最流行的表征学习方法之一，具有广泛的应用场景。虽然近年来相关技术不断更新，但设计和寻找正对仍然是不可避免的。正是由于明确的正对的要求，在密集、多模态等难以获得正对的场景下，对比学习的运用受到了限制。为了解决这一问题，本文设计了一种用于判别自学习框架的自动配对机制——隐式关系循环（IRC）。其核心思想是在多个需要对比但没有明确匹配的特征组之间进行随机行走，我们称之为复杂任务（task C）。通过将随机行走的头尾与包含易获得对的简单任务（task S）连接形成循环，利用循环一致性作为监督指导，逐步自动学习特征组随机行走中想要的正对。我们提供了几个令人惊叹的IRC应用：我们可以学习1)仅使用图像级对有效的密集图像像素关系和表示；2)三维时间点级多模态点云关系与表示；3)甚至在没有现成的视觉语言对的情况下，借助语言进行图像表示。作为一种易于使用的即插即用机制，我们用多个自学习算法、任务和数据集评估了它的通用性和鲁棒性，实现了稳定和显著的改进。例如，IRC在图像语义分割上提高了约3.0 mIoU，在3D分割上提高了1.5 mIoU，在3D检测上提高了1.3 mAP，在图像分类上平均提高了1.2 top1精度。重要的是，这些改进是以很少的参数和计算开销实现的。我们希望IRC能够为社区提供辨别性自学的新见解。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE transactions on pattern analysis and machine intelligence

自引率

0.00%

发文量

期刊最新文献

Spike Camera Optical Flow Estimation Based on Continuous Spike Streams. Bi-C²R: Bidirectional Continual Compatible Representation for Re-Indexing Free Lifelong Person Re-Identification. 2025 Reviewers List* Towards Transferable Defense Against Malicious Image Edits. Understanding Data Influence With Differential Approximation.