Robust semi-supervised learning with reciprocal weighted mixing distribution alignment

IF 8 2区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Engineering Applications of Artificial Intelligence Pub Date : 2024-09-02 DOI:10.1016/j.engappai.2024.109185

Ziyu Cheng , Xianmin Wang , Jing Li , Feng Liu , Yutong Xie , Haiyan Liang

{"title":"Robust semi-supervised learning with reciprocal weighted mixing distribution alignment","authors":"Ziyu Cheng , Xianmin Wang , Jing Li , Feng Liu , Yutong Xie , Haiyan Liang","doi":"10.1016/j.engappai.2024.109185","DOIUrl":null,"url":null,"abstract":"<div><p>Recent semi-supervised learning(SSL) methods have achieved great success owing to the impressive performances brought by the combination of pseudo-labeling and consistency regularization. These methods often use pre-defined constant thresholds or dynamical thresholds to select unlabeled samples that contribute to training. However, many correct/incorrect pseudo-labels may be ignored/selected. Especially in distribution mismatched scenario, threshold-adjusted strategy is often complex and ineffective. To alleviate this issue, we develop a simple yet powerful framework whose idea is to abandon this strategy and utilize distribution alignment to adjust the predictions generated from a biased model softly. Specifically, first, we create two classifiers to predict pseudo-label(i.e., the sample belongs to a specific category) and complementary pseudo-label(i.e., the sample does not belong to a specific category), respectively. Second, by maintaining the distributions of pseudo-labels, complementary pseudo-labels and their reverse versions from past iterations, we enforce a reciprocal weighted mixing according to the predicted category weights. Third, a reciprocal distribution alignment is applied to the mixed distributions to adjust the predicted distributions. Finally, we propose Implication Alignment Loss , which keeps consistency between the predictions of the same implications but from different versions. We empirically demonstrate the effectiveness of our proposed method in comparison with state-of-the-art benchmarks. Especially, our method achieves a 1.18% error rate reduction over the latest state-of-the-art method MutexMatch on CIFAR-10 with 2 labels per class and exhibits robustness in the scenario of mismatched distribution.</p></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"137 ","pages":"Article 109185"},"PeriodicalIF":8.0000,"publicationDate":"2024-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197624013435","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Recent semi-supervised learning(SSL) methods have achieved great success owing to the impressive performances brought by the combination of pseudo-labeling and consistency regularization. These methods often use pre-defined constant thresholds or dynamical thresholds to select unlabeled samples that contribute to training. However, many correct/incorrect pseudo-labels may be ignored/selected. Especially in distribution mismatched scenario, threshold-adjusted strategy is often complex and ineffective. To alleviate this issue, we develop a simple yet powerful framework whose idea is to abandon this strategy and utilize distribution alignment to adjust the predictions generated from a biased model softly. Specifically, first, we create two classifiers to predict pseudo-label(i.e., the sample belongs to a specific category) and complementary pseudo-label(i.e., the sample does not belong to a specific category), respectively. Second, by maintaining the distributions of pseudo-labels, complementary pseudo-labels and their reverse versions from past iterations, we enforce a reciprocal weighted mixing according to the predicted category weights. Third, a reciprocal distribution alignment is applied to the mixed distributions to adjust the predicted distributions. Finally, we propose Implication Alignment Loss , which keeps consistency between the predictions of the same implications but from different versions. We empirically demonstrate the effectiveness of our proposed method in comparison with state-of-the-art benchmarks. Especially, our method achieves a 1.18% error rate reduction over the latest state-of-the-art method MutexMatch on CIFAR-10 with 2 labels per class and exhibits robustness in the scenario of mismatched distribution.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

利用互惠加权混合分布对齐进行稳健的半监督学习

最近的半监督学习（SSL）方法取得了巨大成功，这得益于伪标记和一致性正则化相结合所带来的令人印象深刻的性能。这些方法通常使用预定义的恒定阈值或动态阈值来选择有助于训练的未标记样本。然而，许多正确/不正确的伪标签可能会被忽略/选择。特别是在分布不匹配的情况下，阈值调整策略往往既复杂又无效。为了缓解这一问题，我们开发了一个简单但功能强大的框架，其理念是放弃这种策略，利用分布对齐来调整从有偏差的模型中生成的预测结果。具体来说，首先，我们创建了两个分类器，分别预测伪标签（即样本属于特定类别）和补充伪标签（即样本不属于特定类别）。其次，通过保持过去迭代中伪标签、互补伪标签及其反向版本的分布，我们根据预测的类别权重强制执行对等加权混合。第三，对混合分布进行对等分布对齐，以调整预测分布。最后，我们提出了 "含义对齐损失"（Implication Alignment Loss），它可以保持来自不同版本的相同含义预测之间的一致性。与最先进的基准相比，我们通过实证证明了我们提出的方法的有效性。特别是，在 CIFAR-10 上，我们的方法比最新的先进方法 MutexMatch（每类 2 个标签）减少了 1.18% 的错误率，并在分布不匹配的情况下表现出鲁棒性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Engineering Applications of Artificial Intelligence 工程技术-工程：电子与电气

CiteScore

9.60

自引率

10.00%

发文量

505

审稿时长

68 days

期刊介绍： Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.