A masking, linkage and guidance framework for online class incremental learning

IF 7.5 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pattern Recognition Pub Date : 2024-11-15 DOI:10.1016/j.patcog.2024.111185

Guoqiang Liang , Zhaojie Chen , Shibin Su , Shizhou Zhang , Yanning Zhang

{"title":"A masking, linkage and guidance framework for online class incremental learning","authors":"Guoqiang Liang , Zhaojie Chen , Shibin Su , Shizhou Zhang , Yanning Zhang","doi":"10.1016/j.patcog.2024.111185","DOIUrl":null,"url":null,"abstract":"<div><div>Due to the powerful ability to acquire new knowledge and preserve previously learned concepts from a dynamic data stream, continual learning has recently garnered substantial interest. Since training data can only be used once, online class incremental learning (OCIL) is more practical and difficult. Although replay-based OCIL methods have made great progress, there is still a severe class imbalance problem. Specifically, limited by the small memory size, the number of samples for new classes is much larger than that for old classes, which finally leads to task recency bias and abrupt feature drift. To alleviate this problem, we propose a masking, linkage, and guidance framework (MLG) for OCIL, which consists of three effective modules, i.e. batch-level logit mask (BLM, masking), batch-level feature cross fusion (BFCF, linkage) and accumulative mean feature distillation (AMFD, guidance). The former two focus on class imbalance problem while the last aims to alleviate abrupt feature drift. In BLM, we only activate the logits of classes occurring in a batch, which makes the model learn knowledge within each batch. The BFCF module employs a transformer encoder layer to fuse the sample features within a batch, which rebalances the gradients of classifier’s weights and implicitly learns the sample relationship. Instead of a strict regularization in traditional feature distillation, the proposed AMFD guides previously learned features to move on purpose, which can reduce abrupt feature drift and produce a clearer boundary in feature space. Extensive experiments on four popular datasets for OCIL have shown the effectiveness of proposed MLG framework.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"160 ","pages":"Article 111185"},"PeriodicalIF":7.5000,"publicationDate":"2024-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0031320324009361","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Due to the powerful ability to acquire new knowledge and preserve previously learned concepts from a dynamic data stream, continual learning has recently garnered substantial interest. Since training data can only be used once, online class incremental learning (OCIL) is more practical and difficult. Although replay-based OCIL methods have made great progress, there is still a severe class imbalance problem. Specifically, limited by the small memory size, the number of samples for new classes is much larger than that for old classes, which finally leads to task recency bias and abrupt feature drift. To alleviate this problem, we propose a masking, linkage, and guidance framework (MLG) for OCIL, which consists of three effective modules, i.e. batch-level logit mask (BLM, masking), batch-level feature cross fusion (BFCF, linkage) and accumulative mean feature distillation (AMFD, guidance). The former two focus on class imbalance problem while the last aims to alleviate abrupt feature drift. In BLM, we only activate the logits of classes occurring in a batch, which makes the model learn knowledge within each batch. The BFCF module employs a transformer encoder layer to fuse the sample features within a batch, which rebalances the gradients of classifier’s weights and implicitly learns the sample relationship. Instead of a strict regularization in traditional feature distillation, the proposed AMFD guides previously learned features to move on purpose, which can reduce abrupt feature drift and produce a clearer boundary in feature space. Extensive experiments on four popular datasets for OCIL have shown the effectiveness of proposed MLG framework.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

在线班级增量学习的屏蔽、链接和引导框架

持续学习具有从动态数据流中获取新知识和保留以前所学概念的强大能力，因此最近引起了人们的极大兴趣。由于训练数据只能使用一次，在线类增量学习（OCIL）更加实用，也更加困难。虽然基于重放的 OCIL 方法已经取得了很大进展，但仍然存在严重的类不平衡问题。具体来说，受限于较小的内存容量，新类的样本数量远远大于旧类的样本数量，这最终会导致任务重现偏差和突然的特征漂移。为了缓解这一问题，我们为 OCIL 提出了一个掩码、链接和引导框架（MLG），它由三个有效模块组成，即批量级 logit 掩码（BLM，掩码）、批量级特征交叉融合（BFCF，链接）和累积平均特征提炼（AMFD，引导）。前两种方法主要针对类不平衡问题，而后一种方法则旨在缓解突然的特征漂移。在 BLM 中，我们只激活批次中出现的类的对数，这使得模型能在每个批次中学习知识。BFCF 模块采用转换编码器层来融合批次内的样本特征，从而重新平衡分类器权重的梯度，并隐式学习样本关系。所提出的 AMFD 不是传统特征蒸馏中的严格正则化，而是引导先前学习的特征有目的地移动，这可以减少突然的特征漂移，并在特征空间中产生更清晰的边界。在 OCIL 的四个流行数据集上进行的广泛实验表明了所提出的 MLG 框架的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Pattern Recognition 工程技术-工程：电子与电气

CiteScore

14.40

自引率

16.20%

发文量

683

审稿时长

5.6 months

期刊介绍： The field of Pattern Recognition is both mature and rapidly evolving, playing a crucial role in various related fields such as computer vision, image processing, text analysis, and neural networks. It closely intersects with machine learning and is being applied in emerging areas like biometrics, bioinformatics, multimedia data analysis, and data science. The journal Pattern Recognition, established half a century ago during the early days of computer science, has since grown significantly in scope and influence.