LCSL: Long-Tailed Classification via Self-Labeling

IF 11.1 1区工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC IEEE Transactions on Circuits and Systems for Video Technology Pub Date : 2024-07-02 DOI:10.1109/TCSVT.2024.3421942

Duc-Quang Vu;Trang T. T. Phung;Jia-Ching Wang;Son T. Mai

{"title":"LCSL: Long-Tailed Classification via Self-Labeling","authors":"Duc-Quang Vu;Trang T. T. Phung;Jia-Ching Wang;Son T. Mai","doi":"10.1109/TCSVT.2024.3421942","DOIUrl":null,"url":null,"abstract":"During the last decades, deep learning (DL) has been proven to be a very powerful and successful technique in many real-world applications, e.g., video surveillance or object detection. However, when class label distributions are highly skewed, DL classifiers tend to be biased towards majority classes during training phases. This leads to poor generalization of minority classes and consequently reduces the overall accuracy. How to effectively deal with this long-tailed class distribution in DL, i.e., deep long-tailed classification (DLC), remains a challenging problem despite many research efforts. Among various approaches, data augmentation, which aims at generating more samples for reducing label imbalance, is the most common and practical one. However, simply relying on existing class-agnostic augmentation strategies without properly considering the label differences would worsen the problem since more head-class samples can be inevitably augmented than tail-class ones. Moreover, none of the existing works consider the quality and suitability of augmented samples during the training process. Our proposed approach, called Long-tailed Classification via Self-Labeling (LCSL), is specifically designed to address these limitations. LCSL fundamentally differs from existing works by the way it iteratively exploits the preceding network during the training process to re-label the labeled augmented samples and uses the output confidence to decide whether new samples belong to minority classes before adding them to the data. Not only does this help to reduce imbalance ratios among classes, but this also helps to reduce the uncertainty of class prediction problems for minority classes by selecting more confident samples to the data. This incremental learning and generating scheme thus provide a new robust approach for decreasing model over-fitting, thus enhancing the overall accuracy, especially for minority classes. Extensive experiments have demonstrated that LCSL acquires better performance than state-of-the-art long-tailed learning techniques on various standard benchmark datasets. More specifically, our LCSL obtains 85.8%, 54.4%, and 56.2% in terms of accuracy on CIFAR10-LT, CIFAR100-LT, and ImageNet-LT (with moderate to extreme imbalance ratios), respectively. The source code is available at \n<uri>https://github.com/vdquang1991/lcsl/</uri>\n.","PeriodicalId":13082,"journal":{"name":"IEEE Transactions on Circuits and Systems for Video Technology","volume":"34 11","pages":"12048-12058"},"PeriodicalIF":11.1000,"publicationDate":"2024-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Circuits and Systems for Video Technology","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10580978/","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

Abstract

During the last decades, deep learning (DL) has been proven to be a very powerful and successful technique in many real-world applications, e.g., video surveillance or object detection. However, when class label distributions are highly skewed, DL classifiers tend to be biased towards majority classes during training phases. This leads to poor generalization of minority classes and consequently reduces the overall accuracy. How to effectively deal with this long-tailed class distribution in DL, i.e., deep long-tailed classification (DLC), remains a challenging problem despite many research efforts. Among various approaches, data augmentation, which aims at generating more samples for reducing label imbalance, is the most common and practical one. However, simply relying on existing class-agnostic augmentation strategies without properly considering the label differences would worsen the problem since more head-class samples can be inevitably augmented than tail-class ones. Moreover, none of the existing works consider the quality and suitability of augmented samples during the training process. Our proposed approach, called Long-tailed Classification via Self-Labeling (LCSL), is specifically designed to address these limitations. LCSL fundamentally differs from existing works by the way it iteratively exploits the preceding network during the training process to re-label the labeled augmented samples and uses the output confidence to decide whether new samples belong to minority classes before adding them to the data. Not only does this help to reduce imbalance ratios among classes, but this also helps to reduce the uncertainty of class prediction problems for minority classes by selecting more confident samples to the data. This incremental learning and generating scheme thus provide a new robust approach for decreasing model over-fitting, thus enhancing the overall accuracy, especially for minority classes. Extensive experiments have demonstrated that LCSL acquires better performance than state-of-the-art long-tailed learning techniques on various standard benchmark datasets. More specifically, our LCSL obtains 85.8%, 54.4%, and 56.2% in terms of accuracy on CIFAR10-LT, CIFAR100-LT, and ImageNet-LT (with moderate to extreme imbalance ratios), respectively. The source code is available at https://github.com/vdquang1991/lcsl/ .

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

LCSL：通过自标记进行长尾分类

在过去的几十年里，深度学习（DL）已被证明是一种非常强大且成功的技术，广泛应用于视频监控或物体检测等现实世界中。然而，当类标签分布高度倾斜时，深度学习分类器在训练阶段往往会偏向于多数类。这会导致对少数类别的泛化效果不佳，从而降低整体准确率。如何有效处理 DL 中的这种长尾类分布，即深度长尾分类（DLC），尽管有很多研究工作，但仍然是一个具有挑战性的问题。在各种方法中，旨在生成更多样本以减少标签不平衡的数据增强是最常见、最实用的方法。然而，仅仅依靠现有的类无关扩增策略而不适当考虑标签差异会使问题更加严重，因为头类样本不可避免地会比尾类样本扩增得更多。此外，现有的工作都没有考虑训练过程中增强样本的质量和适用性。我们提出的名为 "通过自标记进行长尾分类（LCSL）"的方法，就是专门为解决这些局限性而设计的。LCSL 从根本上区别于现有的方法，它在训练过程中反复利用前一个网络对已标记的增强样本进行重新标记，并在将新样本添加到数据之前利用输出置信度来决定它们是否属于少数类别。这不仅有助于降低类别间的不平衡比率，还有助于通过选择更有信心的样本到数据中来降低少数类别预测问题的不确定性。因此，这种增量学习和生成方案为减少模型过拟合提供了一种新的稳健方法，从而提高了整体准确性，尤其是对少数类别的准确性。广泛的实验证明，在各种标准基准数据集上，LCSL 比最先进的长尾学习技术获得了更好的性能。更具体地说，我们的 LCSL 在 CIFAR10-LT、CIFAR100-LT 和 ImageNet-LT （中等到极端不平衡率）上分别获得了 85.8%、54.4% 和 56.2% 的准确率。源代码见 https://github.com/vdquang1991/lcsl/。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Transactions on Circuits and Systems for Video Technology 工程技术-工程：电子与电气

CiteScore

13.80

自引率

27.40%

发文量

660

审稿时长

5 months

期刊介绍： The IEEE Transactions on Circuits and Systems for Video Technology (TCSVT) is dedicated to covering all aspects of video technologies from a circuits and systems perspective. We encourage submissions of general, theoretical, and application-oriented papers related to image and video acquisition, representation, presentation, and display. Additionally, we welcome contributions in areas such as processing, filtering, and transforms; analysis and synthesis; learning and understanding; compression, transmission, communication, and networking; as well as storage, retrieval, indexing, and search. Furthermore, papers focusing on hardware and software design and implementation are highly valued. Join us in advancing the field of video technology through innovative research and insights.