Learning feature alignment and dual correlation for few-shot image classification

IF 8.4 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE CAAI Transactions on Intelligence Technology Pub Date : 2023-12-05 DOI:10.1049/cit2.12273

Xilang Huang, Seon Han Choi

{"title":"Learning feature alignment and dual correlation for few-shot image classification","authors":"Xilang Huang, Seon Han Choi","doi":"10.1049/cit2.12273","DOIUrl":null,"url":null,"abstract":"<p>Few-shot image classification is the task of classifying novel classes using extremely limited labelled samples. To perform classification using the limited samples, one solution is to learn the feature alignment (FA) information between the labelled and unlabelled sample features. Most FA methods use the feature mean as the class prototype and calculate the correlation between prototype and unlabelled features to learn an alignment strategy. However, mean prototypes tend to degenerate informative features because spatial features at the same position may not be equally important for the final classification, leading to inaccurate correlation calculations. Therefore, the authors propose an effective intraclass FA strategy that aggregates semantically similar spatial features from an adaptive reference prototype in low-dimensional feature space to obtain an informative prototype feature map for precise correlation computation. Moreover, a dual correlation module to learn the hard and soft correlations was developed by the authors. This module combines the correlation information between the prototype and unlabelled features in both the original and learnable feature spaces, aiming to produce a comprehensive cross-correlation between the prototypes and unlabelled features. Using both FA and cross-attention modules, our model can maintain informative class features and capture important shared features for classification. Experimental results on three few-shot classification benchmarks show that the proposed method outperformed related methods and resulted in a 3% performance boost in the 1-shot setting by inserting the proposed module into the related methods.</p>","PeriodicalId":46211,"journal":{"name":"CAAI Transactions on Intelligence Technology","volume":"9 2","pages":"303-318"},"PeriodicalIF":8.4000,"publicationDate":"2023-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cit2.12273","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"CAAI Transactions on Intelligence Technology","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1049/cit2.12273","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Few-shot image classification is the task of classifying novel classes using extremely limited labelled samples. To perform classification using the limited samples, one solution is to learn the feature alignment (FA) information between the labelled and unlabelled sample features. Most FA methods use the feature mean as the class prototype and calculate the correlation between prototype and unlabelled features to learn an alignment strategy. However, mean prototypes tend to degenerate informative features because spatial features at the same position may not be equally important for the final classification, leading to inaccurate correlation calculations. Therefore, the authors propose an effective intraclass FA strategy that aggregates semantically similar spatial features from an adaptive reference prototype in low-dimensional feature space to obtain an informative prototype feature map for precise correlation computation. Moreover, a dual correlation module to learn the hard and soft correlations was developed by the authors. This module combines the correlation information between the prototype and unlabelled features in both the original and learnable feature spaces, aiming to produce a comprehensive cross-correlation between the prototypes and unlabelled features. Using both FA and cross-attention modules, our model can maintain informative class features and capture important shared features for classification. Experimental results on three few-shot classification benchmarks show that the proposed method outperformed related methods and resulted in a 3% performance boost in the 1-shot setting by inserting the proposed module into the related methods.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

学习特征对齐和双相关性，实现少镜头图像分类

少射图像分类是使用极其有限的标记样本对新类别进行分类的任务。为了使用有限的样本进行分类，一种解决方案是学习标记和未标记样本特征之间的特征对齐(FA)信息。大多数遗传算法使用特征均值作为类原型，通过计算原型与未标记特征之间的相关性来学习对齐策略。然而，由于同一位置的空间特征对最终分类的重要性可能不相同，平均原型往往会使信息特征退化，从而导致相关性计算不准确。因此，作者提出了一种有效的类内特征分析策略，该策略将语义相似的空间特征从自适应参考原型中聚集到低维特征空间中，以获得信息丰富的原型特征图，用于精确的相关性计算。此外，作者还开发了一个双相关模块来学习硬相关和软相关。该模块结合了原始特征空间和可学习特征空间中原型和未标记特征之间的相关信息，旨在产生原型和未标记特征之间的全面交叉相关。使用FA和交叉注意模块，我们的模型可以维护信息丰富的类特征，并捕获重要的共享特征进行分类。实验结果表明，通过将所提出的模块插入到相关方法中，所提出的方法优于相关方法，并且在1 - shot设置中性能提升了3%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

CAAI Transactions on Intelligence Technology COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-

CiteScore

11.00

自引率

3.90%

发文量

134

审稿时长

35 weeks

期刊介绍： CAAI Transactions on Intelligence Technology is a leading venue for original research on the theoretical and experimental aspects of artificial intelligence technology. We are a fully open access journal co-published by the Institution of Engineering and Technology (IET) and the Chinese Association for Artificial Intelligence (CAAI) providing research which is openly accessible to read and share worldwide.