{"title":"Bi-focus cosine complement network for few-shot fine-grained image classification","authors":"Penghao Jia, Guanglei Gou, Yu Cheng, Aoxiang Ning","doi":"10.1016/j.patrec.2025.03.002","DOIUrl":null,"url":null,"abstract":"<div><div>Few-shot fine-grained image classification presents significant challenges due to the need to discern subtle differences with few samples. And the image backgrounds often contain substantial redundant information. Existing methods utilize features containing redundant information directly after extraction to compute distances for classification, which diminishes the model’s object-level distinguishing capability. Additionally, these methods struggle to integrate discriminative and non-discriminative features within hierarchical architectures. To address this, we propose a Bi-focus Cosine Complement Network (BfCCN). In BfCCN, the Bi-focus Module (BFM) harnesses both spatial and channel dimension information to precisely localize targets within images by mitigating the impact of background noise on the model. Moreover, the Cosine Complement Network (CCN) leverages multi-level and multi-type features to capture the fine-grained details typically present in shallow layers but absent in deeper layers, thereby enhancing the model’s ability to detect subtle distinctions by utilizing complementary cosine distance for classification. Experiments on three benchmark fine-grained datasets demonstrate the efficacy of our method. Codes will be available at <span><span>https://github.com/naivejph/BFCCN.git</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":54638,"journal":{"name":"Pattern Recognition Letters","volume":"191 ","pages":"Pages 44-50"},"PeriodicalIF":3.9000,"publicationDate":"2025-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition Letters","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167865525000868","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Few-shot fine-grained image classification presents significant challenges due to the need to discern subtle differences with few samples. And the image backgrounds often contain substantial redundant information. Existing methods utilize features containing redundant information directly after extraction to compute distances for classification, which diminishes the model’s object-level distinguishing capability. Additionally, these methods struggle to integrate discriminative and non-discriminative features within hierarchical architectures. To address this, we propose a Bi-focus Cosine Complement Network (BfCCN). In BfCCN, the Bi-focus Module (BFM) harnesses both spatial and channel dimension information to precisely localize targets within images by mitigating the impact of background noise on the model. Moreover, the Cosine Complement Network (CCN) leverages multi-level and multi-type features to capture the fine-grained details typically present in shallow layers but absent in deeper layers, thereby enhancing the model’s ability to detect subtle distinctions by utilizing complementary cosine distance for classification. Experiments on three benchmark fine-grained datasets demonstrate the efficacy of our method. Codes will be available at https://github.com/naivejph/BFCCN.git.
期刊介绍:
Pattern Recognition Letters aims at rapid publication of concise articles of a broad interest in pattern recognition.
Subject areas include all the current fields of interest represented by the Technical Committees of the International Association of Pattern Recognition, and other developing themes involving learning and recognition.