{"title":"Research on Lightweight Few-Shot Learning Algorithm Based on Convolutional Block Attention Mechanism","authors":"Pang Qi, Yu Yanan, Haile Haftom Berihu","doi":"10.1142/s1469026823500207","DOIUrl":null,"url":null,"abstract":"Few-shot learning can solve new learning tasks in the condition of fewer samples. However, currently, the few-shot learning algorithms mostly use the ResNet as a backbone, which leads to a large number of model parameters. To deal with the problem, a lightweight backbone named DenseAttentionNet which is based on the Convolutional Block Attention Mechanism is proposed by comparing the parameter amount and the accuracy of few-shot classification with ResNet-12. Then, based on the DenseAttentionNet, a few-shot learning algorithm called Meta-DenseAttention is presented to balance the model parameters and the classification effect. The dense connection and attention mechanism are combined to meet the requirements of fewer parameters and to achieve a good classification effect for the first time. The experimental results show that the DenseAttentionNet, not only reduces the number of parameters by 55% but also outperforms other classic backbones in the classification effect compared with the ResNet-12 benchmark. In addition, Meta-DenseAttention has an accuracy of 56.57% (5way-1shot) and 72.73% (5way-5shot) on the miniImageNet, although the number of parameters is only 3.6[Formula: see text]M. The experimental results also show that the few-shot learning algorithm proposed in this paper not only guarantees classification accuracy but also has the characteristics of lightweight.","PeriodicalId":45994,"journal":{"name":"International Journal of Computational Intelligence and Applications","volume":" ","pages":""},"PeriodicalIF":0.8000,"publicationDate":"2023-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Computational Intelligence and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1142/s1469026823500207","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Few-shot learning can solve new learning tasks in the condition of fewer samples. However, currently, the few-shot learning algorithms mostly use the ResNet as a backbone, which leads to a large number of model parameters. To deal with the problem, a lightweight backbone named DenseAttentionNet which is based on the Convolutional Block Attention Mechanism is proposed by comparing the parameter amount and the accuracy of few-shot classification with ResNet-12. Then, based on the DenseAttentionNet, a few-shot learning algorithm called Meta-DenseAttention is presented to balance the model parameters and the classification effect. The dense connection and attention mechanism are combined to meet the requirements of fewer parameters and to achieve a good classification effect for the first time. The experimental results show that the DenseAttentionNet, not only reduces the number of parameters by 55% but also outperforms other classic backbones in the classification effect compared with the ResNet-12 benchmark. In addition, Meta-DenseAttention has an accuracy of 56.57% (5way-1shot) and 72.73% (5way-5shot) on the miniImageNet, although the number of parameters is only 3.6[Formula: see text]M. The experimental results also show that the few-shot learning algorithm proposed in this paper not only guarantees classification accuracy but also has the characteristics of lightweight.
期刊介绍:
The International Journal of Computational Intelligence and Applications, IJCIA, is a refereed journal dedicated to the theory and applications of computational intelligence (artificial neural networks, fuzzy systems, evolutionary computation and hybrid systems). The main goal of this journal is to provide the scientific community and industry with a vehicle whereby ideas using two or more conventional and computational intelligence based techniques could be discussed. The IJCIA welcomes original works in areas such as neural networks, fuzzy logic, evolutionary computation, pattern recognition, hybrid intelligent systems, symbolic machine learning, statistical models, image/audio/video compression and retrieval.