Lequan Wang, Jin Duali, Ziqiang Chen, Guangqiu Chen, Gaotian Liu
{"title":"Weakly Supervised Fine-Grained Visual Classification Through Spatial Information Mining and Attention-guided Regularization","authors":"Lequan Wang, Jin Duali, Ziqiang Chen, Guangqiu Chen, Gaotian Liu","doi":"10.1109/ICCSS53909.2021.9722008","DOIUrl":null,"url":null,"abstract":"Over-fitting is a severe problem when we adopt deep neural networks with a large number parameters in fine-grained visual classification. Many data augmentation methods are proposed through weakly supervised learning to alleviate over-fitting issue. Different from those methods, we propose a weakly supervised attention-guided regularization by object parts’ attention maps to fine-tune the Fully Connected (FC) layer and relieve over-fitting issue during training in this paper. On the other hand, the neural units in the last convolutional layer contain the same receptive fields that limit recognition performance due to involving lots of background noises. To alleviate this issue, we devise a spatial information mining module with an auxiliary penalty loss to aggregate multi-scale receptive fields feature maps with the selected precedent layer. Comprehensive experiments are conducted to show our method achieves or surpasses state-of-the-art results on common fine-grained classification datasets.","PeriodicalId":435816,"journal":{"name":"2021 8th International Conference on Information, Cybernetics, and Computational Social Systems (ICCSS)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 8th International Conference on Information, Cybernetics, and Computational Social Systems (ICCSS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCSS53909.2021.9722008","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Over-fitting is a severe problem when we adopt deep neural networks with a large number parameters in fine-grained visual classification. Many data augmentation methods are proposed through weakly supervised learning to alleviate over-fitting issue. Different from those methods, we propose a weakly supervised attention-guided regularization by object parts’ attention maps to fine-tune the Fully Connected (FC) layer and relieve over-fitting issue during training in this paper. On the other hand, the neural units in the last convolutional layer contain the same receptive fields that limit recognition performance due to involving lots of background noises. To alleviate this issue, we devise a spatial information mining module with an auxiliary penalty loss to aggregate multi-scale receptive fields feature maps with the selected precedent layer. Comprehensive experiments are conducted to show our method achieves or surpasses state-of-the-art results on common fine-grained classification datasets.