Gaoang Wang, Jenq-Neng Hwang, K. Williams, Farron Wallace, Craig S. Rose
{"title":"Shrinking Encoding with Two-Level Codebook Learning for Fine-Grained Fish Recognition","authors":"Gaoang Wang, Jenq-Neng Hwang, K. Williams, Farron Wallace, Craig S. Rose","doi":"10.1109/CVAUI.2016.018","DOIUrl":null,"url":null,"abstract":"Bag-of-features (BoF) shows a great power in representing images for image classification. Many codebook learning methods have been developed to find discriminative parts of images for fine-grained recognition. Built upon BoF framework, we propose a novel approach for finegrained fish recognition with two-level codebook learning by shrinking coding coefficients. In the framework, only the maximum-valued coefficient will be maintained in the local spatial region if followed by max pooling strategy. However, the maximum-valued coefficient may result from a local descriptor which is not discriminative among fine-grained classes, resulting in difficulty in classification. In this paper, a two-level codebook is learned to represent the importance between the local descriptor and each codeword in its corresponding k-nearest neighbors. A shrinkage function is also introduced to shrink unrelated coefficients after encoding. Our experimental results show that the proposed method achieves significant performance improvement for fine-grained fish recognition tasks.","PeriodicalId":169345,"journal":{"name":"2016 ICPR 2nd Workshop on Computer Vision for Analysis of Underwater Imagery (CVAUI)","volume":"52 20","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"17","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 ICPR 2nd Workshop on Computer Vision for Analysis of Underwater Imagery (CVAUI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CVAUI.2016.018","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 17
Abstract
Bag-of-features (BoF) shows a great power in representing images for image classification. Many codebook learning methods have been developed to find discriminative parts of images for fine-grained recognition. Built upon BoF framework, we propose a novel approach for finegrained fish recognition with two-level codebook learning by shrinking coding coefficients. In the framework, only the maximum-valued coefficient will be maintained in the local spatial region if followed by max pooling strategy. However, the maximum-valued coefficient may result from a local descriptor which is not discriminative among fine-grained classes, resulting in difficulty in classification. In this paper, a two-level codebook is learned to represent the importance between the local descriptor and each codeword in its corresponding k-nearest neighbors. A shrinkage function is also introduced to shrink unrelated coefficients after encoding. Our experimental results show that the proposed method achieves significant performance improvement for fine-grained fish recognition tasks.