Modeling Image Patches with a Generic Dictionary of Mini-Epitomes.

Proceedings. IEEE Computer Society Conference on Computer Vision and Pattern Recognition Pub Date : 2014-06-01 DOI:10.1109/CVPR.2014.264

George Papandreou, Liang-Chieh Chen, Alan L Yuille

{"title":"Modeling Image Patches with a Generic Dictionary of Mini-Epitomes.","authors":"George Papandreou, Liang-Chieh Chen, Alan L Yuille","doi":"10.1109/CVPR.2014.264","DOIUrl":null,"url":null,"abstract":"<p><p>The goal of this paper is to question the necessity of features like SIFT in categorical visual recognition tasks. As an alternative, we develop a generative model for the raw intensity of image patches and show that it can support image classification performance on par with optimized SIFT-based techniques in a bag-of-visual-words setting. Key ingredient of the proposed model is a compact dictionary of mini-epitomes, learned in an unsupervised fashion on a large collection of images. The use of epitomes allows us to explicitly account for photometric and position variability in image appearance. We show that this flexibility considerably increases the capacity of the dictionary to accurately approximate the appearance of image patches and support recognition tasks. For image classification, we develop histogram-based image encoding methods tailored to the epitomic representation, as well as an \"epitomic footprint\" encoding which is easy to visualize and highlights the generative nature of our model. We discuss in detail computational aspects and develop efficient algorithms to make the model scalable to large tasks. The proposed techniques are evaluated with experiments on the challenging PASCAL VOC 2007 image classification benchmark.</p>","PeriodicalId":74560,"journal":{"name":"Proceedings. IEEE Computer Society Conference on Computer Vision and Pattern Recognition","volume":"2014 ","pages":"2059-2066"},"PeriodicalIF":0.0000,"publicationDate":"2014-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/CVPR.2014.264","citationCount":"17","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings. IEEE Computer Society Conference on Computer Vision and Pattern Recognition","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CVPR.2014.264","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 17

Abstract

The goal of this paper is to question the necessity of features like SIFT in categorical visual recognition tasks. As an alternative, we develop a generative model for the raw intensity of image patches and show that it can support image classification performance on par with optimized SIFT-based techniques in a bag-of-visual-words setting. Key ingredient of the proposed model is a compact dictionary of mini-epitomes, learned in an unsupervised fashion on a large collection of images. The use of epitomes allows us to explicitly account for photometric and position variability in image appearance. We show that this flexibility considerably increases the capacity of the dictionary to accurately approximate the appearance of image patches and support recognition tasks. For image classification, we develop histogram-based image encoding methods tailored to the epitomic representation, as well as an "epitomic footprint" encoding which is easy to visualize and highlights the generative nature of our model. We discuss in detail computational aspects and develop efficient algorithms to make the model scalable to large tasks. The proposed techniques are evaluated with experiments on the challenging PASCAL VOC 2007 image classification benchmark.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

用Mini-Epitomes的通用字典建模图像补丁。

本文的目的是质疑像SIFT这样的特征在分类视觉识别任务中的必要性。作为替代方案，我们开发了一种图像补丁原始强度的生成模型，并表明它可以在视觉词袋设置中支持与优化的基于sift的技术相当的图像分类性能。提出的模型的关键成分是一个紧凑的迷你缩影字典，以无监督的方式在大量图像上学习。缩影的使用使我们能够明确地解释图像外观的光度和位置变化。我们表明，这种灵活性大大增加了字典的容量，以准确地近似图像补丁的外观和支持识别任务。对于图像分类，我们开发了适合于缩影表示的基于直方图的图像编码方法，以及易于可视化并突出我们模型的生成性质的“缩影足迹”编码。我们详细讨论了计算方面，并开发了有效的算法，使模型可扩展到大型任务。在具有挑战性的PASCAL VOC 2007图像分类基准上对所提出的技术进行了实验评估。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings. IEEE Computer Society Conference on Computer Vision and Pattern Recognition

CiteScore

43.50

自引率

0.00%

发文量