Learning the Compositional Nature of Visual Objects

2007 IEEE Conference on Computer Vision and Pattern Recognition Pub Date : 2007-06-17 DOI:10.1109/CVPR.2007.383154

B. Ommer, J. Buhmann

引用次数: 60

Abstract

The compositional nature of visual objects significantly limits their representation complexity and renders learning of structured object models tractable. Adopting this modeling strategy we both (i) automatically decompose objects into a hierarchy of relevant compositions and we (ii) learn such a compositional representation for each category without supervision. The compositional structure supports feature sharing already on the lowest level of small image patches. Compositions are represented as probability distributions over their constituent parts and the relations between them. The global shape of objects is captured by a graphical model which combines all compositions. Inference based on the underlying statistical model is then employed to obtain a category level object recognition system. Experiments on large standard benchmark datasets underline the competitive recognition performance of this approach and they provide insights into the learned compositional structure of objects.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

学习视觉对象的组成性质

视觉对象的组成特性极大地限制了其表示的复杂性，并使结构化对象模型的学习变得易于处理。采用这种建模策略，我们(i)自动将对象分解为相关组合的层次结构，(ii)在没有监督的情况下为每个类别学习这样的组合表示。该组合结构支持已经在最低级别的小图像补丁上的特征共享。组合用其组成部分及其之间关系的概率分布来表示。物体的整体形状由图形模型捕获，图形模型结合了所有组成部分。然后利用基于底层统计模型的推理得到类别级目标识别系统。在大型标准基准数据集上的实验强调了该方法的竞争性识别性能，并提供了对学习到的对象组成结构的见解。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2007 IEEE Conference on Computer Vision and Pattern Recognition

自引率

0.00%

发文量

期刊最新文献

Combining Region and Edge Cues for Image Segmentation in a Probabilistic Gaussian Mixture Framework Fast Human Pose Estimation using Appearance and Motion via Multi-Dimensional Boosting Regression Enhanced Level Building Algorithm for the Movement Epenthesis Problem in Sign Language Recognition Change Detection in a 3-d World Layered Graph Match with Graph Editing