{"title":"Towards a mathematical theory of primal sketch and sketchability","authors":"Cheng-en Guo, Song-Chun Zhu, Y. Wu","doi":"10.1109/ICCV.2003.1238631","DOIUrl":null,"url":null,"abstract":"In this paper, we present a mathematical theory for Marr's primal sketch. We first conduct a theoretical study of the descriptive Markov random field model and the generative wavelet/sparse coding model from the perspective of entropy and complexity. The competition between the two types of models defines the concept of \"sketchability\", which divides image into texture and geometry. We then propose a primal sketch model that integrates the two models and, in addition, a Gestalt field model for spatial organization. We also propose a sketching pursuit process that coordinates the competition between two pursuit algorithms: the matching pursuit (Mallat and Zhang, 1993) and the filter pursuit (Zhu, et al., 1997), that seek to explain the image by bases and filters respectively. The model can be used to learn a dictionary of image primitives, or textons in Julesz's language, for natural images. The primal sketch model is not only parsimonious for image representation, but produces meaningful sketches over a large number of generic images.","PeriodicalId":131580,"journal":{"name":"Proceedings Ninth IEEE International Conference on Computer Vision","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2003-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"112","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings Ninth IEEE International Conference on Computer Vision","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCV.2003.1238631","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 112
Abstract
In this paper, we present a mathematical theory for Marr's primal sketch. We first conduct a theoretical study of the descriptive Markov random field model and the generative wavelet/sparse coding model from the perspective of entropy and complexity. The competition between the two types of models defines the concept of "sketchability", which divides image into texture and geometry. We then propose a primal sketch model that integrates the two models and, in addition, a Gestalt field model for spatial organization. We also propose a sketching pursuit process that coordinates the competition between two pursuit algorithms: the matching pursuit (Mallat and Zhang, 1993) and the filter pursuit (Zhu, et al., 1997), that seek to explain the image by bases and filters respectively. The model can be used to learn a dictionary of image primitives, or textons in Julesz's language, for natural images. The primal sketch model is not only parsimonious for image representation, but produces meaningful sketches over a large number of generic images.
在本文中,我们提出了马尔原始草图的数学理论。首先从熵和复杂度的角度对描述性马尔可夫随机场模型和生成小波/稀疏编码模型进行了理论研究。两种模型之间的竞争定义了“可素描性”的概念,将图像分为纹理和几何。然后,我们提出了一个整合这两个模型的原始草图模型,以及一个空间组织的格式塔场模型。我们还提出了一种素描追踪过程,它协调了两种追踪算法之间的竞争:匹配追踪(Mallat and Zhang, 1993)和滤波追踪(Zhu, et al., 1997),这两种算法分别试图通过基和滤波器来解释图像。这个模型可以用来学习自然图像的图像原语字典,或者用Julesz的语言来说就是文本。原始草图模型不仅简化了图像表示,而且在大量通用图像上生成有意义的草图。