首页 > 最新文献

Proceedings Ninth IEEE International Conference on Computer Vision最新文献

英文 中文
"Perspective shape from shading" and viscosity solutions “透视形状从阴影”和粘度解决方案
Pub Date : 2003-10-13 DOI: 10.1109/ICCV.2003.1238433
E. Prados, O. Faugeras
This article proposes a solution of the Lambertian shape from shading (SFS) problem in the case of a pinhole camera model (performing a perspective projection). Our approach is based upon the notion of viscosity solutions of Hamilton-Jacobi equations. This approach allows us to naturally deal with nonsmooth solutions and provides a mathematical framework for proving correctness of our algorithms. Our work extends previous work in the area in three aspects. First, it models the camera as a pinhole whereas most authors assume an orthographic projection, thereby extending the applicability of shape from shading methods to more realistic images. In particular it extends the work of E. Prados et al. (2002) and E. Rouy et al. (1992). Second, by adapting the brightness equation to the perspective problem, we obtain a new partial differential equation (PDE). Results about the existence and uniqueness of its solution are also obtained. Third, it allows us to come up with a new approximation scheme and a new algorithm for computing numerical approximations of the "continuous" solution as well as a proof of their convergence toward that solution.
本文提出了一种解决针孔相机模型(进行透视投影)的兰伯特形状从阴影(SFS)问题的方法。我们的方法是基于汉密尔顿-雅可比方程的粘性解的概念。这种方法使我们能够自然地处理非光滑解,并为证明算法的正确性提供了一个数学框架。我们的工作从三个方面扩展了之前在该领域的工作。首先,它将相机建模为针孔,而大多数作者假设为正射影,从而将形状的适用性从阴影方法扩展到更逼真的图像。特别是,它扩展了E. Prados等人(2002)和E. Rouy等人(1992)的工作。其次,将亮度方程应用于透视问题,得到一个新的偏微分方程。得到了其解的存在唯一性。第三,它允许我们提出一种新的近似方案和一种新的算法来计算“连续”解的数值近似,并证明它们向该解收敛。
{"title":"\"Perspective shape from shading\" and viscosity solutions","authors":"E. Prados, O. Faugeras","doi":"10.1109/ICCV.2003.1238433","DOIUrl":"https://doi.org/10.1109/ICCV.2003.1238433","url":null,"abstract":"This article proposes a solution of the Lambertian shape from shading (SFS) problem in the case of a pinhole camera model (performing a perspective projection). Our approach is based upon the notion of viscosity solutions of Hamilton-Jacobi equations. This approach allows us to naturally deal with nonsmooth solutions and provides a mathematical framework for proving correctness of our algorithms. Our work extends previous work in the area in three aspects. First, it models the camera as a pinhole whereas most authors assume an orthographic projection, thereby extending the applicability of shape from shading methods to more realistic images. In particular it extends the work of E. Prados et al. (2002) and E. Rouy et al. (1992). Second, by adapting the brightness equation to the perspective problem, we obtain a new partial differential equation (PDE). Results about the existence and uniqueness of its solution are also obtained. Third, it allows us to come up with a new approximation scheme and a new algorithm for computing numerical approximations of the \"continuous\" solution as well as a proof of their convergence toward that solution.","PeriodicalId":131580,"journal":{"name":"Proceedings Ninth IEEE International Conference on Computer Vision","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129928454","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 140
Linear multiview reconstruction of points, lines, planes and cameras using a reference plane 线性多视图重建点,线,平面和相机使用参考平面
Pub Date : 2003-10-13 DOI: 10.1109/ICCV.2003.1238629
C. Rother
This paper presents a new linear method for reconstructing simultaneously 3D features (points, lines and planes) and cameras from many perspective views by solving a single linear system. It assumes that a real or virtual reference plane is visible in all views. We call it the Direct Reference Plane (DRP) method. It is well known that the projection relationship between uncalibrated cameras and 3D features is nonlinear in the absence of a reference plane. With a known reference plane, points and cameras have a linear relationship, as shown by Rother and Carlsson (2001). The main contribution of this paper is that lines and cameras, as well as, planes and cameras also have a linear relationship. Consequently, all 3D features and all cameras can be reconstructed simultaneously from a single linear system, which handles missing image measurements naturally. A further contribution is an extensive experimental comparison, using real data, of different reference plane and nonreference plane reconstruction methods. For difficult reference plane scenarios, with point or line features, the DRP method is superior to all compared methods. Finally, an extensive list of reference plane scenarios is presented, which shows the wide applicability of the DRP method.
本文提出了一种新的线性方法,通过求解单个线性系统,从多个角度同时重建三维特征(点、线、面)和摄像机。它假设在所有视图中都可见真实或虚拟参考平面。我们称之为直接参考平面(DRP)方法。众所周知,在没有参考平面的情况下,未标定相机与三维特征之间的投影关系是非线性的。Rother和Carlsson(2001)表明,在已知参考平面的情况下,点和相机之间存在线性关系。本文的主要贡献在于线与相机,以及平面与相机也具有线性关系。因此,所有的3D特征和所有的相机可以从一个单一的线性系统同时重建,它处理缺失的图像测量自然。进一步的贡献是使用实际数据对不同参考平面和非参考平面重建方法进行了广泛的实验比较。对于具有点或线特征的复杂参考平面场景,DRP方法优于所有比较方法。最后,给出了一个广泛的参考平面场景列表,显示了DRP方法的广泛适用性。
{"title":"Linear multiview reconstruction of points, lines, planes and cameras using a reference plane","authors":"C. Rother","doi":"10.1109/ICCV.2003.1238629","DOIUrl":"https://doi.org/10.1109/ICCV.2003.1238629","url":null,"abstract":"This paper presents a new linear method for reconstructing simultaneously 3D features (points, lines and planes) and cameras from many perspective views by solving a single linear system. It assumes that a real or virtual reference plane is visible in all views. We call it the Direct Reference Plane (DRP) method. It is well known that the projection relationship between uncalibrated cameras and 3D features is nonlinear in the absence of a reference plane. With a known reference plane, points and cameras have a linear relationship, as shown by Rother and Carlsson (2001). The main contribution of this paper is that lines and cameras, as well as, planes and cameras also have a linear relationship. Consequently, all 3D features and all cameras can be reconstructed simultaneously from a single linear system, which handles missing image measurements naturally. A further contribution is an extensive experimental comparison, using real data, of different reference plane and nonreference plane reconstruction methods. For difficult reference plane scenarios, with point or line features, the DRP method is superior to all compared methods. Finally, an extensive list of reference plane scenarios is presented, which shows the wide applicability of the DRP method.","PeriodicalId":131580,"journal":{"name":"Proceedings Ninth IEEE International Conference on Computer Vision","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127683535","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 36
Dominant sets and hierarchical clustering 优势集与层次聚类
Pub Date : 2003-10-13 DOI: 10.1109/ICCV.2003.1238367
M. Pavan, M. Pelillo
Dominant sets are a new graph-theoretic concept that has proven to be relevant in partitional (flat) clustering as well as image segmentation problems. However, in many computer vision applications, such as the organization of an image database, it is important to provide the data to be clustered with a hierarchical organization, and it is not clear how to do this within the dominant set framework. We address precisely this problem, and present a simple and elegant solution to it. To this end, we consider a family of (continuous) quadratic programs, which contain a parameterized regularization term that controls the global shape of the energy landscape. When the regularization parameter is zero the local solutions are known to be in one-to-one correspondence with dominant sets, but when it is positive an interesting picture emerges. We determine bounds for the regularization parameter that allow us to exclude from the set of local solutions those inducing clusters of size smaller than a prescribed threshold. This suggests a new (divisive) hierarchical approach to clustering, which is based on the idea of properly varying the regularization parameter during the clustering process. Straightforward dynamics from evolutionary game theory are used to locate the solutions of the quadratic programs at each level of the hierarchy. We apply the proposed framework to the problem of organizing a shape database. Experiments with three different similarity matrices (and databases) reported in the literature have been conducted, and the results confirm the effectiveness of our approach.
优势集是一个新的图论概念,已被证明与分区(平面)聚类和图像分割问题相关。然而,在许多计算机视觉应用中,例如图像数据库的组织,重要的是提供要用分层组织聚类的数据,并且不清楚如何在主导集框架内做到这一点。我们正好解决了这个问题,并提出了一个简单而优雅的解决方案。为此,我们考虑了一组(连续的)二次规划,其中包含一个参数化正则化项,控制能源格局的全局形状。当正则化参数为零时,已知局部解与优势集是一对一对应的,但当正则化参数为正时,一个有趣的图像出现了。我们确定正则化参数的边界,使我们能够从局部解集中排除那些诱导大小小于规定阈值的簇。这提出了一种新的(分裂的)分层聚类方法,该方法基于在聚类过程中适当改变正则化参数的思想。从进化博弈论的直接动力学是用来定位在每一层次的二次规划的解决方案。我们将提出的框架应用于形状数据库的组织问题。用文献中报道的三种不同的相似矩阵(和数据库)进行了实验,结果证实了我们方法的有效性。
{"title":"Dominant sets and hierarchical clustering","authors":"M. Pavan, M. Pelillo","doi":"10.1109/ICCV.2003.1238367","DOIUrl":"https://doi.org/10.1109/ICCV.2003.1238367","url":null,"abstract":"Dominant sets are a new graph-theoretic concept that has proven to be relevant in partitional (flat) clustering as well as image segmentation problems. However, in many computer vision applications, such as the organization of an image database, it is important to provide the data to be clustered with a hierarchical organization, and it is not clear how to do this within the dominant set framework. We address precisely this problem, and present a simple and elegant solution to it. To this end, we consider a family of (continuous) quadratic programs, which contain a parameterized regularization term that controls the global shape of the energy landscape. When the regularization parameter is zero the local solutions are known to be in one-to-one correspondence with dominant sets, but when it is positive an interesting picture emerges. We determine bounds for the regularization parameter that allow us to exclude from the set of local solutions those inducing clusters of size smaller than a prescribed threshold. This suggests a new (divisive) hierarchical approach to clustering, which is based on the idea of properly varying the regularization parameter during the clustering process. Straightforward dynamics from evolutionary game theory are used to locate the solutions of the quadratic programs at each level of the hierarchy. We apply the proposed framework to the problem of organizing a shape database. Experiments with three different similarity matrices (and databases) reported in the literature have been conducted, and the results confirm the effectiveness of our approach.","PeriodicalId":131580,"journal":{"name":"Proceedings Ninth IEEE International Conference on Computer Vision","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129656569","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 87
Combining gradient and albedo data for rotation invariant classification of 3D surface texture 结合梯度和反照率数据进行三维表面纹理旋转不变分类
Pub Date : 2003-10-13 DOI: 10.1109/ICCV.2003.1238437
Jiahua Wu, M. Chantler
We present a new texture classification scheme which is invariant to surface-rotation. Many texture classification approaches have been presented in the past that are image-rotation invariant. However, image rotation is not necessarily the same as surface rotation. We have therefore developed a classifier that uses invariants that are derived from surface properties rather than image properties. Previously we developed a scheme that used surface gradient (normal) fields estimated using photometric stereo. In this paper we augment these data with albedo information and also employ an additional feature set: the radial spectrum. We used 30 real textures to test the new classifier. A classification accuracy of 91% was achieved when albedo and gradient 1D polar and radial features were combined. The best performance was also achieved by using 2D albedo and gradient spectra. The classification accuracy is 99%.
提出了一种不受表面旋转影响的纹理分类方法。过去已经提出了许多图像旋转不变性的纹理分类方法。然而,图像旋转不一定与表面旋转相同。因此,我们开发了一种分类器,该分类器使用来自表面属性而不是图像属性的不变量。以前我们开发了一个方案,使用表面梯度(法向)场估计使用光度立体。在本文中,我们用反照率信息增强了这些数据,并使用了一个额外的特征集:径向光谱。我们使用了30个真实纹理来测试新的分类器。当反照率和梯度一维极、径向特征相结合时,分类精度达到91%。利用二维反照率和梯度光谱也取得了最好的效果。分类准确率达99%。
{"title":"Combining gradient and albedo data for rotation invariant classification of 3D surface texture","authors":"Jiahua Wu, M. Chantler","doi":"10.1109/ICCV.2003.1238437","DOIUrl":"https://doi.org/10.1109/ICCV.2003.1238437","url":null,"abstract":"We present a new texture classification scheme which is invariant to surface-rotation. Many texture classification approaches have been presented in the past that are image-rotation invariant. However, image rotation is not necessarily the same as surface rotation. We have therefore developed a classifier that uses invariants that are derived from surface properties rather than image properties. Previously we developed a scheme that used surface gradient (normal) fields estimated using photometric stereo. In this paper we augment these data with albedo information and also employ an additional feature set: the radial spectrum. We used 30 real textures to test the new classifier. A classification accuracy of 91% was achieved when albedo and gradient 1D polar and radial features were combined. The best performance was also achieved by using 2D albedo and gradient spectra. The classification accuracy is 99%.","PeriodicalId":131580,"journal":{"name":"Proceedings Ninth IEEE International Conference on Computer Vision","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129695218","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 27
Recognition with local features: the kernel recipe 局部特征识别:内核配方
Pub Date : 2003-10-13 DOI: 10.1109/ICCV.2003.1238351
C. Wallraven, B. Caputo, Arnulf B. A. Graf
Recent developments in computer vision have shown that local features can provide efficient representations suitable for robust object recognition. Support vector machines have been established as powerful learning algorithms with good generalization capabilities. We combine these two approaches and propose a general kernel method for recognition with local features. We show that the proposed kernel satisfies the Mercer condition and that it is, suitable for many established local feature frameworks. Large-scale recognition results are presented on three different databases, which demonstrate that SVMs with the proposed kernel perform better than standard matching techniques on local features. In addition, experiments on noisy and occluded images show that local feature representations significantly outperform global approaches.
计算机视觉的最新发展表明,局部特征可以提供适合于鲁棒目标识别的有效表示。支持向量机是一种强大的学习算法,具有良好的泛化能力。我们将这两种方法结合起来,提出了一种基于局部特征识别的通用核方法。我们证明了所提出的内核满足Mercer条件,并且适用于许多已建立的局部特征框架。在三个不同的数据库上给出了大规模的识别结果,结果表明,基于该核的支持向量机在局部特征上的识别效果优于标准匹配技术。此外,对噪声和遮挡图像的实验表明,局部特征表示明显优于全局方法。
{"title":"Recognition with local features: the kernel recipe","authors":"C. Wallraven, B. Caputo, Arnulf B. A. Graf","doi":"10.1109/ICCV.2003.1238351","DOIUrl":"https://doi.org/10.1109/ICCV.2003.1238351","url":null,"abstract":"Recent developments in computer vision have shown that local features can provide efficient representations suitable for robust object recognition. Support vector machines have been established as powerful learning algorithms with good generalization capabilities. We combine these two approaches and propose a general kernel method for recognition with local features. We show that the proposed kernel satisfies the Mercer condition and that it is, suitable for many established local feature frameworks. Large-scale recognition results are presented on three different databases, which demonstrate that SVMs with the proposed kernel perform better than standard matching techniques on local features. In addition, experiments on noisy and occluded images show that local feature representations significantly outperform global approaches.","PeriodicalId":131580,"journal":{"name":"Proceedings Ninth IEEE International Conference on Computer Vision","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124389466","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 426
Unsupervised non-parametric region segmentation using level sets 使用水平集的无监督非参数区域分割
Pub Date : 2003-10-13 DOI: 10.1109/ICCV.2003.1238636
T. Kadir, M. Brady
We present a novel non-parametric unsupervised segmentation algorithm based on region competition (Zhu and Yuille, 1996); but implemented within a level sets framework (Osher and Sethian, 1988). The key novelty of the algorithm is that it can solve N /spl ges/ 2 class segmentation problems using just one embedded surface; this is achieved by controlling the merging and splitting behaviour of the level sets according to a minimum description length (MDL) (Leclerc (1989) and Rissanen (1985)) cost function. This is in contrast to N class region-based level set segmentation methods to date which operate by evolving multiple coupled embedded surfaces in parallel (Chan et al., 2002). Furthermore, it operates in an unsupervised manner; it is necessary neither to specify the value of N nor the class models a-priori. We argue that the level sets methodology provides a more convenient framework for the implementation of the region competition algorithm, which is conventionally implemented using region membership arrays due to the lack of a intrinsic curve representation. Finally, we generalise the Gaussian region model used in standard region competition to the non-parametric case. The region boundary motion and merge equations become simple expressions containing cross-entropy and entropy terms.
我们提出了一种新的基于区域竞争的非参数无监督分割算法(Zhu and Yuille, 1996);但要在关卡集框架内执行。该算法的新颖之处在于仅使用一个嵌入曲面就可以解决N /spl / 2类分割问题;这是通过根据最小描述长度(MDL) (Leclerc(1989)和Rissanen(1985))成本函数控制关卡集的合并和分裂行为来实现的。这与迄今为止N类基于区域的水平集分割方法形成对比,后者通过并行发展多个耦合嵌入表面来操作(Chan et al., 2002)。此外,它以一种无人监督的方式运作;既不需要指定N的值,也不需要先验地指定类模型。我们认为水平集方法为区域竞争算法的实现提供了一个更方便的框架,由于缺乏固有的曲线表示,区域竞争算法通常使用区域成员数组实现。最后,我们将标准区域竞争中的高斯区域模型推广到非参数情况。区域边界运动和合并方程变成包含交叉熵和熵项的简单表达式。
{"title":"Unsupervised non-parametric region segmentation using level sets","authors":"T. Kadir, M. Brady","doi":"10.1109/ICCV.2003.1238636","DOIUrl":"https://doi.org/10.1109/ICCV.2003.1238636","url":null,"abstract":"We present a novel non-parametric unsupervised segmentation algorithm based on region competition (Zhu and Yuille, 1996); but implemented within a level sets framework (Osher and Sethian, 1988). The key novelty of the algorithm is that it can solve N /spl ges/ 2 class segmentation problems using just one embedded surface; this is achieved by controlling the merging and splitting behaviour of the level sets according to a minimum description length (MDL) (Leclerc (1989) and Rissanen (1985)) cost function. This is in contrast to N class region-based level set segmentation methods to date which operate by evolving multiple coupled embedded surfaces in parallel (Chan et al., 2002). Furthermore, it operates in an unsupervised manner; it is necessary neither to specify the value of N nor the class models a-priori. We argue that the level sets methodology provides a more convenient framework for the implementation of the region competition algorithm, which is conventionally implemented using region membership arrays due to the lack of a intrinsic curve representation. Finally, we generalise the Gaussian region model used in standard region competition to the non-parametric case. The region boundary motion and merge equations become simple expressions containing cross-entropy and entropy terms.","PeriodicalId":131580,"journal":{"name":"Proceedings Ninth IEEE International Conference on Computer Vision","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124590306","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 60
Filtering using a tree-based estimator 使用基于树的估计器进行过滤
Pub Date : 2003-10-13 DOI: 10.1109/ICCV.2003.1238467
B. Stenger, A. Thayananthan, P. Torr, R. Cipolla
Within this paper a new framework for Bayesian tracking is presented, which approximates the posterior distribution at multiple resolutions. We propose a tree-based representation of the distribution, where the leaves define a partition of the state space with piecewise constant density. The advantage of this representation is that regions with low probability mass can be rapidly discarded in a hierarchical search, and the distribution can be approximated to arbitrary precision. We demonstrate the effectiveness of the technique by using it for tracking 3D articulated and nonrigid motion in front of cluttered background. More specifically, we are interested in estimating the joint angles, position and orientation of a 3D hand model in order to drive an avatar.
本文提出了一种新的贝叶斯跟踪框架,它近似于多分辨率下的后验分布。我们提出了一种基于树的分布表示,其中叶子定义了状态空间的分段恒定密度分区。这种表示的优点是在分层搜索中可以快速丢弃质量概率较低的区域,并且分布可以近似到任意精度。我们通过使用它来跟踪杂乱背景前的3D关节和非刚性运动来证明该技术的有效性。更具体地说,我们感兴趣的是估计关节角度,3D手模型的位置和方向,以驱动化身。
{"title":"Filtering using a tree-based estimator","authors":"B. Stenger, A. Thayananthan, P. Torr, R. Cipolla","doi":"10.1109/ICCV.2003.1238467","DOIUrl":"https://doi.org/10.1109/ICCV.2003.1238467","url":null,"abstract":"Within this paper a new framework for Bayesian tracking is presented, which approximates the posterior distribution at multiple resolutions. We propose a tree-based representation of the distribution, where the leaves define a partition of the state space with piecewise constant density. The advantage of this representation is that regions with low probability mass can be rapidly discarded in a hierarchical search, and the distribution can be approximated to arbitrary precision. We demonstrate the effectiveness of the technique by using it for tracking 3D articulated and nonrigid motion in front of cluttered background. More specifically, we are interested in estimating the joint angles, position and orientation of a 3D hand model in order to drive an avatar.","PeriodicalId":131580,"journal":{"name":"Proceedings Ninth IEEE International Conference on Computer Vision","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122820834","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 167
A Bayesian network framework for relational shape matching 关系形状匹配的贝叶斯网络框架
Pub Date : 2003-10-13 DOI: 10.1109/ICCV.2003.1238412
Anand Rangarajan, J. Coughlan, A. Yuille
A Bayesian network formulation for relational shape matching is presented. The main advantage of the relational shape matching approach is the obviation of the nonrigid spatial mappings used by recent nonrigid matching approaches. The basic variables that need to be estimated in the relational shape matching objective function are the global rotation and scale and the local displacements and correspondences. The new Bethe free energy approach is used to estimate the pairwise correspondences between links of the template graphs and the data. The resulting framework is useful in both registration and recognition contexts. Results are shown on hand-drawn templates and on 2D transverse T1-weighted MR images.
提出了一种关系形状匹配的贝叶斯网络公式。关系形状匹配方法的主要优点是避免了最近的非刚性匹配方法所使用的非刚性空间映射。在关系形状匹配目标函数中需要估计的基本变量是全局旋转和尺度以及局部位移和对应。采用新的贝特自由能方法估计模板图链接与数据的成对对应关系。得到的框架在注册和识别上下文中都很有用。结果显示在手绘模板和2D横向t1加权MR图像上。
{"title":"A Bayesian network framework for relational shape matching","authors":"Anand Rangarajan, J. Coughlan, A. Yuille","doi":"10.1109/ICCV.2003.1238412","DOIUrl":"https://doi.org/10.1109/ICCV.2003.1238412","url":null,"abstract":"A Bayesian network formulation for relational shape matching is presented. The main advantage of the relational shape matching approach is the obviation of the nonrigid spatial mappings used by recent nonrigid matching approaches. The basic variables that need to be estimated in the relational shape matching objective function are the global rotation and scale and the local displacements and correspondences. The new Bethe free energy approach is used to estimate the pairwise correspondences between links of the template graphs and the data. The resulting framework is useful in both registration and recognition contexts. Results are shown on hand-drawn templates and on 2D transverse T1-weighted MR images.","PeriodicalId":131580,"journal":{"name":"Proceedings Ninth IEEE International Conference on Computer Vision","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115517599","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 30
Improved fast gauss transform and efficient kernel density estimation 改进了快速高斯变换和高效核密度估计
Pub Date : 2003-10-13 DOI: 10.1109/ICCV.2003.1238383
Changjiang Yang, R. Duraiswami, N. Gumerov, L. Davis
Evaluating sums of multivariate Gaussians is a common computational task in computer vision and pattern recognition, including in the general and powerful kernel density estimation technique. The quadratic computational complexity of the summation is a significant barrier to the scalability of this algorithm to practical applications. The fast Gauss transform (FGT) has successfully accelerated the kernel density estimation to linear running time for low-dimensional problems. Unfortunately, the cost of a direct extension of the FGT to higher-dimensional problems grows exponentially with dimension, making it impractical for dimensions above 3. We develop an improved fast Gauss transform to efficiently estimate sums of Gaussians in higher dimensions, where a new multivariate expansion scheme and an adaptive space subdivision technique dramatically improve the performance. The improved FGT has been applied to the mean shift algorithm achieving linear computational complexity. Experimental results demonstrate the efficiency and effectiveness of our algorithm.
在计算机视觉和模式识别中,多变量高斯和的求值是一项常见的计算任务,包括通用的、功能强大的核密度估计技术。求和的二次计算复杂度是制约该算法在实际应用中的可扩展性的一个重要障碍。快速高斯变换(FGT)成功地将低维问题的核密度估计缩短到线性运行时间。不幸的是,将FGT直接扩展到高维问题的代价随着维数的增加呈指数增长,使得它对于3维以上的问题变得不切实际。我们开发了一种改进的快速高斯变换来有效地估计高维高斯和,其中一个新的多元展开方案和自适应空间细分技术显着提高了性能。将改进后的FGT应用于均值移位算法,实现了线性计算复杂度。实验结果证明了该算法的有效性和有效性。
{"title":"Improved fast gauss transform and efficient kernel density estimation","authors":"Changjiang Yang, R. Duraiswami, N. Gumerov, L. Davis","doi":"10.1109/ICCV.2003.1238383","DOIUrl":"https://doi.org/10.1109/ICCV.2003.1238383","url":null,"abstract":"Evaluating sums of multivariate Gaussians is a common computational task in computer vision and pattern recognition, including in the general and powerful kernel density estimation technique. The quadratic computational complexity of the summation is a significant barrier to the scalability of this algorithm to practical applications. The fast Gauss transform (FGT) has successfully accelerated the kernel density estimation to linear running time for low-dimensional problems. Unfortunately, the cost of a direct extension of the FGT to higher-dimensional problems grows exponentially with dimension, making it impractical for dimensions above 3. We develop an improved fast Gauss transform to efficiently estimate sums of Gaussians in higher dimensions, where a new multivariate expansion scheme and an adaptive space subdivision technique dramatically improve the performance. The improved FGT has been applied to the mean shift algorithm achieving linear computational complexity. Experimental results demonstrate the efficiency and effectiveness of our algorithm.","PeriodicalId":131580,"journal":{"name":"Proceedings Ninth IEEE International Conference on Computer Vision","volume":"93 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132416359","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 516
A background layer model for object tracking through occlusion 一种通过遮挡进行目标跟踪的背景层模型
Pub Date : 2003-10-13 DOI: 10.1109/ICCV.2003.1238469
Yue Zhou, Hai Tao
Motion layer estimation has recently emerged as a promising object tracking method. In this paper, we extend previous research on layer-based tracker by introducing the concept of background occluding layers and explicitly inferring depth ordering of foreground layers. The background occluding layers lie in front of, behind, and in between foreground layers. Each pixel in the background regions belongs to one of these layers and occludes all the foreground layers behind it. Together with the foreground ordering, the complete information necessary for reliably tracking objects through occlusion is included in our representation. An MAP estimation framework is developed to simultaneously update the motion layer parameters, the ordering parameters, and the background occluding layers. Experimental results show that under various conditions with occlusion, including situations with moving objects undergoing complex motions or having complex interactions, our tracking algorithm is able to handle many difficult tracking tasks reliably.
运动层估计是最近出现的一种很有前途的目标跟踪方法。在本文中,我们通过引入背景遮挡层的概念和明确推断前景层的深度顺序,扩展了以往基于层的跟踪器研究。背景遮挡层位于前景层的前面、后面和中间。背景区域中的每个像素都属于这些图层中的一个,并遮挡其后面的所有前景图层。与前景排序一起,通过遮挡可靠地跟踪物体所需的完整信息包含在我们的表示中。提出了一种MAP估计框架,用于同时更新运动层参数、排序参数和背景遮挡层。实验结果表明,在各种遮挡条件下,包括运动物体进行复杂运动或具有复杂相互作用的情况下,我们的跟踪算法能够可靠地处理许多困难的跟踪任务。
{"title":"A background layer model for object tracking through occlusion","authors":"Yue Zhou, Hai Tao","doi":"10.1109/ICCV.2003.1238469","DOIUrl":"https://doi.org/10.1109/ICCV.2003.1238469","url":null,"abstract":"Motion layer estimation has recently emerged as a promising object tracking method. In this paper, we extend previous research on layer-based tracker by introducing the concept of background occluding layers and explicitly inferring depth ordering of foreground layers. The background occluding layers lie in front of, behind, and in between foreground layers. Each pixel in the background regions belongs to one of these layers and occludes all the foreground layers behind it. Together with the foreground ordering, the complete information necessary for reliably tracking objects through occlusion is included in our representation. An MAP estimation framework is developed to simultaneously update the motion layer parameters, the ordering parameters, and the background occluding layers. Experimental results show that under various conditions with occlusion, including situations with moving objects undergoing complex motions or having complex interactions, our tracking algorithm is able to handle many difficult tracking tasks reliably.","PeriodicalId":131580,"journal":{"name":"Proceedings Ninth IEEE International Conference on Computer Vision","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131888541","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 109
期刊
Proceedings Ninth IEEE International Conference on Computer Vision
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1