首页 > 最新文献

Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271)最新文献

英文 中文
Affine invariant medial axis and skew symmetry 仿射不变中轴和斜对称
Pub Date : 1998-01-04 DOI: 10.1109/ICCV.1998.710814
P. Giblin, G. Sapiro
Affine invariant medial axes and symmetry sets of planar shapes are introduced and studied in this paper. Two different approaches are presented. The first one is based on affine invariant distances, and defines the symmetry set, a set containing the medial axis; as the closure of the locus of points on (at least) two affine normals an affine-equidistant from the corresponding points on the curve. The second approach is based on affine bitangent conics. In this case the symmetry set is defined as the closure of the locus of centers of conics with (at least) three-point contact with two or more distinct points on the curve. This is equivalent to conic and curve having, at those points, the same affine tangent, or the same Euclidean tangent and curvature. Although the two analogous definitions for the classical Euclidean symmetry set (medial axis) are equivalent, this is not the case for the affine group. We then show how to use the symmetry set to detect affine skew symmetry, proving that the contact based symmetry set is a straight line if and only if the given shape is the affine transformation of a symmetric object.
介绍并研究了平面形状的仿射不变内轴和对称集。提出了两种不同的方法。第一个是基于仿射不变距离,定义对称集,一个包含中轴线的集合;作为(至少)两条仿射法线上的点轨迹的闭包,与曲线上的对应点之间的仿射距离相等。第二种方法是基于仿射双代理经济。在这种情况下,对称集被定义为(至少)与曲线上的两个或多个不同的点有三点接触的圆锥曲线的中心轨迹的闭合。这等价于,在这些点上,圆锥曲线和曲线有相同的仿射切线,或者相同的欧几里得切线和曲率。虽然经典欧几里得对称集(中轴)的两个类似定义是等价的,但仿射群的情况并非如此。然后我们展示了如何使用对称集来检测仿射斜对称,证明了基于接触的对称集是一条直线当且仅当给定形状是对称物体的仿射变换。
{"title":"Affine invariant medial axis and skew symmetry","authors":"P. Giblin, G. Sapiro","doi":"10.1109/ICCV.1998.710814","DOIUrl":"https://doi.org/10.1109/ICCV.1998.710814","url":null,"abstract":"Affine invariant medial axes and symmetry sets of planar shapes are introduced and studied in this paper. Two different approaches are presented. The first one is based on affine invariant distances, and defines the symmetry set, a set containing the medial axis; as the closure of the locus of points on (at least) two affine normals an affine-equidistant from the corresponding points on the curve. The second approach is based on affine bitangent conics. In this case the symmetry set is defined as the closure of the locus of centers of conics with (at least) three-point contact with two or more distinct points on the curve. This is equivalent to conic and curve having, at those points, the same affine tangent, or the same Euclidean tangent and curvature. Although the two analogous definitions for the classical Euclidean symmetry set (medial axis) are equivalent, this is not the case for the affine group. We then show how to use the symmetry set to detect affine skew symmetry, proving that the contact based symmetry set is a straight line if and only if the given shape is the affine transformation of a symmetric object.","PeriodicalId":270671,"journal":{"name":"Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134010672","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
The cascaded Hough transform as an aid in aerial image interpretation 级联霍夫变换在航空图像判读中的辅助作用
Pub Date : 1998-01-04 DOI: 10.1109/ICCV.1998.710702
T. Tuytelaars, L. Gool, M. Proesmans, T. Moons
Cartography and other applications of remote sensing have led to an increased interest in the (semi-)automatic interpretation of structures in aerial images of urban and suburban areas. Although these areas are particularly challenging because of their complexity, the degree of regularity in such man-made structures also helps to tackle the problems. The paper presents the iterated application of the Hough transform as a means to exploit such regularities. It shows how such 'Cascaded Hough Transform' (or CHT for short) yields straight lines, vanishing points, and vanishing lines. It also illustrates how the latter assist in improving the precision of the former. The examples are based on real aerial photographs.
地图学和遥感的其他应用使人们对城市和郊区航空图象结构的(半)自动判读越来越感兴趣。尽管这些地区因其复杂性而特别具有挑战性,但这些人造结构的规律性也有助于解决这些问题。本文介绍了霍夫变换的迭代应用,作为挖掘这种规律的一种手段。它展示了这种“级联霍夫变换”(简称CHT)如何产生直线,消失点和消失线。它还说明了后者如何帮助提高前者的精度。这些例子是基于真实的航空照片。
{"title":"The cascaded Hough transform as an aid in aerial image interpretation","authors":"T. Tuytelaars, L. Gool, M. Proesmans, T. Moons","doi":"10.1109/ICCV.1998.710702","DOIUrl":"https://doi.org/10.1109/ICCV.1998.710702","url":null,"abstract":"Cartography and other applications of remote sensing have led to an increased interest in the (semi-)automatic interpretation of structures in aerial images of urban and suburban areas. Although these areas are particularly challenging because of their complexity, the degree of regularity in such man-made structures also helps to tackle the problems. The paper presents the iterated application of the Hough transform as a means to exploit such regularities. It shows how such 'Cascaded Hough Transform' (or CHT for short) yields straight lines, vanishing points, and vanishing lines. It also illustrates how the latter assist in improving the precision of the former. The examples are based on real aerial photographs.","PeriodicalId":270671,"journal":{"name":"Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134327763","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 111
A maximum-flow formulation of the N-camera stereo correspondence problem n摄像机立体对应问题的最大流公式
Pub Date : 1998-01-04 DOI: 10.1109/ICCV.1998.710763
S. Roy, I. Cox
This paper describes a new algorithm for solving the N-camera stereo correspondence problem by transforming it into a maximum-flow problem. Once solved, the minimum-cut associated to the maximum-flow yields a disparity surface for the whole image at once. This global approach to stereo analysis provides a more accurate and coherent depth map than the traditional line-by-line stereo. Moreover, the optimality of the depth surface is guaranteed and can be shown to be a generalization of the dynamic programming approach that is widely used in standard stereo. Results show improved depth estimation as well as better handling of depth discontinuities. While the worst case running time is O(n/sup 2/d/sup 2/log(nd)), the observed average running time is O(n/sup 1.2/ d/sup 1.3/) for an image size of n pixels and depth resolution d.
本文将n摄像机立体对应问题转化为最大流问题,提出了一种求解n摄像机立体对应问题的新算法。一旦解决,与最大流量相关的最小切割立即产生整个图像的视差表面。这种立体声分析的全局方法提供了比传统逐行立体声更准确和连贯的深度图。此外,该方法还保证了深度曲面的最优性,是对标准立体中广泛使用的动态规划方法的一种推广。结果表明,深度估计得到了改进,深度不连续性得到了更好的处理。最坏情况下的运行时间为0 (n/sup 2/d/sup 2/log(nd)),而对于图像大小为n像素、深度分辨率为d的情况,观察到的平均运行时间为0 (n/sup 1.2/ d/sup 1.3/)。
{"title":"A maximum-flow formulation of the N-camera stereo correspondence problem","authors":"S. Roy, I. Cox","doi":"10.1109/ICCV.1998.710763","DOIUrl":"https://doi.org/10.1109/ICCV.1998.710763","url":null,"abstract":"This paper describes a new algorithm for solving the N-camera stereo correspondence problem by transforming it into a maximum-flow problem. Once solved, the minimum-cut associated to the maximum-flow yields a disparity surface for the whole image at once. This global approach to stereo analysis provides a more accurate and coherent depth map than the traditional line-by-line stereo. Moreover, the optimality of the depth surface is guaranteed and can be shown to be a generalization of the dynamic programming approach that is widely used in standard stereo. Results show improved depth estimation as well as better handling of depth discontinuities. While the worst case running time is O(n/sup 2/d/sup 2/log(nd)), the observed average running time is O(n/sup 1.2/ d/sup 1.3/) for an image size of n pixels and depth resolution d.","PeriodicalId":270671,"journal":{"name":"Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132157880","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 604
Representation and self-similarity of shapes 形状的表征与自相似性
Pub Date : 1998-01-04 DOI: 10.1109/ICCV.1998.710858
Tyng-Luh Liu, D. Geiger, R. Kohn
Representing shapes is a significant problem for vision systems that must recognize or classify objects. We derive a representation for a given shape by investigating its self-similarities, and constructing its shape axis (SA) and shape axis tree (SA-tree). We start with a shape, its boundary contour, and two different parameterizations for the contour. To measure its self-similarity we consider matching pairs of points (and their tangents) along the boundary contour, i.e., matching the two parameterizations. The matching, of self-similarity criteria may vary, e.g., co-circularity, parallelism, distance, region homogeneity. The loci of middle points of the pairing contour points are the shape axis and they can be grouped into a unique tree graph, the SA-tree. The shape axis for the co-circularity criteria is compared to the symmetry axis. An interpretation in terms of object parts is also presented.
对于必须识别或分类物体的视觉系统来说,表示形状是一个重要的问题。我们通过研究给定形状的自相似性,构造形状轴(SA)和形状轴树(SA-tree),推导出形状的表示。我们从一个形状,它的边界轮廓,以及轮廓的两种不同的参数化开始。为了测量其自相似性,我们考虑沿边界轮廓匹配点对(及其切线),即匹配两个参数化。自相似标准的匹配可能不同,如共圆度、平行度、距离、区域均匀性等。配对轮廓点的中间点轨迹为形状轴,它们可以组合成唯一的树状图SA-tree。将共圆准则的形状轴与对称轴进行比较。本文还提出了一种基于对象部分的解释。
{"title":"Representation and self-similarity of shapes","authors":"Tyng-Luh Liu, D. Geiger, R. Kohn","doi":"10.1109/ICCV.1998.710858","DOIUrl":"https://doi.org/10.1109/ICCV.1998.710858","url":null,"abstract":"Representing shapes is a significant problem for vision systems that must recognize or classify objects. We derive a representation for a given shape by investigating its self-similarities, and constructing its shape axis (SA) and shape axis tree (SA-tree). We start with a shape, its boundary contour, and two different parameterizations for the contour. To measure its self-similarity we consider matching pairs of points (and their tangents) along the boundary contour, i.e., matching the two parameterizations. The matching, of self-similarity criteria may vary, e.g., co-circularity, parallelism, distance, region homogeneity. The loci of middle points of the pairing contour points are the shape axis and they can be grouped into a unique tree graph, the SA-tree. The shape axis for the co-circularity criteria is compared to the symmetry axis. An interpretation in terms of object parts is also presented.","PeriodicalId":270671,"journal":{"name":"Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132508492","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 130
Transinformation for active object recognition 主动目标识别的信息转换
Pub Date : 1998-01-04 DOI: 10.1109/ICCV.1998.710726
B. Schiele, J. Crowley
This article develops an analogy between object recognition and the transmission of information through a channel based on the statistical representation of the appearances of 3D objects. This analogy provides a means to quantitatively evaluate the contribution of individual receptive field vectors, and to predict the performance of the object recognition process. Transinformation also provides a quantitative measure of the discrimination provided by each viewpoint, thus permitting the determination of the most discriminant viewpoints. As an application, the article develops an active object recognition algorithm which is able to resolve ambiguities inherent in a single-view recognition algorithm.
本文在物体识别和通过基于三维物体外观的统计表示的通道传输信息之间进行了类比。这种类比提供了一种定量评估个体接受野向量贡献的方法,并预测对象识别过程的性能。跨信息还提供了对每个观点所提供的歧视的定量衡量,从而允许确定最具歧视性的观点。作为一种应用,本文开发了一种主动目标识别算法,该算法能够解决单视图识别算法固有的模糊性。
{"title":"Transinformation for active object recognition","authors":"B. Schiele, J. Crowley","doi":"10.1109/ICCV.1998.710726","DOIUrl":"https://doi.org/10.1109/ICCV.1998.710726","url":null,"abstract":"This article develops an analogy between object recognition and the transmission of information through a channel based on the statistical representation of the appearances of 3D objects. This analogy provides a means to quantitatively evaluate the contribution of individual receptive field vectors, and to predict the performance of the object recognition process. Transinformation also provides a quantitative measure of the discrimination provided by each viewpoint, thus permitting the determination of the most discriminant viewpoints. As an application, the article develops an active object recognition algorithm which is able to resolve ambiguities inherent in a single-view recognition algorithm.","PeriodicalId":270671,"journal":{"name":"Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271)","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133273223","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 83
Intensity and feature based stereo matching by disparity parameterization 基于强度和特征的视差参数化立体匹配
Pub Date : 1998-01-04 DOI: 10.1109/ICCV.1998.710844
G. Wei, G. Hirzinger
In this paper, we propose a new solution to the stereo correspondence problem by including features an intensity based matching. The features we use are intensity gradients in both the x and y directions of the left and the deformed right images. Although a uniform smoothness constraint is still used, it is nevertheless applied only to non-feature regions. To avoid local minima in function minimization, we propose to parameterize the disparity function by hierarchical Gaussians. A simple stochastic gradient method is used to estimate the Gaussian weights. Experiments with various real stereo images show robust performances.
在本文中,我们提出了一种新的解决立体对应问题的方法,即基于强度的匹配包含特征。我们使用的特征是左图像和变形的右图像在x和y方向上的强度梯度。尽管仍然使用均匀平滑约束,但它仅适用于非特征区域。为了避免在函数最小化中出现局部极小值,我们提出用分层高斯函数参数化视差函数。用一种简单的随机梯度法估计高斯权重。对各种真实立体图像的实验结果表明,该方法具有较好的鲁棒性。
{"title":"Intensity and feature based stereo matching by disparity parameterization","authors":"G. Wei, G. Hirzinger","doi":"10.1109/ICCV.1998.710844","DOIUrl":"https://doi.org/10.1109/ICCV.1998.710844","url":null,"abstract":"In this paper, we propose a new solution to the stereo correspondence problem by including features an intensity based matching. The features we use are intensity gradients in both the x and y directions of the left and the deformed right images. Although a uniform smoothness constraint is still used, it is nevertheless applied only to non-feature regions. To avoid local minima in function minimization, we propose to parameterize the disparity function by hierarchical Gaussians. A simple stochastic gradient method is used to estimate the Gaussian weights. Experiments with various real stereo images show robust performances.","PeriodicalId":270671,"journal":{"name":"Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271)","volume":"65 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114490295","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Physics-based 3D position analysis of a soccer ball from monocular image sequences 基于物理的单目图像序列足球三维位置分析
Pub Date : 1998-01-04 DOI: 10.1109/ICCV.1998.710797
Taeone Kim, Y. Seo, K. Hong
In this paper, we propose a method for locating 3D position of a soccer ball from monocular image sequence of soccer games. Toward this goal, we adopted ground-model-to-image transformation together with physics-based approach, that a ball follows the parabolic trajectory in the air. By using the transformation the heights of a ball can be easily calculated using simple triangular geometric relations given the start and the end position of the ball on the ground. Here the heights of a ball are determined in terms of a player's height. Even if the end position of a ball is not given on the ground due to kicking or heading of a falling ball before it touches the ground, the most probable trajectory can be determined by searching based on the physical fact that the ball follows a parabolic trajectory in the air. We have tested and experimented with a real image sequence the results of which seem promising.
本文提出了一种从足球比赛的单目图像序列中定位足球三维位置的方法。为了实现这一目标,我们采用了地面模型到图像的转换以及基于物理的方法,即球在空中遵循抛物线轨迹。通过使用变换,可以很容易地计算出球的高度,使用简单的三角形几何关系给定球在地面上的开始和结束位置。在这里,球的高度是由球员的身高决定的。即使一个球在落地前由于被踢或头球而不能在地面上给出它的最终位置,根据球在空中沿抛物线轨迹的物理事实,通过搜索可以确定最可能的轨迹。我们对一个真实的图像序列进行了测试和实验,结果似乎很有希望。
{"title":"Physics-based 3D position analysis of a soccer ball from monocular image sequences","authors":"Taeone Kim, Y. Seo, K. Hong","doi":"10.1109/ICCV.1998.710797","DOIUrl":"https://doi.org/10.1109/ICCV.1998.710797","url":null,"abstract":"In this paper, we propose a method for locating 3D position of a soccer ball from monocular image sequence of soccer games. Toward this goal, we adopted ground-model-to-image transformation together with physics-based approach, that a ball follows the parabolic trajectory in the air. By using the transformation the heights of a ball can be easily calculated using simple triangular geometric relations given the start and the end position of the ball on the ground. Here the heights of a ball are determined in terms of a player's height. Even if the end position of a ball is not given on the ground due to kicking or heading of a falling ball before it touches the ground, the most probable trajectory can be determined by searching based on the physical fact that the ball follows a parabolic trajectory in the air. We have tested and experimented with a real image sequence the results of which seem promising.","PeriodicalId":270671,"journal":{"name":"Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114499960","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 91
A task driven 3D object recognition system using Bayesian networks 基于贝叶斯网络的任务驱动三维目标识别系统
Pub Date : 1998-01-04 DOI: 10.1109/ICCV.1998.710767
Björn Krebs, B. Korn, M. Burkhardt
In this paper we propose a general framework to build a task oriented 3D object recognition system for CAD based vision (CBV). Features from 3D space curves representing the object's rims provide sufficient information to allow identification and pose estimation of industrial CAD models. However, features relying on differential surface properties tend to be very vulnerable with respect to noise. To model the statistical behavior of the data we introduce Bayesian nets which model the relationship between objects and observable features. Furthermore, task oriented selection of the optimal action to reduce the uncertainty of recognition results is incorporated into the Bayesian nets. This enables the integration of intelligent recognition strategies depending on the already acquired evidence into a robust, and efficient, 3D CAD based recognition system.
本文提出了一个基于CAD视觉(CBV)的面向任务的三维物体识别系统的总体框架。来自代表物体边缘的三维空间曲线的特征提供了足够的信息,以允许工业CAD模型的识别和姿态估计。然而,依赖于不同表面特性的特征往往非常容易受到噪声的影响。为了对数据的统计行为进行建模,我们引入了贝叶斯网络,该网络对对象和可观察特征之间的关系进行建模。此外,在贝叶斯网络中引入了以任务为导向的最优行为选择,以减少识别结果的不确定性。这使得基于已经获得的证据的智能识别策略集成到一个强大、高效的基于3D CAD的识别系统中。
{"title":"A task driven 3D object recognition system using Bayesian networks","authors":"Björn Krebs, B. Korn, M. Burkhardt","doi":"10.1109/ICCV.1998.710767","DOIUrl":"https://doi.org/10.1109/ICCV.1998.710767","url":null,"abstract":"In this paper we propose a general framework to build a task oriented 3D object recognition system for CAD based vision (CBV). Features from 3D space curves representing the object's rims provide sufficient information to allow identification and pose estimation of industrial CAD models. However, features relying on differential surface properties tend to be very vulnerable with respect to noise. To model the statistical behavior of the data we introduce Bayesian nets which model the relationship between objects and observable features. Furthermore, task oriented selection of the optimal action to reduce the uncertainty of recognition results is incorporated into the Bayesian nets. This enables the integration of intelligent recognition strategies depending on the already acquired evidence into a robust, and efficient, 3D CAD based recognition system.","PeriodicalId":270671,"journal":{"name":"Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131857788","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
View-based object matching 基于视图的对象匹配
Pub Date : 1998-01-04 DOI: 10.1109/ICCV.1998.710777
A. Shokoufandeh, I. Marsic, Sven J. Dickinson
We introduce a novel view-based object representation, called the saliency map graph (SMG), which captures the salient regions of an object view at multiple scales using a wavelet transform. This compact representation is highly invariant to translation, rotation (image and depth), and scaling, and offers the locality of representation required for occluded object recognition. To compare two saliency map graphs, we introduce two graph similarity algorithms. The first computes the topological similarity between two SMG's, providing a coarse-level matching of two graphs. The second computes the geometrical similarity between two SMG's, providing a fine-level matching of two graphs. We test and compare these two algorithms on a large database of model object views.
我们引入了一种新的基于视图的对象表示,称为显著性映射图(SMG),它使用小波变换在多个尺度上捕获对象视图的显著区域。这种紧凑的表示对平移、旋转(图像和深度)和缩放具有高度的不变性,并提供了遮挡物体识别所需的表示的局域性。为了比较两种显著性图,我们引入了两种图相似算法。第一个计算两个SMG之间的拓扑相似性,提供两个图的粗略匹配。第二个计算两个SMG之间的几何相似性,提供两个图的精细匹配。我们在一个大型模型对象视图数据库上对这两种算法进行了测试和比较。
{"title":"View-based object matching","authors":"A. Shokoufandeh, I. Marsic, Sven J. Dickinson","doi":"10.1109/ICCV.1998.710777","DOIUrl":"https://doi.org/10.1109/ICCV.1998.710777","url":null,"abstract":"We introduce a novel view-based object representation, called the saliency map graph (SMG), which captures the salient regions of an object view at multiple scales using a wavelet transform. This compact representation is highly invariant to translation, rotation (image and depth), and scaling, and offers the locality of representation required for occluded object recognition. To compare two saliency map graphs, we introduce two graph similarity algorithms. The first computes the topological similarity between two SMG's, providing a coarse-level matching of two graphs. The second computes the geometrical similarity between two SMG's, providing a fine-level matching of two graphs. We test and compare these two algorithms on a large database of model object views.","PeriodicalId":270671,"journal":{"name":"Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128569046","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Visual motion estimation and prediction: a probabilistic network model for temporal coherence 视觉运动估计与预测:时间相干性的概率网络模型
Pub Date : 1998-01-04 DOI: 10.1109/ICCV.1998.710834
A. Yuille, Pierre-Yves Burgi, N. Grzywacz
We develop a theory for the temporal integration of visual motion motivated by psychophysical experiments. The theory proposes that input data are temporally grouped and used to predict and estimate motion flows in the image sequences. Our theory is expressed in terms of the Bayesian generalization of standard Kalman filtering which allows us to solve temporal grouping in conjunction with prediction and estimation. As demonstrated for tracking isolated contours the Bayesian formulation is superior to approaches which use data association as a first stage followed by conventional Kalman filtering. Our computer simulations demonstrate that our theory qualitatively accounts for several psychophysical experiments on motion occlusion and motion outliers.
在心理物理实验的基础上,提出了视觉运动的时间整合理论。该理论提出将输入数据暂时分组,并用于预测和估计图像序列中的运动流。我们的理论是用标准卡尔曼滤波的贝叶斯泛化来表达的,它允许我们解决与预测和估计相结合的时间分组。正如所示,对于跟踪孤立轮廓,贝叶斯公式优于使用数据关联作为第一阶段,然后是传统卡尔曼滤波的方法。我们的计算机模拟表明,我们的理论定性地解释了几个关于运动遮挡和运动异常值的心理物理实验。
{"title":"Visual motion estimation and prediction: a probabilistic network model for temporal coherence","authors":"A. Yuille, Pierre-Yves Burgi, N. Grzywacz","doi":"10.1109/ICCV.1998.710834","DOIUrl":"https://doi.org/10.1109/ICCV.1998.710834","url":null,"abstract":"We develop a theory for the temporal integration of visual motion motivated by psychophysical experiments. The theory proposes that input data are temporally grouped and used to predict and estimate motion flows in the image sequences. Our theory is expressed in terms of the Bayesian generalization of standard Kalman filtering which allows us to solve temporal grouping in conjunction with prediction and estimation. As demonstrated for tracking isolated contours the Bayesian formulation is superior to approaches which use data association as a first stage followed by conventional Kalman filtering. Our computer simulations demonstrate that our theory qualitatively accounts for several psychophysical experiments on motion occlusion and motion outliers.","PeriodicalId":270671,"journal":{"name":"Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271)","volume":"54 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134478639","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
期刊
Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1