首页 > 最新文献

2007 IEEE Conference on Computer Vision and Pattern Recognition最新文献

英文 中文
Optimal Step Nonrigid ICP Algorithms for Surface Registration 曲面配准的最优步非刚性ICP算法
Pub Date : 2007-06-17 DOI: 10.1109/CVPR.2007.383165
Brian Amberg, S. Romdhani, T. Vetter
We show how to extend the ICP framework to nonrigid registration, while retaining the convergence properties of the original algorithm. The resulting optimal step nonrigid ICP framework allows the use of different regularisations, as long as they have an adjustable stiffness parameter. The registration loops over a series of decreasing stiffness weights, and incrementally deforms the template towards the target, recovering the whole range of global and local deformations. To find the optimal deformation for a given stiffness, optimal iterative closest point steps are used. Preliminary correspondences are estimated by a nearest-point search. Then the optimal deformation of the template for these fixed correspondences and the active stiffness is calculated. Afterwards the process continues with new correspondences found by searching from the displaced template vertices. We present an algorithm using a locally affine regularisation which assigns an affine transformation to each vertex and minimises the difference in the transformation of neighbouring vertices. It is shown that for this regularisation the optimal deformation for fixed correspondences and fixed stiffness can be determined exactly and efficiently. The method succeeds for a wide range of initial conditions, and handles missing data robustly. It is compared qualitatively and quantitatively to other algorithms using synthetic examples and real world data.
我们展示了如何将ICP框架扩展到非刚性配准,同时保留了原始算法的收敛性。所得到的最佳步骤非刚性ICP框架允许使用不同的正则化,只要它们具有可调的刚度参数。配准循环在一系列减小的刚度权值上,并逐渐使模板向目标变形,恢复全局和局部变形的整个范围。为求给定刚度下的最优变形,采用最优迭代最近点步。初步对应是通过最近点搜索估计的。然后计算模板在这些固定对应下的最优变形量和主动刚度。之后,该过程继续通过从移位的模板顶点搜索找到新的对应关系。我们提出了一种使用局部仿射正则化的算法,该算法为每个顶点分配一个仿射变换,并使相邻顶点的变换差异最小化。结果表明,这种正则化方法可以准确有效地确定固定对应和固定刚度的最优变形。该方法适用于多种初始条件,并能鲁棒地处理缺失数据。使用合成示例和真实世界数据,将其定性和定量地与其他算法进行比较。
{"title":"Optimal Step Nonrigid ICP Algorithms for Surface Registration","authors":"Brian Amberg, S. Romdhani, T. Vetter","doi":"10.1109/CVPR.2007.383165","DOIUrl":"https://doi.org/10.1109/CVPR.2007.383165","url":null,"abstract":"We show how to extend the ICP framework to nonrigid registration, while retaining the convergence properties of the original algorithm. The resulting optimal step nonrigid ICP framework allows the use of different regularisations, as long as they have an adjustable stiffness parameter. The registration loops over a series of decreasing stiffness weights, and incrementally deforms the template towards the target, recovering the whole range of global and local deformations. To find the optimal deformation for a given stiffness, optimal iterative closest point steps are used. Preliminary correspondences are estimated by a nearest-point search. Then the optimal deformation of the template for these fixed correspondences and the active stiffness is calculated. Afterwards the process continues with new correspondences found by searching from the displaced template vertices. We present an algorithm using a locally affine regularisation which assigns an affine transformation to each vertex and minimises the difference in the transformation of neighbouring vertices. It is shown that for this regularisation the optimal deformation for fixed correspondences and fixed stiffness can be determined exactly and efficiently. The method succeeds for a wide range of initial conditions, and handles missing data robustly. It is compared qualitatively and quantitatively to other algorithms using synthetic examples and real world data.","PeriodicalId":351008,"journal":{"name":"2007 IEEE Conference on Computer Vision and Pattern Recognition","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126122679","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 714
A minimal solution to the autocalibration of radial distortion 径向畸变自动标定的最小解
Pub Date : 2007-06-17 DOI: 10.1109/CVPR.2007.383063
Z. Kukelova, T. Pajdla
Epipolar geometry and relative camera pose computation are examples of tasks which can be formulated as minimal problems and solved from a minimal number of image points. Finding the solution leads to solving systems of algebraic equations. Often, these systems are not trivial and therefore special algorithms have to be designed to achieve numerical robustness and computational efficiency. In this paper we provide a solution to the problem of estimating radial distortion and epipolar geometry from eight correspondences in two images. Unlike previous algorithms, which were able to solve the problem from nine correspondences only, we enforce the determinant of the fundamental matrix be zero. This leads to a system of eight quadratic and one cubic equation in nine variables. We simplify this system by eliminating six of these variables. Then, we solve the system by finding eigenvectors of an action matrix of a suitably chosen polynomial. We show how to construct the action matrix without computing complete Grobner basis, which provides an efficient and robust solver. The quality of the solver is demonstrated on synthetic and real data.
极几何和相对相机姿态计算是可以表述为最小问题并从最小数量的图像点解决的任务的例子。求解导致求解代数方程组。通常,这些系统不是微不足道的,因此必须设计特殊的算法来实现数值鲁棒性和计算效率。本文给出了一种从两幅图像的8个对应点估计径向畸变和极几何的方法。不像以前的算法,只能从9个对应中解决问题,我们强制基本矩阵的行列式为零。这导致了一个由九个变量的八个二次方程和一个三次方程组成的系统。我们通过消去其中的六个变量来简化这个系统。然后,我们通过寻找一个适当选择的多项式的作用矩阵的特征向量来求解系统。我们展示了如何在不计算完全Grobner基的情况下构造动作矩阵,从而提供了一个高效且鲁棒的求解器。通过综合数据和实际数据验证了该求解器的有效性。
{"title":"A minimal solution to the autocalibration of radial distortion","authors":"Z. Kukelova, T. Pajdla","doi":"10.1109/CVPR.2007.383063","DOIUrl":"https://doi.org/10.1109/CVPR.2007.383063","url":null,"abstract":"Epipolar geometry and relative camera pose computation are examples of tasks which can be formulated as minimal problems and solved from a minimal number of image points. Finding the solution leads to solving systems of algebraic equations. Often, these systems are not trivial and therefore special algorithms have to be designed to achieve numerical robustness and computational efficiency. In this paper we provide a solution to the problem of estimating radial distortion and epipolar geometry from eight correspondences in two images. Unlike previous algorithms, which were able to solve the problem from nine correspondences only, we enforce the determinant of the fundamental matrix be zero. This leads to a system of eight quadratic and one cubic equation in nine variables. We simplify this system by eliminating six of these variables. Then, we solve the system by finding eigenvectors of an action matrix of a suitably chosen polynomial. We show how to construct the action matrix without computing complete Grobner basis, which provides an efficient and robust solver. The quality of the solver is demonstrated on synthetic and real data.","PeriodicalId":351008,"journal":{"name":"2007 IEEE Conference on Computer Vision and Pattern Recognition","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129482145","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 79
Multi-scale Features for Detection and Segmentation of Rocks in Mars Images 火星图像中岩石的多尺度特征检测与分割
Pub Date : 2007-06-17 DOI: 10.1109/CVPR.2007.383257
H. Dunlop, D. Thompson, David S. Wettergreen
Geologists and planetary scientists will benefit from methods for accurate segmentation of rocks in natural scenes. However, rocks are poorly suited for current visual segmentation techniques - they exhibit diverse morphologies and have no uniform property to distinguish them from background soil. We address this challenge with a novel detection and segmentation method incorporating features from multiple scales. These features include local attributes such as texture, object attributes such as shading and two-dimensional shape, and scene attributes such as the direction of illumination. Our method uses a superpixel segmentation followed by region-merging to search for the most probable groups of superpixels. A learned model of rock appearances identifies whole rocks by scoring candidate superpixel groupings. We evaluate our method's performance on representative images from the Mars Exploration Rover catalog.
地质学家和行星科学家将受益于在自然场景中精确分割岩石的方法。然而,岩石不适合当前的视觉分割技术——它们表现出不同的形态,没有统一的属性来区分它们与背景土壤。我们用一种新的检测和分割方法来解决这一挑战,该方法结合了来自多个尺度的特征。这些特征包括局部属性(如纹理)、对象属性(如阴影和二维形状)以及场景属性(如照明方向)。我们的方法使用超像素分割和区域合并来搜索最可能的超像素组。岩石外观的学习模型通过评分候选超像素分组来识别整个岩石。我们对来自火星探测车目录的代表性图像进行了性能评估。
{"title":"Multi-scale Features for Detection and Segmentation of Rocks in Mars Images","authors":"H. Dunlop, D. Thompson, David S. Wettergreen","doi":"10.1109/CVPR.2007.383257","DOIUrl":"https://doi.org/10.1109/CVPR.2007.383257","url":null,"abstract":"Geologists and planetary scientists will benefit from methods for accurate segmentation of rocks in natural scenes. However, rocks are poorly suited for current visual segmentation techniques - they exhibit diverse morphologies and have no uniform property to distinguish them from background soil. We address this challenge with a novel detection and segmentation method incorporating features from multiple scales. These features include local attributes such as texture, object attributes such as shading and two-dimensional shape, and scene attributes such as the direction of illumination. Our method uses a superpixel segmentation followed by region-merging to search for the most probable groups of superpixels. A learned model of rock appearances identifies whole rocks by scoring candidate superpixel groupings. We evaluate our method's performance on representative images from the Mars Exploration Rover catalog.","PeriodicalId":351008,"journal":{"name":"2007 IEEE Conference on Computer Vision and Pattern Recognition","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129704478","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 47
Inter-Reflection Compensation for Immersive Projection Display 沉浸式投影显示的反射间补偿
Pub Date : 2007-06-17 DOI: 10.1109/CVPR.2007.383473
H. Habe, Nobuo Saeki, T. Matsuyama
This paper proposes an effective method for compensating inter-reflection in immersive projection displays (IPDs). Because IPDs project images onto a screen, which surrounds a viewer, we have perform out both geometric and photometric corrections. Our method compensates inter-reflection on the screen. It requires no special device, and approximates both diffuse and specular reflections on the screen using block-based photometric calibration.
提出了一种有效的沉浸式投影显示器间反射补偿方法。由于ipd将图像投射到观众周围的屏幕上,因此我们必须执行几何和光度校正。我们的方法补偿了屏幕上的相互反射。它不需要特殊的设备,并且使用基于块的光度校准来近似屏幕上的漫反射和镜面反射。
{"title":"Inter-Reflection Compensation for Immersive Projection Display","authors":"H. Habe, Nobuo Saeki, T. Matsuyama","doi":"10.1109/CVPR.2007.383473","DOIUrl":"https://doi.org/10.1109/CVPR.2007.383473","url":null,"abstract":"This paper proposes an effective method for compensating inter-reflection in immersive projection displays (IPDs). Because IPDs project images onto a screen, which surrounds a viewer, we have perform out both geometric and photometric corrections. Our method compensates inter-reflection on the screen. It requires no special device, and approximates both diffuse and specular reflections on the screen using block-based photometric calibration.","PeriodicalId":351008,"journal":{"name":"2007 IEEE Conference on Computer Vision and Pattern Recognition","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128306584","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Texture-Preserving Shadow Removal in Color Images Containing Curved Surfaces 包含曲面的彩色图像中保留纹理的阴影去除
Pub Date : 2007-06-17 DOI: 10.1109/CVPR.2007.383081
E. Arbel, H. Hel-Or
Several approaches to shadow removal in color images have been introduced in recent years. Yet these methods fail in removing shadows that are cast on curved surfaces, as well as retaining the original texture of the image in shadow boundaries, known as penumbra regions. In this paper, we propose a novel approach which effectively removes shadows from curved surfaces while retaining the textural information in the penumbra, yielding high quality shadow-free images. Our approach aims at finding scale factors to cancel the effect of shadows, including penumbra regions where illumination changes gradually. Due to the fact that surface geometry is also taken into account when computing the scale factors, our method can handle a wider range of shadow images than current state-of-the-art methods, as demonstrated by several examples.
近年来介绍了几种消除彩色图像阴影的方法。然而,这些方法无法去除投射在曲面上的阴影,也无法在阴影边界(称为半影区域)中保留图像的原始纹理。在本文中,我们提出了一种新的方法,可以有效地去除曲面上的阴影,同时保留半影中的纹理信息,从而产生高质量的无阴影图像。我们的方法旨在寻找比例因子来消除阴影的影响,包括光照逐渐变化的半影区域。由于在计算比例因子时也考虑了表面几何形状,因此我们的方法可以处理比当前最先进的方法更大范围的阴影图像,如几个例子所示。
{"title":"Texture-Preserving Shadow Removal in Color Images Containing Curved Surfaces","authors":"E. Arbel, H. Hel-Or","doi":"10.1109/CVPR.2007.383081","DOIUrl":"https://doi.org/10.1109/CVPR.2007.383081","url":null,"abstract":"Several approaches to shadow removal in color images have been introduced in recent years. Yet these methods fail in removing shadows that are cast on curved surfaces, as well as retaining the original texture of the image in shadow boundaries, known as penumbra regions. In this paper, we propose a novel approach which effectively removes shadows from curved surfaces while retaining the textural information in the penumbra, yielding high quality shadow-free images. Our approach aims at finding scale factors to cancel the effect of shadows, including penumbra regions where illumination changes gradually. Due to the fact that surface geometry is also taken into account when computing the scale factors, our method can handle a wider range of shadow images than current state-of-the-art methods, as demonstrated by several examples.","PeriodicalId":351008,"journal":{"name":"2007 IEEE Conference on Computer Vision and Pattern Recognition","volume":"27 1 Suppl 1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127296230","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 60
Pedestrian Detection in Infrared Images based on Local Shape Features 基于局部形状特征的红外图像行人检测
Pub Date : 2007-06-17 DOI: 10.1109/CVPR.2007.383452
Li Zhang, Bo Wu, R. Nevatia
Use of IR images is advantageous for many surveillance applications where the systems must operate around the clock and external illumination is not always available. We investigate the methods derived from visible spectrum analysis for the task of human detection. Two feature classes (edgelets and HOG features) and two classification models(AdaBoost and SVM cascade) are extended to IR images. We find out that it is possible to get detection performance in IR images that is comparable to state-of-the-art results for visible spectrum images. It is also shown that the two domains share many features, likely originating from the silhouettes, in spite of the starkly different appearances of the two modalities.
红外图像的使用对于许多监视应用是有利的,这些应用中系统必须全天候运行,并且外部照明并不总是可用。我们研究了人类检测任务中可见光谱分析的方法。将两个特征类(edgelets和HOG特征)和两个分类模型(AdaBoost和SVM级联)扩展到红外图像。我们发现有可能在红外图像中获得与可见光谱图像的最先进结果相当的检测性能。研究还表明,尽管两种模式的外观截然不同,但这两个领域有许多共同的特征,可能源于轮廓。
{"title":"Pedestrian Detection in Infrared Images based on Local Shape Features","authors":"Li Zhang, Bo Wu, R. Nevatia","doi":"10.1109/CVPR.2007.383452","DOIUrl":"https://doi.org/10.1109/CVPR.2007.383452","url":null,"abstract":"Use of IR images is advantageous for many surveillance applications where the systems must operate around the clock and external illumination is not always available. We investigate the methods derived from visible spectrum analysis for the task of human detection. Two feature classes (edgelets and HOG features) and two classification models(AdaBoost and SVM cascade) are extended to IR images. We find out that it is possible to get detection performance in IR images that is comparable to state-of-the-art results for visible spectrum images. It is also shown that the two domains share many features, likely originating from the silhouettes, in spite of the starkly different appearances of the two modalities.","PeriodicalId":351008,"journal":{"name":"2007 IEEE Conference on Computer Vision and Pattern Recognition","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132386221","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 162
Gradient Intensity: A New Mutual Information-Based Registration Method 梯度强度:一种新的基于互信息的配准方法
Pub Date : 2007-06-17 DOI: 10.1109/CVPR.2007.383425
R. Shams, P. Sadeghi, R. Kennedy
Conventional mutual information (Ml)-based registration using pixel intensities is time-consuming and ignores spatial information, which can lead to misalignment. We propose a method to overcome these limitation by acquiring initial estimates of transformation parameters. We introduce the concept of 'gradient intensity' as a measure of spatial strength of an image in a given direction. We determine the rotation parameter by maximizing the MI between gradient intensity histograms. Calculation of the gradient intensity MI function is extremely efficient. Our method is designed to be invariant to scale and translation between the images. We then obtain estimates of scale and translation parameters using methods based on the centroids of gradient images. The estimated parameters are used to initialize an optimization algorithm which is designed to converge more quickly than the standard Powell algorithm in close proximity of the minimum. Experiments show that our method significantly improves the performance of the registration task and reduces the overall computational complexity by an order of magnitude.
传统的基于互信息(Ml)的配准使用像素强度,耗时且忽略空间信息,可能导致不对齐。我们提出了一种通过获取变换参数的初始估计来克服这些限制的方法。我们引入了“梯度强度”的概念,作为给定方向上图像空间强度的度量。我们通过最大化梯度强度直方图之间的MI来确定旋转参数。梯度强度MI函数的计算非常高效。我们的方法对图像之间的缩放和平移具有不变性。然后,我们使用基于梯度图像质心的方法获得尺度和平移参数的估计。利用估计的参数初始化一个优化算法,该算法在接近最小值时比标准鲍威尔算法收敛得更快。实验表明,该方法显著提高了配准任务的性能,并将整体计算复杂度降低了一个数量级。
{"title":"Gradient Intensity: A New Mutual Information-Based Registration Method","authors":"R. Shams, P. Sadeghi, R. Kennedy","doi":"10.1109/CVPR.2007.383425","DOIUrl":"https://doi.org/10.1109/CVPR.2007.383425","url":null,"abstract":"Conventional mutual information (Ml)-based registration using pixel intensities is time-consuming and ignores spatial information, which can lead to misalignment. We propose a method to overcome these limitation by acquiring initial estimates of transformation parameters. We introduce the concept of 'gradient intensity' as a measure of spatial strength of an image in a given direction. We determine the rotation parameter by maximizing the MI between gradient intensity histograms. Calculation of the gradient intensity MI function is extremely efficient. Our method is designed to be invariant to scale and translation between the images. We then obtain estimates of scale and translation parameters using methods based on the centroids of gradient images. The estimated parameters are used to initialize an optimization algorithm which is designed to converge more quickly than the standard Powell algorithm in close proximity of the minimum. Experiments show that our method significantly improves the performance of the registration task and reduces the overall computational complexity by an order of magnitude.","PeriodicalId":351008,"journal":{"name":"2007 IEEE Conference on Computer Vision and Pattern Recognition","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132969337","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
Sequential Architecture for Efficient Car Detection 高效汽车检测的顺序结构
Pub Date : 2007-06-17 DOI: 10.1109/CVPR.2007.383499
Zhenfeng Zhu, Yao Zhao, Hanqing Lu
Based on multi-cue integration and hierarchical SVM, we present a sequential architecture for efficient car detection under complex outdoor scene in this paper. On the low level, two novel area templates based on edge and interest-point cues respectively are first constructed, which can be applied to forming the identities of visual perception to some extent and thus utilized to reject rapidly most of the negative non-car objects at the cost of missing few of the true ones. Moreover on the high level, both global structure and local texture cues are exploited to characterize the car objects precisely. To improve the computational efficiency of general SVM, a solution approximating based two-level hierarchical SVM is proposed. The experimental results show that the integration of global structure and local texture properties provides more powerful ability in discrimination of car objects from non-car ones. The final high detection performance also contributes to the utilizing of two novel low level visual cues and the hierarchical SVM.
本文基于多线索集成和分层支持向量机,提出了一种复杂户外场景下的高效车辆检测序列架构。在低层次上,首先构建了基于边缘和兴趣点线索的两个新的区域模板,可以在一定程度上用于形成视觉感知的身份,从而以丢失少量真实物体为代价快速拒绝大多数负面非汽车物体。此外,在高层次上,利用全局结构和局部纹理线索来精确表征汽车对象。为了提高一般支持向量机的计算效率,提出了一种基于近似解的两级分层支持向量机。实验结果表明,将全局结构和局部纹理属性相结合的方法能够更好地区分汽车和非汽车目标。最终的高检测性能还得益于两种新颖的低层次视觉线索和分层支持向量机的利用。
{"title":"Sequential Architecture for Efficient Car Detection","authors":"Zhenfeng Zhu, Yao Zhao, Hanqing Lu","doi":"10.1109/CVPR.2007.383499","DOIUrl":"https://doi.org/10.1109/CVPR.2007.383499","url":null,"abstract":"Based on multi-cue integration and hierarchical SVM, we present a sequential architecture for efficient car detection under complex outdoor scene in this paper. On the low level, two novel area templates based on edge and interest-point cues respectively are first constructed, which can be applied to forming the identities of visual perception to some extent and thus utilized to reject rapidly most of the negative non-car objects at the cost of missing few of the true ones. Moreover on the high level, both global structure and local texture cues are exploited to characterize the car objects precisely. To improve the computational efficiency of general SVM, a solution approximating based two-level hierarchical SVM is proposed. The experimental results show that the integration of global structure and local texture properties provides more powerful ability in discrimination of car objects from non-car ones. The final high detection performance also contributes to the utilizing of two novel low level visual cues and the hierarchical SVM.","PeriodicalId":351008,"journal":{"name":"2007 IEEE Conference on Computer Vision and Pattern Recognition","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132058059","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 24
On the Blind Classification of Time Series 时间序列的盲分类
Pub Date : 2007-06-17 DOI: 10.1109/CVPR.2007.383333
A. Bissacco, Stefano Soatto
We propose a cord distance in the space of dynamical models that takes into account their dynamics, including transients, output maps and input distributions. In data analysis applications, as opposed to control, the input is often not known and is inferred as part of the (blind) identification. So it is an integral part of the model that should be considered when comparing different time series. Previous work on kernel distances between dynamical models assumed either identical or independent inputs. We extend it to arbitrary distributions, highlighting connections with system identification, independent component analysis, and optimal transport. The increased modeling power is demonstrated empirically on gait classification from simple visual features.
我们提出了一个动态模型空间中的弦距,考虑了它们的动态,包括瞬态,输出映射和输入分布。在数据分析应用程序中,与控制相反,输入通常是未知的,并作为(盲)识别的一部分进行推断。因此,在比较不同的时间序列时,它是模型不可分割的一部分。以前关于动态模型之间核距离的研究假设了相同或独立的输入。我们将其扩展到任意分布,强调与系统识别,独立组件分析和最优传输的联系。通过简单的视觉特征对步态进行分类,证明了该方法提高了建模能力。
{"title":"On the Blind Classification of Time Series","authors":"A. Bissacco, Stefano Soatto","doi":"10.1109/CVPR.2007.383333","DOIUrl":"https://doi.org/10.1109/CVPR.2007.383333","url":null,"abstract":"We propose a cord distance in the space of dynamical models that takes into account their dynamics, including transients, output maps and input distributions. In data analysis applications, as opposed to control, the input is often not known and is inferred as part of the (blind) identification. So it is an integral part of the model that should be considered when comparing different time series. Previous work on kernel distances between dynamical models assumed either identical or independent inputs. We extend it to arbitrary distributions, highlighting connections with system identification, independent component analysis, and optimal transport. The increased modeling power is demonstrated empirically on gait classification from simple visual features.","PeriodicalId":351008,"journal":{"name":"2007 IEEE Conference on Computer Vision and Pattern Recognition","volume":"276 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132209602","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
3D Pose Estimation Based on Multiple Monocular Cues 基于多个单目线索的三维姿态估计
Pub Date : 2007-06-17 DOI: 10.1109/CVPR.2007.383352
Björn Barrois, C. Wöhler
In this study we propose an integrated approach to the problem of 3D pose estimation. The main difference to the majority of known methods is the usage of complementary image information, including intensity and polarisation state of the light reflected from the object surface, edge information, and absolute depth values obtained based on a depth from defocus approach. Our method is based on the comparison of the input image to synthetic images generated by an OpenGL-based renderer using model information about the object provided by CAD data. This comparison provides an error term which is minimised by an iterative optimisation algorithm. Although all six degrees of freedom are estimated, our method requires only a monocular camera, circumventing disadvantages of multiocular camera systems such as the need for external camera calibration. Our framework is open for the inclusion of independently acquired depth data. We evaluate our method on a toy example as well as in two realistic scenarios in the domain of industrial quality inspection. Our experiments regarding complex real-world objects located at a distance of about 0.5 m to the camera show that the algorithm achieves typical accuracies of better than 1 degree for the rotation angles, 1-2 image pixels for the lateral translations, and several millimetres or about 1 percent for the object distance.
在这项研究中,我们提出了一种集成的方法来解决三维姿态估计问题。与大多数已知方法的主要区别在于使用互补图像信息,包括从物体表面反射的光的强度和偏振状态、边缘信息和基于离焦方法获得的深度绝对值。我们的方法是基于输入图像与基于opengl的渲染器生成的合成图像的比较,该渲染器使用CAD数据提供的关于对象的模型信息。这种比较提供了一个由迭代优化算法最小化的误差项。虽然所有六个自由度都是估计的,但我们的方法只需要一个单目摄像机,避免了多目摄像机系统需要外部摄像机校准的缺点。我们的框架是开放的,可以包含独立获取的深度数据。我们在一个玩具示例以及工业质量检测领域的两个现实场景中评估了我们的方法。我们对距离相机约0.5 m的复杂现实世界物体进行的实验表明,该算法的旋转角度精度优于1度,横向平移精度优于1-2个图像像素,物体距离精度优于几毫米或约1%。
{"title":"3D Pose Estimation Based on Multiple Monocular Cues","authors":"Björn Barrois, C. Wöhler","doi":"10.1109/CVPR.2007.383352","DOIUrl":"https://doi.org/10.1109/CVPR.2007.383352","url":null,"abstract":"In this study we propose an integrated approach to the problem of 3D pose estimation. The main difference to the majority of known methods is the usage of complementary image information, including intensity and polarisation state of the light reflected from the object surface, edge information, and absolute depth values obtained based on a depth from defocus approach. Our method is based on the comparison of the input image to synthetic images generated by an OpenGL-based renderer using model information about the object provided by CAD data. This comparison provides an error term which is minimised by an iterative optimisation algorithm. Although all six degrees of freedom are estimated, our method requires only a monocular camera, circumventing disadvantages of multiocular camera systems such as the need for external camera calibration. Our framework is open for the inclusion of independently acquired depth data. We evaluate our method on a toy example as well as in two realistic scenarios in the domain of industrial quality inspection. Our experiments regarding complex real-world objects located at a distance of about 0.5 m to the camera show that the algorithm achieves typical accuracies of better than 1 degree for the rotation angles, 1-2 image pixels for the lateral translations, and several millimetres or about 1 percent for the object distance.","PeriodicalId":351008,"journal":{"name":"2007 IEEE Conference on Computer Vision and Pattern Recognition","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132321531","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 24
期刊
2007 IEEE Conference on Computer Vision and Pattern Recognition
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1