首页 > 最新文献

2007 IEEE Conference on Computer Vision and Pattern Recognition最新文献

英文 中文
Improved Legibility of Text for Multiprojector Tiled Displays 提高了多投影仪平铺显示文本的易读性
Pub Date : 2007-06-17 DOI: 10.1109/CVPR.2007.383464
Philip Tuddenham, P. Robinson
Displaying small text on large multiprojector tiled displays is challenging. Problems arise because text is badly affected by the image-warping techniques that these displays apply to rectify projector misalignment. As a consequence, there has been little progress with important large-display applications that require small text, such as collaborative tutoring or Web-browsing. In this paper we present a new warping technique designed to preserve crisp text, based on recent work by Hereld and Stevens. Our technique produces good results, free of artifacts, when used in today's multiprojector displays. We evaluate the legibility of our technique against conventional interpolation-based warping and find that users prefer our technique. We describe an efficient and reusable implementation, and show how the increased legibility has allowed us to investigate two new applications.
在大型多投影仪平铺显示器上显示小文本是具有挑战性的。问题的出现是因为文本受到图像扭曲技术的严重影响,而这些技术是用来纠正投影仪的错位的。因此,对于需要小文本的重要的大屏幕应用程序,如协作辅导或网页浏览,进展甚微。在本文中,我们提出了一种新的扭曲技术,旨在保持清晰的文本,基于最近的工作,由赫尔德和史蒂文斯。我们的技术产生良好的结果,无伪影,当使用在今天的多投影机显示。我们评估了我们的技术对传统的基于插值的翘曲的易读性,发现用户更喜欢我们的技术。我们描述了一个高效且可重用的实现,并展示了增强的易读性如何使我们能够研究两个新的应用程序。
{"title":"Improved Legibility of Text for Multiprojector Tiled Displays","authors":"Philip Tuddenham, P. Robinson","doi":"10.1109/CVPR.2007.383464","DOIUrl":"https://doi.org/10.1109/CVPR.2007.383464","url":null,"abstract":"Displaying small text on large multiprojector tiled displays is challenging. Problems arise because text is badly affected by the image-warping techniques that these displays apply to rectify projector misalignment. As a consequence, there has been little progress with important large-display applications that require small text, such as collaborative tutoring or Web-browsing. In this paper we present a new warping technique designed to preserve crisp text, based on recent work by Hereld and Stevens. Our technique produces good results, free of artifacts, when used in today's multiprojector displays. We evaluate the legibility of our technique against conventional interpolation-based warping and find that users prefer our technique. We describe an efficient and reusable implementation, and show how the increased legibility has allowed us to investigate two new applications.","PeriodicalId":351008,"journal":{"name":"2007 IEEE Conference on Computer Vision and Pattern Recognition","volume":"124 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129775190","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
On-line Simultaneous Learning and Tracking of Visual Feature Graphs 视觉特征图的在线同步学习与跟踪
Pub Date : 2007-06-17 DOI: 10.1109/CVPR.2007.383435
A. Declercq, J. Piater
Model learning and tracking are two important topics in computer vision. While there are many applications where one of them is used to support the other, there are currently only few where both aid each other simultaneously. In this work, we seek to incrementally learn a graphical model from tracking and to simultaneously use whatever has been learned to improve the tracking in the next frames. The main problem encountered in this situation is that the current intermediate model may be inconsistent with future observations, creating a bias in the tracking results. We propose an uncertain model that explicitly accounts for such uncertainties by representing relations by an appropriately weighted sum of informative (parametric) and uninformative (uniform) components. The method is completely unsupervised and operates in real time.
模型学习和模型跟踪是计算机视觉中的两个重要课题。虽然有许多应用程序使用其中一个来支持另一个,但目前只有少数应用程序可以同时相互帮助。在这项工作中,我们寻求从跟踪中逐步学习图形模型,并同时使用所学到的任何内容来改进下一帧的跟踪。在这种情况下遇到的主要问题是,当前的中间模型可能与未来的观测结果不一致,从而在跟踪结果中产生偏差。我们提出了一个不确定模型,该模型通过适当加权的信息(参数)和非信息(统一)分量来表示关系,从而明确地解释了这种不确定性。该方法是完全无监督的,是实时运行的。
{"title":"On-line Simultaneous Learning and Tracking of Visual Feature Graphs","authors":"A. Declercq, J. Piater","doi":"10.1109/CVPR.2007.383435","DOIUrl":"https://doi.org/10.1109/CVPR.2007.383435","url":null,"abstract":"Model learning and tracking are two important topics in computer vision. While there are many applications where one of them is used to support the other, there are currently only few where both aid each other simultaneously. In this work, we seek to incrementally learn a graphical model from tracking and to simultaneously use whatever has been learned to improve the tracking in the next frames. The main problem encountered in this situation is that the current intermediate model may be inconsistent with future observations, creating a bias in the tracking results. We propose an uncertain model that explicitly accounts for such uncertainties by representing relations by an appropriately weighted sum of informative (parametric) and uninformative (uniform) components. The method is completely unsupervised and operates in real time.","PeriodicalId":351008,"journal":{"name":"2007 IEEE Conference on Computer Vision and Pattern Recognition","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128448904","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Tracking-as-Recognition for Articulated Full-Body Human Motion Analysis 基于跟踪识别的关节式人体运动分析
Pub Date : 2007-06-17 DOI: 10.1109/CVPR.2007.383130
Patrick Peursum, S. Venkatesh, G. West
This paper addresses the problem of markerless tracking of a human in full 3D with a high-dimensional (29D) body model. Most work in this area has been focused on achieving accurate tracking in order to replace marker-based motion capture, but do so at the cost of relying on relatively clean observing conditions. This paper takes a different perspective, proposing a body-tracking model that is explicitly designed to handle real-world conditions such as occlusions by scene objects, failure recovery, long-term tracking, auto-initialisation, generalisation to different people and integration with action recognition. To achieve these goals, an action's motions are modelled with a variant of the hierarchical hidden Markov model. The model is quantitatively evaluated with several tests, including comparison to the annealed particle filter, tracking different people and tracking with a reduced resolution and frame rate.
本文研究了利用高维(29D)人体模型对人体进行全三维无标记跟踪的问题。该领域的大多数工作都集中在实现准确的跟踪,以取代基于标记的运动捕捉,但这样做的代价是依赖相对清洁的观察条件。本文采用了不同的视角,提出了一种明确设计用于处理现实世界条件的身体跟踪模型,例如场景物体遮挡、故障恢复、长期跟踪、自动初始化、推广到不同的人以及与动作识别的集成。为了实现这些目标,一个动作的运动用层次隐马尔可夫模型的一种变体来建模。通过与退火粒子滤波的比较、跟踪不同的人以及降低分辨率和帧率的跟踪,对该模型进行了定量评价。
{"title":"Tracking-as-Recognition for Articulated Full-Body Human Motion Analysis","authors":"Patrick Peursum, S. Venkatesh, G. West","doi":"10.1109/CVPR.2007.383130","DOIUrl":"https://doi.org/10.1109/CVPR.2007.383130","url":null,"abstract":"This paper addresses the problem of markerless tracking of a human in full 3D with a high-dimensional (29D) body model. Most work in this area has been focused on achieving accurate tracking in order to replace marker-based motion capture, but do so at the cost of relying on relatively clean observing conditions. This paper takes a different perspective, proposing a body-tracking model that is explicitly designed to handle real-world conditions such as occlusions by scene objects, failure recovery, long-term tracking, auto-initialisation, generalisation to different people and integration with action recognition. To achieve these goals, an action's motions are modelled with a variant of the hierarchical hidden Markov model. The model is quantitatively evaluated with several tests, including comparison to the annealed particle filter, tracking different people and tracking with a reduced resolution and frame rate.","PeriodicalId":351008,"journal":{"name":"2007 IEEE Conference on Computer Vision and Pattern Recognition","volume":"07 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128730469","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 57
Flexible Object Models for Category-Level 3D Object Recognition 面向类别级三维对象识别的柔性对象模型
Pub Date : 2007-06-17 DOI: 10.1109/CVPR.2007.383149
Akash M. Kushal, C. Schmid, J. Ponce
Today's category-level object recognition systems largely focus on fronto-parallel views of objects with characteristic texture patterns. To overcome these limitations, we propose a novel framework for visual object recognition where object classes are represented by assemblies of partial surface models (PSMs) obeying loose local geometric constraints. The PSMs themselves are formed of dense, locally rigid assemblies of image features. Since our model only enforces local geometric consistency, both at the level of model parts and at the level of individual features within the parts, it is robust to viewpoint changes and intra-class variability. The proposed approach has been implemented, and it outperforms the state-of-the-art algorithms for object detection and localization recently compared in [14] on the Pascal 2005 VOC Challenge Cars Test 1 data.
目前的分类级对象识别系统主要集中在具有特征纹理模式的对象的正面平行视图上。为了克服这些限制,我们提出了一种新的视觉对象识别框架,其中对象类由服从松散局部几何约束的部分表面模型(psm)的集合表示。psm本身由密集的、局部刚性的图像特征组合而成。由于我们的模型只强制局部几何一致性,无论是在模型部件的水平上还是在部件内的单个特征的水平上,它对视点变化和类内可变性都是鲁棒的。所提出的方法已经实现,并且最近在Pascal 2005 VOC挑战汽车测试1数据上与[14]相比,它优于最先进的对象检测和定位算法。
{"title":"Flexible Object Models for Category-Level 3D Object Recognition","authors":"Akash M. Kushal, C. Schmid, J. Ponce","doi":"10.1109/CVPR.2007.383149","DOIUrl":"https://doi.org/10.1109/CVPR.2007.383149","url":null,"abstract":"Today's category-level object recognition systems largely focus on fronto-parallel views of objects with characteristic texture patterns. To overcome these limitations, we propose a novel framework for visual object recognition where object classes are represented by assemblies of partial surface models (PSMs) obeying loose local geometric constraints. The PSMs themselves are formed of dense, locally rigid assemblies of image features. Since our model only enforces local geometric consistency, both at the level of model parts and at the level of individual features within the parts, it is robust to viewpoint changes and intra-class variability. The proposed approach has been implemented, and it outperforms the state-of-the-art algorithms for object detection and localization recently compared in [14] on the Pascal 2005 VOC Challenge Cars Test 1 data.","PeriodicalId":351008,"journal":{"name":"2007 IEEE Conference on Computer Vision and Pattern Recognition","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128575467","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 95
Research issues in image registration for remote sensing 遥感图像配准研究问题
Pub Date : 2007-06-17 DOI: 10.1109/CVPR.2007.383423
R. Eastman, J. L. Moigne, N. Netanyahu
Image registration is an important element in data processing for remote sensing with many applications and a wide range of solutions. Despite considerable investigation the field has not settled on a definitive solution for most applications and a number of questions remain open. This article looks at selected research issues by surveying the experience of operational satellite teams, application-specific requirements for Earth science, and our experiments in the evaluation of image registration algorithms with emphasis on the comparison of algorithms for subpixel accuracy. We conclude that remote sensing applications put particular demands on image registration algorithms to take into account domain-specific knowledge of geometric transformations and image content.
图像配准是遥感数据处理的重要组成部分,具有广泛的应用和解决方案。尽管进行了大量的研究,但该领域尚未为大多数应用确定一个明确的解决方案,许多问题仍未解决。本文通过调查运营卫星团队的经验、地球科学的特定应用需求以及我们在评估图像配准算法方面的实验(重点是亚像素精度算法的比较)来研究选定的研究问题。我们得出结论,遥感应用对图像配准算法提出了特殊要求,以考虑几何变换和图像内容的领域特定知识。
{"title":"Research issues in image registration for remote sensing","authors":"R. Eastman, J. L. Moigne, N. Netanyahu","doi":"10.1109/CVPR.2007.383423","DOIUrl":"https://doi.org/10.1109/CVPR.2007.383423","url":null,"abstract":"Image registration is an important element in data processing for remote sensing with many applications and a wide range of solutions. Despite considerable investigation the field has not settled on a definitive solution for most applications and a number of questions remain open. This article looks at selected research issues by surveying the experience of operational satellite teams, application-specific requirements for Earth science, and our experiments in the evaluation of image registration algorithms with emphasis on the comparison of algorithms for subpixel accuracy. We conclude that remote sensing applications put particular demands on image registration algorithms to take into account domain-specific knowledge of geometric transformations and image content.","PeriodicalId":351008,"journal":{"name":"2007 IEEE Conference on Computer Vision and Pattern Recognition","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129140810","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Projector Calibration using Arbitrary Planes and Calibrated Camera 投影仪校准使用任意平面和校准相机
Pub Date : 2007-06-17 DOI: 10.1109/CVPR.2007.383477
M. Kimura, M. Mochimaru, T. Kanade
In this paper, an easy calibration method for projector is proposed. The calibration handled in this paper is projective relation between 3D space and 2D pattern, and is not correction of trapezoid distortion in projected pattern. In projector-camera systems, especially for 3D measurement, such calibration is the basis of process. The projection from projector can be modeled as inverse projection of the pinhole camera, which is generally considered as perspective projection. In the existing systems, some special objects or devices are often used to calibrate projector, so that 3D-2D projection map can be measured for typical camera calibration methods. The proposed method utilizes projective geometry between camera and projector, so that it requires only pre-calibrated camera and a plane. It is easy to practice, easy to calculate, and reasonably accurate.
本文提出了一种简便的投影仪标定方法。本文所处理的校正是三维空间与二维图形之间的投影关系,而不是投影图形中梯形畸变的校正。在投影摄像机系统中,特别是在三维测量中,这种标定是整个过程的基础。投影机的投影可以建模为针孔相机的逆投影,一般认为是透视投影。在现有的系统中,通常使用一些特殊的物体或设备来校准投影仪,以便测量3D-2D投影地图,用于典型的摄像机校准方法。该方法利用摄像机和投影仪之间的投影几何,因此只需要预先校准的摄像机和一个平面。它易于实践,易于计算,而且相当准确。
{"title":"Projector Calibration using Arbitrary Planes and Calibrated Camera","authors":"M. Kimura, M. Mochimaru, T. Kanade","doi":"10.1109/CVPR.2007.383477","DOIUrl":"https://doi.org/10.1109/CVPR.2007.383477","url":null,"abstract":"In this paper, an easy calibration method for projector is proposed. The calibration handled in this paper is projective relation between 3D space and 2D pattern, and is not correction of trapezoid distortion in projected pattern. In projector-camera systems, especially for 3D measurement, such calibration is the basis of process. The projection from projector can be modeled as inverse projection of the pinhole camera, which is generally considered as perspective projection. In the existing systems, some special objects or devices are often used to calibrate projector, so that 3D-2D projection map can be measured for typical camera calibration methods. The proposed method utilizes projective geometry between camera and projector, so that it requires only pre-calibrated camera and a plane. It is easy to practice, easy to calculate, and reasonably accurate.","PeriodicalId":351008,"journal":{"name":"2007 IEEE Conference on Computer Vision and Pattern Recognition","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129357368","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 86
Towards Scalable Representations of Object Categories: Learning a Hierarchy of Parts 面向对象类别的可伸缩表示:学习零件的层次结构
Pub Date : 2007-06-17 DOI: 10.1109/CVPR.2007.383269
S. Fidler, A. Leonardis
This paper proposes a novel approach to constructing a hierarchical representation of visual input that aims to enable recognition and detection of a large number of object categories. Inspired by the principles of efficient indexing (bottom-up,), robust matching (top-down,), and ideas of compositionality, our approach learns a hierarchy of spatially flexible compositions, i.e. parts, in an unsupervised, statistics-driven manner. Starting with simple, frequent features, we learn the statistically most significant compositions (parts composed of parts), which consequently define the next layer. Parts are learned sequentially, layer after layer, optimally adjusting to the visual data. Lower layers are learned in a category-independent way to obtain complex, yet sharable visual building blocks, which is a crucial step towards a scalable representation. Higher layers of the hierarchy, on the other hand, are constructed by using specific categories, achieving a category representation with a small number of highly generalizable parts that gained their structural flexibility through composition within the hierarchy. Built in this way, new categories can be efficiently and continuously added to the system by adding a small number of parts only in the higher layers. The approach is demonstrated on a large collection of images and a variety of object categories. Detection results confirm the effectiveness and robustness of the learned parts.
本文提出了一种新的方法来构建视觉输入的分层表示,旨在实现对大量对象类别的识别和检测。受高效索引(自下而上)、健壮匹配(自上而下)和组合性思想的启发,我们的方法以无监督、统计驱动的方式学习空间灵活组合(即部件)的层次结构。从简单、频繁的特征开始,我们学习统计上最重要的组成(由部分组成的部分),从而定义下一层。部分是按顺序学习的,一层又一层,以最佳方式调整视觉数据。较低的层以一种与类别无关的方式学习,以获得复杂但可共享的视觉构建块,这是迈向可扩展表示的关键一步。另一方面,层次结构的更高层是通过使用特定的类别来构建的,用少量高度一般化的部分实现类别表示,这些部分通过层次结构内的组合获得结构灵活性。通过这种方式构建,只需在较高层中添加少量部件,就可以高效且持续地将新类别添加到系统中。该方法在大量图像和各种对象类别上进行了演示。检测结果证实了学习部分的有效性和鲁棒性。
{"title":"Towards Scalable Representations of Object Categories: Learning a Hierarchy of Parts","authors":"S. Fidler, A. Leonardis","doi":"10.1109/CVPR.2007.383269","DOIUrl":"https://doi.org/10.1109/CVPR.2007.383269","url":null,"abstract":"This paper proposes a novel approach to constructing a hierarchical representation of visual input that aims to enable recognition and detection of a large number of object categories. Inspired by the principles of efficient indexing (bottom-up,), robust matching (top-down,), and ideas of compositionality, our approach learns a hierarchy of spatially flexible compositions, i.e. parts, in an unsupervised, statistics-driven manner. Starting with simple, frequent features, we learn the statistically most significant compositions (parts composed of parts), which consequently define the next layer. Parts are learned sequentially, layer after layer, optimally adjusting to the visual data. Lower layers are learned in a category-independent way to obtain complex, yet sharable visual building blocks, which is a crucial step towards a scalable representation. Higher layers of the hierarchy, on the other hand, are constructed by using specific categories, achieving a category representation with a small number of highly generalizable parts that gained their structural flexibility through composition within the hierarchy. Built in this way, new categories can be efficiently and continuously added to the system by adding a small number of parts only in the higher layers. The approach is demonstrated on a large collection of images and a variety of object categories. Detection results confirm the effectiveness and robustness of the learned parts.","PeriodicalId":351008,"journal":{"name":"2007 IEEE Conference on Computer Vision and Pattern Recognition","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124673326","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 217
Fast Sparse Gaussian Processes Learning for Man-Made Structure Classification 用于人工结构分类的快速稀疏高斯过程学习
Pub Date : 2007-06-17 DOI: 10.1109/CVPR.2007.383441
Hang Zhou, D. Suter
Informative Vector Machine (IVM) is an efficient fast sparse Gaussian process's (GP) method previously suggested for active learning. It greatly reduces the computational cost of GP classification and makes the GP learning close to real time. We apply IVM for man-made structure classification (a two class problem). Our work includes the investigation of the performance of IVM with varied active data points as well as the effects of different choices of GP kernels. Satisfactory results have been obtained, showing that the approach keeps full GP classification performance and yet is significantly faster (by virtue if using a subset of the whole training data points).
信息向量机(IVM)是一种高效、快速的稀疏高斯过程(GP)主动学习方法。它大大降低了GP分类的计算成本,使GP学习接近实时性。我们将IVM应用于人工结构分类(一个两类问题)。我们的工作包括调查具有不同活动数据点的IVM的性能,以及不同选择GP内核的影响。已经获得了令人满意的结果,表明该方法保持了完整的GP分类性能,但速度明显更快(由于使用了整个训练数据点的子集)。
{"title":"Fast Sparse Gaussian Processes Learning for Man-Made Structure Classification","authors":"Hang Zhou, D. Suter","doi":"10.1109/CVPR.2007.383441","DOIUrl":"https://doi.org/10.1109/CVPR.2007.383441","url":null,"abstract":"Informative Vector Machine (IVM) is an efficient fast sparse Gaussian process's (GP) method previously suggested for active learning. It greatly reduces the computational cost of GP classification and makes the GP learning close to real time. We apply IVM for man-made structure classification (a two class problem). Our work includes the investigation of the performance of IVM with varied active data points as well as the effects of different choices of GP kernels. Satisfactory results have been obtained, showing that the approach keeps full GP classification performance and yet is significantly faster (by virtue if using a subset of the whole training data points).","PeriodicalId":351008,"journal":{"name":"2007 IEEE Conference on Computer Vision and Pattern Recognition","volume":"102 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124737615","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Kernel-based Tracking from a Probabilistic Viewpoint 基于概率观点的核跟踪
Pub Date : 2007-06-17 DOI: 10.1109/CVPR.2007.383240
Q. A. Nguyen, A. Robles-Kelly, Chunhua Shen
In this paper, we present a probabilistic formulation of kernel-based tracking methods based upon maximum likelihood estimation. To this end, we view the coordinates for the pixels in both, the target model and its candidate as random variables and make use of a generative model so as to cast the tracking task into a maximum likelihood framework. This, in turn, permits the use of the EM-algorithm to estimate a set of latent variables that can be used to update the target-center position. Once the latent variables have been estimated, we use the Kullback-Leibler divergence so as to minimise the mutual information between the target model and candidate distributions in order to develop a target-center update rule and a kernel bandwidth adjustment scheme. The method is very general in nature. We illustrate the utility of our approach for purposes of tracking on real-world video sequences using two alternative kernel functions.
在本文中,我们提出了一种基于极大似然估计的核跟踪方法的概率公式。为此,我们将目标模型及其候选模型中像素的坐标视为随机变量,并利用生成模型将跟踪任务置于最大似然框架中。这反过来又允许使用em算法来估计一组可用于更新目标中心位置的潜在变量。一旦潜在变量被估计,我们使用Kullback-Leibler散度来最小化目标模型和候选分布之间的互信息,从而制定目标中心更新规则和核带宽调整方案。这个方法在本质上是非常通用的。我们使用两个可选的核函数来说明我们的方法在跟踪真实世界视频序列方面的实用性。
{"title":"Kernel-based Tracking from a Probabilistic Viewpoint","authors":"Q. A. Nguyen, A. Robles-Kelly, Chunhua Shen","doi":"10.1109/CVPR.2007.383240","DOIUrl":"https://doi.org/10.1109/CVPR.2007.383240","url":null,"abstract":"In this paper, we present a probabilistic formulation of kernel-based tracking methods based upon maximum likelihood estimation. To this end, we view the coordinates for the pixels in both, the target model and its candidate as random variables and make use of a generative model so as to cast the tracking task into a maximum likelihood framework. This, in turn, permits the use of the EM-algorithm to estimate a set of latent variables that can be used to update the target-center position. Once the latent variables have been estimated, we use the Kullback-Leibler divergence so as to minimise the mutual information between the target model and candidate distributions in order to develop a target-center update rule and a kernel bandwidth adjustment scheme. The method is very general in nature. We illustrate the utility of our approach for purposes of tracking on real-world video sequences using two alternative kernel functions.","PeriodicalId":351008,"journal":{"name":"2007 IEEE Conference on Computer Vision and Pattern Recognition","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126910673","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
Image Hallucination Using Neighbor Embedding over Visual Primitive Manifolds 基于邻域嵌入的视觉基元流形图像幻觉
Pub Date : 2007-06-17 DOI: 10.1109/CVPR.2007.383001
Wei-liang Fan, D. Yeung
In this paper, we propose a novel learning-based method for image hallucination, with image super-resolution being a specific application that we focus on here. Given a low-resolution image, its underlying higher-resolution details are synthesized based on a set of training images. In order to build a compact yet descriptive training set, we investigate the characteristic local structures contained in large volumes of small image patches. Inspired by progress in manifold learning research, we take the assumption that small image patches in the low-resolution and high-resolution images form manifolds with similar local geometry in the corresponding image feature spaces. This assumption leads to a super-resolution approach which reconstructs the feature vector corresponding to an image patch by its neighbors in the feature space. In addition, the residual errors associated with the reconstructed image patches are also estimated to compensate for the information loss in the local averaging process. Experimental results show that our hallucination method can synthesize higher-quality images compared with other methods.
在本文中,我们提出了一种新的基于学习的图像幻觉方法,其中图像超分辨率是我们关注的一个具体应用。给定低分辨率图像,基于一组训练图像合成其底层高分辨率细节。为了建立一个紧凑的描述性训练集,我们研究了包含在大量小图像块中的特征局部结构。受流形学习研究进展的启发,我们假设低分辨率和高分辨率图像中的小图像块在相应的图像特征空间中形成具有相似局部几何形状的流形。这一假设导致了一种超分辨率方法,该方法通过特征空间中的邻域重构图像patch对应的特征向量。此外,还估计了重建图像块相关的残差,以补偿局部平均过程中的信息损失。实验结果表明,与其他方法相比,我们的幻觉方法可以合成更高质量的图像。
{"title":"Image Hallucination Using Neighbor Embedding over Visual Primitive Manifolds","authors":"Wei-liang Fan, D. Yeung","doi":"10.1109/CVPR.2007.383001","DOIUrl":"https://doi.org/10.1109/CVPR.2007.383001","url":null,"abstract":"In this paper, we propose a novel learning-based method for image hallucination, with image super-resolution being a specific application that we focus on here. Given a low-resolution image, its underlying higher-resolution details are synthesized based on a set of training images. In order to build a compact yet descriptive training set, we investigate the characteristic local structures contained in large volumes of small image patches. Inspired by progress in manifold learning research, we take the assumption that small image patches in the low-resolution and high-resolution images form manifolds with similar local geometry in the corresponding image feature spaces. This assumption leads to a super-resolution approach which reconstructs the feature vector corresponding to an image patch by its neighbors in the feature space. In addition, the residual errors associated with the reconstructed image patches are also estimated to compensate for the information loss in the local averaging process. Experimental results show that our hallucination method can synthesize higher-quality images compared with other methods.","PeriodicalId":351008,"journal":{"name":"2007 IEEE Conference on Computer Vision and Pattern Recognition","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130643669","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 127
期刊
2007 IEEE Conference on Computer Vision and Pattern Recognition
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1