首页 > 最新文献

2007 IEEE Conference on Computer Vision and Pattern Recognition最新文献

英文 中文
Combining Region and Edge Cues for Image Segmentation in a Probabilistic Gaussian Mixture Framework 结合区域和边缘线索的概率高斯混合框架图像分割
Pub Date : 2009-12-16 DOI: 10.1109/CVPR.2007.383232
Omer Rotem, H. Greenspan, J. Goldberger
In this paper we propose a new segmentation algorithm which combines patch-based information with edge cues under a probabilistic framework. We use a mixture of multiple Gaussians for building the statistical model with color and spatial features, and we incorporate edge information based on texture, color and brightness differences into the EM algorithm. We evaluate our results qualitatively and quantitatively on a large data-set of natural images and compare our results to other state-of-the-art methods.
本文提出了一种在概率框架下将基于补丁的信息与边缘线索相结合的分割算法。我们使用混合的多个高斯函数来构建具有颜色和空间特征的统计模型,并将基于纹理、颜色和亮度差异的边缘信息纳入EM算法。我们在大量自然图像数据集上定性和定量地评估我们的结果,并将我们的结果与其他最先进的方法进行比较。
{"title":"Combining Region and Edge Cues for Image Segmentation in a Probabilistic Gaussian Mixture Framework","authors":"Omer Rotem, H. Greenspan, J. Goldberger","doi":"10.1109/CVPR.2007.383232","DOIUrl":"https://doi.org/10.1109/CVPR.2007.383232","url":null,"abstract":"In this paper we propose a new segmentation algorithm which combines patch-based information with edge cues under a probabilistic framework. We use a mixture of multiple Gaussians for building the statistical model with color and spatial features, and we incorporate edge information based on texture, color and brightness differences into the EM algorithm. We evaluate our results qualitatively and quantitatively on a large data-set of natural images and compare our results to other state-of-the-art methods.","PeriodicalId":351008,"journal":{"name":"2007 IEEE Conference on Computer Vision and Pattern Recognition","volume":"176 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115100294","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 30
Fast Human Pose Estimation using Appearance and Motion via Multi-Dimensional Boosting Regression 基于外观和运动的多维增强回归快速人体姿态估计
Pub Date : 2007-12-05 DOI: 10.1109/CVPR.2007.383129
A. Bissacco, Ming-Hsuan Yang, Stefano Soatto
We address the problem of estimating human pose in video sequences, where rough location has been determined. We exploit both appearance and motion information by defining suitable features of an image and its temporal neighbors, and learning a regression map to the parameters of a model of the human body using boosting techniques. Our algorithm can be viewed as a fast initialization step for human body trackers, or as a tracker itself. We extend gradient boosting techniques to learn a multi-dimensional map from (rotated and scaled) Haar features to the entire set of joint angles representing the full body pose. We test our approach by learning a map from image patches to body joint angles from synchronized video and motion capture walking data. We show how our technique enables learning an efficient real-time pose estimator, validated on publicly available datasets.
我们解决了在视频序列中估计人体姿势的问题,其中粗略的位置已经确定。我们通过定义图像及其时间邻居的合适特征来利用外观和运动信息,并使用增强技术学习到人体模型参数的回归映射。我们的算法可以被看作是人体跟踪器的快速初始化步骤,或者作为一个跟踪器本身。我们扩展了梯度增强技术,以学习从(旋转和缩放)Haar特征到代表全身姿势的整个关节角度集合的多维映射。我们通过从同步视频和动作捕捉步行数据中学习从图像补丁到身体关节角度的地图来测试我们的方法。我们展示了我们的技术如何能够学习一个有效的实时姿态估计器,并在公开可用的数据集上进行了验证。
{"title":"Fast Human Pose Estimation using Appearance and Motion via Multi-Dimensional Boosting Regression","authors":"A. Bissacco, Ming-Hsuan Yang, Stefano Soatto","doi":"10.1109/CVPR.2007.383129","DOIUrl":"https://doi.org/10.1109/CVPR.2007.383129","url":null,"abstract":"We address the problem of estimating human pose in video sequences, where rough location has been determined. We exploit both appearance and motion information by defining suitable features of an image and its temporal neighbors, and learning a regression map to the parameters of a model of the human body using boosting techniques. Our algorithm can be viewed as a fast initialization step for human body trackers, or as a tracker itself. We extend gradient boosting techniques to learn a multi-dimensional map from (rotated and scaled) Haar features to the entire set of joint angles representing the full body pose. We test our approach by learning a map from image patches to body joint angles from synchronized video and motion capture walking data. We show how our technique enables learning an efficient real-time pose estimator, validated on publicly available datasets.","PeriodicalId":351008,"journal":{"name":"2007 IEEE Conference on Computer Vision and Pattern Recognition","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115857158","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 136
An Embodied User Interface for Increasing Physical Activities in Game 增加游戏中体力活动的具体用户界面
Pub Date : 2007-06-17 DOI: 10.1109/CVPR.2007.383479
S. Kim, W. Winchester, Yun-Bum Choi, J. Lee
The rate of obese has been increasing and obesity has emerged as a significant threat not only to the health but also in society. Obesity has adverse effects such as physical appearance, psychosocial consequences and metabolic disturbances. One of reasons causing these phenomena is most games have static and stationary user interfaces as input devices. These kinds of interfaces hold users at their computers and cause not only decreases of the strength of their health, but also blocks communications between family members. In this paper, we propose physical activity based interactive exercise called Punch Punch, which is played with virtual objects displaying on a large screen. The informal study revealed that the Punch Punch enhanced physical and social activities while playing games. The goal of this study is finding embodied user interfaces to increase physical and social activities.
肥胖率一直在上升,肥胖不仅是对健康的重大威胁,也是对社会的重大威胁。肥胖会产生不良影响,如外貌、心理社会后果和代谢紊乱。造成这种现象的原因之一是,大多数游戏都将静态和静态用户界面作为输入设备。这类界面将用户困在电脑前,不仅会降低他们的健康水平,还会阻碍家庭成员之间的交流。在本文中,我们提出了一种基于身体活动的交互式练习,称为Punch Punch,它是通过在大屏幕上显示虚拟物体来进行的。这项非正式研究表明,在玩游戏时,Punch Punch可以增强身体和社交活动。本研究的目的是寻找具体化的用户界面,以增加身体和社会活动。
{"title":"An Embodied User Interface for Increasing Physical Activities in Game","authors":"S. Kim, W. Winchester, Yun-Bum Choi, J. Lee","doi":"10.1109/CVPR.2007.383479","DOIUrl":"https://doi.org/10.1109/CVPR.2007.383479","url":null,"abstract":"The rate of obese has been increasing and obesity has emerged as a significant threat not only to the health but also in society. Obesity has adverse effects such as physical appearance, psychosocial consequences and metabolic disturbances. One of reasons causing these phenomena is most games have static and stationary user interfaces as input devices. These kinds of interfaces hold users at their computers and cause not only decreases of the strength of their health, but also blocks communications between family members. In this paper, we propose physical activity based interactive exercise called Punch Punch, which is played with virtual objects displaying on a large screen. The informal study revealed that the Punch Punch enhanced physical and social activities while playing games. The goal of this study is finding embodied user interfaces to increase physical and social activities.","PeriodicalId":351008,"journal":{"name":"2007 IEEE Conference on Computer Vision and Pattern Recognition","volume":"435 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115603875","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Crisp Weighted Support Vector Regression for robust single model estimation : application to object tracking in image sequences 稳健单模型估计的清晰加权支持向量回归:应用于图像序列中的目标跟踪
Pub Date : 2007-06-17 DOI: 10.1109/CVPR.2007.383181
F. Dufrenois, J. Colliez, D. Hamad
Support Vector Regression (SVR) is now a well-established method for estimating real-valued functions. However, the standard SVR is not effective to deal with outliers and structured outliers in training data sets commonly encountered in computer vision applications. In this paper, we present a weighted version of SVM for regression. The proposed approach introduces an adaptive binary function that allows a dominant model from a degraded training dataset to be extracted. This binary function progressively separates inliers from outliers following a one-against-all decomposition. Experimental tests show the high robustness of the proposed approach against outliers and residual structured outliers. Next, we validate our algorithm for object tracking and for optic flow estimation.
支持向量回归(SVR)是目前公认的一种估计实值函数的方法。然而,对于计算机视觉应用中常见的训练数据集中的异常点和结构化异常点,标准的支持向量回归算法并不有效。在本文中,我们提出了一个加权版本的支持向量机的回归。提出的方法引入了一个自适应二值函数,允许从退化的训练数据集中提取主导模型。这个二元函数通过一一分解逐步将内线与离群分离。实验测试表明,该方法对异常值和残差结构化异常值具有较高的鲁棒性。接下来,我们验证了我们的算法用于目标跟踪和光流估计。
{"title":"Crisp Weighted Support Vector Regression for robust single model estimation : application to object tracking in image sequences","authors":"F. Dufrenois, J. Colliez, D. Hamad","doi":"10.1109/CVPR.2007.383181","DOIUrl":"https://doi.org/10.1109/CVPR.2007.383181","url":null,"abstract":"Support Vector Regression (SVR) is now a well-established method for estimating real-valued functions. However, the standard SVR is not effective to deal with outliers and structured outliers in training data sets commonly encountered in computer vision applications. In this paper, we present a weighted version of SVM for regression. The proposed approach introduces an adaptive binary function that allows a dominant model from a degraded training dataset to be extracted. This binary function progressively separates inliers from outliers following a one-against-all decomposition. Experimental tests show the high robustness of the proposed approach against outliers and residual structured outliers. Next, we validate our algorithm for object tracking and for optic flow estimation.","PeriodicalId":351008,"journal":{"name":"2007 IEEE Conference on Computer Vision and Pattern Recognition","volume":"287 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123121482","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Realizing Super-Resolution with Superimposed Projection 用叠加投影实现超分辨率
Pub Date : 2007-06-17 DOI: 10.1109/CVPR.2007.383463
Niranjan Damera-Venkata, Nelson L. Chang
We consider the problem of rendering high-resolution images on a display composed of multiple superimposed lower-resolution projectors. A theoretical analysis of this problem in the literature previously concluded that the multi-projector superimposition of low resolution projectors cannot produce high resolution images. In our recent work, we showed to the contrary that super-resolution via multiple superimposed projectors is indeed theoretically achievable. This paper derives practical algorithms for real multi-projector systems that account for the intra- and inter-projector variations and that render high-quality, high-resolution content at real-time interactive frame rates. A camera is used to estimate the geometric, photometric, and color properties of each component projector in a calibration step. Given this parameter information, we demonstrate novel methods for efficiently generating optimal sub-frames so that the resulting projected image is as close as possible to the given high resolution images.
我们考虑在由多个叠加的低分辨率投影仪组成的显示器上呈现高分辨率图像的问题。先前文献对这一问题的理论分析得出结论,低分辨率投影机的多投影机叠加不能产生高分辨率图像。在我们最近的工作中,我们相反地证明了通过多个叠加投影仪的超分辨率在理论上确实是可以实现的。本文推导了实际多投影机系统的实用算法,该算法考虑了投影机内部和内部的变化,并以实时交互帧率呈现高质量,高分辨率的内容。在校准步骤中,摄像机用于估计每个投影仪组件的几何、光度和颜色属性。考虑到这些参数信息,我们展示了有效生成最优子帧的新方法,以便得到的投影图像尽可能接近给定的高分辨率图像。
{"title":"Realizing Super-Resolution with Superimposed Projection","authors":"Niranjan Damera-Venkata, Nelson L. Chang","doi":"10.1109/CVPR.2007.383463","DOIUrl":"https://doi.org/10.1109/CVPR.2007.383463","url":null,"abstract":"We consider the problem of rendering high-resolution images on a display composed of multiple superimposed lower-resolution projectors. A theoretical analysis of this problem in the literature previously concluded that the multi-projector superimposition of low resolution projectors cannot produce high resolution images. In our recent work, we showed to the contrary that super-resolution via multiple superimposed projectors is indeed theoretically achievable. This paper derives practical algorithms for real multi-projector systems that account for the intra- and inter-projector variations and that render high-quality, high-resolution content at real-time interactive frame rates. A camera is used to estimate the geometric, photometric, and color properties of each component projector in a calibration step. Given this parameter information, we demonstrate novel methods for efficiently generating optimal sub-frames so that the resulting projected image is as close as possible to the given high resolution images.","PeriodicalId":351008,"journal":{"name":"2007 IEEE Conference on Computer Vision and Pattern Recognition","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116786390","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 42
Two-View Motion Segmentation from Linear Programming Relaxation 基于线性规划松弛的二视图运动分割
Pub Date : 2007-06-17 DOI: 10.1109/CVPR.2007.382975
Hongdong Li
This paper studies the problem of multibody motion segmentation, which is an important, but challenging problem due to its well-known chicken-and-egg-type recursive character. We propose a new mixture-of-fundamental-matrices model to describe the multibody motions from two views. Based on the maximum likelihood estimation, in conjunction with a random sampling scheme, we show that the problem can be naturally formulated as a linear programming (LP) problem. Consequently, the motion segmentation problem can be solved efficiently by linear program relaxation. Experiments demonstrate that: without assuming the actual number of motions our method produces accurate segmentation result. This LP formulation has also other advantages, such as easy to handle outliers and easy to enforce prior knowledge etc.
本文研究了多体运动分割问题,这是一个重要而又具有挑战性的问题,因为它具有众所周知的鸡和蛋的递归性质。我们提出了一个新的混合基本矩阵模型,从两个角度来描述多体运动。基于极大似然估计,结合随机抽样方案,我们证明了该问题可以自然地表述为线性规划(LP)问题。因此,采用线性规划松弛法可以有效地解决运动分割问题。实验表明:在不假设实际运动数的情况下,该方法可以得到准确的分割结果。该LP公式还具有易于处理异常值和易于执行先验知识等优点。
{"title":"Two-View Motion Segmentation from Linear Programming Relaxation","authors":"Hongdong Li","doi":"10.1109/CVPR.2007.382975","DOIUrl":"https://doi.org/10.1109/CVPR.2007.382975","url":null,"abstract":"This paper studies the problem of multibody motion segmentation, which is an important, but challenging problem due to its well-known chicken-and-egg-type recursive character. We propose a new mixture-of-fundamental-matrices model to describe the multibody motions from two views. Based on the maximum likelihood estimation, in conjunction with a random sampling scheme, we show that the problem can be naturally formulated as a linear programming (LP) problem. Consequently, the motion segmentation problem can be solved efficiently by linear program relaxation. Experiments demonstrate that: without assuming the actual number of motions our method produces accurate segmentation result. This LP formulation has also other advantages, such as easy to handle outliers and easy to enforce prior knowledge etc.","PeriodicalId":351008,"journal":{"name":"2007 IEEE Conference on Computer Vision and Pattern Recognition","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116866357","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 66
Automatic texture mapping on real 3D model 在真实的3D模型上自动纹理映射
Pub Date : 2007-06-17 DOI: 10.1109/CVPR.2007.383482
T. Molinier, D. Fofi, P. Gorria, J. Salvi
We propose a full automatic technique to project virtual texture on a real textureless 3D object. Our system is composed of cameras and projector and are used to determine the pose of the object in the real world with the projector as reference and then estimate the image seen by the projector if it would be a camera.
提出了一种将虚拟纹理投影到真实无纹理三维物体上的全自动技术。我们的系统由摄像机和投影仪组成,用于确定现实世界中物体的姿态,投影仪作为参考,然后估计投影仪看到的图像,如果它是一个摄像机。
{"title":"Automatic texture mapping on real 3D model","authors":"T. Molinier, D. Fofi, P. Gorria, J. Salvi","doi":"10.1109/CVPR.2007.383482","DOIUrl":"https://doi.org/10.1109/CVPR.2007.383482","url":null,"abstract":"We propose a full automatic technique to project virtual texture on a real textureless 3D object. Our system is composed of cameras and projector and are used to determine the pose of the object in the real world with the projector as reference and then estimate the image seen by the projector if it would be a camera.","PeriodicalId":351008,"journal":{"name":"2007 IEEE Conference on Computer Vision and Pattern Recognition","volume":"27 6","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120998183","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Efficiently Determining Silhouette Consistency 有效确定轮廓一致性
Pub Date : 2007-06-17 DOI: 10.1109/CVPR.2007.383161
Li Yi, D. Jacobs
Volume intersection is a frequently used technique to solve the Shape-From-Silhouette problem, which constructs a 3D object estimate from a set of silhouettes taken with calibrated cameras. It is natural to develop an efficient algorithm to determine the consistency of a set of silhouettes before performing time-consuming reconstruction, so that inaccurate silhouettes can be omitted. In this paper we first present a fast algorithm to determine the consistency of three silhouettes from known (but arbitrary) viewing directions, assuming the projection is scaled orthographic. The temporal complexity of the algorithm is linear in the number of points of the silhouette boundaries. We further prove that a set of more than three convex silhouettes are consistent if and only if any three of them are consistent. Another possible application of our approach is to determine the miscalibrated cameras in a large camera system. A consistent subset of cameras can be determined on the fly and miscalibrated cameras can also be recalibrated at a coarse scale. Real and synthesized data are used to demonstrate our results.
体交是一种常用的解决形状-轮廓问题的技术,它从一组标定的相机拍摄的轮廓中构建一个三维物体估计。在进行耗时的重建之前,自然需要开发一种高效的算法来确定一组轮廓的一致性,从而可以省略不准确的轮廓。在本文中,我们首先提出了一种快速算法,以确定从已知(但任意)的观看方向的三个轮廓的一致性,假设投影是比例正射影。该算法的时间复杂度在轮廓边界点的数量上是线性的。进一步证明了一组超过三个的凸轮廓是一致的当且仅当其中任意三个是一致的。我们的方法的另一个可能的应用是确定大型相机系统中的错误校准相机。相机的一致子集可以在飞行中确定,并且错误校准的相机也可以在粗尺度上重新校准。实际数据和综合数据被用来证明我们的结果。
{"title":"Efficiently Determining Silhouette Consistency","authors":"Li Yi, D. Jacobs","doi":"10.1109/CVPR.2007.383161","DOIUrl":"https://doi.org/10.1109/CVPR.2007.383161","url":null,"abstract":"Volume intersection is a frequently used technique to solve the Shape-From-Silhouette problem, which constructs a 3D object estimate from a set of silhouettes taken with calibrated cameras. It is natural to develop an efficient algorithm to determine the consistency of a set of silhouettes before performing time-consuming reconstruction, so that inaccurate silhouettes can be omitted. In this paper we first present a fast algorithm to determine the consistency of three silhouettes from known (but arbitrary) viewing directions, assuming the projection is scaled orthographic. The temporal complexity of the algorithm is linear in the number of points of the silhouette boundaries. We further prove that a set of more than three convex silhouettes are consistent if and only if any three of them are consistent. Another possible application of our approach is to determine the miscalibrated cameras in a large camera system. A consistent subset of cameras can be determined on the fly and miscalibrated cameras can also be recalibrated at a coarse scale. Real and synthesized data are used to demonstrate our results.","PeriodicalId":351008,"journal":{"name":"2007 IEEE Conference on Computer Vision and Pattern Recognition","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127252811","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
CRF-driven Implicit Deformable Model crf驱动的隐式变形模型
Pub Date : 2007-06-17 DOI: 10.1109/CVPR.2007.383233
G. Tsechpenakis, Dimitris N. Metaxas
We present a topology independent solution for segmenting objects with texture patterns of any scale, using an implicit deformable model driven by conditional random fields (CRFs). Our model integrates region and edge information as image driven terms, whereas the probabilistic shape and internal (smoothness) terms use representations similar to the level-set based methods. The evolution of the model is solved as a MAP estimation problem, where the target conditional probability is decomposed into the internal term and the image-driven term. For the later, we use discriminative CRFs in two scales, pixel- and patch-based, to obtain smooth probability fields based on the corresponding image features. The advantages and novelties of our approach are (i) the integration of CRFs with implicit deformable models in a tightly coupled scheme, (ii) the use of CRFs which avoids ambiguities in the probability fields, (iii) the handling of local feature variations by updating the model interior statistics and processing at different spatial scales, and (v) the independence from the topology. We demonstrate the performance of our method in a wide variety of images, from the zebra and cheetah examples to the left and right ventricles in cardiac images.
我们提出了一种拓扑无关的解决方案,用于分割具有任意尺度纹理图案的物体,使用由条件随机场(CRFs)驱动的隐式可变形模型。我们的模型将区域和边缘信息集成为图像驱动项,而概率形状和内部(平滑)项使用类似于基于水平集的方法的表示。将模型的演化分解为MAP估计问题,将目标条件概率分解为内部项和图像驱动项。对于后者,我们使用基于像素和基于补丁的两种尺度的判别crf,根据相应的图像特征获得光滑的概率场。该方法的优点和新颖之处在于:(i)在紧密耦合方案中将crf与隐式可变形模型集成,(ii)使用crf避免了概率域中的模糊性,(iii)通过更新模型内部统计和在不同空间尺度上进行处理来处理局部特征变化,以及(v)与拓扑的独立性。我们在各种各样的图像中展示了我们的方法的性能,从斑马和猎豹的例子到心脏图像中的左心室和右心室。
{"title":"CRF-driven Implicit Deformable Model","authors":"G. Tsechpenakis, Dimitris N. Metaxas","doi":"10.1109/CVPR.2007.383233","DOIUrl":"https://doi.org/10.1109/CVPR.2007.383233","url":null,"abstract":"We present a topology independent solution for segmenting objects with texture patterns of any scale, using an implicit deformable model driven by conditional random fields (CRFs). Our model integrates region and edge information as image driven terms, whereas the probabilistic shape and internal (smoothness) terms use representations similar to the level-set based methods. The evolution of the model is solved as a MAP estimation problem, where the target conditional probability is decomposed into the internal term and the image-driven term. For the later, we use discriminative CRFs in two scales, pixel- and patch-based, to obtain smooth probability fields based on the corresponding image features. The advantages and novelties of our approach are (i) the integration of CRFs with implicit deformable models in a tightly coupled scheme, (ii) the use of CRFs which avoids ambiguities in the probability fields, (iii) the handling of local feature variations by updating the model interior statistics and processing at different spatial scales, and (v) the independence from the topology. We demonstrate the performance of our method in a wide variety of images, from the zebra and cheetah examples to the left and right ventricles in cardiac images.","PeriodicalId":351008,"journal":{"name":"2007 IEEE Conference on Computer Vision and Pattern Recognition","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127357869","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 31
Color Constancy using Natural Image Statistics 使用自然图像统计的色彩稳定性
Pub Date : 2007-06-17 DOI: 10.1109/CVPR.2007.383206
A. Gijsenij, T. Gevers
Although many color constancy methods exist, they are all based on specific assumptions such as the set of possible light sources, or the spatial and spectral characteristics of images. As a consequence, no algorithm can be considered as universal. However, with the large variety of available methods, the question is how to select the method that induces equivalent classes for different image characteristics. Furthermore, the subsequent question is how to combine the different algorithms in a proper way. To achieve selection and combining of color constancy algorithms, in this paper, natural image statistics are used to identify the most important characteristics of color images. Then, based on these image characteristics, the proper color constancy algorithm (or best combination of algorithms) is selected for a specific image. To capture the image characteristics, the Weibull parameterization (e.g. texture and contrast) is used. Experiments show that, on a large data set of 11,000 images, our approach outperforms current state-of-the-art single algorithms, as well as simple alternatives for combining several algorithms.
虽然存在许多颜色恒常性方法,但它们都是基于特定的假设,例如可能的光源集,或图像的空间和光谱特征。因此,没有一种算法可以被认为是通用的。然而,在可用的方法种类繁多的情况下,如何选择针对不同图像特征诱导等效类的方法是一个问题。此外,接下来的问题是如何以适当的方式结合不同的算法。为了实现颜色一致性算法的选择和组合,本文采用自然图像统计来识别彩色图像最重要的特征。然后,根据这些图像特征,为特定图像选择合适的颜色恒定算法(或算法的最佳组合)。为了捕获图像特征,使用威布尔参数化(例如纹理和对比度)。实验表明,在11,000张图像的大型数据集上,我们的方法优于当前最先进的单一算法,以及组合几种算法的简单替代方案。
{"title":"Color Constancy using Natural Image Statistics","authors":"A. Gijsenij, T. Gevers","doi":"10.1109/CVPR.2007.383206","DOIUrl":"https://doi.org/10.1109/CVPR.2007.383206","url":null,"abstract":"Although many color constancy methods exist, they are all based on specific assumptions such as the set of possible light sources, or the spatial and spectral characteristics of images. As a consequence, no algorithm can be considered as universal. However, with the large variety of available methods, the question is how to select the method that induces equivalent classes for different image characteristics. Furthermore, the subsequent question is how to combine the different algorithms in a proper way. To achieve selection and combining of color constancy algorithms, in this paper, natural image statistics are used to identify the most important characteristics of color images. Then, based on these image characteristics, the proper color constancy algorithm (or best combination of algorithms) is selected for a specific image. To capture the image characteristics, the Weibull parameterization (e.g. texture and contrast) is used. Experiments show that, on a large data set of 11,000 images, our approach outperforms current state-of-the-art single algorithms, as well as simple alternatives for combining several algorithms.","PeriodicalId":351008,"journal":{"name":"2007 IEEE Conference on Computer Vision and Pattern Recognition","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127493361","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 256
期刊
2007 IEEE Conference on Computer Vision and Pattern Recognition
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1