首页 > 最新文献

2014 IEEE International Conference on Image Processing (ICIP)最新文献

英文 中文
A tool for fast ground truth generation for object detection and tracking from video 一个快速生成地面真相的工具,用于从视频中检测和跟踪目标
Pub Date : 2014-10-01 DOI: 10.1109/ICIP.2014.7025073
F. Comaschi, S. Stuijk, T. Basten, H. Corporaal
Object detection and tracking is one of the most important components in computer vision applications. To carefully evaluate the performance of detection and tracking algorithms, it is important to develop benchmark data sets. One of the most tedious and error-prone aspects when developing benchmarks, is the generation of the ground truth. This paper presents FAST-GT (FAst Semi-automatic Tool for Ground Truth generation), a new generic framework for the semiautomatic generation of ground truths. FAST-GT reduces the need for manual intervention thus speeding-up the ground-truthing process.
目标检测与跟踪是计算机视觉应用的重要组成部分之一。为了仔细评估检测和跟踪算法的性能,开发基准数据集非常重要。在开发基准时,最乏味和最容易出错的一个方面是生成基本事实。本文提出了FAst - gt (FAst Semi-automatic Tool for Ground Truth generation),一种新的地面真理半自动生成通用框架。FAST-GT减少了人工干预的需要,从而加快了地面测实过程。
{"title":"A tool for fast ground truth generation for object detection and tracking from video","authors":"F. Comaschi, S. Stuijk, T. Basten, H. Corporaal","doi":"10.1109/ICIP.2014.7025073","DOIUrl":"https://doi.org/10.1109/ICIP.2014.7025073","url":null,"abstract":"Object detection and tracking is one of the most important components in computer vision applications. To carefully evaluate the performance of detection and tracking algorithms, it is important to develop benchmark data sets. One of the most tedious and error-prone aspects when developing benchmarks, is the generation of the ground truth. This paper presents FAST-GT (FAst Semi-automatic Tool for Ground Truth generation), a new generic framework for the semiautomatic generation of ground truths. FAST-GT reduces the need for manual intervention thus speeding-up the ground-truthing process.","PeriodicalId":6856,"journal":{"name":"2014 IEEE International Conference on Image Processing (ICIP)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"72752699","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Singular vector decomposition based adaptive transform for motion compensation residuals 基于奇异向量分解的运动补偿残差自适应变换
Pub Date : 2014-10-01 DOI: 10.1109/ICIP.2014.7025838
Xiaoran Cao, Yun He
Video coding standards commonly use discrete cosine transform (DCT) to transform the motion compensation (M-C) residuals. However, the MC residuals have much weaker correlation than image pixels, and DCT is not the optimized transform for them. In this paper, we propose an adaptive transform structure for MC residuals. Unlike traditional approaches which use a predefined transform core, we apply singular value decomposition (SVD) on the prediction block and use the eigenvector matrices as the transform core. Experiments show that this adaptive transform is more efficient compared with the traditional approach. An average 2.0% bit rate reduction is achieved when implemented on H.265/HEVC.
视频编码标准常用离散余弦变换(DCT)对运动补偿残差进行变换。然而,残差与图像像素的相关性要弱得多,DCT并不是残差的最佳变换。本文提出了一种MC残差的自适应变换结构。与传统方法使用预定义的变换核不同,我们在预测块上应用奇异值分解(SVD),并使用特征向量矩阵作为变换核。实验表明,与传统方法相比,这种自适应变换具有更高的效率。在H.265/HEVC上实现时,比特率平均降低2.0%。
{"title":"Singular vector decomposition based adaptive transform for motion compensation residuals","authors":"Xiaoran Cao, Yun He","doi":"10.1109/ICIP.2014.7025838","DOIUrl":"https://doi.org/10.1109/ICIP.2014.7025838","url":null,"abstract":"Video coding standards commonly use discrete cosine transform (DCT) to transform the motion compensation (M-C) residuals. However, the MC residuals have much weaker correlation than image pixels, and DCT is not the optimized transform for them. In this paper, we propose an adaptive transform structure for MC residuals. Unlike traditional approaches which use a predefined transform core, we apply singular value decomposition (SVD) on the prediction block and use the eigenvector matrices as the transform core. Experiments show that this adaptive transform is more efficient compared with the traditional approach. An average 2.0% bit rate reduction is achieved when implemented on H.265/HEVC.","PeriodicalId":6856,"journal":{"name":"2014 IEEE International Conference on Image Processing (ICIP)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73399527","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
A gradient-like variational Bayesian approach: Application to microwave imaging for breast tumor detection 类梯度变分贝叶斯方法:在乳腺肿瘤微波成像检测中的应用
Pub Date : 2014-10-01 DOI: 10.1109/ICIP.2014.7025342
L. Gharsalli, B. Duchêne, A. Mohammad-Djafari, H. Ayasso
In this paper a nonlinear inverse scattering problem is solved by means of a variational Bayesian approach. The objective is to detect breast tumor from measurements of the scattered fields at different frequencies and for several illuminations. This inverse problem is known to be non linear and ill-posed. Thus, it needs to be regularized by introducing a priori information. Herein, prior information available on the sought object is that it is composed of a finite known number of different materials distributed in compact regions. It is accounted for by tackling the problem in a Bayesian framework. Then, the true joint posterior is approximated by a separable law by mean of a gradient-like variational Bayesian technique. The latter is adapted to complex valued contrast and used to compute the posterior estimators through a joint update of the shape parameters of the approximating marginals. Both permittivity and conductivity maps are reconstructed and the results obtained on synthetic data show a good reconstruction quality and a convergence faster than that of the classical variational Bayesian approach.
本文用变分贝叶斯方法求解了一个非线性逆散射问题。目的是通过测量不同频率和不同光照下的散射场来检测乳腺肿瘤。这个反问题是已知的非线性和不适定的。因此,需要通过引入先验信息对其进行正则化。在这里,所寻求的对象的先验信息是,它是由分布在紧凑区域的有限已知数量的不同材料组成的。这是通过在贝叶斯框架中处理问题来解释的。然后,利用类梯度变分贝叶斯技术,用可分离律逼近真关节后验。后者适用于复值对比,并通过联合更新近似边缘的形状参数来计算后验估计量。在综合数据上重建了介电常数图和电导率图,结果表明重建质量好,收敛速度快于经典变分贝叶斯方法。
{"title":"A gradient-like variational Bayesian approach: Application to microwave imaging for breast tumor detection","authors":"L. Gharsalli, B. Duchêne, A. Mohammad-Djafari, H. Ayasso","doi":"10.1109/ICIP.2014.7025342","DOIUrl":"https://doi.org/10.1109/ICIP.2014.7025342","url":null,"abstract":"In this paper a nonlinear inverse scattering problem is solved by means of a variational Bayesian approach. The objective is to detect breast tumor from measurements of the scattered fields at different frequencies and for several illuminations. This inverse problem is known to be non linear and ill-posed. Thus, it needs to be regularized by introducing a priori information. Herein, prior information available on the sought object is that it is composed of a finite known number of different materials distributed in compact regions. It is accounted for by tackling the problem in a Bayesian framework. Then, the true joint posterior is approximated by a separable law by mean of a gradient-like variational Bayesian technique. The latter is adapted to complex valued contrast and used to compute the posterior estimators through a joint update of the shape parameters of the approximating marginals. Both permittivity and conductivity maps are reconstructed and the results obtained on synthetic data show a good reconstruction quality and a convergence faster than that of the classical variational Bayesian approach.","PeriodicalId":6856,"journal":{"name":"2014 IEEE International Conference on Image Processing (ICIP)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73412425","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Task-driven dictionary learning for hyperspectral image classification with structured sparsity priors 基于任务驱动字典学习的结构化稀疏先验高光谱图像分类
Pub Date : 2014-10-01 DOI: 10.1109/ICIP.2014.7026065
Xiaoxia Sun, N. Nasrabadi, T. Tran
In hyperspectral pixel classification, previous research have shown that the sparse representation classifier can achieve a better performance when exploiting the neighboring test pixels through enforcing different structured sparsity priors. In this paper, we propose a supervised sparse-representation-based dictionary learning method with joint or Laplacian s-parsity priors. The proposed method has numerous advantages over the existing dictionary learning techniques. It uses a structured sparsity and provides a more robust and stable sparse coefficients. Besides, it is capable of reducing the classification error by jointly optimizing the dictionary and the classifier's parameters during the dictionary training stage.
在高光谱像素分类中,已有研究表明,稀疏表示分类器通过施加不同的结构稀疏先验,可以在利用相邻测试像素时获得更好的性能。在本文中,我们提出了一种基于联合或拉普拉斯s-parsity先验的监督稀疏表示字典学习方法。与现有的字典学习技术相比,该方法具有许多优点。它使用结构化稀疏性,并提供更鲁棒和稳定的稀疏系数。在字典训练阶段,通过对字典和分类器参数的联合优化,降低了分类误差。
{"title":"Task-driven dictionary learning for hyperspectral image classification with structured sparsity priors","authors":"Xiaoxia Sun, N. Nasrabadi, T. Tran","doi":"10.1109/ICIP.2014.7026065","DOIUrl":"https://doi.org/10.1109/ICIP.2014.7026065","url":null,"abstract":"In hyperspectral pixel classification, previous research have shown that the sparse representation classifier can achieve a better performance when exploiting the neighboring test pixels through enforcing different structured sparsity priors. In this paper, we propose a supervised sparse-representation-based dictionary learning method with joint or Laplacian s-parsity priors. The proposed method has numerous advantages over the existing dictionary learning techniques. It uses a structured sparsity and provides a more robust and stable sparse coefficients. Besides, it is capable of reducing the classification error by jointly optimizing the dictionary and the classifier's parameters during the dictionary training stage.","PeriodicalId":6856,"journal":{"name":"2014 IEEE International Conference on Image Processing (ICIP)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73420374","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Efficient 2D human pose estimation using mean-shift 利用mean-shift进行有效的二维人体姿态估计
Pub Date : 2014-10-01 DOI: 10.1109/ICIP.2014.7025685
A. R. Khalid, Ali Hassan, M. Taj
In 2D pose estimation, each limb is parametrized by it position(2D), scale(1D) and orientation(1D). One of the key bottlenecks is the exhaustive search in this 4D limb space where only a few maxima in the space are desired. To reduce the search space, we reformulate this problem in terms of finding the modes of a likelihood distribution and solve it using the Mean-Shift algorithm. Ours is the first paper in the pose estimation community to use such an approach. In addition, we describe a complete top-down approach that estimates limbs in a sequential pair-wise manner. This allows us to use Kinematic Constraints before processing, requiring us to perform search in only a small sub-region of the image for each limb. We finally devise a PCA based pose validation criteria that enables us to prune invalid hypotheses. Combining these search-space reduction techniques allows our method to generate results at par with the state-of-the-art, while saving more than 80% computations when compared to full image search.
在二维姿态估计中,每个肢体由其位置(2D)、尺度(1D)和方向(1D)进行参数化。其中一个关键的瓶颈是在这个四维分支空间中穷举搜索,在这个空间中只需要几个最大值。为了减少搜索空间,我们将这个问题重新表述为寻找似然分布的模式,并使用Mean-Shift算法来解决它。我们的论文是姿态估计领域使用这种方法的第一篇论文。此外,我们描述了一种完整的自上而下的方法,以顺序成对的方式估计肢体。这允许我们在处理之前使用运动学约束,要求我们只在图像的一小部分区域对每个肢体进行搜索。最后,我们设计了一个基于PCA的姿态验证标准,使我们能够修剪无效的假设。结合这些搜索空间缩减技术,我们的方法可以生成与最先进的结果相当的结果,同时与完整的图像搜索相比节省80%以上的计算。
{"title":"Efficient 2D human pose estimation using mean-shift","authors":"A. R. Khalid, Ali Hassan, M. Taj","doi":"10.1109/ICIP.2014.7025685","DOIUrl":"https://doi.org/10.1109/ICIP.2014.7025685","url":null,"abstract":"In 2D pose estimation, each limb is parametrized by it position(2D), scale(1D) and orientation(1D). One of the key bottlenecks is the exhaustive search in this 4D limb space where only a few maxima in the space are desired. To reduce the search space, we reformulate this problem in terms of finding the modes of a likelihood distribution and solve it using the Mean-Shift algorithm. Ours is the first paper in the pose estimation community to use such an approach. In addition, we describe a complete top-down approach that estimates limbs in a sequential pair-wise manner. This allows us to use Kinematic Constraints before processing, requiring us to perform search in only a small sub-region of the image for each limb. We finally devise a PCA based pose validation criteria that enables us to prune invalid hypotheses. Combining these search-space reduction techniques allows our method to generate results at par with the state-of-the-art, while saving more than 80% computations when compared to full image search.","PeriodicalId":6856,"journal":{"name":"2014 IEEE International Conference on Image Processing (ICIP)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80111903","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A simple and efficient algorithm for dot patterns reconstruction 一种简单有效的网点图案重建算法
Pub Date : 2014-10-01 DOI: 10.1109/ICIP.2014.7025958
Mahmoud Melkemi, M. Elbaz
The problem of reconstructing the shape of dot patterns (sampled planar connected regions) is extensively studied in the literature. Up till now, the existing works do not provide guarantee for the correctness of the obtained solution, usually the results was validated empirically according to human perception. In this article, we present a new algorithm that guarantees reconstruction of the shape for a set of points satisfying some density conditions. Many experimental results show that the algorithm usually gives an adequate reconstruction for non-uniformly and weakly-sampled patterns. An advantage of the algorithm is its simplicity. Once the Delaunay triangulation of the input data is computed, simple rules are applied to the Delaunay edges in order to select those belonging to the reconstruction graph.
点图案(采样平面连通区域)的形状重建问题在文献中得到了广泛的研究。到目前为止,现有的工作并不能保证所得到的解的正确性,通常是根据人的感知对结果进行经验验证。在本文中,我们提出了一种新的算法,保证了一组满足密度条件的点的形状重建。大量实验结果表明,该算法对非均匀和弱采样模式具有较好的重构效果。该算法的一个优点是简单。一旦输入数据的Delaunay三角化计算完成,简单的规则被应用到Delaunay边以选择那些属于重建图的边。
{"title":"A simple and efficient algorithm for dot patterns reconstruction","authors":"Mahmoud Melkemi, M. Elbaz","doi":"10.1109/ICIP.2014.7025958","DOIUrl":"https://doi.org/10.1109/ICIP.2014.7025958","url":null,"abstract":"The problem of reconstructing the shape of dot patterns (sampled planar connected regions) is extensively studied in the literature. Up till now, the existing works do not provide guarantee for the correctness of the obtained solution, usually the results was validated empirically according to human perception. In this article, we present a new algorithm that guarantees reconstruction of the shape for a set of points satisfying some density conditions. Many experimental results show that the algorithm usually gives an adequate reconstruction for non-uniformly and weakly-sampled patterns. An advantage of the algorithm is its simplicity. Once the Delaunay triangulation of the input data is computed, simple rules are applied to the Delaunay edges in order to select those belonging to the reconstruction graph.","PeriodicalId":6856,"journal":{"name":"2014 IEEE International Conference on Image Processing (ICIP)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81374489","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Robust object tracking via multi-task dynamic sparse model 基于多任务动态稀疏模型的鲁棒目标跟踪
Pub Date : 2014-10-01 DOI: 10.1109/ICIP.2014.7025078
Zhangjian Ji, Weiqiang Wang
Recently, sparse representation has been widely applied to some generative tracking methods, which learn the representation of each particle independently and do not consider the correlation between the representation of each particle in the time domain. In this paper, we formulate the object tracking in a particle filter framework as a multi-task dynamic sparse learning problem, which we denote as Multi-Task Dynamic Sparse Tracking(MTDST). By exploring the popular sparsity-inducing ℓ1, 2 mixed norms, we regularize the representation problem to enforce joint sparsity and learn the particle representations together. Meanwhile, we also introduce the innovation sparse term in the tracking model. As compared to previous methods, our method mines the independencies between particles and the correlation of particle representation in the time domain, which improves the tracking performance. In addition, because the loft least square is robust to the outliers, we adopt the loft least square to replace the least square to calculate the likelihood probability. In the updating scheme, we eliminate the influences of occlusion pixels when updating the templates. The comprehensive experiments on the several challenging image sequences demonstrate that the proposed method consistently outperforms the existing state-of-the-art methods.
近年来,稀疏表示被广泛应用于一些生成式跟踪方法中,这些方法独立地学习每个粒子的表示,而不考虑每个粒子在时域内表示之间的相关性。本文将粒子滤波框架下的目标跟踪问题表述为一个多任务动态稀疏学习问题,我们称之为多任务动态稀疏跟踪(MTDST)。通过探索流行的稀疏性诱导的1,2混合规范,我们正则化了表示问题以增强联合稀疏性,并一起学习了粒子表示。同时,在跟踪模型中引入了创新稀疏项。与以往的方法相比,我们的方法挖掘了粒子之间的独立性和粒子表示在时域上的相关性,提高了跟踪性能。此外,由于最小二乘法对异常值具有鲁棒性,我们采用最小二乘法代替最小二乘法来计算似然概率。在更新方案中,我们在更新模板时消除了遮挡像素的影响。在几个具有挑战性的图像序列上的综合实验表明,该方法始终优于现有的最先进的方法。
{"title":"Robust object tracking via multi-task dynamic sparse model","authors":"Zhangjian Ji, Weiqiang Wang","doi":"10.1109/ICIP.2014.7025078","DOIUrl":"https://doi.org/10.1109/ICIP.2014.7025078","url":null,"abstract":"Recently, sparse representation has been widely applied to some generative tracking methods, which learn the representation of each particle independently and do not consider the correlation between the representation of each particle in the time domain. In this paper, we formulate the object tracking in a particle filter framework as a multi-task dynamic sparse learning problem, which we denote as Multi-Task Dynamic Sparse Tracking(MTDST). By exploring the popular sparsity-inducing ℓ1, 2 mixed norms, we regularize the representation problem to enforce joint sparsity and learn the particle representations together. Meanwhile, we also introduce the innovation sparse term in the tracking model. As compared to previous methods, our method mines the independencies between particles and the correlation of particle representation in the time domain, which improves the tracking performance. In addition, because the loft least square is robust to the outliers, we adopt the loft least square to replace the least square to calculate the likelihood probability. In the updating scheme, we eliminate the influences of occlusion pixels when updating the templates. The comprehensive experiments on the several challenging image sequences demonstrate that the proposed method consistently outperforms the existing state-of-the-art methods.","PeriodicalId":6856,"journal":{"name":"2014 IEEE International Conference on Image Processing (ICIP)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82121908","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Multi-layer temporal graphical model for head pose estimation in real-world videos 真实视频中头部姿态估计的多层时间图形模型
Pub Date : 2014-10-01 DOI: 10.1109/ICIP.2014.7025686
Meltem Demirkus, Doina Precup, James J. Clark, T. Arbel
Head pose estimation has been receiving a lot of attention due to its wide range of possible applications. However, most approaches in the literature have focused on head pose estimation in controlled environments. Head pose estimation has recently begun to be applied to real-world environments. However, the focus has been on estimation from single images or video frames. Furthermore, most approaches frame the problem as classification into a set of coarse pose bins, rather than performing continuous pose estimation. The proposed multi-layer probabilistic temporal graphical model robustly estimates continuous head pose angle while leveraging the strengths of multiple features into account. Experiments performed on a large, real-world video database show that our approach not only significantly outperforms alternative head pose approaches, but also provides a pose probability assigned at each video frame, which permits robust temporal, probabilistic fusion of pose information over the entire video sequence.
头部姿态估计因其广泛的应用前景而备受关注。然而,文献中的大多数方法都集中在受控环境下的头部姿势估计。头部姿态估计最近开始应用于现实环境。然而,重点一直是对单个图像或视频帧的估计。此外,大多数方法将问题框架为将问题分类到一组粗糙的姿态箱中,而不是执行连续的姿态估计。提出的多层概率时间图模型在充分考虑多个特征的优势的同时,对连续头姿角进行鲁棒估计。在大型真实视频数据库上进行的实验表明,我们的方法不仅显著优于其他头部姿势方法,而且还提供了在每个视频帧分配的姿势概率,从而允许在整个视频序列上进行姿势信息的鲁棒时间概率融合。
{"title":"Multi-layer temporal graphical model for head pose estimation in real-world videos","authors":"Meltem Demirkus, Doina Precup, James J. Clark, T. Arbel","doi":"10.1109/ICIP.2014.7025686","DOIUrl":"https://doi.org/10.1109/ICIP.2014.7025686","url":null,"abstract":"Head pose estimation has been receiving a lot of attention due to its wide range of possible applications. However, most approaches in the literature have focused on head pose estimation in controlled environments. Head pose estimation has recently begun to be applied to real-world environments. However, the focus has been on estimation from single images or video frames. Furthermore, most approaches frame the problem as classification into a set of coarse pose bins, rather than performing continuous pose estimation. The proposed multi-layer probabilistic temporal graphical model robustly estimates continuous head pose angle while leveraging the strengths of multiple features into account. Experiments performed on a large, real-world video database show that our approach not only significantly outperforms alternative head pose approaches, but also provides a pose probability assigned at each video frame, which permits robust temporal, probabilistic fusion of pose information over the entire video sequence.","PeriodicalId":6856,"journal":{"name":"2014 IEEE International Conference on Image Processing (ICIP)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82467481","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Calibration of an industrial vision system using an ellipsoid 用椭球体标定工业视觉系统
Pub Date : 2014-10-01 DOI: 10.1109/ICIP.2014.7025700
J. Heather
A robust multi-camera calibration algorithm developed for an industrial vision system is described. An ellipsoid with a simple surface pattern and accurately known geometry is used as a calibration target. Our algorithm automatically detects the presence of the ball on the conveyor and accurately determines the position of its outline and marker lines in each image frame using efficient image processing techniques. A fast least-squares minimization is then performed to determine the optimal camera and motion parameters. The method is fully automatic and requires no human interaction or guidance, helping to minimize machine setup and maintenance times. The calibration algorithm has been demonstrated on real image captures and performance is quantified using simulated image sequences.
介绍了一种用于工业视觉系统的鲁棒多相机标定算法。使用具有简单表面图案和精确已知几何形状的椭球作为标定目标。我们的算法自动检测传送带上的球的存在,并使用高效的图像处理技术准确地确定其轮廓和标记线在每个图像帧中的位置。然后进行快速最小二乘最小化,以确定最佳的相机和运动参数。该方法是全自动的,无需人工交互或指导,有助于最大限度地减少机器设置和维护时间。标定算法已在实际图像捕获上进行了验证,并使用模拟图像序列对其性能进行了量化。
{"title":"Calibration of an industrial vision system using an ellipsoid","authors":"J. Heather","doi":"10.1109/ICIP.2014.7025700","DOIUrl":"https://doi.org/10.1109/ICIP.2014.7025700","url":null,"abstract":"A robust multi-camera calibration algorithm developed for an industrial vision system is described. An ellipsoid with a simple surface pattern and accurately known geometry is used as a calibration target. Our algorithm automatically detects the presence of the ball on the conveyor and accurately determines the position of its outline and marker lines in each image frame using efficient image processing techniques. A fast least-squares minimization is then performed to determine the optimal camera and motion parameters. The method is fully automatic and requires no human interaction or guidance, helping to minimize machine setup and maintenance times. The calibration algorithm has been demonstrated on real image captures and performance is quantified using simulated image sequences.","PeriodicalId":6856,"journal":{"name":"2014 IEEE International Conference on Image Processing (ICIP)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76570603","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Low-rank based compact representation of motion capture data 基于低秩的运动捕捉数据的紧凑表示
Pub Date : 2014-10-01 DOI: 10.1109/ICIP.2014.7025296
Junhui Hou, Lap-Pui Chau, Ying He, N. Magnenat-Thalmann
In this paper, we propose a practical, elegant and effective scheme for compact mocap data representation. Guided by our analysis of the unique properties of mocap data, the input mocap sequence is optimally segmented into a set of subsequences. Then, we project the subsequences onto a pair of computational orthogonal matrices to explore strong low-rank characteristic within and among the subsequences. The experimental results show that the proposed scheme is much more effective for reducing the data size, compared with the existing techniques.
在本文中,我们提出了一种实用、优雅和有效的紧凑动作捕捉数据表示方案。在我们对动作捕捉数据独特属性分析的指导下,输入动作捕捉序列被最佳分割成一组子序列。然后,我们将子序列投影到一对计算正交矩阵上,以探索子序列内部和子序列之间的强低秩特征。实验结果表明,与现有的算法相比,该算法能够有效地减小数据量。
{"title":"Low-rank based compact representation of motion capture data","authors":"Junhui Hou, Lap-Pui Chau, Ying He, N. Magnenat-Thalmann","doi":"10.1109/ICIP.2014.7025296","DOIUrl":"https://doi.org/10.1109/ICIP.2014.7025296","url":null,"abstract":"In this paper, we propose a practical, elegant and effective scheme for compact mocap data representation. Guided by our analysis of the unique properties of mocap data, the input mocap sequence is optimally segmented into a set of subsequences. Then, we project the subsequences onto a pair of computational orthogonal matrices to explore strong low-rank characteristic within and among the subsequences. The experimental results show that the proposed scheme is much more effective for reducing the data size, compared with the existing techniques.","PeriodicalId":6856,"journal":{"name":"2014 IEEE International Conference on Image Processing (ICIP)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76102861","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
期刊
2014 IEEE International Conference on Image Processing (ICIP)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1