首页 > 最新文献

2013 IEEE International Conference on Computer Vision最新文献

英文 中文
From Semi-supervised to Transfer Counting of Crowds 从半监督到人群转移计数
Pub Date : 2013-12-01 DOI: 10.1109/ICCV.2013.270
Chen Change Loy, S. Gong, T. Xiang
Regression-based techniques have shown promising results for people counting in crowded scenes. However, most existing techniques require expensive and laborious data annotation for model training. In this study, we propose to address this problem from three perspectives: (1) Instead of exhaustively annotating every single frame, the most informative frames are selected for annotation automatically and actively. (2) Rather than learning from only labelled data, the abundant unlabelled data are exploited. (3) Labelled data from other scenes are employed to further alleviate the burden for data annotation. All three ideas are implemented in a unified active and semi-supervised regression framework with ability to perform transfer learning, by exploiting the underlying geometric structure of crowd patterns via manifold analysis. Extensive experiments validate the effectiveness of our approach.
基于回归的技术已经显示出在拥挤场景中计数的良好结果。然而,大多数现有技术需要昂贵且费力的数据注释来进行模型训练。在本研究中,我们建议从三个方面来解决这一问题:(1)不是对每一帧都进行详尽的注释,而是自动地、主动地选择信息量最大的帧进行注释。(2)利用大量的未标记数据,而不是仅仅从标记数据中学习。(3)利用其他场景的标注数据,进一步减轻数据标注的负担。所有这三个想法都是在一个统一的主动和半监督回归框架中实现的,该框架具有执行迁移学习的能力,通过流形分析利用群体模式的潜在几何结构。大量的实验验证了我们方法的有效性。
{"title":"From Semi-supervised to Transfer Counting of Crowds","authors":"Chen Change Loy, S. Gong, T. Xiang","doi":"10.1109/ICCV.2013.270","DOIUrl":"https://doi.org/10.1109/ICCV.2013.270","url":null,"abstract":"Regression-based techniques have shown promising results for people counting in crowded scenes. However, most existing techniques require expensive and laborious data annotation for model training. In this study, we propose to address this problem from three perspectives: (1) Instead of exhaustively annotating every single frame, the most informative frames are selected for annotation automatically and actively. (2) Rather than learning from only labelled data, the abundant unlabelled data are exploited. (3) Labelled data from other scenes are employed to further alleviate the burden for data annotation. All three ideas are implemented in a unified active and semi-supervised regression framework with ability to perform transfer learning, by exploiting the underlying geometric structure of crowd patterns via manifold analysis. Extensive experiments validate the effectiveness of our approach.","PeriodicalId":6351,"journal":{"name":"2013 IEEE International Conference on Computer Vision","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81721763","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 142
Restoring an Image Taken through a Window Covered with Dirt or Rain 恢复通过被灰尘或雨水覆盖的窗户拍摄的图像
Pub Date : 2013-12-01 DOI: 10.1109/ICCV.2013.84
D. Eigen, Dilip Krishnan, R. Fergus
Photographs taken through a window are often compromised by dirt or rain present on the window surface. Common cases of this include pictures taken from inside a vehicle, or outdoor security cameras mounted inside a protective enclosure. At capture time, defocus can be used to remove the artifacts, but this relies on achieving a shallow depth-of-field and placement of the camera close to the window. Instead, we present a post-capture image processing solution that can remove localized rain and dirt artifacts from a single image. We collect a dataset of clean/corrupted image pairs which are then used to train a specialized form of convolutional neural network. This learns how to map corrupted image patches to clean ones, implicitly capturing the characteristic appearance of dirt and water droplets in natural images. Our models demonstrate effective removal of dirt and rain in outdoor test conditions.
透过窗户拍摄的照片通常会受到窗户表面灰尘或雨水的影响。常见的情况包括从车内拍摄的照片,或安装在保护外壳内的室外安全摄像头。在捕捉时,散焦可以用来去除伪影,但这依赖于实现较浅的景深和相机靠近窗口的位置。相反,我们提出了一种捕获后的图像处理解决方案,可以从单个图像中去除局部的雨水和污垢伪影。我们收集了一个干净/损坏图像对的数据集,然后用于训练一种特殊形式的卷积神经网络。它学习如何将损坏的图像块映射到干净的图像块,隐含地捕获自然图像中污垢和水滴的特征外观。我们的模型证明了在室外测试条件下有效去除污垢和雨水。
{"title":"Restoring an Image Taken through a Window Covered with Dirt or Rain","authors":"D. Eigen, Dilip Krishnan, R. Fergus","doi":"10.1109/ICCV.2013.84","DOIUrl":"https://doi.org/10.1109/ICCV.2013.84","url":null,"abstract":"Photographs taken through a window are often compromised by dirt or rain present on the window surface. Common cases of this include pictures taken from inside a vehicle, or outdoor security cameras mounted inside a protective enclosure. At capture time, defocus can be used to remove the artifacts, but this relies on achieving a shallow depth-of-field and placement of the camera close to the window. Instead, we present a post-capture image processing solution that can remove localized rain and dirt artifacts from a single image. We collect a dataset of clean/corrupted image pairs which are then used to train a specialized form of convolutional neural network. This learns how to map corrupted image patches to clean ones, implicitly capturing the characteristic appearance of dirt and water droplets in natural images. Our models demonstrate effective removal of dirt and rain in outdoor test conditions.","PeriodicalId":6351,"journal":{"name":"2013 IEEE International Conference on Computer Vision","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81785454","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 405
Detecting Irregular Curvilinear Structures in Gray Scale and Color Imagery Using Multi-directional Oriented Flux 利用多向定向磁通检测灰度和彩色图像中的不规则曲线结构
Pub Date : 2013-12-01 DOI: 10.1109/ICCV.2013.196
Engin Türetken, C. Becker, Przemyslaw Glowacki, Fethallah Benmansour, P. Fua
We propose a new approach to detecting irregular curvilinear structures in noisy image stacks. In contrast to earlier approaches that rely on circular models of the cross-sections, ours allows for the arbitrarily-shaped ones that are prevalent in biological imagery. This is achieved by maximizing the image gradient flux along multiple directions and radii, instead of only two with a unique radius as is usually done. This yields a more complex optimization problem for which we propose a computationally efficient solution. We demonstrate the effectiveness of our approach on a wide range of challenging gray scale and color datasets and show that it outperforms existing techniques, especially on very irregular structures.
提出了一种检测噪声图像堆中不规则曲线结构的新方法。与早期依赖圆形截面模型的方法相比,我们的方法允许在生物图像中普遍存在的任意形状的截面。这是通过沿着多个方向和半径最大化图像梯度通量来实现的,而不是像通常那样只有两个具有唯一半径的方向。这产生了一个更复杂的优化问题,我们提出了一个计算效率高的解决方案。我们证明了我们的方法在各种具有挑战性的灰度和颜色数据集上的有效性,并表明它优于现有的技术,特别是在非常不规则的结构上。
{"title":"Detecting Irregular Curvilinear Structures in Gray Scale and Color Imagery Using Multi-directional Oriented Flux","authors":"Engin Türetken, C. Becker, Przemyslaw Glowacki, Fethallah Benmansour, P. Fua","doi":"10.1109/ICCV.2013.196","DOIUrl":"https://doi.org/10.1109/ICCV.2013.196","url":null,"abstract":"We propose a new approach to detecting irregular curvilinear structures in noisy image stacks. In contrast to earlier approaches that rely on circular models of the cross-sections, ours allows for the arbitrarily-shaped ones that are prevalent in biological imagery. This is achieved by maximizing the image gradient flux along multiple directions and radii, instead of only two with a unique radius as is usually done. This yields a more complex optimization problem for which we propose a computationally efficient solution. We demonstrate the effectiveness of our approach on a wide range of challenging gray scale and color datasets and show that it outperforms existing techniques, especially on very irregular structures.","PeriodicalId":6351,"journal":{"name":"2013 IEEE International Conference on Computer Vision","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82598900","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 43
Manifold Based Face Synthesis from Sparse Samples 基于稀疏样本的流形人脸合成
Pub Date : 2013-12-01 DOI: 10.1109/ICCV.2013.275
Hongteng Xu, H. Zha
Data sparsity has been a thorny issue for manifold-based image synthesis, and in this paper we address this critical problem by leveraging ideas from transfer learning. Specifically, we propose methods based on generating auxiliary data in the form of synthetic samples using transformations of the original sparse samples. To incorporate the auxiliary data, we propose a weighted data synthesis method, which adaptively selects from the generated samples for inclusion during the manifold learning process via a weighted iterative algorithm. To demonstrate the feasibility of the proposed method, we apply it to the problem of face image synthesis from sparse samples. Compared with existing methods, the proposed method shows encouraging results with good performance improvements.
对于基于流形的图像合成来说,数据稀疏性一直是一个棘手的问题,在本文中,我们通过利用迁移学习的思想来解决这个关键问题。具体来说,我们提出了基于原始稀疏样本变换生成合成样本形式的辅助数据的方法。为了整合辅助数据,我们提出了一种加权数据合成方法,该方法通过加权迭代算法自适应地从生成的样本中选择包含在流形学习过程中的样本。为了证明该方法的可行性,我们将其应用于基于稀疏样本的人脸图像合成问题。与现有方法相比,该方法取得了令人鼓舞的效果,性能得到了较好的提高。
{"title":"Manifold Based Face Synthesis from Sparse Samples","authors":"Hongteng Xu, H. Zha","doi":"10.1109/ICCV.2013.275","DOIUrl":"https://doi.org/10.1109/ICCV.2013.275","url":null,"abstract":"Data sparsity has been a thorny issue for manifold-based image synthesis, and in this paper we address this critical problem by leveraging ideas from transfer learning. Specifically, we propose methods based on generating auxiliary data in the form of synthetic samples using transformations of the original sparse samples. To incorporate the auxiliary data, we propose a weighted data synthesis method, which adaptively selects from the generated samples for inclusion during the manifold learning process via a weighted iterative algorithm. To demonstrate the feasibility of the proposed method, we apply it to the problem of face image synthesis from sparse samples. Compared with existing methods, the proposed method shows encouraging results with good performance improvements.","PeriodicalId":6351,"journal":{"name":"2013 IEEE International Conference on Computer Vision","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84027165","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Slice Sampling Particle Belief Propagation 切片采样粒子信念传播
Pub Date : 2013-12-01 DOI: 10.1109/ICCV.2013.144
Oliver Müller, M. Yang, B. Rosenhahn
Inference in continuous label Markov random fields is a challenging task. We use particle belief propagation (PBP) for solving the inference problem in continuous label space. Sampling particles from the belief distribution is typically done by using Metropolis-Hastings (MH) Markov chain Monte Carlo (MCMC) methods which involves sampling from a proposal distribution. This proposal distribution has to be carefully designed depending on the particular model and input data to achieve fast convergence. We propose to avoid dependence on a proposal distribution by introducing a slice sampling based PBP algorithm. The proposed approach shows superior convergence performance on an image denoising toy example. Our findings are validated on a challenging relational 2D feature tracking application.
连续标记马尔可夫随机场的推理是一个具有挑战性的任务。我们使用粒子信念传播(PBP)来解决连续标签空间中的推理问题。从信念分布中采样粒子通常采用Metropolis-Hastings (MH) Markov chain Monte Carlo (MCMC)方法,该方法涉及从建议分布中采样。必须根据特定的模型和输入数据仔细设计该建议分布,以实现快速收敛。我们提出通过引入基于切片采样的PBP算法来避免对提案分布的依赖。该方法在图像去噪示例中表现出优异的收敛性能。我们的发现在一个具有挑战性的关系2D特征跟踪应用程序上得到了验证。
{"title":"Slice Sampling Particle Belief Propagation","authors":"Oliver Müller, M. Yang, B. Rosenhahn","doi":"10.1109/ICCV.2013.144","DOIUrl":"https://doi.org/10.1109/ICCV.2013.144","url":null,"abstract":"Inference in continuous label Markov random fields is a challenging task. We use particle belief propagation (PBP) for solving the inference problem in continuous label space. Sampling particles from the belief distribution is typically done by using Metropolis-Hastings (MH) Markov chain Monte Carlo (MCMC) methods which involves sampling from a proposal distribution. This proposal distribution has to be carefully designed depending on the particular model and input data to achieve fast convergence. We propose to avoid dependence on a proposal distribution by introducing a slice sampling based PBP algorithm. The proposed approach shows superior convergence performance on an image denoising toy example. Our findings are validated on a challenging relational 2D feature tracking application.","PeriodicalId":6351,"journal":{"name":"2013 IEEE International Conference on Computer Vision","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90642623","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Multi-view Normal Field Integration for 3D Reconstruction of Mirroring Objects 镜像对象三维重建的多视图法向场集成
Pub Date : 2013-12-01 DOI: 10.1109/ICCV.2013.311
Michael Weinmann, Aljosa Osep, R. Ruiters, R. Klein
In this paper, we present a novel, robust multi-view normal field integration technique for reconstructing the full 3D shape of mirroring objects. We employ a turntable-based setup with several cameras and displays. These are used to display illumination patterns which are reflected by the object surface. The pattern information observed in the cameras enables the calculation of individual volumetric normal fields for each combination of camera, display and turntable angle. As the pattern information might be blurred depending on the surface curvature or due to non-perfect mirroring surface characteristics, we locally adapt the decoding to the finest still resolvable pattern resolution. In complex real-world scenarios, the normal fields contain regions without observations due to occlusions and outliers due to interreflections and noise. Therefore, a robust reconstruction using only normal information is challenging. Via a non-parametric clustering of normal hypotheses derived for each point in the scene, we obtain both the most likely local surface normal and a local surface consistency estimate. This information is utilized in an iterative min-cut based variational approach to reconstruct the surface geometry.
在本文中,我们提出了一种新的,鲁棒的多视图法向场积分技术,用于重建镜像对象的完整三维形状。我们采用了一个基于转盘的设置,有几个摄像头和显示器。这些用于显示被物体表面反射的照明模式。在相机中观察到的模式信息可以计算每个相机、显示器和转盘角度组合的单个体积法向场。由于图案信息可能会因表面曲率或镜像表面特征不完美而模糊,因此我们局部调整解码以获得最佳的可分辨图案分辨率。在复杂的现实世界场景中,法向场包含由于遮挡和由于相互反射和噪声而产生的异常值而没有观测的区域。因此,仅使用正常信息进行鲁棒重建是具有挑战性的。通过对场景中每个点的正态假设进行非参数聚类,我们获得了最可能的局部表面法向和局部表面一致性估计。该信息被用于基于迭代最小切割的变分方法来重建表面几何形状。
{"title":"Multi-view Normal Field Integration for 3D Reconstruction of Mirroring Objects","authors":"Michael Weinmann, Aljosa Osep, R. Ruiters, R. Klein","doi":"10.1109/ICCV.2013.311","DOIUrl":"https://doi.org/10.1109/ICCV.2013.311","url":null,"abstract":"In this paper, we present a novel, robust multi-view normal field integration technique for reconstructing the full 3D shape of mirroring objects. We employ a turntable-based setup with several cameras and displays. These are used to display illumination patterns which are reflected by the object surface. The pattern information observed in the cameras enables the calculation of individual volumetric normal fields for each combination of camera, display and turntable angle. As the pattern information might be blurred depending on the surface curvature or due to non-perfect mirroring surface characteristics, we locally adapt the decoding to the finest still resolvable pattern resolution. In complex real-world scenarios, the normal fields contain regions without observations due to occlusions and outliers due to interreflections and noise. Therefore, a robust reconstruction using only normal information is challenging. Via a non-parametric clustering of normal hypotheses derived for each point in the scene, we obtain both the most likely local surface normal and a local surface consistency estimate. This information is utilized in an iterative min-cut based variational approach to reconstruct the surface geometry.","PeriodicalId":6351,"journal":{"name":"2013 IEEE International Conference on Computer Vision","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89518611","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 35
Unifying Nuclear Norm and Bilinear Factorization Approaches for Low-Rank Matrix Decomposition 低秩矩阵分解的统一核范数与双线性分解方法
Pub Date : 2013-12-01 DOI: 10.1109/ICCV.2013.309
R. Cabral, F. D. L. Torre, J. Costeira, Alexandre Bernardino
Low rank models have been widely used for the representation of shape, appearance or motion in computer vision problems. Traditional approaches to fit low rank models make use of an explicit bilinear factorization. These approaches benefit from fast numerical methods for optimization and easy kernelization. However, they suffer from serious local minima problems depending on the loss function and the amount/type of missing data. Recently, these low-rank models have alternatively been formulated as convex problems using the nuclear norm regularizer, unlike factorization methods, their numerical solvers are slow and it is unclear how to kernelize them or to impose a rank a priori. This paper proposes a unified approach to bilinear factorization and nuclear norm regularization, that inherits the benefits of both. We analyze the conditions under which these approaches are equivalent. Moreover, based on this analysis, we propose a new optimization algorithm and a "rank continuation'' strategy that outperform state-of-the-art approaches for Robust PCA, Structure from Motion and Photometric Stereo with outliers and missing data.
在计算机视觉问题中,低秩模型被广泛用于形状、外观或运动的表示。传统的低秩模型拟合方法使用显式双线性分解。这些方法得益于快速的数值方法优化和易于核化。然而,由于损失函数和丢失数据的数量/类型,它们存在严重的局部最小问题。最近,这些低秩模型被表述为使用核范数正则化器的凸问题,与因式分解方法不同,它们的数值求解速度很慢,并且不清楚如何对它们进行核化或施加先验秩。本文提出了一种统一的双线性分解和核范数正则化方法,继承了两者的优点。我们分析了这些方法是等价的条件。此外,在此分析的基础上,我们提出了一种新的优化算法和“秩延续”策略,该策略优于具有异常值和缺失数据的鲁棒主成分分析、运动和光度立体结构的最先进方法。
{"title":"Unifying Nuclear Norm and Bilinear Factorization Approaches for Low-Rank Matrix Decomposition","authors":"R. Cabral, F. D. L. Torre, J. Costeira, Alexandre Bernardino","doi":"10.1109/ICCV.2013.309","DOIUrl":"https://doi.org/10.1109/ICCV.2013.309","url":null,"abstract":"Low rank models have been widely used for the representation of shape, appearance or motion in computer vision problems. Traditional approaches to fit low rank models make use of an explicit bilinear factorization. These approaches benefit from fast numerical methods for optimization and easy kernelization. However, they suffer from serious local minima problems depending on the loss function and the amount/type of missing data. Recently, these low-rank models have alternatively been formulated as convex problems using the nuclear norm regularizer, unlike factorization methods, their numerical solvers are slow and it is unclear how to kernelize them or to impose a rank a priori. This paper proposes a unified approach to bilinear factorization and nuclear norm regularization, that inherits the benefits of both. We analyze the conditions under which these approaches are equivalent. Moreover, based on this analysis, we propose a new optimization algorithm and a \"rank continuation'' strategy that outperform state-of-the-art approaches for Robust PCA, Structure from Motion and Photometric Stereo with outliers and missing data.","PeriodicalId":6351,"journal":{"name":"2013 IEEE International Conference on Computer Vision","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89812993","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 181
Saliency Detection via Absorbing Markov Chain 吸收马尔可夫链的显著性检测
Pub Date : 2013-12-01 DOI: 10.1109/ICCV.2013.209
Bowen Jiang, L. Zhang, Huchuan Lu, Chuan Yang, Ming-Hsuan Yang
In this paper, we formulate saliency detection via absorbing Markov chain on an image graph model. We jointly consider the appearance divergence and spatial distribution of salient objects and the background. The virtual boundary nodes are chosen as the absorbing nodes in a Markov chain and the absorbed time from each transient node to boundary absorbing nodes is computed. The absorbed time of transient node measures its global similarity with all absorbing nodes, and thus salient objects can be consistently separated from the background when the absorbed time is used as a metric. Since the time from transient node to absorbing nodes relies on the weights on the path and their spatial distance, the background region on the center of image may be salient. We further exploit the equilibrium distribution in an ergodic Markov chain to reduce the absorbed time in the long-range smooth background regions. Extensive experiments on four benchmark datasets demonstrate robustness and efficiency of the proposed method against the state-of-the-art methods.
本文利用吸收马尔可夫链的方法,在图像图模型上实现显著性检测。我们共同考虑显著目标与背景的外观差异和空间分布。选取虚边界节点作为马尔可夫链中的吸收节点,计算每个暂态节点到边界吸收节点的吸收时间。瞬态节点的吸收时间衡量其与所有吸收节点的全局相似度,因此当吸收时间作为度量时,可以一致地将显著目标从背景中分离出来。由于瞬态节点到吸收节点的时间依赖于路径上的权值及其空间距离,因此图像中心的背景区域可能比较突出。我们进一步利用遍历马尔可夫链中的平衡分布来减少远程平滑背景区的吸收时间。在四个基准数据集上进行的大量实验表明,该方法相对于最先进的方法具有鲁棒性和效率。
{"title":"Saliency Detection via Absorbing Markov Chain","authors":"Bowen Jiang, L. Zhang, Huchuan Lu, Chuan Yang, Ming-Hsuan Yang","doi":"10.1109/ICCV.2013.209","DOIUrl":"https://doi.org/10.1109/ICCV.2013.209","url":null,"abstract":"In this paper, we formulate saliency detection via absorbing Markov chain on an image graph model. We jointly consider the appearance divergence and spatial distribution of salient objects and the background. The virtual boundary nodes are chosen as the absorbing nodes in a Markov chain and the absorbed time from each transient node to boundary absorbing nodes is computed. The absorbed time of transient node measures its global similarity with all absorbing nodes, and thus salient objects can be consistently separated from the background when the absorbed time is used as a metric. Since the time from transient node to absorbing nodes relies on the weights on the path and their spatial distance, the background region on the center of image may be salient. We further exploit the equilibrium distribution in an ergodic Markov chain to reduce the absorbed time in the long-range smooth background regions. Extensive experiments on four benchmark datasets demonstrate robustness and efficiency of the proposed method against the state-of-the-art methods.","PeriodicalId":6351,"journal":{"name":"2013 IEEE International Conference on Computer Vision","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89981262","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 585
Learning Slow Features for Behaviour Analysis 学习行为分析的慢特性
Pub Date : 2013-12-01 DOI: 10.1109/ICCV.2013.353
Lazaros Zafeiriou, M. Nicolaou, S. Zafeiriou, Symeon Nikitidis, M. Pantic
A recently introduced latent feature learning technique for time varying dynamic phenomena analysis is the so called Slow Feature Analysis (SFA). SFA is a deterministic component analysis technique for multi-dimensional sequences that by minimizing the variance of the first order time derivative approximation of the input signal finds uncorrelated projections that extract slowly-varying features ordered by their temporal consistency and constancy. In this paper, we propose a number of extensions in both the deterministic and the probabilistic SFA optimization frameworks. In particular, we derive a novel deterministic SFA algorithm that is able to identify linear projections that extract the common slowest varying features of two or more sequences. In addition, we propose an Expectation Maximization (EM) algorithm to perform inference in a probabilistic formulation of SFA and similarly extend it in order to handle two and more time varying data sequences. Moreover, we demonstrate that the probabilistic SFA (EMSFA) algorithm that discovers the common slowest varying latent space of multiple sequences can be combined with dynamic time warping techniques for robust sequence time alignment. The proposed SFA algorithms were applied for facial behavior analysis demonstrating their usefulness and appropriateness for this task.
最近提出的一种用于时变动态现象分析的潜在特征学习技术是慢特征分析(SFA)。SFA是一种针对多维序列的确定性成分分析技术,它通过最小化输入信号的一阶时间导数近似的方差,找到不相关的投影,提取按时间一致性和恒定性排序的缓慢变化特征。在本文中,我们在确定性和概率SFA优化框架中提出了一些扩展。特别是,我们推导了一种新的确定性SFA算法,该算法能够识别线性投影,提取两个或多个序列的共同最慢变化特征。此外,我们提出了一种期望最大化(EM)算法来对SFA的概率公式进行推理,并对其进行类似的扩展,以处理两个或多个时变数据序列。此外,我们还证明了概率SFA (EMSFA)算法可以发现多个序列的共同最慢变化潜在空间,并与动态时间规整技术相结合以实现鲁棒序列时间对齐。将所提出的SFA算法应用于面部行为分析,证明了其有效性和适用性。
{"title":"Learning Slow Features for Behaviour Analysis","authors":"Lazaros Zafeiriou, M. Nicolaou, S. Zafeiriou, Symeon Nikitidis, M. Pantic","doi":"10.1109/ICCV.2013.353","DOIUrl":"https://doi.org/10.1109/ICCV.2013.353","url":null,"abstract":"A recently introduced latent feature learning technique for time varying dynamic phenomena analysis is the so called Slow Feature Analysis (SFA). SFA is a deterministic component analysis technique for multi-dimensional sequences that by minimizing the variance of the first order time derivative approximation of the input signal finds uncorrelated projections that extract slowly-varying features ordered by their temporal consistency and constancy. In this paper, we propose a number of extensions in both the deterministic and the probabilistic SFA optimization frameworks. In particular, we derive a novel deterministic SFA algorithm that is able to identify linear projections that extract the common slowest varying features of two or more sequences. In addition, we propose an Expectation Maximization (EM) algorithm to perform inference in a probabilistic formulation of SFA and similarly extend it in order to handle two and more time varying data sequences. Moreover, we demonstrate that the probabilistic SFA (EMSFA) algorithm that discovers the common slowest varying latent space of multiple sequences can be combined with dynamic time warping techniques for robust sequence time alignment. The proposed SFA algorithms were applied for facial behavior analysis demonstrating their usefulness and appropriateness for this task.","PeriodicalId":6351,"journal":{"name":"2013 IEEE International Conference on Computer Vision","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86581464","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 23
A Convex Optimization Framework for Active Learning 主动学习的凸优化框架
Pub Date : 2013-12-01 DOI: 10.1109/ICCV.2013.33
Ehsan Elhamifar, G. Sapiro, A. Yang, S. Shankar Sasrty
In many image/video/web classification problems, we have access to a large number of unlabeled samples. However, it is typically expensive and time consuming to obtain labels for the samples. Active learning is the problem of progressively selecting and annotating the most informative unlabeled samples, in order to obtain a high classification performance. Most existing active learning algorithms select only one sample at a time prior to retraining the classifier. Hence, they are computationally expensive and cannot take advantage of parallel labeling systems such as Mechanical Turk. On the other hand, algorithms that allow the selection of multiple samples prior to retraining the classifier, may select samples that have significant information overlap or they involve solving a non-convex optimization. More importantly, the majority of active learning algorithms are developed for a certain classifier type such as SVM. In this paper, we develop an efficient active learning framework based on convex programming, which can select multiple samples at a time for annotation. Unlike the state of the art, our algorithm can be used in conjunction with any type of classifiers, including those of the family of the recently proposed Sparse Representation-based Classification (SRC). We use the two principles of classifier uncertainty and sample diversity in order to guide the optimization program towards selecting the most informative unlabeled samples, which have the least information overlap. Our method can incorporate the data distribution in the selection process by using the appropriate dissimilarity between pairs of samples. We show the effectiveness of our framework in person detection, scene categorization and face recognition on real-world datasets.
在许多图像/视频/网络分类问题中,我们可以访问大量未标记的样本。然而,为样品获取标签通常是昂贵和耗时的。主动学习是逐步选择和标注信息量最大的未标记样本,以获得较高的分类性能的问题。大多数现有的主动学习算法在重新训练分类器之前一次只选择一个样本。因此,它们在计算上是昂贵的,并且不能利用类似Mechanical Turk这样的并行标记系统。另一方面,允许在重新训练分类器之前选择多个样本的算法可能会选择具有重要信息重叠的样本,或者它们涉及解决非凸优化。更重要的是,大多数主动学习算法都是针对某种分类器(如SVM)开发的。在本文中,我们开发了一个基于凸规划的高效主动学习框架,它可以一次选择多个样本进行标注。与目前的技术不同,我们的算法可以与任何类型的分类器结合使用,包括最近提出的基于稀疏表示的分类(SRC)系列。我们使用分类器不确定性和样本多样性两个原则来指导优化程序选择信息最丰富、信息重叠最少的未标记样本。我们的方法可以利用样本对之间的适当不相似性,将数据分布纳入选择过程。我们在真实世界的数据集上展示了我们的框架在人物检测、场景分类和人脸识别方面的有效性。
{"title":"A Convex Optimization Framework for Active Learning","authors":"Ehsan Elhamifar, G. Sapiro, A. Yang, S. Shankar Sasrty","doi":"10.1109/ICCV.2013.33","DOIUrl":"https://doi.org/10.1109/ICCV.2013.33","url":null,"abstract":"In many image/video/web classification problems, we have access to a large number of unlabeled samples. However, it is typically expensive and time consuming to obtain labels for the samples. Active learning is the problem of progressively selecting and annotating the most informative unlabeled samples, in order to obtain a high classification performance. Most existing active learning algorithms select only one sample at a time prior to retraining the classifier. Hence, they are computationally expensive and cannot take advantage of parallel labeling systems such as Mechanical Turk. On the other hand, algorithms that allow the selection of multiple samples prior to retraining the classifier, may select samples that have significant information overlap or they involve solving a non-convex optimization. More importantly, the majority of active learning algorithms are developed for a certain classifier type such as SVM. In this paper, we develop an efficient active learning framework based on convex programming, which can select multiple samples at a time for annotation. Unlike the state of the art, our algorithm can be used in conjunction with any type of classifiers, including those of the family of the recently proposed Sparse Representation-based Classification (SRC). We use the two principles of classifier uncertainty and sample diversity in order to guide the optimization program towards selecting the most informative unlabeled samples, which have the least information overlap. Our method can incorporate the data distribution in the selection process by using the appropriate dissimilarity between pairs of samples. We show the effectiveness of our framework in person detection, scene categorization and face recognition on real-world datasets.","PeriodicalId":6351,"journal":{"name":"2013 IEEE International Conference on Computer Vision","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87810141","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 116
期刊
2013 IEEE International Conference on Computer Vision
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1