Learning Non-linear Reconstruction Models for Image Set Classification

2014 IEEE Conference on Computer Vision and Pattern Recognition Pub Date : 2014-06-23 DOI:10.1109/CVPR.2014.246

Munawar Hayat, Bennamoun, S. An

引用次数: 72

Abstract

We propose a deep learning framework for image set classification with application to face recognition. An Adaptive Deep Network Template (ADNT) is defined whose parameters are initialized by performing unsupervised pre-training in a layer-wise fashion using Gaussian Restricted Boltzmann Machines (GRBMs). The pre-initialized ADNT is then separately trained for images of each class and class-specific models are learnt. Based on the minimum reconstruction error from the learnt class-specific models, a majority voting strategy is used for classification. The proposed framework is extensively evaluated for the task of image set classification based face recognition on Honda/UCSD, CMU Mobo, YouTube Celebrities and a Kinect dataset. Our experimental results and comparisons with existing state-of-the-art methods show that the proposed method consistently achieves the best performance on all these datasets.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

学习用于图像集分类的非线性重建模型

我们提出了一种图像集分类的深度学习框架，并将其应用于人脸识别。定义了一个自适应深度网络模板(ADNT)，其参数通过使用高斯受限玻尔兹曼机(grbm)以分层方式执行无监督预训练来初始化。然后针对每个类的图像分别训练预初始化的ADNT，并学习特定于类的模型。基于学习到的类特定模型的最小重构误差，采用多数投票策略进行分类。在本田/UCSD、CMU Mobo、YouTube Celebrities和Kinect数据集上，对基于图像集分类的人脸识别任务进行了广泛的评估。我们的实验结果和与现有最先进的方法的比较表明，所提出的方法在所有这些数据集上都能达到最佳性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2014 IEEE Conference on Computer Vision and Pattern Recognition

自引率

0.00%

发文量

期刊最新文献

Enriching Visual Knowledge Bases via Object Discovery and Segmentation Multiple Structured-Instance Learning for Semantic Segmentation with Uncertain Training Data Parsing Occluded People L0 Norm Based Dictionary Learning by Proximal Methods with Global Convergence Generalized Pupil-centric Imaging and Analytical Calibration for a Non-frontal Camera