Evaluating Multi-task Learning for Multi-view Head-Pose Classification in Interactive Environments

2014 22nd International Conference on Pattern Recognition Pub Date : 2014-08-24 DOI:10.1109/ICPR.2014.717

Yan Yan, Subramanian Ramanathan, E. Ricci, O. Lanz, N. Sebe

引用次数: 12

Abstract

Social attention behavior offers vital cues towards inferring one's personality traits from interactive settings such as round-table meetings and cocktail parties. Head orientation is typically employed as a proxy for determining the social attention direction when faces are captured at low-resolution. Recently, multi-task learning has been proposed to robustly compute head pose under perspective and scale-based facial appearance variations when multiple, distant and large field-of-view cameras are employed for visual analysis in smart-room applications. In this paper, we evaluate the effectiveness of an SVM-based MTL (SVM+MTL) framework with various facial descriptors (KL, HOG, LBP, etc.). The KL+HOG feature combination is found to produce the best classification performance, with SVM+MTL outperforming classical SVM irrespective of the feature used.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

评价交互环境下多视角头姿分类的多任务学习

社会注意行为为从圆桌会议和鸡尾酒会等互动环境中推断一个人的个性特征提供了重要线索。当以低分辨率捕捉人脸时，头部方向通常被用作确定社会注意方向的代理。最近，多任务学习被提出，用于在智能房间应用中使用多个远距离和大视场摄像机进行视觉分析时，在基于视角和尺度的面部外观变化下稳健地计算头部姿势。在本文中，我们评估了基于SVM的MTL (SVM+MTL)框架与各种面部描述符(KL, HOG, LBP等)的有效性。发现KL+HOG特征组合产生最佳分类性能，无论使用何种特征，SVM+MTL都优于经典SVM。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2014 22nd International Conference on Pattern Recognition

自引率

0.00%

发文量

期刊最新文献

Real-Time Tracking via Deformable Structure Regression Learning Traffic Camera Anomaly Detection Velocity-Based Multiple Change-Point Inference for Unsupervised Segmentation of Human Movement Behavior Volume Reconstruction for MRI Anomaly Detection through Spatio-temporal Context Modeling in Crowded Scenes