Multi-task Recurrent Neural Network for Immediacy Prediction

2015 IEEE International Conference on Computer Vision (ICCV) Pub Date : 2015-12-07 DOI:10.1109/ICCV.2015.383

Xiao Chu, Wanli Ouyang, Wei Yang, Xiaogang Wang

引用次数: 47

Abstract

In this paper, we propose to predict immediacy for interacting persons from still images. A complete immediacy set includes interactions, relative distance, body leaning direction and standing orientation. These measures are found to be related to the attitude, social relationship, social interaction, action, nationality, and religion of the communicators. A large-scale dataset with 10,000 images is constructed, in which all the immediacy measures and the human poses are annotated. We propose a rich set of immediacy representations that help to predict immediacy from imperfect 1-person and 2-person pose estimation results. A multi-task deep recurrent neural network is constructed to take the proposed rich immediacy representation as input and learn the complex relationship among immediacy predictions multiple steps of refinement. The effectiveness of the proposed approach is proved through extensive experiments on the large scale dataset.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

即时预测的多任务递归神经网络

在本文中，我们提出从静止图像中预测互动人员的即时性。一个完整的即时性集合包括互动、相对距离、身体倾斜方向和站立方向。这些指标与传播者的态度、社会关系、社会互动、行为、国籍、宗教信仰等有关。构建了包含1万幅图像的大规模数据集，并对所有的即时性度量和人体姿势进行了注释。我们提出了一套丰富的即时性表示，有助于从不完美的1人或2人姿势估计结果中预测即时性。构建了一个多任务深度递归神经网络，将提出的丰富的直接性表示作为输入，并通过多个细化步骤学习直接性预测之间的复杂关系。通过大规模数据集的大量实验证明了该方法的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2015 IEEE International Conference on Computer Vision (ICCV)

自引率

0.00%

发文量

期刊最新文献

Listening with Your Eyes: Towards a Practical Visual Speech Recognition System Using Deep Boltzmann Machines Self-Calibration of Optical Lenses Single Image Pop-Up from Discriminatively Learned Parts Multi-task Recurrent Neural Network for Immediacy Prediction Low-Rank Tensor Approximation with Laplacian Scale Mixture Modeling for Multiframe Image Denoising