Human action recognition and retrieval using sole depth information

Proceedings of the 20th ACM international conference on Multimedia Pub Date : 2012-10-29 DOI:10.1145/2393347.2396381

Yan-Ching Lin, Min-Chun Hu, Wen-Huang Cheng, Yung-Huan Hsieh, Hong-Ming Chen

引用次数: 86

Abstract

Observing the widespread use of Kinect-like depth cameras, in this work, we investigate into the problem of using sole depth data for human action recognition and retrieval in videos. We proposed the use of simple depth descriptors without learning optimization to achieve promising performances as compatible to those of the leading methods based on color images and videos, and can be effectively applied for real-time applications. Because of the infrared nature of depth cameras, the proposed approach will be especially useful under poor lighting conditions, e.g. the surveillance environments without sufficient lighting. Meanwhile, we proposed a large Depth-included Human Action video dataset, namely DHA, which contains 357 videos of performed human actions belonging to 17 categories. To the best of our knowledge, the DHA is one of the largest depth-included video datasets of human actions.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于深度信息的人体动作识别与检索

观察到kinect深度相机的广泛使用，在这项工作中，我们研究了在视频中使用单一深度数据进行人类动作识别和检索的问题。我们提出使用简单的深度描述符而不进行学习优化，可以获得与基于彩色图像和视频的领先方法兼容的良好性能，并且可以有效地应用于实时应用。由于深度相机的红外特性，所提出的方法在光线不足的情况下特别有用，例如在没有足够照明的监视环境中。同时，我们提出了一个包含深度的大型人类动作视频数据集，即DHA，它包含了357个人类动作的视频，属于17个类别。据我们所知，DHA是最大的深度包含人类行为的视频数据集之一。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of the 20th ACM international conference on Multimedia

自引率

0.00%

发文量

期刊最新文献

ROI-based protection scheme for high definition interactive video applications TouchPaper: making print interactive A genetic algorithm for audio retargeting Mining in-class social networks for large-scale pedagogical analysis Plug&touch: a mobile interaction solution for large display via vision-based hand gesture detection