Seeing actions through scene context

2013 Visual Communications and Image Processing (VCIP) Pub Date : 2013-11-01 DOI:10.1109/VCIP.2013.6706382

Hongbo Zhang, Songzhi Su, Shaozi Li, Duansheng Chen, Bineng Zhong, R. Ji

引用次数: 0

Abstract

Recognizing human actions is not alone, as hinted by the scene herein. In this paper, we investigate the possibility to boost the action recognition performance by exploiting their scene context associated. To this end, we model the scene as a mid-level “hidden layer” to bridge action descriptors and action categories. This is achieved via a scene topic model, in which hybrid visual descriptors including spatiotemporal action features and scene descriptors are first extracted from the video sequence. Then, we learn a joint probability distribution between scene and action by a Naive Bayesian N-earest Neighbor algorithm, which is adopted to jointly infer the action categories online by combining off-the-shelf action recognition algorithms. We demonstrate our merits by comparing to state-of-the-arts in several action recognition benchmarks.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

通过场景背景来观察动作

正如这里的场景所暗示的那样，识别人类的行为并不是唯一的。在本文中，我们研究了通过利用它们的场景上下文关联来提高动作识别性能的可能性。为此，我们将场景建模为中级“隐藏层”，以连接动作描述符和动作类别。这是通过场景主题模型实现的，其中首先从视频序列中提取包括时空动作特征和场景描述符的混合视觉描述符。然后，我们通过朴素贝叶斯n近邻算法学习场景和动作之间的联合概率分布，并结合现有的动作识别算法在线联合推断动作类别。我们通过在几个动作识别基准中比较最先进的技术来展示我们的优点。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2013 Visual Communications and Image Processing (VCIP)

自引率

0.00%

发文量

期刊最新文献

New motherwavelet for pattern detection in IR image Improved disparity vector derivation in 3D-HEVC Learning non-negative locality-constrained Linear Coding for human action recognition Wavelet based smoke detection method with RGB Contrast-image and shape constrain Joint image denoising using self-similarity based low-rank approximations