SCC: Semantic Context Cascade for Efficient Action Detection

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Pub Date : 2017-07-21 DOI:10.1109/CVPR.2017.338

Fabian Caba Heilbron, Wayner Barrios, Victor Escorcia, Bernard Ghanem

引用次数: 98

Abstract

Despite the recent advances in large-scale video analysis, action detection remains as one of the most challenging unsolved problems in computer vision. This snag is in part due to the large volume of data that needs to be analyzed to detect actions in videos. Existing approaches have mitigated the computational cost, but still, these methods lack rich high-level semantics that helps them to localize the actions quickly. In this paper, we introduce a Semantic Cascade Context (SCC) model that aims to detect action in long video sequences. By embracing semantic priors associated with human activities, SCC produces high-quality class-specific action proposals and prune unrelated activities in a cascade fashion. Experimental results in ActivityNet unveils that SCC achieves state-of-the-art performance for action detection while operating at real time.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

高效动作检测的语义上下文级联

尽管最近在大规模视频分析方面取得了进展，但动作检测仍然是计算机视觉中最具挑战性的未解决问题之一。这种障碍部分是由于需要分析大量数据来检测视频中的动作。现有的方法已经降低了计算成本，但是这些方法仍然缺乏丰富的高级语义来帮助它们快速定位动作。在本文中，我们引入了一个语义级联上下文(SCC)模型，旨在检测长视频序列中的动作。通过采用与人类活动相关的语义先验，SCC产生高质量的类特定行动建议，并以级联方式修剪不相关的活动。ActivityNet的实验结果表明，SCC在实时操作时实现了最先进的动作检测性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

自引率

0.00%

发文量

期刊最新文献

FFTLasso: Large-Scale LASSO in the Fourier Domain Semantically Coherent Co-Segmentation and Reconstruction of Dynamic Scenes Coarse-to-Fine Segmentation with Shape-Tailored Continuum Scale Spaces Joint Gap Detection and Inpainting of Line Drawings Wetness and Color from a Single Multispectral Image