Pre-Attentional Filtering in Compressed Video

2005 IEEE International Conference on Multimedia and Expo Pub Date : 2005-07-06 DOI:10.1109/ICME.2005.1521445

J. Sánchez, R. L. Felip, Xavier Binefa

引用次数: 1

Abstract

We propose the use of attentional cascades based on the DCT and motion information contained in an MPEG coded stream. An attentional cascade is a sequence of very efficient classifiers that reject a large number of negative candidate regions, while keeping all the positive candidates. Working directly on the compressed domain has two main advantages: computationally expensive features are already computed, and the stream is only partially decoded without the additional cost of full decompression, which will be reached by a very small number of the initial candidate regions. We have applied these concepts to skin color detection, as a pre-attentive filtering prior to face detection, and to text region detection with particular focus on license plates for vehicle identification. In both cases, a reduction of the number of candidate regions close to 95% is achieved, which turns into an enormous performance increase in video indexing processes

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

压缩视频中的预注意滤波

我们建议使用基于DCT和MPEG编码流中包含的运动信息的注意力级联。注意级联是一个非常有效的分类器序列，它拒绝大量的负面候选区域，同时保留所有积极的候选区域。直接在压缩域上工作有两个主要优点:计算成本高的特征已经计算出来，并且流只被部分解码，而没有额外的完全解压缩成本，这将由非常少量的初始候选区域达到。我们已经将这些概念应用于肤色检测，作为人脸检测之前的预先注意过滤，以及文本区域检测，特别关注车牌用于车辆识别。在这两种情况下，候选区域的数量都减少了近95%，这在视频索引过程中带来了巨大的性能提升

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2005 IEEE International Conference on Multimedia and Expo

自引率

0.00%

发文量

期刊最新文献

Lossless image compression with tree coding of magnitude levels Maximizing the profit for cache replacement in a transcoding proxy Pre-Attentional Filtering in Compressed Video Annotation and detection of blended emotions in real human-human dialogs recorded in a call center Fast inter frame encoding based on modes pre-decision in H.264