Quasi Real-Time Summarization for Consumer Videos

2014 IEEE Conference on Computer Vision and Pattern Recognition Pub Date : 2014-06-23 DOI:10.1109/CVPR.2014.322

Bin Zhao, E. Xing

{"title":"Quasi Real-Time Summarization for Consumer Videos","authors":"Bin Zhao, E. Xing","doi":"10.1109/CVPR.2014.322","DOIUrl":null,"url":null,"abstract":"With the widespread availability of video cameras, we are facing an ever-growing enormous collection of unedited and unstructured video data. Due to lack of an automatic way to generate summaries from this large collection of consumer videos, they can be tedious and time consuming to index or search. In this work, we propose online video highlighting, a principled way of generating short video summarizing the most important and interesting contents of an unedited and unstructured video, costly both time-wise and financially for manual processing. Specifically, our method learns a dictionary from given video using group sparse coding, and updates atoms in the dictionary on-the-fly. A summary video is then generated by combining segments that cannot be sparsely reconstructed using the learned dictionary. The online fashion of our proposed method enables it to process arbitrarily long videos and start generating summaries before seeing the end of the video. Moreover, the processing time required by our proposed method is close to the original video length, achieving quasi real-time summarization speed. Theoretical analysis, together with experimental results on more than 12 hours of surveillance and YouTube videos are provided, demonstrating the effectiveness of online video highlighting.","PeriodicalId":319578,"journal":{"name":"2014 IEEE Conference on Computer Vision and Pattern Recognition","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"235","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 IEEE Conference on Computer Vision and Pattern Recognition","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CVPR.2014.322","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 235

Abstract

With the widespread availability of video cameras, we are facing an ever-growing enormous collection of unedited and unstructured video data. Due to lack of an automatic way to generate summaries from this large collection of consumer videos, they can be tedious and time consuming to index or search. In this work, we propose online video highlighting, a principled way of generating short video summarizing the most important and interesting contents of an unedited and unstructured video, costly both time-wise and financially for manual processing. Specifically, our method learns a dictionary from given video using group sparse coding, and updates atoms in the dictionary on-the-fly. A summary video is then generated by combining segments that cannot be sparsely reconstructed using the learned dictionary. The online fashion of our proposed method enables it to process arbitrarily long videos and start generating summaries before seeing the end of the video. Moreover, the processing time required by our proposed method is close to the original video length, achieving quasi real-time summarization speed. Theoretical analysis, together with experimental results on more than 12 hours of surveillance and YouTube videos are provided, demonstrating the effectiveness of online video highlighting.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

面向消费者视频的准实时摘要

随着摄像机的广泛使用，我们正面临着不断增长的大量未经编辑和非结构化的视频数据。由于缺乏从大量消费者视频中自动生成摘要的方法，因此索引或搜索它们可能是乏味且耗时的。在这项工作中，我们提出了在线视频突出显示，这是一种原则性的方法，可以生成短视频，总结未编辑和非结构化视频中最重要和最有趣的内容，人工处理在时间和经济上都很昂贵。具体来说，我们的方法使用组稀疏编码从给定的视频中学习字典，并动态更新字典中的原子。然后，将无法使用学习字典稀疏重建的片段组合在一起，生成摘要视频。我们提出的方法的在线方式使其能够处理任意长的视频，并在看到视频结束之前开始生成摘要。此外，该方法所需的处理时间接近原始视频长度，实现了准实时的摘要速度。理论分析，以及对超过12小时的监控和YouTube视频的实验结果，证明了在线视频突出的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2014 IEEE Conference on Computer Vision and Pattern Recognition

自引率

0.00%

发文量

期刊最新文献

Enriching Visual Knowledge Bases via Object Discovery and Segmentation Multiple Structured-Instance Learning for Semantic Segmentation with Uncertain Training Data Parsing Occluded People L0 Norm Based Dictionary Learning by Proximal Methods with Global Convergence Generalized Pupil-centric Imaging and Analytical Calibration for a Non-frontal Camera