Domain Specific and Idiom Adaptive Video Summarization

Proceedings of the ACM Multimedia Asia Pub Date : 2019-12-15 DOI:10.1145/3338533.3366603

Yi Dong, Chang Liu, Zhiqi Shen, Zhanning Gao, Pan Wang, Changgong Zhang, Peiran Ren, Xuansong Xie, Han Yu, Qingming Huang

{"title":"Domain Specific and Idiom Adaptive Video Summarization","authors":"Yi Dong, Chang Liu, Zhiqi Shen, Zhanning Gao, Pan Wang, Changgong Zhang, Peiran Ren, Xuansong Xie, Han Yu, Qingming Huang","doi":"10.1145/3338533.3366603","DOIUrl":null,"url":null,"abstract":"As short videos become an increasingly popular form of storytelling, there is a growing demand for video summarization to convey information concisely with a subset of video frames. Some criteria such as interestingness and diversity are used by existing efforts to pick appropriate segments of content. However, there lacks a mechanism to infuse insights from cinematography and persuasion into this process. As a result, the results of the video summarization sometimes deviate from the original. In addition, the exploration of the vast design space to create customized video summaries is costly for video producer. To address these challenges, we propose a domain specific and idiom adaptive video summarization approach. Specifically, our approach first segments the input video and extracts high-level information from each segment. Such labels are used to represent a collection of idioms and summarization metrics as submodular components which users can combine to create personalized summary styles in a variety of ways. In order to identify the importance of the idioms and metrics in different domains, we leverage max margin learning. Experimental results have validated the effectiveness of our approach. We also plan to release a dataset containing over 600 videos with expert annotations which can benefit further research in this area.","PeriodicalId":273086,"journal":{"name":"Proceedings of the ACM Multimedia Asia","volume":"27 6","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the ACM Multimedia Asia","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3338533.3366603","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

As short videos become an increasingly popular form of storytelling, there is a growing demand for video summarization to convey information concisely with a subset of video frames. Some criteria such as interestingness and diversity are used by existing efforts to pick appropriate segments of content. However, there lacks a mechanism to infuse insights from cinematography and persuasion into this process. As a result, the results of the video summarization sometimes deviate from the original. In addition, the exploration of the vast design space to create customized video summaries is costly for video producer. To address these challenges, we propose a domain specific and idiom adaptive video summarization approach. Specifically, our approach first segments the input video and extracts high-level information from each segment. Such labels are used to represent a collection of idioms and summarization metrics as submodular components which users can combine to create personalized summary styles in a variety of ways. In order to identify the importance of the idioms and metrics in different domains, we leverage max margin learning. Experimental results have validated the effectiveness of our approach. We also plan to release a dataset containing over 600 videos with expert annotations which can benefit further research in this area.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

特定领域和习语自适应视频摘要

随着短视频成为一种越来越流行的叙事形式，人们对视频摘要的需求越来越大，希望通过视频帧的子集来简洁地传达信息。一些标准，如趣味性和多样性，被现有的努力用来挑选适当的内容片段。然而，缺乏一种机制来将电影摄影和说服的见解注入到这个过程中。因此，视频总结的结果有时会偏离原来的内容。此外，探索广阔的设计空间来制作定制化的视频摘要，对于视频制作者来说是非常昂贵的。为了解决这些问题，我们提出了一种针对特定领域和习语的视频摘要方法。具体来说，我们的方法首先对输入视频进行分割，并从每个片段中提取高级信息。这样的标签用于将一组习惯用法和摘要度量表示为子模块组件，用户可以将这些组件组合起来，以各种方式创建个性化的摘要样式。为了确定习语和度量在不同领域的重要性，我们利用最大边际学习。实验结果验证了该方法的有效性。我们还计划发布一个包含超过600个带有专家注释的视频的数据集，这有助于该领域的进一步研究。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of the ACM Multimedia Asia

自引率

0.00%

发文量

期刊最新文献

Session details: Vision in Multimedia Domain Specific and Idiom Adaptive Video Summarization Multi-Label Image Classification with Attention Mechanism and Graph Convolutional Networks Session details: Brave New Idea Self-balance Motion and Appearance Model for Multi-object Tracking in UAV