Extraction of Semantic Keyframes Based on Visual Attention and Affective Models

2007 International Conference on Computational Intelligence and Security (CIS 2007) Pub Date : 2007-12-15 DOI:10.1109/CIS.2007.9

Zhicheng Zhao, A. Cai

引用次数: 6

Abstract

The Extraction of video keyframe is convenient for browsing and retrieving of video content. However, since the "keyframe" is a subjective concept which involves in vision and psychology, it is difficult to be described by low-level features of video. In this paper, we propose a method of keyframe extraction based on visual attention and affective models. To be concrete, film elements such as character, lighting and camera motion, crucial to human attention, are fused into a visual attention model, and the film is segmented into scenes according to a short-time memory model. The "scene importance" is then computed by using the affective arousal which determines audience's excitability in the 2D emotion space. Finally, according to the attention model and the scene importance, scene keyframes are extracted. Experimental results indicate that keyframes extracted by our approach are coincident with human perception, and would be in favor of further semantic analysis.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于视觉注意和情感模型的语义关键帧提取

视频关键帧的提取方便了视频内容的浏览和检索。然而，由于“关键帧”是一个涉及视觉和心理学的主观概念，很难用视频的低级特征来描述。本文提出了一种基于视觉注意和情感模型的关键帧提取方法。具体来说，将人物、灯光、镜头运动等对人的注意力至关重要的电影元素融合到视觉注意力模型中，并根据短时记忆模型将电影分割成场景。然后通过情感唤醒来计算“场景重要性”，情感唤醒决定了观众在2D情感空间中的兴奋性。最后，根据注意力模型和场景重要性提取场景关键帧。实验结果表明，该方法提取的关键帧与人类感知一致，有利于进一步的语义分析。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2007 International Conference on Computational Intelligence and Security (CIS 2007)

自引率

0.00%

发文量