Gaze-Driven Video Re-Editing

ACM Trans. Graph. Pub Date : 2015-03-02 DOI:10.1145/2699644

Eakta Jain, Yaser Sheikh, Ariel Shamir, J. Hodgins

引用次数: 44

Abstract

Given the current profusion of devices for viewing media, video content created at one aspect ratio is often viewed on displays with different aspect ratios. Many previous solutions address this problem by retargeting or resizing the video, but a more general solution would re-edit the video for the new display. Our method employs the three primary editing operations: pan, cut, and zoom. We let viewers implicitly reveal what is important in a video by tracking their gaze as they watch the video. We present an algorithm that optimizes the path of a cropping window based on the collected eyetracking data, finds places to cut, and computes the size of the cropping window. We present results on a variety of video clips, including close-up and distant shots, and stationary and moving cameras. We conduct two experiments to evaluate our results. First, we eyetrack viewers on the result videos generated by our algorithm, and second, we perform a subjective assessment of viewer preference. These experiments show that viewer gaze patterns are similar on our result videos and on the original video clips, and that viewers prefer our results to an optimized crop-and-warp algorithm.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

目光驱动的视频重新编辑

鉴于当前观看媒体的设备种类繁多，以一种宽高比创建的视频内容通常在不同宽高比的显示器上观看。许多以前的解决方案通过重新定位或调整视频大小来解决这个问题，但更通用的解决方案是为新显示重新编辑视频。我们的方法采用了三种主要的编辑操作:平移、剪切和缩放。我们让观众在观看视频时通过跟踪他们的目光来隐性地揭示视频中什么是重要的。我们提出了一种算法，该算法基于收集的眼动追踪数据优化裁剪窗口的路径，找到裁剪的位置，并计算裁剪窗口的大小。我们展示了各种视频剪辑的结果，包括特写和远距离拍摄，以及静止和移动的相机。我们进行了两个实验来评估我们的结果。首先，我们通过算法生成的结果视频来跟踪观众，其次，我们对观众的偏好进行主观评估。这些实验表明，观众的注视模式在我们的结果视频和原始视频剪辑上是相似的，并且观众更喜欢我们的结果，而不是优化的裁剪和扭曲算法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

ACM Trans. Graph.

自引率

0.00%

发文量

期刊最新文献

LuisaRender: A High-Performance Rendering Framework with Layered and Unified Interfaces on Stream Architectures BoolSurf: Boolean Operations on Surfaces SkinMixer: Blending 3D Animated Models PopStage: The Generation of Stage Cross-Editing Video Based on Spatio-Temporal Matching QuadStream: A Quad-Based Scene Streaming Architecture for Novel Viewpoint Reconstruction