FaceDirector: Continuous Control of Facial Performance in Video

2015 IEEE International Conference on Computer Vision (ICCV) Pub Date : 2015-12-07 DOI:10.1109/ICCV.2015.453

Charles Malleson, J. Bazin, Oliver Wang, D. Bradley, T. Beeler, A. Hilton, A. Sorkine-Hornung

{"title":"FaceDirector: Continuous Control of Facial Performance in Video","authors":"Charles Malleson, J. Bazin, Oliver Wang, D. Bradley, T. Beeler, A. Hilton, A. Sorkine-Hornung","doi":"10.1109/ICCV.2015.453","DOIUrl":null,"url":null,"abstract":"We present a method to continuously blend between multiple facial performances of an actor, which can contain different facial expressions or emotional states. As an example, given sad and angry video takes of a scene, our method empowers the movie director to specify arbitrary weighted combinations and smooth transitions between the two takes in post-production. Our contributions include (1) a robust nonlinear audio-visual synchronization technique that exploits complementary properties of audio and visual cues to automatically determine robust, dense spatiotemporal correspondences between takes, and (2) a seamless facial blending approach that provides the director full control to interpolate timing, facial expression, and local appearance, in order to generate novel performances after filming. In contrast to most previous works, our approach operates entirely in image space, avoiding the need of 3D facial reconstruction. We demonstrate that our method can synthesize visually believable performances with applications in emotion transition, performance correction, and timing control.","PeriodicalId":6633,"journal":{"name":"2015 IEEE International Conference on Computer Vision (ICCV)","volume":"3 1","pages":"3979-3987"},"PeriodicalIF":0.0000,"publicationDate":"2015-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"18","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 IEEE International Conference on Computer Vision (ICCV)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCV.2015.453","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 18

Abstract

We present a method to continuously blend between multiple facial performances of an actor, which can contain different facial expressions or emotional states. As an example, given sad and angry video takes of a scene, our method empowers the movie director to specify arbitrary weighted combinations and smooth transitions between the two takes in post-production. Our contributions include (1) a robust nonlinear audio-visual synchronization technique that exploits complementary properties of audio and visual cues to automatically determine robust, dense spatiotemporal correspondences between takes, and (2) a seamless facial blending approach that provides the director full control to interpolate timing, facial expression, and local appearance, in order to generate novel performances after filming. In contrast to most previous works, our approach operates entirely in image space, avoiding the need of 3D facial reconstruction. We demonstrate that our method can synthesize visually believable performances with applications in emotion transition, performance correction, and timing control.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

FaceDirector:在视频中连续控制面部表现

我们提出了一种方法，在演员的多个面部表演之间持续融合，这些面部表演可以包含不同的面部表情或情绪状态。例如，给定一个场景的悲伤和愤怒视频，我们的方法授权电影导演在后期制作中指定任意加权组合和两个镜头之间的平滑过渡。我们的贡献包括:(1)一种鲁棒非线性视听同步技术，利用音频和视觉线索的互补特性来自动确定拍摄之间的鲁棒、密集的时空对应关系;(2)一种无缝的面部混合方法，使导演能够完全控制插入时间、面部表情和局部外观，以便在拍摄后产生新颖的表演。与大多数先前的工作相比，我们的方法完全在图像空间中操作，避免了3D面部重建的需要。我们证明了我们的方法可以合成视觉上可信的表演，并应用于情绪转换，表演纠正和时间控制。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2015 IEEE International Conference on Computer Vision (ICCV)

自引率

0.00%

发文量

期刊最新文献

Listening with Your Eyes: Towards a Practical Visual Speech Recognition System Using Deep Boltzmann Machines Self-Calibration of Optical Lenses Single Image Pop-Up from Discriminatively Learned Parts Multi-task Recurrent Neural Network for Immediacy Prediction Low-Rank Tensor Approximation with Laplacian Scale Mixture Modeling for Multiframe Image Denoising