High-Fidelity Face Swapping with Style Blending

24th Irish Machine Vision and Image Processing Conference Pub Date : 2022-08-31 DOI:10.56541/kupa8487

Xinyu Yang, Zhijin Guo, Mowen Xue, Zijian Shi

{"title":"High-Fidelity Face Swapping with Style Blending","authors":"Xinyu Yang, Zhijin Guo, Mowen Xue, Zijian Shi","doi":"10.56541/kupa8487","DOIUrl":null,"url":null,"abstract":"Face swapping is gaining significant traction, boosted by the plethora of human face synthesis with the deep learning methods. Recent works based on Generative Adversarial Nets (GAN) for face swapping often suffer from blending inconsistency, distortions and artefacts, as well as instability in training. In this work, we propose a novel end-to-end framework for high-fidelity face swapping, leveraging the high photorealistic face generation techniques from StyleGAN. Firstly, we invert the facial images into the style latent space by purposing a novel facial attributes encoder that is capable of extracting face essentials from the face image and projecting them to the style code in the latent space. We show that such inverted style code encapsulates facial attributes that are indispensable for face swapping task. Secondly, a carefully designed style blending module (SBM) is introduced for transferring the identity from a source image to the target by the multi-head attention (MHA) mechanism. We propose relevant constraints for guiding the learning of the SBM, leading to the effective blending of the Face ID from the source face to the target image. Finally, the blended style code can be translated back to the image space via the style decoder, benefiting from the training stability and the high quality of the generative capability of the style-based decoder. Extensive experiments demonstrate the superior quality of the face synthesis results (illustrated in Figure 1) of our face-swapping system compared with other state-of-the-art methods.","PeriodicalId":180076,"journal":{"name":"24th Irish Machine Vision and Image Processing Conference","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2022-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"24th Irish Machine Vision and Image Processing Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.56541/kupa8487","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Face swapping is gaining significant traction, boosted by the plethora of human face synthesis with the deep learning methods. Recent works based on Generative Adversarial Nets (GAN) for face swapping often suffer from blending inconsistency, distortions and artefacts, as well as instability in training. In this work, we propose a novel end-to-end framework for high-fidelity face swapping, leveraging the high photorealistic face generation techniques from StyleGAN. Firstly, we invert the facial images into the style latent space by purposing a novel facial attributes encoder that is capable of extracting face essentials from the face image and projecting them to the style code in the latent space. We show that such inverted style code encapsulates facial attributes that are indispensable for face swapping task. Secondly, a carefully designed style blending module (SBM) is introduced for transferring the identity from a source image to the target by the multi-head attention (MHA) mechanism. We propose relevant constraints for guiding the learning of the SBM, leading to the effective blending of the Face ID from the source face to the target image. Finally, the blended style code can be translated back to the image space via the style decoder, benefiting from the training stability and the high quality of the generative capability of the style-based decoder. Extensive experiments demonstrate the superior quality of the face synthesis results (illustrated in Figure 1) of our face-swapping system compared with other state-of-the-art methods.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

高保真人脸交换与风格混合

人脸交换正在获得巨大的吸引力，这得益于使用深度学习方法合成的大量人脸。近年来基于生成对抗网络(GAN)的人脸交换工作经常存在混合不一致、扭曲和伪像以及训练不稳定等问题。在这项工作中，我们提出了一个新颖的端到端高保真人脸交换框架，利用StyleGAN的高真实感人脸生成技术。首先，我们设计了一种新的人脸属性编码器，该编码器能够从人脸图像中提取人脸特征并将其投影到潜在空间中的样式代码中，从而将人脸图像反演到样式潜在空间中。我们证明了这种倒置样式代码封装了人脸交换任务中不可缺少的面部属性。其次，引入了精心设计的风格混合模块(SBM)，通过多头注意(MHA)机制将身份从源图像传递到目标图像。我们提出了相关的约束条件来指导SBM的学习，从而有效地将Face ID从源人脸混合到目标图像。最后，利用基于样式的解码器的训练稳定性和高质量的生成能力，混合的样式码可以通过样式解码器转换回图像空间。大量的实验证明，与其他最先进的方法相比，我们的人脸交换系统的人脸合成结果(如图1所示)具有更高的质量。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

24th Irish Machine Vision and Image Processing Conference

自引率

0.00%

发文量