UPST-NeRF: Universal Photorealistic Style Transfer of Neural Radiance Fields for 3D Scene

IF 6.5 IEEE transactions on visualization and computer graphics Pub Date : 2024-03-19 DOI:10.1109/TVCG.2024.3378692

Yaosen Chen;Qi Yuan;Zhiqiang Li;Yuegen Liu;Wei Wang;Chaoping Xie;Xuming Wen;Qien Yu

{"title":"UPST-NeRF: Universal Photorealistic Style Transfer of Neural Radiance Fields for 3D Scene","authors":"Yaosen Chen;Qi Yuan;Zhiqiang Li;Yuegen Liu;Wei Wang;Chaoping Xie;Xuming Wen;Qien Yu","doi":"10.1109/TVCG.2024.3378692","DOIUrl":null,"url":null,"abstract":"Photorealistic stylization of 3D scenes aims to generate photorealistic images from arbitrary novel views according to a given style image, while ensuring consistency when rendering video from different viewpoints. Some existing stylization methods using neural radiance fields can effectively predict stylized scenes by combining the features of the style image with multi-view images to train 3D scenes. However, these methods generate novel view images that contain undesirable artifacts. In addition, they cannot achieve universal photorealistic stylization for a 3D scene. Therefore, a stylization image needs to retrain a 3D scene representation network based on a neural radiation field. We propose a novel photorealistic 3D scene stylization transfer framework to address these issues. It can realize photorealistic 3D scene style transfer with a 2D style image for novel view video rendering. We first pre-trained a 2D photorealistic style transfer network, which can satisfy the photorealistic style transfer between any content image and style image. Then, we use voxel features to optimize a 3D scene and obtain the geometric representation of the scene. Finally, we jointly optimize a hypernetwork to realize the photorealistic style transfer of arbitrary style images. In the transfer stage, we use a pre-trained 2D photorealistic network to constrain the photorealistic style of different views and different style images in the 3D scene. The experimental results show that our method not only realizes the 3D photorealistic style transfer of arbitrary style images, but also outperforms the existing methods in terms of visual quality and consistency.","PeriodicalId":94035,"journal":{"name":"IEEE transactions on visualization and computer graphics","volume":"31 4","pages":"2045-2057"},"PeriodicalIF":6.5000,"publicationDate":"2024-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on visualization and computer graphics","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10475593/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Photorealistic stylization of 3D scenes aims to generate photorealistic images from arbitrary novel views according to a given style image, while ensuring consistency when rendering video from different viewpoints. Some existing stylization methods using neural radiance fields can effectively predict stylized scenes by combining the features of the style image with multi-view images to train 3D scenes. However, these methods generate novel view images that contain undesirable artifacts. In addition, they cannot achieve universal photorealistic stylization for a 3D scene. Therefore, a stylization image needs to retrain a 3D scene representation network based on a neural radiation field. We propose a novel photorealistic 3D scene stylization transfer framework to address these issues. It can realize photorealistic 3D scene style transfer with a 2D style image for novel view video rendering. We first pre-trained a 2D photorealistic style transfer network, which can satisfy the photorealistic style transfer between any content image and style image. Then, we use voxel features to optimize a 3D scene and obtain the geometric representation of the scene. Finally, we jointly optimize a hypernetwork to realize the photorealistic style transfer of arbitrary style images. In the transfer stage, we use a pre-trained 2D photorealistic network to constrain the photorealistic style of different views and different style images in the 3D scene. The experimental results show that our method not only realizes the 3D photorealistic style transfer of arbitrary style images, but also outperforms the existing methods in terms of visual quality and consistency.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

UPST-NeRF：三维场景神经辐射场的通用逼真风格转移

逼真的3D场景风格化旨在根据给定的风格图像从任意新颖的视角生成逼真的图像，同时保证从不同视点渲染视频时的一致性。现有的一些基于神经辐射场的风格化方法，通过将风格化图像的特征与多视图图像相结合来训练3D场景，可以有效地预测风格化场景。然而，这些方法生成的新视图图像包含不需要的工件。此外，它们无法实现3D场景的通用逼真风格。因此，风格化图像需要重新训练基于神经辐射场的三维场景表示网络。为了解决这些问题，我们提出了一种新的逼真的3D场景风格化传输框架。它可以实现逼真的3D场景风格与2D风格图像的转换，实现新颖的视角视频渲染。我们首先预训练了一个2D逼真风格转移网络，该网络可以满足任意内容图像和风格图像之间的逼真风格转移。然后，利用体素特征对三维场景进行优化，得到场景的几何表示。最后，我们共同优化了一个超网络，实现了任意风格图像的逼真风格转换。在迁移阶段，我们使用预训练好的2D真实感网络来约束3D场景中不同视图和不同风格图像的真实感风格。实验结果表明，该方法不仅实现了任意风格图像的三维真实感风格转换，而且在视觉质量和一致性方面优于现有方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE transactions on visualization and computer graphics

自引率

0.00%

发文量