Feature Field Fusion for few-shot novel view synthesis

IF 4.2 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Image and Vision Computing Pub Date : 2025-04-01 Epub Date: 2025-02-22 DOI:10.1016/j.imavis.2025.105465

Junting Li , Yanghong Zhou , Jintu Fan , Dahua Shou , Sa Xu , P.Y. Mok

引用次数: 0

Abstract

Reconstructing neural radiance fields from limited or sparse views has given very promising potential for this field of research. Previous methods usually constrain the reconstruction process with additional priors, e.g. semantic-based or patch-based regularization. Nevertheless, such regularization is given to the synthesis of unseen views, which may not effectively assist the field of learning, in particular when the training views are sparse. Instead, we propose a feature Field Fusion (FFusion) NeRF in this paper that can learn structure and more details from features extracted from pre-trained neural networks for the sparse training views, and use as extra guide for the training of the RGB field. With such extra feature guides, FFusion predicts more accurate color and density when synthesizing novel views. Experimental results have shown that FFusion can effectively improve the quality of the synthesized novel views with only limited or sparse inputs.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于特征场融合的少镜头新视角合成

从有限或稀疏的视图中重建神经辐射场为这一领域的研究提供了非常有前景的前景。以前的方法通常用附加的先验约束重建过程，例如基于语义或基于补丁的正则化。然而，这种正则化被给予了看不见的视图的综合，这可能不能有效地辅助学习领域，特别是当训练视图是稀疏的时候。相反，我们在本文中提出了一种特征场融合（FFusion） NeRF，它可以从稀疏训练视图的预训练神经网络中提取的特征中学习结构和更多细节，并将其用作RGB场训练的额外指导。有了这些额外的功能指南，在合成新视图时，FFusion可以预测更准确的颜色和密度。实验结果表明，在输入有限或稀疏的情况下，融合算法可以有效地提高合成新视图的质量。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Image and Vision Computing 工程技术-工程：电子与电气

CiteScore

8.50

自引率

8.50%

发文量

143

审稿时长

7.8 months

期刊介绍： Image and Vision Computing has as a primary aim the provision of an effective medium of interchange for the results of high quality theoretical and applied research fundamental to all aspects of image interpretation and computer vision. The journal publishes work that proposes new image interpretation and computer vision methodology or addresses the application of such methods to real world scenes. It seeks to strengthen a deeper understanding in the discipline by encouraging the quantitative comparison and performance evaluation of the proposed methodology. The coverage includes: image interpretation, scene modelling, object recognition and tracking, shape analysis, monitoring and surveillance, active vision and robotic systems, SLAM, biologically-inspired computer vision, motion analysis, stereo vision, document image understanding, character and handwritten text recognition, face and gesture recognition, biometrics, vision-based human-computer interaction, human activity and behavior understanding, data fusion from multiple sensor inputs, image databases.