Video frame interpolation based on depthwise over-parameterized recurrent residual convolution

IF 1 4区计算机科学 Q4 ENGINEERING, ELECTRICAL & ELECTRONIC Journal of Electronic Imaging Pub Date : 2024-08-01 DOI:10.1117/1.jei.33.4.043036

Xiaohui Yang, Weijing Liu, Shaowen Wang

引用次数: 0

Abstract

To effectively address the challenges of large motions, complex backgrounds and large occlusions in videos, we introduce an end-to-end method for video frame interpolation based on recurrent residual convolution and depthwise over-parameterized convolution in this paper. Specifically, we devise a U-Net architecture utilizing recurrent residual convolution to enhance the quality of interpolated frame. First, the recurrent residual U-Net feature extractor is employed to extract features from input frames, yielding the kernel for each pixel. Subsequently, an adaptive collaboration of flows is utilized to warp the input frames, which are then fed into the frame synthesis network to generate initial interpolated frames. Finally, the proposed network incorporates depthwise over-parameterized convolution to further enhance the quality of interpolated frame. Experimental results on various datasets demonstrate the superiority of our method over state-of-the-art techniques in both objective and subjective evaluations.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于深度过参数化递归残差卷积的视频帧插值技术

为了有效应对视频中的大运动、复杂背景和大遮挡等挑战，我们在本文中介绍了一种基于递归残差卷积和深度过参数化卷积的端到端视频帧插值方法。具体来说，我们设计了一种利用递归残差卷积的 U-Net 架构，以提高插值帧的质量。首先，利用递归残差 U-Net 特征提取器从输入帧中提取特征，为每个像素生成内核。随后，利用自适应协作流对输入帧进行翘曲，然后将其输入帧合成网络，生成初始插值帧。最后，建议的网络结合了深度过参数化卷积，以进一步提高插值帧的质量。在各种数据集上的实验结果表明，无论是客观评价还是主观评价，我们的方法都优于最先进的技术。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Journal of Electronic Imaging 工程技术-成像科学与照相技术

CiteScore

1.70

自引率

27.30%

发文量

341

审稿时长

4.0 months

期刊介绍： The Journal of Electronic Imaging publishes peer-reviewed papers in all technology areas that make up the field of electronic imaging and are normally considered in the design, engineering, and applications of electronic imaging systems.