Edge-Guided Video Super-Resolution Network

2022 IEEE International Conference on Recent Advances in Systems Science and Engineering (RASSE) Pub Date : 2022-11-07 DOI:10.1109/RASSE54974.2022.9989570

Haohsuan Tseng, Chih-Hung Kuo, Yiting Chen, Sinhong Lee

{"title":"Edge-Guided Video Super-Resolution Network","authors":"Haohsuan Tseng, Chih-Hung Kuo, Yiting Chen, Sinhong Lee","doi":"10.1109/RASSE54974.2022.9989570","DOIUrl":null,"url":null,"abstract":"In this paper, we propose an edge-guided video super-resolution (EGVSR) network that utilizes the edge information of the image to effectively recover high-frequency details for high-resolution frames. The reconstruction process consists of two stages. In the first stage, the Coarse Frame Reconstruction Network (CFRN) generates coarse SR frames. In addition, we propose the Edge-Prediction Network (EPN) to capture the edge details that help to supplement the missing high-frequency information. Unlike some prior SR works that tend to increase the depth of networks or use attention mechanisms to reconstruct large-size objects but ignore small-size objects, we propose the Attention Fusion Residual Block (AFRB) to process objects of different sizes. The AFRB, an enhanced version of the conventional residual block, performs fusion through a multi-scale channel attention mechanism and serves as the basic operation unit in the CFRN and the EPN. Then, in the second stage, we propose the Frame Refinement Network (FRN), which contains multiple convolution layers. Through the FRN, we fuse and refine the coarse SR frames and edge information learned from the first stage. Compared with the state-of-the-art methods, our SR model improves approximately 0.5% in PSNR and 1.8% in SSIM evaluation on the benchmark VID4 dataset when the number of parameters is reduced by 54%.","PeriodicalId":382440,"journal":{"name":"2022 IEEE International Conference on Recent Advances in Systems Science and Engineering (RASSE)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Recent Advances in Systems Science and Engineering (RASSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RASSE54974.2022.9989570","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

In this paper, we propose an edge-guided video super-resolution (EGVSR) network that utilizes the edge information of the image to effectively recover high-frequency details for high-resolution frames. The reconstruction process consists of two stages. In the first stage, the Coarse Frame Reconstruction Network (CFRN) generates coarse SR frames. In addition, we propose the Edge-Prediction Network (EPN) to capture the edge details that help to supplement the missing high-frequency information. Unlike some prior SR works that tend to increase the depth of networks or use attention mechanisms to reconstruct large-size objects but ignore small-size objects, we propose the Attention Fusion Residual Block (AFRB) to process objects of different sizes. The AFRB, an enhanced version of the conventional residual block, performs fusion through a multi-scale channel attention mechanism and serves as the basic operation unit in the CFRN and the EPN. Then, in the second stage, we propose the Frame Refinement Network (FRN), which contains multiple convolution layers. Through the FRN, we fuse and refine the coarse SR frames and edge information learned from the first stage. Compared with the state-of-the-art methods, our SR model improves approximately 0.5% in PSNR and 1.8% in SSIM evaluation on the benchmark VID4 dataset when the number of parameters is reduced by 54%.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

边缘引导视频超分辨率网络

在本文中，我们提出了一种边缘引导视频超分辨率(EGVSR)网络，它利用图像的边缘信息来有效地恢复高分辨率帧的高频细节。重建过程包括两个阶段。在第一阶段，CFRN(粗帧重建网络)生成粗帧SR帧。此外，我们提出了边缘预测网络(EPN)来捕获边缘细节，有助于补充缺失的高频信息。不同于以往一些SR研究倾向于增加网络深度或使用注意机制重构大尺寸物体而忽略小尺寸物体，我们提出了注意融合残差块(attention Fusion Residual Block, AFRB)来处理不同尺寸的物体。AFRB是传统残余块的增强版本，通过多尺度通道注意机制进行融合，是CFRN和EPN的基本操作单元。然后，在第二阶段，我们提出了包含多个卷积层的帧细化网络(FRN)。通过FRN，我们融合和细化从第一阶段学习到的粗SR帧和边缘信息。与最先进的方法相比，当参数数量减少54%时，我们的SR模型在基准VID4数据集上的PSNR提高了约0.5%，SSIM评估提高了1.8%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2022 IEEE International Conference on Recent Advances in Systems Science and Engineering (RASSE)

自引率

0.00%

发文量