{"title":"Spatio-Temporal Adaptive Weighted Fusion Network for Compressed Video Quality Enhancement","authors":"Tingrong Zhang;Xiaohai He;Qizhi Teng;Junxiong Cheng;Chao Ren","doi":"10.1109/TCSII.2024.3444052","DOIUrl":null,"url":null,"abstract":"In recent years, many deep learning-based methods for improving the quality of compressed video have emerged, some of which utilize multiple reference frames to enhance the target frame. However, most of these methods directly aggregate the temporal information of the reference frames, ignoring the spatial information within the target frame. In this brief, we propose a spatio-temporal information adaptive weighted fusion network (STAWFN) to enhance compressed video quality by dynamically integrating spatial information and temporal information. Specifically, we utilize well-designed temporal feature extractor (TFE) and spatial feature extractor (SFE) to extract temporal and spatial information, respectively. And then an adaptive weighted feature fusion module is employed to effectively fuse temporal information and spatial information. In addition, we construct multi-channel enhanced residual block to refine the fused features for better enhancement capability. Comprehensive test results on HEVC-compressed videos show that the proposed method can significantly enhance the objective and subjective quality of compressed videos and reach state-of-the-art performance.","PeriodicalId":13101,"journal":{"name":"IEEE Transactions on Circuits and Systems II: Express Briefs","volume":"71 12","pages":"5064-5068"},"PeriodicalIF":4.9000,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Circuits and Systems II: Express Briefs","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10637290/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
In recent years, many deep learning-based methods for improving the quality of compressed video have emerged, some of which utilize multiple reference frames to enhance the target frame. However, most of these methods directly aggregate the temporal information of the reference frames, ignoring the spatial information within the target frame. In this brief, we propose a spatio-temporal information adaptive weighted fusion network (STAWFN) to enhance compressed video quality by dynamically integrating spatial information and temporal information. Specifically, we utilize well-designed temporal feature extractor (TFE) and spatial feature extractor (SFE) to extract temporal and spatial information, respectively. And then an adaptive weighted feature fusion module is employed to effectively fuse temporal information and spatial information. In addition, we construct multi-channel enhanced residual block to refine the fused features for better enhancement capability. Comprehensive test results on HEVC-compressed videos show that the proposed method can significantly enhance the objective and subjective quality of compressed videos and reach state-of-the-art performance.
期刊介绍:
TCAS II publishes brief papers in the field specified by the theory, analysis, design, and practical implementations of circuits, and the application of circuit techniques to systems and to signal processing. Included is the whole spectrum from basic scientific theory to industrial applications. The field of interest covered includes:
Circuits: Analog, Digital and Mixed Signal Circuits and Systems
Nonlinear Circuits and Systems, Integrated Sensors, MEMS and Systems on Chip, Nanoscale Circuits and Systems, Optoelectronic
Circuits and Systems, Power Electronics and Systems
Software for Analog-and-Logic Circuits and Systems
Control aspects of Circuits and Systems.