基于时空对齐的多尺度特征融合网络视频去噪

Third International Seminar on Artificial Intelligence, Networking, and Information Technology Pub Date : 2023-02-22 DOI:10.1117/12.2667325

Yushan Lv, Di Wu, Yuhang Li, Youdong Ding

{"title":"基于时空对齐的多尺度特征融合网络视频去噪","authors":"Yushan Lv, Di Wu, Yuhang Li, Youdong Ding","doi":"10.1117/12.2667325","DOIUrl":null,"url":null,"abstract":"Most existing video denoising methods based on the PatchMatch algorithm and optical flow estimation often lead to artifacts blurring and poor denoising effect on scale-varying data. To tackle these issues, we propose a multi-scale feature fusion network based on different pyramid blocks and adaptive spatial-channel attention, which enables to effectively extract multi-scale feature information from noisy video data. Furthermore, we develop a spatial-temporal alignment module with deformable convolution to align the implicit features and reduce blurring artifacts. The results show that the proposed method outperforms the state-of-the-art algorithms in visual and objective quality metrics on the public datasets DAVIS and Set8.","PeriodicalId":128051,"journal":{"name":"Third International Seminar on Artificial Intelligence, Networking, and Information Technology","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multi-scale feature fusion network with spatial-temporal alignment for video denoising\",\"authors\":\"Yushan Lv, Di Wu, Yuhang Li, Youdong Ding\",\"doi\":\"10.1117/12.2667325\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Most existing video denoising methods based on the PatchMatch algorithm and optical flow estimation often lead to artifacts blurring and poor denoising effect on scale-varying data. To tackle these issues, we propose a multi-scale feature fusion network based on different pyramid blocks and adaptive spatial-channel attention, which enables to effectively extract multi-scale feature information from noisy video data. Furthermore, we develop a spatial-temporal alignment module with deformable convolution to align the implicit features and reduce blurring artifacts. The results show that the proposed method outperforms the state-of-the-art algorithms in visual and objective quality metrics on the public datasets DAVIS and Set8.\",\"PeriodicalId\":128051,\"journal\":{\"name\":\"Third International Seminar on Artificial Intelligence, Networking, and Information Technology\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-02-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Third International Seminar on Artificial Intelligence, Networking, and Information Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1117/12.2667325\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Third International Seminar on Artificial Intelligence, Networking, and Information Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2667325","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

现有的基于PatchMatch算法和光流估计的视频去噪方法，对尺度变化的数据往往产生伪影模糊，去噪效果较差。为了解决这些问题，我们提出了一种基于不同金字塔块和自适应空间通道关注的多尺度特征融合网络，能够有效地从噪声视频数据中提取多尺度特征信息。此外，我们开发了一个具有可变形卷积的时空对齐模块，以对齐隐式特征并减少模糊伪影。结果表明，在公共数据集DAVIS和Set8上，该方法在视觉和客观质量度量方面优于目前最先进的算法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Multi-scale feature fusion network with spatial-temporal alignment for video denoising

Most existing video denoising methods based on the PatchMatch algorithm and optical flow estimation often lead to artifacts blurring and poor denoising effect on scale-varying data. To tackle these issues, we propose a multi-scale feature fusion network based on different pyramid blocks and adaptive spatial-channel attention, which enables to effectively extract multi-scale feature information from noisy video data. Furthermore, we develop a spatial-temporal alignment module with deformable convolution to align the implicit features and reduce blurring artifacts. The results show that the proposed method outperforms the state-of-the-art algorithms in visual and objective quality metrics on the public datasets DAVIS and Set8.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Third International Seminar on Artificial Intelligence, Networking, and Information Technology

自引率

0.00%

发文量