WaveMixSR-V2: Enhancing Super-resolution with Higher Efficiency

arXiv - EE - Image and Video Processing Pub Date : 2024-09-16 DOI:arxiv-2409.10582

Pranav Jeevan, Neeraj Nixon, Amit Sethi

引用次数: 0

Abstract

Recent advancements in single image super-resolution have been predominantly driven by token mixers and transformer architectures. WaveMixSR utilized the WaveMix architecture, employing a two-dimensional discrete wavelet transform for spatial token mixing, achieving superior performance in super-resolution tasks with remarkable resource efficiency. In this work, we present an enhanced version of the WaveMixSR architecture by (1) replacing the traditional transpose convolution layer with a pixel shuffle operation and (2) implementing a multistage design for higher resolution tasks ($4\times$). Our experiments demonstrate that our enhanced model -- WaveMixSR-V2 -- outperforms other architectures in multiple super-resolution tasks, achieving state-of-the-art for the BSD100 dataset, while also consuming fewer resources, exhibits higher parameter efficiency, lower latency and higher throughput. Our code is available at https://github.com/pranavphoenix/WaveMixSR.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

WaveMixSR-V2：以更高的效率增强超分辨率

单图像超分辨率的最新进展主要是由令牌混合器和变换器架构推动的。WaveMixSR 利用 WaveMix 架构，采用二维离散小波变换进行空间令牌混合，在超分辨率任务中实现了卓越的性能和显著的资源效率。在这项工作中，我们提出了 WaveMixSR 架构的增强版本，具体做法是：（1）用像素洗牌操作取代传统的跨距卷积层；（2）针对更高分辨率任务（4 美元/次）实施多级设计。我们的实验证明，我们的增强型模型--WaveMixSR-V2--在多个超分辨率任务中的表现优于其他架构，在 BSD100 数据集上达到了最先进水平，同时还消耗更少的资源，表现出更高的参数效率、更低的延迟和更高的吞吐量。我们的代码见 https://github.com/pranavphoenix/WaveMixSR。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

arXiv - EE - Image and Video Processing

自引率

0.00%

发文量