图像压缩感知的多尺度通道蒸馏网络

IF 3.4 3区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS IEEE Access Pub Date : 2025-01-09 DOI:10.1109/ACCESS.2025.3527756
Tianyu Zhang;Kuntao Ye;Yue Zhang;Rui Lu
{"title":"图像压缩感知的多尺度通道蒸馏网络","authors":"Tianyu Zhang;Kuntao Ye;Yue Zhang;Rui Lu","doi":"10.1109/ACCESS.2025.3527756","DOIUrl":null,"url":null,"abstract":"Recently, convolutional neural networks (CNNs) have demonstrated striking success in computer vision tasks. Methods based on CNNs for image compressive sensing (CS) have also gained prominence. However, existing methods tend to increase the depth of the network in feature space for better reconstruction quality, neglecting the hierarchical representation of intermediate features in pixel space. In order to coordinate the feature space and pixel space to complete the deep reconstruction of images, and further improve the reconstruction performance of current CS methods, we propose a multi-scale channel distillation network (MSCDN). This network first obtains images of multiple scales using a scale-space image decomposition method at the sampling stage, followed by sampling these decomposed images through a convolutional operation. In this way, multi-scale information in the compressed domain is aggregated. During the reconstruction phase, a low-frequency information recovery network generates a preliminary image, whereas a high-frequency feature aggregation network refines the image further. Specifically, we design a dual-branch deep reconstruction architecture with channel distillation residual block (CDRB) as the core component. One branch extracts features gradually by cascading multiple CDRB modules, thereby supplementing the initial reconstructed image with a large amount of high-frequency content in feature space. The other branch takes the initial reconstructed image as input and sequentially fuses the intermediate feature outputs by CDRBs to increase the local details of the image in pixel space. Combining outputs from both branches, we achieve an optimal reconstructed image. Extensive experimental results on four benchmark datasets demonstrate that MSCDN surpasses state-of-the-art CS methods not only in reconstruction accuracy but also in perceptual visual quality.","PeriodicalId":13079,"journal":{"name":"IEEE Access","volume":"13 ","pages":"9524-9537"},"PeriodicalIF":3.4000,"publicationDate":"2025-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10835084","citationCount":"0","resultStr":"{\"title\":\"Multi-Scale Channel Distillation Network for Image Compressive Sensing\",\"authors\":\"Tianyu Zhang;Kuntao Ye;Yue Zhang;Rui Lu\",\"doi\":\"10.1109/ACCESS.2025.3527756\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recently, convolutional neural networks (CNNs) have demonstrated striking success in computer vision tasks. Methods based on CNNs for image compressive sensing (CS) have also gained prominence. However, existing methods tend to increase the depth of the network in feature space for better reconstruction quality, neglecting the hierarchical representation of intermediate features in pixel space. In order to coordinate the feature space and pixel space to complete the deep reconstruction of images, and further improve the reconstruction performance of current CS methods, we propose a multi-scale channel distillation network (MSCDN). This network first obtains images of multiple scales using a scale-space image decomposition method at the sampling stage, followed by sampling these decomposed images through a convolutional operation. In this way, multi-scale information in the compressed domain is aggregated. During the reconstruction phase, a low-frequency information recovery network generates a preliminary image, whereas a high-frequency feature aggregation network refines the image further. Specifically, we design a dual-branch deep reconstruction architecture with channel distillation residual block (CDRB) as the core component. One branch extracts features gradually by cascading multiple CDRB modules, thereby supplementing the initial reconstructed image with a large amount of high-frequency content in feature space. The other branch takes the initial reconstructed image as input and sequentially fuses the intermediate feature outputs by CDRBs to increase the local details of the image in pixel space. Combining outputs from both branches, we achieve an optimal reconstructed image. Extensive experimental results on four benchmark datasets demonstrate that MSCDN surpasses state-of-the-art CS methods not only in reconstruction accuracy but also in perceptual visual quality.\",\"PeriodicalId\":13079,\"journal\":{\"name\":\"IEEE Access\",\"volume\":\"13 \",\"pages\":\"9524-9537\"},\"PeriodicalIF\":3.4000,\"publicationDate\":\"2025-01-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10835084\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Access\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10835084/\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Access","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10835084/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

摘要

最近,卷积神经网络(cnn)在计算机视觉任务中取得了惊人的成功。基于cnn的图像压缩感知(CS)方法也得到了重视。然而,现有方法倾向于增加特征空间中的网络深度以获得更好的重构质量,而忽略了中间特征在像素空间中的分层表示。为了协调特征空间和像素空间完成图像的深度重建,进一步提高现有CS方法的重建性能,我们提出了一种多尺度通道蒸馏网络(MSCDN)。该网络首先在采样阶段使用尺度空间图像分解方法获得多尺度图像,然后通过卷积运算对分解后的图像进行采样。这样,压缩域中的多尺度信息就被聚合了。在重建阶段,低频信息恢复网络生成初步图像,高频特征聚合网络进一步细化图像。具体来说,我们设计了一个以通道蒸馏残余块(CDRB)为核心组件的双分支深度重构架构。一个分支通过多个CDRB模块级联逐渐提取特征,从而在初始重构图像中补充大量特征空间中的高频内容。另一个分支以初始重构图像为输入,依次融合cdrb输出的中间特征,增加图像在像素空间的局部细节。结合两个分支的输出,我们得到了一个最优的重建图像。在四个基准数据集上的大量实验结果表明,MSCDN不仅在重建精度上优于最先进的CS方法,而且在感知视觉质量上也优于最先进的CS方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Multi-Scale Channel Distillation Network for Image Compressive Sensing
Recently, convolutional neural networks (CNNs) have demonstrated striking success in computer vision tasks. Methods based on CNNs for image compressive sensing (CS) have also gained prominence. However, existing methods tend to increase the depth of the network in feature space for better reconstruction quality, neglecting the hierarchical representation of intermediate features in pixel space. In order to coordinate the feature space and pixel space to complete the deep reconstruction of images, and further improve the reconstruction performance of current CS methods, we propose a multi-scale channel distillation network (MSCDN). This network first obtains images of multiple scales using a scale-space image decomposition method at the sampling stage, followed by sampling these decomposed images through a convolutional operation. In this way, multi-scale information in the compressed domain is aggregated. During the reconstruction phase, a low-frequency information recovery network generates a preliminary image, whereas a high-frequency feature aggregation network refines the image further. Specifically, we design a dual-branch deep reconstruction architecture with channel distillation residual block (CDRB) as the core component. One branch extracts features gradually by cascading multiple CDRB modules, thereby supplementing the initial reconstructed image with a large amount of high-frequency content in feature space. The other branch takes the initial reconstructed image as input and sequentially fuses the intermediate feature outputs by CDRBs to increase the local details of the image in pixel space. Combining outputs from both branches, we achieve an optimal reconstructed image. Extensive experimental results on four benchmark datasets demonstrate that MSCDN surpasses state-of-the-art CS methods not only in reconstruction accuracy but also in perceptual visual quality.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
IEEE Access
IEEE Access COMPUTER SCIENCE, INFORMATION SYSTEMSENGIN-ENGINEERING, ELECTRICAL & ELECTRONIC
CiteScore
9.80
自引率
7.70%
发文量
6673
审稿时长
6 weeks
期刊介绍: IEEE Access® is a multidisciplinary, open access (OA), applications-oriented, all-electronic archival journal that continuously presents the results of original research or development across all of IEEE''s fields of interest. IEEE Access will publish articles that are of high interest to readers, original, technically correct, and clearly presented. Supported by author publication charges (APC), its hallmarks are a rapid peer review and publication process with open access to all readers. Unlike IEEE''s traditional Transactions or Journals, reviews are "binary", in that reviewers will either Accept or Reject an article in the form it is submitted in order to achieve rapid turnaround. Especially encouraged are submissions on: Multidisciplinary topics, or applications-oriented articles and negative results that do not fit within the scope of IEEE''s traditional journals. Practical articles discussing new experiments or measurement techniques, interesting solutions to engineering. Development of new or improved fabrication or manufacturing techniques. Reviews or survey articles of new or evolving fields oriented to assist others in understanding the new area.
期刊最新文献
Editorial Board IEEE Access™ Editorial Board Corrections to “The Recent Technologies to Curb the Second-Wave of COVID-19 Pandemic” Corrections to “Decentralized Asynchronous Formation Planning of Multirotor Aerial Vehicles in Dynamic Environments Using Flexible Formation Graphs and Tight Trajectory Hulls” Study on the Motion Patterns of Nested Test Cabin and Its Shock Response Spectrum Analysis
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1