{"title":"图像压缩感知的多尺度通道蒸馏网络","authors":"Tianyu Zhang;Kuntao Ye;Yue Zhang;Rui Lu","doi":"10.1109/ACCESS.2025.3527756","DOIUrl":null,"url":null,"abstract":"Recently, convolutional neural networks (CNNs) have demonstrated striking success in computer vision tasks. Methods based on CNNs for image compressive sensing (CS) have also gained prominence. However, existing methods tend to increase the depth of the network in feature space for better reconstruction quality, neglecting the hierarchical representation of intermediate features in pixel space. In order to coordinate the feature space and pixel space to complete the deep reconstruction of images, and further improve the reconstruction performance of current CS methods, we propose a multi-scale channel distillation network (MSCDN). This network first obtains images of multiple scales using a scale-space image decomposition method at the sampling stage, followed by sampling these decomposed images through a convolutional operation. In this way, multi-scale information in the compressed domain is aggregated. During the reconstruction phase, a low-frequency information recovery network generates a preliminary image, whereas a high-frequency feature aggregation network refines the image further. Specifically, we design a dual-branch deep reconstruction architecture with channel distillation residual block (CDRB) as the core component. One branch extracts features gradually by cascading multiple CDRB modules, thereby supplementing the initial reconstructed image with a large amount of high-frequency content in feature space. The other branch takes the initial reconstructed image as input and sequentially fuses the intermediate feature outputs by CDRBs to increase the local details of the image in pixel space. Combining outputs from both branches, we achieve an optimal reconstructed image. Extensive experimental results on four benchmark datasets demonstrate that MSCDN surpasses state-of-the-art CS methods not only in reconstruction accuracy but also in perceptual visual quality.","PeriodicalId":13079,"journal":{"name":"IEEE Access","volume":"13 ","pages":"9524-9537"},"PeriodicalIF":3.4000,"publicationDate":"2025-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10835084","citationCount":"0","resultStr":"{\"title\":\"Multi-Scale Channel Distillation Network for Image Compressive Sensing\",\"authors\":\"Tianyu Zhang;Kuntao Ye;Yue Zhang;Rui Lu\",\"doi\":\"10.1109/ACCESS.2025.3527756\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recently, convolutional neural networks (CNNs) have demonstrated striking success in computer vision tasks. Methods based on CNNs for image compressive sensing (CS) have also gained prominence. However, existing methods tend to increase the depth of the network in feature space for better reconstruction quality, neglecting the hierarchical representation of intermediate features in pixel space. In order to coordinate the feature space and pixel space to complete the deep reconstruction of images, and further improve the reconstruction performance of current CS methods, we propose a multi-scale channel distillation network (MSCDN). This network first obtains images of multiple scales using a scale-space image decomposition method at the sampling stage, followed by sampling these decomposed images through a convolutional operation. In this way, multi-scale information in the compressed domain is aggregated. During the reconstruction phase, a low-frequency information recovery network generates a preliminary image, whereas a high-frequency feature aggregation network refines the image further. Specifically, we design a dual-branch deep reconstruction architecture with channel distillation residual block (CDRB) as the core component. One branch extracts features gradually by cascading multiple CDRB modules, thereby supplementing the initial reconstructed image with a large amount of high-frequency content in feature space. The other branch takes the initial reconstructed image as input and sequentially fuses the intermediate feature outputs by CDRBs to increase the local details of the image in pixel space. Combining outputs from both branches, we achieve an optimal reconstructed image. Extensive experimental results on four benchmark datasets demonstrate that MSCDN surpasses state-of-the-art CS methods not only in reconstruction accuracy but also in perceptual visual quality.\",\"PeriodicalId\":13079,\"journal\":{\"name\":\"IEEE Access\",\"volume\":\"13 \",\"pages\":\"9524-9537\"},\"PeriodicalIF\":3.4000,\"publicationDate\":\"2025-01-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10835084\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Access\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10835084/\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Access","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10835084/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
Multi-Scale Channel Distillation Network for Image Compressive Sensing
Recently, convolutional neural networks (CNNs) have demonstrated striking success in computer vision tasks. Methods based on CNNs for image compressive sensing (CS) have also gained prominence. However, existing methods tend to increase the depth of the network in feature space for better reconstruction quality, neglecting the hierarchical representation of intermediate features in pixel space. In order to coordinate the feature space and pixel space to complete the deep reconstruction of images, and further improve the reconstruction performance of current CS methods, we propose a multi-scale channel distillation network (MSCDN). This network first obtains images of multiple scales using a scale-space image decomposition method at the sampling stage, followed by sampling these decomposed images through a convolutional operation. In this way, multi-scale information in the compressed domain is aggregated. During the reconstruction phase, a low-frequency information recovery network generates a preliminary image, whereas a high-frequency feature aggregation network refines the image further. Specifically, we design a dual-branch deep reconstruction architecture with channel distillation residual block (CDRB) as the core component. One branch extracts features gradually by cascading multiple CDRB modules, thereby supplementing the initial reconstructed image with a large amount of high-frequency content in feature space. The other branch takes the initial reconstructed image as input and sequentially fuses the intermediate feature outputs by CDRBs to increase the local details of the image in pixel space. Combining outputs from both branches, we achieve an optimal reconstructed image. Extensive experimental results on four benchmark datasets demonstrate that MSCDN surpasses state-of-the-art CS methods not only in reconstruction accuracy but also in perceptual visual quality.
IEEE AccessCOMPUTER SCIENCE, INFORMATION SYSTEMSENGIN-ENGINEERING, ELECTRICAL & ELECTRONIC
CiteScore
9.80
自引率
7.70%
发文量
6673
审稿时长
6 weeks
期刊介绍:
IEEE Access® is a multidisciplinary, open access (OA), applications-oriented, all-electronic archival journal that continuously presents the results of original research or development across all of IEEE''s fields of interest.
IEEE Access will publish articles that are of high interest to readers, original, technically correct, and clearly presented. Supported by author publication charges (APC), its hallmarks are a rapid peer review and publication process with open access to all readers. Unlike IEEE''s traditional Transactions or Journals, reviews are "binary", in that reviewers will either Accept or Reject an article in the form it is submitted in order to achieve rapid turnaround. Especially encouraged are submissions on:
Multidisciplinary topics, or applications-oriented articles and negative results that do not fit within the scope of IEEE''s traditional journals.
Practical articles discussing new experiments or measurement techniques, interesting solutions to engineering.
Development of new or improved fabrication or manufacturing techniques.
Reviews or survey articles of new or evolving fields oriented to assist others in understanding the new area.