首页 > 最新文献

IEEE Transactions on Computational Imaging最新文献

英文 中文
M3Depth: Wavelet-Enhanced Depth Estimation on Mars via Mutual Boosting of Dual-Modal Data M3Depth:基于双模态数据相互增强的小波增强火星深度估计
IF 4.8 2区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-12-10 DOI: 10.1109/TCI.2025.3642761
Junjie Li;Jiawei Wang;Miyu Li;Yu Liu;Yumei Wang;Haitao Xu
Depth estimation plays a great potential role in obstacle avoidance and navigation for further Mars exploration missions. Compared to traditional stereo matching, learning-based stereo depth estimation provides a data-driven approach to infer dense and precise depth maps from stereo image pairs. However, these methods always suffer performance degradation in environments with sparse textures and lacking geometric constraints, such as the unstructured terrain of Mars. To address these challenges, we propose M3Depth, a depth estimation model tailored for Mars rovers. Considering the sparse and smooth texture of Martian terrain, which is primarily composed of low-frequency features, our model incorporates a convolutional kernel based on wavelet transform that effectively captures low-frequency response and expands the receptive field. Additionally, we introduce a consistency loss that explicitly models the complementary relationship between depth map and surface normal map, utilizing the surface normal as a geometric constraint to enhance the accuracy of depth estimation. Besides, a pixel-wise refinement module with mutual boosting mechanism is designed to iteratively refine both depth and surface normal predictions. Experimental results on synthetic Mars datasets with depth annotations show that M3Depth achieves a 16% improvement in depth estimation accuracy compared to other state-of-the-art methods in depth estimation. Furthermore, the model demonstrates strong applicability in real-world Martian scenarios, offering a promising solution for future Mars exploration missions.
深度估计在未来火星探测任务的避障和导航中发挥着巨大的潜在作用。与传统的立体匹配相比,基于学习的立体深度估计提供了一种数据驱动的方法,可以从立体图像对中推断出密集和精确的深度图。然而,这些方法在纹理稀疏和缺乏几何约束的环境中总是会出现性能下降,比如火星的非结构化地形。为了解决这些挑战,我们提出了M3Depth,这是一个为火星探测器量身定制的深度估计模型。考虑到火星地形纹理稀疏光滑,主要由低频特征组成,我们的模型结合了基于小波变换的卷积核,有效地捕获了低频响应,扩大了接收场。此外,我们引入了一致性损失,明确地模拟了深度图和表面法线图之间的互补关系,利用表面法线作为几何约束来提高深度估计的精度。此外,还设计了具有相互促进机制的逐像素细化模块,对深度和表面法线预测进行迭代细化。在具有深度标注的合成火星数据集上的实验结果表明,与其他最先进的深度估计方法相比,M3Depth的深度估计精度提高了16%。此外,该模型在真实的火星场景中具有很强的适用性,为未来的火星探测任务提供了一个有希望的解决方案。
{"title":"M3Depth: Wavelet-Enhanced Depth Estimation on Mars via Mutual Boosting of Dual-Modal Data","authors":"Junjie Li;Jiawei Wang;Miyu Li;Yu Liu;Yumei Wang;Haitao Xu","doi":"10.1109/TCI.2025.3642761","DOIUrl":"https://doi.org/10.1109/TCI.2025.3642761","url":null,"abstract":"Depth estimation plays a great potential role in obstacle avoidance and navigation for further Mars exploration missions. Compared to traditional stereo matching, learning-based stereo depth estimation provides a data-driven approach to infer dense and precise depth maps from stereo image pairs. However, these methods always suffer performance degradation in environments with sparse textures and lacking geometric constraints, such as the unstructured terrain of Mars. To address these challenges, we propose M<sup>3</sup>Depth, a depth estimation model tailored for Mars rovers. Considering the sparse and smooth texture of Martian terrain, which is primarily composed of low-frequency features, our model incorporates a convolutional kernel based on wavelet transform that effectively captures low-frequency response and expands the receptive field. Additionally, we introduce a consistency loss that explicitly models the complementary relationship between depth map and surface normal map, utilizing the surface normal as a geometric constraint to enhance the accuracy of depth estimation. Besides, a pixel-wise refinement module with mutual boosting mechanism is designed to iteratively refine both depth and surface normal predictions. Experimental results on synthetic Mars datasets with depth annotations show that M<sup>3</sup>Depth achieves a 16% improvement in depth estimation accuracy compared to other state-of-the-art methods in depth estimation. Furthermore, the model demonstrates strong applicability in real-world Martian scenarios, offering a promising solution for future Mars exploration missions.","PeriodicalId":56022,"journal":{"name":"IEEE Transactions on Computational Imaging","volume":"12 ","pages":"158-171"},"PeriodicalIF":4.8,"publicationDate":"2025-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145886634","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Self-Supervised Learning-Based Reconstruction of High-Resolution 4D Light Fields 基于自监督学习的高分辨率四维光场重建
IF 4.8 2区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-12-10 DOI: 10.1109/TCI.2025.3642236
Jianxin Lei;Dongze Wu;Chengcai Xu;Hongcheng Gu;Guangquan Zhou;Junhui Hou;Ping Zhou
Hand-held light field (LF) cameras often exhibit low spatial resolution due to the inherent trade-off between spatial and angular dimensions. Existing supervised learning-based LF spatial super-resolution (SR) methods, which rely on pre-defined image degradation models, struggle to overcome the domain gap between the training phase—where LFs with natural resolution are used as ground truth—and the inference phase, which aims to reconstruct higher-resolution LFs, especially when applied to real-world data. To address this challenge, this paper introduces a novel self-supervised learning-based method for LF spatial SR, which can produce higher spatial resolution LF images than originally captured ones without pre-defined image degradation models. The self-supervised method incorporates a hybrid LF imaging prototype, a real-world hybrid LF dataset, and a self-supervised LF spatial SR framework. The prototype makes reference image pairs between low-resolution central-view sub-aperture images and high-resolution (HR) images. The self-supervised framework consists of a well-designed LF spatial SR network with hybrid input, a central-view synthesis network with an HR-aware loss that enables side-view sub-aperture images to learn high-frequency information from the only HR central view reference image, and a backward degradation network with an epipolar-plane image gradient loss to preserve LF parallax structures. Extensive experiments on both simulated and real-world datasets demonstrate the significant superiority of our approach over state-of-the-art ones in reconstructing higher spatial resolution LF images without pre-defined degradation.
手持光场(LF)相机往往表现出较低的空间分辨率,由于固有的权衡空间和角度的尺寸。现有的基于监督学习的LFs空间超分辨率(SR)方法依赖于预定义的图像退化模型,难以克服训练阶段(自然分辨率的LFs被用作基础真值)和推理阶段(旨在重建更高分辨率的LFs,特别是在应用于现实世界数据时)之间的域差距。为了解决这一问题,本文引入了一种新的基于自监督学习的LF空间SR方法,该方法可以产生比原始捕获的更高的空间分辨率的LF图像,而无需预先定义图像退化模型。自监督方法结合了混合LF成像原型、现实世界混合LF数据集和自监督LF空间SR框架。该原型在低分辨率中央视子孔径图像和高分辨率(HR)图像之间建立了参考图像对。自监督框架包括一个设计良好的带有混合输入的LF空间SR网络,一个具有HR感知损失的中心视图合成网络,使侧视子孔径图像能够从唯一的HR中心视图参考图像中学习高频信息,以及一个具有极平面图像梯度损失的向后退化网络,以保留LF视差结构。在模拟和真实数据集上进行的大量实验表明,我们的方法在重建更高空间分辨率的LF图像而没有预定义的退化方面比最先进的方法具有显著的优势。
{"title":"Self-Supervised Learning-Based Reconstruction of High-Resolution 4D Light Fields","authors":"Jianxin Lei;Dongze Wu;Chengcai Xu;Hongcheng Gu;Guangquan Zhou;Junhui Hou;Ping Zhou","doi":"10.1109/TCI.2025.3642236","DOIUrl":"https://doi.org/10.1109/TCI.2025.3642236","url":null,"abstract":"Hand-held light field (LF) cameras often exhibit low spatial resolution due to the inherent trade-off between spatial and angular dimensions. Existing supervised learning-based LF spatial super-resolution (SR) methods, which rely on pre-defined image degradation models, struggle to overcome the domain gap between the training phase—where LFs with natural resolution are used as ground truth—and the inference phase, which aims to reconstruct higher-resolution LFs, especially when applied to real-world data. To address this challenge, this paper introduces a novel self-supervised learning-based method for LF spatial SR, which can produce higher spatial resolution LF images than originally captured ones without pre-defined image degradation models. The self-supervised method incorporates a hybrid LF imaging prototype, a real-world hybrid LF dataset, and a self-supervised LF spatial SR framework. The prototype makes reference image pairs between low-resolution central-view sub-aperture images and high-resolution (HR) images. The self-supervised framework consists of a well-designed LF spatial SR network with hybrid input, a central-view synthesis network with an HR-aware loss that enables side-view sub-aperture images to learn high-frequency information from the only HR central view reference image, and a backward degradation network with an epipolar-plane image gradient loss to preserve LF parallax structures. Extensive experiments on both simulated and real-world datasets demonstrate the significant superiority of our approach over state-of-the-art ones in reconstructing higher spatial resolution LF images without pre-defined degradation.","PeriodicalId":56022,"journal":{"name":"IEEE Transactions on Computational Imaging","volume":"12 ","pages":"114-127"},"PeriodicalIF":4.8,"publicationDate":"2025-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145830904","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
List of Reviewers 审稿人名单
IF 4.8 2区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-12-10 DOI: 10.1109/TCI.2025.3641749
{"title":"List of Reviewers","authors":"","doi":"10.1109/TCI.2025.3641749","DOIUrl":"https://doi.org/10.1109/TCI.2025.3641749","url":null,"abstract":"","PeriodicalId":56022,"journal":{"name":"IEEE Transactions on Computational Imaging","volume":"11 ","pages":"1682-1685"},"PeriodicalIF":4.8,"publicationDate":"2025-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11296854","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145729404","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Distribution-Adaptive Hierarchical Quantization Enhanced Binary Networks for Spectral Compressive Imaging 光谱压缩成像的分布自适应分层量化增强二值网络
IF 4.8 2区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-12-08 DOI: 10.1109/TCI.2025.3641031
Mengying Jin;Liang Xiao;Zhihui Wei
Hyperspectral image processing faces significant challenges in storage and computation. Snapshot Compressive Imaging (SCI) effectively encodes three-dimensional data into two-dimensional measurements, facilitating efficient data acquisition. However, reconstructing high-quality data from these compressed measurements remains a formidable task. Binary Neural Networks (BNNs) have gained attention for their ability to reduce storage requirements and computational costs. Yet, they often struggle with accuracy loss, fixed quantization limits, and lack of domain knowledge utilization. To overcome these limitations, distribution-adaptive hierarchical quantization-enhanced binary networks are proposed to achieve efficient SCI reconstruction. First, an adaptive distribution strategy and a binary weight evaluation branch are proposed to improve representation accuracy. Second, a hierarchical quantization scheme is presented to enhance multiscale feature extraction while maintaining efficiency. Third, domain-specific priors and a novel sparsity constraint are incorporated to capture fine details and improve training stability. The experimental results demonstrate the superiority of our approach, achieving an increase of 1.98 dB in PSNR and an improvement of 0.055 in SSIM compared to state-of-the-art BNNs.
高光谱图像处理在存储和计算方面面临着重大挑战。快照压缩成像(SCI)有效地将三维数据编码为二维测量,促进了高效的数据采集。然而,从这些压缩测量中重建高质量的数据仍然是一项艰巨的任务。二进制神经网络(BNNs)因其降低存储需求和计算成本的能力而受到关注。然而,它们经常与准确性损失、固定量化限制和缺乏领域知识利用作斗争。为了克服这些限制,提出了分布自适应分层量化增强二值网络来实现高效的SCI重建。首先,提出了一种自适应分布策略和二值权评估分支来提高表征精度;其次,提出了一种分层量化方案,在保证提取效率的前提下增强多尺度特征提取。第三,结合领域特定先验和新的稀疏性约束来捕获精细细节,提高训练稳定性。实验结果证明了我们的方法的优越性,与最先进的bnn相比,PSNR提高了1.98 dB, SSIM提高了0.055。
{"title":"Distribution-Adaptive Hierarchical Quantization Enhanced Binary Networks for Spectral Compressive Imaging","authors":"Mengying Jin;Liang Xiao;Zhihui Wei","doi":"10.1109/TCI.2025.3641031","DOIUrl":"https://doi.org/10.1109/TCI.2025.3641031","url":null,"abstract":"Hyperspectral image processing faces significant challenges in storage and computation. Snapshot Compressive Imaging (SCI) effectively encodes three-dimensional data into two-dimensional measurements, facilitating efficient data acquisition. However, reconstructing high-quality data from these compressed measurements remains a formidable task. Binary Neural Networks (BNNs) have gained attention for their ability to reduce storage requirements and computational costs. Yet, they often struggle with accuracy loss, fixed quantization limits, and lack of domain knowledge utilization. To overcome these limitations, distribution-adaptive hierarchical quantization-enhanced binary networks are proposed to achieve efficient SCI reconstruction. First, an adaptive distribution strategy and a binary weight evaluation branch are proposed to improve representation accuracy. Second, a hierarchical quantization scheme is presented to enhance multiscale feature extraction while maintaining efficiency. Third, domain-specific priors and a novel sparsity constraint are incorporated to capture fine details and improve training stability. The experimental results demonstrate the superiority of our approach, achieving an increase of 1.98 dB in PSNR and an improvement of 0.055 in SSIM compared to state-of-the-art BNNs.","PeriodicalId":56022,"journal":{"name":"IEEE Transactions on Computational Imaging","volume":"12 ","pages":"86-101"},"PeriodicalIF":4.8,"publicationDate":"2025-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145830968","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Moving Targets Imaging by SVD of a Space-Velocity MIMO Radar Data Driven Matrix 空速MIMO雷达数据驱动矩阵的SVD运动目标成像
IF 4.8 2区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-12-05 DOI: 10.1109/TCI.2025.3640864
Liliana Borcea;Josselin Garnier
We introduce a method for Multiple Input Multiple Output (MIMO) radar imaging of moving targets in a strongly reflecting, complex stationary scenery (clutter). The radar system has fixed nearby antennas that play the dual role of sources and receivers. It gathers data either by emitting probing pulses from one antenna at a time, or by sending from all the antennas non-coherent, possibly orthogonal, waveforms. We show how to obtain from the measurements an imaging function that depends on search position and velocity and is approximately separable in these variables, for a single moving target. For multiple moving targets in clutter, the imaging function is a sum of separable functions. By sampling this imaging function on a position-velocity grid we obtain an imaging matrix whose Singular Value Decomposition (SVD) allows the separation of the clutter and the targets moving at different velocities. The decomposition also leads directly to estimates of the locations and motion of the targets. The imaging method is designed to work in strong clutter, with unknown and possibly heterogeneous statistics. It does not require prior estimation of the covariance matrix of the clutter response or of its rank. We give an analysis of the imaging method and illustrate how it works with numerical simulations.
介绍了一种多输入多输出(MIMO)雷达在强反射、复杂静止环境(杂波)下对运动目标的成像方法。雷达系统有固定的附近天线,扮演源和接收器的双重角色。它收集数据的方式要么是一次从一个天线发射探测脉冲,要么是从所有天线发射非相干的、可能是正交的波形。我们展示了如何从测量中获得成像函数,该函数取决于搜索位置和速度,并且在这些变量中近似可分,对于单个移动目标。对于杂波环境下的多个运动目标,其成像函数是可分离函数的和。通过在位置-速度网格上对该成像函数进行采样,得到了一个成像矩阵,该矩阵的奇异值分解(SVD)可以将杂波和以不同速度运动的目标分离开来。分解还直接导致对目标的位置和运动的估计。该成像方法设计用于强杂波,具有未知和可能异构的统计数据。它不需要事先估计杂波响应的协方差矩阵或其秩。我们对成像方法进行了分析,并用数值模拟说明了它是如何工作的。
{"title":"Moving Targets Imaging by SVD of a Space-Velocity MIMO Radar Data Driven Matrix","authors":"Liliana Borcea;Josselin Garnier","doi":"10.1109/TCI.2025.3640864","DOIUrl":"https://doi.org/10.1109/TCI.2025.3640864","url":null,"abstract":"We introduce a method for Multiple Input Multiple Output (MIMO) radar imaging of moving targets in a strongly reflecting, complex stationary scenery (clutter). The radar system has fixed nearby antennas that play the dual role of sources and receivers. It gathers data either by emitting probing pulses from one antenna at a time, or by sending from all the antennas non-coherent, possibly orthogonal, waveforms. We show how to obtain from the measurements an imaging function that depends on search position and velocity and is approximately separable in these variables, for a single moving target. For multiple moving targets in clutter, the imaging function is a sum of separable functions. By sampling this imaging function on a position-velocity grid we obtain an imaging matrix whose Singular Value Decomposition (SVD) allows the separation of the clutter and the targets moving at different velocities. The decomposition also leads directly to estimates of the locations and motion of the targets. The imaging method is designed to work in strong clutter, with unknown and possibly heterogeneous statistics. It does not require prior estimation of the covariance matrix of the clutter response or of its rank. We give an analysis of the imaging method and illustrate how it works with numerical simulations.","PeriodicalId":56022,"journal":{"name":"IEEE Transactions on Computational Imaging","volume":"12 ","pages":"172-186"},"PeriodicalIF":4.8,"publicationDate":"2025-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145929445","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multilevel Plug-and-Play Image Restoration 多层次即插即用图像恢复
IF 4.8 2区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-12-05 DOI: 10.1109/TCI.2025.3640427
Nils Laurent;Julián Tachella;Elisa Riccietti;Nelly Pustelnik
Plug-and-play (PnP) image reconstruction methods leverage pretrained deep neural network denoisers as image priors to solve general inverse problems, and can obtain a competitive performance without having to train a network on a specific problem. Despite their flexibility, PnP methods often require several iterations to converge and their performance can be highly sensitive to the choice of the initialization and of the hyperparameters. In this paper, we propose a new multilevel PnP framework to accelerate the convergence of PnP methods in the context of large-scale images. The proposed scheme, following a coarse-to-fine strategy, is initialized at the coarsest scale and the resolution of the starting point is progressively improved to reach the fine level with the highest resolution. The scheme then combines classical PnP iterations with cheaper iterations, involving representations of the images at coarser scales. As a result of the combination of these two ingredients, the multilevel PnP scheme accelerates the convergence and improves the robustness to the choice of initialization and hyperparameters. In a series of experiments, including image inpainting, demosaicing, and deblurring, we show that the proposed multilevel PnP method outperforms other PnP methods in both speed and reconstruction performance.
即插即用(PnP)图像重建方法利用预训练的深度神经网络去噪作为图像先验来解决一般的逆问题,并且无需针对特定问题训练网络即可获得具有竞争力的性能。尽管具有灵活性,但PnP方法通常需要多次迭代才能收敛,并且其性能对初始化和超参数的选择高度敏感。在本文中,我们提出了一个新的多层PnP框架来加速PnP方法在大规模图像背景下的收敛。该方案采用从粗到细的策略,在最粗尺度上初始化,逐步提高起始点的分辨率,达到分辨率最高的精细级。然后,该方案结合了经典的PnP迭代和更便宜的迭代,包括在更粗的尺度上表示图像。由于这两种成分的结合,多层PnP方案加快了收敛速度,提高了对初始化和超参数选择的鲁棒性。在一系列的实验中,包括图像绘制,去马赛克和去模糊,我们表明,所提出的多层次PnP方法在速度和重建性能上都优于其他PnP方法。
{"title":"Multilevel Plug-and-Play Image Restoration","authors":"Nils Laurent;Julián Tachella;Elisa Riccietti;Nelly Pustelnik","doi":"10.1109/TCI.2025.3640427","DOIUrl":"https://doi.org/10.1109/TCI.2025.3640427","url":null,"abstract":"Plug-and-play (PnP) image reconstruction methods leverage pretrained deep neural network denoisers as image priors to solve general inverse problems, and can obtain a competitive performance without having to train a network on a specific problem. Despite their flexibility, PnP methods often require several iterations to converge and their performance can be highly sensitive to the choice of the initialization and of the hyperparameters. In this paper, we propose a new multilevel PnP framework to accelerate the convergence of PnP methods in the context of large-scale images. The proposed scheme, following a coarse-to-fine strategy, is initialized at the coarsest scale and the resolution of the starting point is progressively improved to reach the fine level with the highest resolution. The scheme then combines classical PnP iterations with cheaper iterations, involving representations of the images at coarser scales. As a result of the combination of these two ingredients, the multilevel PnP scheme accelerates the convergence and improves the robustness to the choice of initialization and hyperparameters. In a series of experiments, including image inpainting, demosaicing, and deblurring, we show that the proposed multilevel PnP method outperforms other PnP methods in both speed and reconstruction performance.","PeriodicalId":56022,"journal":{"name":"IEEE Transactions on Computational Imaging","volume":"12 ","pages":"445-456"},"PeriodicalIF":4.8,"publicationDate":"2025-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146175727","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Convergence-Guaranteed Spectral CT Reconstruction via Internal and External Prior Mining 基于内外先验挖掘的收敛保证谱CT重建
IF 4.8 2区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-11-24 DOI: 10.1109/TCI.2025.3636743
Chunyan Liu;Dianlin Hu;Jiangjun Peng;Hong Wang;Qianyu Shu;Jianjun Wang
Spectral computed tomography (CT) is an imaging technology that utilizes the absorption characteristics of different X-ray energies to obtain the X-ray attenuation characteristics of objects in different energy ranges. However, the limited number of photons detected by spectral CT under a specific X-ray spectrum leads to obvious projection data noise. Making full use of the various properties of the original data is an effective way to recover a clean image from a small amount of noisy projection data. This paper proposes a spectral CT reconstruction method based on representative coefficient image denoising under a low-rank decomposition framework. This method integrates model-driven internal low-rank and nonlocal priors, and data-driven external deep priors, aiming to fully exploit the inherent spectral correlation, nonlocal self-similarity and deep spatial features in spectral CT images. Specifically, we use low-rank decomposition to characterize the global low-rankness of spectral CT images under a plug-and-play framework, and jointly utilize nonlocal low-rankness and smoothness as well as deep image priors to denoise representative coefficient images. Therefore, the proposed method faithfully represents the real underlying information of images by cleverly combining internal and external, nonlocal and local priors. Meanwhile, we design an effective proximal alternating minimization (PAM) algorithm to solve the proposed reconstruction model and establish the theoretical guarantee of the proposed algorithm. Experimental results show that compared with existing popular algorithms, the proposed method can significantly reduce running time while improving spectral CT images quality.
光谱计算机断层扫描(CT)是一种利用不同x射线能量的吸收特性来获得不同能量范围内物体的x射线衰减特性的成像技术。然而,在特定的x射线光谱下,光谱CT检测到的光子数量有限,导致投影数据噪声明显。充分利用原始数据的各种属性是从少量噪声投影数据中恢复干净图像的有效途径。提出了一种低秩分解框架下基于代表性系数图像去噪的光谱CT重构方法。该方法将模型驱动的内部低秩和非局部先验和数据驱动的外部深度先验相结合,旨在充分利用光谱CT图像固有的光谱相关性、非局部自相似性和深度空间特征。具体而言,我们在即插即用框架下,利用低秩分解对光谱CT图像的全局低秩度进行表征,并联合利用非局部低秩度和平滑度以及深度图像先验对代表性系数图像进行去噪。因此,该方法将内部先验与外部先验、非局部先验与局部先验巧妙结合,忠实地反映了图像的真实底层信息。同时,我们设计了一种有效的邻域交替极小化(PAM)算法来求解所提出的重构模型,并为所提出的算法建立了理论保证。实验结果表明,与现有的流行算法相比,该方法在提高光谱CT图像质量的同时显著缩短了运行时间。
{"title":"Convergence-Guaranteed Spectral CT Reconstruction via Internal and External Prior Mining","authors":"Chunyan Liu;Dianlin Hu;Jiangjun Peng;Hong Wang;Qianyu Shu;Jianjun Wang","doi":"10.1109/TCI.2025.3636743","DOIUrl":"https://doi.org/10.1109/TCI.2025.3636743","url":null,"abstract":"Spectral computed tomography (CT) is an imaging technology that utilizes the absorption characteristics of different X-ray energies to obtain the X-ray attenuation characteristics of objects in different energy ranges. However, the limited number of photons detected by spectral CT under a specific X-ray spectrum leads to obvious projection data noise. Making full use of the various properties of the original data is an effective way to recover a clean image from a small amount of noisy projection data. This paper proposes a spectral CT reconstruction method based on representative coefficient image denoising under a low-rank decomposition framework. This method integrates model-driven internal low-rank and nonlocal priors, and data-driven external deep priors, aiming to fully exploit the inherent spectral correlation, nonlocal self-similarity and deep spatial features in spectral CT images. Specifically, we use low-rank decomposition to characterize the global low-rankness of spectral CT images under a plug-and-play framework, and jointly utilize nonlocal low-rankness and smoothness as well as deep image priors to denoise representative coefficient images. Therefore, the proposed method faithfully represents the real underlying information of images by cleverly combining internal and external, nonlocal and local priors. Meanwhile, we design an effective proximal alternating minimization (PAM) algorithm to solve the proposed reconstruction model and establish the theoretical guarantee of the proposed algorithm. Experimental results show that compared with existing popular algorithms, the proposed method can significantly reduce running time while improving spectral CT images quality.","PeriodicalId":56022,"journal":{"name":"IEEE Transactions on Computational Imaging","volume":"12 ","pages":"46-59"},"PeriodicalIF":4.8,"publicationDate":"2025-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145766180","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fast Correction for Geometric Distortion in PFA Wavefront Curvature Compensation PFA波前曲率补偿中几何畸变的快速校正
IF 4.8 2区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-11-24 DOI: 10.1109/TCI.2025.3636746
Yining Zhang;Jixia Fan;Yanqi Liu;Xinhua Mao
In large-scene synthetic aperture radar (SAR) imaging, the selection of appropriate algorithms is crucial as it directly impacts processing efficiency and image fidelity. The Polar Format Algorithm (PFA) is widely used for its high-speed image formation capabilities. However, its reliance on the planar wavefront approximation inevitably introduces phase errors. A primary challenge arising from linear components of these errors is geometric distortion, which manifests as space-variant shift from actual positions. Traditional inverse warping correction method based on two-dimensional(2-D) interpolation suffers from high computational costs. To address this limitation, this paper proposes a separable 2-D interpolation framework that decouples the correction process into two one-dimensional (1-D) interpolations along azimuth and range axes. Through inverse solutions of geometric distortion functions, it is demonstrated that applying this framework to geometric distortion correction can effectively reduce the complexity while preserving image precision. Simulations and real-data comparisons validate that the proposed fast geometric distortion correction method significantly improve correction speed thus boosting overall computational efficiency.
在大场景合成孔径雷达(SAR)成像中,算法的选择至关重要,它直接影响到处理效率和图像保真度。极坐标格式算法(Polar Format Algorithm, PFA)因其高速图像生成能力而得到广泛应用。然而,它对平面波前近似的依赖不可避免地引入了相位误差。由这些误差的线性分量引起的一个主要挑战是几何畸变,它表现为与实际位置的空间变异偏移。传统的基于二维(2-D)插值的逆翘曲校正方法计算量大。为了解决这一限制,本文提出了一个可分离的二维插值框架,该框架将校正过程解耦为沿方位轴和距离轴的两个一维(一维)插值。通过几何畸变函数的反解,证明将该框架应用于几何畸变校正可以有效地降低校正复杂度,同时保持图像精度。仿真和实际数据对比验证了所提出的快速几何畸变校正方法显著提高了校正速度,从而提高了整体计算效率。
{"title":"Fast Correction for Geometric Distortion in PFA Wavefront Curvature Compensation","authors":"Yining Zhang;Jixia Fan;Yanqi Liu;Xinhua Mao","doi":"10.1109/TCI.2025.3636746","DOIUrl":"https://doi.org/10.1109/TCI.2025.3636746","url":null,"abstract":"In large-scene synthetic aperture radar (SAR) imaging, the selection of appropriate algorithms is crucial as it directly impacts processing efficiency and image fidelity. The Polar Format Algorithm (PFA) is widely used for its high-speed image formation capabilities. However, its reliance on the planar wavefront approximation inevitably introduces phase errors. A primary challenge arising from linear components of these errors is geometric distortion, which manifests as space-variant shift from actual positions. Traditional inverse warping correction method based on two-dimensional(2-D) interpolation suffers from high computational costs. To address this limitation, this paper proposes a separable 2-D interpolation framework that decouples the correction process into two one-dimensional (1-D) interpolations along azimuth and range axes. Through inverse solutions of geometric distortion functions, it is demonstrated that applying this framework to geometric distortion correction can effectively reduce the complexity while preserving image precision. Simulations and real-data comparisons validate that the proposed fast geometric distortion correction method significantly improve correction speed thus boosting overall computational efficiency.","PeriodicalId":56022,"journal":{"name":"IEEE Transactions on Computational Imaging","volume":"12 ","pages":"60-72"},"PeriodicalIF":4.8,"publicationDate":"2025-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145766194","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Rotational Motion Compensation and Sparse ISAR Imaging of Maneuvering Targets via Deep Unrolling Network 基于深度展开网络的机动目标旋转补偿与稀疏ISAR成像
IF 4.8 2区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-11-24 DOI: 10.1109/TCI.2025.3636744
Yue Wang;Xueru Bai;Feng Zhou
Accurate rotational motion compensation is critical for achieving well-focused inverse synthetic aperture radar (ISAR) imaging of maneuvering targets. However, low signal-to-noise ratio (SNR) and incomplete echoes often lead to significant performance degradation in conventional methods. Furthermore, these methods rely heavily on manual parameter tuning, which limits their adaptability to varying SNR and data missing rate in practical applications. In this article, a novel deep unrolling network for ISAR imaging of maneuvering targets is proposed. Firstly, an iterative method, termed RMC-PDHG, is proposed for rotational motion compensation and well-focused ISAR imaging based on Primal-Dual Hybrid Gradient (PDHG), enabling accurate imaging of maneuvering targets under low SNR and incomplete echo conditions. On this basis, a rotational motion compensation and imaging network, i.e., RMC-PDHG-Net, is developed by unrolling the RMC-PDHG. This network incorporates a hypernetwork to dynamically generate optimal internal parameters such as the regularization coefficient and step size based on intermediate image features, thereby gaining robustness to varying SNR and data missing rate. Additionally, a two-stage training strategy combining unsupervised and supervised learning is proposed to improve rotation parameter estimation accuracy and image reconstruction quality. Experimental results on simulated and measured data have demonstrated the effectiveness and robustness of the proposed network.
精确的旋转运动补偿是实现反合成孔径雷达(ISAR)对机动目标成像的关键。然而,在传统的方法中,低信噪比和不完全回波往往导致性能显著下降。此外,这些方法严重依赖于人工参数调整,这限制了它们在实际应用中对不同信噪比和数据缺失率的适应性。本文提出了一种用于机动目标ISAR成像的深度展开网络。首先,提出了一种基于原始-双混合梯度(PDHG)的旋转运动补偿和聚焦ISAR成像的迭代方法rmmc -PDHG,实现了低信噪比和不完全回波条件下机动目标的精确成像。在此基础上,通过展开rmmc - pdhg,构建了一个旋转运动补偿成像网络rmmc - pdhg - net。该网络采用超网络,根据中间图像特征动态生成最优的正则化系数和步长等内部参数,从而获得对不同信噪比和数据缺失率的鲁棒性。此外,提出了一种无监督学习和监督学习相结合的两阶段训练策略,以提高旋转参数估计精度和图像重建质量。仿真和实测数据的实验结果证明了该网络的有效性和鲁棒性。
{"title":"Rotational Motion Compensation and Sparse ISAR Imaging of Maneuvering Targets via Deep Unrolling Network","authors":"Yue Wang;Xueru Bai;Feng Zhou","doi":"10.1109/TCI.2025.3636744","DOIUrl":"https://doi.org/10.1109/TCI.2025.3636744","url":null,"abstract":"Accurate rotational motion compensation is critical for achieving well-focused inverse synthetic aperture radar (ISAR) imaging of maneuvering targets. However, low signal-to-noise ratio (SNR) and incomplete echoes often lead to significant performance degradation in conventional methods. Furthermore, these methods rely heavily on manual parameter tuning, which limits their adaptability to varying SNR and data missing rate in practical applications. In this article, a novel deep unrolling network for ISAR imaging of maneuvering targets is proposed. Firstly, an iterative method, termed RMC-PDHG, is proposed for rotational motion compensation and well-focused ISAR imaging based on Primal-Dual Hybrid Gradient (PDHG), enabling accurate imaging of maneuvering targets under low SNR and incomplete echo conditions. On this basis, a rotational motion compensation and imaging network, i.e., RMC-PDHG-Net, is developed by unrolling the RMC-PDHG. This network incorporates a hypernetwork to dynamically generate optimal internal parameters such as the regularization coefficient and step size based on intermediate image features, thereby gaining robustness to varying SNR and data missing rate. Additionally, a two-stage training strategy combining unsupervised and supervised learning is proposed to improve rotation parameter estimation accuracy and image reconstruction quality. Experimental results on simulated and measured data have demonstrated the effectiveness and robustness of the proposed network.","PeriodicalId":56022,"journal":{"name":"IEEE Transactions on Computational Imaging","volume":"12 ","pages":"11-24"},"PeriodicalIF":4.8,"publicationDate":"2025-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145766225","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-Slice Knowledge-Driven System Matrix Calibration in Magnetic Particle Imaging 磁颗粒成像中多层知识驱动系统矩阵标定
IF 4.8 2区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-11-24 DOI: 10.1109/TCI.2025.3636749
Pengyue Guo;Zechen Wei;Yu Zeng;Bingye Wang;Yidong Liao;Jiawei Hu;Lingwen Hou;Kai Liu;Ning He;Qibin Wang;Lei Li;Hui Hui;Yihan Wang;Shouping Zhu;Jie Tian
Magnetic particle imaging (MPI) is a novel tomographic imaging technique with high sensitivity and high temporal resolution. Reconstruction methods based on the system matrix (SM) enable accurate estimation of the concentration distribution of magnetic nanoparticles. However, SM calibration measurement is highly time-consuming, and the SM needs to be recalibrated whenever the scan parameters, particle types, or even the particle environment change. Although previous studies have proposed methods to accelerate SM calibration, these approaches do not fully exploit the similarity between the two-dimensional (2D) SMs of adjacent slices. In this study, we propose a multi-slice knowledge-driven SM calibration method, MKD-SM, which leverages knowledge obtained from multiple adjacent x-y slices to improve SM calibration accuracy at high downsampling ratios. Specifically, based on the significant similarity of the 2D SMs from adjacent x-y slices, MKD-SM employs a cross-misaligned sampling method to obtain the low-resolution (LR) SM within the field of view (FOV), ensuring that the 2D LR SMs obtained from adjacent slices exhibit complementarity. Additionally, we use a federated affinity fusion method to aggregate the complementary knowledge across multiple adjacent slices and utilize an architecture based on a cascade of CNNs and transformers for high-resolution (HR) SM recovery. Experimental results on the public OpenMPI dataset demonstrate that MKD-SM outperforms existing calibration methods, achieving higher SM calibration accuracy, particularly at high downsampling ratios. Ablation studies confirm the effectiveness of leveraging knowledge from adjacent slices. Furthermore, the proposed method has been successfully applied to an in-house field-free line (FFL) MPI scanner, enabling HR image reconstruction with LR SM measurements.
磁粉成像(MPI)是一种具有高灵敏度和高时间分辨率的层析成像技术。基于系统矩阵(SM)的重构方法能够准确估计磁性纳米颗粒的浓度分布。然而,SM校准测量非常耗时,每当扫描参数、颗粒类型甚至颗粒环境发生变化时,SM都需要重新校准。虽然以前的研究提出了加速SM校准的方法,但这些方法并没有充分利用相邻切片的二维SM之间的相似性。在这项研究中,我们提出了一种多片知识驱动的SM校准方法MKD-SM,该方法利用从多个相邻x-y切片获得的知识来提高高下采样比下的SM校准精度。具体而言,基于相邻x-y切片的二维微信号具有显著的相似性,MKD-SM采用交叉错位采样方法获得视场内的低分辨率微信号,确保相邻切片获得的二维微信号具有互补性。此外,我们使用联邦亲和融合方法来聚合多个相邻切片之间的互补知识,并利用基于cnn级联和变压器的架构进行高分辨率(HR) SM恢复。在OpenMPI公共数据集上的实验结果表明,MKD-SM优于现有的校准方法,特别是在高降采样率下,实现了更高的SM校准精度。消融研究证实了利用邻近切片知识的有效性。此外,该方法已成功应用于内部无场线(FFL) MPI扫描仪,实现了LR SM测量的HR图像重建。
{"title":"Multi-Slice Knowledge-Driven System Matrix Calibration in Magnetic Particle Imaging","authors":"Pengyue Guo;Zechen Wei;Yu Zeng;Bingye Wang;Yidong Liao;Jiawei Hu;Lingwen Hou;Kai Liu;Ning He;Qibin Wang;Lei Li;Hui Hui;Yihan Wang;Shouping Zhu;Jie Tian","doi":"10.1109/TCI.2025.3636749","DOIUrl":"https://doi.org/10.1109/TCI.2025.3636749","url":null,"abstract":"Magnetic particle imaging (MPI) is a novel tomographic imaging technique with high sensitivity and high temporal resolution. Reconstruction methods based on the system matrix (SM) enable accurate estimation of the concentration distribution of magnetic nanoparticles. However, SM calibration measurement is highly time-consuming, and the SM needs to be recalibrated whenever the scan parameters, particle types, or even the particle environment change. Although previous studies have proposed methods to accelerate SM calibration, these approaches do not fully exploit the similarity between the two-dimensional (2D) SMs of adjacent slices. In this study, we propose a multi-slice knowledge-driven SM calibration method, MKD-SM, which leverages knowledge obtained from multiple adjacent x-y slices to improve SM calibration accuracy at high downsampling ratios. Specifically, based on the significant similarity of the 2D SMs from adjacent x-y slices, MKD-SM employs a cross-misaligned sampling method to obtain the low-resolution (LR) SM within the field of view (FOV), ensuring that the 2D LR SMs obtained from adjacent slices exhibit complementarity. Additionally, we use a federated affinity fusion method to aggregate the complementary knowledge across multiple adjacent slices and utilize an architecture based on a cascade of CNNs and transformers for high-resolution (HR) SM recovery. Experimental results on the public OpenMPI dataset demonstrate that MKD-SM outperforms existing calibration methods, achieving higher SM calibration accuracy, particularly at high downsampling ratios. Ablation studies confirm the effectiveness of leveraging knowledge from adjacent slices. Furthermore, the proposed method has been successfully applied to an in-house field-free line (FFL) MPI scanner, enabling HR image reconstruction with LR SM measurements.","PeriodicalId":56022,"journal":{"name":"IEEE Transactions on Computational Imaging","volume":"12 ","pages":"25-36"},"PeriodicalIF":4.8,"publicationDate":"2025-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145766219","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Transactions on Computational Imaging
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1