首页 > 最新文献

IEEE Transactions on Computational Imaging最新文献

英文 中文
Using Randomized Nyström Preconditioners to Accelerate Variational Image Reconstruction 使用随机Nyström预处理加速变分图像重建
IF 4.8 2区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-10-20 DOI: 10.1109/TCI.2025.3622903
Tao Hong;Zhaoyi Xu;Jason Hu;Jeffrey A. Fessler
Model-based iterative reconstruction plays a key role in solving inverse problems. However, the associated minimization problems are generally large-scale, nonsmooth, and sometimes even nonconvex, which present challenges in designing efficient iterative solvers. Preconditioning methods can significantly accelerate the convergence of iterative methods. In some applications, computing preconditioners on-the-fly is beneficial. Moreover, forward models in image reconstruction are typically represented as operators, and the corresponding explicit matrices are often unavailable, which brings additional challenges in designing preconditioners. Therefore, for practical use, computing and applying preconditioners should be computationally inexpensive. This paper adapts the randomized Nyström approximation to compute effective preconditioners that accelerate image reconstruction without requiring an explicit matrix for the forward model. We leverage modern GPU computational platforms to compute the preconditioner on-the-fly. Moreover, we propose efficient approaches for applying the preconditioners to problems with classical nonsmooth regularizers, i.e., wavelet, total variation, and Hessian Schatten-norm. Our numerical results on image deblurring, super-resolution with impulsive noise, and 2D computed tomography reconstruction illustrate the efficiency and effectiveness of the proposed preconditioner.
基于模型的迭代重构是求解逆问题的关键。然而,相关的最小化问题通常是大规模的、非光滑的,有时甚至是非凸的,这对设计有效的迭代求解器提出了挑战。预处理方法可以显著加快迭代方法的收敛速度。在某些应用中,动态计算预调节器是有益的。此外,图像重建中的正演模型通常用算子表示,而相应的显式矩阵往往不可用,这给预处理设计带来了额外的挑战。因此,在实际应用中,计算和应用预调节器应该在计算上便宜。本文采用随机Nyström近似来计算加速图像重建的有效预调节器,而不需要显式的正演模型矩阵。我们利用现代GPU计算平台来实时计算前置条件。此外,我们提出了将预条件应用于经典非光滑正则化问题的有效方法,即小波、总变分和Hessian schattenn -范数。我们在图像去模糊、带脉冲噪声的超分辨率和二维计算机断层扫描重建方面的数值结果表明了所提出的预调节器的效率和有效性。
{"title":"Using Randomized Nyström Preconditioners to Accelerate Variational Image Reconstruction","authors":"Tao Hong;Zhaoyi Xu;Jason Hu;Jeffrey A. Fessler","doi":"10.1109/TCI.2025.3622903","DOIUrl":"https://doi.org/10.1109/TCI.2025.3622903","url":null,"abstract":"Model-based iterative reconstruction plays a key role in solving inverse problems. However, the associated minimization problems are generally large-scale, nonsmooth, and sometimes even nonconvex, which present challenges in designing efficient iterative solvers. Preconditioning methods can significantly accelerate the convergence of iterative methods. In some applications, computing preconditioners on-the-fly is beneficial. Moreover, forward models in image reconstruction are typically represented as operators, and the corresponding explicit matrices are often unavailable, which brings additional challenges in designing preconditioners. Therefore, for practical use, computing and applying preconditioners should be computationally inexpensive. This paper adapts the randomized Nyström approximation to compute effective preconditioners that accelerate image reconstruction without requiring an explicit matrix for the forward model. We leverage modern GPU computational platforms to compute the preconditioner on-the-fly. Moreover, we propose efficient approaches for applying the preconditioners to problems with classical nonsmooth regularizers, i.e., wavelet, total variation, and Hessian Schatten-norm. Our numerical results on image deblurring, super-resolution with impulsive noise, and 2D computed tomography reconstruction illustrate the efficiency and effectiveness of the proposed preconditioner.","PeriodicalId":56022,"journal":{"name":"IEEE Transactions on Computational Imaging","volume":"11 ","pages":"1630-1643"},"PeriodicalIF":4.8,"publicationDate":"2025-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145560746","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
High-Fidelity Holographic Reconstruction via Physics-Aware Heterogeneous Self-Supervised Learning 基于物理意识异构自监督学习的高保真全息重建
IF 4.8 2区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-10-17 DOI: 10.1109/TCI.2025.3623041
Sida Gao;Ziyang Li;Zhengyu Wu;Qi Li;Pengcheng Jia;Yutong Li;Shutian Liu;Zhengjun Liu
Single-shot digital holography has demonstrated great potential for capturing the complex amplitude of optical fields in lensless imaging systems. Nevertheless, accurately reconstructing intricate interference patterns remains challenging due to high-frequency texture distortion and limited generalization across diverse real-world conditions. To address these issues, we propose a physics-embedded network that explicitly incorporates angular spectrum propagation and spatial-domain amplitude constraints into a deep learning framework, ensuring both physical consistency and strong representational capacity. Building on this foundation, we devised HoloSSL, a physics-aware heterogeneous self-supervised learning framework for robust single-frame holographic reconstruction. The framework consists of two complementary stages: (1) physics-aware self-supervised pretraining using Hermite-Gaussian synthesized holograms to learn a prior-consistent mapping, and (2) unsupervised domain adaptation using real captured intensity-only holograms, where intensity fidelity and propagation consistency jointly guide label-free fine-tuning. To enhance texture representation, we design a swin-hourglass block that integrates cross-scale self-attention with frequency-aware modeling. Extensive simulations and real-world experiments demonstrate that HoloSSL outperforms state-of-the-art methods in terms of reconstruction fidelity, structural consistency, and robustness to noise, providing a new paradigm for adaptive, interpretable, and high-fidelity holographic imaging.
在无透镜成像系统中,单镜头数字全息术在捕捉复杂光场振幅方面显示出巨大的潜力。然而,由于高频纹理失真和在不同现实条件下的有限泛化,准确重建复杂的干涉图案仍然具有挑战性。为了解决这些问题,我们提出了一个物理嵌入式网络,该网络明确地将角频谱传播和空间域振幅约束纳入深度学习框架,确保物理一致性和强大的表示能力。在此基础上,我们设计了HoloSSL,一个用于鲁棒单帧全息重建的物理感知异构自监督学习框架。该框架包括两个互补的阶段:(1)使用Hermite-Gaussian合成全息图进行物理感知的自监督预训练,以学习先验一致性映射;(2)使用真实捕获的仅强度全息图进行无监督域自适应,其中强度保真度和传播一致性共同指导无标签微调。为了增强纹理表征,我们设计了一个天鹅沙漏块,将跨尺度自注意与频率感知建模相结合。广泛的模拟和现实世界的实验表明,HoloSSL在重建保真度、结构一致性和对噪声的鲁棒性方面优于最先进的方法,为自适应、可解释和高保真全息成像提供了新的范例。
{"title":"High-Fidelity Holographic Reconstruction via Physics-Aware Heterogeneous Self-Supervised Learning","authors":"Sida Gao;Ziyang Li;Zhengyu Wu;Qi Li;Pengcheng Jia;Yutong Li;Shutian Liu;Zhengjun Liu","doi":"10.1109/TCI.2025.3623041","DOIUrl":"https://doi.org/10.1109/TCI.2025.3623041","url":null,"abstract":"Single-shot digital holography has demonstrated great potential for capturing the complex amplitude of optical fields in lensless imaging systems. Nevertheless, accurately reconstructing intricate interference patterns remains challenging due to high-frequency texture distortion and limited generalization across diverse real-world conditions. To address these issues, we propose a physics-embedded network that explicitly incorporates angular spectrum propagation and spatial-domain amplitude constraints into a deep learning framework, ensuring both physical consistency and strong representational capacity. Building on this foundation, we devised HoloSSL, a physics-aware heterogeneous self-supervised learning framework for robust single-frame holographic reconstruction. The framework consists of two complementary stages: (1) physics-aware self-supervised pretraining using Hermite-Gaussian synthesized holograms to learn a prior-consistent mapping, and (2) unsupervised domain adaptation using real captured intensity-only holograms, where intensity fidelity and propagation consistency jointly guide label-free fine-tuning. To enhance texture representation, we design a swin-hourglass block that integrates cross-scale self-attention with frequency-aware modeling. Extensive simulations and real-world experiments demonstrate that HoloSSL outperforms state-of-the-art methods in terms of reconstruction fidelity, structural consistency, and robustness to noise, providing a new paradigm for adaptive, interpretable, and high-fidelity holographic imaging.","PeriodicalId":56022,"journal":{"name":"IEEE Transactions on Computational Imaging","volume":"11 ","pages":"1469-1478"},"PeriodicalIF":4.8,"publicationDate":"2025-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145455859","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Joint High-Order Motion Compensation and Imaging Algorithm for Ultrahigh-Resolution Spaceborne ISAR 超高分辨率星载ISAR联合高阶运动补偿与成像算法
IF 4.8 2区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-10-17 DOI: 10.1109/TCI.2025.3623006
Yichen Zhou;Yong Wang
The acquisition of high-resolution images of space orbiting satellites by spaceborne ISAR has significant importance for enhancing space situation awareness. However, challenges persist with the increase of resolution. Two main issues arise: (1) The assumption of a stable imaging projection plane is invalid under the current imaging regime. (2) The high-order spatial-variant range cell migration and phase errors caused by complex relative motion is introduced. Therefore, it is important to optimize the existing ISAR imaging geometry models and imaging algorithms. In this paper, integration of high-order motion compensation and high-resolution imaging algorithm is proposed. Firstly, the spaceborne ISAR geometric imaging model is established, and the rationality of the model and the unique properties application to image space satellites target are investigated. Subsequently, by effectively utilizing the properties, a high-order range migration elimination method based on image partition is proposed, along with the image partition parameters estimation method. For the remaining phase error, a high-resolution imaging algorithm is introduced to eliminate error, which can overcome the influence of cross terms and strong scattering points. Finally, compared with the existing algorithms, simulation results validate the effectiveness and superiority of the proposed algorithm under different SNR.
星载ISAR获取空间轨道卫星高分辨率图像对增强空间态势感知具有重要意义。然而,随着分辨率的提高,挑战依然存在。出现了两个主要问题:(1)在目前的成像制度下,稳定成像投影平面的假设是无效的。(2)介绍了复杂相对运动引起的高阶空间变异性距离单元偏移和相位误差。因此,优化现有ISAR成像几何模型和成像算法具有重要意义。本文提出了将高阶运动补偿与高分辨率成像算法相结合的方法。首先,建立了星载ISAR几何成像模型,研究了该模型的合理性及其在空间卫星目标成像中的独特性能。随后,通过有效利用这些属性,提出了一种基于图像分割的高阶距离偏移消除方法,以及图像分割参数估计方法。对于剩余相位误差,引入高分辨率成像算法消除误差,克服交叉项和强散射点的影响。最后,通过与现有算法的比较,仿真结果验证了该算法在不同信噪比下的有效性和优越性。
{"title":"Joint High-Order Motion Compensation and Imaging Algorithm for Ultrahigh-Resolution Spaceborne ISAR","authors":"Yichen Zhou;Yong Wang","doi":"10.1109/TCI.2025.3623006","DOIUrl":"https://doi.org/10.1109/TCI.2025.3623006","url":null,"abstract":"The acquisition of high-resolution images of space orbiting satellites by spaceborne ISAR has significant importance for enhancing space situation awareness. However, challenges persist with the increase of resolution. Two main issues arise: (1) The assumption of a stable imaging projection plane is invalid under the current imaging regime. (2) The high-order spatial-variant range cell migration and phase errors caused by complex relative motion is introduced. Therefore, it is important to optimize the existing ISAR imaging geometry models and imaging algorithms. In this paper, integration of high-order motion compensation and high-resolution imaging algorithm is proposed. Firstly, the spaceborne ISAR geometric imaging model is established, and the rationality of the model and the unique properties application to image space satellites target are investigated. Subsequently, by effectively utilizing the properties, a high-order range migration elimination method based on image partition is proposed, along with the image partition parameters estimation method. For the remaining phase error, a high-resolution imaging algorithm is introduced to eliminate error, which can overcome the influence of cross terms and strong scattering points. Finally, compared with the existing algorithms, simulation results validate the effectiveness and superiority of the proposed algorithm under different SNR.","PeriodicalId":56022,"journal":{"name":"IEEE Transactions on Computational Imaging","volume":"11 ","pages":"1479-1493"},"PeriodicalIF":4.8,"publicationDate":"2025-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145455915","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Graph Attention-Driven Bayesian Deep Unrolling for Dual-Peak Single-Photon Lidar Imaging 双峰单光子激光雷达成像的图注意力驱动贝叶斯深度展开
IF 4.8 2区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-10-17 DOI: 10.1109/TCI.2025.3623000
Kyungmin Choi;JaKeoung Koo;Stephen McLaughlin;Abderrahim Halimi
Single-photon Lidar imaging offers a significant advantage in 3D imaging due to its high resolution and long-range capabilities, however it is challenging to apply in noisy environments with multiple targets per pixel. To tackle these challenges, several methods have been proposed. Statistical methods demonstrate interpretability on the inferred parameters, but they are often limited in their ability to handle complex scenes. Deep learning-based methods have shown superior performance in terms of accuracy and robustness, but they lack interpretability or they are limited to a single-peak per pixel. In this paper, we propose a deep unrolling algorithm for dual-peak single-photon Lidar imaging. We introduce a hierarchical Bayesian model for multiple targets and propose a neural network that unrolls the underlying statistical method. To support multiple targets, we adopt a dual depth maps representation and exploit geometric deep learning to extract features from the point cloud. The proposed method takes advantages of statistical methods and learning-based methods in terms of accuracy and quantifying uncertainty. The experimental results on synthetic and real data demonstrate the competitive performance when compared to existing methods, while also providing uncertainty information.
单光子激光雷达成像由于其高分辨率和远程能力,在3D成像中具有显着优势,但在每像素多个目标的嘈杂环境中应用具有挑战性。为了应对这些挑战,已经提出了几种方法。统计方法证明了对推断参数的可解释性,但它们在处理复杂场景的能力上往往受到限制。基于深度学习的方法在准确性和鲁棒性方面表现出优异的性能,但它们缺乏可解释性,或者它们仅限于每个像素的单个峰值。本文提出了一种双峰单光子激光雷达成像的深度展开算法。我们引入了一个多目标的层次贝叶斯模型,并提出了一个神经网络来展开底层统计方法。为了支持多个目标,我们采用双深度图表示,并利用几何深度学习从点云中提取特征。该方法在准确性和量化不确定性方面具有统计方法和基于学习的方法的优点。在综合数据和真实数据上的实验结果表明,与现有方法相比,该方法具有较强的竞争力,同时也提供了不确定性信息。
{"title":"Graph Attention-Driven Bayesian Deep Unrolling for Dual-Peak Single-Photon Lidar Imaging","authors":"Kyungmin Choi;JaKeoung Koo;Stephen McLaughlin;Abderrahim Halimi","doi":"10.1109/TCI.2025.3623000","DOIUrl":"https://doi.org/10.1109/TCI.2025.3623000","url":null,"abstract":"Single-photon Lidar imaging offers a significant advantage in 3D imaging due to its high resolution and long-range capabilities, however it is challenging to apply in noisy environments with multiple targets per pixel. To tackle these challenges, several methods have been proposed. Statistical methods demonstrate interpretability on the inferred parameters, but they are often limited in their ability to handle complex scenes. Deep learning-based methods have shown superior performance in terms of accuracy and robustness, but they lack interpretability or they are limited to a single-peak per pixel. In this paper, we propose a deep unrolling algorithm for dual-peak single-photon Lidar imaging. We introduce a hierarchical Bayesian model for multiple targets and propose a neural network that unrolls the underlying statistical method. To support multiple targets, we adopt a dual depth maps representation and exploit geometric deep learning to extract features from the point cloud. The proposed method takes advantages of statistical methods and learning-based methods in terms of accuracy and quantifying uncertainty. The experimental results on synthetic and real data demonstrate the competitive performance when compared to existing methods, while also providing uncertainty information.","PeriodicalId":56022,"journal":{"name":"IEEE Transactions on Computational Imaging","volume":"11 ","pages":"1494-1504"},"PeriodicalIF":4.8,"publicationDate":"2025-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145455808","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Limitations of Data-Driven Spectral Reconstruction: An Optics-Aware Analysis 数据驱动光谱重建的局限性:光学感知分析
IF 4.8 2区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-10-17 DOI: 10.1109/TCI.2025.3622928
Qiang Fu;Matheus Souza;Eunsue Choi;Suhyun Shin;Seung-Hwan Baek;Wolfgang Heidrich
Hyperspectral imaging empowers machine vision systems with the distinct capability of identifying materials through recording their spectral signatures. Recent efforts in data-driven spectral reconstruction aim at extracting spectral information from RGB images captured by cost-effective RGB cameras, instead of dedicated hardware. Published work reports exceedingly high numerical scores for this reconstruction task, yet real-world performance lags substantially behind. In this paper we systematically analyze the performance of such methods with three groups of dedicated experiments. First, we evaluate the practical overfitting limitations with respect to current datasets by training the networks with less data, validating the trained models with unseen yet slightly modified data, and cross-dataset validation. Second, we reveal fundamental limitations in the ability of RGB to spectral methods to deal with metameric or near-metameric conditions, which have so far gone largely unnoticed due to the insufficiencies of existing datasets. We achieve this by validating the trained models with metamer data generated by metameric black theory and re-training the networks with various forms of metamers. This methodology can also be used for data augmentation as a partial mitigation of the dataset issues, although the RGB to spectral inverse problem remains fundamentally ill-posed. Finally, we analyze the potential for modifying the problem setting to achieve better performance by exploiting some form of optical encoding provided by either incidental optical aberrations or some form of deliberate optical design. Our experiments show that such approaches do indeed provide improved results under certain circumstances, however their overall performance is limited by the same dataset issues as in the plain RGB to spectral scenario. We therefore conclude that future progress on snapshot spectral imaging will heavily depend on the generation of improved datasets which can then be used to design effective optical encoding strategies.
高光谱成像通过记录材料的光谱特征,赋予机器视觉系统识别材料的独特能力。最近在数据驱动的光谱重建方面的努力旨在从低成本的RGB相机捕获的RGB图像中提取光谱信息,而不是专用硬件。已发表的研究报告显示,这种重建任务的数值得分极高,但实际表现却远远落后于此。本文通过三组专用实验系统地分析了这些方法的性能。首先,我们评估了相对于当前数据集的实际过拟合限制,方法是用较少的数据训练网络,用未见但略有修改的数据验证训练模型,以及跨数据集验证。其次,我们揭示了RGB光谱方法处理异谱或近异谱条件的能力的基本局限性,由于现有数据集的不足,迄今为止这些局限性在很大程度上被忽视了。我们通过使用元元黑理论生成的元元数据验证训练模型,并使用各种形式的元元重新训练网络来实现这一点。这种方法也可用于数据增强,作为部分缓解数据集问题的方法,尽管RGB与光谱的逆问题基本上仍然是病态的。最后,我们分析了修改问题设置的可能性,通过利用由偶然光学像差或某种形式的故意光学设计提供的某种形式的光学编码来实现更好的性能。我们的实验表明,这些方法确实在某些情况下提供了改进的结果,但是它们的整体性能受到与普通RGB到光谱场景相同的数据集问题的限制。因此,我们得出结论,快照光谱成像的未来进展将在很大程度上取决于改进数据集的生成,这些数据集可以用来设计有效的光学编码策略。
{"title":"Limitations of Data-Driven Spectral Reconstruction: An Optics-Aware Analysis","authors":"Qiang Fu;Matheus Souza;Eunsue Choi;Suhyun Shin;Seung-Hwan Baek;Wolfgang Heidrich","doi":"10.1109/TCI.2025.3622928","DOIUrl":"https://doi.org/10.1109/TCI.2025.3622928","url":null,"abstract":"Hyperspectral imaging empowers machine vision systems with the distinct capability of identifying materials through recording their spectral signatures. Recent efforts in data-driven spectral reconstruction aim at extracting spectral information from RGB images captured by cost-effective RGB cameras, instead of dedicated hardware. Published work reports exceedingly high numerical scores for this reconstruction task, yet real-world performance lags substantially behind. In this paper we <italic>systematically analyze</i> the performance of such methods with three groups of dedicated experiments. First, we evaluate the practical overfitting limitations with respect to current datasets by training the networks with less data, validating the trained models with unseen yet slightly modified data, and cross-dataset validation. Second, we reveal <italic>fundamental limitations</i> in the ability of RGB to spectral methods to deal with metameric or near-metameric conditions, which have so far gone largely unnoticed due to the insufficiencies of existing datasets. We achieve this by validating the trained models with metamer data generated by metameric black theory and re-training the networks with various forms of metamers. This methodology can also be used for data augmentation as a partial mitigation of the dataset issues, <italic>although the RGB to spectral inverse problem remains fundamentally ill-posed</i>. Finally, we analyze the potential for modifying the problem setting to achieve better performance by exploiting some form of optical encoding provided by either incidental optical aberrations or some form of deliberate optical design. Our experiments show that such approaches do indeed provide improved results under certain circumstances, however their overall performance is limited by the same dataset issues as in the plain RGB to spectral scenario. We therefore conclude that <italic>future progress on snapshot spectral imaging will heavily depend on the generation of improved datasets which can then be used to design effective optical encoding strategies</i>.","PeriodicalId":56022,"journal":{"name":"IEEE Transactions on Computational Imaging","volume":"11 ","pages":"1505-1520"},"PeriodicalIF":4.8,"publicationDate":"2025-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11206531","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145455956","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
High-Quality Diffractive Lens Infrared Computational Imaging via Severe Degradation Estimation Framework 基于严重退化估计框架的高质量衍射透镜红外计算成像
IF 4.8 2区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-10-09 DOI: 10.1109/TCI.2025.3619811
Pengzhou Ji;Xiong Dun;Yujie Xing;Xuquan Wang;Jian Zhang;Hongmei Li;Kan Zhao;Hongfei Jiao;Zhanshan Wang;Xinbin Cheng
Diffractive lens (DLs) computational imaging promises lightweight infrared imaging. However, it is still challenging to accurately obtain the degradation of DLs computational imaging systems to achieve high-quality imaging. Degradation estimation methods based on image-to-image are the most feasible way, but existing methods face the problem of low accuracy for severe degradation estimation. In this paper, we proposed a severe degradation estimation framework based on KernelNet, and realized the improvement of the accuracy of severe degenerate estimation by more than 1.13 dB. Furthermore, DyUnet, an image restoration network, was proposed, which is based on dynamic convolution. The results of the mid-wave diffractive lens infrared computational imaging system indicated that our method achieved high-quality imaging results comparable to the conventional one, while the number of optical elements was reduced from 7 to 2, and the weight was reduced by 50$%$.
衍射透镜(DLs)计算成像承诺轻量级红外成像。然而,为了实现高质量的成像,如何准确地获得dl计算成像系统的退化仍然是一个挑战。基于图像对图像的退化估计方法是最可行的方法,但现有方法面临严重退化估计精度低的问题。本文提出了一种基于KernelNet的严重退化估计框架,实现了严重退化估计精度提高1.13 dB以上。在此基础上,提出了基于动态卷积的图像恢复网络DyUnet。中波衍射透镜红外计算成像系统的实验结果表明,该方法在将光学元件数量从7个减少到2个,重量减少50%的情况下,获得了与传统方法相当的高质量成像结果。
{"title":"High-Quality Diffractive Lens Infrared Computational Imaging via Severe Degradation Estimation Framework","authors":"Pengzhou Ji;Xiong Dun;Yujie Xing;Xuquan Wang;Jian Zhang;Hongmei Li;Kan Zhao;Hongfei Jiao;Zhanshan Wang;Xinbin Cheng","doi":"10.1109/TCI.2025.3619811","DOIUrl":"https://doi.org/10.1109/TCI.2025.3619811","url":null,"abstract":"Diffractive lens (DLs) computational imaging promises lightweight infrared imaging. However, it is still challenging to accurately obtain the degradation of DLs computational imaging systems to achieve high-quality imaging. Degradation estimation methods based on image-to-image are the most feasible way, but existing methods face the problem of low accuracy for severe degradation estimation. In this paper, we proposed a severe degradation estimation framework based on KernelNet, and realized the improvement of the accuracy of severe degenerate estimation by more than 1.13 dB. Furthermore, DyUnet, an image restoration network, was proposed, which is based on dynamic convolution. The results of the mid-wave diffractive lens infrared computational imaging system indicated that our method achieved high-quality imaging results comparable to the conventional one, while the number of optical elements was reduced from 7 to 2, and the weight was reduced by 50<inline-formula><tex-math>$%$</tex-math></inline-formula>.","PeriodicalId":56022,"journal":{"name":"IEEE Transactions on Computational Imaging","volume":"11 ","pages":"1458-1468"},"PeriodicalIF":4.8,"publicationDate":"2025-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11197645","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145405368","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Prior Image Constrained Total Variation-Stokes for Cerebral Perfusion CT Imaging 脑灌注CT成像的先验图像约束总变异-斯托克斯
IF 4.8 2区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-10-09 DOI: 10.1109/TCI.2025.3619821
Shanzhou Niu;Shuo Li;Tinghua Wang;Weiwen Wu;You Zhang;Jing Wang;Jianhua Ma
Cerebral perfusion computed tomography (CPCT) can non-invasively and rapidly assess blood flow circulation in the brain, making it widely adopted in clinical settings. However, the dynamic scanning protocol associated with CPCT entails substantial ionizing radiation exposure, leading to elevated radiation risks. Lowering the tube current can efficiently reduce radiation dose, but leads to significant image quality deterioration for standard filtered back-projection (FBP) algorithm due to increased quantum noise in measured projection data. In this study, we present an iterative image reconstruction method to improve the low-dose CPCT image quality, which uses the prior image constrained total variation-stokes (PICTVS) based on the penalized weighted least squares (PWLS) criterion. This method leverages information from the prior image to enhance the image quality of low-dose CPCT. Specifically, PICTVS utilizes high-quality geometric structural information from the prior image and fuses it into low-dose CPCT image reconstruction while preserving the main features of the target image. An effective alternating minimization method was developed to solve the objective function associated with the PWLS-PICTVS reconstruction. The novelty of the PWLS-PICTVS algorithm is listed as follows: (1) The PICTVS regularization incorporates the structural information of prior image into the target image where the gradients of both images align; (2) In image areas where the gradients differ, the PICTVS regularization employs total variation (TV) instead; and (3) The PICTVS regularization facilitates the integration of shared edge structure information from a high-quality prior image into the low-dose image while avoiding introducing mismatched anatomy information. Qualitative and quantitative analyses were conducted to assess the efficacy of the PWLS-PICTVS image reconstruction algorithm using a digital brain perfusion phantom and simulated low-dose clinical patient data. The experimental results show that the PWLS-PICTVS algorithm significantly improves noise suppression, streak artifact reduction, and edge preservation when compared with the other reconstruction methods. Importantly, the CPCT images reconstructed using the PWLS-PICTVS method yield more accurate hemodynamic parameter maps, enhancing their potential for clinical diagnosis.
脑灌注计算机断层扫描(CPCT)可以无创、快速评估脑血流循环,在临床中得到广泛应用。然而,与CPCT相关的动态扫描方案需要大量的电离辐射暴露,导致辐射风险升高。降低管电流可以有效地降低辐射剂量,但由于测量投影数据中的量子噪声增加,导致标准滤波反投影(FBP)算法的图像质量明显下降。在本研究中,我们提出了一种基于惩罚加权最小二乘(PWLS)准则的先验图像约束总变差-stokes (PICTVS)迭代图像重建方法来提高低剂量CPCT图像质量。该方法利用先验图像的信息来提高低剂量CPCT的图像质量。具体来说,PICTVS利用来自先验图像的高质量几何结构信息,并将其融合到低剂量CPCT图像重建中,同时保留目标图像的主要特征。提出了一种有效的交替最小化方法来求解与PWLS-PICTVS重建相关的目标函数。PWLS-PICTVS算法的新颖之处在于:(1)PICTVS正则化将先验图像的结构信息整合到两幅图像梯度对齐的目标图像中;(2)在梯度不同的图像区域,PICTVS正则化采用总变分(TV)代替;(3) PICTVS正则化有助于将高质量先验图像的共享边缘结构信息整合到低剂量图像中,同时避免引入不匹配的解剖信息。采用数字脑灌注幻象和模拟低剂量临床患者数据,对PWLS-PICTVS图像重建算法的有效性进行定性和定量分析。实验结果表明,与其他重建方法相比,PWLS-PICTVS算法在噪声抑制、条纹伪影抑制和边缘保持方面有显著提高。重要的是,使用PWLS-PICTVS方法重建的CPCT图像产生更准确的血流动力学参数图,增强了其临床诊断的潜力。
{"title":"Prior Image Constrained Total Variation-Stokes for Cerebral Perfusion CT Imaging","authors":"Shanzhou Niu;Shuo Li;Tinghua Wang;Weiwen Wu;You Zhang;Jing Wang;Jianhua Ma","doi":"10.1109/TCI.2025.3619821","DOIUrl":"https://doi.org/10.1109/TCI.2025.3619821","url":null,"abstract":"Cerebral perfusion computed tomography (CPCT) can non-invasively and rapidly assess blood flow circulation in the brain, making it widely adopted in clinical settings. However, the dynamic scanning protocol associated with CPCT entails substantial ionizing radiation exposure, leading to elevated radiation risks. Lowering the tube current can efficiently reduce radiation dose, but leads to significant image quality deterioration for standard filtered back-projection (FBP) algorithm due to increased quantum noise in measured projection data. In this study, we present an iterative image reconstruction method to improve the low-dose CPCT image quality, which uses the prior image constrained total variation-stokes (PICTVS) based on the penalized weighted least squares (PWLS) criterion. This method leverages information from the prior image to enhance the image quality of low-dose CPCT. Specifically, PICTVS utilizes high-quality geometric structural information from the prior image and fuses it into low-dose CPCT image reconstruction while preserving the main features of the target image. An effective alternating minimization method was developed to solve the objective function associated with the PWLS-PICTVS reconstruction. The novelty of the PWLS-PICTVS algorithm is listed as follows: (1) The PICTVS regularization incorporates the structural information of prior image into the target image where the gradients of both images align; (2) In image areas where the gradients differ, the PICTVS regularization employs total variation (TV) instead; and (3) The PICTVS regularization facilitates the integration of shared edge structure information from a high-quality prior image into the low-dose image while avoiding introducing mismatched anatomy information. Qualitative and quantitative analyses were conducted to assess the efficacy of the PWLS-PICTVS image reconstruction algorithm using a digital brain perfusion phantom and simulated low-dose clinical patient data. The experimental results show that the PWLS-PICTVS algorithm significantly improves noise suppression, streak artifact reduction, and edge preservation when compared with the other reconstruction methods. Importantly, the CPCT images reconstructed using the PWLS-PICTVS method yield more accurate hemodynamic parameter maps, enhancing their potential for clinical diagnosis.","PeriodicalId":56022,"journal":{"name":"IEEE Transactions on Computational Imaging","volume":"11 ","pages":"1447-1457"},"PeriodicalIF":4.8,"publicationDate":"2025-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145351909","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DROLL: Dual-Domain Reconstruction Network With a High-Fidelity Domain-Transform Operator Based on Learned Low-Rank Prior for Sparse-View CT Reconstruction 基于学习低秩先验的高保真域变换算子双域重建网络用于稀疏视图CT重建
IF 4.8 2区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-10-06 DOI: 10.1109/TCI.2025.3617255
Haowen Zhang;Pengcheng Zhang;Yikun Zhang;Yang Chen;Yi Liu;Zhiguo Gui
Sparse-view CT reconstruction represents a prototypical ill-posed inverse problem. The implementation of deep learning solutions has proven to be highly successful in this field. The dual-domain reconstruction network achieves a favorable trade-off between reconstruction performance and computational cost by leveraging the powerful mapping capability of deep learning and the domain-transform relying on analytical reconstruction algorithms. However, further research is required to enhance the domain-transform in this field. Inspired by the successful utilization of low-rank prior in various medical imaging tasks, we proposed an end-to-end one-shot dual-domain network for sparse-view CT reconstruction. The domain-transform was designed as a high-fidelity multi-channel parallel back-projection in proposed network. In this way, feature maps between channels in the image domain imply strong low-rank priors. We implemented the singular value thresholding algorithm as a network layer, learning parameters and thresholds in a data-driven manner, fully leveraging the low-rank prior across channels to greatly reduce information loss and distortion during domain-transform. Moreover, we constructed a projection completion network based on dual attention mechanism that synthesizes missing view projections by effectively utilizing potential local correlation among projection domains during fan-beam scanning. In the image domain, a refine subnetwork based on Vision Transformer utilizes mix-scale features to implement two-dimensional filtering belonging to the back-projection filter algorithm. Extensive experiments on two clinically relevant datasets have demonstrated that the proposed network achieves competing performance on both quantitative metrics and visual quality.
稀疏视图CT重构是一个典型的病态逆问题。深度学习解决方案的实施在这个领域已经被证明是非常成功的。双域重构网络利用深度学习强大的映射能力和基于解析重构算法的域变换,在重构性能和计算成本之间取得了较好的平衡。但是,该领域的领域变换还需要进一步的研究。受低秩先验在各种医学成像任务中成功应用的启发,我们提出了一种端到端单次双域网络用于稀疏视图CT重建。在该网络中,域变换被设计为高保真的多通道并行反投影。这样,图像域中通道之间的特征映射意味着强低秩先验。我们将奇异值阈值算法作为网络层实现,以数据驱动的方式学习参数和阈值,充分利用跨通道的低秩先验,大大减少了域变换过程中的信息丢失和失真。此外,我们构建了一个基于双注意机制的投影补全网络,该网络通过有效利用扇形波束扫描过程中投影域之间潜在的局部相关性来合成缺失的视图投影。在图像域,基于Vision Transformer的细化子网利用混合尺度特征实现属于反投影滤波算法的二维滤波。在两个临床相关数据集上进行的大量实验表明,所提出的网络在定量指标和视觉质量上都达到了竞争性能。
{"title":"DROLL: Dual-Domain Reconstruction Network With a High-Fidelity Domain-Transform Operator Based on Learned Low-Rank Prior for Sparse-View CT Reconstruction","authors":"Haowen Zhang;Pengcheng Zhang;Yikun Zhang;Yang Chen;Yi Liu;Zhiguo Gui","doi":"10.1109/TCI.2025.3617255","DOIUrl":"https://doi.org/10.1109/TCI.2025.3617255","url":null,"abstract":"Sparse-view CT reconstruction represents a prototypical ill-posed inverse problem. The implementation of deep learning solutions has proven to be highly successful in this field. The dual-domain reconstruction network achieves a favorable trade-off between reconstruction performance and computational cost by leveraging the powerful mapping capability of deep learning and the domain-transform relying on analytical reconstruction algorithms. However, further research is required to enhance the domain-transform in this field. Inspired by the successful utilization of low-rank prior in various medical imaging tasks, we proposed an end-to-end one-shot dual-domain network for sparse-view CT reconstruction. The domain-transform was designed as a high-fidelity multi-channel parallel back-projection in proposed network. In this way, feature maps between channels in the image domain imply strong low-rank priors. We implemented the singular value thresholding algorithm as a network layer, learning parameters and thresholds in a data-driven manner, fully leveraging the low-rank prior across channels to greatly reduce information loss and distortion during domain-transform. Moreover, we constructed a projection completion network based on dual attention mechanism that synthesizes missing view projections by effectively utilizing potential local correlation among projection domains during fan-beam scanning. In the image domain, a refine subnetwork based on Vision Transformer utilizes mix-scale features to implement two-dimensional filtering belonging to the back-projection filter algorithm. Extensive experiments on two clinically relevant datasets have demonstrated that the proposed network achieves competing performance on both quantitative metrics and visual quality.","PeriodicalId":56022,"journal":{"name":"IEEE Transactions on Computational Imaging","volume":"11 ","pages":"1375-1390"},"PeriodicalIF":4.8,"publicationDate":"2025-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145315292","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MISD: Model-Informed Stable Diffusion Model for Limited Noisy Data CT Reconstruction 基于模型的有限噪声数据CT重建稳定扩散模型
IF 4.8 2区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-10-02 DOI: 10.1109/TCI.2025.3617237
Qi Wang;Yufang Cai;Haijun Yu;Fenlin Liu;Weiwen Wu
Limited data computed tomography (LDCT) plays a critical role in accelerating the scanning process and reducing radiation exposure for patients. However, LDCT reconstruction is inherently an ill-posed inverse problem, often resulting in pronounced edge artifacts and the loss of fine structural details. In recent years, score-based generative models (SGMs) have shown great promise in LDCT reconstruction by alleviating the ill-posedness and enabling high-fidelity image recovery in the case of noise-free condition. However, in practical CT systems, measurement data is often contaminated by noise. The coexistence of noise and limited data presents significant challenges for SGM-based image reconstruction methods. To address this challenge, this study proposes a Model-Informed Stable Diffusion (MISD) model which integrates a sampling process with a generative prior in the image-space module and a physics prior in the projection-space module. The projection-space module incorporates physical information to establish a noise suppression mechanism, effectively reducing the impact of noise. At the same time, the image-space module uses a generative model to progressively reconstruct clear structures and features from an initial state characterized by pure noise. Together, these two modules form a cohesive mathematical framework, utilizing iterative optimization to gradually minimize the effects of noise and artifacts. Experimental results show that the MISD method consistently achieves higher quantitative metrics and recovers finer structural details than other state-of-the-art reconstruction techniques, both on simulated and real datasets.
有限数据计算机断层扫描(LDCT)在加速扫描过程和减少患者的辐射暴露方面起着至关重要的作用。然而,LDCT重建本质上是一个病态逆问题,经常导致明显的边缘伪影和精细结构细节的丢失。近年来,基于分数的生成模型(SGMs)在LDCT重建中显示出巨大的前景,它可以减轻图像的病态性,并在无噪声的情况下实现高保真图像恢复。然而,在实际的CT系统中,测量数据经常受到噪声的污染。噪声和有限数据的共存对基于sgm的图像重建方法提出了重大挑战。为了应对这一挑战,本研究提出了一种模型通知稳定扩散(MISD)模型,该模型将采样过程与图像空间模块中的生成先验和投影空间模块中的物理先验集成在一起。投影空间模块结合物理信息,建立噪声抑制机制,有效降低噪声的影响。同时,图像空间模块使用生成模型从纯噪声的初始状态逐步重建清晰的结构和特征。总之,这两个模块形成了一个有凝聚力的数学框架,利用迭代优化来逐渐减少噪声和伪影的影响。实验结果表明,与其他最先进的重建技术相比,MISD方法在模拟和真实数据集上都能获得更高的定量指标,并恢复更精细的结构细节。
{"title":"MISD: Model-Informed Stable Diffusion Model for Limited Noisy Data CT Reconstruction","authors":"Qi Wang;Yufang Cai;Haijun Yu;Fenlin Liu;Weiwen Wu","doi":"10.1109/TCI.2025.3617237","DOIUrl":"https://doi.org/10.1109/TCI.2025.3617237","url":null,"abstract":"Limited data computed tomography (LDCT) plays a critical role in accelerating the scanning process and reducing radiation exposure for patients. However, LDCT reconstruction is inherently an ill-posed inverse problem, often resulting in pronounced edge artifacts and the loss of fine structural details. In recent years, score-based generative models (SGMs) have shown great promise in LDCT reconstruction by alleviating the ill-posedness and enabling high-fidelity image recovery in the case of noise-free condition. However, in practical CT systems, measurement data is often contaminated by noise. The coexistence of noise and limited data presents significant challenges for SGM-based image reconstruction methods. To address this challenge, this study proposes a Model-Informed Stable Diffusion (MISD) model which integrates a sampling process with a generative prior in the image-space module and a physics prior in the projection-space module. The projection-space module incorporates physical information to establish a noise suppression mechanism, effectively reducing the impact of noise. At the same time, the image-space module uses a generative model to progressively reconstruct clear structures and features from an initial state characterized by pure noise. Together, these two modules form a cohesive mathematical framework, utilizing iterative optimization to gradually minimize the effects of noise and artifacts. Experimental results show that the MISD method consistently achieves higher quantitative metrics and recovers finer structural details than other state-of-the-art reconstruction techniques, both on simulated and real datasets.","PeriodicalId":56022,"journal":{"name":"IEEE Transactions on Computational Imaging","volume":"11 ","pages":"1434-1446"},"PeriodicalIF":4.8,"publicationDate":"2025-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145352017","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Marginal Importance of Known Distortions and Alignment in CASSI Systems CASSI系统中已知畸变和对准的边际重要性
IF 4.8 2区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-10-02 DOI: 10.1109/TCI.2025.3617235
Léo Paillet;Antoine Rouxel;Hervé Carfantan;Simon Lacroix;Antoine Monmayrant
Coded Aperture Snapshot Spectral Imagers (CASSI) are designed to acquire 2D coded acquisitions of hyperspectral scenes (HSS). Associated with reconstruction algorithms, they allow to analyze the HSS using a small amount of data. Various configurations of such imagers exist, with different distortions. In this article we show that reconstruction quality is relatively insensitive to these distortions, if known and correctly modeled. To this end, we introduce a differentiable ray-tracing-based model that incorporates aberrations and distortions to render coded hyperspectral acquisitions using CASSI. Such simulated acquisitions are used to train state-of-the-art hyperspectral cube reconstruction algorithms. We also adapted these algorithms through the use of a ray-tracing-based mapping function which accounts for aberrations and distortions. We evaluated four comparable CASSI systems with varying degree of optical aberrations and misalignments, using five state-of-the-art hyperspectral cube reconstruction algorithms. Our analyses show that if known and properly modeled, the effects of geometric distortions of the system and misalignment of the dispersive elements have a marginal impact on the overall reconstruction quality. Therefore, relaxing traditional constraints on measurement conformity and fidelity to the scene enables the development of novel imaging instruments, guided by performance metrics applied to the design or the processing of acquisitions. By providing a complete framework for design, simulation and evaluation, this work contributes to the optimization and exploration of new CASSI systems and more generally to the computational imaging community.
编码孔径快照光谱成像仪(CASSI)用于获取高光谱场景(HSS)的二维编码图像。与重建算法相结合,它们允许使用少量数据分析HSS。这种成像仪有不同的配置,具有不同的畸变。在本文中,我们表明,如果已知和正确建模,重建质量对这些扭曲相对不敏感。为此,我们引入了一种基于可微光线跟踪的模型,该模型结合了像差和畸变,使用CASSI渲染编码高光谱采集。这种模拟采集被用来训练最先进的高光谱立方体重建算法。我们还通过使用基于光线跟踪的映射功能来适应这些算法,该功能可以解释像差和扭曲。我们使用五种最先进的高光谱立方体重建算法,评估了四个具有不同程度光学像差和失调的可比较CASSI系统。我们的分析表明,如果已知并正确建模,系统的几何畸变和色散元素的不对准对整体重建质量的影响很小。因此,放松对测量一致性和对场景保真度的传统限制,可以开发新型成像仪器,并以应用于设计或采集处理的性能指标为指导。通过为设计、模拟和评估提供一个完整的框架,这项工作有助于优化和探索新的CASSI系统,并更广泛地为计算成像社区做出贡献。
{"title":"The Marginal Importance of Known Distortions and Alignment in CASSI Systems","authors":"Léo Paillet;Antoine Rouxel;Hervé Carfantan;Simon Lacroix;Antoine Monmayrant","doi":"10.1109/TCI.2025.3617235","DOIUrl":"https://doi.org/10.1109/TCI.2025.3617235","url":null,"abstract":"Coded Aperture Snapshot Spectral Imagers (CASSI) are designed to acquire 2D coded acquisitions of hyperspectral scenes (HSS). Associated with reconstruction algorithms, they allow to analyze the HSS using a small amount of data. Various configurations of such imagers exist, with different distortions. In this article we show that reconstruction quality is relatively insensitive to these distortions, if known and correctly modeled. To this end, we introduce a differentiable ray-tracing-based model that incorporates aberrations and distortions to render coded hyperspectral acquisitions using CASSI. Such simulated acquisitions are used to train state-of-the-art hyperspectral cube reconstruction algorithms. We also adapted these algorithms through the use of a ray-tracing-based mapping function which accounts for aberrations and distortions. We evaluated four comparable CASSI systems with varying degree of optical aberrations and misalignments, using five state-of-the-art hyperspectral cube reconstruction algorithms. Our analyses show that if known and properly modeled, the effects of geometric distortions of the system and misalignment of the dispersive elements have a marginal impact on the overall reconstruction quality. Therefore, relaxing traditional constraints on measurement conformity and fidelity to the scene enables the development of novel imaging instruments, guided by performance metrics applied to the design or the processing of acquisitions. By providing a complete framework for design, simulation and evaluation, this work contributes to the optimization and exploration of new CASSI systems and more generally to the computational imaging community.","PeriodicalId":56022,"journal":{"name":"IEEE Transactions on Computational Imaging","volume":"11 ","pages":"1391-1403"},"PeriodicalIF":4.8,"publicationDate":"2025-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145315293","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Transactions on Computational Imaging
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1