首页 > 最新文献

IEEE transactions on medical imaging最新文献

英文 中文
Attention-Based Shape-Deformation Networks for Artifact-Free Geometry Reconstruction of Lumbar Spine From MR Images 基于注意力的MR图像腰椎无伪影几何重建的形状变形网络
Pub Date : 2025-07-15 DOI: 10.1109/TMI.2025.3588831
Linchen Qian;Jiasong Chen;Linhai Ma;Timur Urakov;Weiyong Gu;Liang Liang
Lumbar disc degeneration, a progressive structural wear and tear of lumbar intervertebral disc, is regarded as an essential role on low back pain, a significant global health concern. Automated lumbar spine geometry reconstruction from MR images will enable fast measurement of medical parameters to evaluate the lumbar status, in order to determine a suitable treatment. Existing image segmentation-based techniques often generate erroneous segments or unstructured point clouds, unsuitable for medical parameter measurement. In this work, we present UNet-DeformSA and TransDeformer: novel attention-based deep neural networks that reconstruct the geometry of the lumbar spine with high spatial accuracy and mesh correspondence across patients, and we also present a variant of TransDeformer for error estimation. Specially, we devise new attention modules with a new attention formula, which integrate tokenized image features and tokenized shape features to predict the displacements of the points on a shape template. The deformed template reveals the lumbar spine geometry in an image. Experiment results show that our networks generate artifact-free geometry outputs, and the variant of TransDeformer can predict the errors of a reconstructed geometry. Our code is available at https://github.com/linchenq/TransDeformer-Mesh.
腰椎间盘退变是腰椎间盘的进行性结构磨损和撕裂,被认为是腰痛的重要原因,是一个重大的全球健康问题。从MR图像中自动重建腰椎几何结构将能够快速测量医学参数以评估腰椎状态,以便确定合适的治疗方法。现有的基于图像分割的技术经常产生错误的片段或非结构化的点云,不适合医学参数的测量。在这项工作中,我们提出了UNet-DeformSA和TransDeformer:新型的基于注意力的深度神经网络,以高空间精度和网格对应的方式重建腰椎的几何形状,我们还提出了TransDeformer的一种变体,用于误差估计。特别地,我们设计了新的关注模块和新的关注公式,将标记化图像特征和标记化形状特征相结合,预测形状模板上点的位移。变形的模板在图像中显示腰椎的几何形状。实验结果表明,我们的网络生成了无伪影的几何输出,并且transformformer变体可以预测重构几何的误差。我们的代码可在https://github.com/linchenq/TransDeformer-Mesh上获得。
{"title":"Attention-Based Shape-Deformation Networks for Artifact-Free Geometry Reconstruction of Lumbar Spine From MR Images","authors":"Linchen Qian;Jiasong Chen;Linhai Ma;Timur Urakov;Weiyong Gu;Liang Liang","doi":"10.1109/TMI.2025.3588831","DOIUrl":"10.1109/TMI.2025.3588831","url":null,"abstract":"Lumbar disc degeneration, a progressive structural wear and tear of lumbar intervertebral disc, is regarded as an essential role on low back pain, a significant global health concern. Automated lumbar spine geometry reconstruction from MR images will enable fast measurement of medical parameters to evaluate the lumbar status, in order to determine a suitable treatment. Existing image segmentation-based techniques often generate erroneous segments or unstructured point clouds, unsuitable for medical parameter measurement. In this work, we present UNet-DeformSA and TransDeformer: novel attention-based deep neural networks that reconstruct the geometry of the lumbar spine with high spatial accuracy and mesh correspondence across patients, and we also present a variant of TransDeformer for error estimation. Specially, we devise new attention modules with a new attention formula, which integrate tokenized image features and tokenized shape features to predict the displacements of the points on a shape template. The deformed template reveals the lumbar spine geometry in an image. Experiment results show that our networks generate artifact-free geometry outputs, and the variant of TransDeformer can predict the errors of a reconstructed geometry. Our code is available at <uri>https://github.com/linchenq/TransDeformer-Mesh</uri>.","PeriodicalId":94033,"journal":{"name":"IEEE transactions on medical imaging","volume":"44 12","pages":"5258-5277"},"PeriodicalIF":0.0,"publicationDate":"2025-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144639750","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bayesian Posterior Distribution Estimation of Kinetic Parameters in Dynamic Brain PET Using Generative Deep Learning Models 基于生成式深度学习模型的动态脑PET动力学参数贝叶斯后验分布估计
Pub Date : 2025-07-15 DOI: 10.1109/TMI.2025.3588859
Yanis Djebra;Xiaofeng Liu;Thibault Marin;Amal Tiss;Maeva Dhaynaut;Nicolas Guehl;Keith Johnson;Georges El Fakhri;Chao Ma;Jinsong Ouyang
Positron Emission Tomography (PET) is a valuable imaging method for studying molecular-level processes in the body, such as hyperphosphorylated tau (p-tau) protein aggregates, a hallmark of several neurodegenerative diseases including Alzheimer’s disease. P-tau density and cerebral perfusion can be quantified from dynamic PET images using tracer kinetic modeling techniques. However, noise in PET images leads to uncertainty in the estimated kinetic parameters, which can be quantified by estimating the posterior distribution of kinetic parameters using Bayesian inference (BI). Markov Chain Monte Carlo (MCMC) techniques are commonly used for posterior estimation but with significant computational needs. This work proposes an Improved Denoising Diffusion Probabilistic Model (iDDPM)-based method to estimate the posterior distribution of kinetic parameters in dynamic PET, leveraging the high computational efficiency of deep learning. The performance of the proposed method was evaluated on a [18F]MK6240 study and compared to a Conditional Variational Autoencoder with dual decoder (CVAE-DD)-based method and a Wasserstein GAN with gradient penalty (WGAN-GP)-based method. Posterior distributions inferred from Metropolis-Hasting MCMC were used as reference. Our approach consistently outperformed the CVAE-DD and WGAN-GP methods and offered significant reduction in computation time than the MCMC method (over 230 times faster), inferring accurate ( $lt {0}.{67},%$ mean error) and precise ( $lt {7}.{23},%$ standard deviation error) posterior distributions.
正电子发射断层扫描(PET)是研究体内分子水平过程的一种有价值的成像方法,例如过度磷酸化的tau (p-tau)蛋白聚集体,这是包括阿尔茨海默病在内的几种神经退行性疾病的标志。P-tau密度和脑灌注可以使用示踪动力学建模技术从动态PET图像中量化。然而,PET图像中的噪声导致了估计的动力学参数的不确定性,这可以通过使用贝叶斯推理(BI)估计动力学参数的后验分布来量化。马尔可夫链蒙特卡罗(MCMC)技术通常用于后验估计,但计算量很大。本文提出了一种基于改进的去噪扩散概率模型(iDDPM)的方法来估计动态PET中动力学参数的后验分布,利用深度学习的高计算效率。在一项[18F]MK6240研究中评估了该方法的性能,并将其与基于双解码器的条件变分自编码器(CVAE-DD)方法和基于梯度惩罚的Wasserstein GAN (WGAN-GP)方法进行了比较。以Metropolis-Hasting MCMC推断的后验分布为参考。我们的方法始终优于CVAE-DD和WGAN-GP方法,并且比MCMC方法显著减少了计算时间(超过230倍),推断准确($lt{0})。{67},%$平均误差)和精确($lt{7}。{23},%$标准差误差)后验分布。
{"title":"Bayesian Posterior Distribution Estimation of Kinetic Parameters in Dynamic Brain PET Using Generative Deep Learning Models","authors":"Yanis Djebra;Xiaofeng Liu;Thibault Marin;Amal Tiss;Maeva Dhaynaut;Nicolas Guehl;Keith Johnson;Georges El Fakhri;Chao Ma;Jinsong Ouyang","doi":"10.1109/TMI.2025.3588859","DOIUrl":"10.1109/TMI.2025.3588859","url":null,"abstract":"Positron Emission Tomography (PET) is a valuable imaging method for studying molecular-level processes in the body, such as hyperphosphorylated tau (p-tau) protein aggregates, a hallmark of several neurodegenerative diseases including Alzheimer’s disease. P-tau density and cerebral perfusion can be quantified from dynamic PET images using tracer kinetic modeling techniques. However, noise in PET images leads to uncertainty in the estimated kinetic parameters, which can be quantified by estimating the posterior distribution of kinetic parameters using Bayesian inference (BI). Markov Chain Monte Carlo (MCMC) techniques are commonly used for posterior estimation but with significant computational needs. This work proposes an Improved Denoising Diffusion Probabilistic Model (iDDPM)-based method to estimate the posterior distribution of kinetic parameters in dynamic PET, leveraging the high computational efficiency of deep learning. The performance of the proposed method was evaluated on a [18F]MK6240 study and compared to a Conditional Variational Autoencoder with dual decoder (CVAE-DD)-based method and a Wasserstein GAN with gradient penalty (WGAN-GP)-based method. Posterior distributions inferred from Metropolis-Hasting MCMC were used as reference. Our approach consistently outperformed the CVAE-DD and WGAN-GP methods and offered significant reduction in computation time than the MCMC method (over 230 times faster), inferring accurate (<inline-formula> <tex-math>$lt {0}.{67},%$ </tex-math></inline-formula> mean error) and precise (<inline-formula> <tex-math>$lt {7}.{23},%$ </tex-math></inline-formula> standard deviation error) posterior distributions.","PeriodicalId":94033,"journal":{"name":"IEEE transactions on medical imaging","volume":"44 12","pages":"5089-5102"},"PeriodicalIF":0.0,"publicationDate":"2025-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144639748","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Region Uncertainty Estimation for Medical Image Segmentation With Noisy Labels 带噪声标签医学图像分割的区域不确定性估计
Pub Date : 2025-07-14 DOI: 10.1109/TMI.2025.3589058
Kai Han;Shuhui Wang;Jun Chen;Chengxuan Qian;Chongwen Lyu;Siqi Ma;Chengjian Qiu;Victor S. Sheng;Qingming Huang;Zhe Liu
The success of deep learning in 3D medical image segmentation hinges on training with a large dataset of fully annotated 3D volumes, which are difficult and time-consuming to acquire. Although recent foundation models (e.g., segment anything model, SAM) can utilize sparse annotations to reduce annotation costs, segmentation tasks involving organs and tissues with blurred boundaries remain challenging. To address this issue, we propose a region uncertainty estimation framework for Computed Tomography (CT) image segmentation using noisy labels. Specifically, we propose a sample-stratified training strategy that stratifies samples according to their varying quality labels, prioritizing confident and fine-grained information at each training stage. This sample-to-voxel level processing enables more reliable supervision information to propagate to noisy label data, thus effectively mitigating the impact of noisy annotations. Moreover, we further design a boundary-guided regional uncertainty estimation module that adapts sample hierarchical training to assist in evaluating sample confidence. Experiments conducted across multiple CT datasets demonstrate the superiority of our proposed method over several competitive approaches under various noise conditions. Our proposed reliable label propagation strategy not only significantly reduces the cost of medical image annotation and robust model training but also improves the segmentation performance in scenarios with imperfect annotations, thus paving the way towards the application of medical segmentation foundation models under low-resource and remote scenarios. Code will be available at https://github.com/KHan-UJS/NoisyLabel
深度学习在三维医学图像分割中的成功取决于使用一个完整注释的三维体的大型数据集进行训练,而这些数据集的获取困难且耗时。尽管最近的基础模型(例如,segment anything model, SAM)可以利用稀疏注释来降低注释成本,但是涉及到边界模糊的器官和组织的分割任务仍然具有挑战性。为了解决这个问题,我们提出了一个区域不确定性估计框架,用于使用噪声标签进行计算机断层扫描(CT)图像分割。具体来说,我们提出了一种样本分层训练策略,根据不同的质量标签对样本进行分层,在每个训练阶段优先考虑自信和细粒度的信息。这种样本到体素级的处理使得更可靠的监督信息能够传播到有噪声的标签数据中,从而有效地减轻了有噪声标注的影响。此外,我们进一步设计了一个边界引导的区域不确定性估计模块,该模块适应样本分层训练,以帮助评估样本置信度。在多个CT数据集上进行的实验表明,在各种噪声条件下,我们提出的方法优于几种竞争方法。我们提出的可靠标签传播策略不仅显著降低了医学图像标注和鲁棒性模型训练的成本,而且提高了标注不完善场景下的分割性能,从而为低资源和远程场景下医学分割基础模型的应用铺平了道路。代码将在https://github.com/KHan-UJS/NoisyLabel上提供
{"title":"Region Uncertainty Estimation for Medical Image Segmentation With Noisy Labels","authors":"Kai Han;Shuhui Wang;Jun Chen;Chengxuan Qian;Chongwen Lyu;Siqi Ma;Chengjian Qiu;Victor S. Sheng;Qingming Huang;Zhe Liu","doi":"10.1109/TMI.2025.3589058","DOIUrl":"10.1109/TMI.2025.3589058","url":null,"abstract":"The success of deep learning in 3D medical image segmentation hinges on training with a large dataset of fully annotated 3D volumes, which are difficult and time-consuming to acquire. Although recent foundation models (e.g., segment anything model, SAM) can utilize sparse annotations to reduce annotation costs, segmentation tasks involving organs and tissues with blurred boundaries remain challenging. To address this issue, we propose a region uncertainty estimation framework for Computed Tomography (CT) image segmentation using noisy labels. Specifically, we propose a sample-stratified training strategy that stratifies samples according to their varying quality labels, prioritizing confident and fine-grained information at each training stage. This sample-to-voxel level processing enables more reliable supervision information to propagate to noisy label data, thus effectively mitigating the impact of noisy annotations. Moreover, we further design a boundary-guided regional uncertainty estimation module that adapts sample hierarchical training to assist in evaluating sample confidence. Experiments conducted across multiple CT datasets demonstrate the superiority of our proposed method over several competitive approaches under various noise conditions. Our proposed reliable label propagation strategy not only significantly reduces the cost of medical image annotation and robust model training but also improves the segmentation performance in scenarios with imperfect annotations, thus paving the way towards the application of medical segmentation foundation models under low-resource and remote scenarios. Code will be available at <uri>https://github.com/KHan-UJS/NoisyLabel</uri>","PeriodicalId":94033,"journal":{"name":"IEEE transactions on medical imaging","volume":"44 12","pages":"5197-5207"},"PeriodicalIF":0.0,"publicationDate":"2025-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144629831","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MDPNet: Multiscale Dynamic Polyp-Focus Network for Enhancing Medical Image Polyp Segmentation MDPNet:用于医学图像息肉分割的多尺度动态息肉焦点网络
Pub Date : 2025-07-14 DOI: 10.1109/TMI.2025.3588503
Alpha Alimamy Kamara;Shiwen He;Abdul Joseph Fofanah;Rong Xu;Yuehan Chen
Colorectal cancer (CRC) is the most common malignant neoplasm in the digestive system and a primary cause of cancer-related mortality in the United States, exceeded only by lung and prostate cancers. The American Cancer Society estimates that in 2024, there will be approximately 152,810 new cases of colorectal cancer and 53,010 deaths in the United States, highlighting the critical need for early diagnosis and prevention. Precise polyp segmentation is crucial for early detection, as it improves treatability and survival rates. However, existing methods, such as the UNet architecture, struggle to capture long-range dependencies and manage the variability in polyp shapes and sizes, and the low contrast between polyps and the surrounding background. We propose a multiscale dynamic polyp-focus network (MDPNet) to solve these problems. It has three modules: dynamic polyp-focus (DPfocus), non-local multiscale attention pooling (NMAP), and learnable multiscale attention pooling (LMAP). DPfocus captures global pixel-to-polyp dependencies, preserving high-level semantics and emphasizing polyp-specific regions. NMAP stabilizes the model under varying polyp shapes, sizes, and contrasts by dynamically aggregating multiscale features with minimal data loss. LMAP enhances spatial representation by learning multiscale attention across different regions. This enables MDPNet to understand long-range dependencies and combine information from different levels of context, boosting the segmentation accuracy. Extensive experiments on four publicly available datasets demonstrate that MDPNet is effective and outperforms current state-of-the-art segmentation methods by 2–5% in overall accuracy across all datasets. This demonstrates that our method improves polyp segmentation accuracy, aiding early detection and treatment of colorectal cancer.
结直肠癌(CRC)是消化系统中最常见的恶性肿瘤,也是美国癌症相关死亡率的主要原因,仅次于肺癌和前列腺癌。美国癌症协会估计,到2024年,美国将有大约152810例结直肠癌新病例和53010例死亡,这突出了早期诊断和预防的迫切需要。精确的息肉分割对于早期发现是至关重要的,因为它可以提高治愈率和存活率。然而,现有的方法,如UNet体系结构,难以捕获长期依赖关系,管理息肉形状和大小的可变性,以及息肉与周围背景之间的低对比度。我们提出了一种多尺度动态多焦点网络(MDPNet)来解决这些问题。它有三个模块:动态多焦点(DPfocus)、非局部多尺度注意池(NMAP)和可学习多尺度注意池(LMAP)。DPfocus捕获全局像素到息肉的依赖关系,保留高级语义并强调特定于息肉的区域。NMAP通过动态聚合多尺度特征,以最小的数据丢失来稳定不同息肉形状、大小和对比度下的模型。LMAP通过学习不同区域的多尺度注意力来增强空间表征。这使MDPNet能够理解远程依赖关系,并结合来自不同级别上下文的信息,从而提高分割的准确性。在四个公开可用的数据集上进行的大量实验表明,MDPNet是有效的,并且在所有数据集的总体精度上优于当前最先进的分割方法2-5%。这表明我们的方法提高了息肉分割的准确性,有助于早期发现和治疗结直肠癌。
{"title":"MDPNet: Multiscale Dynamic Polyp-Focus Network for Enhancing Medical Image Polyp Segmentation","authors":"Alpha Alimamy Kamara;Shiwen He;Abdul Joseph Fofanah;Rong Xu;Yuehan Chen","doi":"10.1109/TMI.2025.3588503","DOIUrl":"10.1109/TMI.2025.3588503","url":null,"abstract":"Colorectal cancer (CRC) is the most common malignant neoplasm in the digestive system and a primary cause of cancer-related mortality in the United States, exceeded only by lung and prostate cancers. The American Cancer Society estimates that in 2024, there will be approximately 152,810 new cases of colorectal cancer and 53,010 deaths in the United States, highlighting the critical need for early diagnosis and prevention. Precise polyp segmentation is crucial for early detection, as it improves treatability and survival rates. However, existing methods, such as the UNet architecture, struggle to capture long-range dependencies and manage the variability in polyp shapes and sizes, and the low contrast between polyps and the surrounding background. We propose a multiscale dynamic polyp-focus network (MDPNet) to solve these problems. It has three modules: dynamic polyp-focus (DPfocus), non-local multiscale attention pooling (NMAP), and learnable multiscale attention pooling (LMAP). DPfocus captures global pixel-to-polyp dependencies, preserving high-level semantics and emphasizing polyp-specific regions. NMAP stabilizes the model under varying polyp shapes, sizes, and contrasts by dynamically aggregating multiscale features with minimal data loss. LMAP enhances spatial representation by learning multiscale attention across different regions. This enables MDPNet to understand long-range dependencies and combine information from different levels of context, boosting the segmentation accuracy. Extensive experiments on four publicly available datasets demonstrate that MDPNet is effective and outperforms current state-of-the-art segmentation methods by 2–5% in overall accuracy across all datasets. This demonstrates that our method improves polyp segmentation accuracy, aiding early detection and treatment of colorectal cancer.","PeriodicalId":94033,"journal":{"name":"IEEE transactions on medical imaging","volume":"44 12","pages":"5208-5220"},"PeriodicalIF":0.0,"publicationDate":"2025-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144630145","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Clinical Stage Prompt Induced Multi-Modal Prognosis 临床分期提示诱导多模式预后
Pub Date : 2025-07-14 DOI: 10.1109/TMI.2025.3588836
Ting Jin;Xingran Xie;Qingli Li;Xinxing Li;Yan Wang
Histology analysis of the tumor micro-environment integrated with genomic assays is widely regarded as the cornerstone for cancer analysis and survival prediction. This paper jointly incorporates genomics and Whole Slide Images (WSIs), and focuses on addressing the primary challenges involved in multi-modality prognosis analysis: 1) the high-order relevance is difficult to be modeled from dimensional imbalanced gigapixel WSIs and tens of thousands of genetic sequences, and 2) the lack of medical expertise and clinical knowledge hampers the effectiveness of prognosis-oriented multi-modal fusion. Due to the nature of the prognosis task, statistical priors and clinical knowledge are essential factors to provide the likelihood of survival over time, which, however, has been under-studied. To this end, we propose a prognosis-oriented image-omics fusion framework, dubbed Clinical Stage Prompt induced Multimodal Prognosis (CiMP). Concretely, we leverage the capabilities of the advanced LLM to generate descriptions derived from structured clinical records and utilize the generated clinical staging prompts to inquire critical prognosis-related information from each modality intentionally. In addition, we propose a Group Multi-Head Self-Attention module to capture structured group-specific features within cohorts of genomic data. Experimental results on five TCGA datasets show the superiority of our proposed method, achieving state-of-the-art performance compared to previous multi-modal prognostic models. Furthermore, the clinical interpretability and discussion also highlight the immense potential for further medical applications. Our code will be released at https://github.com/DeepMed-Lab-ECNU/CiMP/
结合基因组分析的肿瘤微环境组织学分析被广泛认为是癌症分析和生存预测的基石。本文将基因组学与全幻灯片图像(Whole Slide Images, wsi)技术相结合,重点解决多模态预后分析面临的主要挑战:1)难以从维度不平衡的千兆像素wsi和数以万计的基因序列中建立高阶相关性模型;2)缺乏医学专业知识和临床知识阻碍了面向预后的多模态融合的有效性。由于预后任务的性质,统计先验和临床知识是提供随时间推移的生存可能性的重要因素,然而,这一点尚未得到充分研究。为此,我们提出了一个面向预后的图像组学融合框架,称为临床阶段提示诱导多模式预后(CiMP)。具体而言,我们利用高级LLM的功能,从结构化的临床记录中生成描述,并利用生成的临床分期提示,有意地从每个模式中查询关键的预后相关信息。此外,我们提出了一个群体多头自关注模块,以捕获基因组数据队列中结构化的群体特定特征。在五个TCGA数据集上的实验结果显示了我们提出的方法的优越性,与以前的多模态预测模型相比,实现了最先进的性能。此外,临床可解释性和讨论也强调了进一步医学应用的巨大潜力。我们的代码将在https://github.com/DeepMed-Lab-ECNU/CiMP/上发布
{"title":"Clinical Stage Prompt Induced Multi-Modal Prognosis","authors":"Ting Jin;Xingran Xie;Qingli Li;Xinxing Li;Yan Wang","doi":"10.1109/TMI.2025.3588836","DOIUrl":"10.1109/TMI.2025.3588836","url":null,"abstract":"Histology analysis of the tumor micro-environment integrated with genomic assays is widely regarded as the cornerstone for cancer analysis and survival prediction. This paper jointly incorporates genomics and Whole Slide Images (WSIs), and focuses on addressing the primary challenges involved in multi-modality prognosis analysis: 1) the high-order relevance is difficult to be modeled from dimensional imbalanced gigapixel WSIs and tens of thousands of genetic sequences, and 2) the lack of medical expertise and clinical knowledge hampers the effectiveness of prognosis-oriented multi-modal fusion. Due to the nature of the prognosis task, statistical priors and clinical knowledge are essential factors to provide the likelihood of survival over time, which, however, has been under-studied. To this end, we propose a prognosis-oriented image-omics fusion framework, dubbed Clinical Stage Prompt induced Multimodal Prognosis (CiMP). Concretely, we leverage the capabilities of the advanced LLM to generate descriptions derived from structured clinical records and utilize the generated clinical staging prompts to inquire critical prognosis-related information from each modality intentionally. In addition, we propose a Group Multi-Head Self-Attention module to capture structured group-specific features within cohorts of genomic data. Experimental results on five TCGA datasets show the superiority of our proposed method, achieving state-of-the-art performance compared to previous multi-modal prognostic models. Furthermore, the clinical interpretability and discussion also highlight the immense potential for further medical applications. Our code will be released at <uri>https://github.com/DeepMed-Lab-ECNU/CiMP/</uri>","PeriodicalId":94033,"journal":{"name":"IEEE transactions on medical imaging","volume":"44 12","pages":"5065-5076"},"PeriodicalIF":0.0,"publicationDate":"2025-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144629830","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Self-Supervised Upsampling for Reconstructions With Generalized Enhancement in Photoacoustic Computed Tomography 光声计算机断层扫描广义增强重建的自监督上采样
Pub Date : 2025-07-14 DOI: 10.1109/TMI.2025.3588789
Kexin Deng;Yan Luo;Hongzhi Zuo;Yuwen Chen;Liujie Gu;Mingyuan Liu;Hengrong Lan;Jianwen Luo;Cheng Ma
Photoacoustic computed tomography (PACT) is an emerging hybrid imaging modality with potential applications in biomedicine. A major roadblock to the widespread adoption of PACT is the limited number of detectors, which gives rise to spatial aliasing and manifests as streak artifacts in the reconstructed image. A brute-force solution to the problem is to increase the number of detectors, which, however, is often undesirable due to escalated costs. In this study, we present a novel self-supervised learning approach, to overcome this long-standing challenge. We found that small blocks of PACT channel data show similarity at various downsampling rates. Based on this observation, a neural network trained on downsampled data can reliably perform accurate interpolation without requiring densely-sampled ground truth data, which is typically unavailable in real practice. Our method has undergone validation through numerical simulations, controlled phantom experiments, as well as ex vivo and in vivo animal tests, across multiple PACT systems. We have demonstrated that our technique provides an effective and cost-efficient solution to address the under-sampling issue in PACT, thereby enhancing the capabilities of this imaging technology.
光声计算机断层扫描(PACT)是一种新兴的混合成像方式,在生物医学领域具有潜在的应用前景。广泛采用PACT的主要障碍是探测器数量有限,这导致了空间混叠,并在重建图像中表现为条纹伪影。一种暴力解决方案是增加检测器的数量,然而,由于成本上升,这通常是不可取的。在这项研究中,我们提出了一种新的自监督学习方法,以克服这一长期存在的挑战。我们发现小块的PACT通道数据在不同的降采样率下显示出相似性。基于这种观察,在下采样数据上训练的神经网络可以可靠地执行精确的插值,而不需要密集采样的地面真值数据,这在实际应用中通常是不可用的。我们的方法已经通过多个PACT系统的数值模拟、受控模拟实验以及离体和体内动物实验进行了验证。我们已经证明,我们的技术为解决PACT中的采样不足问题提供了一种有效且经济的解决方案,从而提高了该成像技术的能力。
{"title":"Self-Supervised Upsampling for Reconstructions With Generalized Enhancement in Photoacoustic Computed Tomography","authors":"Kexin Deng;Yan Luo;Hongzhi Zuo;Yuwen Chen;Liujie Gu;Mingyuan Liu;Hengrong Lan;Jianwen Luo;Cheng Ma","doi":"10.1109/TMI.2025.3588789","DOIUrl":"10.1109/TMI.2025.3588789","url":null,"abstract":"Photoacoustic computed tomography (PACT) is an emerging hybrid imaging modality with potential applications in biomedicine. A major roadblock to the widespread adoption of PACT is the limited number of detectors, which gives rise to spatial aliasing and manifests as streak artifacts in the reconstructed image. A brute-force solution to the problem is to increase the number of detectors, which, however, is often undesirable due to escalated costs. In this study, we present a novel self-supervised learning approach, to overcome this long-standing challenge. We found that small blocks of PACT channel data show similarity at various downsampling rates. Based on this observation, a neural network trained on downsampled data can reliably perform accurate interpolation without requiring densely-sampled ground truth data, which is typically unavailable in real practice. Our method has undergone validation through numerical simulations, controlled phantom experiments, as well as ex vivo and in vivo animal tests, across multiple PACT systems. We have demonstrated that our technique provides an effective and cost-efficient solution to address the under-sampling issue in PACT, thereby enhancing the capabilities of this imaging technology.","PeriodicalId":94033,"journal":{"name":"IEEE transactions on medical imaging","volume":"44 12","pages":"5117-5127"},"PeriodicalIF":0.0,"publicationDate":"2025-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144629835","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PAL: Boosting Skin Lesion Segmentation via Probabilistic Attribute Learning 基于概率属性学习的皮肤损伤分割
Pub Date : 2025-07-11 DOI: 10.1109/TMI.2025.3588167
Yuchen Yuan;Xi Wang;Jinpeng Li;Guangyong Chen;Pheng-Ann Heng
Skin lesion segmentation is vital for the early detection, diagnosis, and treatment of melanoma, yet it remains challenging due to significant variations in lesion attributes (e.g., color, size, shape), ambiguous boundaries, and noise interference. Recent advancements have focused on capturing contextual information and incorporating boundary priors to handle challenging lesions. However, there has been limited exploration on the explicit analysis of the inherent patterns of skin lesions, a crucial aspect of the knowledge-driven decision-making process used by clinical experts. In this work, we introduce a novel approach called Probabilistic Attribute Learning (PAL), which leverages knowledge of lesion patterns to achieve enhanced performance on challenging lesions. Recognizing that the lesion patterns exhibited in each image can be properly depicted by disentangled attributes, we begin by explicitly estimating the distributions of these attributes as distinct Gaussian distributions, with mean and variance indicating the most likely pattern of that attribute and its variation. Using Monte Carlo Sampling, we iteratively draw multiple samples from these distributions to capture various potential patterns for each attribute. These samples are then merged through an effective attribute fusion technique, resulting in diverse representations that comprehensively depict the lesion class. By performing pixel-class proximity matching between each pixel-wise representation and the diverse class-wise representations, we significantly enhance the model’s robustness. Extensive experiments on two public skin lesion datasets and one unified polyp lesion dataset demonstrate the effectiveness and strong generalization ability of our method. Codes are available at https://github.com/IsYuchenYuan/PAL
皮肤病变分割对于黑色素瘤的早期检测、诊断和治疗至关重要,但由于病变属性(如颜色、大小、形状)的显著变化、边界模糊和噪声干扰,它仍然具有挑战性。最近的进展集中在获取上下文信息和结合边界先验来处理具有挑战性的病变。然而,对皮肤病变固有模式的明确分析的探索有限,这是临床专家使用的知识驱动决策过程的关键方面。在这项工作中,我们引入了一种称为概率属性学习(PAL)的新方法,该方法利用病变模式的知识来提高具有挑战性病变的性能。认识到每个图像中显示的病变模式可以通过解纠缠属性适当地描述,我们首先明确估计这些属性的分布为不同的高斯分布,其中均值和方差表示该属性及其变化的最可能模式。使用蒙特卡罗采样,我们从这些分布中迭代地绘制多个样本,以捕获每个属性的各种潜在模式。然后通过有效的属性融合技术合并这些样本,产生全面描述病变类别的不同表示。通过在每个像素表示和不同的类表示之间执行像素类接近匹配,我们显著增强了模型的鲁棒性。在两个公共皮肤病变数据集和一个统一的息肉病变数据集上进行的大量实验证明了该方法的有效性和较强的泛化能力。代码可在https://github.com/IsYuchenYuan/PAL上获得
{"title":"PAL: Boosting Skin Lesion Segmentation via Probabilistic Attribute Learning","authors":"Yuchen Yuan;Xi Wang;Jinpeng Li;Guangyong Chen;Pheng-Ann Heng","doi":"10.1109/TMI.2025.3588167","DOIUrl":"10.1109/TMI.2025.3588167","url":null,"abstract":"Skin lesion segmentation is vital for the early detection, diagnosis, and treatment of melanoma, yet it remains challenging due to significant variations in lesion attributes (e.g., color, size, shape), ambiguous boundaries, and noise interference. Recent advancements have focused on capturing contextual information and incorporating boundary priors to handle challenging lesions. However, there has been limited exploration on the explicit analysis of the inherent patterns of skin lesions, a crucial aspect of the knowledge-driven decision-making process used by clinical experts. In this work, we introduce a novel approach called Probabilistic Attribute Learning (PAL), which leverages knowledge of lesion patterns to achieve enhanced performance on challenging lesions. Recognizing that the lesion patterns exhibited in each image can be properly depicted by disentangled attributes, we begin by explicitly estimating the distributions of these attributes as distinct Gaussian distributions, with mean and variance indicating the most likely pattern of that attribute and its variation. Using Monte Carlo Sampling, we iteratively draw multiple samples from these distributions to capture various potential patterns for each attribute. These samples are then merged through an effective attribute fusion technique, resulting in diverse representations that comprehensively depict the lesion class. By performing pixel-class proximity matching between each pixel-wise representation and the diverse class-wise representations, we significantly enhance the model’s robustness. Extensive experiments on two public skin lesion datasets and one unified polyp lesion dataset demonstrate the effectiveness and strong generalization ability of our method. Codes are available at <uri>https://github.com/IsYuchenYuan/PAL</uri>","PeriodicalId":94033,"journal":{"name":"IEEE transactions on medical imaging","volume":"44 12","pages":"5183-5196"},"PeriodicalIF":0.0,"publicationDate":"2025-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11078393","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144611278","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Echocardiography Video Segmentation via Neighborhood Correlation Mining 基于邻域相关挖掘的超声心动图视频分割
Pub Date : 2025-07-11 DOI: 10.1109/TMI.2025.3588157
Xiaolong Deng;Huisi Wu
Accurate segmentation of the left ventricle in echocardiography is critical for diagnosing and treating cardiovascular diseases. However, accurate segmentation remains challenging due to the limitations of ultrasound imaging. Although numerous image and video segmentation methods have been proposed, existing methods still fail to effectively solve this task, which is limited by sparsity annotations. To address this problem, we propose a novel semi-supervised segmentation framework named NCM-Net for echocardiography. We first propose the neighborhood correlation mining (NCM) module, which sufficiently mines the correlations between query features and their spatiotemporal neighborhoods to resist noise influence. The module also captures cross-scale contextual correlations between pixels spatially to further refine features, thus alleviating the impact of noise on echocardiography segmentation. To further improve segmentation accuracy, we propose using unreliable-pixels masked attention (UMA). By masking reliable pixels, it pays extra attention to unreliable pixels to refine the boundary of segmentation. Further, we use cross-frame boundary constraints on the final predictions to optimize their temporal consistency. Through extensive experiments on two publicly available datasets, CAMUS and EchoNet-Dynamic, we demonstrate the effectiveness of the proposed, which achieves state-of-the-art performance and outstanding temporal consistency. Codes are available at https://github.com/dengxl0520/NCMNet
超声心动图对左心室的准确分割对心血管疾病的诊断和治疗至关重要。然而,由于超声成像的限制,准确分割仍然具有挑战性。尽管已经提出了许多图像和视频分割方法,但现有的方法仍然不能有效地解决这一问题,这受到稀疏性注释的限制。为了解决这个问题,我们提出了一种新的超声心动图半监督分割框架NCM-Net。我们首先提出邻域相关挖掘(NCM)模块,该模块充分挖掘查询特征与其时空邻域之间的相关性,以抵抗噪声的影响。该模块还在空间上捕获像素之间的跨尺度上下文相关性,以进一步细化特征,从而减轻噪声对超声心动图分割的影响。为了进一步提高分割精度,我们提出使用不可靠像素掩蔽注意(UMA)。通过屏蔽可靠的像素点,对不可靠的像素点给予额外的关注,以细化分割的边界。此外,我们对最终预测使用跨帧边界约束来优化它们的时间一致性。通过在CAMUS和EchoNet-Dynamic两个公开可用的数据集上进行大量实验,我们证明了该方法的有效性,该方法实现了最先进的性能和出色的时间一致性。代码可在https://github.com/dengxl0520/NCMNet上获得
{"title":"Echocardiography Video Segmentation via Neighborhood Correlation Mining","authors":"Xiaolong Deng;Huisi Wu","doi":"10.1109/TMI.2025.3588157","DOIUrl":"10.1109/TMI.2025.3588157","url":null,"abstract":"Accurate segmentation of the left ventricle in echocardiography is critical for diagnosing and treating cardiovascular diseases. However, accurate segmentation remains challenging due to the limitations of ultrasound imaging. Although numerous image and video segmentation methods have been proposed, existing methods still fail to effectively solve this task, which is limited by sparsity annotations. To address this problem, we propose a novel semi-supervised segmentation framework named NCM-Net for echocardiography. We first propose the neighborhood correlation mining (NCM) module, which sufficiently mines the correlations between query features and their spatiotemporal neighborhoods to resist noise influence. The module also captures cross-scale contextual correlations between pixels spatially to further refine features, thus alleviating the impact of noise on echocardiography segmentation. To further improve segmentation accuracy, we propose using unreliable-pixels masked attention (UMA). By masking reliable pixels, it pays extra attention to unreliable pixels to refine the boundary of segmentation. Further, we use cross-frame boundary constraints on the final predictions to optimize their temporal consistency. Through extensive experiments on two publicly available datasets, CAMUS and EchoNet-Dynamic, we demonstrate the effectiveness of the proposed, which achieves state-of-the-art performance and outstanding temporal consistency. Codes are available at <uri>https://github.com/dengxl0520/NCMNet</uri>","PeriodicalId":94033,"journal":{"name":"IEEE transactions on medical imaging","volume":"44 12","pages":"5172-5182"},"PeriodicalIF":0.0,"publicationDate":"2025-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144611122","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Flow-Rate-Constrained Physics-Informed Neural Networks for Flow Field Error Correction in 4-D Flow Magnetic Resonance Imaging 基于流量约束物理信息的四维流磁共振成像流场误差校正神经网络
Pub Date : 2025-07-10 DOI: 10.1109/TMI.2025.3587636
Jihun Kang;Eui Cheol Jung;Hyun Jung Koo;Dong Hyun Yang;Hojin Ha
In this study, we present enhanced physics-informed neural networks (PINNs), which were designed to address flow field errors in four-dimensional flow magnetic resonance imaging (4D Flow MRI). Flow field errors, typically occurring in high-velocity regions, lead to inaccuracies in velocity fields and flow rate underestimation. We proposed incorporating flow rate constraints to ensure physical consistency across cross-sections. The proposed framework included optimization strategies to improve convergence, stability, and accuracy. Artificial viscosity modeling, projecting conflicting gradients (PCGrad), and Euclidean norm scaling were applied to balance loss functions during training. The performance was validated using 2D computational fluid dynamics (CFD) with synthetic error, in-vitro 4D flow MRI mimicking aortic valve, and in-vivo 4D flow MRI from patients with aortic regurgitation and aortic stenosis. This study demonstrated considerable improvements in correcting flow field errors, denoising, and super-resolution. Notably, the proposed PINNs provided accurate flow rate reconstruction in stenotic and high-velocity regions. This approach extends the applicability of 4D flow MRI by providing reliable hemodynamics in the post-processing stage.
在这项研究中,我们提出了增强型物理信息神经网络(pinn),旨在解决四维流动磁共振成像(4D flow MRI)中的流场误差。流场误差通常发生在高速区域,导致速度场的不准确和流量的低估。我们建议合并流速限制以确保横截面上的物理一致性。提出的框架包括优化策略,以提高收敛性、稳定性和准确性。在训练过程中,使用人工粘度建模、投影冲突梯度(PCGrad)和欧氏范数缩放来平衡损失函数。通过二维计算流体动力学(CFD)、模拟主动脉瓣的体外4D血流MRI以及主动脉瓣返流和主动脉瓣狭窄患者的体内4D血流MRI验证了该性能。该研究表明,在校正流场误差、去噪和超分辨率方面有相当大的改进。值得注意的是,所提出的pinn在狭窄和高速区域提供了准确的流量重建。这种方法通过在后处理阶段提供可靠的血流动力学,扩展了4D血流MRI的适用性。
{"title":"Flow-Rate-Constrained Physics-Informed Neural Networks for Flow Field Error Correction in 4-D Flow Magnetic Resonance Imaging","authors":"Jihun Kang;Eui Cheol Jung;Hyun Jung Koo;Dong Hyun Yang;Hojin Ha","doi":"10.1109/TMI.2025.3587636","DOIUrl":"10.1109/TMI.2025.3587636","url":null,"abstract":"In this study, we present enhanced physics-informed neural networks (PINNs), which were designed to address flow field errors in four-dimensional flow magnetic resonance imaging (4D Flow MRI). Flow field errors, typically occurring in high-velocity regions, lead to inaccuracies in velocity fields and flow rate underestimation. We proposed incorporating flow rate constraints to ensure physical consistency across cross-sections. The proposed framework included optimization strategies to improve convergence, stability, and accuracy. Artificial viscosity modeling, projecting conflicting gradients (PCGrad), and Euclidean norm scaling were applied to balance loss functions during training. The performance was validated using 2D computational fluid dynamics (CFD) with synthetic error, in-vitro 4D flow MRI mimicking aortic valve, and in-vivo 4D flow MRI from patients with aortic regurgitation and aortic stenosis. This study demonstrated considerable improvements in correcting flow field errors, denoising, and super-resolution. Notably, the proposed PINNs provided accurate flow rate reconstruction in stenotic and high-velocity regions. This approach extends the applicability of 4D flow MRI by providing reliable hemodynamics in the post-processing stage.","PeriodicalId":94033,"journal":{"name":"IEEE transactions on medical imaging","volume":"44 12","pages":"5155-5171"},"PeriodicalIF":0.0,"publicationDate":"2025-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144603140","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Coupled Diffusion Models for Metal Artifact Reduction of Clinical Dental CBCT Images 临床牙科CBCT图像金属伪影还原的耦合扩散模型。
Pub Date : 2025-07-08 DOI: 10.1109/TMI.2025.3587131
Zhouzhuo Zhang;Juncheng Yan;Yuxuan Shi;Zhiming Cui;Jun Xu;Dinggang Shen
Metal dental implants may introduce metal artifacts (MA) during the CBCT imaging process, causing significant interference in subsequent diagnosis. In recent years, many deep learning methods for metal artifact reduction (MAR) have been proposed. Due to the huge difference between synthetic and clinical MA, supervised learning MAR methods may perform poorly in clinical settings. Many existing unsupervised MAR methods trained on clinical data often suffer from incorrect dental morphology. To alleviate the above problems, in this paper, we propose a new MAR method of Coupled Diffusion Models (CDM) for clinical dental CBCT images. Specifically, we separately train two diffusion models on clinical MA-degraded images and clinical clean images to obtain prior information, respectively. During the denoising process, the variances of noise levels are calculated from MA images and the prior of diffusion models. Then we develop a noise transformation module between the two diffusion models to transform the MA noise image into a new initial value for the denoising process. Our designs effectively exploit the inherent transformation between the misaligned MA-degraded images and clean images. Additionally, we introduce an MA-adaptive inference technique to better accommodate the MA degradation in different areas of an MA-degraded image. Experiments on our clinical dataset demonstrate that our CDM outperforms the comparison methods on both objective metrics and visual quality, especially for severe MA degradation. We will publicly release our code.
金属牙种植体可能在CBCT成像过程中引入金属伪影(MA),对后续诊断造成明显干扰。近年来,人们提出了许多用于金属伪影还原的深度学习方法。由于合成和临床MA之间的巨大差异,监督学习MAR方法在临床环境中可能表现不佳。现有的许多基于临床数据训练的无监督MAR方法往往存在牙形态不正确的问题。为了解决上述问题,本文提出了一种基于耦合扩散模型(CDM)的临床牙科CBCT图像MAR方法。具体而言,我们分别在临床ma降级图像和临床干净图像上训练两个扩散模型来获得先验信息。在去噪过程中,通过MA图像和扩散模型的先验计算噪声级的方差。然后在两种扩散模型之间建立噪声变换模块,将MA噪声图像转换为新的初始值进行去噪处理。我们的设计有效地利用了不对齐的ma退化图像和干净图像之间的固有转换。此外,我们引入了一种自适应MA推理技术,以更好地适应MA退化图像中不同区域的MA退化。在临床数据集上的实验表明,我们的CDM在客观指标和视觉质量上都优于比较方法,特别是对于严重的MA退化。我们将公开发布我们的代码。
{"title":"Coupled Diffusion Models for Metal Artifact Reduction of Clinical Dental CBCT Images","authors":"Zhouzhuo Zhang;Juncheng Yan;Yuxuan Shi;Zhiming Cui;Jun Xu;Dinggang Shen","doi":"10.1109/TMI.2025.3587131","DOIUrl":"10.1109/TMI.2025.3587131","url":null,"abstract":"Metal dental implants may introduce metal artifacts (MA) during the CBCT imaging process, causing significant interference in subsequent diagnosis. In recent years, many deep learning methods for metal artifact reduction (MAR) have been proposed. Due to the huge difference between synthetic and clinical MA, supervised learning MAR methods may perform poorly in clinical settings. Many existing unsupervised MAR methods trained on clinical data often suffer from incorrect dental morphology. To alleviate the above problems, in this paper, we propose a new MAR method of Coupled Diffusion Models (CDM) for clinical dental CBCT images. Specifically, we separately train two diffusion models on clinical MA-degraded images and clinical clean images to obtain prior information, respectively. During the denoising process, the variances of noise levels are calculated from MA images and the prior of diffusion models. Then we develop a noise transformation module between the two diffusion models to transform the MA noise image into a new initial value for the denoising process. Our designs effectively exploit the inherent transformation between the misaligned MA-degraded images and clean images. Additionally, we introduce an MA-adaptive inference technique to better accommodate the MA degradation in different areas of an MA-degraded image. Experiments on our clinical dataset demonstrate that our CDM outperforms the comparison methods on both objective metrics and visual quality, especially for severe MA degradation. We will publicly release our code.","PeriodicalId":94033,"journal":{"name":"IEEE transactions on medical imaging","volume":"44 12","pages":"5103-5116"},"PeriodicalIF":0.0,"publicationDate":"2025-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144578618","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE transactions on medical imaging
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1