IF 4.9 2区 医学 Q1 ENGINEERING, BIOMEDICAL Biomedical Signal Processing and Control Pub Date : 2025-03-05 DOI:10.1016/j.bspc.2025.107735
Xiaoyan Kui , Bo Liu , Zanbo Sun , Qinsong Li , Min Zhang , Wei Liang , Beiji Zou
{"title":"Med-LVDM: Medical latent variational diffusion model for medical image translation","authors":"Xiaoyan Kui ,&nbsp;Bo Liu ,&nbsp;Zanbo Sun ,&nbsp;Qinsong Li ,&nbsp;Min Zhang ,&nbsp;Wei Liang ,&nbsp;Beiji Zou","doi":"10.1016/j.bspc.2025.107735","DOIUrl":null,"url":null,"abstract":"<div><div>Learning-based methods for medical image translation have proven effective in addressing the challenge of obtaining complete multimodal medical images in clinical practice, particularly when patients are allergic to contrast agents or critical illnesses. Recently, diffusion models have exhibited superior performance in various image-generation tasks and are expected to replace generative adversarial networks (GANs) for medical image translation. However, existing methods suffer from unintuitive training objectives and complex network structures that curtail their efficacy in this domain. To address this gap, we propose a novel medical latent variational diffusion model (Med-LVDM) for efficient medical image translation. Firstly, we introduce a new parametric representation based on the variational diffusion model (VDM) and optimize the training objective to the weighted mean square error between the synthetic and target images, which is intuitive and has fewer model parameters. Then, we map the diffusion training and sampling process to the latent space, significantly reducing computational complexity to enhance the feasibility of clinical applications. Finally, to capture global information without focusing solely on local features, we utilize U-ViT as the backbone for Med-LVDM to effectively adapt to the latent space representing abstract information rather than concrete pixel-level information. Extensive qualitative and quantitative results in multi-contrast MRI and cross-modality MRI-CT datasets demonstrate our superiority in translation quality compared to state-of-the-art methods. In particular, Med-LVDM achieved its highest SSIM and PSNR of 92.37% and 26.23 dB on the BraTS2018 dataset, 90.18% and 24.55 dB on the IXI dataset, 91.61% and 25.52 dB on the MRI-CT dataset.</div></div>","PeriodicalId":55362,"journal":{"name":"Biomedical Signal Processing and Control","volume":"106 ","pages":"Article 107735"},"PeriodicalIF":4.9000,"publicationDate":"2025-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical Signal Processing and Control","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1746809425002460","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0

摘要

事实证明,基于学习的医学图像翻译方法能有效解决临床实践中获取完整多模态医学图像的难题,尤其是当患者对造影剂过敏或患有危重疾病时。最近,扩散模型在各种图像生成任务中表现出卓越的性能,有望取代生成式对抗网络(GAN)用于医学图像翻译。然而,现有的方法存在训练目标不直观、网络结构复杂等问题,从而降低了它们在这一领域的功效。针对这一缺陷,我们提出了一种新型医学潜变量扩散模型(Med-LVDM),用于高效医学图像翻译。首先,我们在变分扩散模型(VDM)的基础上引入了一种新的参数表示法,并将训练目标优化为合成图像与目标图像之间的加权均方误差,这样既直观又能减少模型参数。然后,我们将扩散训练和采样过程映射到潜空间,大大降低了计算复杂度,提高了临床应用的可行性。最后,为了捕捉全局信息而不只关注局部特征,我们利用 U-ViT 作为 Med-LVDM 的骨干,以有效适应代表抽象信息而非具体像素级信息的潜空间。多对比 MRI 和跨模态 MRI-CT 数据集的大量定性和定量结果表明,与最先进的方法相比,我们的翻译质量更胜一筹。其中,Med-LVDM 在 BraTS2018 数据集上的 SSIM 和 PSNR 分别达到 92.37% 和 26.23 dB,在 IXI 数据集上分别达到 90.18% 和 24.55 dB,在 MRI-CT 数据集上分别达到 91.61% 和 25.52 dB。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

摘要图片

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Med-LVDM: Medical latent variational diffusion model for medical image translation
Learning-based methods for medical image translation have proven effective in addressing the challenge of obtaining complete multimodal medical images in clinical practice, particularly when patients are allergic to contrast agents or critical illnesses. Recently, diffusion models have exhibited superior performance in various image-generation tasks and are expected to replace generative adversarial networks (GANs) for medical image translation. However, existing methods suffer from unintuitive training objectives and complex network structures that curtail their efficacy in this domain. To address this gap, we propose a novel medical latent variational diffusion model (Med-LVDM) for efficient medical image translation. Firstly, we introduce a new parametric representation based on the variational diffusion model (VDM) and optimize the training objective to the weighted mean square error between the synthetic and target images, which is intuitive and has fewer model parameters. Then, we map the diffusion training and sampling process to the latent space, significantly reducing computational complexity to enhance the feasibility of clinical applications. Finally, to capture global information without focusing solely on local features, we utilize U-ViT as the backbone for Med-LVDM to effectively adapt to the latent space representing abstract information rather than concrete pixel-level information. Extensive qualitative and quantitative results in multi-contrast MRI and cross-modality MRI-CT datasets demonstrate our superiority in translation quality compared to state-of-the-art methods. In particular, Med-LVDM achieved its highest SSIM and PSNR of 92.37% and 26.23 dB on the BraTS2018 dataset, 90.18% and 24.55 dB on the IXI dataset, 91.61% and 25.52 dB on the MRI-CT dataset.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Biomedical Signal Processing and Control
Biomedical Signal Processing and Control 工程技术-工程:生物医学
CiteScore
9.80
自引率
13.70%
发文量
822
审稿时长
4 months
期刊介绍: Biomedical Signal Processing and Control aims to provide a cross-disciplinary international forum for the interchange of information on research in the measurement and analysis of signals and images in clinical medicine and the biological sciences. Emphasis is placed on contributions dealing with the practical, applications-led research on the use of methods and devices in clinical diagnosis, patient monitoring and management. Biomedical Signal Processing and Control reflects the main areas in which these methods are being used and developed at the interface of both engineering and clinical science. The scope of the journal is defined to include relevant review papers, technical notes, short communications and letters. Tutorial papers and special issues will also be published.
期刊最新文献
Gaussian regressed generative adversarial network based hermitian extreme gradient boosting for plant leaf disease detection Computer-aided diagnosis of spinal deformities based on keypoints detection in human back depth images Advancing cardiovascular risk prediction: A fusion of SVM models with fuzzy logic and the Sugeno integral Altered visual network modularity and communication in ADHD subtypes: Classification via source-localized EEG modules STD-YOLOv7:A small target detector for micronucleus based on YOLOv7
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1