mar - unet:基于U-Net的LDCT图像去噪,其中包含多个轻量级的基于注意力的模块和残差增强。

IF 3.1 3区 医学 Q2 ENGINEERING, BIOMEDICAL Physics in medicine and biology Pub Date : 2025-02-13 DOI:10.1088/1361-6560/adb19a
Hao Tang, Ningfeng Que, Yanwen Tian, Mingzhe Li, Alessandro Perelli, Yueyang Teng
{"title":"mar - unet:基于U-Net的LDCT图像去噪,其中包含多个轻量级的基于注意力的模块和残差增强。","authors":"Hao Tang, Ningfeng Que, Yanwen Tian, Mingzhe Li, Alessandro Perelli, Yueyang Teng","doi":"10.1088/1361-6560/adb19a","DOIUrl":null,"url":null,"abstract":"<p><p><i>Objective.</i>Computed tomography (CT) is a crucial medical imaging technique which uses x-ray radiation to identify cancer tissues. Since radiation poses a significant health risk, low dose acquisition procedures need to be adopted. However, low-dose CT (LDCT) can cause higher noise and artifacts which massively degrade the diagnosis.<i>Approach.</i>To denoise LDCT images more effectively, this paper proposes a deep learning method based on U-Net with multiple lightweight attention-based modules and residual reinforcement (MLAR-UNet). We integrate a U-Net architecture with several advanced modules, including Convolutional Block Attention Module (CBAM), Cross Residual Module (CR), Attention Cross Reinforcement Module (ACRM), and Convolution and Transformer Cross Attention Module (CTCAM). Among these modules, CBAM applies channel and spatial attention mechanisms to enhance local feature representation. However, serious detail loss caused by incorrect embedding of CBAM for LDCT denoising is verified in this study. To relieve this, we introduce CR to reduce information loss in deeper layers, preserving features more effectively. To address the excessive local attention of CBAM, we design ACRM, which incorporates Transformer to adjust the attention weights. Furthermore, we design CTCAM, which leverages a complex combination of Transformer and convolution to capture multi-scale information and compute more accurate attention weights.<i>Results.</i>Experiments verify the embedding rationality and validity of each module and show that the proposed MLAR-UNet denoises LDCT images more effectively and preserves more details than many state-of-the-art methods on clinical chest and abdominal CT datasets.<i>Significance.</i>The proposed MLAR-UNet not only demonstrates superior LDCT image denoising capability but also highlights the strong detail comprehension and negligible overheads of our designed ACRM and CTCAM. These findings provide a novel approach to integrating Transformer more efficiently in image processing.</p>","PeriodicalId":20185,"journal":{"name":"Physics in medicine and biology","volume":" ","pages":""},"PeriodicalIF":3.1000,"publicationDate":"2025-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"MLAR-UNet: LDCT image denoising based on U-Net with multiple lightweight attention-based modules and residual reinforcement.\",\"authors\":\"Hao Tang, Ningfeng Que, Yanwen Tian, Mingzhe Li, Alessandro Perelli, Yueyang Teng\",\"doi\":\"10.1088/1361-6560/adb19a\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p><i>Objective.</i>Computed tomography (CT) is a crucial medical imaging technique which uses x-ray radiation to identify cancer tissues. Since radiation poses a significant health risk, low dose acquisition procedures need to be adopted. However, low-dose CT (LDCT) can cause higher noise and artifacts which massively degrade the diagnosis.<i>Approach.</i>To denoise LDCT images more effectively, this paper proposes a deep learning method based on U-Net with multiple lightweight attention-based modules and residual reinforcement (MLAR-UNet). We integrate a U-Net architecture with several advanced modules, including Convolutional Block Attention Module (CBAM), Cross Residual Module (CR), Attention Cross Reinforcement Module (ACRM), and Convolution and Transformer Cross Attention Module (CTCAM). Among these modules, CBAM applies channel and spatial attention mechanisms to enhance local feature representation. However, serious detail loss caused by incorrect embedding of CBAM for LDCT denoising is verified in this study. To relieve this, we introduce CR to reduce information loss in deeper layers, preserving features more effectively. To address the excessive local attention of CBAM, we design ACRM, which incorporates Transformer to adjust the attention weights. Furthermore, we design CTCAM, which leverages a complex combination of Transformer and convolution to capture multi-scale information and compute more accurate attention weights.<i>Results.</i>Experiments verify the embedding rationality and validity of each module and show that the proposed MLAR-UNet denoises LDCT images more effectively and preserves more details than many state-of-the-art methods on clinical chest and abdominal CT datasets.<i>Significance.</i>The proposed MLAR-UNet not only demonstrates superior LDCT image denoising capability but also highlights the strong detail comprehension and negligible overheads of our designed ACRM and CTCAM. These findings provide a novel approach to integrating Transformer more efficiently in image processing.</p>\",\"PeriodicalId\":20185,\"journal\":{\"name\":\"Physics in medicine and biology\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":3.1000,\"publicationDate\":\"2025-02-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Physics in medicine and biology\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1088/1361-6560/adb19a\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, BIOMEDICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Physics in medicine and biology","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1088/1361-6560/adb19a","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0

摘要

目的:计算机断层扫描(CT)是一种重要的医学成像技术,它利用x射线辐射来识别肿瘤组织。由于辐射对健康构成重大风险,因此需要采用低剂量获取程序。然而,低剂量CT (LDCT)会产生较高的噪声和伪影,从而大大降低了诊断。方法:为了更有效地去噪LDCT图像,本文提出了一种基于U-Net的深度学习方法,该方法包含多个轻量级的基于注意的模块和残差增强(mar - unet),我们将U-Net架构与卷积块注意模块(CBAM)、交叉残差模块(CR)、注意交叉增强模块(ACRM)和卷积和变形交叉注意模块(CTCAM)等几个高级模块集成在一起。在这些模块中,CBAM采用通道和空间注意机制来增强局部特征表征。然而,本研究验证了CBAM在LDCT去噪中不正确的嵌入会造成严重的细节损失。为了解决这个问题,我们引入CR来减少更深层的信息丢失,更有效地保留特征。为了解决CBAM的局部注意力过多的问题,我们设计了ACRM,该ACRM结合了Transformer来调整注意力权重。此外,我们设计了CTCAM,它利用变压器和卷积的复杂组合来捕获多尺度信息并计算更准确的注意力权重。结果:实验验证了每个模块嵌入的合理性和有效性,并表明所提出的MLAR-UNet在临床胸部和腹部CT数据集上比许多最先进的(SOTA)方法更有效地去噪LDCT图像,并保留了更多的细节。意义:所提出的mar - unet不仅显示了优越的LDCT图像去噪能力,而且突出了我们设计的ACRM和CTCAM的强大细节理解能力和可忽略不计的开销。这些发现为在图像处理中更有效地集成Transformer提供了一种新的方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
MLAR-UNet: LDCT image denoising based on U-Net with multiple lightweight attention-based modules and residual reinforcement.

Objective.Computed tomography (CT) is a crucial medical imaging technique which uses x-ray radiation to identify cancer tissues. Since radiation poses a significant health risk, low dose acquisition procedures need to be adopted. However, low-dose CT (LDCT) can cause higher noise and artifacts which massively degrade the diagnosis.Approach.To denoise LDCT images more effectively, this paper proposes a deep learning method based on U-Net with multiple lightweight attention-based modules and residual reinforcement (MLAR-UNet). We integrate a U-Net architecture with several advanced modules, including Convolutional Block Attention Module (CBAM), Cross Residual Module (CR), Attention Cross Reinforcement Module (ACRM), and Convolution and Transformer Cross Attention Module (CTCAM). Among these modules, CBAM applies channel and spatial attention mechanisms to enhance local feature representation. However, serious detail loss caused by incorrect embedding of CBAM for LDCT denoising is verified in this study. To relieve this, we introduce CR to reduce information loss in deeper layers, preserving features more effectively. To address the excessive local attention of CBAM, we design ACRM, which incorporates Transformer to adjust the attention weights. Furthermore, we design CTCAM, which leverages a complex combination of Transformer and convolution to capture multi-scale information and compute more accurate attention weights.Results.Experiments verify the embedding rationality and validity of each module and show that the proposed MLAR-UNet denoises LDCT images more effectively and preserves more details than many state-of-the-art methods on clinical chest and abdominal CT datasets.Significance.The proposed MLAR-UNet not only demonstrates superior LDCT image denoising capability but also highlights the strong detail comprehension and negligible overheads of our designed ACRM and CTCAM. These findings provide a novel approach to integrating Transformer more efficiently in image processing.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Physics in medicine and biology
Physics in medicine and biology 医学-工程:生物医学
CiteScore
6.50
自引率
14.30%
发文量
409
审稿时长
2 months
期刊介绍: The development and application of theoretical, computational and experimental physics to medicine, physiology and biology. Topics covered are: therapy physics (including ionizing and non-ionizing radiation); biomedical imaging (e.g. x-ray, magnetic resonance, ultrasound, optical and nuclear imaging); image-guided interventions; image reconstruction and analysis (including kinetic modelling); artificial intelligence in biomedical physics and analysis; nanoparticles in imaging and therapy; radiobiology; radiation protection and patient dose monitoring; radiation dosimetry
期刊最新文献
Interlayer-aware postoperative facial appearance prediction in orthognathic surgery with bio-geometric guidance. GPDM: generation-prior diffusion model for accelerated direct attenuation and scatter correction of whole-body18F-FDG PET. DecX-Net: a dual-path feature decoupling network with layer-wise feature alignment for subsolid pulmonary nodule segmentation. Lung ion-fluoroscopy Guided Hadron therapy: LIGHT concept and proof-of-principle. Anatomy-based parameter selection and efficiency-quality balance in Radixact helical tomotherapy.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1