MLAR-UNet: LDCT image denoising based on U-Net with multiple lightweight attention-based modules and residual reinforcement.

IF 3.3 3区医学 Q2 ENGINEERING, BIOMEDICAL Physics in medicine and biology Pub Date : 2025-02-13 DOI:10.1088/1361-6560/adb19a

Hao Tang, Ningfeng Que, Yanwen Tian, Mingzhe Li, Alessandro Perelli, Yueyang Teng

{"title":"MLAR-UNet: LDCT image denoising based on U-Net with multiple lightweight attention-based modules and residual reinforcement.","authors":"Hao Tang, Ningfeng Que, Yanwen Tian, Mingzhe Li, Alessandro Perelli, Yueyang Teng","doi":"10.1088/1361-6560/adb19a","DOIUrl":null,"url":null,"abstract":"Objective.Computed tomography (CT) is a crucial medical imaging technique which uses x-ray radiation to identify cancer tissues. Since radiation poses a significant health risk, low dose acquisition procedures need to be adopted. However, low-dose CT (LDCT) can cause higher noise and artifacts which massively degrade the diagnosis.Approach.To denoise LDCT images more effectively, this paper proposes a deep learning method based on U-Net with multiple lightweight attention-based modules and residual reinforcement (MLAR-UNet). We integrate a U-Net architecture with several advanced modules, including Convolutional Block Attention Module (CBAM), Cross Residual Module (CR), Attention Cross Reinforcement Module (ACRM), and Convolution and Transformer Cross Attention Module (CTCAM). Among these modules, CBAM applies channel and spatial attention mechanisms to enhance local feature representation. However, serious detail loss caused by incorrect embedding of CBAM for LDCT denoising is verified in this study. To relieve this, we introduce CR to reduce information loss in deeper layers, preserving features more effectively. To address the excessive local attention of CBAM, we design ACRM, which incorporates Transformer to adjust the attention weights. Furthermore, we design CTCAM, which leverages a complex combination of Transformer and convolution to capture multi-scale information and compute more accurate attention weights.Results.Experiments verify the embedding rationality and validity of each module and show that the proposed MLAR-UNet denoises LDCT images more effectively and preserves more details than many state-of-the-art methods on clinical chest and abdominal CT datasets.Significance.The proposed MLAR-UNet not only demonstrates superior LDCT image denoising capability but also highlights the strong detail comprehension and negligible overheads of our designed ACRM and CTCAM. These findings provide a novel approach to integrating Transformer more efficiently in image processing.","PeriodicalId":20185,"journal":{"name":"Physics in medicine and biology","volume":" ","pages":""},"PeriodicalIF":3.3000,"publicationDate":"2025-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Physics in medicine and biology","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1088/1361-6560/adb19a","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}

引用次数: 0

Abstract

Objective.Computed tomography (CT) is a crucial medical imaging technique which uses x-ray radiation to identify cancer tissues. Since radiation poses a significant health risk, low dose acquisition procedures need to be adopted. However, low-dose CT (LDCT) can cause higher noise and artifacts which massively degrade the diagnosis.Approach.To denoise LDCT images more effectively, this paper proposes a deep learning method based on U-Net with multiple lightweight attention-based modules and residual reinforcement (MLAR-UNet). We integrate a U-Net architecture with several advanced modules, including Convolutional Block Attention Module (CBAM), Cross Residual Module (CR), Attention Cross Reinforcement Module (ACRM), and Convolution and Transformer Cross Attention Module (CTCAM). Among these modules, CBAM applies channel and spatial attention mechanisms to enhance local feature representation. However, serious detail loss caused by incorrect embedding of CBAM for LDCT denoising is verified in this study. To relieve this, we introduce CR to reduce information loss in deeper layers, preserving features more effectively. To address the excessive local attention of CBAM, we design ACRM, which incorporates Transformer to adjust the attention weights. Furthermore, we design CTCAM, which leverages a complex combination of Transformer and convolution to capture multi-scale information and compute more accurate attention weights.Results.Experiments verify the embedding rationality and validity of each module and show that the proposed MLAR-UNet denoises LDCT images more effectively and preserves more details than many state-of-the-art methods on clinical chest and abdominal CT datasets.Significance.The proposed MLAR-UNet not only demonstrates superior LDCT image denoising capability but also highlights the strong detail comprehension and negligible overheads of our designed ACRM and CTCAM. These findings provide a novel approach to integrating Transformer more efficiently in image processing.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

求助全文

约1分钟内获得全文去求助

来源期刊

Physics in medicine and biology 医学-工程：生物医学

CiteScore

6.50

自引率

14.30%

发文量

409

审稿时长

2 months

期刊介绍： The development and application of theoretical, computational and experimental physics to medicine, physiology and biology. Topics covered are: therapy physics (including ionizing and non-ionizing radiation); biomedical imaging (e.g. x-ray, magnetic resonance, ultrasound, optical and nuclear imaging); image-guided interventions; image reconstruction and analysis (including kinetic modelling); artificial intelligence in biomedical physics and analysis; nanoparticles in imaging and therapy; radiobiology; radiation protection and patient dose monitoring; radiation dosimetry