Yanmei Li , Xiaoshuang Li , Jian Luo , Tao Yu , Jingshi Deng , Qibin Yang
{"title":"MLDAN:Multi-scale large kernel decomposition attention network for super-resolution of lung computed tomography images","authors":"Yanmei Li , Xiaoshuang Li , Jian Luo , Tao Yu , Jingshi Deng , Qibin Yang","doi":"10.1016/j.bspc.2025.107795","DOIUrl":null,"url":null,"abstract":"<div><div>Computed Tomography(CT) is widely used in medical diagnostics, with high-resolution(HR) CT images providing essential details for disease diagnosis. However, factors like equipment limitations and motion artifacts can result in low-resolution(LR) CT images. To address this, researchers have explored image super-resolution(SR) techniques to restore HR images, but challenges remain, particularly with lung CT images. One key challenge is the diversity of lung structures, such as variations in the shape and size of the lung parenchyma, bronchi, blood vessels, and lesions, which standard convolutional neural networks(CNNs) struggle to recover. In this paper, a method based on multi-scale large kernel decomposition attention network(MLDAN) for lung CT image super-resolution is proposed. This method captures detailed features of lung structures through multi-scale large kernel decomposition attention(MLDA) mechanism. To further enhance irregular lung edge details, we designed an anatomy-aware dynamic fusion module(ADFM), which adapts to complex lung features using deformable convolutions. Additionally, preserving global information between lung structures and their surrounding tissues is crucial for disease diagnosis. To maintain this, we incorporated MLDA as a Token Mixer within the MetaFormer architecture to efficiently model global dependencies. Experimental results on the public COVID-CT dataset and the lung cancer dataset IQ-OTH/NCCD show that the proposed method outperforms state-of-the-art(SOTA) techniques in both objective metrics(PSNR/SSIM) and visual quality, while maintaining a reduced parameter count. This method has significant potential for application and provides support for the early diagnosis of lung diseases. Codes are available at <span><span>https://github.com/tiantianbaipiao/MLDAN</span><svg><path></path></svg></span></div></div>","PeriodicalId":55362,"journal":{"name":"Biomedical Signal Processing and Control","volume":"107 ","pages":"Article 107795"},"PeriodicalIF":4.9000,"publicationDate":"2025-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical Signal Processing and Control","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1746809425003064","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0
Abstract
Computed Tomography(CT) is widely used in medical diagnostics, with high-resolution(HR) CT images providing essential details for disease diagnosis. However, factors like equipment limitations and motion artifacts can result in low-resolution(LR) CT images. To address this, researchers have explored image super-resolution(SR) techniques to restore HR images, but challenges remain, particularly with lung CT images. One key challenge is the diversity of lung structures, such as variations in the shape and size of the lung parenchyma, bronchi, blood vessels, and lesions, which standard convolutional neural networks(CNNs) struggle to recover. In this paper, a method based on multi-scale large kernel decomposition attention network(MLDAN) for lung CT image super-resolution is proposed. This method captures detailed features of lung structures through multi-scale large kernel decomposition attention(MLDA) mechanism. To further enhance irregular lung edge details, we designed an anatomy-aware dynamic fusion module(ADFM), which adapts to complex lung features using deformable convolutions. Additionally, preserving global information between lung structures and their surrounding tissues is crucial for disease diagnosis. To maintain this, we incorporated MLDA as a Token Mixer within the MetaFormer architecture to efficiently model global dependencies. Experimental results on the public COVID-CT dataset and the lung cancer dataset IQ-OTH/NCCD show that the proposed method outperforms state-of-the-art(SOTA) techniques in both objective metrics(PSNR/SSIM) and visual quality, while maintaining a reduced parameter count. This method has significant potential for application and provides support for the early diagnosis of lung diseases. Codes are available at https://github.com/tiantianbaipiao/MLDAN
期刊介绍:
Biomedical Signal Processing and Control aims to provide a cross-disciplinary international forum for the interchange of information on research in the measurement and analysis of signals and images in clinical medicine and the biological sciences. Emphasis is placed on contributions dealing with the practical, applications-led research on the use of methods and devices in clinical diagnosis, patient monitoring and management.
Biomedical Signal Processing and Control reflects the main areas in which these methods are being used and developed at the interface of both engineering and clinical science. The scope of the journal is defined to include relevant review papers, technical notes, short communications and letters. Tutorial papers and special issues will also be published.