Optimized YOLOv11 model for lung nodule detection

IF 4.9 2区 医学 Q1 ENGINEERING, BIOMEDICAL Biomedical Signal Processing and Control Pub Date : 2025-03-29 DOI:10.1016/j.bspc.2025.107830
Zichao Liu , Lili Wei , Tingqiang Song
{"title":"Optimized YOLOv11 model for lung nodule detection","authors":"Zichao Liu ,&nbsp;Lili Wei ,&nbsp;Tingqiang Song","doi":"10.1016/j.bspc.2025.107830","DOIUrl":null,"url":null,"abstract":"<div><h3>Objectives</h3><div>This study proposes an advanced YOLOv11-based lung nodule detection algorithm that balances high accuracy with efficient computation, addressing the critical need for accurate and timely early diagnosis of lung cancer.</div></div><div><h3>Methods</h3><div>We replaced the traditional backbone with MobileNetV4, which employs reversible connections to prevent information loss and enhance feature representation, thereby improving the model’s efficiency in processing high-resolution CT scans. We developed a novel C2PSA module, C2PSA-MSDA, which integrates Multi-Scale Dilation Attention (MSDA) to capture multi-scale features more effectively. For the neck part, we introduced the new FreqFusion-BiFPN to enhance feature integration and boundary clarity, thereby reducing false positives. Additionally, we created a new C3k2 module, DyC3k2, to optimize feature fusion. We adopted Focal-inv-IoU for bounding box regression and Slide Loss for classification, which help the model focus more on high-quality predictions while still considering lower-quality ones, leading to more balanced and accurate detection.</div></div><div><h3>Results</h3><div>Extensive experiments on the LUNAR16 dataset and a proprietary dataset demonstrated significant improvements: precision increased by 4.15 %, recall by 3.23 %, mAP50 by 4.04 %, and mAP50-95 by 3.28 % compared to the baseline YOLOv11. These gains were achieved with a smaller model size (5.08 MB) and a processing speed of 135.2 frames per second (f/s). The model also performed well on the proprietary dataset, demonstrating strong generalization.</div></div><div><h3>Conclusion</h3><div>The results indicate that the improved algorithm achieves higher accuracy, real-time performance, and better generalization in lung nodule detection, highlighting its potential for clinical application in the early lung cancer diagnosis.</div></div>","PeriodicalId":55362,"journal":{"name":"Biomedical Signal Processing and Control","volume":"107 ","pages":"Article 107830"},"PeriodicalIF":4.9000,"publicationDate":"2025-03-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical Signal Processing and Control","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1746809425003416","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0

Abstract

Objectives

This study proposes an advanced YOLOv11-based lung nodule detection algorithm that balances high accuracy with efficient computation, addressing the critical need for accurate and timely early diagnosis of lung cancer.

Methods

We replaced the traditional backbone with MobileNetV4, which employs reversible connections to prevent information loss and enhance feature representation, thereby improving the model’s efficiency in processing high-resolution CT scans. We developed a novel C2PSA module, C2PSA-MSDA, which integrates Multi-Scale Dilation Attention (MSDA) to capture multi-scale features more effectively. For the neck part, we introduced the new FreqFusion-BiFPN to enhance feature integration and boundary clarity, thereby reducing false positives. Additionally, we created a new C3k2 module, DyC3k2, to optimize feature fusion. We adopted Focal-inv-IoU for bounding box regression and Slide Loss for classification, which help the model focus more on high-quality predictions while still considering lower-quality ones, leading to more balanced and accurate detection.

Results

Extensive experiments on the LUNAR16 dataset and a proprietary dataset demonstrated significant improvements: precision increased by 4.15 %, recall by 3.23 %, mAP50 by 4.04 %, and mAP50-95 by 3.28 % compared to the baseline YOLOv11. These gains were achieved with a smaller model size (5.08 MB) and a processing speed of 135.2 frames per second (f/s). The model also performed well on the proprietary dataset, demonstrating strong generalization.

Conclusion

The results indicate that the improved algorithm achieves higher accuracy, real-time performance, and better generalization in lung nodule detection, highlighting its potential for clinical application in the early lung cancer diagnosis.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
用于肺结节检测的优化 YOLOv11 模型
目的提出一种先进的基于yolov11的肺结节检测算法,该算法兼顾了高精度和高效的计算,解决了准确、及时的肺癌早期诊断的迫切需求。方法采用MobileNetV4取代传统的骨干网,利用可逆连接防止信息丢失,增强特征表征,提高模型处理高分辨率CT扫描的效率。我们开发了一种新的C2PSA模块,C2PSA-MSDA,它集成了多尺度扩张注意(MSDA),可以更有效地捕获多尺度特征。对于颈部,我们引入了新的FreqFusion-BiFPN,以增强特征集成和边界清晰度,从而减少误报。此外,我们还创建了一个新的C3k2模块DyC3k2来优化特征融合。我们采用Focal-inv-IoU进行边界盒回归,采用Slide Loss进行分类,这样可以帮助模型更加关注高质量的预测,同时也会考虑低质量的预测,从而使检测更加平衡和准确。结果在LUNAR16数据集和专有数据集上的大量实验表明,与基线YOLOv11相比,精度提高了4.15%,召回率提高了3.23%,mAP50提高了4.04%,mAP50-95提高了3.28%。这些增益是在较小的模型大小(5.08 MB)和每秒135.2帧(f/s)的处理速度下实现的。该模型在专有数据集上也表现良好,显示出较强的泛化能力。结论改进算法在肺结节检测中具有更高的准确率、实时性和更好的泛化能力,在早期肺癌诊断中具有临床应用潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Biomedical Signal Processing and Control
Biomedical Signal Processing and Control 工程技术-工程:生物医学
CiteScore
9.80
自引率
13.70%
发文量
822
审稿时长
4 months
期刊介绍: Biomedical Signal Processing and Control aims to provide a cross-disciplinary international forum for the interchange of information on research in the measurement and analysis of signals and images in clinical medicine and the biological sciences. Emphasis is placed on contributions dealing with the practical, applications-led research on the use of methods and devices in clinical diagnosis, patient monitoring and management. Biomedical Signal Processing and Control reflects the main areas in which these methods are being used and developed at the interface of both engineering and clinical science. The scope of the journal is defined to include relevant review papers, technical notes, short communications and letters. Tutorial papers and special issues will also be published.
期刊最新文献
PulseAI: An automated machine learning-based augmentation index detector for arterial stiffness monitoring from cuff-based measurements 3D CNN-based method for automatic reorientation of 11C-acetate cardiac PET images using anchor point detection SWDL: Stratum-Wise Difference Learning with deep Laplacian pyramid for semi-supervised 3D intracranial hemorrhage segmentation Diffusion model-based medical ultrasound segmentation network in ultrasound image The application of convolutional neural networks for brain age prediction: A systematic review
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1