Image detection method for multi-category lesions in wireless capsule endoscopy based on deep learning models.

IF 4.3 3区 医学 Q1 GASTROENTEROLOGY & HEPATOLOGY World Journal of Gastroenterology Pub Date : 2024-12-28 DOI:10.3748/wjg.v30.i48.5111
Zhi-Guo Xiao, Xian-Qing Chen, Dong Zhang, Xin-Yuan Li, Wen-Xin Dai, Wen-Hui Liang
{"title":"Image detection method for multi-category lesions in wireless capsule endoscopy based on deep learning models.","authors":"Zhi-Guo Xiao, Xian-Qing Chen, Dong Zhang, Xin-Yuan Li, Wen-Xin Dai, Wen-Hui Liang","doi":"10.3748/wjg.v30.i48.5111","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Wireless capsule endoscopy (WCE) has become an important noninvasive and portable tool for diagnosing digestive tract diseases and has been propelled by advancements in medical imaging technology. However, the complexity of the digestive tract structure, and the diversity of lesion types, results in different sites and types of lesions distinctly appearing in the images, posing a challenge for the accurate identification of digestive tract diseases.</p><p><strong>Aim: </strong>To propose a deep learning-based lesion detection model to automatically identify and accurately label digestive tract lesions, thereby improving the diagnostic efficiency of doctors, and creating significant clinical application value.</p><p><strong>Methods: </strong>In this paper, we propose a neural network model, WCE_Detection, for the accurate detection and classification of 23 classes of digestive tract lesion images. First, since multicategory lesion images exhibit various shapes and scales, a multidetection head strategy is adopted in the object detection network to increase the model's robustness for multiscale lesion detection. Moreover, a bidirectional feature pyramid network (BiFPN) is introduced, which effectively fuses shallow semantic features by adding skip connections, significantly reducing the detection error rate. On the basis of the above, we utilize the Swin Transformer with its unique self-attention mechanism and hierarchical structure in conjunction with the BiFPN feature fusion technique to enhance the feature representation of multicategory lesion images.</p><p><strong>Results: </strong>The model constructed in this study achieved an mAP50 of 91.5% for detecting 23 lesions. More than eleven single-category lesions achieved an mAP50 of over 99.4%, and more than twenty lesions had an mAP50 value of over 80%. These results indicate that the model outperforms other state-of-the-art models in the end-to-end integrated detection of human digestive tract lesion images.</p><p><strong>Conclusion: </strong>The deep learning-based object detection network detects multiple digestive tract lesions in WCE images with high accuracy, improving the diagnostic efficiency of doctors, and demonstrating significant clinical application value.</p>","PeriodicalId":23778,"journal":{"name":"World Journal of Gastroenterology","volume":"30 48","pages":"5111-5129"},"PeriodicalIF":4.3000,"publicationDate":"2024-12-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11612692/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"World Journal of Gastroenterology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3748/wjg.v30.i48.5111","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GASTROENTEROLOGY & HEPATOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Wireless capsule endoscopy (WCE) has become an important noninvasive and portable tool for diagnosing digestive tract diseases and has been propelled by advancements in medical imaging technology. However, the complexity of the digestive tract structure, and the diversity of lesion types, results in different sites and types of lesions distinctly appearing in the images, posing a challenge for the accurate identification of digestive tract diseases.

Aim: To propose a deep learning-based lesion detection model to automatically identify and accurately label digestive tract lesions, thereby improving the diagnostic efficiency of doctors, and creating significant clinical application value.

Methods: In this paper, we propose a neural network model, WCE_Detection, for the accurate detection and classification of 23 classes of digestive tract lesion images. First, since multicategory lesion images exhibit various shapes and scales, a multidetection head strategy is adopted in the object detection network to increase the model's robustness for multiscale lesion detection. Moreover, a bidirectional feature pyramid network (BiFPN) is introduced, which effectively fuses shallow semantic features by adding skip connections, significantly reducing the detection error rate. On the basis of the above, we utilize the Swin Transformer with its unique self-attention mechanism and hierarchical structure in conjunction with the BiFPN feature fusion technique to enhance the feature representation of multicategory lesion images.

Results: The model constructed in this study achieved an mAP50 of 91.5% for detecting 23 lesions. More than eleven single-category lesions achieved an mAP50 of over 99.4%, and more than twenty lesions had an mAP50 value of over 80%. These results indicate that the model outperforms other state-of-the-art models in the end-to-end integrated detection of human digestive tract lesion images.

Conclusion: The deep learning-based object detection network detects multiple digestive tract lesions in WCE images with high accuracy, improving the diagnostic efficiency of doctors, and demonstrating significant clinical application value.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于深度学习模型的无线胶囊内窥镜多类别病变图像检测方法。
背景:随着医学影像技术的进步,无线胶囊内窥镜(WCE)已成为诊断消化道疾病的一种重要的无创便携式工具。然而,消化道结构的复杂性和病变类型的多样性,导致不同部位、不同类型的病变在图像中明显出现,给消化道疾病的准确识别带来了挑战。目的:提出一种基于深度学习的病变检测模型,自动识别并准确标记消化道病变,从而提高医生的诊断效率,具有重要的临床应用价值。方法:本文提出了一种神经网络模型WCE_Detection,用于23类消化道病变图像的准确检测和分类。首先,由于多类别病变图像具有不同的形状和尺度,因此在目标检测网络中采用了多检测头策略,以提高模型对多尺度病变检测的鲁棒性。引入双向特征金字塔网络(BiFPN),通过增加跳跃连接有效融合浅层语义特征,显著降低检测错误率。在此基础上,我们利用Swin Transformer独特的自关注机制和分层结构,结合BiFPN特征融合技术增强多类别病变图像的特征表征。结果:本研究构建的模型检测23个病变的mAP50为91.5%。超过99.4%的单一类型病变有11个以上,超过80%的病变有20多个。这些结果表明,该模型在人体消化道病变图像的端到端综合检测方面优于其他最先进的模型。结论:基于深度学习的目标检测网络对WCE图像中消化道多发病变的检测准确率较高,提高了医生的诊断效率,具有重要的临床应用价值。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
World Journal of Gastroenterology
World Journal of Gastroenterology 医学-胃肠肝病学
CiteScore
7.80
自引率
4.70%
发文量
464
审稿时长
2.4 months
期刊介绍: The primary aims of the WJG are to improve diagnostic, therapeutic and preventive modalities and the skills of clinicians and to guide clinical practice in gastroenterology and hepatology.
期刊最新文献
Exploring the links between gallstone disease, non-alcoholic fatty liver disease, and kidney stones: A path to comprehensive prevention. Exploring the therapeutic potential of glucagon-like peptide 1 agonists in metabolic disorders. Gel immersion in endoscopy: Exploring potential applications. Imaging characteristics of brain microstructure and cerebral perfusion in Crohn's disease patients with anxiety: A prospective comparative study. Impact of microplastics on the human digestive system: From basic to clinical.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1