Effectiveness of encoder-decoder deep learning approach for colorectal polyp segmentation in colonoscopy images

IF 3.4 2区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Applied Intelligence Pub Date : 2025-01-10 DOI:10.1007/s10489-024-06167-6
Ameer Hamza, Muhammad Bilal, Muhammad Ramzan, Nadia Malik
{"title":"Effectiveness of encoder-decoder deep learning approach for colorectal polyp segmentation in colonoscopy images","authors":"Ameer Hamza,&nbsp;Muhammad Bilal,&nbsp;Muhammad Ramzan,&nbsp;Nadia Malik","doi":"10.1007/s10489-024-06167-6","DOIUrl":null,"url":null,"abstract":"<div><p>Colorectal cancer is considered one of the deadliest diseases, contributing to an alarming increase in annual deaths worldwide, with colorectal polyps recognized as precursors to this malignancy. Early and accurate detection of these polyps is crucial for reducing the mortality rate of colorectal cancer. However, the manual detection of polyps is a time-consuming process and requires the expertise of trained medical professionals. Moreover, it often misses polyps due to their varied size, color, and texture. Computer-aided diagnosis systems offer potential improvements, but they often struggle with precision in complex visual environments. This study presents an enhanced deep learning approach using encoder-decoder architecture for colorectal polyp segmentation to capture and utilize complex feature representations. Our approach introduces an enhanced dual attention mechanism, combining spatial and channel-wise attention to focus precisely on critical features. Channel-wise attention, implemented via an optimized Squeeze-and-Excitation (S&amp;E) block, allows the network to capture comprehensive contextual information and interrelationships among different channels, ensuring a more refined feature selection process. The experimental results showed that the proposed model achieved a mean Intersection over Union (IoU) of 0.9054 and 0.9277, a dice coefficient of 0.9006 and 0.9128, a precision of 0.8985 and 0.9517, a recall of 0.9190 and 0.9094, and an accuracy of 0.9806 and 0.9907 on the Kvasir-SEG and CVC-ClinicDB datasets, respectively. Moreover, the proposed model outperforms the existing state-of-the-art resulting in improved patient outcomes with the potential to enhance the early detection of colorectal polyps.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 4","pages":""},"PeriodicalIF":3.4000,"publicationDate":"2025-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Intelligence","FirstCategoryId":"94","ListUrlMain":"https://link.springer.com/article/10.1007/s10489-024-06167-6","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Colorectal cancer is considered one of the deadliest diseases, contributing to an alarming increase in annual deaths worldwide, with colorectal polyps recognized as precursors to this malignancy. Early and accurate detection of these polyps is crucial for reducing the mortality rate of colorectal cancer. However, the manual detection of polyps is a time-consuming process and requires the expertise of trained medical professionals. Moreover, it often misses polyps due to their varied size, color, and texture. Computer-aided diagnosis systems offer potential improvements, but they often struggle with precision in complex visual environments. This study presents an enhanced deep learning approach using encoder-decoder architecture for colorectal polyp segmentation to capture and utilize complex feature representations. Our approach introduces an enhanced dual attention mechanism, combining spatial and channel-wise attention to focus precisely on critical features. Channel-wise attention, implemented via an optimized Squeeze-and-Excitation (S&E) block, allows the network to capture comprehensive contextual information and interrelationships among different channels, ensuring a more refined feature selection process. The experimental results showed that the proposed model achieved a mean Intersection over Union (IoU) of 0.9054 and 0.9277, a dice coefficient of 0.9006 and 0.9128, a precision of 0.8985 and 0.9517, a recall of 0.9190 and 0.9094, and an accuracy of 0.9806 and 0.9907 on the Kvasir-SEG and CVC-ClinicDB datasets, respectively. Moreover, the proposed model outperforms the existing state-of-the-art resulting in improved patient outcomes with the potential to enhance the early detection of colorectal polyps.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
编码器-解码器深度学习方法在结肠镜图像中结肠息肉分割中的有效性
结直肠癌被认为是最致命的疾病之一,导致全球每年死亡人数的惊人增长,结直肠癌息肉被认为是这种恶性肿瘤的前兆。早期准确发现这些息肉对于降低结直肠癌的死亡率至关重要。然而,人工检测息肉是一个耗时的过程,需要训练有素的医疗专业人员的专业知识。此外,由于息肉的大小、颜色和质地各异,它经常会遗漏息肉。计算机辅助诊断系统提供了潜在的改进,但它们在复杂的视觉环境中往往难以达到精度。本研究提出了一种增强的深度学习方法,使用编码器-解码器架构用于结肠直肠息肉分割,以捕获和利用复杂的特征表示。我们的方法引入了一种增强的双重注意机制,将空间和渠道的注意结合起来,精确地关注关键特征。通过优化的挤压和激励(S&;E)块实现的通道智能关注,允许网络捕获不同通道之间的综合上下文信息和相互关系,确保更精细的特征选择过程。实验结果表明,该模型在Kvasir-SEG和CVC-ClinicDB数据集上的平均IoU分别为0.9054和0.9277,骰子系数分别为0.9006和0.9128,精度分别为0.8985和0.9517,召回率分别为0.9190和0.9094,准确率分别为0.9806和0.9907。此外,所提出的模型优于现有的最先进的技术,从而改善了患者的预后,有可能提高结肠直肠息肉的早期发现。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Applied Intelligence
Applied Intelligence 工程技术-计算机:人工智能
CiteScore
6.60
自引率
20.80%
发文量
1361
审稿时长
5.9 months
期刊介绍: With a focus on research in artificial intelligence and neural networks, this journal addresses issues involving solutions of real-life manufacturing, defense, management, government and industrial problems which are too complex to be solved through conventional approaches and require the simulation of intelligent thought processes, heuristics, applications of knowledge, and distributed and parallel processing. The integration of these multiple approaches in solving complex problems is of particular importance. The journal presents new and original research and technological developments, addressing real and complex issues applicable to difficult problems. It provides a medium for exchanging scientific research and technological achievements accomplished by the international community.
期刊最新文献
Multi-view learning based on product and process metrics for software defect prediction MVSRF: Point cloud semantic segmentation and optimization method for granular construction objects WLKA-RVS: a retinal vessel segmentation method using weighted large kernel attention DAAR: Dual attention cooperative adaptive pruning rate by data-driven for filter pruning Chaotic opposition-based plant propagation algorithm for engineering problem
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1