Context Aware Automatic Polyp Segmentation Network With Mask Attention

IEEE transactions on artificial intelligence Pub Date : 2024-03-11 DOI:10.1109/TAI.2024.3375832

Praveer Saxena;Ashish Kumar Bhandari

{"title":"Context Aware Automatic Polyp Segmentation Network With Mask Attention","authors":"Praveer Saxena;Ashish Kumar Bhandari","doi":"10.1109/TAI.2024.3375832","DOIUrl":null,"url":null,"abstract":"Colorectal cancer stands out as a major factor in cancer-related fatalities. The prevention of colorectal cancer may be aided by early polyp diagnosis. Colonoscopy is a widely used procedure for the diagnosis of polyps, but it is highly dependent on the skills of the medical practitioner. Automatic polyp segmentation using computer-aided diagnosis can help medical practitioners detect even those polyps missed by humans, and this early detection of polyps can save precious human lives. Due to the lack of distinct edges, poor contrast between the foreground and background, and great variety of polyps, automatic segmentation of polyps is quite difficult. Although there are several deep learning-based strategies for segmenting polyps, typical convolutional neural network (CNN)-based algorithms lack long-range dependencies and lose spatial information because of consecutive convolution and pooling. In this research, a novel encoder–decoder-based segmentation architecture has been proposed in an effort to identify distinguishing features that can be used to precisely separate the polyps. The proposed architecture combines the strengths of a pretrained ResNet50 encoder, residual block, our proposed multiscale dilation block, and the mask attention block. Multiscale dilation block enables us to extract features at different scales for better feature representation. The mask attention block utilizes a generated auxiliary mask in order to concentrate on important image features. To evaluate the proposed architecture, several polyp segmentation datasets have been used. The obtained findings show that the suggested architecture performs better than several state-of-the-art (SOTA) approaches for segmenting the polyps.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"5 7","pages":"3510-3523"},"PeriodicalIF":0.0000,"publicationDate":"2024-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on artificial intelligence","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10466639/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Colorectal cancer stands out as a major factor in cancer-related fatalities. The prevention of colorectal cancer may be aided by early polyp diagnosis. Colonoscopy is a widely used procedure for the diagnosis of polyps, but it is highly dependent on the skills of the medical practitioner. Automatic polyp segmentation using computer-aided diagnosis can help medical practitioners detect even those polyps missed by humans, and this early detection of polyps can save precious human lives. Due to the lack of distinct edges, poor contrast between the foreground and background, and great variety of polyps, automatic segmentation of polyps is quite difficult. Although there are several deep learning-based strategies for segmenting polyps, typical convolutional neural network (CNN)-based algorithms lack long-range dependencies and lose spatial information because of consecutive convolution and pooling. In this research, a novel encoder–decoder-based segmentation architecture has been proposed in an effort to identify distinguishing features that can be used to precisely separate the polyps. The proposed architecture combines the strengths of a pretrained ResNet50 encoder, residual block, our proposed multiscale dilation block, and the mask attention block. Multiscale dilation block enables us to extract features at different scales for better feature representation. The mask attention block utilizes a generated auxiliary mask in order to concentrate on important image features. To evaluate the proposed architecture, several polyp segmentation datasets have been used. The obtained findings show that the suggested architecture performs better than several state-of-the-art (SOTA) approaches for segmenting the polyps.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

具有掩码注意力的上下文感知自动息肉分割网络

大肠癌是导致癌症相关死亡的主要因素。早期诊断息肉有助于预防大肠癌。结肠镜检查是一种广泛使用的息肉诊断程序，但它在很大程度上依赖于医生的技能。使用计算机辅助诊断技术进行息肉自动分割可以帮助医疗从业人员发现那些被人类遗漏的息肉，而息肉的早期发现可以挽救宝贵的生命。由于息肉缺乏明显的边缘、前景与背景对比度差以及种类繁多，息肉的自动分割相当困难。虽然有几种基于深度学习的息肉分割策略，但典型的基于卷积神经网络（CNN）的算法缺乏长程依赖性，并且由于连续卷积和池化而丢失了空间信息。本研究提出了一种基于编码器-解码器的新型分割架构，旨在找出可用于精确分离息肉的区别特征。所提出的架构结合了预训练的 ResNet50 编码器、残差块、我们提出的多尺度扩张块和掩膜关注块的优势。多尺度扩张块使我们能够提取不同尺度的特征，从而获得更好的特征表示。掩码关注块利用生成的辅助掩码，以便将注意力集中在重要的图像特征上。为了评估所提出的架构，我们使用了几个息肉分割数据集。结果表明，在息肉分割方面，建议的架构比几种最先进的（SOTA）方法表现更好。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊