CATNet: A Cross Attention and Texture-Aware Network for Polyp Segmentation

IF 3 4区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC International Journal of Imaging Systems and Technology Pub Date : 2024-11-16 DOI:10.1002/ima.23220
Zhifang Deng, Yangdong Wu
{"title":"CATNet: A Cross Attention and Texture-Aware Network for Polyp Segmentation","authors":"Zhifang Deng,&nbsp;Yangdong Wu","doi":"10.1002/ima.23220","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Polyp segmentation is a challenging task, as some polyps exhibit similar textures to surrounding tissues, making them difficult to distinguish. Therefore, we present a parallel cross-attention and texture-aware network to address this challenging task. CATNet incorporates the parallel cross-attention mechanism, Residual Feature Fusion Module, and texture-aware module. Initially, polyp images undergo processing in our backbone network to extract multi-level polyp features. Subsequently, the parallel cross-attention mechanism sequentially captures channel and spatial dependencies across multi-scale polyp features, thereby yielding enhanced representations. These enhanced representations are then input into multiple texture-aware modules, which facilitate polyp segmentation by accentuating subtle textural disparities between polyps and the background. Finally, the Residual Feature Fusion module integrates the segmentation results with the previous layer of enhanced representations. This process serves to eliminate background noise and enhance intricate details. We assess the efficacy of our proposed method across five distinct polyp datasets. On three unseen datasets, CVC-300, CVC-ColonDB, and ETIS. We achieve mDice scores of 0.916, 0.817, and 0.777, respectively. Experimental results unequivocally demonstrate the superior performance of our approach over current models. The proposed CATNet addresses the challenges posed by textural similarities, setting a benchmark for future advancements in automated polyp detection and segmentation.</p>\n </div>","PeriodicalId":14027,"journal":{"name":"International Journal of Imaging Systems and Technology","volume":"34 6","pages":""},"PeriodicalIF":3.0000,"publicationDate":"2024-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Imaging Systems and Technology","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/ima.23220","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

Polyp segmentation is a challenging task, as some polyps exhibit similar textures to surrounding tissues, making them difficult to distinguish. Therefore, we present a parallel cross-attention and texture-aware network to address this challenging task. CATNet incorporates the parallel cross-attention mechanism, Residual Feature Fusion Module, and texture-aware module. Initially, polyp images undergo processing in our backbone network to extract multi-level polyp features. Subsequently, the parallel cross-attention mechanism sequentially captures channel and spatial dependencies across multi-scale polyp features, thereby yielding enhanced representations. These enhanced representations are then input into multiple texture-aware modules, which facilitate polyp segmentation by accentuating subtle textural disparities between polyps and the background. Finally, the Residual Feature Fusion module integrates the segmentation results with the previous layer of enhanced representations. This process serves to eliminate background noise and enhance intricate details. We assess the efficacy of our proposed method across five distinct polyp datasets. On three unseen datasets, CVC-300, CVC-ColonDB, and ETIS. We achieve mDice scores of 0.916, 0.817, and 0.777, respectively. Experimental results unequivocally demonstrate the superior performance of our approach over current models. The proposed CATNet addresses the challenges posed by textural similarities, setting a benchmark for future advancements in automated polyp detection and segmentation.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
CATNet:用于息肉分割的交叉注意和纹理感知网络
息肉分割是一项具有挑战性的任务,因为有些息肉的纹理与周围组织相似,因此难以区分。因此,我们提出了一种并行交叉注意和纹理感知网络来解决这一具有挑战性的任务。CATNet 包含并行交叉注意机制、残留特征融合模块和纹理感知模块。首先,息肉图像在主干网络中进行处理,提取多层次息肉特征。随后,并行交叉注意机制会依次捕捉多尺度息肉特征之间的通道和空间依赖关系,从而生成增强表征。这些增强表征随后被输入多个纹理感知模块,通过突出息肉与背景之间微妙的纹理差异,促进息肉分割。最后,残差特征融合模块将分割结果与前一层增强表征进行整合。这一过程可消除背景噪音,增强复杂细节。我们在五个不同的息肉数据集上评估了我们提出的方法的有效性。在三个未见数据集(CVC-300、CVC-ColonDB 和 ETIS)上,我们的 mDice 得分均为 0.5。我们的 mDice 分数分别达到 0.916、0.817 和 0.777。实验结果清楚地表明,我们的方法比现有模型性能更优越。提出的 CATNet 解决了纹理相似性带来的挑战,为未来息肉自动检测和分割的进步树立了标杆。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
International Journal of Imaging Systems and Technology
International Journal of Imaging Systems and Technology 工程技术-成像科学与照相技术
CiteScore
6.90
自引率
6.10%
发文量
138
审稿时长
3 months
期刊介绍: The International Journal of Imaging Systems and Technology (IMA) is a forum for the exchange of ideas and results relevant to imaging systems, including imaging physics and informatics. The journal covers all imaging modalities in humans and animals. IMA accepts technically sound and scientifically rigorous research in the interdisciplinary field of imaging, including relevant algorithmic research and hardware and software development, and their applications relevant to medical research. The journal provides a platform to publish original research in structural and functional imaging. The journal is also open to imaging studies of the human body and on animals that describe novel diagnostic imaging and analyses methods. Technical, theoretical, and clinical research in both normal and clinical populations is encouraged. Submissions describing methods, software, databases, replication studies as well as negative results are also considered. The scope of the journal includes, but is not limited to, the following in the context of biomedical research: Imaging and neuro-imaging modalities: structural MRI, functional MRI, PET, SPECT, CT, ultrasound, EEG, MEG, NIRS etc.; Neuromodulation and brain stimulation techniques such as TMS and tDCS; Software and hardware for imaging, especially related to human and animal health; Image segmentation in normal and clinical populations; Pattern analysis and classification using machine learning techniques; Computational modeling and analysis; Brain connectivity and connectomics; Systems-level characterization of brain function; Neural networks and neurorobotics; Computer vision, based on human/animal physiology; Brain-computer interface (BCI) technology; Big data, databasing and data mining.
期刊最新文献
Predicting the Early Detection of Breast Cancer Using Hybrid Machine Learning Systems and Thermographic Imaging CATNet: A Cross Attention and Texture-Aware Network for Polyp Segmentation VMC-UNet: A Vision Mamba-CNN U-Net for Tumor Segmentation in Breast Ultrasound Image Suppression of the Tissue Component With the Total Least-Squares Algorithm to Improve Second Harmonic Imaging of Ultrasound Contrast Agents Segmentation and Classification of Breast Masses From the Whole Mammography Images Using Transfer Learning and BI-RADS Characteristics
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1