ACU-TransNet:用于息肉分割的注意力和卷积增强 UNet 变换器网络。

IF 1.7 3区 医学 Q3 INSTRUMENTS & INSTRUMENTATION Journal of X-Ray Science and Technology Pub Date : 2024-10-12 DOI:10.3233/XST-240076
Lei Huang, Yun Wu
{"title":"ACU-TransNet:用于息肉分割的注意力和卷积增强 UNet 变换器网络。","authors":"Lei Huang, Yun Wu","doi":"10.3233/XST-240076","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>UNet has achieved great success in medical image segmentation. However, due to the inherent locality of convolution operations, UNet is deficient in capturing global features and long-range dependencies of polyps, resulting in less accurate polyp recognition for complex morphologies and backgrounds. Transformers, with their sequential operations, are better at perceiving global features but lack low-level details, leading to limited localization ability. If the advantages of both architectures can be effectively combined, the accuracy of polyp segmentation can be further improved.</p><p><strong>Methods: </strong>In this paper, we propose an attention and convolution-augmented UNet-Transformer Network (ACU-TransNet) for polyp segmentation. This network is composed of the comprehensive attention UNet and the Transformer head, sequentially connected by the bridge layer. On the one hand, the comprehensive attention UNet enhances specific feature extraction through deformable convolution and channel attention in the first layer of the encoder and achieves more accurate shape extraction through spatial attention and channel attention in the decoder. On the other hand, the Transformer head supplements fine-grained information through convolutional attention and acquires hierarchical global characteristics from the feature maps.</p><p><strong>Results: </strong>mcU-TransNet could comprehensively learn dataset features and enhance colonoscopy interpretability for polyp detection.</p><p><strong>Conclusion: </strong>Experimental results on the CVC-ClinicDB and Kvasir-SEG datasets demonstrate that mcU-TransNet outperforms existing state-of-the-art methods, showcasing its robustness.</p>","PeriodicalId":49948,"journal":{"name":"Journal of X-Ray Science and Technology","volume":" ","pages":""},"PeriodicalIF":1.7000,"publicationDate":"2024-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"ACU-TransNet: Attention and convolution-augmented UNet-transformer network for polyp segmentation.\",\"authors\":\"Lei Huang, Yun Wu\",\"doi\":\"10.3233/XST-240076\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>UNet has achieved great success in medical image segmentation. However, due to the inherent locality of convolution operations, UNet is deficient in capturing global features and long-range dependencies of polyps, resulting in less accurate polyp recognition for complex morphologies and backgrounds. Transformers, with their sequential operations, are better at perceiving global features but lack low-level details, leading to limited localization ability. If the advantages of both architectures can be effectively combined, the accuracy of polyp segmentation can be further improved.</p><p><strong>Methods: </strong>In this paper, we propose an attention and convolution-augmented UNet-Transformer Network (ACU-TransNet) for polyp segmentation. This network is composed of the comprehensive attention UNet and the Transformer head, sequentially connected by the bridge layer. On the one hand, the comprehensive attention UNet enhances specific feature extraction through deformable convolution and channel attention in the first layer of the encoder and achieves more accurate shape extraction through spatial attention and channel attention in the decoder. On the other hand, the Transformer head supplements fine-grained information through convolutional attention and acquires hierarchical global characteristics from the feature maps.</p><p><strong>Results: </strong>mcU-TransNet could comprehensively learn dataset features and enhance colonoscopy interpretability for polyp detection.</p><p><strong>Conclusion: </strong>Experimental results on the CVC-ClinicDB and Kvasir-SEG datasets demonstrate that mcU-TransNet outperforms existing state-of-the-art methods, showcasing its robustness.</p>\",\"PeriodicalId\":49948,\"journal\":{\"name\":\"Journal of X-Ray Science and Technology\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":1.7000,\"publicationDate\":\"2024-10-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of X-Ray Science and Technology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.3233/XST-240076\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"INSTRUMENTS & INSTRUMENTATION\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of X-Ray Science and Technology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3233/XST-240076","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"INSTRUMENTS & INSTRUMENTATION","Score":null,"Total":0}
引用次数: 0

摘要

背景:UNet 在医学图像分割方面取得了巨大成功。然而,由于卷积运算固有的局部性,UNet 在捕捉息肉的全局特征和长程依赖性方面存在不足,导致对复杂形态和背景的息肉识别不够准确。变换器具有顺序操作功能,能更好地感知全局特征,但缺乏低层次细节,导致定位能力有限。如果能有效结合这两种架构的优势,就能进一步提高息肉分割的准确性:本文提出了一种用于息肉分割的注意力和卷积增强 UNet-Transformer 网络(ACU-TransNet)。该网络由综合注意力 UNet 和变换器头组成,通过桥接层依次连接。一方面,综合注意力 UNet 在第一层编码器中通过可变形卷积和通道注意力加强特定特征提取,并在解码器中通过空间注意力和通道注意力实现更精确的形状提取。结果:mcU-TransNet 可以全面学习数据集特征,提高结肠镜息肉检测的可解释性:在 CVC-ClinicDB 和 Kvasir-SEG 数据集上的实验结果表明,mcU-TransNet 的性能优于现有的先进方法,展示了其鲁棒性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
ACU-TransNet: Attention and convolution-augmented UNet-transformer network for polyp segmentation.

Background: UNet has achieved great success in medical image segmentation. However, due to the inherent locality of convolution operations, UNet is deficient in capturing global features and long-range dependencies of polyps, resulting in less accurate polyp recognition for complex morphologies and backgrounds. Transformers, with their sequential operations, are better at perceiving global features but lack low-level details, leading to limited localization ability. If the advantages of both architectures can be effectively combined, the accuracy of polyp segmentation can be further improved.

Methods: In this paper, we propose an attention and convolution-augmented UNet-Transformer Network (ACU-TransNet) for polyp segmentation. This network is composed of the comprehensive attention UNet and the Transformer head, sequentially connected by the bridge layer. On the one hand, the comprehensive attention UNet enhances specific feature extraction through deformable convolution and channel attention in the first layer of the encoder and achieves more accurate shape extraction through spatial attention and channel attention in the decoder. On the other hand, the Transformer head supplements fine-grained information through convolutional attention and acquires hierarchical global characteristics from the feature maps.

Results: mcU-TransNet could comprehensively learn dataset features and enhance colonoscopy interpretability for polyp detection.

Conclusion: Experimental results on the CVC-ClinicDB and Kvasir-SEG datasets demonstrate that mcU-TransNet outperforms existing state-of-the-art methods, showcasing its robustness.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
4.90
自引率
23.30%
发文量
150
审稿时长
3 months
期刊介绍: Research areas within the scope of the journal include: Interaction of x-rays with matter: x-ray phenomena, biological effects of radiation, radiation safety and optical constants X-ray sources: x-rays from synchrotrons, x-ray lasers, plasmas, and other sources, conventional or unconventional Optical elements: grazing incidence optics, multilayer mirrors, zone plates, gratings, other diffraction optics Optical instruments: interferometers, spectrometers, microscopes, telescopes, microprobes
期刊最新文献
Industrial digital radiographic image denoising based on improved KBNet. Research on the effectiveness of multi-view slice correction strategy based on deep learning in high pitch helical CT reconstruction. A fully linearized ADMM algorithm for optimization based image reconstruction. A reconstruction method for ptychography based on residual dense network. Can AI generate diagnostic reports for radiologist approval on CXR images? A multi-reader and multi-case observer performance study.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1