A CNN-Transformer-based Approach for Medical Image Segmentation

Thi-Thao Tran, Dinh-Thien Vu, Thi-Hue Nguyen, Van-Truong Pham
{"title":"A CNN-Transformer-based Approach for Medical Image Segmentation","authors":"Thi-Thao Tran, Dinh-Thien Vu, Thi-Hue Nguyen, Van-Truong Pham","doi":"10.1109/ICSSE58758.2023.10227162","DOIUrl":null,"url":null,"abstract":"Advances in deep convolutional neural networks (CNNs) have shown excellent performances on image processing applications including segmentation for medical images. Nevertheless, CNN-based approaches like the Fully Convolutional Neural Networks (FCNs), Unet and variants for image segmentation often meet difficulties when expressing long-range dependency because of the locality properties of convolutional operations. In an alternative, the network models based on transformers have global context of the image and features, thus better expressing long-range dependency. Though having advantages, the transformer-based approach often lacks local information context, thus limiting certain applications like medical images. In the current study, we propose a new model that can inherit advantages of both global and local contexts of the two above approaches by using CNN and Transformer branches, and introduced the Convmixer and Progressive Atrous Spatial Pyramidal Pooling modules in the bottlenecks of each branches. The proposed model has been validated on various medical image databases including the Data science bowls 2018, and GlaS datasets. High evaluation scores including Dice score and Intersection Over Union metric have shown performance of the proposed segmentation model while compared with recent neural network models.","PeriodicalId":280745,"journal":{"name":"2023 International Conference on System Science and Engineering (ICSSE)","volume":"59 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on System Science and Engineering (ICSSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSSE58758.2023.10227162","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Advances in deep convolutional neural networks (CNNs) have shown excellent performances on image processing applications including segmentation for medical images. Nevertheless, CNN-based approaches like the Fully Convolutional Neural Networks (FCNs), Unet and variants for image segmentation often meet difficulties when expressing long-range dependency because of the locality properties of convolutional operations. In an alternative, the network models based on transformers have global context of the image and features, thus better expressing long-range dependency. Though having advantages, the transformer-based approach often lacks local information context, thus limiting certain applications like medical images. In the current study, we propose a new model that can inherit advantages of both global and local contexts of the two above approaches by using CNN and Transformer branches, and introduced the Convmixer and Progressive Atrous Spatial Pyramidal Pooling modules in the bottlenecks of each branches. The proposed model has been validated on various medical image databases including the Data science bowls 2018, and GlaS datasets. High evaluation scores including Dice score and Intersection Over Union metric have shown performance of the proposed segmentation model while compared with recent neural network models.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于cnn变换的医学图像分割方法
深度卷积神经网络(cnn)在医学图像分割等图像处理应用中表现出优异的性能。然而,基于cnn的方法,如全卷积神经网络(Fully Convolutional Neural Networks, fcn)、Unet和图像分割的变体,由于卷积运算的局部性,在表达远程依赖关系时往往会遇到困难。另一方面,基于变压器的网络模型具有图像和特征的全局上下文,从而更好地表达远程依赖性。尽管具有优势,但基于转换器的方法通常缺乏本地信息上下文,从而限制了诸如医学图像之类的某些应用。在本研究中,我们提出了一种新的模型,通过使用CNN和Transformer分支来继承上述两种方法的全局和局部上下文优势,并在每个分支的瓶颈处引入了Convmixer和Progressive Atrous Spatial Pyramidal Pooling模块。该模型已在各种医学图像数据库(包括2018年数据科学碗和GlaS数据集)上进行了验证。与最近的神经网络模型相比,包括Dice分数和Intersection Over Union度量在内的高评价分数显示了所提出的分割模型的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Deep Q-Network (DQN) Approach for Automatic Vehicles Applied in the Intelligent Transportation System (ITS) Improvement in Proportional Energy Sharing and DC Bus Voltage Restoring for DC Microgrid in the Islanded Operation Mode A New Buck-Boost Converter Structure With Improved Efficiency Performance of Energy Harvesting Aided Multi-hop Mobile Relay Networks With and Without Using Cooperative Communication A New Novel of Prescribed Optimal Control and Its Application for Smart Damping System
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1