A hybrid network integrating convolution and transformer for thymoma segmentation

IF 4.4 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Intelligent medicine Pub Date : 2023-08-01 DOI:10.1016/j.imed.2022.06.003
Jingyuan Li , Wenfang Sun , Xiulong Feng , Karen M. von Deneen , Wen Wang , Guangbin Cui , Yi Zhang
{"title":"A hybrid network integrating convolution and transformer for thymoma segmentation","authors":"Jingyuan Li ,&nbsp;Wenfang Sun ,&nbsp;Xiulong Feng ,&nbsp;Karen M. von Deneen ,&nbsp;Wen Wang ,&nbsp;Guangbin Cui ,&nbsp;Yi Zhang","doi":"10.1016/j.imed.2022.06.003","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><p>Manual segmentation of thymoma is an onerous, labor-intensive, and subjective task for radiologists. Accordingly, the development of an automatic and efficient method for thymoma segmentation can be valuable for the early detection and diagnosis of this malignancy.</p></div><div><h3>Methods</h3><p>Three hundred and ten subjects were enrolled in this retrospective study and all underwent CECT scans. All the scans were manually labeled by four experienced radiologists. The successful application of convolution neural networks (CNNs) and Transformer in computer vision led us to propose a hybrid CNN–Transformer architecture, named transformer attention Net (TA-Net), that would allow the utilization of both local information from CNN features and the global information encoded by Transformers. U-Net was used as the basic structure and Transformers were inserted into convolution blocks in the encoder. In addition, attention gates were embedded in skip connections to highlight salient features. Comparison of the accuracy, intersection over Union (IoU), Dice score, and Boundary F1 contour matching score (BFScore) between the predicted segmentation and the manual labels were utilized to evaluate segmentation performance.</p></div><div><h3>Results</h3><p>For thymoma segmentation using TA-Net, the accuracy, Dice score, IoU, and BFScore were 92.49%, 89.92%, 83.80%, and 0.8945, respectively, and no significant differences were detected among tumor types and enhanced phases. Our proposed method achieved the best performance when compared with state-of-the-art methods.</p></div><div><h3>Conclusion</h3><p>The proposed method, which combines CNNs with Transformer, achives outstanding performance in thymoma segmentation compared with previous methods. TA-Net may provide consistent and reproducible delineation, thereby assisting radiologists in clinical applications.</p></div>","PeriodicalId":73400,"journal":{"name":"Intelligent medicine","volume":"3 3","pages":"Pages 164-172"},"PeriodicalIF":4.4000,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Intelligent medicine","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S266710262200050X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

Abstract

Background

Manual segmentation of thymoma is an onerous, labor-intensive, and subjective task for radiologists. Accordingly, the development of an automatic and efficient method for thymoma segmentation can be valuable for the early detection and diagnosis of this malignancy.

Methods

Three hundred and ten subjects were enrolled in this retrospective study and all underwent CECT scans. All the scans were manually labeled by four experienced radiologists. The successful application of convolution neural networks (CNNs) and Transformer in computer vision led us to propose a hybrid CNN–Transformer architecture, named transformer attention Net (TA-Net), that would allow the utilization of both local information from CNN features and the global information encoded by Transformers. U-Net was used as the basic structure and Transformers were inserted into convolution blocks in the encoder. In addition, attention gates were embedded in skip connections to highlight salient features. Comparison of the accuracy, intersection over Union (IoU), Dice score, and Boundary F1 contour matching score (BFScore) between the predicted segmentation and the manual labels were utilized to evaluate segmentation performance.

Results

For thymoma segmentation using TA-Net, the accuracy, Dice score, IoU, and BFScore were 92.49%, 89.92%, 83.80%, and 0.8945, respectively, and no significant differences were detected among tumor types and enhanced phases. Our proposed method achieved the best performance when compared with state-of-the-art methods.

Conclusion

The proposed method, which combines CNNs with Transformer, achives outstanding performance in thymoma segmentation compared with previous methods. TA-Net may provide consistent and reproducible delineation, thereby assisting radiologists in clinical applications.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
一种融合卷积和变换器的混合网络用于胸腺瘤分割
背景胸腺瘤的手工分割对放射科医生来说是一项繁重、劳动密集和主观的任务。因此,开发一种自动有效的胸腺瘤分割方法对这种恶性肿瘤的早期检测和诊断具有价值。方法310名受试者参加了这项回顾性研究,所有受试者都接受了CECT扫描。所有的扫描都是由四位经验丰富的放射科医生手动标记的。卷积神经网络(CNNs)和Transformer在计算机视觉中的成功应用使我们提出了一种混合的CNN-Transformer架构,称为Transformer注意力网(TA-Net),该架构将允许利用来自CNN特征的局部信息和Transformer编码的全局信息。U-Net被用作基本结构,变换器被插入编码器的卷积块中。此外,在跳跃连接中嵌入了注意力门,以突出突出的特征。利用预测分割和手动标签之间的准确性、联合交集(IoU)、骰子分数和边界F1轮廓匹配分数(BFScore)的比较来评估分割性能。结果TA-Net对胸腺瘤的分割准确率、Dice评分、IoU和BFScore分别为92.49%、89.92%、83.80%和0.8945,肿瘤类型和增强期之间无显著差异。与最先进的方法相比,我们提出的方法实现了最佳性能。结论与以往的方法相比,该方法将细胞神经网络与Transformer相结合,在胸腺瘤分割中取得了显著的效果。TA Net可以提供一致且可重复的描绘,从而帮助放射科医生进行临床应用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Intelligent medicine
Intelligent medicine Surgery, Radiology and Imaging, Artificial Intelligence, Biomedical Engineering
CiteScore
5.20
自引率
0.00%
发文量
19
期刊最新文献
Impact of data balancing a multiclass dataset before the creation of association rules to study bacterial vaginosis Neuropsychological detection and prediction using machine learning algorithms: a comprehensive review Improved neurological diagnoses and treatment strategies via automated human brain tissue segmentation from clinical magnetic resonance imaging Increasing the accuracy and reproducibility of positron emission tomography radiomics for predicting pelvic lymph node metastasis in patients with cervical cancer using 3D local binary pattern-based texture features A clinical decision support system using rough set theory and machine learning for disease prediction
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1