Jingyuan Li , Wenfang Sun , Xiulong Feng , Karen M. von Deneen , Wen Wang , Guangbin Cui , Yi Zhang
{"title":"一种融合卷积和变换器的混合网络用于胸腺瘤分割","authors":"Jingyuan Li , Wenfang Sun , Xiulong Feng , Karen M. von Deneen , Wen Wang , Guangbin Cui , Yi Zhang","doi":"10.1016/j.imed.2022.06.003","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><p>Manual segmentation of thymoma is an onerous, labor-intensive, and subjective task for radiologists. Accordingly, the development of an automatic and efficient method for thymoma segmentation can be valuable for the early detection and diagnosis of this malignancy.</p></div><div><h3>Methods</h3><p>Three hundred and ten subjects were enrolled in this retrospective study and all underwent CECT scans. All the scans were manually labeled by four experienced radiologists. The successful application of convolution neural networks (CNNs) and Transformer in computer vision led us to propose a hybrid CNN–Transformer architecture, named transformer attention Net (TA-Net), that would allow the utilization of both local information from CNN features and the global information encoded by Transformers. U-Net was used as the basic structure and Transformers were inserted into convolution blocks in the encoder. In addition, attention gates were embedded in skip connections to highlight salient features. Comparison of the accuracy, intersection over Union (IoU), Dice score, and Boundary F1 contour matching score (BFScore) between the predicted segmentation and the manual labels were utilized to evaluate segmentation performance.</p></div><div><h3>Results</h3><p>For thymoma segmentation using TA-Net, the accuracy, Dice score, IoU, and BFScore were 92.49%, 89.92%, 83.80%, and 0.8945, respectively, and no significant differences were detected among tumor types and enhanced phases. Our proposed method achieved the best performance when compared with state-of-the-art methods.</p></div><div><h3>Conclusion</h3><p>The proposed method, which combines CNNs with Transformer, achives outstanding performance in thymoma segmentation compared with previous methods. TA-Net may provide consistent and reproducible delineation, thereby assisting radiologists in clinical applications.</p></div>","PeriodicalId":73400,"journal":{"name":"Intelligent medicine","volume":"3 3","pages":"Pages 164-172"},"PeriodicalIF":4.4000,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A hybrid network integrating convolution and transformer for thymoma segmentation\",\"authors\":\"Jingyuan Li , Wenfang Sun , Xiulong Feng , Karen M. von Deneen , Wen Wang , Guangbin Cui , Yi Zhang\",\"doi\":\"10.1016/j.imed.2022.06.003\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Background</h3><p>Manual segmentation of thymoma is an onerous, labor-intensive, and subjective task for radiologists. Accordingly, the development of an automatic and efficient method for thymoma segmentation can be valuable for the early detection and diagnosis of this malignancy.</p></div><div><h3>Methods</h3><p>Three hundred and ten subjects were enrolled in this retrospective study and all underwent CECT scans. All the scans were manually labeled by four experienced radiologists. The successful application of convolution neural networks (CNNs) and Transformer in computer vision led us to propose a hybrid CNN–Transformer architecture, named transformer attention Net (TA-Net), that would allow the utilization of both local information from CNN features and the global information encoded by Transformers. U-Net was used as the basic structure and Transformers were inserted into convolution blocks in the encoder. In addition, attention gates were embedded in skip connections to highlight salient features. Comparison of the accuracy, intersection over Union (IoU), Dice score, and Boundary F1 contour matching score (BFScore) between the predicted segmentation and the manual labels were utilized to evaluate segmentation performance.</p></div><div><h3>Results</h3><p>For thymoma segmentation using TA-Net, the accuracy, Dice score, IoU, and BFScore were 92.49%, 89.92%, 83.80%, and 0.8945, respectively, and no significant differences were detected among tumor types and enhanced phases. Our proposed method achieved the best performance when compared with state-of-the-art methods.</p></div><div><h3>Conclusion</h3><p>The proposed method, which combines CNNs with Transformer, achives outstanding performance in thymoma segmentation compared with previous methods. TA-Net may provide consistent and reproducible delineation, thereby assisting radiologists in clinical applications.</p></div>\",\"PeriodicalId\":73400,\"journal\":{\"name\":\"Intelligent medicine\",\"volume\":\"3 3\",\"pages\":\"Pages 164-172\"},\"PeriodicalIF\":4.4000,\"publicationDate\":\"2023-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Intelligent medicine\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S266710262200050X\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Intelligent medicine","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S266710262200050X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
A hybrid network integrating convolution and transformer for thymoma segmentation
Background
Manual segmentation of thymoma is an onerous, labor-intensive, and subjective task for radiologists. Accordingly, the development of an automatic and efficient method for thymoma segmentation can be valuable for the early detection and diagnosis of this malignancy.
Methods
Three hundred and ten subjects were enrolled in this retrospective study and all underwent CECT scans. All the scans were manually labeled by four experienced radiologists. The successful application of convolution neural networks (CNNs) and Transformer in computer vision led us to propose a hybrid CNN–Transformer architecture, named transformer attention Net (TA-Net), that would allow the utilization of both local information from CNN features and the global information encoded by Transformers. U-Net was used as the basic structure and Transformers were inserted into convolution blocks in the encoder. In addition, attention gates were embedded in skip connections to highlight salient features. Comparison of the accuracy, intersection over Union (IoU), Dice score, and Boundary F1 contour matching score (BFScore) between the predicted segmentation and the manual labels were utilized to evaluate segmentation performance.
Results
For thymoma segmentation using TA-Net, the accuracy, Dice score, IoU, and BFScore were 92.49%, 89.92%, 83.80%, and 0.8945, respectively, and no significant differences were detected among tumor types and enhanced phases. Our proposed method achieved the best performance when compared with state-of-the-art methods.
Conclusion
The proposed method, which combines CNNs with Transformer, achives outstanding performance in thymoma segmentation compared with previous methods. TA-Net may provide consistent and reproducible delineation, thereby assisting radiologists in clinical applications.