Xiaodong Zhang, Yongquan Zhang, Changmiao Wang, Lin Li, Fengjun Zhu, Yang Sun, Tong Mo, Qingmao Hu, Jinping Xu, Dezhi Cao
{"title":"使用多尺度变换器分割局灶性皮质发育不良病灶","authors":"Xiaodong Zhang, Yongquan Zhang, Changmiao Wang, Lin Li, Fengjun Zhu, Yang Sun, Tong Mo, Qingmao Hu, Jinping Xu, Dezhi Cao","doi":"10.1186/s13244-024-01803-8","DOIUrl":null,"url":null,"abstract":"Accurate segmentation of focal cortical dysplasia (FCD) lesions from MR images plays an important role in surgical planning and decision but is still challenging for radiologists and clinicians. In this study, we introduce a novel transformer-based model, designed for the end-to-end segmentation of FCD lesions from multi-channel MR images. The core innovation of our proposed model is the integration of a convolutional neural network-based encoder-decoder structure with a multiscale transformer to augment the feature representation of lesions in the global field of view. Transformer pathways, composed of memory- and computation-efficient dual-self-attention modules, leverage feature maps from varying depths of the encoder to discern long-range interdependencies among feature positions and channels, thereby emphasizing areas and channels relevant to lesions. The proposed model was trained and evaluated on a public-open dataset including MR images of 85 patients using both subject-level and voxel-level metrics. Experimental results indicate that our model offers superior performance both quantitatively and qualitatively. It successfully identified lesions in 82.4% of patients, with a low false-positive lesion cluster rate of 0.176 ± 0.381 per patient. Furthermore, the model achieved an average Dice coefficient of 0.410 ± 0.288, outperforming five established methods. Integration of the transformer could enhance the feature presentation and segmentation performance of FCD lesions. The proposed model has the potential to serve as a valuable assistive tool for physicians, enabling rapid and accurate identification of FCD lesions. The source code and pre-trained model weights are available at https://github.com/zhangxd0530/MS-DSA-NET . This multiscale transformer-based model performs segmentation of focal cortical dysplasia lesions, aiming to help radiologists and clinicians make accurate and efficient preoperative evaluations of focal cortical dysplasia patients from MR images. ","PeriodicalId":13639,"journal":{"name":"Insights into Imaging","volume":"7 1","pages":""},"PeriodicalIF":4.1000,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Focal cortical dysplasia lesion segmentation using multiscale transformer\",\"authors\":\"Xiaodong Zhang, Yongquan Zhang, Changmiao Wang, Lin Li, Fengjun Zhu, Yang Sun, Tong Mo, Qingmao Hu, Jinping Xu, Dezhi Cao\",\"doi\":\"10.1186/s13244-024-01803-8\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Accurate segmentation of focal cortical dysplasia (FCD) lesions from MR images plays an important role in surgical planning and decision but is still challenging for radiologists and clinicians. In this study, we introduce a novel transformer-based model, designed for the end-to-end segmentation of FCD lesions from multi-channel MR images. The core innovation of our proposed model is the integration of a convolutional neural network-based encoder-decoder structure with a multiscale transformer to augment the feature representation of lesions in the global field of view. Transformer pathways, composed of memory- and computation-efficient dual-self-attention modules, leverage feature maps from varying depths of the encoder to discern long-range interdependencies among feature positions and channels, thereby emphasizing areas and channels relevant to lesions. The proposed model was trained and evaluated on a public-open dataset including MR images of 85 patients using both subject-level and voxel-level metrics. Experimental results indicate that our model offers superior performance both quantitatively and qualitatively. It successfully identified lesions in 82.4% of patients, with a low false-positive lesion cluster rate of 0.176 ± 0.381 per patient. Furthermore, the model achieved an average Dice coefficient of 0.410 ± 0.288, outperforming five established methods. Integration of the transformer could enhance the feature presentation and segmentation performance of FCD lesions. The proposed model has the potential to serve as a valuable assistive tool for physicians, enabling rapid and accurate identification of FCD lesions. The source code and pre-trained model weights are available at https://github.com/zhangxd0530/MS-DSA-NET . This multiscale transformer-based model performs segmentation of focal cortical dysplasia lesions, aiming to help radiologists and clinicians make accurate and efficient preoperative evaluations of focal cortical dysplasia patients from MR images. \",\"PeriodicalId\":13639,\"journal\":{\"name\":\"Insights into Imaging\",\"volume\":\"7 1\",\"pages\":\"\"},\"PeriodicalIF\":4.1000,\"publicationDate\":\"2024-09-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Insights into Imaging\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s13244-024-01803-8\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Insights into Imaging","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s13244-024-01803-8","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
Focal cortical dysplasia lesion segmentation using multiscale transformer
Accurate segmentation of focal cortical dysplasia (FCD) lesions from MR images plays an important role in surgical planning and decision but is still challenging for radiologists and clinicians. In this study, we introduce a novel transformer-based model, designed for the end-to-end segmentation of FCD lesions from multi-channel MR images. The core innovation of our proposed model is the integration of a convolutional neural network-based encoder-decoder structure with a multiscale transformer to augment the feature representation of lesions in the global field of view. Transformer pathways, composed of memory- and computation-efficient dual-self-attention modules, leverage feature maps from varying depths of the encoder to discern long-range interdependencies among feature positions and channels, thereby emphasizing areas and channels relevant to lesions. The proposed model was trained and evaluated on a public-open dataset including MR images of 85 patients using both subject-level and voxel-level metrics. Experimental results indicate that our model offers superior performance both quantitatively and qualitatively. It successfully identified lesions in 82.4% of patients, with a low false-positive lesion cluster rate of 0.176 ± 0.381 per patient. Furthermore, the model achieved an average Dice coefficient of 0.410 ± 0.288, outperforming five established methods. Integration of the transformer could enhance the feature presentation and segmentation performance of FCD lesions. The proposed model has the potential to serve as a valuable assistive tool for physicians, enabling rapid and accurate identification of FCD lesions. The source code and pre-trained model weights are available at https://github.com/zhangxd0530/MS-DSA-NET . This multiscale transformer-based model performs segmentation of focal cortical dysplasia lesions, aiming to help radiologists and clinicians make accurate and efficient preoperative evaluations of focal cortical dysplasia patients from MR images.
期刊介绍:
Insights into Imaging (I³) is a peer-reviewed open access journal published under the brand SpringerOpen. All content published in the journal is freely available online to anyone, anywhere!
I³ continuously updates scientific knowledge and progress in best-practice standards in radiology through the publication of original articles and state-of-the-art reviews and opinions, along with recommendations and statements from the leading radiological societies in Europe.
Founded by the European Society of Radiology (ESR), I³ creates a platform for educational material, guidelines and recommendations, and a forum for topics of controversy.
A balanced combination of review articles, original papers, short communications from European radiological congresses and information on society matters makes I³ an indispensable source for current information in this field.
I³ is owned by the ESR, however authors retain copyright to their article according to the Creative Commons Attribution License (see Copyright and License Agreement). All articles can be read, redistributed and reused for free, as long as the author of the original work is cited properly.
The open access fees (article-processing charges) for this journal are kindly sponsored by ESR for all Members.
The journal went open access in 2012, which means that all articles published since then are freely available online.