{"title":"3D ShiftBTS: Shift Operation for 3D Multimodal Brain Tumor Segmentation.","authors":"Guangqi Yang, Xiaoxin Guo, Haoran Zhang, Zhenyuan Zheng, Hongliang Dong, Songbai Xu","doi":"10.1109/JBHI.2025.3552166","DOIUrl":null,"url":null,"abstract":"<p><p>Recently, ShiftViT and its variants have attracted much attention for their simple and efficient shift operation, showing excellent efficacy in several tasks on natural images, surpassing Swin Transformer. However, considering the complexity of 3D multimodal images, which have higher dimensions than natural images, and the relative stability of the human tissue structure in medical images, the applicability of shift operation on 3D multimodal medical data has yet to be determined. This paper demonstrates that ShiftViT has enormous potential in 3D multimodal medical image analysis. Using 3D medical image segmentation as a representative downstream task, we investigate how shift operation can improve model performance. First, applying ShiftViT to 3D multimodal medical images not only effectively extracts global information but also significantly enhances the model's performance. Second, as a plug-and-play strategy, the shift operation can be integrated with other modules without adding additional computational burden, proving its flexibility in the overall system. Finally, we further investigate the generalizability of the shift operation by introducing a cascaded attention module, which provides useful insights to improve the generalizability of 3D medical image segmentation models. Through this study, we extend the application scope of ShiftViT and bring new exploration directions to the field of 3D multimodal medical image analysis. Our research results prove the feasibility of applying ShiftViT in 3D multimodal medical images and provide an effective and scalable model, which is expected to further promote the development of medical image processing technology. The code will be published in https://github.com/ydlam/3D-ShiftBTS.</p>","PeriodicalId":13073,"journal":{"name":"IEEE Journal of Biomedical and Health Informatics","volume":"PP ","pages":""},"PeriodicalIF":6.7000,"publicationDate":"2025-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Journal of Biomedical and Health Informatics","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1109/JBHI.2025.3552166","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Recently, ShiftViT and its variants have attracted much attention for their simple and efficient shift operation, showing excellent efficacy in several tasks on natural images, surpassing Swin Transformer. However, considering the complexity of 3D multimodal images, which have higher dimensions than natural images, and the relative stability of the human tissue structure in medical images, the applicability of shift operation on 3D multimodal medical data has yet to be determined. This paper demonstrates that ShiftViT has enormous potential in 3D multimodal medical image analysis. Using 3D medical image segmentation as a representative downstream task, we investigate how shift operation can improve model performance. First, applying ShiftViT to 3D multimodal medical images not only effectively extracts global information but also significantly enhances the model's performance. Second, as a plug-and-play strategy, the shift operation can be integrated with other modules without adding additional computational burden, proving its flexibility in the overall system. Finally, we further investigate the generalizability of the shift operation by introducing a cascaded attention module, which provides useful insights to improve the generalizability of 3D medical image segmentation models. Through this study, we extend the application scope of ShiftViT and bring new exploration directions to the field of 3D multimodal medical image analysis. Our research results prove the feasibility of applying ShiftViT in 3D multimodal medical images and provide an effective and scalable model, which is expected to further promote the development of medical image processing technology. The code will be published in https://github.com/ydlam/3D-ShiftBTS.
期刊介绍:
IEEE Journal of Biomedical and Health Informatics publishes original papers presenting recent advances where information and communication technologies intersect with health, healthcare, life sciences, and biomedicine. Topics include acquisition, transmission, storage, retrieval, management, and analysis of biomedical and health information. The journal covers applications of information technologies in healthcare, patient monitoring, preventive care, early disease diagnosis, therapy discovery, and personalized treatment protocols. It explores electronic medical and health records, clinical information systems, decision support systems, medical and biological imaging informatics, wearable systems, body area/sensor networks, and more. Integration-related topics like interoperability, evidence-based medicine, and secure patient data are also addressed.