Ensemble Learning for Three-dimensional Medical Image Segmentation of Organ at Risk in Brachytherapy Using Double U-Net, Bi-directional ConvLSTM U-Net, and Transformer Network.
{"title":"Ensemble Learning for Three-dimensional Medical Image Segmentation of Organ at Risk in Brachytherapy Using Double U-Net, Bi-directional ConvLSTM U-Net, and Transformer Network.","authors":"Soniya Pal, Raj Pal Singh, Anuj Kumar","doi":"10.4103/jmp.jmp_160_24","DOIUrl":null,"url":null,"abstract":"<p><strong>Aim: </strong>This article presents a novel approach to automate the segmentation of organ at risk (OAR) for high-dose-rate brachytherapy patients using three deep learning models combined with ensemble learning techniques. It aims to improve the accuracy and efficiency of segmentation.</p><p><strong>Materials and methods: </strong>The dataset comprised computed tomography (CT) scans of 60 patients obtained from our own institutional image bank and 10 patients from the other institute, all in Digital Imaging and Communications in Medicine format. Experienced radiation oncologists manually segmented four OARs for each scan. Each scan was preprocessed and three models, Double U-Net (DUN), Bi-directional ConvLSTM U-Net (BCUN), and Transformer Networks (TN), were trained on reduced CT scans (240 × 240 × 128) due to memory limitations. Ensemble learning techniques were employed to enhance accuracy and segmentation metrics. Testing and validation were conducted on 12 patients from our institute (OID) and 10 patients from another institute (DID).</p><p><strong>Results: </strong>For DID test dataset, using the ensemble learning technique combining Transformer Network (TN) and BCUN, i.e., TN + BCUN, the average Dice similarity coefficient (DSC) ranged from 0.992 to 0.998, and for DUN and BCUN (DUN + BCUN) combination, the average DSC ranged from 0.990 to 0.993, which reflecting high segmentation accuracy. The 95% Hausdorff distance (HD) ranged from 0.9 to 1.2 mm for TN + BCUN and 1.1 to 1.4 mm for DUN + BCUN, demonstrating precise segmentation boundaries.</p><p><strong>Conclusion: </strong>The proposed method leverages the strengths of each network architecture. The DUN setup excels in sequential processing, the BCUN captures spatiotemporal dependencies, and transformer networks provide a robust understanding of global context. This combination enables efficient and accurate segmentation, surpassing human expert performance in both time and accuracy.</p>","PeriodicalId":51719,"journal":{"name":"Journal of Medical Physics","volume":"49 4","pages":"574-582"},"PeriodicalIF":0.7000,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11801097/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Medical Physics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4103/jmp.jmp_160_24","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/18 0:00:00","PubModel":"Epub","JCR":"Q4","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
引用次数: 0
Abstract
Aim: This article presents a novel approach to automate the segmentation of organ at risk (OAR) for high-dose-rate brachytherapy patients using three deep learning models combined with ensemble learning techniques. It aims to improve the accuracy and efficiency of segmentation.
Materials and methods: The dataset comprised computed tomography (CT) scans of 60 patients obtained from our own institutional image bank and 10 patients from the other institute, all in Digital Imaging and Communications in Medicine format. Experienced radiation oncologists manually segmented four OARs for each scan. Each scan was preprocessed and three models, Double U-Net (DUN), Bi-directional ConvLSTM U-Net (BCUN), and Transformer Networks (TN), were trained on reduced CT scans (240 × 240 × 128) due to memory limitations. Ensemble learning techniques were employed to enhance accuracy and segmentation metrics. Testing and validation were conducted on 12 patients from our institute (OID) and 10 patients from another institute (DID).
Results: For DID test dataset, using the ensemble learning technique combining Transformer Network (TN) and BCUN, i.e., TN + BCUN, the average Dice similarity coefficient (DSC) ranged from 0.992 to 0.998, and for DUN and BCUN (DUN + BCUN) combination, the average DSC ranged from 0.990 to 0.993, which reflecting high segmentation accuracy. The 95% Hausdorff distance (HD) ranged from 0.9 to 1.2 mm for TN + BCUN and 1.1 to 1.4 mm for DUN + BCUN, demonstrating precise segmentation boundaries.
Conclusion: The proposed method leverages the strengths of each network architecture. The DUN setup excels in sequential processing, the BCUN captures spatiotemporal dependencies, and transformer networks provide a robust understanding of global context. This combination enables efficient and accurate segmentation, surpassing human expert performance in both time and accuracy.
期刊介绍:
JOURNAL OF MEDICAL PHYSICS is the official journal of Association of Medical Physicists of India (AMPI). The association has been bringing out a quarterly publication since 1976. Till the end of 1993, it was known as Medical Physics Bulletin, which then became Journal of Medical Physics. The main objective of the Journal is to serve as a vehicle of communication to highlight all aspects of the practice of medical radiation physics. The areas covered include all aspects of the application of radiation physics to biological sciences, radiotherapy, radiodiagnosis, nuclear medicine, dosimetry and radiation protection. Papers / manuscripts dealing with the aspects of physics related to cancer therapy / radiobiology also fall within the scope of the journal.