Yafei Dong , Thibault Marin , Yue Zhuo , Elie Najem , Maryam Moteabbed , Fangxu Xing , Arnaud Beddok , Rita Maria Lahoud , Laura Rozenblum , Zhiyuan Ding , Xiaofeng Liu , Kira Grogg , Jonghye Woo , Yen-Lin E. Chen , Ruth Lim , Chao Ma , Georges El Fakhri
{"title":"Gross tumor volume confidence maps prediction for soft tissue sarcomas from multi-modality medical images using a diffusion model","authors":"Yafei Dong , Thibault Marin , Yue Zhuo , Elie Najem , Maryam Moteabbed , Fangxu Xing , Arnaud Beddok , Rita Maria Lahoud , Laura Rozenblum , Zhiyuan Ding , Xiaofeng Liu , Kira Grogg , Jonghye Woo , Yen-Lin E. Chen , Ruth Lim , Chao Ma , Georges El Fakhri","doi":"10.1016/j.phro.2025.100734","DOIUrl":null,"url":null,"abstract":"<div><h3>Background and purpose:</h3><div>Accurate delineation of the gross tumor volume (GTV) is essential for radiotherapy of soft tissue sarcomas. However, manual GTV delineation from multi-modality images is time-consuming. Furthermore, GTV delineation is subject to inter- and intra-reader variability, which reduces the reproducibility of treatment planning. To address these issues, this work aims to develop a highly accurate automatic delineation technique modeling reader variability for soft tissue sarcomas using deep learning.</div></div><div><h3>Materials and methods:</h3><div>We employed a publicly available soft tissue sarcoma dataset consisting of Fluorodeoxyglucose Positron Emission Tomography (FDG-PET), X-ray Computed Tomography (CT), and pre-contrast T1-weighted Magnetic Resonance Imaging (MRI) scans for 51 patients, of which 49 were selected for analysis. The GTVs were delineated by six experienced readers, each reader performing GTV contouring multiple times for every patient. The confidence maps were calculated by averaging the labels provided by all readers, resulting in values ranging from 0 to 1. We developed and trained a diffusion model-based neural network to predict confidence maps of GTV for soft tissue sarcomas from multi-modality medical images.</div></div><div><h3>Results:</h3><div>Quantitative analysis showed that the proposed diffusion model performed competitively with U-Net-based models, frequently ranking first or second across five evaluation metrics: Dice Index, Hausdorff Distance, Recall, Precision, and Brier Score. Additionally, experiments evaluating the impact of different imaging modalities demonstrated that incorporating multi-modality image inputs provided improved performance compared to single-modality and dual-modality inputs.</div></div><div><h3>Conclusion:</h3><div>The proposed diffusion model is capable of predicting accurate confidence maps of GTV for soft tissue sarcomas from multi-modality inputs.</div></div>","PeriodicalId":36850,"journal":{"name":"Physics and Imaging in Radiation Oncology","volume":"33 ","pages":"Article 100734"},"PeriodicalIF":3.4000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Physics and Imaging in Radiation Oncology","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2405631625000399","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background and purpose:
Accurate delineation of the gross tumor volume (GTV) is essential for radiotherapy of soft tissue sarcomas. However, manual GTV delineation from multi-modality images is time-consuming. Furthermore, GTV delineation is subject to inter- and intra-reader variability, which reduces the reproducibility of treatment planning. To address these issues, this work aims to develop a highly accurate automatic delineation technique modeling reader variability for soft tissue sarcomas using deep learning.
Materials and methods:
We employed a publicly available soft tissue sarcoma dataset consisting of Fluorodeoxyglucose Positron Emission Tomography (FDG-PET), X-ray Computed Tomography (CT), and pre-contrast T1-weighted Magnetic Resonance Imaging (MRI) scans for 51 patients, of which 49 were selected for analysis. The GTVs were delineated by six experienced readers, each reader performing GTV contouring multiple times for every patient. The confidence maps were calculated by averaging the labels provided by all readers, resulting in values ranging from 0 to 1. We developed and trained a diffusion model-based neural network to predict confidence maps of GTV for soft tissue sarcomas from multi-modality medical images.
Results:
Quantitative analysis showed that the proposed diffusion model performed competitively with U-Net-based models, frequently ranking first or second across five evaluation metrics: Dice Index, Hausdorff Distance, Recall, Precision, and Brier Score. Additionally, experiments evaluating the impact of different imaging modalities demonstrated that incorporating multi-modality image inputs provided improved performance compared to single-modality and dual-modality inputs.
Conclusion:
The proposed diffusion model is capable of predicting accurate confidence maps of GTV for soft tissue sarcomas from multi-modality inputs.