Pub Date : 2025-10-01DOI: 10.1016/j.phro.2025.100886
Libing Zhu , Yi Rong , Nathan Y. Yu , Jason M. Holmes , Carlos E. Vargas , Sarah E. James , Lu Shang , Jean-Claude M. Rwigema , Quan Chen
Background and purpose
Deep-learning auto-segmentation (DLAS) performance in radiotherapy may change over time due to data shift/drift or practice changes, yet guidance for quality assurance is lacking. This study developed a practical framework for prospective performance monitoring using retrospective data.
Methods
A total of 464 prostate cases over 20 months were retrospectively collected. Two commercial DLAS models were clinically used: model A (2D U-Net, January 2022–January 2023) and model B (3D U-Net, February–August 2023). The agreement between DLAS and clinical contours was assessed using Dice Similarity Coefficient (DSC), 95th percentile Hausdorff Distance (HD95), and Surface DSC with a 2 mm tolerance (SDSC). Statistical process control charts were created to monitor performance drift and model switching. The first 150 cases were used to define organ-specific control limits with two and three standard deviations of monthly mean values, .
Results
2 and 3-based control limits were established for the monthly average charts, ranging from DSC 0.82–0.97, HD95 1.4–10.5 mm, and SDSC 0.45–0.91 across organs. Model A showed stable performance, with 9–13 months per organ remaining within the 3 thresholds. In contrast, model B demonstrated a marked performance shift (p < 0.001), with all five organs exceeding both thresholds across all 7 months. The 2 thresholds were more sensitive in detecting mild deviations for model A, while both limits effectively identified the substantial drift of model B.
Conclusion
The monitoring system effectively detected out-of-distribution outliers and clinical practice changes, providing a reliable framework for early detection of monthly performance degradation.
{"title":"Establishing prospective performance monitoring for real-world implementation of deep learning-based auto-segmentation in prostate cancer radiotherapy","authors":"Libing Zhu , Yi Rong , Nathan Y. Yu , Jason M. Holmes , Carlos E. Vargas , Sarah E. James , Lu Shang , Jean-Claude M. Rwigema , Quan Chen","doi":"10.1016/j.phro.2025.100886","DOIUrl":"10.1016/j.phro.2025.100886","url":null,"abstract":"<div><h3>Background and purpose</h3><div>Deep-learning auto-segmentation (DLAS) performance in radiotherapy may change over time due to data shift/drift or practice changes, yet guidance for quality assurance is lacking. This study developed a practical framework for prospective performance monitoring using retrospective data.</div></div><div><h3>Methods</h3><div>A total of 464 prostate cases over 20 months were retrospectively collected. Two commercial DLAS models were clinically used: model A (2D U-Net, January 2022–January 2023) and model B (3D U-Net, February–August 2023). The agreement between DLAS and clinical contours was assessed using Dice Similarity Coefficient (DSC), 95th percentile Hausdorff Distance (HD95), and Surface DSC with a 2 mm tolerance (SDSC). Statistical process control charts were created to monitor performance drift and model switching. The first 150 cases were used to define organ-specific control limits with two and three standard deviations of monthly mean values, <span><math><mrow><msub><mi>σ</mi><mover><mrow><mi>x</mi></mrow><mrow><mo>¯</mo></mrow></mover></msub></mrow></math></span>.</div></div><div><h3>Results</h3><div>2<span><math><mrow><msub><mi>σ</mi><mover><mrow><mi>x</mi></mrow><mrow><mo>¯</mo></mrow></mover></msub></mrow></math></span> and 3<span><math><mrow><msub><mi>σ</mi><mover><mrow><mi>x</mi></mrow><mrow><mo>¯</mo></mrow></mover></msub></mrow></math></span>-based control limits were established for the monthly average charts, ranging from DSC 0.82–0.97, HD95 1.4–10.5 mm, and SDSC 0.45–0.91 across organs. Model A showed stable performance, with 9–13 months per organ remaining within the 3<span><math><mrow><msub><mi>σ</mi><mover><mrow><mi>x</mi></mrow><mrow><mo>¯</mo></mrow></mover></msub></mrow></math></span> thresholds. In contrast, model B demonstrated a marked performance shift (p < 0.001), with all five organs exceeding both thresholds across all 7 months. The 2<span><math><mrow><msub><mi>σ</mi><mover><mrow><mi>x</mi></mrow><mrow><mo>¯</mo></mrow></mover></msub></mrow></math></span> thresholds were more sensitive in detecting mild deviations for model A, while both limits effectively identified the substantial drift of model B.</div></div><div><h3>Conclusion</h3><div>The monitoring system effectively detected out-of-distribution outliers and clinical practice changes, providing a reliable framework for early detection of monthly performance degradation.</div></div>","PeriodicalId":36850,"journal":{"name":"Physics and Imaging in Radiation Oncology","volume":"36 ","pages":"Article 100886"},"PeriodicalIF":3.3,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145736394","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Oligometastatic disease represents limited metastatic burden, and local ablative therapies such as stereotactic body radiotherapy (SBRT) may improve survival. However, inter-institutional variability in target segmentation and treatment planning can compromise treatment quality. This study aimed to evaluate the segmentation variability and dose distribution quality of SBRT in oligometastatic settings using a multi-institutional dummy run approach.
Methods and materials
Sixty-nine institutions were provided with two anonymized cases of adrenal and spine metastases to delineate targets and organs at risk (OARs) and create intensity-modulated radiotherapy plans following a protocol. Variability was quantified using the Dice similarity coefficient (DSC), Hausdorff distance, and mean distance to agreement. Plan qualities were assessed using the Paddick conformity index, modified gradient index, and a new three-dimensional conformity–gradient index (3D-CGI). Knowledge-based planning (KBP) was applied to explore potential improvements in OAR sparing.
Results
All submitted plans met protocol dose constraints. However, substantial segmentation variability was observed, particularly for the spine case. Among 136 plans, 79% demonstrated acceptable conformity and dose gradients, with 3D-CGI < 6 correlating with favorable distributions. Mean DSC was 0.93 for the clinical target volume and 0.76 for the cauda equina, which showed the highest variability. KBP reduced OAR doses for the adrenal case but showed limited impact for the spine case.
Conclusions
Although dose constraints were achieved, segmentation variability remained substantial, particularly for the cauda equina in the spine case. These findings emphasize inter-institutional differences and the need for standardization and tools to improve SBRT consistency.
{"title":"A multi-institutional dummy run on segmentation variability and plan quality of stereotactic body radiotherapy for oligometastatic disease","authors":"Hideaki Hirashima , Yukinori Matsuo , Satoshi Ishikura , Mitsuhiro Nakamura , Ikuno Nishibuchi , Daisuke Kawahara , Yoshihisa Shimada , Yoshiro Nakahara , Teiji Nishio , Naoto Shikama , Shun-ichi Watanabe , Isamu Okamoto , Toshiyuki Ishiba , Fumikata Hara , Tadahiko Shien , Takashi Mizowaki","doi":"10.1016/j.phro.2025.100857","DOIUrl":"10.1016/j.phro.2025.100857","url":null,"abstract":"<div><h3>Background and purpose</h3><div>Oligometastatic disease represents limited metastatic burden, and local ablative therapies such as stereotactic body radiotherapy (SBRT) may improve survival. However, inter-institutional variability in target segmentation and treatment planning can compromise treatment quality. This study aimed to evaluate the segmentation variability and dose distribution quality of SBRT in oligometastatic settings using a multi-institutional dummy run approach.</div></div><div><h3>Methods and materials</h3><div>Sixty-nine institutions were provided with two anonymized cases of adrenal and spine metastases to delineate targets and organs at risk (OARs) and create intensity-modulated radiotherapy plans following a protocol. Variability was quantified using the Dice similarity coefficient (DSC), Hausdorff distance, and mean distance to agreement. Plan qualities were assessed using the Paddick conformity index, modified gradient index, and a new three-dimensional conformity–gradient index (3D-CGI). Knowledge-based planning (KBP) was applied to explore potential improvements in OAR sparing.</div></div><div><h3>Results</h3><div>All submitted plans met protocol dose constraints. However, substantial segmentation variability was observed, particularly for the spine case. Among 136 plans, 79% demonstrated acceptable conformity and dose gradients, with 3D-CGI < 6 correlating with favorable distributions. Mean DSC was 0.93 for the clinical target volume and 0.76 for the cauda equina, which showed the highest variability. KBP reduced OAR doses for the adrenal case but showed limited impact for the spine case.</div></div><div><h3>Conclusions</h3><div>Although dose constraints were achieved, segmentation variability remained substantial, particularly for the cauda equina in the spine case. These findings emphasize inter-institutional differences and the need for standardization and tools to improve SBRT consistency.</div></div>","PeriodicalId":36850,"journal":{"name":"Physics and Imaging in Radiation Oncology","volume":"36 ","pages":"Article 100857"},"PeriodicalIF":3.3,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145520095","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-10-01DOI: 10.1016/j.phro.2025.100863
Zhunhao Zheng , Junqiang Chen , Xiaolin Ge , Meng Yan , Ling Li , Qifeng Wang , Xiaomin Wang , Xin Wang , Wenyang Liu , Yonggang Shi , Xiaofeng Wang , Hongyun Shi , Zhilong Yu , Qingsong Pang , Zefen Xiao , Wencheng Zhang
Background and purpose
To integrate clinical characteristics, radiomics, and dosiomics to provide accurate and individualized prediction of radiation pneumonitis (RP) in elderly patients aged 70 and over with esophageal cancer receiving radiotherapy.
Materials and methods
Based on a phase III clinical study (NCT02979691) that included elderly patients with esophageal squamous cell carcinoma (ESCC) who received definitive radiotherapy, we selected a total of 229 patients with available computed tomography (CT) and dose images. Radiomic and dosiomics features were extracted from both lungs. The patients were randomly assigned to either the training group (N = 161) or the test group (N = 68) in a 7:3 ratio. In the training set, logistic regression (LR) was applied to calculate the radiomic score (R score) and dosiomic score (D score). The constructed multivariate LR and ridge regression prediction models were evaluated using the test set. The endpoint of the predictive model is defined as a grade ≥ 2 RP. Discrimination and prediction were assessed by calculating the area under curve (AUC) of the receiver operating characteristic curve and plotting calibration and decision curve analyses (DCA).
Results
The hybrid LR model integrating R score, D score and clinical characteristics had the best clinical applicability. The hybrid model demonstrated superior predictive performance on the test set, achieving an area under the curve (AUC) of 0.76, while the combined clinical and DVH model achieved an AUC of 0.70.
Conclusions
A hybrid model combining radiomics and dosiomics with clinical characteristics showed the best performance for predicting RP.
{"title":"A machine learning approach for radiation pneumonitis prediction in elderly esophageal cancer patients by integrating baseline computed tomography radiomics, dosiomics, and clinical characteristics","authors":"Zhunhao Zheng , Junqiang Chen , Xiaolin Ge , Meng Yan , Ling Li , Qifeng Wang , Xiaomin Wang , Xin Wang , Wenyang Liu , Yonggang Shi , Xiaofeng Wang , Hongyun Shi , Zhilong Yu , Qingsong Pang , Zefen Xiao , Wencheng Zhang","doi":"10.1016/j.phro.2025.100863","DOIUrl":"10.1016/j.phro.2025.100863","url":null,"abstract":"<div><h3>Background and purpose</h3><div>To integrate clinical characteristics, radiomics, and dosiomics to provide accurate and individualized prediction of radiation pneumonitis (RP) in elderly patients aged 70 and over with esophageal cancer receiving radiotherapy.</div></div><div><h3>Materials and methods</h3><div>Based on a phase III clinical study (NCT02979691) that included elderly patients with esophageal squamous cell carcinoma (ESCC) who received definitive radiotherapy, we selected a total of 229 patients with available computed tomography (CT) and dose images. Radiomic and dosiomics features were extracted from both lungs. The patients were randomly assigned to either the training group (N = 161) or the test group (N = 68) in a 7:3 ratio. In the training set, logistic regression (LR) was applied to calculate the radiomic score (R score) and dosiomic score (D score). The constructed multivariate LR and ridge regression prediction models were evaluated using the test set. The endpoint of the predictive model is defined as a grade ≥ 2 RP. Discrimination and prediction were assessed by calculating the area under curve (AUC) of the receiver operating characteristic curve and plotting calibration and decision curve analyses (DCA).</div></div><div><h3>Results</h3><div>The hybrid LR model integrating R score, D score and clinical characteristics had the best clinical applicability. The hybrid model demonstrated superior predictive performance on the test set, achieving an area under the curve (AUC) of 0.76, while the combined clinical and DVH model achieved an AUC of 0.70.</div></div><div><h3>Conclusions</h3><div>A hybrid model combining radiomics and dosiomics with clinical characteristics showed the best performance for predicting RP.</div></div>","PeriodicalId":36850,"journal":{"name":"Physics and Imaging in Radiation Oncology","volume":"36 ","pages":"Article 100863"},"PeriodicalIF":3.3,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145576373","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-10-01DOI: 10.1016/j.phro.2025.100877
Mikkel Skaarup, Nikolaj Kylling Gyldenløve Jensen
Background and purpose
The desire to avoid tattooing radiotherapy patients lead us to implement surface-guided initial patient set-up (SGRT). To validate SGRT we investigated setup precision, user attitude, and impact on radiotherapy technician (RTT) workflow.
Materials and methods
During a six-month period, initial setup was investigated on six linear accelerators (Truebeam, Varian), each equipped with a thermo-optical surface camera (ExacTrac Dynamic, Brainlab). Precision was assessed by comparing couch shifts based on x-ray imaging acquired after initial setup and number of x-ray imaging procedures for each fraction to data from the prior year, using a tattoo-based setup. The data was split into subgroups (thoracic, abdominal/pelvic, palliative and miscellaneous (gastrointestinal, head and neck, cranial and extremities)). User attitude and impact on RTT workflow was assessed qualitatively by questionnaire. RTTs were asked to rate how SGRT compared to tattoo-based setup and record the need for manual adjustments of the patient (e.g. pushing, lifting or pulling). Questionnaires were repeated 1.5 years after implementation.
Results
We included 460 patients setup with SGRT, and 468 patients with tattoo-based methods. Median couch shifts and repeated imaging were comparable overall (0.6 cm and 9 % respectively for both setup methods), SGRT performed better for the thoracic and miscellaneous sites subgroups. RTTs preferred SGRT to laser and tattoo initial setup for >90 % of fractions. Manual adjustments were reduced with SGRT (15 % of fractions) compared to tattoo-based (60 % of fractions).
Conclusions
SGRT achieved the same or better precision as tattoo-based initial setup while providing a better workflow and reduced physical adjustments performed by the RTTs by 75 %.
{"title":"Workflow evaluation of surface-guided initial patient set-up in radiotherapy","authors":"Mikkel Skaarup, Nikolaj Kylling Gyldenløve Jensen","doi":"10.1016/j.phro.2025.100877","DOIUrl":"10.1016/j.phro.2025.100877","url":null,"abstract":"<div><h3>Background and purpose</h3><div>The desire to avoid tattooing radiotherapy patients lead us to implement surface-guided initial patient set-up (SGRT). To validate SGRT we investigated setup precision, user attitude, and impact on radiotherapy technician (RTT) workflow.</div></div><div><h3>Materials and methods</h3><div>During a six-month period, initial setup was investigated on six linear accelerators (Truebeam, Varian), each equipped with a thermo-optical surface camera (ExacTrac Dynamic, Brainlab). Precision was assessed by comparing couch shifts based on x-ray imaging acquired after initial setup and number of x-ray imaging procedures for each fraction to data from the prior year, using a tattoo-based setup. The data was split into subgroups (thoracic, abdominal/pelvic, palliative and miscellaneous (gastrointestinal, head and neck, cranial and extremities)). User attitude and impact on RTT workflow was assessed qualitatively by questionnaire. RTTs were asked to rate how SGRT compared to tattoo-based setup and record the need for manual adjustments of the patient (e.g. pushing, lifting or pulling). Questionnaires were repeated 1.5 years after implementation.</div></div><div><h3>Results</h3><div>We included 460 patients setup with SGRT, and 468 patients with tattoo-based methods. Median couch shifts and repeated imaging were comparable overall (0.6 cm and 9 % respectively for both setup methods), SGRT performed better for the thoracic and miscellaneous sites subgroups. RTTs preferred SGRT to laser and tattoo initial setup for >90 % of fractions. Manual adjustments were reduced with SGRT (15 % of fractions) compared to tattoo-based (60 % of fractions).</div></div><div><h3>Conclusions</h3><div>SGRT achieved the same or better precision as tattoo-based initial setup while providing a better workflow and reduced physical adjustments performed by the RTTs by 75 %.</div></div>","PeriodicalId":36850,"journal":{"name":"Physics and Imaging in Radiation Oncology","volume":"36 ","pages":"Article 100877"},"PeriodicalIF":3.3,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145680998","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-10-01DOI: 10.1016/j.phro.2025.100852
Arkadiy Dushatskiy , Peter A.N. Bosman , Karel A. Hinnen , Jan Wiersma , Henrike Westerveld , Bradley R. Pieters , Tanja Alderliesten
Background and Purpose
Recently, we introduced a novel Deep Learning (DL) based (semi-)automatic method for medical image segmentation. Unlike classical DL segmentation methods, it produces multiple segmentation variants (reflecting the variation of manual segmentations) instead of just one. Potentially, with this approach, there is a higher chance that a clinician prefers one of automatically produced segmentation variants. This work focuses on evaluating this method on prostate segmentation in MRI scans used for brachytherapy and investigating its potential clinical usefulness.
Materials and Methods
Three experienced radiation oncologists graded (per-slice and per-scan) segmentations produced by our method, reference segmentations (manually created and used for brachytherapy treatment planning) and segmentations produced by a classical DL method. The study was retrospective and the way the segmentation was generated (our method, classical DL method, or manually) was blinded for the clinicians. The grades reflect the amount of manual correction required. Additionally, the clinicians were asked to rank segmentations to evaluate which one is preferred for each scan. The study was performed on 13 prostate cancer patients.
Results
Segmentations produced by our method are graded as requiring no manual correction in 292/576 (51 %) slices compared to 240/576 (42 %) slices in the case of the segmentations produced by a classical DL method. Furthermore, in fewer slices, 38 (6.6 %) vs. 48 (8.3 %), segmentations by our method were graded as unacceptable.
Conclusion
Our study has demonstrated that deep-learning-based segmentation methods can produce high-quality segmentations. Our method was evaluated better than a classical DL method, indicating the potential for integration into clinical practice.
{"title":"Evaluating the quality of multiple automatically produced segmentation variants of the prostate on Magnetic Resonance Imaging scans for brachytherapy","authors":"Arkadiy Dushatskiy , Peter A.N. Bosman , Karel A. Hinnen , Jan Wiersma , Henrike Westerveld , Bradley R. Pieters , Tanja Alderliesten","doi":"10.1016/j.phro.2025.100852","DOIUrl":"10.1016/j.phro.2025.100852","url":null,"abstract":"<div><h3>Background and Purpose</h3><div>Recently, we introduced a novel Deep Learning (DL) based (semi-)automatic method for medical image segmentation. Unlike classical DL segmentation methods, it produces multiple segmentation variants (reflecting the variation of manual segmentations) instead of just one. Potentially, with this approach, there is a higher chance that a clinician prefers one of automatically produced segmentation variants. This work focuses on evaluating this method on prostate segmentation in MRI scans used for brachytherapy and investigating its potential clinical usefulness.</div></div><div><h3>Materials and Methods</h3><div>Three experienced radiation oncologists graded (per-slice and per-scan) segmentations produced by our method, reference segmentations (manually created and used for brachytherapy treatment planning) and segmentations produced by a classical DL method. The study was retrospective and the way the segmentation was generated (our method, classical DL method, or manually) was blinded for the clinicians. The grades reflect the amount of manual correction required. Additionally, the clinicians were asked to rank segmentations to evaluate which one is preferred for each scan. The study was performed on 13 prostate cancer patients.</div></div><div><h3>Results</h3><div>Segmentations produced by our method are graded as requiring no manual correction in 292/576 (51 %) slices compared to 240/576 (42 %) slices in the case of the segmentations produced by a classical DL method. Furthermore, in fewer slices, 38 (6.6 %) vs. 48 (8.3 %), segmentations by our method were graded as unacceptable.</div></div><div><h3>Conclusion</h3><div>Our study has demonstrated that deep-learning-based segmentation methods can produce high-quality segmentations. Our method was evaluated better than a classical DL method, indicating the potential for integration into clinical practice.</div></div>","PeriodicalId":36850,"journal":{"name":"Physics and Imaging in Radiation Oncology","volume":"36 ","pages":"Article 100852"},"PeriodicalIF":3.3,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145681030","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-10-01DOI: 10.1016/j.phro.2025.100844
Maria Thor , Aditya Apte , Milan Grkovski , Charles B. Simone II , Daphna Y. Gelblum , Masoud Zarepisheh , Puneeth Iyengar , Abraham J. Wu , Jacob Y. Shin , Tafadzwa Chaunzwa , Jennifer Ma , David Billing , Mark Dunphy , Jamie E. Chaft , Daniel R. Gomez , Joseph O. Deasy , Narek Shaverdian
Background and purpose
Early onset radiation pneumonitis (RPEarly) after concurrent chemoradiotherapy (cCRT) can lead to consolidation immunotherapy (IO) discontinuation, and poor survival in locally advanced non-small cell lung cancer (LA-NSCLC). This work assessed the external validity of a previously published RPEarly risk model.
Material and methods
The RPEarly risk model utilizes pretreatment 18F-FDG PET/CT imaging of the normal lungs and the mean lung dose (MLD). The 90th percentile of the standardized uptake value (SUVP90) and the MLD model parameters from the previous derivation cohort (N = 160) were applied in the independent cohort (50 consecutive LA-NSCLC patients treated with cCRT and IO) where model performance was evaluated (area under the receiver-operating characteristic curve (AUC), p-values, and the Hosmer-Lemeshow test (pHL)).
Results
Seven patients (14 %) developed RPEarly. Model performance of the previously developed SUVP90 and MLD model improved with re-fitting (AUC = 0.76 vs. 0.72; p = 0.01 vs. 0.10; pHL = 0.66 vs. 0.94). Above a clinically desirable 10 % predicted RPEarly, after refitting model coefficients in the combined derivation and validation cohorts (N = 210), the MLD was 13 ± 2.2 EQD23 Gy (SUVP90 = 1.2 ± 0.3) above the RPEarly risk threshold vs. 8.5 ± 2.6 EQD23 Gy (0.9 ± 0.2) below the threshold. For an SUVP90 of 1.1 and an MLD of 11 Gy EQD23 Gy, 25/27 patients developing RPEarly were captured.
Conclusion
The previously developed SUVP90 and MLD-based risk model for RPEarly demonstrated a high probability to correctly predict RPEarly in the independent cohort. This now validated RPEarly risk model with derived high-risk indications could enable personalized thoracic RT planning to reduce the risk of RPEarly and of discontinuing life-prolonging IO post-cCRT.
背景和目的同步放化疗(cCRT)后早发性放射性肺炎(RPEarly)可导致局部晚期非小细胞肺癌(LA-NSCLC)的巩固免疫治疗(IO)中断和生存率低。这项工作评估了先前发表的RPEarly风险模型的外部有效性。材料和方法RPEarly risk模型采用预处理的18F-FDG PET/CT正常肺成像和平均肺剂量(MLD)。在独立队列(50例连续接受cCRT和IO治疗的LA-NSCLC患者)中,采用标准化摄取值(SUVP90)的第90百分位和先前衍生队列(N = 160)的MLD模型参数,评估模型性能(接受者-工作特征曲线下面积(AUC)、p值和Hosmer-Lemeshow检验(pHL))。结果早期发病7例(14%)。先前开发的SUVP90和MLD模型的模型性能通过重新拟合得到改善(AUC = 0.76 vs. 0.72; p = 0.01 vs. 0.10; pHL = 0.66 vs. 0.94)。在推导和验证联合队列(N = 210)中修正模型系数后,在临床所需的10%以上预测RPEarly, MLD比RPEarly风险阈值高13±2.2 EQD23 Gy (SUVP90 = 1.2±0.3),比阈值低8.5±2.6 EQD23 Gy(0.9±0.2)。SUVP90为1.1,MLD为11 Gy EQD23 Gy,捕获了25/27的早期发展患者。结论先前建立的基于SUVP90和mld的RPEarly风险模型在独立队列中正确预测RPEarly的概率很高。现在,这个经过验证的RPEarly风险模型及其衍生的高风险适应症可以实现个性化的胸部RT计划,以降低RPEarly的风险和ccrt后停止延长生命的IO的风险。
{"title":"Prospective validation of a pretreatment 18F-FDG PET/CT and mean lung dose model for early radiation pneumonitis","authors":"Maria Thor , Aditya Apte , Milan Grkovski , Charles B. Simone II , Daphna Y. Gelblum , Masoud Zarepisheh , Puneeth Iyengar , Abraham J. Wu , Jacob Y. Shin , Tafadzwa Chaunzwa , Jennifer Ma , David Billing , Mark Dunphy , Jamie E. Chaft , Daniel R. Gomez , Joseph O. Deasy , Narek Shaverdian","doi":"10.1016/j.phro.2025.100844","DOIUrl":"10.1016/j.phro.2025.100844","url":null,"abstract":"<div><h3>Background and purpose</h3><div>Early onset radiation pneumonitis (RP<sub>Early</sub>) after concurrent chemoradiotherapy (cCRT) can lead to consolidation immunotherapy (IO) discontinuation, and poor survival in locally advanced non-small cell lung cancer (LA-NSCLC). This work assessed the external validity of a previously published RP<sub>Early</sub> risk model.</div></div><div><h3>Material and methods</h3><div>The RP<sub>Early</sub> risk model utilizes pretreatment 18F-FDG PET/CT imaging of the normal lungs and the mean lung dose (MLD). The 90th percentile of the standardized uptake value (SUV<sub>P90</sub>) and the MLD model parameters from the previous derivation cohort (N = 160) were applied in the independent cohort (50 consecutive LA-NSCLC patients treated with cCRT and IO) where model performance was evaluated (area under the receiver-operating characteristic curve (AUC), <em>p-values</em>, and the Hosmer-Lemeshow test (<em>pHL</em>)).</div></div><div><h3>Results</h3><div>Seven patients (14 %) developed RP<sub>Early</sub>. Model performance of the previously developed SUV<sub>P90</sub> and MLD model improved with re-fitting (AUC = 0.76 <em>vs.</em> 0.72; p = 0.01 <em>vs.</em> 0.10; pHL = 0.66 <em>vs.</em> 0.94). Above a clinically desirable 10 % predicted RP<sub>Early</sub>, after refitting model coefficients in the combined derivation and validation cohorts (N = 210), the MLD was 13 ± 2.2 EQD2<sub>3</sub> Gy (SUV<sub>P90</sub> = 1.2 ± 0.3) above the RP<sub>Early</sub> risk threshold <em>vs.</em> 8.5 ± 2.6 EQD2<sub>3</sub> Gy (0.9 ± 0.2) below the threshold. For an SUV<sub>P90</sub> of 1.1 and an MLD of 11 Gy EQD2<sub>3</sub> Gy, 25/27 patients developing RP<sub>Early</sub> were captured.</div></div><div><h3>Conclusion</h3><div>The previously developed SUV<sub>P90</sub> and MLD-based risk model for RP<sub>Early</sub> demonstrated a high probability to correctly predict RP<sub>Early</sub> in the independent cohort. This now validated RP<sub>Early</sub> risk model with derived high-risk indications could enable personalized thoracic RT planning to reduce the risk of RP<sub>Early</sub> and of discontinuing life-prolonging IO post-cCRT.</div></div>","PeriodicalId":36850,"journal":{"name":"Physics and Imaging in Radiation Oncology","volume":"36 ","pages":"Article 100844"},"PeriodicalIF":3.3,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145221817","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-10-01DOI: 10.1016/j.phro.2025.100841
Chuyan Wang , Haoping Xu , Zhenkui Wang , Li Tong , Xijing Zhang , Fuhua Yan , Jiayi Chen , Yingli Yang
Background and purpose
Pelvic radiotherapy-induced bone marrow (BM) damage adversely affects patient prognosis. Progress in BM-sparing radiotherapy is limited by the lack of standardized BM quantification and the inherent constraints of magnetic resonance spectroscopy (MRS), the current gold standard for BM magnetic resonance imaging (MRI). Proton density fat fraction (PDFF), derived from DIXON-based MRI, has emerged as an imaging biomarker for detecting BM changes. This study evaluated the potential of DIXON-based MRI in pelvic BM for radiotherapy.
Materials and methods
Three existing DIXON-based techniques were optimized and compared to establish clinical protocols. In vitro measurements were performed using fat phantoms calibrated against thermogravimetric analysis, while in vivo measurements were based on data from 30 volunteers with MRS serving as the reference standard. Quantitative accuracy was assessed using mean absolute error (MAE), repeatability via intra-class correlation coefficients (ICCs), and image quality using an ACR phantom.
Results
Comprehensive evaluation identified optimal parameters for each DIXON-based sequence. For in vitro measurements, the MAE for MRS was 3.5 % and the highest MAE across three optimized DIXON-based sequences was 5.9 %. For in vivo measurements, linear regressions between MRS and each of the optimized DIXON-based sequence resulted in R2 ≥ 0.93 and MAE ≤ 7.6 %. All three optimized DIXON-based sequences demonstrated high repeatability (ICCs ≥ 0.97) and clearly visualized BM with varying fat fractions, with no consistently outperforming in image quality.
Conclusion
For BM assessment, this study demonstrated DIXON-based PDFF quantification achieved high accuracy, repeatability, and image quality, supporting its potential for radiotherapy.
{"title":"Leveraging dixon-based magnetic resonance imaging for pelvic bone marrow imaging in radiotherapy","authors":"Chuyan Wang , Haoping Xu , Zhenkui Wang , Li Tong , Xijing Zhang , Fuhua Yan , Jiayi Chen , Yingli Yang","doi":"10.1016/j.phro.2025.100841","DOIUrl":"10.1016/j.phro.2025.100841","url":null,"abstract":"<div><h3>Background and purpose</h3><div>Pelvic radiotherapy-induced bone marrow (BM) damage adversely affects patient prognosis. Progress in BM-sparing radiotherapy is limited by the lack of standardized BM quantification and the inherent constraints of magnetic resonance spectroscopy (MRS), the current gold standard for BM magnetic resonance imaging (MRI). Proton density fat fraction (PDFF), derived from DIXON-based MRI, has emerged as an imaging biomarker for detecting BM changes. This study evaluated the potential of DIXON-based MRI in pelvic BM for radiotherapy.</div></div><div><h3>Materials and methods</h3><div>Three existing DIXON-based techniques were optimized and compared to establish clinical protocols. <em>In vitro</em> measurements were performed using fat phantoms calibrated against thermogravimetric analysis, while <em>in vivo</em> measurements were based on data from 30 volunteers with MRS serving as the reference standard. Quantitative accuracy was assessed using mean absolute error (MAE), repeatability via intra-class correlation coefficients (ICCs), and image quality using an ACR phantom.</div></div><div><h3>Results</h3><div>Comprehensive evaluation identified optimal parameters for each DIXON-based sequence. For <em>in vitro</em> measurements, the MAE for MRS was 3.5 % and the highest MAE across three optimized DIXON-based sequences was 5.9 %. For <em>in vivo</em> measurements, linear regressions between MRS and each of the optimized DIXON-based sequence resulted in R<sup>2</sup> ≥ 0.93 and MAE ≤ 7.6 %. All three optimized DIXON-based sequences demonstrated high repeatability (ICCs ≥ 0.97) and clearly visualized BM with varying fat fractions, with no consistently outperforming in image quality.</div></div><div><h3>Conclusion</h3><div>For BM assessment, this study demonstrated DIXON-based PDFF quantification achieved high accuracy, repeatability, and image quality, supporting its potential for radiotherapy.</div></div>","PeriodicalId":36850,"journal":{"name":"Physics and Imaging in Radiation Oncology","volume":"36 ","pages":"Article 100841"},"PeriodicalIF":3.3,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145221820","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-10-01DOI: 10.1016/j.phro.2025.100861
Gabriel Adrian, Raphaël Moeckli, Elke Beyreuther, Ludvig P. Muren, Brita Singers Sørensen
{"title":"New modalities for ultra-high dose rate irradiation and FLASH experiments","authors":"Gabriel Adrian, Raphaël Moeckli, Elke Beyreuther, Ludvig P. Muren, Brita Singers Sørensen","doi":"10.1016/j.phro.2025.100861","DOIUrl":"10.1016/j.phro.2025.100861","url":null,"abstract":"","PeriodicalId":36850,"journal":{"name":"Physics and Imaging in Radiation Oncology","volume":"36 ","pages":"Article 100861"},"PeriodicalIF":3.3,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145466706","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Photon-counting detector computed tomography (PCD-CT) offers advancements over conventional energy-integrating detector computed tomography (EID-CT). This study is the first to investigate the use of PCD-CT for delineation purposes.
Materials and methods
Twelve patients with head-and-neck cancer were prospectively included. Delineation of clinical target volume (CTV) and organs at risk (OARs) was performed by three physicians using EID-CT (120 kVp) and PCD-CT at virtual monoenergetic levels of 140 keV and 50 keV. Segmentation accuracy was assessed and contouring-confidence was rated.
Results
The ratio of the mean computed tomography dose index (CTDI) for EID-CT/PCD-CT for all patients was 2.72. The study revealed high inter-observer agreement for the mandible, oral cavity, parotid glands, and submandibular glands, with DSCs exceeding 0.8 across both EID-CT and PCD-CT. No differences in delineation agreement were observed, while a significant improvement was found for the lips (p = 0.001) and oral cavity (p = 0.003) when combining 140 keV and 50 keV images in PCD-CT compared to EID-CT. PCD-CT outperformed EID-CT in terms of contouring confidence.
Conclusion
The study established the feasibility of PCD-CT for radiotherapy planning. PCD-CT-based delineation provides results comparable to EID-CT, with the added advantages of reduced radiation dose, improved image quality and higher delineation confidence. Significant improvements in delineating specific structures, such as the lips and oral cavity, were observed when combining multiple energy levels in PCD-CT.
{"title":"First implementation of photon-counting dector computed tomography for optimizing segmentation in head-and-neck cancer radiotherapy","authors":"Niccolò Bertini , Hubert S. Gabryś , Hatem Alkadhi , Lotte Wilke , Patrick Wohlfahrt , Serena Psoroulas , Eugenia Vlaskou , Laura Motisi , Matthias Guckenberger , Stephanie Tanadini-Lang , Panagiotis Balermpas","doi":"10.1016/j.phro.2025.100864","DOIUrl":"10.1016/j.phro.2025.100864","url":null,"abstract":"<div><h3>Background and purpose</h3><div>Photon-counting detector computed tomography (PCD-CT) offers advancements over conventional energy-integrating detector computed tomography (EID-CT). This study is the first to investigate the use of PCD-CT for delineation purposes.</div></div><div><h3>Materials and methods</h3><div>Twelve patients with head-and-neck cancer were prospectively included. Delineation of clinical target volume (CTV) and organs at risk (OARs) was performed by three physicians using EID-CT (120 kVp) and PCD-CT at virtual monoenergetic levels of 140 keV and 50 keV. Segmentation accuracy was assessed and contouring-confidence was rated.</div></div><div><h3>Results</h3><div>The ratio of the mean computed tomography dose index (CTDI) for EID-CT/PCD-CT for all patients was 2.72. The study revealed high inter-observer agreement for the mandible, oral cavity, parotid glands, and submandibular glands, with DSCs exceeding 0.8 across both EID-CT and PCD-CT. No differences in delineation agreement were observed, while a significant improvement was found for the lips (p = 0.001) and oral cavity (p = 0.003) when combining 140 keV and 50 keV images in PCD-CT compared to EID-CT. PCD-CT outperformed EID-CT in terms of contouring confidence.</div></div><div><h3>Conclusion</h3><div>The study established the feasibility of PCD-CT for radiotherapy planning. PCD-CT-based delineation provides results comparable to EID-CT, with the added advantages of reduced radiation dose, improved image quality and higher delineation confidence. Significant improvements in delineating specific structures, such as the lips and oral cavity, were observed when combining multiple energy levels in PCD-CT.</div></div>","PeriodicalId":36850,"journal":{"name":"Physics and Imaging in Radiation Oncology","volume":"36 ","pages":"Article 100864"},"PeriodicalIF":3.3,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145614338","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-10-01DOI: 10.1016/j.phro.2025.100869
Miriam Eckl , Nour Alfakhori , Marvin Willam , Hans Oppitz , Constantin Dreher , Michael Ehmann , Judit Boda-Heggemann , Frank A. Giordano , Jens Fleckenstein
Background and purpose
Cone beam computed tomography (CBCT)-based approaches for online adaptive radiation therapy (oART) have recently become clinically available for ultra-hypofractionated prostate stereotactic body radiation therapy (SBRT). This work assessed the changes of relevant dose-volume-histogram (DVH) parameters and their robustness against morphologic variations during adaptation.
Materials and methods
Sixteen prostate cancer patients were treated with oART in an SBRT regimen (40 Gy in 5 treatment sessions (Tx) to the clinical target volume (CTV), PACE-B constraints). Two CBCTs were acquired daily: CBCT1 for adaptive planning and CBCT2 after adaptation for position verification. Adapted plans optimized on CBCT1 (ARTCBCT1) were recalculated on CBCT2 (ARTCBCT2) and compared to treatment plans on CBCT1 after image guidance (IGRTCBCT1) for relevant DVH metrics: V40Gy(CTV), V37Gy(bladder), V36Gy(rectum). Spearman’s rank coefficients r with p-values (5% significance level) were determined to analyze correlations between adaptation time (ΔT) and bladder filling as well as Tx and median prostate volume.
Results
oART improved median V40Gy(CTV) from 86% in IGRTCBCT1 to 94% in ARTCBCT2. Inter-fractional prostate swelling ( was responsible for CTV deviations. Bladder filling () and rectal gas migration during the median adaptation time increased V37Gy(bladder) from 4.9 cm3 in ARTCBCT1 to 6.5 cm3 in ARTCBCT2 and V36Gy(rectum) from 0.5 cm3 to 0.6 cm3 and led to 10 constraint violations, each.
Conclusion
Compared to IGRT, daily oART substantially improved CTV coverage. Besides inter-fractional prostate swelling, constraint violations originated from seminal vesicles motion, rectal gas or bladder filling during adaptation. Treatment adaptation times should therefore be minimized whenever possible.
{"title":"A cone-beam computed tomography based workflow for online adaptive ultra-hypofractionated radiotherapy of prostate cancer","authors":"Miriam Eckl , Nour Alfakhori , Marvin Willam , Hans Oppitz , Constantin Dreher , Michael Ehmann , Judit Boda-Heggemann , Frank A. Giordano , Jens Fleckenstein","doi":"10.1016/j.phro.2025.100869","DOIUrl":"10.1016/j.phro.2025.100869","url":null,"abstract":"<div><h3>Background and purpose</h3><div>Cone beam computed tomography (CBCT)-based approaches for online adaptive radiation therapy (oART) have recently become clinically available for ultra-hypofractionated prostate stereotactic body radiation therapy (SBRT). This work assessed the changes of relevant dose-volume-histogram (DVH) parameters and their robustness against morphologic variations during adaptation.</div></div><div><h3>Materials and methods</h3><div>Sixteen prostate cancer patients were treated with oART in an SBRT regimen (40<!--> <!-->Gy in 5 treatment sessions (Tx) to the clinical target volume (CTV), PACE-B constraints). Two CBCTs were acquired daily: CBCT1 for adaptive planning and CBCT2 after adaptation for position verification. Adapted plans optimized on CBCT1 (ART<sub>CBCT1</sub>) were recalculated on CBCT2 (ART<sub>CBCT2</sub>) and compared to treatment plans on CBCT1 after image guidance (IGRT<sub>CBCT1</sub>) for relevant DVH metrics: V<sub>40Gy</sub>(CTV), V<sub>37Gy</sub>(bladder), V<sub>36Gy</sub>(rectum). Spearman’s rank coefficients r with p-values (5% significance level) were determined to analyze correlations between adaptation time (ΔT) and bladder filling as well as Tx and median prostate volume.</div></div><div><h3>Results</h3><div>oART improved median V<sub>40Gy</sub>(CTV) from 86% in IGRT<sub>CBCT1</sub> to 94% in ART<sub>CBCT2</sub>. Inter-fractional prostate swelling (<span><math><mrow><msub><mi>r</mi><mrow><mi>T</mi><mi>x</mi><mo>,</mo><mi>V</mi><mi>o</mi><mi>l</mi><mo>(</mo><mi>p</mi><mi>r</mi><mi>o</mi><mi>s</mi><mi>t</mi><mi>a</mi><mi>t</mi><mi>e</mi><mo>)</mo></mrow></msub><mrow><mo>=</mo><mn>0.98</mn><mo>,</mo><mi>p</mi><mo>=</mo><mn>0.005</mn><mo>)</mo></mrow></mrow></math></span> was responsible for CTV deviations. Bladder filling (<span><math><mrow><msub><mi>r</mi><mrow><mi>Δ</mi><mi>T</mi><mo>,</mo><mi>V</mi><mi>o</mi><mi>l</mi><mo>(</mo><mi>b</mi><mi>l</mi><mi>a</mi><mi>d</mi><mi>d</mi><mi>e</mi><mi>r</mi><mo>)</mo></mrow></msub><mo>=</mo><mn>0.34</mn><mo>,</mo><mi>p</mi><mo>=</mo><mn>0.002</mn></mrow></math></span>) and rectal gas migration during the median adaptation time <span><math><mrow><mi>Δ</mi><mi>T</mi><mo>=</mo><mn>24.0</mn><mspace></mspace><mi>m</mi><mi>i</mi><mi>n</mi></mrow></math></span> increased V<sub>37Gy</sub>(bladder) from 4.9 cm<sup>3</sup> in ART<sub>CBCT1</sub> to 6.5 cm<sup>3</sup> in ART<sub>CBCT2</sub> and V<sub>36Gy</sub>(rectum) from 0.5 cm<sup>3</sup> to 0.6 cm<sup>3</sup> and led to 10 constraint violations, each.</div></div><div><h3>Conclusion</h3><div>Compared to IGRT, daily oART substantially improved CTV coverage. Besides inter-fractional prostate swelling, constraint violations originated from seminal vesicles motion, rectal gas or bladder filling during adaptation. Treatment adaptation times should therefore be minimized whenever possible.</div></div>","PeriodicalId":36850,"journal":{"name":"Physics and Imaging in Radiation Oncology","volume":"36 ","pages":"Article 100869"},"PeriodicalIF":3.3,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145576374","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}