Yongheng Yan , Xin Sun , Yuanhua Chen , Zihan Sun , SenXiang Yan , Zhongjie Lu , Feng Zhao
{"title":"Optimizing fractionation schedules for de-escalation radiotherapy in head and neck cancers using deep reinforcement learning","authors":"Yongheng Yan , Xin Sun , Yuanhua Chen , Zihan Sun , SenXiang Yan , Zhongjie Lu , Feng Zhao","doi":"10.1016/j.radonc.2025.110833","DOIUrl":null,"url":null,"abstract":"<div><h3>Purpose</h3><div>: Patients with locally-advanced head and neck squamous cell carcinomas (HNSCCs), particularly those related to human papillomavirus (HPV), often achieve good locoregional control (LRC), yet they suffer significant toxicities from standard chemoradiotherapy. This study aims to optimize the daily dose fractionation based on individual responses to radiotherapy (RT), minimizing toxicity while maintaining a low risk of LRC failure.</div></div><div><h3>Method:</h3><div>A virtual environment was developed to simulate tumor dynamics under RT for optimizing dose schedules. Patients predicted to maintain LRC were selected for de-escalation experiments. The proliferation saturation index (PSI) and linear-quadratic model were used to predict responses. A deep reinforcement learning (DRL) agent optimized fractionation schemes by interacting with the simulation environment, aiming to reduce the OAR’s biologically effective dose (BED) while preserving LRC. The impact of model uncertainty was analyzed and a support vector machine (SVM) model was used to segment parameter space and identify patients more robust to noise.</div></div><div><h3>Results:</h3><div>Personalized de-escalation plans were compared to conventional RT in a cohort of 5000 virtual patients. Personalized fractionation reduced the tumor dose and OAR’s BED by 29%, with an average OAR BED reduction of 5.61 ± 2.96 Gy. Prognostic outcomes were nearly identical, with 99.80% of patients in the low-risk LRC failure group. Model uncertainty impacted dosimetric indicators and prognosis, but the high-BED benefit group showed greater robustness to noise. SVM decision boundaries defined parameters range for patient selection.</div></div><div><h3>Conclusion:</h3><div>Optimizing fractionated doses based on patient responses minimizes toxicity while maintaining LRC in HNSCCs. Stratifying patients can mitigate model uncertainty and reduce treatment risks.</div></div>","PeriodicalId":21041,"journal":{"name":"Radiotherapy and Oncology","volume":"207 ","pages":"Article 110833"},"PeriodicalIF":4.9000,"publicationDate":"2025-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Radiotherapy and Oncology","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167814025001288","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Purpose
: Patients with locally-advanced head and neck squamous cell carcinomas (HNSCCs), particularly those related to human papillomavirus (HPV), often achieve good locoregional control (LRC), yet they suffer significant toxicities from standard chemoradiotherapy. This study aims to optimize the daily dose fractionation based on individual responses to radiotherapy (RT), minimizing toxicity while maintaining a low risk of LRC failure.
Method:
A virtual environment was developed to simulate tumor dynamics under RT for optimizing dose schedules. Patients predicted to maintain LRC were selected for de-escalation experiments. The proliferation saturation index (PSI) and linear-quadratic model were used to predict responses. A deep reinforcement learning (DRL) agent optimized fractionation schemes by interacting with the simulation environment, aiming to reduce the OAR’s biologically effective dose (BED) while preserving LRC. The impact of model uncertainty was analyzed and a support vector machine (SVM) model was used to segment parameter space and identify patients more robust to noise.
Results:
Personalized de-escalation plans were compared to conventional RT in a cohort of 5000 virtual patients. Personalized fractionation reduced the tumor dose and OAR’s BED by 29%, with an average OAR BED reduction of 5.61 ± 2.96 Gy. Prognostic outcomes were nearly identical, with 99.80% of patients in the low-risk LRC failure group. Model uncertainty impacted dosimetric indicators and prognosis, but the high-BED benefit group showed greater robustness to noise. SVM decision boundaries defined parameters range for patient selection.
Conclusion:
Optimizing fractionated doses based on patient responses minimizes toxicity while maintaining LRC in HNSCCs. Stratifying patients can mitigate model uncertainty and reduce treatment risks.
期刊介绍:
Radiotherapy and Oncology publishes papers describing original research as well as review articles. It covers areas of interest relating to radiation oncology. This includes: clinical radiotherapy, combined modality treatment, translational studies, epidemiological outcomes, imaging, dosimetry, and radiation therapy planning, experimental work in radiobiology, chemobiology, hyperthermia and tumour biology, as well as data science in radiation oncology and physics aspects relevant to oncology.Papers on more general aspects of interest to the radiation oncologist including chemotherapy, surgery and immunology are also published.