首页 > 最新文献

JCO Clinical Cancer Informatics最新文献

英文 中文
Time-Series Clustering Captures Patterns of Early Immune Effector Cell-Associated Hematotoxicity That Are Predictable Using Tree-Based Models. 时间序列聚类捕捉早期免疫效应细胞相关血液毒性的模式,使用基于树的模型可预测。
IF 2.8 Q2 ONCOLOGY Pub Date : 2026-01-01 Epub Date: 2026-01-14 DOI: 10.1200/CCI-25-00148
Emily C Liang, Yein Jeon, Yang Qiao, Xiancheng Wu, Jennifer J Huang, Andrew J Portuguese, Ryan Basom, Aiko Torkelson, Delaney Kirchmeier, Kristina Braathen, Andrew J Cowan, Mazyar Shadman, Alexandre V Hirayama, Brian G Till, Erik L Kimble, Qian Wu, Jordan Gauthier

Purpose: Immune effector cell-associated hematotoxicity (ICAHT) is a major cause of nonrelapse mortality after chimeric antigen receptor (CAR) T-cell therapy. We hypothesized that unsupervised time-series clustering could better identify archetypal patterns of early hematotoxicity compared to the early ICAHT (eICAHT) grading system.

Methods: We applied unsupervised k-means time-series clustering based on Euclidean distances to longitudinal absolute neutrophil count (ANC) data from days +0 through +30 post-CAR T-cell infusion in 691 patients treated at our center (training set: n = 483, 70%; test set: n = 208, 30%).

Results: Within our training set, we identified an optimal cluster solution based on four ANC recovery clusters, which were labeled as very good, good, poor, and very poor. We trained a random forest (RF) model including the top five most important features (day +3, +4, +5, +26, and +27 ANC values) to predict the cluster assignments. Within our test set, we applied the RF model to predict cluster assignments. Compared with the eICAHT criteria, the RF-predicted clusters were more compact and better separated (Dunn index: 0.078 v 0.034; average silhouette width: 0.12 v 0.010). In addition, the RF model identified patients in the good recovery cluster with intermediate overall survival (hazard ratio [HR], 1.70 [95% CI, 1.05 to 2.74]; P = .029; reference, very good), which was not captured by grade 2 eICAHT (HR, 1.37 [95% CI, 0.80 to 2.35]; P = .25; reference, grade 0-1).

Conclusion: Unsupervised time-series clustering identified distinct and clinically relevant patterns of hematotoxicity after CAR T-cell therapy. We trained and tested an RF model that accurately predicted cluster assignments using only five features. Predictions can be generated using our online web application.

目的:免疫效应细胞相关血液毒性(ICAHT)是嵌合抗原受体(CAR) t细胞治疗后非复发性死亡的主要原因。我们假设,与早期ICAHT (eICAHT)分级系统相比,无监督时间序列聚类可以更好地识别早期血液毒性的原型模式。方法:我们将基于欧几里得距离的无监督k-均值时间序列聚类应用于691名在我们中心接受治疗的患者(训练集:n = 483,70%;测试集:n = 208,30%) car - t细胞输注后+0至+30天的纵向绝对中性粒细胞计数(ANC)数据。结果:在我们的训练集中,我们确定了一个基于四个ANC恢复集群的最优集群解决方案,这些集群被标记为非常好、好、差和非常差。我们训练了一个随机森林(RF)模型,包括前五个最重要的特征(日+3、+4、+5、+26和+27 ANC值)来预测聚类分配。在我们的测试集中,我们应用RF模型来预测集群分配。与eICAHT标准相比,rf预测的聚类更紧凑,分离性更好(Dunn指数:0.078 v 0.034;平均轮廓宽度:0.12 v 0.010)。此外,RF模型确定了处于良好恢复组的患者,总生存率为中等(风险比[HR], 1.70 [95% CI, 1.05至2.74];P = 0.029;参考文献,非常好),2级eICAHT未捕获这些患者(风险比[HR], 1.37 [95% CI, 0.80至2.35];P = 0.25;参考文献,0-1级)。结论:无监督的时间序列聚类识别出CAR - t细胞治疗后血液毒性的独特和临床相关模式。我们训练并测试了一个RF模型,该模型仅使用五个特征就能准确地预测聚类分配。预测可以使用我们的在线web应用程序生成。
{"title":"Time-Series Clustering Captures Patterns of Early Immune Effector Cell-Associated Hematotoxicity That Are Predictable Using Tree-Based Models.","authors":"Emily C Liang, Yein Jeon, Yang Qiao, Xiancheng Wu, Jennifer J Huang, Andrew J Portuguese, Ryan Basom, Aiko Torkelson, Delaney Kirchmeier, Kristina Braathen, Andrew J Cowan, Mazyar Shadman, Alexandre V Hirayama, Brian G Till, Erik L Kimble, Qian Wu, Jordan Gauthier","doi":"10.1200/CCI-25-00148","DOIUrl":"10.1200/CCI-25-00148","url":null,"abstract":"<p><strong>Purpose: </strong>Immune effector cell-associated hematotoxicity (ICAHT) is a major cause of nonrelapse mortality after chimeric antigen receptor (CAR) T-cell therapy. We hypothesized that unsupervised time-series clustering could better identify archetypal patterns of early hematotoxicity compared to the early ICAHT (eICAHT) grading system.</p><p><strong>Methods: </strong>We applied unsupervised k-means time-series clustering based on Euclidean distances to longitudinal absolute neutrophil count (ANC) data from days +0 through +30 post-CAR T-cell infusion in 691 patients treated at our center (training set: n = 483, 70%; test set: n = 208, 30%).</p><p><strong>Results: </strong>Within our training set, we identified an optimal cluster solution based on four ANC recovery clusters, which were labeled as very good, good, poor, and very poor. We trained a random forest (RF) model including the top five most important features (day +3, +4, +5, +26, and +27 ANC values) to predict the cluster assignments. Within our test set, we applied the RF model to predict cluster assignments. Compared with the eICAHT criteria, the RF-predicted clusters were more compact and better separated (Dunn index: 0.078 <i>v</i> 0.034; average silhouette width: 0.12 <i>v</i> 0.010). In addition, the RF model identified patients in the good recovery cluster with intermediate overall survival (hazard ratio [HR], 1.70 [95% CI, 1.05 to 2.74]; <i>P</i> = .029; reference, very good), which was not captured by grade 2 eICAHT (HR, 1.37 [95% CI, 0.80 to 2.35]; <i>P</i> = .25; reference, grade 0-1).</p><p><strong>Conclusion: </strong>Unsupervised time-series clustering identified distinct and clinically relevant patterns of hematotoxicity after CAR T-cell therapy. We trained and tested an RF model that accurately predicted cluster assignments using only five features. Predictions can be generated using our online web application.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"10 ","pages":"e2500148"},"PeriodicalIF":2.8,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12810859/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145985901","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Impact of Real-World Response to First-Line Immunotherapy and Chemotherapy on Subsequent Treatment Outcomes in Patients With Advanced or Metastatic Non-Small Cell Lung Cancer. 一线免疫治疗和化疗的真实世界反应对晚期或转移性非小细胞肺癌患者后续治疗结果的影响
IF 2.8 Q2 ONCOLOGY Pub Date : 2026-01-01 Epub Date: 2026-01-21 DOI: 10.1200/CCI-25-00207
Jyoti Malhotra, Shilpa Viswanathan, Shivani K Mhatre, Inderjit K Dhillon, Riddhi Patel, Nicole Yohn, Furaha Kariburyo-Yay, Xinye Li, Biagio Ricciuti

Purpose: This study examined real-world overall survival (rwOS) in patients with advanced or metastatic non-small cell lung cancer (a/mNSCLC) treated with combination immunotherapy (IO) and platinum chemotherapy in first line (1L), followed by second-line or beyond (2L+) non-IO, nonplatinum chemotherapy and explored the association between real-world duration of response (rwDOR) to 1L treatment and rwOS on the 2L+ treatment.

Methods: This study used two US-based data sets: ConcertAI Patient360 NSCLC data set (ConcertAI) and the Flatiron Health Research Database (FHRD), and included adults with a/mNSCLC diagnosed from January 1, 2018, to March 31, 2023 (data cutoff: March 31, 2024). Kaplan-Meier and multivariate Cox regression analyses estimated rwOS for the index regimen by rwDOR to 1L.

Results: Patients with rwDOR ≤6 (≈60%) v >6 months (≈40%) in 1L were similar across the 596 ConcertAI patients and 1,094 FHRD patients. Across the ConcertAI data set/FHRD, 52.6%/55.7% of patients achieved complete/partial response as real-world best overall response to 1L combination IO and platinum chemotherapy and 17.8%/19.1% had stable disease. The median rwOS on 2L+ treatment was 8.3 v 5.2 months (P = .001; ConcertAI) and 8.3 v 5.1 months (P < .001; FHRD) for patients with 1L rwDOR >6 v ≤6 months. The adjusted hazard ratio for patients with 1L rwDOR >6 v ≤6 months was 0.74 (95% CI, 0.61 to 0.90; P = .002) and 0.76 (95% CI, 0.67 to 0.88; P < .001) in the ConcertAI data set and FHRD, respectively.

Conclusion: Our findings demonstrate that patients with rwDOR ≥6 months on 1L combination IO and platinum chemotherapy exhibit longer rwOS on subsequent treatments. This emphasizes the need for 1L treatments that extend DOR and delay the onset of acquired resistance, which remains an unmet need for approximately 60% of patients who do not achieve a sustained response in clinical practice.

目的:本研究考察了在一线(1L)联合免疫治疗(IO)和铂类化疗后,二线或二线以上(2L+)非IO、非铂类化疗的晚期或转移性非小细胞肺癌(a/mNSCLC)患者的真实总生存期(rwOS),并探讨了1L治疗的真实反应时间(rwDOR)与2L+治疗的rwOS之间的关系。方法:本研究使用了两个基于美国的数据集:ConcertAI Patient360 NSCLC数据集(ConcertAI)和Flatiron健康研究数据库(FHRD),并纳入了2018年1月1日至2023年3月31日诊断为a/mNSCLC的成年人(数据截止日期:2024年3月31日)。Kaplan-Meier和多变量Cox回归分析以rwDOR为1L估计指标方案的rwOS。结果:596例ConcertAI患者和1094例FHRD患者在1L中rwDOR≤6(≈60%)v >6个月(≈40%)的患者相似。在ConcertAI数据集/FHRD中,52.6%/55.7%的患者对1L IO联合铂化疗达到完全/部分缓解,达到真实世界最佳总体缓解,17.8%/19.1%的患者病情稳定。2L+治疗的中位rwOS为8.3 v 5.2个月(P = 0.001; ConcertAI), 1L rwDOR≤6个月的患者中位rwOS为8.3 v 5.1个月(P < 0.001; FHRD)。在ConcertAI数据集和FHRD中,1L rwDOR≤6个月患者的校正危险比分别为0.74 (95% CI, 0.61 ~ 0.90, P = 0.002)和0.76 (95% CI, 0.67 ~ 0.88, P < 0.001)。结论:我们的研究结果表明,rwDOR≥6个月的1L IO联合铂化疗患者在后续治疗中表现出更长的rwOS。这强调了l治疗的必要性,延长DOR和延迟获得性耐药的发生,对于在临床实践中没有实现持续反应的大约60%的患者来说,这仍然是一个未满足的需求。
{"title":"Impact of Real-World Response to First-Line Immunotherapy and Chemotherapy on Subsequent Treatment Outcomes in Patients With Advanced or Metastatic Non-Small Cell Lung Cancer.","authors":"Jyoti Malhotra, Shilpa Viswanathan, Shivani K Mhatre, Inderjit K Dhillon, Riddhi Patel, Nicole Yohn, Furaha Kariburyo-Yay, Xinye Li, Biagio Ricciuti","doi":"10.1200/CCI-25-00207","DOIUrl":"https://doi.org/10.1200/CCI-25-00207","url":null,"abstract":"<p><strong>Purpose: </strong>This study examined real-world overall survival (rwOS) in patients with advanced or metastatic non-small cell lung cancer (a/mNSCLC) treated with combination immunotherapy (IO) and platinum chemotherapy in first line (1L), followed by second-line or beyond (2L+) non-IO, nonplatinum chemotherapy and explored the association between real-world duration of response (rwDOR) to 1L treatment and rwOS on the 2L+ treatment.</p><p><strong>Methods: </strong>This study used two US-based data sets: ConcertAI Patient360 NSCLC data set (ConcertAI) and the Flatiron Health Research Database (FHRD), and included adults with a/mNSCLC diagnosed from January 1, 2018, to March 31, 2023 (data cutoff: March 31, 2024). Kaplan-Meier and multivariate Cox regression analyses estimated rwOS for the index regimen by rwDOR to 1L.</p><p><strong>Results: </strong>Patients with rwDOR ≤6 (≈60%) <i>v</i> >6 months (≈40%) in 1L were similar across the 596 ConcertAI patients and 1,094 FHRD patients. Across the ConcertAI data set/FHRD, 52.6%/55.7% of patients achieved complete/partial response as real-world best overall response to 1L combination IO and platinum chemotherapy and 17.8%/19.1% had stable disease. The median rwOS on 2L+ treatment was 8.3 <i>v</i> 5.2 months (<i>P</i> = .001; ConcertAI) and 8.3 <i>v</i> 5.1 months (<i>P</i> < .001; FHRD) for patients with 1L rwDOR >6 <i>v</i> ≤6 months. The adjusted hazard ratio for patients with 1L rwDOR >6 <i>v</i> ≤6 months was 0.74 (95% CI, 0.61 to 0.90; <i>P</i> = .002) and 0.76 (95% CI, 0.67 to 0.88; <i>P</i> < .001) in the ConcertAI data set and FHRD, respectively.</p><p><strong>Conclusion: </strong>Our findings demonstrate that patients with rwDOR ≥6 months on 1L combination IO and platinum chemotherapy exhibit longer rwOS on subsequent treatments. This emphasizes the need for 1L treatments that extend DOR and delay the onset of acquired resistance, which remains an unmet need for approximately 60% of patients who do not achieve a sustained response in clinical practice.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"10 ","pages":"e2500207"},"PeriodicalIF":2.8,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146020548","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Novel Electronic Health Record-Based Data Commons for Pancreatic Cancer. 新型基于胰腺癌电子健康记录的数据共享
IF 2.8 Q2 ONCOLOGY Pub Date : 2026-01-01 Epub Date: 2026-01-09 DOI: 10.1200/CCI-24-00265
Kaleem S Ahmed, Clayton T Marcinak, Muhammad Maisam Ali, Sheriff M Issaka, Yonghe Yan, Gabriel McMahan, Thomas Callaci, Noelle K LoConte, Andrea Shiefelbein, Sharon Weber, Majid Afshar, Matthew M Churpek, Jomol Mathew, Syed Nabeel Zafar

Purpose: Clinical research in pancreatic cancer (PC) has been limited because of a lack of granular data in national data sets. An electronic health record (EHR)-based data set specifically designed for PC has immense potential to advance research. This study describes the creation of an EHR-based data commons for patients with PC.

Methods: We generated an index cohort of adult patients at our institution diagnosed with PC (International Classification of Diseases for Oncology, codes C25.0-25.9) between January 1, 2010, and December 31, 2023. To develop the Pancreatic Cancer Data Commons (PCDC), we linked six data sources: (1) institutional EHR data, (2) cancer-specific data from the North American Association of Central Cancer Registries, (3) surgical outcomes from the National Surgical Quality Improvement Program, (4) community-level data from the American Community Survey, (5) national mortality data from Obituary.com, and (6) genomic data from the cBioPortal for Cancer Genomics. We evaluated the feasibility of using the Observational Medical Outcomes Partnership common data model. The data set is stored on a cloud-based, Health Insurance Portability and Accountability Act-secure, and National Institute of Standards and Technology-compliant server.

Results: The PCDC currently includes data of 3,542 unique patients. The mean age at diagnosis is 66.6 ± 11.7 years; 53.3% is male, and 92.2% is White. Linkage to six national data sets increased the completeness of cancer-specific data from 31.3% to 71.6%. Most patients presented at stage IV (43.6%), followed by stage I (22.6%). As of the latest update, 1,074 (30.3%) patients were still alive.

Conclusion: The PCDC is a centralized resource that solves a gap in PC research. The ability to securely link and analyze protected patient data is a strategic step toward enhancing clinical research and optimizing care for patients with PC. Our future work includes expanding the PCDC to multiple centers using common data models.

目的:由于缺乏国家数据集的颗粒数据,胰腺癌(PC)的临床研究受到限制。专门为个人电脑设计的基于电子健康记录(EHR)的数据集具有推进研究的巨大潜力。本研究描述了为PC患者创建基于ehr的数据共享。方法:我们对2010年1月1日至2023年12月31日在我院诊断为PC(国际肿瘤疾病分类,代码C25.0-25.9)的成年患者进行了索引队列研究。为了开发胰腺癌数据共享(PCDC),我们链接了六个数据源:(1)机构电子病历数据,(2)来自北美中央癌症登记处协会的癌症特异性数据,(3)来自国家手术质量改进计划的手术结果,(4)来自美国社区调查的社区水平数据,(5)来自Obituary.com的全国死亡率数据,(6)来自癌症基因组学cBioPortal的基因组数据。我们评估了使用观察性医疗结果伙伴关系通用数据模型的可行性。数据集存储在基于云的、健康保险可移植性和责任法案安全的、符合美国国家标准与技术研究所标准的服务器上。结果:PCDC目前包括3,542例独特患者的数据。平均诊断年龄66.6±11.7岁;男性占53.3%,白人占92.2%。与六个国家数据集的联系将癌症特异性数据的完整性从31.3%提高到71.6%。大多数患者出现在IV期(43.6%),其次是I期(22.6%)。截至最新数据,1074名(30.3%)患者仍然活着。结论:PCDC是解决PC研究空白的集中资源。安全链接和分析受保护的患者数据的能力是加强临床研究和优化PC患者护理的战略步骤。我们未来的工作包括使用通用数据模型将PCDC扩展到多个中心。
{"title":"Novel Electronic Health Record-Based Data Commons for Pancreatic Cancer.","authors":"Kaleem S Ahmed, Clayton T Marcinak, Muhammad Maisam Ali, Sheriff M Issaka, Yonghe Yan, Gabriel McMahan, Thomas Callaci, Noelle K LoConte, Andrea Shiefelbein, Sharon Weber, Majid Afshar, Matthew M Churpek, Jomol Mathew, Syed Nabeel Zafar","doi":"10.1200/CCI-24-00265","DOIUrl":"10.1200/CCI-24-00265","url":null,"abstract":"<p><strong>Purpose: </strong>Clinical research in pancreatic cancer (PC) has been limited because of a lack of granular data in national data sets. An electronic health record (EHR)-based data set specifically designed for PC has immense potential to advance research. This study describes the creation of an EHR-based data commons for patients with PC.</p><p><strong>Methods: </strong>We generated an index cohort of adult patients at our institution diagnosed with PC (International Classification of Diseases for Oncology, codes C25.0-25.9) between January 1, 2010, and December 31, 2023. To develop the Pancreatic Cancer Data Commons (PCDC), we linked six data sources: (1) institutional EHR data, (2) cancer-specific data from the North American Association of Central Cancer Registries, (3) surgical outcomes from the National Surgical Quality Improvement Program, (4) community-level data from the American Community Survey, (5) national mortality data from Obituary.com, and (6) genomic data from the cBioPortal for Cancer Genomics. We evaluated the feasibility of using the Observational Medical Outcomes Partnership common data model. The data set is stored on a cloud-based, Health Insurance Portability and Accountability Act-secure, and National Institute of Standards and Technology-compliant server.</p><p><strong>Results: </strong>The PCDC currently includes data of 3,542 unique patients. The mean age at diagnosis is 66.6 ± 11.7 years; 53.3% is male, and 92.2% is White. Linkage to six national data sets increased the completeness of cancer-specific data from 31.3% to 71.6%. Most patients presented at stage IV (43.6%), followed by stage I (22.6%). As of the latest update, 1,074 (30.3%) patients were still alive.</p><p><strong>Conclusion: </strong>The PCDC is a centralized resource that solves a gap in PC research. The ability to securely link and analyze protected patient data is a strategic step toward enhancing clinical research and optimizing care for patients with PC. Our future work includes expanding the PCDC to multiple centers using common data models.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"10 ","pages":"e2400265"},"PeriodicalIF":2.8,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12795310/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145946747","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PREPARE ALL: An Artificial Intelligence Tool for Predicting Relapse in Children With Acute Lymphoblastic Leukemia. PREPARE ALL:预测急性淋巴细胞白血病儿童复发的人工智能工具。
IF 2.8 Q2 ONCOLOGY Pub Date : 2026-01-01 Epub Date: 2026-01-21 DOI: 10.1200/CCI-25-00222
Subikksha Saravanan, Raghunathan Rengaswamy, Gaurav Narula, Sameer Bakhshi, Rachna Seth, Nandana Das, Manash Pratim Gogoi, Shripad Banavali, Prasanth Srinivasan, Gargi Das, T K Balaji, Shekar Krishnan, Vaskar Saha, Vijayalakshmi Ramshankar, Venkatraman Radhakrishnan

Purpose: The Pediatric Relapse Prediction and Risk Evaluation for Acute Lymphoblastic Leukemia (PREPARE-ALL) tool aims to predict relapse in pediatric ALL by integrating clinical expertise with artificial intelligence and machine learning (ML), particularly Extreme Gradient Boosting (XGBoost). PREPARE-ALL demonstrates that multicenter, protocol-driven clinical and laboratory data can be used through ML to generate reproducible relapse predictions with greater sensitivity than individual clinician assessments.

Methods: PREPARE-ALL was developed using data from the ICiCLe ALL-14 pretrial cohort across five centers, incorporating 33 clinical and laboratory features.

Results: Among 2,252 patients enrolled in the study, 565 (25.1%) relapsed. Using an 80:20 train-test split, XGBoost achieved a sensitivity of 68.5% (245/447 relapses detected). Additional metrics included a positive predictive value of 31.3%, a negative predictive value of 82.8%, an accuracy of 54.8%, and a specificity of 50.3%. Key predictors of relapse included high hyperdiploidy and BCR-ABL1 fusion positive, positive measurable residual disease status at the end of induction, sex, age, highest presenting WBC, and final risk group. Three clinicians scored the validation data set; the developed model achieved a higher recall (68.5%) compared with clinical judgment (approximately 31%-36%).

Conclusion: PREPARE-ALL identifies twice as many relapses as clinicians and serves as a practical decision-support tool for early relapse triage and treatment planning, enabling timely therapeutic adjustments and improved outcomes in pediatric ALL.

目的:儿科急性淋巴细胞白血病复发预测和风险评估(PREPARE-ALL)工具旨在通过将临床专业知识与人工智能和机器学习(ML),特别是极限梯度增强(XGBoost)相结合,预测儿科ALL的复发。PREPARE-ALL表明,通过ML可以使用多中心、协议驱动的临床和实验室数据来生成可重复的复发预测,其灵敏度高于单个临床医生的评估。方法:PREPARE-ALL是利用来自5个中心的ICiCLe ALL-14试验前队列的数据开发的,包括33个临床和实验室特征。结果:在纳入研究的2252例患者中,565例(25.1%)复发。使用80:20的列车测试分割,XGBoost实现了68.5%的灵敏度(检测到245/447次复发)。其他指标包括阳性预测值31.3%,阴性预测值82.8%,准确率54.8%,特异性50.3%。复发的关键预测因素包括高二倍体和BCR-ABL1融合阳性,诱导结束时可测量的阳性残留疾病状态,性别,年龄,最高呈现WBC和最终危险组。三位临床医生对验证数据集进行评分;与临床判断(约31%-36%)相比,开发的模型实现了更高的召回率(68.5%)。结论:prep -ALL识别的复发率是临床医生的两倍,可作为早期复发分诊和治疗计划的实用决策支持工具,能够及时调整治疗并改善儿科ALL的预后。
{"title":"PREPARE ALL: An Artificial Intelligence Tool for Predicting Relapse in Children With Acute Lymphoblastic Leukemia.","authors":"Subikksha Saravanan, Raghunathan Rengaswamy, Gaurav Narula, Sameer Bakhshi, Rachna Seth, Nandana Das, Manash Pratim Gogoi, Shripad Banavali, Prasanth Srinivasan, Gargi Das, T K Balaji, Shekar Krishnan, Vaskar Saha, Vijayalakshmi Ramshankar, Venkatraman Radhakrishnan","doi":"10.1200/CCI-25-00222","DOIUrl":"10.1200/CCI-25-00222","url":null,"abstract":"<p><strong>Purpose: </strong>The Pediatric Relapse Prediction and Risk Evaluation for Acute Lymphoblastic Leukemia (PREPARE-ALL) tool aims to predict relapse in pediatric ALL by integrating clinical expertise with artificial intelligence and machine learning (ML), particularly Extreme Gradient Boosting (XGBoost). PREPARE-ALL demonstrates that multicenter, protocol-driven clinical and laboratory data can be used through ML to generate reproducible relapse predictions with greater sensitivity than individual clinician assessments.</p><p><strong>Methods: </strong>PREPARE-ALL was developed using data from the ICiCLe ALL-14 pretrial cohort across five centers, incorporating 33 clinical and laboratory features.</p><p><strong>Results: </strong>Among 2,252 patients enrolled in the study, 565 (25.1%) relapsed. Using an 80:20 train-test split, XGBoost achieved a sensitivity of 68.5% (245/447 relapses detected). Additional metrics included a positive predictive value of 31.3%, a negative predictive value of 82.8%, an accuracy of 54.8%, and a specificity of 50.3%. Key predictors of relapse included high hyperdiploidy and BCR-ABL1 fusion positive, positive measurable residual disease status at the end of induction, sex, age, highest presenting WBC, and final risk group. Three clinicians scored the validation data set; the developed model achieved a higher recall (68.5%) compared with clinical judgment (approximately 31%-36%).</p><p><strong>Conclusion: </strong>PREPARE-ALL identifies twice as many relapses as clinicians and serves as a practical decision-support tool for early relapse triage and treatment planning, enabling timely therapeutic adjustments and improved outcomes in pediatric ALL.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"10 ","pages":"e2500222"},"PeriodicalIF":2.8,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146020517","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Causal Cascade of Symptoms on Patient Functioning and Health-Related Quality of Life in Non-Small Cell Lung Cancer and Metastatic Breast Cancer. 非小细胞肺癌和转移性乳腺癌患者功能和健康相关生活质量的因果级联症状
IF 2.8 Q2 ONCOLOGY Pub Date : 2026-01-01 Epub Date: 2026-01-23 DOI: 10.1200/CCI-25-00127
Donald E Stull

Purpose: Health-related quality of life (HRQoL) is a valuable counterpart to other end points in oncology trials. However, when analyzing the effect of treatment on HRQoL, results are often mixed. Cancer and its treatment can have direct and indirect effects on symptoms such as pain, nausea, and fatigue. These, in turn, can affect the patients' physical and social functioning and ultimately their HRQoL. This causal cascade requires methodologies, such as structural equation modeling, to estimate direct, indirect, and total effects of symptoms on more distal HRQoL outcomes.

Methods: Data were obtained from two multicenter, randomized trials available from Project Data Sphere, LLC. One trial included patients with non-small cell lung cancer; the other included patients with metastatic breast cancer. Structural equation models were evaluated; models included symptom items from the European Organization for Research and Treatment of Cancer (EORTC) core questionnaire, QLQ-C30 (version 3). A causal cascade of individual symptoms at two time points was hypothesized to affect fatigue, which, in turn, had direct and indirect effects on the functional and HRQoL measures.

Results: Results showed a logical, causal ordering among EORTC QLQ-C30 outcomes. In the final model, the effects of most symptoms on functional domains and the quality of life (QoL) scale were through their direct effect on fatigue, which, in turn, affected patient functioning and HRQoL.

Conclusion: Results strongly suggest that when examining the effects of symptoms on EORTC QLQ-C30 functional scales or HRQoL, symptoms would generally not show direct effects on these more distal outcomes. The results from the two clinical trials demonstrated that anticipating a causal cascade from specific symptoms to functional domains and HRQoL would yield more meaningful results.

目的:健康相关生活质量(HRQoL)是肿瘤学试验中与其他终点相对应的有价值的指标。然而,在分析治疗对HRQoL的影响时,结果往往喜忧参半。癌症及其治疗可对疼痛、恶心和疲劳等症状产生直接或间接的影响。这些反过来又会影响患者的身体和社交功能,最终影响他们的HRQoL。这种因果级联需要结构方程模型等方法来估计症状对更远端HRQoL结果的直接、间接和总影响。方法:数据来自Project Data Sphere, LLC提供的两项多中心随机试验。一项试验纳入非小细胞肺癌患者;另一组包括转移性乳腺癌患者。对结构方程模型进行评价;模型包括来自欧洲癌症研究和治疗组织(EORTC)核心问卷QLQ-C30(版本3)的症状项目。假设在两个时间点上个体症状的因果级联会影响疲劳,而疲劳反过来又对功能和HRQoL测量产生直接和间接影响。结果:结果显示EORTC QLQ-C30结果之间存在逻辑因果顺序。在最后的模型中,大多数症状对功能域和生活质量(QoL)量表的影响是通过它们对疲劳的直接影响来实现的,疲劳反过来又影响患者的功能和HRQoL。结论:结果强烈提示,在检查症状对EORTC QLQ-C30功能量表或HRQoL的影响时,症状通常不会对这些更远的结果显示直接影响。两项临床试验的结果表明,预测从特定症状到功能领域和HRQoL的因果级联将产生更有意义的结果。
{"title":"Causal Cascade of Symptoms on Patient Functioning and Health-Related Quality of Life in Non-Small Cell Lung Cancer and Metastatic Breast Cancer.","authors":"Donald E Stull","doi":"10.1200/CCI-25-00127","DOIUrl":"10.1200/CCI-25-00127","url":null,"abstract":"<p><strong>Purpose: </strong>Health-related quality of life (HRQoL) is a valuable counterpart to other end points in oncology trials. However, when analyzing the effect of treatment on HRQoL, results are often mixed. Cancer and its treatment can have direct and indirect effects on symptoms such as pain, nausea, and fatigue. These, in turn, can affect the patients' physical and social functioning and ultimately their HRQoL. This causal cascade requires methodologies, such as structural equation modeling, to estimate direct, indirect, and total effects of symptoms on more distal HRQoL outcomes.</p><p><strong>Methods: </strong>Data were obtained from two multicenter, randomized trials available from Project Data Sphere, LLC. One trial included patients with non-small cell lung cancer; the other included patients with metastatic breast cancer. Structural equation models were evaluated; models included symptom items from the European Organization for Research and Treatment of Cancer (EORTC) core questionnaire, QLQ-C30 (version 3). A causal cascade of individual symptoms at two time points was hypothesized to affect fatigue, which, in turn, had direct and indirect effects on the functional and HRQoL measures.</p><p><strong>Results: </strong>Results showed a logical, causal ordering among EORTC QLQ-C30 outcomes. In the final model, the effects of most symptoms on functional domains and the quality of life (QoL) scale were through their direct effect on fatigue, which, in turn, affected patient functioning and HRQoL.</p><p><strong>Conclusion: </strong>Results strongly suggest that when examining the effects of symptoms on EORTC QLQ-C30 functional scales or HRQoL, symptoms would generally not show direct effects on these more distal outcomes. The results from the two clinical trials demonstrated that anticipating a causal cascade from specific symptoms to functional domains and HRQoL would yield more meaningful results.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"10 ","pages":"e2500127"},"PeriodicalIF":2.8,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12834281/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146042168","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ClinBioNGS: A Clinical Bioinformatics Pipeline for Integrated Analysis of Somatic Next-Generation Sequencing Cancer Panels. ClinBioNGS:用于下一代体细胞测序癌症面板综合分析的临床生物信息学管道。
IF 2.8 Q2 ONCOLOGY Pub Date : 2026-01-01 Epub Date: 2026-01-08 DOI: 10.1200/CCI-25-00221
Raúl Marín, Ania Alay, Maria Ajenjo-Bauza, Sara Hijazo-Pechero, Carla Montironi, Eva Hernández-Illán, Pedro Jares, Daniel Azuara, Mar Varela, August Vidal, Joan Anton Puig-Butillé, Conxi Lázaro, Víctor Moreno, David Cordero, Ernest Nadal, Xavier Solé

Purpose: Somatic next-generation sequencing (NGS) panels are widely used in precision oncology to detect clinically actionable genomic alterations. However, interpreting diverse DNA and RNA alterations remains challenging because of the complexity of tumor-only data and the limitations of current pipelines, which are often proprietary, noncustomizable, or lack visual reports to support clinical interpretation. We present ClinBioNGS, an open-source, panel-agnostic bioinformatics pipeline for the comprehensive analysis of somatic NGS cancer panels in both clinical and translational settings.

Materials and methods: ClinBioNGS is a modular, fully containerized workflow implemented in Nextflow. It supports integrated analysis of DNA and RNA data, including multicaller small variant detection, copy number alteration (CNA) profiling, gene fusion and splice variant identification, and evaluation of complex genomic biomarkers such as tumor mutational burden and microsatellite instability. Variants are annotated and prioritized using established clinical frameworks. The results are compiled in a self-contained interactive HTML report with dynamic tables and informative visualizations to facilitate clinical interpretation. Validation included SEQC2 reference data sets across six commercial panels, and benchmarking was performed on 2,024 clinical tumor samples analyzed with three commercial platforms.

Results: ClinBioNGS achieved high accuracy in SEQC2 validation, with precision (0.987-1.000), recall (0.920-0.997), and F1 scores (0.956-0.999) across diverse panels. In a clinical benchmarking with real-world data, the pipeline demonstrated high concordance with commercial solutions for small variants (97%), CNAs (89%), and RNA alterations (94%), while also identifying additional high-confidence alterations missed by vendor pipelines.

Conclusion: ClinBioNGS provides a robust, flexible, and transparent solution for standardized analysis of somatic NGS cancer panels. It supports reproducible, clinically oriented interpretation of genomic data and is freely available for noncommercial research-use only at GitHub.

目的:体细胞下一代测序(NGS)面板广泛应用于精确肿瘤学,以检测临床可操作的基因组改变。然而,解释不同的DNA和RNA改变仍然具有挑战性,因为仅肿瘤数据的复杂性和当前管道的局限性,这些管道通常是专有的,不可定制的,或者缺乏支持临床解释的可视化报告。我们提出ClinBioNGS,一个开源的、与小组无关的生物信息学管道,用于临床和转化环境中体细胞NGS癌症小组的综合分析。材料和方法:ClinBioNGS是在Nextflow中实现的模块化、完全容器化的工作流。它支持DNA和RNA数据的集成分析,包括多靶点小变异检测、拷贝数改变(CNA)分析、基因融合和剪接变异鉴定,以及复杂基因组生物标志物的评估,如肿瘤突变负担和微卫星不稳定性。变体使用已建立的临床框架进行注释和优先排序。结果汇编在一个独立的交互式HTML报告与动态表格和信息可视化,以方便临床解释。验证包括六个商业小组的SEQC2参考数据集,并在三个商业平台分析的2024个临床肿瘤样本上进行基准测试。结果:ClinBioNGS在SEQC2验证中具有较高的准确性,精密度(0.987-1.000),召回率(0.920-0.997),F1评分(0.956-0.999)。在真实世界数据的临床基准测试中,该管道与商业解决方案在小变异(97%)、CNAs(89%)和RNA改变(94%)方面表现出了高度的一致性,同时也识别了供应商管道遗漏的额外高置信度改变。结论:ClinBioNGS为体细胞NGS癌组的标准化分析提供了一个强大、灵活和透明的解决方案。它支持可重复的、临床导向的基因组数据解释,并且仅在GitHub上免费提供非商业研究使用。
{"title":"ClinBioNGS: A Clinical Bioinformatics Pipeline for Integrated Analysis of Somatic Next-Generation Sequencing Cancer Panels.","authors":"Raúl Marín, Ania Alay, Maria Ajenjo-Bauza, Sara Hijazo-Pechero, Carla Montironi, Eva Hernández-Illán, Pedro Jares, Daniel Azuara, Mar Varela, August Vidal, Joan Anton Puig-Butillé, Conxi Lázaro, Víctor Moreno, David Cordero, Ernest Nadal, Xavier Solé","doi":"10.1200/CCI-25-00221","DOIUrl":"https://doi.org/10.1200/CCI-25-00221","url":null,"abstract":"<p><strong>Purpose: </strong>Somatic next-generation sequencing (NGS) panels are widely used in precision oncology to detect clinically actionable genomic alterations. However, interpreting diverse DNA and RNA alterations remains challenging because of the complexity of tumor-only data and the limitations of current pipelines, which are often proprietary, noncustomizable, or lack visual reports to support clinical interpretation. We present ClinBioNGS, an open-source, panel-agnostic bioinformatics pipeline for the comprehensive analysis of somatic NGS cancer panels in both clinical and translational settings.</p><p><strong>Materials and methods: </strong>ClinBioNGS is a modular, fully containerized workflow implemented in Nextflow. It supports integrated analysis of DNA and RNA data, including multicaller small variant detection, copy number alteration (CNA) profiling, gene fusion and splice variant identification, and evaluation of complex genomic biomarkers such as tumor mutational burden and microsatellite instability. Variants are annotated and prioritized using established clinical frameworks. The results are compiled in a self-contained interactive HTML report with dynamic tables and informative visualizations to facilitate clinical interpretation. Validation included SEQC2 reference data sets across six commercial panels, and benchmarking was performed on 2,024 clinical tumor samples analyzed with three commercial platforms.</p><p><strong>Results: </strong>ClinBioNGS achieved high accuracy in SEQC2 validation, with precision (0.987-1.000), recall (0.920-0.997), and F1 scores (0.956-0.999) across diverse panels. In a clinical benchmarking with real-world data, the pipeline demonstrated high concordance with commercial solutions for small variants (97%), CNAs (89%), and RNA alterations (94%), while also identifying additional high-confidence alterations missed by vendor pipelines.</p><p><strong>Conclusion: </strong>ClinBioNGS provides a robust, flexible, and transparent solution for standardized analysis of somatic NGS cancer panels. It supports reproducible, clinically oriented interpretation of genomic data and is freely available for noncommercial research-use only at GitHub.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"10 ","pages":"e2500221"},"PeriodicalIF":2.8,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145935982","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Simulation-Based Evaluation of a Large Language Model-Enabled Clinical Decision Support Platform in Oncology. 基于模拟的肿瘤临床决策支持平台的大型语言模型评估。
IF 2.8 Q2 ONCOLOGY Pub Date : 2026-01-01 Epub Date: 2026-01-28 DOI: 10.1200/CCI-25-00244
Nesrine Lajmi, Mehul Patel, Gareth Obery, Archana Dorge, Ashish Sharma, Jack Halligan, Ernest Lo

Purpose: A core clinical task is to synthesize fragmented patient data into a coherent summary to support decision making. However, electronic health record (EHR) inefficiencies burden clinicians and contribute to their cognitive overload and burnout. This study evaluated the impact of a large language model (LLM)-enabled clinical decision support (LLM-CDS) platform compared with a simulated EHR (SimEPR) on workflow efficiency and user experience in generating accurate clinical summaries during tumor board preparation and explored its applicability to consultation preparation, referrals, treatment planning, and patient communication.

Methods: In a remote, within-participant simulation, 26 oncologists from the United Kingdom, United States, Spain, and Singapore reviewed synthetic breast cancer cases and created comprehensive summaries for tumor board discussions using both LLM-CDS and SimEPR. LLM-CDS provided editable LLM-generated summaries; SimEPR required manual composition. Time to task completion was recorded. An independent reviewer assessed summary quality based on completeness, correctness, and conciseness. Participants also completed surveys on usability, cognitive load, and feature acceptability.

Results: LLM-CDS significantly reduced the summary completion time compared with SimEPR (6:55 v 8:47 minutes; P < .001). Summary completeness was rated higher with LLM-CDS (mean score, 3.93 v 3.13), whereas correctness and conciseness were similar. Overall, 87% of participants would recommend LLM-CDS and 96% would anticipate time savings. The system usability scale score for LLM-CDS was 65.7. Although perceived cognitive load was lower with LLM-CDS, the difference was not statistically significant. The LLM summary was the most valued, with 92% finding it useful for the tumor board and consultation preparation.

Conclusion: The LLM-CDS platform improved the efficiency and completeness of clinical summarization. Strong user acceptance and anticipated time savings underscore the potential for streamlining a range of oncology workflows.

目的:一项核心临床任务是将零散的患者数据综合成连贯的摘要,以支持决策。然而,电子健康记录(EHR)效率低下给临床医生带来负担,并导致他们的认知超载和倦怠。本研究评估了大型语言模型(LLM)支持的临床决策支持(LLM- cds)平台与模拟EHR (SimEPR)在肿瘤板制备过程中生成准确临床摘要的工作流程效率和用户体验方面的影响,并探讨了其在会诊准备、转诊、治疗计划和患者沟通方面的适用性。方法:在远程参与者模拟中,来自英国、美国、西班牙和新加坡的26名肿瘤学家回顾了合成乳腺癌病例,并使用LLM-CDS和SimEPR为肿瘤委员会讨论创建了综合摘要。LLM-CDS提供可编辑的llm生成的摘要;SimEPR需要手工合成。记录完成任务的时间。一个独立的审稿人根据完整性、正确性和简洁性评估摘要的质量。参与者还完成了关于可用性、认知负荷和功能可接受性的调查。结果:与SimEPR相比,LLM-CDS显著缩短了汇总完成时间(6:55 vs 8:47分钟;P < 0.001)。LLM-CDS的总结完整性评分更高(平均评分3.93 v 3.13),而正确性和简洁性相似。总体而言,87%的参与者会推荐LLM-CDS, 96%的人预计会节省时间。LLM-CDS的系统可用性量表得分为65.7分。虽然LLM-CDS组的认知负荷较低,但差异无统计学意义。法学硕士总结是最有价值的,92%的人认为它对肿瘤板和会诊准备有用。结论:LLM-CDS平台提高了临床总结的效率和完整性。强大的用户接受度和预期的时间节省强调了简化肿瘤工作流程的潜力。
{"title":"Simulation-Based Evaluation of a Large Language Model-Enabled Clinical Decision Support Platform in Oncology.","authors":"Nesrine Lajmi, Mehul Patel, Gareth Obery, Archana Dorge, Ashish Sharma, Jack Halligan, Ernest Lo","doi":"10.1200/CCI-25-00244","DOIUrl":"https://doi.org/10.1200/CCI-25-00244","url":null,"abstract":"<p><strong>Purpose: </strong>A core clinical task is to synthesize fragmented patient data into a coherent summary to support decision making. However, electronic health record (EHR) inefficiencies burden clinicians and contribute to their cognitive overload and burnout. This study evaluated the impact of a large language model (LLM)-enabled clinical decision support (LLM-CDS) platform compared with a simulated EHR (SimEPR) on workflow efficiency and user experience in generating accurate clinical summaries during tumor board preparation and explored its applicability to consultation preparation, referrals, treatment planning, and patient communication.</p><p><strong>Methods: </strong>In a remote, within-participant simulation, 26 oncologists from the United Kingdom, United States, Spain, and Singapore reviewed synthetic breast cancer cases and created comprehensive summaries for tumor board discussions using both LLM-CDS and SimEPR. LLM-CDS provided editable LLM-generated summaries; SimEPR required manual composition. Time to task completion was recorded. An independent reviewer assessed summary quality based on completeness, correctness, and conciseness. Participants also completed surveys on usability, cognitive load, and feature acceptability.</p><p><strong>Results: </strong>LLM-CDS significantly reduced the summary completion time compared with SimEPR (6:55 <i>v</i> 8:47 minutes; <i>P</i> < .001). Summary completeness was rated higher with LLM-CDS (mean score, 3.93 <i>v</i> 3.13), whereas correctness and conciseness were similar. Overall, 87% of participants would recommend LLM-CDS and 96% would anticipate time savings. The system usability scale score for LLM-CDS was 65.7. Although perceived cognitive load was lower with LLM-CDS, the difference was not statistically significant. The LLM summary was the most valued, with 92% finding it useful for the tumor board and consultation preparation.</p><p><strong>Conclusion: </strong>The LLM-CDS platform improved the efficiency and completeness of clinical summarization. Strong user acceptance and anticipated time savings underscore the potential for streamlining a range of oncology workflows.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"10 ","pages":"e2500244"},"PeriodicalIF":2.8,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12863596/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146114927","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Assessing the Detection Power of Genome-Wide Copy Number Variation Profiles in Prostate Cancer Using Simulated Shallow Whole-Genome Sequencing Data. 利用模拟浅全基因组测序数据评估前列腺癌全基因组拷贝数变异谱的检测能力。
IF 2.8 Q2 ONCOLOGY Pub Date : 2026-01-01 Epub Date: 2026-01-23 DOI: 10.1200/CCI-25-00240
Samhita Pamidimarri Naga, Peter H J Slootbeek, Sofie H Tolmeijer, Christian Gillissen, Marjolijn J L Ligtenberg, Niven Mehra, Richarda M de Voer

Purpose: Shallow whole-genome sequencing (sWGS) is a cost-effective approach for detecting genome wide copy number profiles in tumor samples. In metastatic castration-resistant prostate cancer (mCRPC), recognizing homologous recombination deficiency (HRD) and tandem duplication (TD) genomic profiles may contribute to improved treatment choices such as poly (ADP-ribose) polymerase inhibitors. This study aims to determine the minimum sequencing depth and tumor content (TC) required to accurately identify these clinically significant genomic profiles using sWGS.

Materials and methods: Whole-genome sequencing (WGS) data from 168 tumor and matched normal biopsies from 155 patients with mCRPC were mixed in silico to generate a set of 3,360 mixtures with varying TCs (original, 20%, 10%, 5%, 3%) and sequencing depths (original, 5×, 2×, 1×, 0.1×). Copy number variations (CNVs) were analyzed using ichorCNA and WisecondorX at different window sizes.

Results: An average sequencing depth of 1× at 20% TC was found to be sufficient to detect CNVs with high sensitivity (>0.85) and high specificity (>0.95). For HRD and TD profile detection, ichorCNA at a 50 Kb window size was optimal and a reliable detection of HRD profiles was achieved with a very strong correlation of R = 0.88 (P < 2.2e-16). Detection of TD profiles also remained accurate at these parameters with a strong correlation of R = 0.72 (P < 2.2e-16), although the median length of duplication events increased at lower depths. TC estimation by ichorCNA strongly correlated with full-depth WGS of diploid genomes.

Conclusion: In this study, through in silico simulations of WGS data, we demonstrate that the genomic scars of two druggable genomic profiles, HRD and TD, can be reliably detected in mCRPC with 1× average sequencing depth and ≥20% TC. Further research is required to correlate these markers with outcome of specific treatments using sWGS.

目的:浅全基因组测序(sWGS)是检测肿瘤样本基因组拷贝数谱的一种经济有效的方法。在转移性去势抵抗性前列腺癌(mCRPC)中,认识到同源重组缺陷(HRD)和串联重复(TD)基因组谱可能有助于改善治疗选择,如聚(adp -核糖)聚合酶抑制剂。本研究旨在确定使用sWGS准确识别这些具有临床意义的基因组图谱所需的最小测序深度和肿瘤含量(TC)。材料和方法:将来自155例mCRPC患者的168例肿瘤和匹配的正常活检组织的全基因组测序(WGS)数据在硅片上混合,生成一组3360个不同tc(原始、20%、10%、5%、3%)和测序深度(原始、5×、2×、1×、0.1×)的混合物。使用ichorCNA和WisecondorX分析不同窗口大小下的拷贝数变异(CNVs)。结果:在20% TC下,平均测序深度为1×足以检测CNVs,具有高灵敏度(>0.85)和高特异性(>0.95)。对于HRD和TD剖面检测,50 Kb窗口大小的ichorCNA是最佳的,并且实现了可靠的HRD剖面检测,相关性非常强,R = 0.88 (P < 2.2e-16)。尽管重复事件的中位数长度在较低深度增加,但在这些参数下,TD剖面的检测仍然保持准确,相关性为R = 0.72 (P < 2.2 -16)。ichorCNA估计的TC与二倍体基因组的全深度WGS密切相关。结论:本研究通过对WGS数据的计算机模拟,证明在平均测序深度为1倍、TC≥20%的mCRPC中,可以可靠地检测到HRD和TD两种可药物基因组谱的基因组疤痕。需要进一步的研究将这些标记物与使用sWGS的特定治疗结果联系起来。
{"title":"Assessing the Detection Power of Genome-Wide Copy Number Variation Profiles in Prostate Cancer Using Simulated Shallow Whole-Genome Sequencing Data.","authors":"Samhita Pamidimarri Naga, Peter H J Slootbeek, Sofie H Tolmeijer, Christian Gillissen, Marjolijn J L Ligtenberg, Niven Mehra, Richarda M de Voer","doi":"10.1200/CCI-25-00240","DOIUrl":"10.1200/CCI-25-00240","url":null,"abstract":"<p><strong>Purpose: </strong>Shallow whole-genome sequencing (sWGS) is a cost-effective approach for detecting genome wide copy number profiles in tumor samples. In metastatic castration-resistant prostate cancer (mCRPC), recognizing homologous recombination deficiency (HRD) and tandem duplication (TD) genomic profiles may contribute to improved treatment choices such as poly (ADP-ribose) polymerase inhibitors. This study aims to determine the minimum sequencing depth and tumor content (TC) required to accurately identify these clinically significant genomic profiles using sWGS.</p><p><strong>Materials and methods: </strong>Whole-genome sequencing (WGS) data from 168 tumor and matched normal biopsies from 155 patients with mCRPC were mixed in silico to generate a set of 3,360 mixtures with varying TCs (original, 20%, 10%, 5%, 3%) and sequencing depths (original, 5×, 2×, 1×, 0.1×). Copy number variations (CNVs) were analyzed using ichorCNA and WisecondorX at different window sizes.</p><p><strong>Results: </strong>An average sequencing depth of 1× at 20% TC was found to be sufficient to detect CNVs with high sensitivity (>0.85) and high specificity (>0.95). For HRD and TD profile detection, ichorCNA at a 50 Kb window size was optimal and a reliable detection of HRD profiles was achieved with a very strong correlation of R = 0.88 (<i>P</i> < 2.2e-16). Detection of TD profiles also remained accurate at these parameters with a strong correlation of R = 0.72 (<i>P</i> < 2.2e-16), although the median length of duplication events increased at lower depths. TC estimation by ichorCNA strongly correlated with full-depth WGS of diploid genomes.</p><p><strong>Conclusion: </strong>In this study, through in silico simulations of WGS data, we demonstrate that the genomic scars of two druggable genomic profiles, HRD and TD, can be reliably detected in mCRPC with 1× average sequencing depth and ≥20% TC. Further research is required to correlate these markers with outcome of specific treatments using sWGS.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"10 ","pages":"e2500240"},"PeriodicalIF":2.8,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12834301/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146042138","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Detecting the Cure Model Appropriateness in Randomized Clinical Trials With Long-Term Survivors. 在长期幸存者随机临床试验中检测治疗模式的适宜性。
IF 2.8 Q2 ONCOLOGY Pub Date : 2025-12-01 Epub Date: 2025-12-15 DOI: 10.1200/CCI-25-00084
Cheryl Kouadio, Subodh Selukar, Megan Othus, Sylvie Chevret

Purpose: To evaluate the appropriateness of a cure model when analyzing right-censored end points of a randomized clinical trial (RCT) in malignancy in the presence of long-term survivors. We aim to derive how the ratio estimation of censored cured subjects (RECeUS), previously proposed for a homogeneous population, could be extended for use in RCTs.

Methods: Based on the RECeUS method, four decision rules were considered to assess the appropriateness of a cure model. They considered the eligibility conditions to be met: in both arms, in at least one randomized arm, in the entire sample, or when only considering an average of the conditions, respectively. A simulation study was performed to evaluate their performance and the impact of the link function when considering the appropriateness of cure models. We also illustrate the method using two real data examples from two RCTs conducted in patients with acute leukemia and COVID-19 disease.

Results: Simulation results show that the best decision rule that can be applied in all considered treatment effect scenarios might be to check the criteria in at least one randomized arm. Regardless of the rules, the cure model appeared to be appropriate in both RCT data.

Conclusion: When analyzing survival data from RCTs, the appropriateness of a cure model could be considered in the face of a plateau shape of the survival curves. To ensure that the presence of such a plateau in the survival curves is a reliable indicator of the presence of cured patients in the population, the RECeUS method should be used in each randomized arm separately, with criteria met in at least one randomized arm.

目的:在分析恶性肿瘤长期存活患者的随机临床试验(RCT)右截尾终点时,评估一种治愈模型的适宜性。我们的目标是推导出如何将先前针对同质人群提出的审查治愈受试者(RECeUS)的比率估计扩展到随机对照试验中。方法:基于RECeUS方法,采用四种决策规则来评估治疗模型的适宜性。他们考虑了需要满足的资格条件:在两个组中,在至少一个随机组中,在整个样本中,或者分别只考虑条件的平均值。在考虑治疗模型的适当性时,进行了模拟研究以评估它们的性能和链接函数的影响。我们还使用来自急性白血病和COVID-19疾病患者的两项随机对照试验的两个真实数据示例来说明该方法。结果:模拟结果显示,在所有考虑的治疗效果场景中,最佳决策规则可能是至少在一个随机分组中检查标准。无论规则如何,治愈模型在两个RCT数据中似乎都是合适的。结论:在分析随机对照试验的生存数据时,面对生存曲线的平台形状,可以考虑治疗模型的适用性。为了确保生存曲线中存在这样一个平台是人群中存在治愈患者的可靠指标,应在每个随机分组中单独使用RECeUS方法,至少在一个随机分组中满足标准。
{"title":"Detecting the Cure Model Appropriateness in Randomized Clinical Trials With Long-Term Survivors.","authors":"Cheryl Kouadio, Subodh Selukar, Megan Othus, Sylvie Chevret","doi":"10.1200/CCI-25-00084","DOIUrl":"https://doi.org/10.1200/CCI-25-00084","url":null,"abstract":"<p><strong>Purpose: </strong>To evaluate the appropriateness of a cure model when analyzing right-censored end points of a randomized clinical trial (RCT) in malignancy in the presence of long-term survivors. We aim to derive how the ratio estimation of censored cured subjects (RECeUS), previously proposed for a homogeneous population, could be extended for use in RCTs.</p><p><strong>Methods: </strong>Based on the RECeUS method, four decision rules were considered to assess the appropriateness of a cure model. They considered the eligibility conditions to be met: in both arms, in at least one randomized arm, in the entire sample, or when only considering an average of the conditions, respectively. A simulation study was performed to evaluate their performance and the impact of the link function when considering the appropriateness of cure models. We also illustrate the method using two real data examples from two RCTs conducted in patients with acute leukemia and COVID-19 disease.</p><p><strong>Results: </strong>Simulation results show that the best decision rule that can be applied in all considered treatment effect scenarios might be to check the criteria in at least one randomized arm. Regardless of the rules, the cure model appeared to be appropriate in both RCT data.</p><p><strong>Conclusion: </strong>When analyzing survival data from RCTs, the appropriateness of a cure model could be considered in the face of a plateau shape of the survival curves. To ensure that the presence of such a plateau in the survival curves is a reliable indicator of the presence of cured patients in the population, the RECeUS method should be used in each randomized arm separately, with criteria met in at least one randomized arm.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"9 ","pages":"e2500084"},"PeriodicalIF":2.8,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145764401","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Toward Clinical Readiness: Critical Reflections on PATHOMIQ_PRAD and Artificial Intelligence Histologic Classifiers in Prostate Cancer. 迈向临床准备:对前列腺癌病理分级和人工智能组织学分类的批判性思考。
IF 2.8 Q2 ONCOLOGY Pub Date : 2025-12-01 Epub Date: 2025-11-26 DOI: 10.1200/CCI-25-00227
Schawanya Kaewpitoon Rattanapitoon, Thirayu Meererksom, Nav La, Nathkapach Kaewpitoon Rattanapitoon
{"title":"Toward Clinical Readiness: Critical Reflections on PATHOMIQ_PRAD and Artificial Intelligence Histologic Classifiers in Prostate Cancer.","authors":"Schawanya Kaewpitoon Rattanapitoon, Thirayu Meererksom, Nav La, Nathkapach Kaewpitoon Rattanapitoon","doi":"10.1200/CCI-25-00227","DOIUrl":"https://doi.org/10.1200/CCI-25-00227","url":null,"abstract":"","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"9 ","pages":"e2500227"},"PeriodicalIF":2.8,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145642549","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
JCO Clinical Cancer Informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1