首页 > 最新文献

JCO Clinical Cancer Informatics最新文献

英文 中文
Exploring the Past and Current Landscape of Biomarker-Driven Clinical Trials Through Large Language Models. 通过大型语言模型探索生物标志物驱动的临床试验的过去和现在的景观。
IF 2.8 Q2 ONCOLOGY Pub Date : 2026-02-01 Epub Date: 2026-02-03 DOI: 10.1200/CCI-25-00028
Margaret Guo, Evan Passalacqua, Erik Bao, Brenda Miao, Atul Butte, Travis Zack

Purpose: Biomarkers, or specific somatic alterations, are increasingly required for clinical trial eligibility. Finding and enrolling patients with these biomarkers is essential not only for continuous progress in the treatment of disease but also for democratizing clinical trial participation. Here, we use data from the National Cancer Institute Clinical Trials Reporting Program (NCI CTRP), combined with large language model applications, to survey the current landscape of cancer clinical trials.

Methods: We extracted 20,894 trials from Cancer.gov from the application programming interface (API) of the NCI CTRP. We quantified biomarker rates in cancer subtypes, described the geographic distribution of trial sites, and identified failure causes for these trials. Finally, we built an application from this API to match patients with clinical trials.

Results: We showed that 5,044 of the 20,894 interventional clinical trials contained biomarker eligibility data and trials tended to cluster around large academic centers and cities. We identified 630 biomarkers in 36 cancer subtypes and show that most biomarkers are used as eligibility criteria for multiple cancer subtypes. We highlight that the difficulties with accrual and sponsorship were the most common reason for discontinuing clinical trials. Finally, we demonstrate a novel method to automatically match natural language queries with eligible clinical trials, NCI Clinical Trials Navigator.

Conclusion: A survey of our clinical genomics showed that many individuals likely have mutations that would make them eligible for biomarker-driven trials. We used the NCI Clinical Trials database to show that the distribution of biomarker trials across the United States limits access for many patients and likely leads to the frequent trial termination because of inadequate accrual. Finally, we built an automated publicly available tool that can improve patient-to-trial biomarker-based matching.

目的:生物标志物,或特定的体细胞改变,越来越需要临床试验资格。寻找和招募具有这些生物标志物的患者不仅对疾病治疗的持续进展至关重要,而且对临床试验参与的民主化也至关重要。在这里,我们使用来自国家癌症研究所临床试验报告计划(NCI CTRP)的数据,结合大型语言模型应用程序,来调查癌症临床试验的现状。方法:我们从NCI CTRP的应用程序编程接口(API)中提取Cancer.gov上的20,894项试验。我们量化了癌症亚型的生物标志物率,描述了试验地点的地理分布,并确定了这些试验失败的原因。最后,我们从这个API构建了一个应用程序,将患者与临床试验相匹配。结果:我们发现20,894项介入临床试验中有5,044项包含生物标志物合格性数据,并且试验倾向于集中在大型学术中心和城市周围。我们在36种癌症亚型中鉴定了630种生物标志物,并表明大多数生物标志物可作为多种癌症亚型的合格标准。我们强调,应计和赞助的困难是终止临床试验的最常见原因。最后,我们展示了一种新的方法来自动匹配自然语言查询与符合条件的临床试验,NCI临床试验导航。结论:我们的临床基因组学调查显示,许多个体可能有突变,这将使他们有资格进行生物标志物驱动的试验。我们使用NCI临床试验数据库显示,生物标志物试验在美国的分布限制了许多患者的可及性,并可能由于累积不足而导致频繁的试验终止。最后,我们建立了一个自动化的公开工具,可以改善患者对试验生物标志物的匹配。
{"title":"Exploring the Past and Current Landscape of Biomarker-Driven Clinical Trials Through Large Language Models.","authors":"Margaret Guo, Evan Passalacqua, Erik Bao, Brenda Miao, Atul Butte, Travis Zack","doi":"10.1200/CCI-25-00028","DOIUrl":"10.1200/CCI-25-00028","url":null,"abstract":"<p><strong>Purpose: </strong>Biomarkers, or specific somatic alterations, are increasingly required for clinical trial eligibility. Finding and enrolling patients with these biomarkers is essential not only for continuous progress in the treatment of disease but also for democratizing clinical trial participation. Here, we use data from the National Cancer Institute Clinical Trials Reporting Program (NCI CTRP), combined with large language model applications, to survey the current landscape of cancer clinical trials.</p><p><strong>Methods: </strong>We extracted 20,894 trials from Cancer.gov from the application programming interface (API) of the NCI CTRP. We quantified biomarker rates in cancer subtypes, described the geographic distribution of trial sites, and identified failure causes for these trials. Finally, we built an application from this API to match patients with clinical trials.</p><p><strong>Results: </strong>We showed that 5,044 of the 20,894 interventional clinical trials contained biomarker eligibility data and trials tended to cluster around large academic centers and cities. We identified 630 biomarkers in 36 cancer subtypes and show that most biomarkers are used as eligibility criteria for multiple cancer subtypes. We highlight that the difficulties with accrual and sponsorship were the most common reason for discontinuing clinical trials. Finally, we demonstrate a novel method to automatically match natural language queries with eligible clinical trials, NCI Clinical Trials Navigator.</p><p><strong>Conclusion: </strong>A survey of our clinical genomics showed that many individuals likely have mutations that would make them eligible for biomarker-driven trials. We used the NCI Clinical Trials database to show that the distribution of biomarker trials across the United States limits access for many patients and likely leads to the frequent trial termination because of inadequate accrual. Finally, we built an automated publicly available tool that can improve patient-to-trial biomarker-based matching.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"10 ","pages":"e2500028"},"PeriodicalIF":2.8,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12871862/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146114969","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Health Care Worker Perspectives After New Electronic Health Record Implementation in an Oncology Ambulatory Clinic: Qualitative and Quality-Improvement Insights. 在肿瘤门诊实施新的电子健康记录后的卫生保健工作者的观点:定性和质量改进的见解。
IF 2.8 Q2 ONCOLOGY Pub Date : 2026-02-01 Epub Date: 2026-02-05 DOI: 10.1200/CCI-25-00322
Luxiga Thanabalachandran, Khaled Zaza, Renee Hartzell, Kimberley Miller, Geneviève C Digby, Taylor Moffat, Melinda Mushonga, Kristin Wright, Melanie Powis, John Drover, Siddhartha Srivastava, Monika K Krzyzanowska, Yuchen Li

Purpose: Electronic health record (EHR) systems aim to improve efficiency, care coordination, and patient safety, yet implementation often introduces workflow challenges and staff burden. In 2024, the Cancer Centre of Southeastern Ontario (CCSEO), a regional academic cancer center in Canada, transitioned from a hybrid paper-electronic system to a fully integrated regional EHR. Although hospital EHR adoption has been studied, limited research has examined its impact within ambulatory oncology care, particularly among nonphysician staff, or how institutions responded to the findings. Our study explored oncology healthcare worker perspectives on EHR implementation at CCSEO and identified resulting quality-improvement (QI) initiatives.

Methods: Using purposeful maximum variation sampling, we recruited clinical, administrative, and research staff. Semistructured interviews explored workflow efficiency, documentation burden, staff wellness, patient safety, communication, and training. Data were audio-recorded, transcribed, and analyzed thematically using MAXQDA.

Results: Nineteen interviews were conducted until thematic saturation. Three major themes emerged. (1) Efficiency and workflow: Staff valued consolidated records and regional connectivity but reported navigation complexity, time burden, duplicate orders, reliance on multiple programs, and frequent workarounds. (2) Staff and patient wellness: Staff noted limited training, increased workload, cognitive overload, and reliance on peer support contributed to burnout. (3) Patient safety: Identified risks included order and medication errors, communication breakdowns, poor system visualization, imaging delays, and wristband or labeling issues. Several QI initiatives were implemented in response, including education and navigation rounds, formation of working groups, and integration of artificial intelligence.

Conclusion: EHR implementation introduced both benefits and challenges in oncology workflows. Findings informed multidisciplinary QI initiatives targeting role-specific training, workflow optimization, and safety, offering a framework for other cancer centers transitioning to new EHR systems.

目的:电子健康记录(EHR)系统旨在提高效率、护理协调和患者安全,但实施通常会带来工作流程挑战和工作人员负担。2024年,安大略省东南部癌症中心(CCSEO),加拿大的一个区域性学术癌症中心,从混合纸张电子系统过渡到完全集成的区域性电子健康档案。虽然已经对医院电子病历的采用进行了研究,但有限的研究已经检查了它对门诊肿瘤护理的影响,特别是对非医生工作人员的影响,或者机构如何回应研究结果。我们的研究探讨了肿瘤医护人员对CCSEO实施电子病历的看法,并确定了由此产生的质量改进(QI)举措。方法:采用有目的的最大变异抽样,我们招募了临床、行政和研究人员。半结构化访谈探讨了工作流程效率、文档负担、员工健康、患者安全、沟通和培训。使用MAXQDA对数据进行录音、转录和主题分析。结果:进行了19次访谈,直到主题饱和。出现了三个主要主题。(1)效率和工作流程:员工重视统一记录和区域连通性,但报告导航复杂性、时间负担、重复订单、依赖多个程序以及频繁的解决方案。(2)员工和患者健康:员工指出,培训有限、工作量增加、认知超载以及对同伴支持的依赖是导致倦怠的原因。(3)患者安全:已确定的风险包括医嘱和用药错误、沟通中断、系统可视化不良、成像延迟以及腕带或标签问题。作为回应,实施了几项全民智能倡议,包括教育和导航轮、工作组的组建以及人工智能的整合。结论:电子健康档案的实施给肿瘤学工作流程带来了好处和挑战。研究结果为针对特定角色培训、工作流程优化和安全性的多学科QI倡议提供了信息,为其他癌症中心过渡到新的电子健康档案系统提供了框架。
{"title":"Health Care Worker Perspectives After New Electronic Health Record Implementation in an Oncology Ambulatory Clinic: Qualitative and Quality-Improvement Insights.","authors":"Luxiga Thanabalachandran, Khaled Zaza, Renee Hartzell, Kimberley Miller, Geneviève C Digby, Taylor Moffat, Melinda Mushonga, Kristin Wright, Melanie Powis, John Drover, Siddhartha Srivastava, Monika K Krzyzanowska, Yuchen Li","doi":"10.1200/CCI-25-00322","DOIUrl":"https://doi.org/10.1200/CCI-25-00322","url":null,"abstract":"<p><strong>Purpose: </strong>Electronic health record (EHR) systems aim to improve efficiency, care coordination, and patient safety, yet implementation often introduces workflow challenges and staff burden. In 2024, the Cancer Centre of Southeastern Ontario (CCSEO), a regional academic cancer center in Canada, transitioned from a hybrid paper-electronic system to a fully integrated regional EHR. Although hospital EHR adoption has been studied, limited research has examined its impact within ambulatory oncology care, particularly among nonphysician staff, or how institutions responded to the findings. Our study explored oncology healthcare worker perspectives on EHR implementation at CCSEO and identified resulting quality-improvement (QI) initiatives.</p><p><strong>Methods: </strong>Using purposeful maximum variation sampling, we recruited clinical, administrative, and research staff. Semistructured interviews explored workflow efficiency, documentation burden, staff wellness, patient safety, communication, and training. Data were audio-recorded, transcribed, and analyzed thematically using MAXQDA.</p><p><strong>Results: </strong>Nineteen interviews were conducted until thematic saturation. Three major themes emerged. (1) Efficiency and workflow: Staff valued consolidated records and regional connectivity but reported navigation complexity, time burden, duplicate orders, reliance on multiple programs, and frequent workarounds. (2) Staff and patient wellness: Staff noted limited training, increased workload, cognitive overload, and reliance on peer support contributed to burnout. (3) Patient safety: Identified risks included order and medication errors, communication breakdowns, poor system visualization, imaging delays, and wristband or labeling issues. Several QI initiatives were implemented in response, including education and navigation rounds, formation of working groups, and integration of artificial intelligence.</p><p><strong>Conclusion: </strong>EHR implementation introduced both benefits and challenges in oncology workflows. Findings informed multidisciplinary QI initiatives targeting role-specific training, workflow optimization, and safety, offering a framework for other cancer centers transitioning to new EHR systems.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"10 ","pages":"e2500322"},"PeriodicalIF":2.8,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146127255","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SNF-CLIMEDIN: A Randomized Trial of Digital Support and Intervention in Patients With Advanced Non-Small Cell Lung Cancer. A Hellenic Cooperative Oncology Group Study. SNF-CLIMEDIN:一项晚期非小细胞肺癌患者数字支持和干预的随机试验希腊合作肿瘤小组研究。
IF 2.8 Q2 ONCOLOGY Pub Date : 2026-02-01 Epub Date: 2026-02-03 DOI: 10.1200/CCI-25-00234
Paris A Kosmidis, Thanos Kosmidis, Kyriaki Papadopoulou, Nikolaos Korfiatis, Athanasios Vozikis, Sofia Lampaki, Panagiota Economopoulou, Elena Fountzilas, Athina Christopoulou, Epaminondas Samantas, Anastasios Vagionas, Giannis Socrates Mountzios, Georgios Goumas, Nikolaos Tsoukalas, Ilias Athanasiadis, Dimitris Bafaloukos, Chris Panopoulos, Margarita Ioanna Koufaki, George Fountzilas, Georgios Petrakis, Helena Linardou

Purpose: This trial aims to investigate the effectiveness of online digital intervention in patients with non-small cell lung cancer (NSCLC) in terms of adverse events (AEs) and quality of life (QoL).

Methods: This randomized trial recruited 200 patients with advanced NSCLC (March 2022-October 2023). All patients received standard-of-care precise treatment, predominantly immunochemotherapy. The study was designed to assess AEs and QoL improvement. Through the CareAcross online platform, all patients received information about their disease and treatment and reported any of the 22 predefined AEs at any time. Patients were randomly assigned 1:1 in the intervention (A) and control (B) arm; patients in arm A automatically received, additionally, evidence-based guidance for the reported AEs. EuroQol 5-dimension 5-level responses were collected at baseline and at each treatment cycle. Resulting scores were compared between baseline and after the sixth cycle. In addition, patient case-level hospitalization data were collected and costs were estimated based on reimbursed costs as defined by the Ministry of Health, enabling a post hoc analysis.

Results: Clinical characteristics were well-balanced. More AEs were reported by patients online versus to their clinicians (P < .01). Among the 22 AEs, 17 improved more in arm A, with the improvement in rash and stomatitis being statistically significant. In QoL, there was no improvement in any of the five EuroQol 5-Dimension dimensions. Digital intervention was cost-saving with lower mean costs for hospitalization (P < .001). Overall response rate, progression-free survival, and overall survival were not statistically different between the two arms, ensuring comparable clinical outcome.

Conclusion: Digital oncology tends to improve selected AEs and is cost saving. Patients report, digitally, more informative AEs. Digital oncology can be a complementary tool to the oncology team and warrants further exploration.

目的:本试验旨在探讨在线数字干预在非小细胞肺癌(NSCLC)患者不良事件(ae)和生活质量(QoL)方面的有效性。方法:该随机试验招募了200例晚期NSCLC患者(2022年3月至2023年10月)。所有患者都接受了标准的精确治疗,主要是免疫化疗。该研究旨在评估ae和QoL的改善。通过CareAcross在线平台,所有患者都可以获得有关其疾病和治疗的信息,并随时报告22个预定义ae中的任何一个。患者按1:1随机分配到干预组(A)和对照组(B);另外,A组的患者自动接受报告ae的循证指导。在基线和每个治疗周期收集EuroQol 5维5级反应。结果得分在基线和第六个周期后进行比较。此外,还收集了患者病例级住院数据,并根据卫生部确定的报销费用估算了费用,以便进行事后分析。结果:临床特征平衡良好。患者在线报告的不良事件多于向临床医生报告的不良事件(P < 0.01)。在22例ae中,A组有17例改善较多,其中皮疹和口炎的改善有统计学意义。在生活质量方面,5个EuroQol 5-Dimension维度中的任何一个都没有改善。数字干预节约成本,平均住院费用较低(P < 0.001)。两组的总有效率、无进展生存期和总生存期无统计学差异,确保了可比较的临床结果。结论:数字肿瘤学倾向于改善选定的ae,节省成本。患者报告,数字化,更翔实的ae。数字肿瘤学可以成为肿瘤学团队的补充工具,值得进一步探索。
{"title":"SNF-CLIMEDIN: A Randomized Trial of Digital Support and Intervention in Patients With Advanced Non-Small Cell Lung Cancer. A Hellenic Cooperative Oncology Group Study.","authors":"Paris A Kosmidis, Thanos Kosmidis, Kyriaki Papadopoulou, Nikolaos Korfiatis, Athanasios Vozikis, Sofia Lampaki, Panagiota Economopoulou, Elena Fountzilas, Athina Christopoulou, Epaminondas Samantas, Anastasios Vagionas, Giannis Socrates Mountzios, Georgios Goumas, Nikolaos Tsoukalas, Ilias Athanasiadis, Dimitris Bafaloukos, Chris Panopoulos, Margarita Ioanna Koufaki, George Fountzilas, Georgios Petrakis, Helena Linardou","doi":"10.1200/CCI-25-00234","DOIUrl":"https://doi.org/10.1200/CCI-25-00234","url":null,"abstract":"<p><strong>Purpose: </strong>This trial aims to investigate the effectiveness of online digital intervention in patients with non-small cell lung cancer (NSCLC) in terms of adverse events (AEs) and quality of life (QoL).</p><p><strong>Methods: </strong>This randomized trial recruited 200 patients with advanced NSCLC (March 2022-October 2023). All patients received standard-of-care precise treatment, predominantly immunochemotherapy. The study was designed to assess AEs and QoL improvement. Through the CareAcross online platform, all patients received information about their disease and treatment and reported any of the 22 predefined AEs at any time. Patients were randomly assigned 1:1 in the intervention (A) and control (B) arm; patients in arm A automatically received, additionally, evidence-based guidance for the reported AEs. EuroQol 5-dimension 5-level responses were collected at baseline and at each treatment cycle. Resulting scores were compared between baseline and after the sixth cycle. In addition, patient case-level hospitalization data were collected and costs were estimated based on reimbursed costs as defined by the Ministry of Health, enabling a post hoc analysis.</p><p><strong>Results: </strong>Clinical characteristics were well-balanced. More AEs were reported by patients online versus to their clinicians (<i>P</i> < .01). Among the 22 AEs, 17 improved more in arm A, with the improvement in rash and stomatitis being statistically significant. In QoL, there was no improvement in any of the five EuroQol 5-Dimension dimensions. Digital intervention was cost-saving with lower mean costs for hospitalization (<i>P</i> < .001). Overall response rate, progression-free survival, and overall survival were not statistically different between the two arms, ensuring comparable clinical outcome.</p><p><strong>Conclusion: </strong>Digital oncology tends to improve selected AEs and is cost saving. Patients report, digitally, more informative AEs. Digital oncology can be a complementary tool to the oncology team and warrants further exploration.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"10 ","pages":"e2500234"},"PeriodicalIF":2.8,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146114999","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Validation of Claims-Based Algorithms to Classify Thoracic Radiation Therapy Courses. 基于索赔的胸椎放射治疗过程分类算法的验证。
IF 2.8 Q2 ONCOLOGY Pub Date : 2026-02-01 Epub Date: 2026-01-16 DOI: 10.1200/CCI-25-00266
Shane S Neibart, Nicholas Lin, Jacob Hogan, Shalini Moningi, Benjamin H Kann, Raymond H Mak, Miranda Lam

Purpose: Routinely collected administrative data provide insights into health care utilization and outcomes but lack detailed clinical information, such as the specific site and intent of radiation therapy (RT). This study aimed to validate claims-based algorithms to accurately identify thoracic RT (TRT) and curative-intent RT in administrative databases.

Methods: Patients at our institution with lung cancer and any RT Current Procedural Terminology (CPT) code from October 2015 to January 2024 were analyzed. RT claims were organized by treatment episode, and RT details were manually abstracted from the electronic health record to classify episodes as TRT or non-TRT and curative or noncurative. A priori algorithms were defined as the presence of respiratory motion management codes, >14 treatment codes (except for stereotactic body RT [SBRT] courses), with or without exclusive thoracic malignancy diagnosis codes. Positive predictive value (PPV) was computed for each episode, stratified by modality (three-dimensional conformal RT [3DCRT], intensity-modulated RT [IMRT], and SBRT). Algorithms were considered acceptable if the lower bound of the Clopper-Pearson 95% CI for PPV exceeded 70%.

Results: A total of 3,846 RT episodes were analyzed. The primary a priori TRT algorithm achieved a PPV of 97% (95% CI, 96 to 98) for IMRT, 99% (95% CI, 97 to 99) for SBRT, and 87% (95% CI, 81 to 92) for 3DCRT. Performance declined when exclusive thoracic malignancy diagnosis codes were excluded. For curative-intent RT, PPVs were 87% for IMRT, 90% for SBRT, and 55% for 3DCRT.

Conclusion: Clinically informed algorithms can accurately identify TRT in claims data, achieving high PPVs particularly for IMRT and SBRT courses. These algorithms can be applied in claims databases to assess RT toxicity and effectiveness. External validation across diverse data sets will be important to confirm generalizability.

目的:常规收集的管理数据提供了对医疗保健利用和结果的见解,但缺乏详细的临床信息,如放射治疗(RT)的具体部位和意图。本研究旨在验证基于索赔的算法,以准确识别管理数据库中的胸部RT (TRT)和治疗目的RT。方法:对我院2015年10月至2024年1月收治的肺癌及所有RT现行程序术语(CPT)编码患者进行分析。根据治疗事件组织RT声明,并从电子健康记录中手动提取RT详细信息,将发作分为TRT或非TRT,治愈或不可治愈。先验算法被定义为存在呼吸运动管理代码,bbbb14治疗代码(立体定向体RT [SBRT]疗程除外),具有或不具有排他胸部恶性肿瘤诊断代码。计算每次发作的阳性预测值(PPV),并按模式(三维适形放疗[3DCRT]、调强放疗[IMRT]和SBRT)分层。如果PPV的Clopper-Pearson 95% CI下界超过70%,则认为算法是可接受的。结果:共分析了3846例RT发作。初级先验TRT算法对于IMRT的PPV为97% (95% CI, 96 ~ 98),对于SBRT的PPV为99% (95% CI, 97 ~ 99),对于3DCRT的PPV为87% (95% CI, 81 ~ 92)。排除胸腔恶性肿瘤诊断代码后,诊断效果下降。对于治疗目的RT, IMRT的ppv为87%,SBRT为90%,3DCRT为55%。结论:临床知情算法可以准确识别索赔数据中的TRT,特别是在IMRT和SBRT疗程中实现高ppv。这些算法可以应用于索赔数据库,以评估RT的毒性和有效性。跨不同数据集的外部验证对于确认泛化性非常重要。
{"title":"Validation of Claims-Based Algorithms to Classify Thoracic Radiation Therapy Courses.","authors":"Shane S Neibart, Nicholas Lin, Jacob Hogan, Shalini Moningi, Benjamin H Kann, Raymond H Mak, Miranda Lam","doi":"10.1200/CCI-25-00266","DOIUrl":"https://doi.org/10.1200/CCI-25-00266","url":null,"abstract":"<p><strong>Purpose: </strong>Routinely collected administrative data provide insights into health care utilization and outcomes but lack detailed clinical information, such as the specific site and intent of radiation therapy (RT). This study aimed to validate claims-based algorithms to accurately identify thoracic RT (TRT) and curative-intent RT in administrative databases.</p><p><strong>Methods: </strong>Patients at our institution with lung cancer and any RT Current Procedural Terminology (CPT) code from October 2015 to January 2024 were analyzed. RT claims were organized by treatment episode, and RT details were manually abstracted from the electronic health record to classify episodes as TRT or non-TRT and curative or noncurative. A priori algorithms were defined as the presence of respiratory motion management codes, >14 treatment codes (except for stereotactic body RT [SBRT] courses), with or without exclusive thoracic malignancy diagnosis codes. Positive predictive value (PPV) was computed for each episode, stratified by modality (three-dimensional conformal RT [3DCRT], intensity-modulated RT [IMRT], and SBRT). Algorithms were considered acceptable if the lower bound of the Clopper-Pearson 95% CI for PPV exceeded 70%.</p><p><strong>Results: </strong>A total of 3,846 RT episodes were analyzed. The primary a priori TRT algorithm achieved a PPV of 97% (95% CI, 96 to 98) for IMRT, 99% (95% CI, 97 to 99) for SBRT, and 87% (95% CI, 81 to 92) for 3DCRT. Performance declined when exclusive thoracic malignancy diagnosis codes were excluded. For curative-intent RT, PPVs were 87% for IMRT, 90% for SBRT, and 55% for 3DCRT.</p><p><strong>Conclusion: </strong>Clinically informed algorithms can accurately identify TRT in claims data, achieving high PPVs particularly for IMRT and SBRT courses. These algorithms can be applied in claims databases to assess RT toxicity and effectiveness. External validation across diverse data sets will be important to confirm generalizability.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"10 ","pages":"e2500266"},"PeriodicalIF":2.8,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146114914","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Self-Supervised Transformer-Based Pipeline for Liver Tumor Segmentation and Type Classification. 基于自监督变压器的肝脏肿瘤分割与类型分类。
IF 2.8 Q2 ONCOLOGY Pub Date : 2026-02-01 Epub Date: 2026-01-30 DOI: 10.1200/CCI-25-00135
Ramtin Mojtahedi, Mohammad Hamghalam, Jacob J Peoples, William R Jarnagin, Richard K G Do, Amber L Simpson

Purpose: It is essential to detect and segment liver tumors to guide treatment and track disease progression. To reduce the need for large annotated data sets, we present an end-to-end pipeline that uses self-supervised pretraining to improve segmentation and then classifies tumor types with a separate pretrained classifier applied to the segmented tumor regions.

Methods: First, we pretrained the encoder of a transformer-based network using a self-supervised approach on unlabeled abdominal computed tomography images. Subsequently, we fine-tuned the segmentation network to segment the liver and tumors, and the tumor regions were classified using a pretrained convolutional neural network (Inception-v3 architecture) as intrahepatic cholangiocarcinoma (ICC), hepatocellular carcinoma (HCC), or colorectal liver metastases (CRLMs). We evaluated 459 images (155 HCC, 107 ICC, 197 CRLM). For external testing, we used an independent public data set (n = 40).

Results: Averaged across HCC, ICC, and CRLM, in comparison with a supervised baseline (no pretraining), self-supervised pretraining improved the liver Dice similarity coefficient (DSC) by 6.4 percentage points and reduced the 95th-percentile Hausdorff distance (HD95) by 32.97 mm. For tumors, the DSC increased by 6.0 percentage points and the HD95 decreased by 3.2 mm. Tumor type classification achieved AUC 0.98 (95% CI, 0.96 to 1.00) and accuracy 96% (95% CI, 92% to 99%). Segmentation performance on the external data was close to the internal cohort with tumor DSC 0.73, intersection over union (IoU) 0.60, and HD95 30.98 mm and liver DSC 0.91, IoU 0.83, and HD95 29.67 mm.

Conclusion: The proposed self-supervised, end-to-end pipeline improves liver tumor segmentation and provides accurate tumor type classification, supporting reliable radiologic assessment, treatment planning, and improved prognostication for patients with liver cancer.

目的:肝肿瘤的检测和分割对指导治疗和跟踪疾病进展至关重要。为了减少对大型注释数据集的需求,我们提出了一个端到端管道,该管道使用自监督预训练来改进分割,然后使用一个单独的预训练分类器对分割的肿瘤区域进行肿瘤类型分类。方法:首先,我们使用自监督方法对基于变压器的网络的编码器进行预训练,该网络使用未标记的腹部计算机断层扫描图像。随后,我们对分割网络进行了微调,以分割肝脏和肿瘤,并使用预训练的卷积神经网络(Inception-v3架构)将肿瘤区域分类为肝内胆管癌(ICC)、肝细胞癌(HCC)或结直肠癌肝转移瘤(crlm)。我们评估了459张图像(HCC 155张,ICC 107张,CRLM 197张)。对于外部测试,我们使用独立的公共数据集(n = 40)。结果:在HCC、ICC和CRLM中,与监督基线(无预训练)相比,自我监督预训练将肝脏Dice相似系数(DSC)提高了6.4个百分点,将第95百分位Hausdorff距离(HD95)降低了32.97 mm。对于肿瘤,DSC增加了6.0个百分点,HD95下降了3.2毫米。肿瘤类型分类达到AUC 0.98 (95% CI, 0.96 ~ 1.00),准确率96% (95% CI, 92% ~ 99%)。外部数据的分割性能接近内部队列,肿瘤DSC为0.73,IoU为0.60,HD95为30.98 mm,肝脏DSC为0.91,IoU为0.83,HD95为29.67 mm。结论:提出的自我监督的端到端管道改善了肝脏肿瘤分割,提供了准确的肿瘤类型分类,支持可靠的放射评估,治疗计划,改善了肝癌患者的预后。
{"title":"Self-Supervised Transformer-Based Pipeline for Liver Tumor Segmentation and Type Classification.","authors":"Ramtin Mojtahedi, Mohammad Hamghalam, Jacob J Peoples, William R Jarnagin, Richard K G Do, Amber L Simpson","doi":"10.1200/CCI-25-00135","DOIUrl":"10.1200/CCI-25-00135","url":null,"abstract":"<p><strong>Purpose: </strong>It is essential to detect and segment liver tumors to guide treatment and track disease progression. To reduce the need for large annotated data sets, we present an end-to-end pipeline that uses self-supervised pretraining to improve segmentation and then classifies tumor types with a separate pretrained classifier applied to the segmented tumor regions.</p><p><strong>Methods: </strong>First, we pretrained the encoder of a transformer-based network using a self-supervised approach on unlabeled abdominal computed tomography images. Subsequently, we fine-tuned the segmentation network to segment the liver and tumors, and the tumor regions were classified using a pretrained convolutional neural network (Inception-v3 architecture) as intrahepatic cholangiocarcinoma (ICC), hepatocellular carcinoma (HCC), or colorectal liver metastases (CRLMs). We evaluated 459 images (155 HCC, 107 ICC, 197 CRLM). For external testing, we used an independent public data set (n = 40).</p><p><strong>Results: </strong>Averaged across HCC, ICC, and CRLM, in comparison with a supervised baseline (no pretraining), self-supervised pretraining improved the liver Dice similarity coefficient (DSC) by 6.4 percentage points and reduced the 95th-percentile Hausdorff distance (HD<sub>95</sub>) by 32.97 mm. For tumors, the DSC increased by 6.0 percentage points and the HD<sub>95</sub> decreased by 3.2 mm. Tumor type classification achieved AUC 0.98 (95% CI, 0.96 to 1.00) and accuracy 96% (95% CI, 92% to 99%). Segmentation performance on the external data was close to the internal cohort with tumor DSC 0.73, intersection over union (IoU) 0.60, and HD<sub>95</sub> 30.98 mm and liver DSC 0.91, IoU 0.83, and HD<sub>95</sub> 29.67 mm.</p><p><strong>Conclusion: </strong>The proposed self-supervised, end-to-end pipeline improves liver tumor segmentation and provides accurate tumor type classification, supporting reliable radiologic assessment, treatment planning, and improved prognostication for patients with liver cancer.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"10 ","pages":"e2500135"},"PeriodicalIF":2.8,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12866948/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146094763","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Application of the Population Health Institute Model of Health for Identifying Cancer Catchment Area Priorities. 人口健康研究所健康模型在确定癌症集中地区优先事项中的应用。
IF 2.8 Q2 ONCOLOGY Pub Date : 2026-02-01 Epub Date: 2026-02-05 DOI: 10.1200/CCI-25-00126
Amy Trentham-Dietz, Thomas P Lawler, Ronald E Gangnon, Allison R Dahlke, Noelle K LoConte, Earlise C Ward, Christine P Muganda, Shaneda Warren Andersen, Marjory L Givens

Purpose: The University of Wisconsin Population Health Institute (PHI) Model of Health, grounded in models developed over a decade ago, provides a framework for prioritizing health-related investments including setting agendas, implementing policies, and sharing resources for improving community health and health equity. The model includes multiple determinants of health and two broad health outcomes (length and quality of life). We adapted the PHI Model of Health to cancer outcomes.

Methods: Using county-level publicly available data, health factor summary measures were derived in three areas: health infrastructure including health promotion and clinical care, physical environment, and social and economic factors. A composite health factor z-score was calculated as the weighted (40%, 15%, and 45%, respectively) average of the summary measures for each county, and k-means clustering was used to create unequally sized county groups with lower (healthier) to higher (less healthy) z-scores. We fit age-adjusted negative binomial regression models to estimate rate ratios and 95% CI for cancer mortality in relation to county health factor cluster.

Results: Age-adjusted cancer mortality rates increased across the 10 county health factor clusters for all-cancers as well as for lung, colorectal, breast, and prostate cancers. Rate ratios generally increased across the 10 health factor clusters for all cancers combined and for specific cancer types. Compared with counties with the most favorable health factor conditions, the counties with the least favorable conditions had an all-cancer mortality rate ratio of 1.49 (95% CI, 1.39 to 1.60).

Conclusion: The PHI model of health adapted to cancer outcomes provides an approach for linking community-specific conditions to the interventions that hold promise to directly address drivers of the cancer burden.

目的:威斯康星大学人口健康研究所(PHI)健康模型以十多年前开发的模型为基础,为确定健康相关投资的优先次序提供了一个框架,包括制定议程、实施政策和共享资源,以改善社区健康和健康公平。该模型包括多种健康决定因素和两种广泛的健康结果(寿命和生活质量)。我们将PHI健康模型应用于癌症结果。方法:利用县级公开数据,从卫生基础设施(包括健康促进和临床护理)、自然环境和社会经济因素三个方面得出健康因素汇总测度。综合健康因子z-得分计算为每个县的加权(分别为40%、15%和45%)平均值,k-均值聚类用于创建大小不等的县组,其z-得分较低(较健康)到较高(较不健康)。我们拟合年龄调整后的负二项回归模型来估计与县健康因素集群相关的癌症死亡率的比率和95% CI。结果:在10个县的所有癌症以及肺癌、结直肠癌、乳腺癌和前列腺癌的健康因素集群中,年龄调整后的癌症死亡率都有所增加。所有癌症和特定癌症类型的10个健康因素群的比率普遍增加。与健康因素条件最有利的县相比,条件最不利的县的所有癌症死亡率比为1.49 (95% CI, 1.39 ~ 1.60)。结论:适应癌症结果的PHI健康模型提供了一种将社区特定条件与有望直接解决癌症负担驱动因素的干预措施联系起来的方法。
{"title":"Application of the Population Health Institute Model of Health for Identifying Cancer Catchment Area Priorities.","authors":"Amy Trentham-Dietz, Thomas P Lawler, Ronald E Gangnon, Allison R Dahlke, Noelle K LoConte, Earlise C Ward, Christine P Muganda, Shaneda Warren Andersen, Marjory L Givens","doi":"10.1200/CCI-25-00126","DOIUrl":"https://doi.org/10.1200/CCI-25-00126","url":null,"abstract":"<p><strong>Purpose: </strong>The University of Wisconsin Population Health Institute (PHI) Model of Health, grounded in models developed over a decade ago, provides a framework for prioritizing health-related investments including setting agendas, implementing policies, and sharing resources for improving community health and health equity. The model includes multiple determinants of health and two broad health outcomes (length and quality of life). We adapted the PHI Model of Health to cancer outcomes.</p><p><strong>Methods: </strong>Using county-level publicly available data, health factor summary measures were derived in three areas: health infrastructure including health promotion and clinical care, physical environment, and social and economic factors. A composite health factor z-score was calculated as the weighted (40%, 15%, and 45%, respectively) average of the summary measures for each county, and k-means clustering was used to create unequally sized county groups with lower (healthier) to higher (less healthy) z-scores. We fit age-adjusted negative binomial regression models to estimate rate ratios and 95% CI for cancer mortality in relation to county health factor cluster.</p><p><strong>Results: </strong>Age-adjusted cancer mortality rates increased across the 10 county health factor clusters for all-cancers as well as for lung, colorectal, breast, and prostate cancers. Rate ratios generally increased across the 10 health factor clusters for all cancers combined and for specific cancer types. Compared with counties with the most favorable health factor conditions, the counties with the least favorable conditions had an all-cancer mortality rate ratio of 1.49 (95% CI, 1.39 to 1.60).</p><p><strong>Conclusion: </strong>The PHI model of health adapted to cancer outcomes provides an approach for linking community-specific conditions to the interventions that hold promise to directly address drivers of the cancer burden.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"10 ","pages":"e2500126"},"PeriodicalIF":2.8,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146127245","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluating the Readiness of Saudi Oncology Real-World Data for Standardization and Quality Enhancement. 评估沙特肿瘤学真实世界数据标准化和质量提高的准备情况。
IF 2.8 Q2 ONCOLOGY Pub Date : 2026-02-01 Epub Date: 2026-01-16 DOI: 10.1200/CCI-25-00248
Almaha Alfakhri, Ohoud Almadani, Ibrahim Asiri, Nada Alsuhebany, Ahmed Alanazi, Turki Althunian

Purpose: Real-world data (RWD) are increasingly used in oncology research, regulatory decisions, and clinical practice; however, variability in data quality and lack of standardization remain major limitations. This study assessed the readiness of oncology RWD from Saudi health care centers for standardization and evaluated their completeness and accuracy.

Methods: Deidentified electronic health records for adult patients (18 years and older) diagnosed with breast cancer, thyroid cancer, colorectal cancer, gastric cancer, hepatocellular carcinoma, or renal cell carcinoma were extracted from five health care centers within the Saudi Real-World Evidence Network. Readiness for standardization was evaluated by assessing alignment with data elements in the Minimal Common Oncology Data Elements (mCODE) framework, a standardized and clinically focused oncology data model. Data quality was evaluated using two dimensions: completeness, defined as the proportion of patients with at least one entered value for each element; and accuracy, defined as the proportion of correct entries based on verification checks (including plausibility and consistency). Outcomes were calculated at the element level and weighted to generate domain- and center-level proportions.

Results: A total of 20,671 oncology patients were included. Overall weighted alignment with mCODE domains was moderate (62.43%). The patient domain showed the highest alignment (71.43%), whereas the outcome domain exhibited significant gaps. Data completeness was low to moderate (49.02%), with higher levels in common cancers (54.33%) than in rare cancers (51.50%). Data accuracy was high overall (95.03%), with rare cancers showing higher accuracy (98.76%) than common cancers (94.62%).

Conclusion: Saudi oncology RWD show moderate alignment with mCODE, with consistently high accuracy across domains. However, gaps in data completeness highlight the need for broader adoption of standardized data frameworks to support interoperability and enable nationwide research and regulatory use.

目的:真实世界数据(RWD)越来越多地用于肿瘤研究、监管决策和临床实践;然而,数据质量的可变性和缺乏标准化仍然是主要的限制。本研究评估了沙特卫生保健中心肿瘤RWD的标准化准备情况,并评估了其完整性和准确性。方法:从沙特真实世界证据网络的五个卫生保健中心提取诊断为乳腺癌、甲状腺癌、结直肠癌、胃癌、肝细胞癌或肾细胞癌的成年患者(18岁及以上)的未识别电子健康记录。通过评估与最小通用肿瘤数据元素(mCODE)框架中数据元素的一致性来评估标准化的准备情况,mCODE框架是一种标准化的临床肿瘤数据模型。数据质量通过两个维度进行评估:完整性,定义为每个元素至少有一个输入值的患者比例;准确性,定义为基于验证检查(包括合理性和一致性)的正确条目的比例。结果在元素水平上计算,并加权以产生领域和中心水平的比例。结果:共纳入肿瘤患者20671例。与mCODE域的总体加权一致性中等(62.43%)。患者域显示出最高的一致性(71.43%),而结果域显示出显著的差距。数据完整性为中低(49.02%),常见癌症(54.33%)高于罕见癌症(51.50%)。总体而言,数据准确性较高(95.03%),其中罕见癌症的准确性(98.76%)高于常见癌症(94.62%)。结论:沙特肿瘤RWD显示出与mCODE的中等一致性,跨域具有一致的高准确性。然而,数据完整性方面的差距突出表明,需要更广泛地采用标准化数据框架,以支持互操作性,并使全国范围的研究和监管使用成为可能。
{"title":"Evaluating the Readiness of Saudi Oncology Real-World Data for Standardization and Quality Enhancement.","authors":"Almaha Alfakhri, Ohoud Almadani, Ibrahim Asiri, Nada Alsuhebany, Ahmed Alanazi, Turki Althunian","doi":"10.1200/CCI-25-00248","DOIUrl":"https://doi.org/10.1200/CCI-25-00248","url":null,"abstract":"<p><strong>Purpose: </strong>Real-world data (RWD) are increasingly used in oncology research, regulatory decisions, and clinical practice; however, variability in data quality and lack of standardization remain major limitations. This study assessed the readiness of oncology RWD from Saudi health care centers for standardization and evaluated their completeness and accuracy.</p><p><strong>Methods: </strong>Deidentified electronic health records for adult patients (18 years and older) diagnosed with breast cancer, thyroid cancer, colorectal cancer, gastric cancer, hepatocellular carcinoma, or renal cell carcinoma were extracted from five health care centers within the Saudi Real-World Evidence Network. Readiness for standardization was evaluated by assessing alignment with data elements in the Minimal Common Oncology Data Elements (mCODE) framework, a standardized and clinically focused oncology data model. Data quality was evaluated using two dimensions: completeness, defined as the proportion of patients with at least one entered value for each element; and accuracy, defined as the proportion of correct entries based on verification checks (including plausibility and consistency). Outcomes were calculated at the element level and weighted to generate domain- and center-level proportions.</p><p><strong>Results: </strong>A total of 20,671 oncology patients were included. Overall weighted alignment with mCODE domains was moderate (62.43%). The patient domain showed the highest alignment (71.43%), whereas the outcome domain exhibited significant gaps. Data completeness was low to moderate (49.02%), with higher levels in common cancers (54.33%) than in rare cancers (51.50%). Data accuracy was high overall (95.03%), with rare cancers showing higher accuracy (98.76%) than common cancers (94.62%).</p><p><strong>Conclusion: </strong>Saudi oncology RWD show moderate alignment with mCODE, with consistently high accuracy across domains. However, gaps in data completeness highlight the need for broader adoption of standardized data frameworks to support interoperability and enable nationwide research and regulatory use.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"10 ","pages":"e2500248"},"PeriodicalIF":2.8,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146114921","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Novel R Shiny Tool for Survival Analysis With Time-Varying Covariate in Oncology Studies: Overcoming Biases and Enhancing Collaboration. 肿瘤研究中时变协变量生存分析的新R闪亮工具:克服偏见和加强合作。
IF 2.8 Q2 ONCOLOGY Pub Date : 2026-02-01 Epub Date: 2026-01-30 DOI: 10.1200/CCI-25-00225
Yimei Li, Yang Qiao, Fei Gao, Jordan Gauthier, Qiang Ed Zhang, Jenna Voutsinas, Wendy Leisenring, Ted Gooley, Corinne Summers, Alexandre Hirayama, Cameron J Turtle, Rebecca Gardner, Jarcy Zee, Qian Vicky Wu

Purpose: Our study is motivated by evaluating the role of hematopoietic cell transplantation (HCT) after chimeric antigen receptor T-cell (CAR-T) therapy for ALL, a debated topic. Because patients may receive HCT at different times after CAR-T infusion or never, HCT post-CAR-T should be considered as a time-varying covariate (TVC).

Methods: Standard Cox models and Kaplan-Meier (KM) curves (naïve method) assume that TVC status is known and fixed at baseline, which can yield biased estimates. Landmark analysis is a popular alternative but depends on a chosen landmark time. Time-dependent (TD) Cox model is better suited for TVC although visualizing survival curves is complex. The newly proposed Smith-Zee method generates appropriate survival curves from TD Cox models.

Results: To address these challenges, we developed an open-source R Shiny tool integrating multiple models (naïve Cox, landmark Cox, and TD Cox) and curves (naïve KM, landmark KM, Smith-Zee, and Extended KM) to facilitate TVC analysis. Reanalysis of post-CAR-T HCT's effect on leukemia-free survival (LFS) showed consistent results between naïve and TD Cox models, whereas landmark analyses varied by landmark time. A separate data analysis of chronic graft-versus-host disease and survival showed that substantial differences emerged across statistical methods. Simulations revealed increased bias in naïve methods when TVC changed late and minimal bias when TVC changes occurred early relative to time to events.

Conclusion: We recommend TD Cox models and Smith-Zee curves for robust TVC analysis. Our R Shiny tool supports standardized analyses without requiring data sharing, thereby promoting collaboration across different institutions and providing a practical tool to advance survival analysis in oncology research.

目的:本研究的动机是评估嵌合抗原受体t细胞(CAR-T)治疗ALL后造血细胞移植(HCT)的作用,这是一个有争议的话题。由于患者在CAR-T输注后可能在不同时间接受HCT或从未接受过HCT,因此CAR-T后的HCT应被视为时变协变量(TVC)。方法:标准Cox模型和Kaplan-Meier (KM)曲线(naïve方法)假设TVC状态是已知的,并且在基线处是固定的,这可能产生有偏差的估计。里程碑分析是一种流行的替代方法,但取决于所选择的里程碑时间。虽然生存曲线的可视化比较复杂,但时间依赖(TD) Cox模型更适合TVC。新提出的Smith-Zee方法从TD Cox模型中生成合适的生存曲线。为了解决这些挑战,我们开发了一个开源R Shiny工具,集成了多个模型(naïve Cox, landmark Cox和TD Cox)和曲线(naïve KM, landmark KM, Smith-Zee和Extended KM),以促进TVC分析。重新分析car - t后HCT对无白血病生存(LFS)的影响,naïve和TD Cox模型之间的结果一致,而里程碑分析因里程碑时间而异。一项关于慢性移植物抗宿主病和生存率的独立数据分析显示,不同统计方法之间存在实质性差异。模拟显示,当TVC变化较晚时,naïve方法的偏差增加,而当TVC变化相对于事件发生的时间较早时,偏差最小。结论:我们推荐TD - Cox模型和Smith-Zee曲线进行稳健的TVC分析。我们的R Shiny工具支持标准化分析,而不需要数据共享,从而促进不同机构之间的合作,并为肿瘤研究中的生存分析提供实用工具。
{"title":"Novel R Shiny Tool for Survival Analysis With Time-Varying Covariate in Oncology Studies: Overcoming Biases and Enhancing Collaboration.","authors":"Yimei Li, Yang Qiao, Fei Gao, Jordan Gauthier, Qiang Ed Zhang, Jenna Voutsinas, Wendy Leisenring, Ted Gooley, Corinne Summers, Alexandre Hirayama, Cameron J Turtle, Rebecca Gardner, Jarcy Zee, Qian Vicky Wu","doi":"10.1200/CCI-25-00225","DOIUrl":"https://doi.org/10.1200/CCI-25-00225","url":null,"abstract":"<p><strong>Purpose: </strong>Our study is motivated by evaluating the role of hematopoietic cell transplantation (HCT) after chimeric antigen receptor T-cell (CAR-T) therapy for ALL, a debated topic. Because patients may receive HCT at different times after CAR-T infusion or never, HCT post-CAR-T should be considered as a time-varying covariate (TVC).</p><p><strong>Methods: </strong>Standard Cox models and Kaplan-Meier (KM) curves (naïve method) assume that TVC status is known and fixed at baseline, which can yield biased estimates. Landmark analysis is a popular alternative but depends on a chosen landmark time. Time-dependent (TD) Cox model is better suited for TVC although visualizing survival curves is complex. The newly proposed Smith-Zee method generates appropriate survival curves from TD Cox models.</p><p><strong>Results: </strong>To address these challenges, we developed an open-source R Shiny tool integrating multiple models (naïve Cox, landmark Cox, and TD Cox) and curves (naïve KM, landmark KM, Smith-Zee, and Extended KM) to facilitate TVC analysis. Reanalysis of post-CAR-T HCT's effect on leukemia-free survival (LFS) showed consistent results between naïve and TD Cox models, whereas landmark analyses varied by landmark time. A separate data analysis of chronic graft-versus-host disease and survival showed that substantial differences emerged across statistical methods. Simulations revealed increased bias in naïve methods when TVC changed late and minimal bias when TVC changes occurred early relative to time to events.</p><p><strong>Conclusion: </strong>We recommend TD Cox models and Smith-Zee curves for robust TVC analysis. Our R Shiny tool supports standardized analyses without requiring data sharing, thereby promoting collaboration across different institutions and providing a practical tool to advance survival analysis in oncology research.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"10 ","pages":"e2500225"},"PeriodicalIF":2.8,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146094767","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Clinical Trial Patient Matching: A Real-Time, Common Data Model and Artificial Intelligence-Driven System for Semiautomated Patient Prescreening in Cancer Clinical Trials. 临床试验患者匹配:用于癌症临床试验半自动患者预筛选的实时、通用数据模型和人工智能驱动系统。
IF 2.8 Q2 ONCOLOGY Pub Date : 2026-01-01 Epub Date: 2026-01-09 DOI: 10.1200/CCI-25-00262
Guannan Gong, Jessica Liu, Sameer Pandya, Cristian Taborda, Nathalie Wiesendanger, Nate Price, Will Byron, Andreas Coppi, Patrick Young, Christina Wiess, Haley Dunning, Courtney Barganier, Rachel Brodeur, Neal Fischbach, Patricia LoRusso, Lajos Pusztai, So Yeon Kim, Mariya Rozenblit, Michael Cecchini, Anne Mongiu, Lourdes Mendez, Edward Kaftan, Charles Torre, Harlan Krumholz, Ian Krop, Wade Schulz, Maryam Lustberg, Pamela L Kunz

Purpose: Cancer clinical trial enrollment remains critically low at 5%-7% of adult patients despite exponential growth in available trials. Manual patient-trial matching represents a fundamental bottleneck, whereas current artificial intelligence (AI) and machine learning patient-trial matching systems lack data standardization and compatibility across health systems. We developed and validated a semiautomated clinical trial patient matching (CTPM) tool to improve recruitment efficiency and scalability.

Methods: We created a hybrid rules-based and natural language processing (NLP)-based pipeline that automatically screens patients using structured and unstructured electronic health record data standardized to the Observational Medical Outcomes Partnership (OMOP) common data model. CTPM performance was first evaluated on one metastatic colorectal cancer (CRC) trial by comparing CTPM accuracy and efficiency to manual chart review. Following the single-trial validation, we then implemented the system across 29 clinical trials spanning multiple cancer specialties and phases.

Results: For the single CRC trial, CTPM achieved 94% retrospective and 88% prospective accuracy, matching gold standard clinical chart review with 100% sensitivity. Implementation reduced chart review workload 10-fold and screening time by 41% (3.1 to 1.8 minutes per chart) for those patients who did undergo review. Since September 2022, the system has screened 98,348 patients across 29 trials, identifying 825 eligible candidates and facilitating 117 patient enrollments with 9%-37% consent rates.

Conclusion: This AI and NLP tool demonstrates improved efficiency in clinical trial recruitment by enabling research teams to focus on qualified candidates rather than exhaustive chart reviews. The OMOP-based framework supports scalability across health systems, with potential to address enrollment challenges that limit patient access to clinical trials.

目的:尽管现有临床试验呈指数级增长,但癌症临床试验的入组率仍极低,仅为成人患者的5%-7%。人工患者-试验匹配是一个基本瓶颈,而当前的人工智能(AI)和机器学习患者-试验匹配系统缺乏跨卫生系统的数据标准化和兼容性。我们开发并验证了一种半自动临床试验患者匹配(CTPM)工具,以提高招募效率和可扩展性。方法:我们创建了一个基于规则和基于自然语言处理(NLP)的混合管道,该管道使用结构化和非结构化电子健康记录数据自动筛选患者,这些数据标准化到观察性医疗结果合作伙伴关系(OMOP)通用数据模型。CTPM的性能首次在一项转移性结直肠癌(CRC)试验中通过比较CTPM的准确性和效率与手动图表审查来评估。在单次试验验证之后,我们在29项临床试验中实施了该系统,涵盖了多个癌症专科和阶段。结果:对于单个结直肠癌试验,CTPM达到94%的回顾性和88%的前瞻性准确性,与金标准临床图表审查100%的敏感性相匹配。对于那些确实接受了检查的患者,实施将图表审查工作量减少了10倍,筛查时间减少了41%(每张图表3.1至1.8分钟)。自2022年9月以来,该系统已经在29项试验中筛选了98,348名患者,确定了825名符合条件的候选人,并促进了117名患者的登记,同意率为9%-37%。结论:该AI和NLP工具通过使研究团队专注于合格的候选人而不是详尽的图表审查,提高了临床试验招募的效率。基于omop的框架支持跨卫生系统的可扩展性,有可能解决限制患者获得临床试验的注册挑战。
{"title":"Clinical Trial Patient Matching: A Real-Time, Common Data Model and Artificial Intelligence-Driven System for Semiautomated Patient Prescreening in Cancer Clinical Trials.","authors":"Guannan Gong, Jessica Liu, Sameer Pandya, Cristian Taborda, Nathalie Wiesendanger, Nate Price, Will Byron, Andreas Coppi, Patrick Young, Christina Wiess, Haley Dunning, Courtney Barganier, Rachel Brodeur, Neal Fischbach, Patricia LoRusso, Lajos Pusztai, So Yeon Kim, Mariya Rozenblit, Michael Cecchini, Anne Mongiu, Lourdes Mendez, Edward Kaftan, Charles Torre, Harlan Krumholz, Ian Krop, Wade Schulz, Maryam Lustberg, Pamela L Kunz","doi":"10.1200/CCI-25-00262","DOIUrl":"https://doi.org/10.1200/CCI-25-00262","url":null,"abstract":"<p><strong>Purpose: </strong>Cancer clinical trial enrollment remains critically low at 5%-7% of adult patients despite exponential growth in available trials. Manual patient-trial matching represents a fundamental bottleneck, whereas current artificial intelligence (AI) and machine learning patient-trial matching systems lack data standardization and compatibility across health systems. We developed and validated a semiautomated clinical trial patient matching (CTPM) tool to improve recruitment efficiency and scalability.</p><p><strong>Methods: </strong>We created a hybrid rules-based and natural language processing (NLP)-based pipeline that automatically screens patients using structured and unstructured electronic health record data standardized to the Observational Medical Outcomes Partnership (OMOP) common data model. CTPM performance was first evaluated on one metastatic colorectal cancer (CRC) trial by comparing CTPM accuracy and efficiency to manual chart review. Following the single-trial validation, we then implemented the system across 29 clinical trials spanning multiple cancer specialties and phases.</p><p><strong>Results: </strong>For the single CRC trial, CTPM achieved 94% retrospective and 88% prospective accuracy, matching gold standard clinical chart review with 100% sensitivity. Implementation reduced chart review workload 10-fold and screening time by 41% (3.1 to 1.8 minutes per chart) for those patients who did undergo review. Since September 2022, the system has screened 98,348 patients across 29 trials, identifying 825 eligible candidates and facilitating 117 patient enrollments with 9%-37% consent rates.</p><p><strong>Conclusion: </strong>This AI and NLP tool demonstrates improved efficiency in clinical trial recruitment by enabling research teams to focus on qualified candidates rather than exhaustive chart reviews. The OMOP-based framework supports scalability across health systems, with potential to address enrollment challenges that limit patient access to clinical trials.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"10 ","pages":"e2500262"},"PeriodicalIF":2.8,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145946722","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Extraction of Treatments and Responses From Non-Small Cell Lung Cancer Clinical Notes Using Natural Language Processing. 基于自然语言处理的非小细胞肺癌临床记录的治疗和反应提取。
IF 2.8 Q2 ONCOLOGY Pub Date : 2026-01-01 Epub Date: 2026-01-08 DOI: 10.1200/CCI-25-00138
Sonish Sivarajkumar, Subhash Edupuganti, David Lazris, Manisha Bhattacharya, Michael Davis, Devin Dressman, Roby Thomas, Yan Hu, Yang Ren, Hua Xu, Ping Yang, Yufei Huang, Yanshan Wang

Purpose: Manual extraction of treatment outcomes from unstructured oncology clinical notes is a significant challenge for real-world evidence (RWE) generation. This study aimed to develop and evaluate a robust natural language processing (NLP) system to automatically extract cancer treatments and their associated RECIST-based response categories (complete response, partial response, stable disease, and progressive disease) from non-small cell lung cancer (NSCLC) clinical notes.

Methods: This retrospective NLP development and validation study used a corpus of 250 NSCLC oncology notes from University of Pittsburgh Medical Center (UPMC) Hillman Cancer Center, annotated by physician experts. An end-to-end NLP pipeline was designed, integrating a rule-based module for entity extraction (treatments and responses) and a machine learning module using biomedical clinical bidirectional encoder representations from transformers for relation classification. The system's performance was evaluated on a held-out test set, with partial external validation for relation extraction on a Mayo Clinic data set.

Results: The NLP system achieved high overall accuracy. On the UPMC test set (64 notes), the relation classification model attained an area under the receiver operating characteristic curve of 0.94 and an F1 score of 0.92 for linking treatments with documented responses. The rule-based entity extraction demonstrated a macro-averaged F1 score of 0.87 (precision 0.98, recall 0.81). Although precision was high for chemotherapy and most response types (1.00), recall for cancer surgery was 0.45. External validation at Mayo Clinic showed moderate relation extraction F1 scores (range: 0.51-0.64).

Conclusion: The proposed NLP system can reliably extract structured treatment and response information from unstructured NSCLC oncology notes with high accuracy. This automated approach can assist in abstracting critical cancer treatment outcomes from clinical narrative text, thereby streamlining real-world data analysis and supporting the generation of RWE in oncology.

目的:从非结构化肿瘤临床记录中手动提取治疗结果是对真实世界证据(RWE)生成的重大挑战。本研究旨在开发和评估一个强大的自然语言处理(NLP)系统,从非小细胞肺癌(NSCLC)的临床记录中自动提取癌症治疗及其相关的基于recist的反应类别(完全缓解、部分缓解、疾病稳定和疾病进展)。方法:这项回顾性NLP开发和验证研究使用了匹兹堡大学医学中心(UPMC) Hillman癌症中心250份非小细胞肺癌肿瘤笔记的语料库,由医师专家注释。设计了端到端NLP管道,集成了用于实体提取(治疗和响应)的基于规则的模块和用于关系分类的生物医学临床双向编码器表示的机器学习模块。系统的性能在一个hold -out测试集上进行了评估,并在梅奥诊所数据集上对关系提取进行了部分外部验证。结果:NLP系统总体准确率较高。在UPMC测试集(64个注释)上,关系分类模型在接受者工作特征曲线下的面积为0.94,将处理与记录的反应联系起来的F1得分为0.92。基于规则的实体提取的宏观平均F1得分为0.87(精度0.98,召回率0.81)。虽然化疗和大多数反应类型的准确率很高(1.00),但癌症手术的召回率为0.45。梅奥诊所的外部验证显示提取F1评分有中等相关性(范围:0.51-0.64)。结论:NLP系统能够可靠、准确地从非结构化的NSCLC肿瘤记录中提取结构化的治疗和反应信息。这种自动化方法可以帮助从临床叙述文本中提取关键的癌症治疗结果,从而简化现实世界的数据分析,并支持肿瘤学中RWE的生成。
{"title":"Extraction of Treatments and Responses From Non-Small Cell Lung Cancer Clinical Notes Using Natural Language Processing.","authors":"Sonish Sivarajkumar, Subhash Edupuganti, David Lazris, Manisha Bhattacharya, Michael Davis, Devin Dressman, Roby Thomas, Yan Hu, Yang Ren, Hua Xu, Ping Yang, Yufei Huang, Yanshan Wang","doi":"10.1200/CCI-25-00138","DOIUrl":"10.1200/CCI-25-00138","url":null,"abstract":"<p><strong>Purpose: </strong>Manual extraction of treatment outcomes from unstructured oncology clinical notes is a significant challenge for real-world evidence (RWE) generation. This study aimed to develop and evaluate a robust natural language processing (NLP) system to automatically extract cancer treatments and their associated RECIST-based response categories (complete response, partial response, stable disease, and progressive disease) from non-small cell lung cancer (NSCLC) clinical notes.</p><p><strong>Methods: </strong>This retrospective NLP development and validation study used a corpus of 250 NSCLC oncology notes from University of Pittsburgh Medical Center (UPMC) Hillman Cancer Center, annotated by physician experts. An end-to-end NLP pipeline was designed, integrating a rule-based module for entity extraction (treatments and responses) and a machine learning module using biomedical clinical bidirectional encoder representations from transformers for relation classification. The system's performance was evaluated on a held-out test set, with partial external validation for relation extraction on a Mayo Clinic data set.</p><p><strong>Results: </strong>The NLP system achieved high overall accuracy. On the UPMC test set (64 notes), the relation classification model attained an area under the receiver operating characteristic curve of 0.94 and an F1 score of 0.92 for linking treatments with documented responses. The rule-based entity extraction demonstrated a macro-averaged F1 score of 0.87 (precision 0.98, recall 0.81). Although precision was high for chemotherapy and most response types (1.00), recall for cancer surgery was 0.45. External validation at Mayo Clinic showed moderate relation extraction F1 scores (range: 0.51-0.64).</p><p><strong>Conclusion: </strong>The proposed NLP system can reliably extract structured treatment and response information from unstructured NSCLC oncology notes with high accuracy. This automated approach can assist in abstracting critical cancer treatment outcomes from clinical narrative text, thereby streamlining real-world data analysis and supporting the generation of RWE in oncology.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"10 ","pages":"e2500138"},"PeriodicalIF":2.8,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12788794/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145936045","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
JCO Clinical Cancer Informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1