首页 > 最新文献

JCO Clinical Cancer Informatics最新文献

英文 中文
Artificial Intelligence-Assisted Error Detection in Complex Clinical Documentation: Leveraging Large Language Models to Enhance Patient Safety in Oncology. 复杂临床文件中的人工智能辅助错误检测:利用大型语言模型来提高肿瘤患者的安全性。
IF 2.8 Q2 ONCOLOGY Pub Date : 2026-01-01 Epub Date: 2026-01-06 DOI: 10.1200/CCI-25-00194
Peter May, Sina Nokodian, Christoph Nuernbergk, Manuel Knauer, Maike Hefter, Aaron Becker von Rose, Florian Bassermann, Johannes Jung

Purpose: In high-risk specialties such as oncology, errors in clinical documentation can have severe consequences, highlighting a need for enhanced safety checks. We therefore aimed to evaluate the capability of frontier large language models (LLMs) to identify and correct errors in complex clinical documentation in oncology.

Methods: We conducted a two-phase evaluation. First, we assessed LLMs (GPT o4-mini and Gemini 2.5 Pro) on 1,000 synthetic clinical hematology/oncology vignettes with controlled errors, benchmarking against human expert data for error flag detection and sentence localization. Second, we evaluated advanced LLMs and a local LLM (Gemma 3 27B) against six clinicians in detecting single, predefined, and clinically relevant errors, such as wrong risk classifications or omission of critical medication within 90 synthetic discharge summaries from oncologic patients.

Results: LLMs outperformed human benchmark in error flag and sentence localization tasks, with Gemini 2.5 Pro achieving top accuracies of 0.928 and 0.915, respectively. Results were robust across subgroups and scalable, with simultaneous processing of up to 50 vignettes. Within complex discharge summaries, Gemini 2.5 Pro and GPT o4-mini-high identified 97.8% and 87.8% of injected errors, respectively, substantially exceeding the 47.8% average detection rate of human specialists. Gemma 3 27B detected 35.6% of errors. Analysis of error detection overlap revealed a synergistic potential for hybrid human-artificial intelligence (AI) systems.

Conclusion: Frontier LLMs exhibit superior error-detection capabilities and speed compared with both local models and human specialists, who are inherently time-constrained. Although synthetic data provide a controlled testbed, real-world evaluation across diverse errors and documentation styles remains critical. Advanced LLMs can serve as powerful assistants for clinical documentation reviews, substantially reducing the risk of oversight and clinician workload. Integrating LLM-driven error flagging into electronic health record workflows offers a promising strategy for enhancing documentation accuracy, treatment quality, and patient safety in oncology.

目的:在肿瘤等高风险专科,临床文件中的错误可能会产生严重后果,强调需要加强安全检查。因此,我们旨在评估前沿大语言模型(llm)识别和纠正肿瘤学复杂临床文献错误的能力。方法:采用两阶段评价方法。首先,我们评估了LLMs (GPT o4-mini和Gemini 2.5 Pro)在1000个具有控制误差的临床血液学/肿瘤学合成图像上的性能,并与人类专家数据进行了基准测试,用于错误标记检测和句子定位。其次,我们评估了高级LLM和本地LLM (Gemma 327b)与六名临床医生在检测单个,预定义和临床相关错误方面的差异,例如在90例肿瘤患者的合成出院总结中错误的风险分类或遗漏关键药物。结果:llm在错误标记和句子定位任务上优于人类基准,Gemini 2.5 Pro的最高准确率分别为0.928和0.915。结果是稳健的跨亚组和可扩展的,同时处理多达50个小插曲。在复杂的出院总结中,Gemini 2.5 Pro和GPT o4-mini-high分别识别了97.8%和87.8%的注射错误,大大超过了人类专家47.8%的平均检出率。Gemma 327b检测出35.6%的错误。错误检测重叠分析揭示了混合人类-人工智能(AI)系统的协同潜力。结论:与本地模型和受时间限制的人类专家相比,Frontier llm具有更好的错误检测能力和速度。尽管合成数据提供了一个可控的测试平台,但是在真实世界中对各种错误和文档风格进行评估仍然至关重要。高级法学硕士可以作为临床文件审查的有力助手,大大减少监督风险和临床医生的工作量。将法学硕士驱动的错误标记集成到电子健康记录工作流程中,为提高肿瘤学文档准确性、治疗质量和患者安全性提供了一种有前途的策略。
{"title":"Artificial Intelligence-Assisted Error Detection in Complex Clinical Documentation: Leveraging Large Language Models to Enhance Patient Safety in Oncology.","authors":"Peter May, Sina Nokodian, Christoph Nuernbergk, Manuel Knauer, Maike Hefter, Aaron Becker von Rose, Florian Bassermann, Johannes Jung","doi":"10.1200/CCI-25-00194","DOIUrl":"10.1200/CCI-25-00194","url":null,"abstract":"<p><strong>Purpose: </strong>In high-risk specialties such as oncology, errors in clinical documentation can have severe consequences, highlighting a need for enhanced safety checks. We therefore aimed to evaluate the capability of frontier large language models (LLMs) to identify and correct errors in complex clinical documentation in oncology.</p><p><strong>Methods: </strong>We conducted a two-phase evaluation. First, we assessed LLMs (GPT o4-mini and Gemini 2.5 Pro) on 1,000 synthetic clinical hematology/oncology vignettes with controlled errors, benchmarking against human expert data for error flag detection and sentence localization. Second, we evaluated advanced LLMs and a local LLM (Gemma 3 27B) against six clinicians in detecting single, predefined, and clinically relevant errors, such as wrong risk classifications or omission of critical medication within 90 synthetic discharge summaries from oncologic patients.</p><p><strong>Results: </strong>LLMs outperformed human benchmark in error flag and sentence localization tasks, with Gemini 2.5 Pro achieving top accuracies of 0.928 and 0.915, respectively. Results were robust across subgroups and scalable, with simultaneous processing of up to 50 vignettes. Within complex discharge summaries, Gemini 2.5 Pro and GPT o4-mini-high identified 97.8% and 87.8% of injected errors, respectively, substantially exceeding the 47.8% average detection rate of human specialists. Gemma 3 27B detected 35.6% of errors. Analysis of error detection overlap revealed a synergistic potential for hybrid human-artificial intelligence (AI) systems.</p><p><strong>Conclusion: </strong>Frontier LLMs exhibit superior error-detection capabilities and speed compared with both local models and human specialists, who are inherently time-constrained. Although synthetic data provide a controlled testbed, real-world evaluation across diverse errors and documentation styles remains critical. Advanced LLMs can serve as powerful assistants for clinical documentation reviews, substantially reducing the risk of oversight and clinician workload. Integrating LLM-driven error flagging into electronic health record workflows offers a promising strategy for enhancing documentation accuracy, treatment quality, and patient safety in oncology.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"10 ","pages":"e2500194"},"PeriodicalIF":2.8,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12794695/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145913830","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improving Survival Models in Health Care by Balancing Imbalanced Cohorts: A Novel Approach. 通过平衡不平衡队列改善医疗保健中的生存模式:一种新方法。
IF 2.8 Q2 ONCOLOGY Pub Date : 2026-01-01 Epub Date: 2026-01-28 DOI: 10.1200/CCI-25-00190
Catherine Ning, Dimitris Bertsimas, Per Eystein Lønning, Federico N Auecio, Richard Burkhart, Felix Balzer, Stefan Buettner, Hideo Baba, Itaru Endo, Georgios Stasinos, Johan Gagnière, Cornelis Verhoef, Martin E Kreis, Georgios Antonios Margonis

Purpose: We explore whether survival model performance in underrepresented high- and low-risk subgroups-regions of the prognostic spectrum where clinical decisions are most consequential-can be improved through targeted restructuring of the training data set. Rather than modifying model architecture, we propose a novel risk-stratified sampling method that addresses imbalances in prognostic subgroup density to support more reliable learning in underrepresented tail strata.

Methods: We introduce a novel methodology that partitions patients by baseline prognostic risk and applies matching within each stratum to equalize representation across the risk distribution. We implement this framework on a cohort of 1,799 patients with resected colorectal liver metastases (CRLM), including 1,197 who received adjuvant chemotherapy and 602 who did not. All models used in this study are Cox proportional hazards models trained on the same set of selected variables. Model performance is assessed via Harrell's C index and Integrated Calibration Index, with internal validation using Efron's bias-corrected bootstrapping. External validation is conducted on two independent CRLM data sets.

Results: Cox models trained on risk-balanced cohorts showed consistent improvements in internal validation compared with models trained on the full data set. The proposed approach preserved overall model calibration while noticeably improving stratified C index values in underrepresented high- and low-risk strata of the external cohorts.

Conclusion: Our findings suggest that survival model performance in observational oncology cohorts can be meaningfully improved through targeted rebalancing of the training data across prognostic risk strata. This approach offers a practical and model-agnostic complement to existing methods, especially in applications where predictive reliability across the full risk continuum is critical to downstream clinical decisions.

目的:我们探索在未被充分代表的高风险和低风险亚组(临床决策最重要的预后范围区域)中,生存模型的表现是否可以通过有针对性地重组训练数据集来改善。我们提出了一种新的风险分层抽样方法,而不是修改模型结构,该方法解决了预测子群密度的不平衡,以支持在代表性不足的尾层中更可靠的学习。方法:我们引入了一种新的方法,根据基线预后风险对患者进行分区,并在每个阶层内进行匹配,以均衡整个风险分布的代表性。我们在1799例切除的结直肠肝转移(CRLM)患者中实施了这一框架,其中1197例接受了辅助化疗,602例未接受辅助化疗。本研究中使用的所有模型都是在同一组选定变量上训练的Cox比例风险模型。模型性能通过Harrell的C指数和集成校准指数进行评估,并使用Efron的偏差校正引导进行内部验证。在两个独立的CRLM数据集上进行外部验证。结果:与在完整数据集上训练的模型相比,在风险平衡队列上训练的Cox模型在内部验证方面显示出一致的改进。所提出的方法保留了整体模型校准,同时显着改善了外部队列中代表性不足的高风险和低风险阶层的分层C指数值。结论:我们的研究结果表明,通过对预后风险层的训练数据进行有针对性的再平衡,观察性肿瘤队列中的生存模型性能可以得到有意义的改善。这种方法为现有方法提供了一种实用的、与模型无关的补充,特别是在整个风险连续体的预测可靠性对下游临床决策至关重要的应用中。
{"title":"Improving Survival Models in Health Care by Balancing Imbalanced Cohorts: A Novel Approach.","authors":"Catherine Ning, Dimitris Bertsimas, Per Eystein Lønning, Federico N Auecio, Richard Burkhart, Felix Balzer, Stefan Buettner, Hideo Baba, Itaru Endo, Georgios Stasinos, Johan Gagnière, Cornelis Verhoef, Martin E Kreis, Georgios Antonios Margonis","doi":"10.1200/CCI-25-00190","DOIUrl":"https://doi.org/10.1200/CCI-25-00190","url":null,"abstract":"<p><strong>Purpose: </strong>We explore whether survival model performance in underrepresented high- and low-risk subgroups-regions of the prognostic spectrum where clinical decisions are most consequential-can be improved through targeted restructuring of the training data set. Rather than modifying model architecture, we propose a novel risk-stratified sampling method that addresses imbalances in prognostic subgroup density to support more reliable learning in underrepresented tail strata.</p><p><strong>Methods: </strong>We introduce a novel methodology that partitions patients by baseline prognostic risk and applies matching within each stratum to equalize representation across the risk distribution. We implement this framework on a cohort of 1,799 patients with resected colorectal liver metastases (CRLM), including 1,197 who received adjuvant chemotherapy and 602 who did not. All models used in this study are Cox proportional hazards models trained on the same set of selected variables. Model performance is assessed via Harrell's C index and Integrated Calibration Index, with internal validation using Efron's bias-corrected bootstrapping. External validation is conducted on two independent CRLM data sets.</p><p><strong>Results: </strong>Cox models trained on risk-balanced cohorts showed consistent improvements in internal validation compared with models trained on the full data set. The proposed approach preserved overall model calibration while noticeably improving stratified C index values in underrepresented high- and low-risk strata of the external cohorts.</p><p><strong>Conclusion: </strong>Our findings suggest that survival model performance in observational oncology cohorts can be meaningfully improved through targeted rebalancing of the training data across prognostic risk strata. This approach offers a practical and model-agnostic complement to existing methods, especially in applications where predictive reliability across the full risk continuum is critical to downstream clinical decisions.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"10 ","pages":"e2500190"},"PeriodicalIF":2.8,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12854512/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146094800","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Knowledge Representation of a Multicenter Adolescent and Young Adult Cancer Infrastructure: Development of the STRONG AYA Knowledge Graph. 多中心青少年和青年癌症基础设施的知识表示:STRONG AYA知识图谱的发展。
IF 2.8 Q2 ONCOLOGY Pub Date : 2026-01-01 Epub Date: 2026-01-14 DOI: 10.1200/CCI-25-00177
Joshi Hogenboom, Varsha Gouthamchand, Charlotte Cairns, Silvie H M Janssen, Kirsty Way, Andre L A J Dekker, Winette T A van der Graaf, Anne-Sophie Darlington, Olga Husson, Leonard Y L Wee, Johan van Soest, Aiara Lobo Gomes

Purpose: Rare diseases are difficult to fully capture, and regularly call for large, geographically dispersed initiatives. Such initiatives are often met with data harmonization challenges. These challenges render data incompatible and impede successful realization. The STRONG AYA project is such an initiative, specifically focusing on adolescent and young adult (AYAs) with cancer. STRONG AYA is setting up a federated data infrastructure containing data of varying format. Here, we elaborate on how we used health care-agnostic semantic web technologies to overcome such challenges.

Methods: We structured the STRONG AYA case-mix and core outcome measures concepts and their properties as knowledge graphs. Having identified the corresponding standard terminologies, we developed a semantic map on the basis of the knowledge graphs and the here introduced annotation helper plugin for Flyover. Flyover is a tool that converts structured data into resource description framework (RDF) triples and enables semantic interoperability. As a demonstration, we mapped data that are to be included in the STRONG AYA infrastructure.

Results: The knowledge graphs provided a comprehensive overview of the large number of STRONG AYA concepts. The semantic terminology mapping and annotation helper allowed us to query data with incomprehensible terminologies, without changing them. Both the knowledge graphs and semantic map were made available on a Hugo webpage for increased transparency and understanding.

Conclusion: The use of semantic web technologies, such as RDF and knowledge graphs, is a viable solution to overcome challenges regarding data interoperability and reusability for a federated AYA cancer data infrastructure without being bound to rigid standardized schemas. The linkage of semantically meaningful concepts to otherwise incomprehensible data elements demonstrates how by using these domain-agnostic technologies we made nonstandardized health care data interoperable.

目的:罕见病很难完全捕捉,经常需要大规模的、地理上分散的行动。此类举措经常遇到数据协调方面的挑战。这些挑战使得数据不兼容,阻碍了成功的实现。STRONG AYA项目就是这样一项倡议,专门针对患有癌症的青少年和年轻人。STRONG AYA正在建立一个包含不同格式数据的联邦数据基础设施。在这里,我们详细阐述了我们如何使用与医疗保健无关的语义web技术来克服这些挑战。方法:我们将STRONG AYA病例组合和核心结果测量概念及其属性构建为知识图。在确定了相应的标准术语之后,我们在知识图的基础上开发了一个语义图,并在这里介绍了Flyover的注释助手插件。Flyover是一种将结构化数据转换为资源描述框架(RDF)三元组并支持语义互操作性的工具。作为演示,我们映射了将包含在STRONG AYA基础结构中的数据。结果:知识图谱提供了大量STRONG AYA概念的全面概述。语义术语映射和注释帮助器允许我们查询具有难以理解的术语的数据,而无需更改它们。知识图和语义图都在Hugo网页上提供,以增加透明度和理解。结论:使用语义web技术,如RDF和知识图,是一种可行的解决方案,可以克服联邦AYA癌症数据基础设施的数据互操作性和可重用性方面的挑战,而无需绑定到严格的标准化模式。语义上有意义的概念与其他方面难以理解的数据元素之间的联系表明,通过使用这些与领域无关的技术,我们如何使非标准化的医疗保健数据具有互操作性。
{"title":"Knowledge Representation of a Multicenter Adolescent and Young Adult Cancer Infrastructure: Development of the STRONG AYA Knowledge Graph.","authors":"Joshi Hogenboom, Varsha Gouthamchand, Charlotte Cairns, Silvie H M Janssen, Kirsty Way, Andre L A J Dekker, Winette T A van der Graaf, Anne-Sophie Darlington, Olga Husson, Leonard Y L Wee, Johan van Soest, Aiara Lobo Gomes","doi":"10.1200/CCI-25-00177","DOIUrl":"10.1200/CCI-25-00177","url":null,"abstract":"<p><strong>Purpose: </strong>Rare diseases are difficult to fully capture, and regularly call for large, geographically dispersed initiatives. Such initiatives are often met with data harmonization challenges. These challenges render data incompatible and impede successful realization. The STRONG AYA project is such an initiative, specifically focusing on adolescent and young adult (AYAs) with cancer. STRONG AYA is setting up a federated data infrastructure containing data of varying format. Here, we elaborate on how we used health care-agnostic semantic web technologies to overcome such challenges.</p><p><strong>Methods: </strong>We structured the STRONG AYA case-mix and core outcome measures concepts and their properties as knowledge graphs. Having identified the corresponding standard terminologies, we developed a semantic map on the basis of the knowledge graphs and the here introduced annotation helper plugin for <i>Flyover</i>. <i>Flyover</i> is a tool that converts structured data into resource description framework (RDF) triples and enables semantic interoperability. As a demonstration, we mapped data that are to be included in the STRONG AYA infrastructure.</p><p><strong>Results: </strong>The knowledge graphs provided a comprehensive overview of the large number of STRONG AYA concepts. The semantic terminology mapping and annotation helper allowed us to query data with incomprehensible terminologies, without changing them. Both the knowledge graphs and semantic map were made available on a Hugo webpage for increased transparency and understanding.</p><p><strong>Conclusion: </strong>The use of semantic web technologies, such as RDF and knowledge graphs, is a viable solution to overcome challenges regarding data interoperability and reusability for a federated AYA cancer data infrastructure without being bound to rigid standardized schemas. The linkage of semantically meaningful concepts to otherwise incomprehensible data elements demonstrates how by using these domain-agnostic technologies we made nonstandardized health care data interoperable.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"10 ","pages":"e2500177"},"PeriodicalIF":2.8,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12834280/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145985727","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Synoptic Multidisciplinary Team Meeting Workflows to Promote Guideline-Based Classification of Resectability in Pancreatic Cancer: A Multicenter Prospective Study. 综合多学科小组会议工作流程以促进基于指南的胰腺癌可切除性分类:一项多中心前瞻性研究。
IF 2.8 Q2 ONCOLOGY Pub Date : 2026-01-01 Epub Date: 2026-01-06 DOI: 10.1200/CCI-25-00255
William McGahan, Nick Butler, Thomas O'Rourke, Bernard Mark Smithers, David Cavallucci

Purpose: To address incomplete and inconsistent classification of pancreatic cancer resectability according to International Association of Pancreatology (IAP) anatomic, biologic, and conditional criteria.

Materials and methods: We designed, implemented, and evaluated an interoperable, web-based platform that captured structured pretreatment data and performed algorithm-driven resectability classification. Linked modules supported referral to and discussion at multidisciplinary team meetings (MDTMs) at two quaternary hospitals (June 2021-February 2022) and populated downstream documentation. In a pre-post study, Pearson χ2 test and multivariable logistic regression (odds ratios [ORs] with 95% CI) compared data completeness (primary end point), as well as the distribution of IAP-defined resectability and treatment intent (secondary end points). In the postintervention cohort, overall survival (OS) was stratified by IAP resectability using Kaplan-Meier curves and compared using the log-rank test. Hazard ratios (HRs) with 95% CIs and log-rank statistics were calculated for individual resectability criteria using Cox models. All tests were two-sided with nominal significance (P < .05). An embedded module evaluated workflow integration and user experience.

Results: Ninety-five patients with pancreatic cancer were referred to MDTMs during the intervention period, of whom 71 were eligible. Compared with 71 preintervention patients, the system improved documentation of tumor-vessel relationships (OR, 9.39 [95% CI, 4.43 to 21.7]), locoregional lymphadenopathy (OR, 30.5 [95% CI, 11.1 to 102]), and performance status (PS; OR, 3.34 [95% CI, 1.67 to 6.85]), reducing the number with unknown resectability (OR, 0.10 [95% CI, 0.03 to 0.25]). PS ≥ 2 (HR, 2.16 [95% CI, 1.06 to 4.43]) and serum CA19.9 ≥ 500 U/mL (HR, 1.94 [95% CI, 1.03 to 3.63]) were significantly associated with OS, whereas anatomic criteria were not.

Discussion: A synoptic intervention integrated into MDTM workflows across multiple sites improved structured data capture, reduced unknown resectability, and highlighted the relevance of biologic and conditional criteria in addition to tumor anatomy.

目的:根据国际胰脏学协会(IAP)的解剖学、生物学和条件标准,解决胰腺癌可切除性分类不完整和不一致的问题。材料和方法:我们设计、实现并评估了一个可互操作的基于web的平台,该平台捕获结构化预处理数据并执行算法驱动的可切除性分类。链接模块支持两家第四医院(2021年6月至2022年2月)的多学科小组会议(MDTMs)的转诊和讨论,并填充下游文件。在一项前后研究中,Pearson χ2检验和多变量logistic回归(95% CI的比值比[ORs])比较了数据的完整性(主要终点),以及iap定义的可切除性和治疗意图的分布(次要终点)。在干预后队列中,使用Kaplan-Meier曲线对IAP可切除性进行总生存率(OS)分层,并使用log-rank检验进行比较。使用Cox模型计算个体可切除性标准的95% ci和log-rank统计的风险比(hr)。所有检验均为双侧检验,具有名义显著性(P < 0.05)。一个嵌入式模块评估工作流集成和用户体验。结果:干预期间95例胰腺癌患者转介到MDTMs,其中71例符合条件。与71例干预前患者相比,该系统改善了肿瘤与血管关系(OR, 9.39 [95% CI, 4.43至21.7])、局部区域淋巴结病变(OR, 30.5 [95% CI, 11.1至102])和功能状态(PS; OR, 3.34 [95% CI, 1.67至6.85])的记录,减少了未知可切除性的患者数量(OR, 0.10 [95% CI, 0.03至0.25])。PS≥2 (HR, 2.16 [95% CI, 1.06 ~ 4.43])和血清CA19.9≥500 U/mL (HR, 1.94 [95% CI, 1.03 ~ 3.63])与OS有显著相关性,而解剖标准与OS无显著相关性。讨论:将综合干预整合到MDTM跨多个部位的工作流程中,改善了结构化数据捕获,减少了未知的可切除性,并强调了除肿瘤解剖外生物学和条件标准的相关性。
{"title":"Synoptic Multidisciplinary Team Meeting Workflows to Promote Guideline-Based Classification of Resectability in Pancreatic Cancer: A Multicenter Prospective Study.","authors":"William McGahan, Nick Butler, Thomas O'Rourke, Bernard Mark Smithers, David Cavallucci","doi":"10.1200/CCI-25-00255","DOIUrl":"10.1200/CCI-25-00255","url":null,"abstract":"<p><strong>Purpose: </strong>To address incomplete and inconsistent classification of pancreatic cancer resectability according to International Association of Pancreatology (IAP) anatomic, biologic, and conditional criteria.</p><p><strong>Materials and methods: </strong>We designed, implemented, and evaluated an interoperable, web-based platform that captured structured pretreatment data and performed algorithm-driven resectability classification. Linked modules supported referral to and discussion at multidisciplinary team meetings (MDTMs) at two quaternary hospitals (June 2021-February 2022) and populated downstream documentation. In a pre-post study, Pearson χ<sup>2</sup> test and multivariable logistic regression (odds ratios [ORs] with 95% CI) compared data completeness (primary end point), as well as the distribution of IAP-defined resectability and treatment intent (secondary end points). In the postintervention cohort, overall survival (OS) was stratified by IAP resectability using Kaplan-Meier curves and compared using the log-rank test. Hazard ratios (HRs) with 95% CIs and log-rank statistics were calculated for individual resectability criteria using Cox models. All tests were two-sided with nominal significance (<i>P</i> < .05). An embedded module evaluated workflow integration and user experience.</p><p><strong>Results: </strong>Ninety-five patients with pancreatic cancer were referred to MDTMs during the intervention period, of whom 71 were eligible. Compared with 71 preintervention patients, the system improved documentation of tumor-vessel relationships (OR, 9.39 [95% CI, 4.43 to 21.7]), locoregional lymphadenopathy (OR, 30.5 [95% CI, 11.1 to 102]), and performance status (PS; OR, 3.34 [95% CI, 1.67 to 6.85]), reducing the number with <i>unknown</i> resectability (OR, 0.10 [95% CI, 0.03 to 0.25]). PS ≥ 2 (HR, 2.16 [95% CI, 1.06 to 4.43]) and serum CA19.9 ≥ 500 U/mL (HR, 1.94 [95% CI, 1.03 to 3.63]) were significantly associated with OS, whereas anatomic criteria were not.</p><p><strong>Discussion: </strong>A synoptic intervention integrated into MDTM workflows across multiple sites improved structured data capture, reduced <i>unknown</i> resectability, and highlighted the relevance of biologic and conditional criteria in addition to tumor anatomy.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"10 ","pages":"e2500255"},"PeriodicalIF":2.8,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145913860","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Tumor Board-Inspired Multiagent Artificial Intelligence System for Interpreting Oncology Guidelines. 肿瘤委员会启发的多智能体人工智能系统解读肿瘤学指南。
IF 2.8 Q2 ONCOLOGY Pub Date : 2026-01-01 Epub Date: 2026-01-07 DOI: 10.1200/CCI-25-00286
Jiasheng Wang, Kirti Arora, David M Swoboda, Aziz Nazha

Purpose: Clinical guidelines are essential for evidence-based oncology care but are often long, complex, and difficult to navigate. We developed a multiagent artificial intelligence (AI) system to accurately retrieve and interpret guideline content in response to guideline-based clinical questions.

Methods: We included 34 ASCO guidelines published between January 2021 and December 2024. Using a multiagent framework, we assigned distinct roles to AI agents: a Coordinator Agent selected the relevant guideline, specialized Tumor Board Agents extracted information from text, tables, and figures, and a Reviewer Agent synthesized a final answer. A total of 100 open-ended questions were created on the basis of the guideline content. The system's performance was compared with GPT-4o, Claude 3.7, Gemini 2.5 flash, DeepSeek-R1, and the ASCO Guidelines Assistant.

Results: The multi-agent system achieved (94% [95% CI, 89.3 to 98.7]) accuracy in selecting the correct guidelines and (90% [95% CI, 84.1 to 95.9]) accuracy in answering questions. This significantly outperformed GPT-4o (48%), Claude 3.7 (49%), Gemini 2.5 (50%), DeepSeek-R1 (58%), and the ASCO Guidelines Assistant (67%, all P < .01, McNemar's test). Most errors were due to incorrect guideline selection or misinterpretation; no hallucinated answers were observed. Removing the Coordinator Agent reduced accuracy to 40%, and excluding tables and figures reduced accuracy to 51%.

Conclusion: By assigning specialized tasks to AI agents and incorporating visual elements from clinical guidelines, our system outperformed existing tools in accurately answering oncology questions. This pilot study, limited to ASCO guidelines, may improve access to guideline-based care.

目的:临床指南对循证肿瘤学治疗至关重要,但往往冗长、复杂且难以驾驭。我们开发了一个多智能体人工智能(AI)系统,以准确检索和解释指南内容,以响应基于指南的临床问题。方法:我们纳入了2021年1月至2024年12月期间发表的34份ASCO指南。使用多代理框架,我们为人工智能代理分配了不同的角色:协调代理选择相关指南,专门的肿瘤委员会代理从文本、表格和数字中提取信息,审稿人代理合成最终答案。在指南内容的基础上,共设置了100个开放式问题。该系统的性能与gpt - 40、Claude 3.7、Gemini 2.5 flash、DeepSeek-R1和ASCO指南助手进行了比较。结果:多智能体系统在选择正确指南方面达到(94% [95% CI, 89.3至98.7])准确率,在回答问题方面达到(90% [95% CI, 84.1至95.9])准确率。这明显优于gpt - 40 (48%), Claude 3.7 (49%), Gemini 2.5 (50%), DeepSeek-R1(58%)和ASCO指南助理(67%,均P < 0.01, McNemar试验)。大多数错误是由于指南选择不正确或误解所致;没有观察到有幻觉的答案。删除协调代理将准确性降低到40%,排除表格和数字将准确性降低到51%。结论:通过将专业任务分配给人工智能代理,并结合临床指南中的视觉元素,我们的系统在准确回答肿瘤问题方面优于现有工具。这项试点研究,仅限于ASCO指南,可能会改善获得基于指南的护理。
{"title":"Tumor Board-Inspired Multiagent Artificial Intelligence System for Interpreting Oncology Guidelines.","authors":"Jiasheng Wang, Kirti Arora, David M Swoboda, Aziz Nazha","doi":"10.1200/CCI-25-00286","DOIUrl":"https://doi.org/10.1200/CCI-25-00286","url":null,"abstract":"<p><strong>Purpose: </strong>Clinical guidelines are essential for evidence-based oncology care but are often long, complex, and difficult to navigate. We developed a multiagent artificial intelligence (AI) system to accurately retrieve and interpret guideline content in response to guideline-based clinical questions.</p><p><strong>Methods: </strong>We included 34 ASCO guidelines published between January 2021 and December 2024. Using a multiagent framework, we assigned distinct roles to AI agents: a Coordinator Agent selected the relevant guideline, specialized Tumor Board Agents extracted information from text, tables, and figures, and a Reviewer Agent synthesized a final answer. A total of 100 open-ended questions were created on the basis of the guideline content. The system's performance was compared with GPT-4o, Claude 3.7, Gemini 2.5 flash, DeepSeek-R1, and the ASCO Guidelines Assistant.</p><p><strong>Results: </strong>The multi-agent system achieved (94% [95% CI, 89.3 to 98.7]) accuracy in selecting the correct guidelines and (90% [95% CI, 84.1 to 95.9]) accuracy in answering questions. This significantly outperformed GPT-4o (48%), Claude 3.7 (49%), Gemini 2.5 (50%), DeepSeek-R1 (58%), and the ASCO Guidelines Assistant (67%, all <i>P</i> < .01, McNemar's test). Most errors were due to incorrect guideline selection or misinterpretation; no hallucinated answers were observed. Removing the Coordinator Agent reduced accuracy to 40%, and excluding tables and figures reduced accuracy to 51%.</p><p><strong>Conclusion: </strong>By assigning specialized tasks to AI agents and incorporating visual elements from clinical guidelines, our system outperformed existing tools in accurately answering oncology questions. This pilot study, limited to ASCO guidelines, may improve access to guideline-based care.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"10 ","pages":"e2500286"},"PeriodicalIF":2.8,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145918949","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Generative Artificial Intelligence Successfully Automates Data Extraction From Unstructured Magnetic Resonance Imaging Reports: Feasibility in Prostate Cancer Care. 生成式人工智能成功地从非结构化磁共振成像报告中自动提取数据:前列腺癌护理的可行性。
IF 2.8 Q2 ONCOLOGY Pub Date : 2026-01-01 Epub Date: 2026-01-07 DOI: 10.1200/CCI-24-00334
Anobel Y Odisho, Andrew W Liu, William A Pace, Marvin N Carlisle, Robert Krumm, Janet E Cowan, Peter R Carroll, Matthew R Cooperberg

Purpose: Radiology reports are stored as plain text in most electronic health records, rendering the data computationally inaccessible. Large language models are powerful tools for analyzing unstructured text but relatively untested in urologic oncology. We aimed to develop a pipeline to extract data from plain text prostate magnetic resonance imaging (MRI) reports using GPT4.0 and compare the accuracy to manually abstracted data.

Methods: We developed a data pipeline using a secure, enterprise-wide deployment of OpenAI's GPT-4.0 to automatically extract data elements from prostate MRI report text when presented with prostate MRI reports. Identical prompts and reports were sent multiple times to determine response variability. We extracted 15 data elements per report and compared accuracy to a manually abstracted gold standard.

Results: Across 424 prostate MRI reports, GPT-4.0 response accuracy was consistently above 95%. Individual field accuracies were 98.3% (96.3%-99.3%) for prostate-specific antigen density, 97.4% (95.4%-98.7%) for extracapsular extension, and 98.1% (96.3%-99.2%) for TNM stage, and had a median of 98.1% (96.3%-99.2%), a mean of 97.2% (95.2%-98.3%), and a range of 99.8% (98.7%-100.0%) for number of suspicious lesions to 87.7% (84.2%-90.7%) for identification of lesion location in the base of the prostate. Response variability over five repeated runs ranged from 0.14% to 3.61%, differed based on the data element extracted (P < .001), and was inversely correlated with accuracy (P < .001). In disagreements between manual and GPT-4.0 extracted data, GPT-4.0 responses were more often deemed correct by an additional reviewer.

Conclusion: GPT-4.0 had high accuracy with low variability in extracting data points from prostate cancer MRI reports with low upfront programming requirements. This represents an effective tool to expedite medical data extraction for clinical and research use cases.

目的:在大多数电子健康记录中,放射学报告以纯文本形式存储,使得数据在计算上不可访问。大型语言模型是分析非结构化文本的强大工具,但在泌尿肿瘤学中尚未经过测试。我们的目标是开发一个使用GPT4.0从纯文本前列腺磁共振成像(MRI)报告中提取数据的管道,并将其与手动提取数据的准确性进行比较。方法:我们开发了一个数据管道,使用安全的、企业级部署的OpenAI的GPT-4.0,当出现前列腺MRI报告时,自动从前列腺MRI报告文本中提取数据元素。多次发送相同的提示和报告,以确定响应的可变性。我们为每个报告提取了15个数据元素,并将准确性与手动提取的黄金标准进行了比较。结果:在424份前列腺MRI报告中,GPT-4.0反应准确率始终在95%以上。前列腺特异性抗原密度、包膜外延伸和TNM分期的准确率分别为98.3%(96.3% ~ 99.3%)、97.4%(95.4% ~ 98.7%)和98.1%(96.3% ~ 99.2%),中位数为98.1%(96.3% ~ 99.2%),平均值为97.2%(95.2% ~ 98.3%),可疑病变数量的准确率为99.8%(98.7% ~ 100.0%),前列腺基底部病变位置的准确率为87.7%(84.2% ~ 90.7%)。五次重复运行的响应变异性范围为0.14%至3.61%,根据提取的数据元素不同而不同(P < .001),并且与准确性呈负相关(P < .001)。在手动和GPT-4.0提取的数据不一致时,GPT-4.0的回答通常被额外的审稿人认为是正确的。结论:GPT-4.0从前列腺癌MRI报告中提取数据点的准确性高,变异性低,前期编程要求低。这是一种有效的工具,可以加快临床和研究用例的医疗数据提取。
{"title":"Generative Artificial Intelligence Successfully Automates Data Extraction From Unstructured Magnetic Resonance Imaging Reports: Feasibility in Prostate Cancer Care.","authors":"Anobel Y Odisho, Andrew W Liu, William A Pace, Marvin N Carlisle, Robert Krumm, Janet E Cowan, Peter R Carroll, Matthew R Cooperberg","doi":"10.1200/CCI-24-00334","DOIUrl":"https://doi.org/10.1200/CCI-24-00334","url":null,"abstract":"<p><strong>Purpose: </strong>Radiology reports are stored as plain text in most electronic health records, rendering the data computationally inaccessible. Large language models are powerful tools for analyzing unstructured text but relatively untested in urologic oncology. We aimed to develop a pipeline to extract data from plain text prostate magnetic resonance imaging (MRI) reports using GPT4.0 and compare the accuracy to manually abstracted data.</p><p><strong>Methods: </strong>We developed a data pipeline using a secure, enterprise-wide deployment of OpenAI's GPT-4.0 to automatically extract data elements from prostate MRI report text when presented with prostate MRI reports. Identical prompts and reports were sent multiple times to determine response variability. We extracted 15 data elements per report and compared accuracy to a manually abstracted gold standard.</p><p><strong>Results: </strong>Across 424 prostate MRI reports, GPT-4.0 response accuracy was consistently above 95%. Individual field accuracies were 98.3% (96.3%-99.3%) for prostate-specific antigen density, 97.4% (95.4%-98.7%) for extracapsular extension, and 98.1% (96.3%-99.2%) for TNM stage, and had a median of 98.1% (96.3%-99.2%), a mean of 97.2% (95.2%-98.3%), and a range of 99.8% (98.7%-100.0%) for number of suspicious lesions to 87.7% (84.2%-90.7%) for identification of lesion location in the base of the prostate. Response variability over five repeated runs ranged from 0.14% to 3.61%, differed based on the data element extracted (<i>P</i> < .001), and was inversely correlated with accuracy (<i>P</i> < .001). In disagreements between manual and GPT-4.0 extracted data, GPT-4.0 responses were more often deemed correct by an additional reviewer.</p><p><strong>Conclusion: </strong>GPT-4.0 had high accuracy with low variability in extracting data points from prostate cancer MRI reports with low upfront programming requirements. This represents an effective tool to expedite medical data extraction for clinical and research use cases.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"10 ","pages":"e2400334"},"PeriodicalIF":2.8,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145919369","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Time-Series Clustering Captures Patterns of Early Immune Effector Cell-Associated Hematotoxicity That Are Predictable Using Tree-Based Models. 时间序列聚类捕捉早期免疫效应细胞相关血液毒性的模式,使用基于树的模型可预测。
IF 2.8 Q2 ONCOLOGY Pub Date : 2026-01-01 Epub Date: 2026-01-14 DOI: 10.1200/CCI-25-00148
Emily C Liang, Yein Jeon, Yang Qiao, Xiancheng Wu, Jennifer J Huang, Andrew J Portuguese, Ryan Basom, Aiko Torkelson, Delaney Kirchmeier, Kristina Braathen, Andrew J Cowan, Mazyar Shadman, Alexandre V Hirayama, Brian G Till, Erik L Kimble, Qian Wu, Jordan Gauthier

Purpose: Immune effector cell-associated hematotoxicity (ICAHT) is a major cause of nonrelapse mortality after chimeric antigen receptor (CAR) T-cell therapy. We hypothesized that unsupervised time-series clustering could better identify archetypal patterns of early hematotoxicity compared to the early ICAHT (eICAHT) grading system.

Methods: We applied unsupervised k-means time-series clustering based on Euclidean distances to longitudinal absolute neutrophil count (ANC) data from days +0 through +30 post-CAR T-cell infusion in 691 patients treated at our center (training set: n = 483, 70%; test set: n = 208, 30%).

Results: Within our training set, we identified an optimal cluster solution based on four ANC recovery clusters, which were labeled as very good, good, poor, and very poor. We trained a random forest (RF) model including the top five most important features (day +3, +4, +5, +26, and +27 ANC values) to predict the cluster assignments. Within our test set, we applied the RF model to predict cluster assignments. Compared with the eICAHT criteria, the RF-predicted clusters were more compact and better separated (Dunn index: 0.078 v 0.034; average silhouette width: 0.12 v 0.010). In addition, the RF model identified patients in the good recovery cluster with intermediate overall survival (hazard ratio [HR], 1.70 [95% CI, 1.05 to 2.74]; P = .029; reference, very good), which was not captured by grade 2 eICAHT (HR, 1.37 [95% CI, 0.80 to 2.35]; P = .25; reference, grade 0-1).

Conclusion: Unsupervised time-series clustering identified distinct and clinically relevant patterns of hematotoxicity after CAR T-cell therapy. We trained and tested an RF model that accurately predicted cluster assignments using only five features. Predictions can be generated using our online web application.

目的:免疫效应细胞相关血液毒性(ICAHT)是嵌合抗原受体(CAR) t细胞治疗后非复发性死亡的主要原因。我们假设,与早期ICAHT (eICAHT)分级系统相比,无监督时间序列聚类可以更好地识别早期血液毒性的原型模式。方法:我们将基于欧几里得距离的无监督k-均值时间序列聚类应用于691名在我们中心接受治疗的患者(训练集:n = 483,70%;测试集:n = 208,30%) car - t细胞输注后+0至+30天的纵向绝对中性粒细胞计数(ANC)数据。结果:在我们的训练集中,我们确定了一个基于四个ANC恢复集群的最优集群解决方案,这些集群被标记为非常好、好、差和非常差。我们训练了一个随机森林(RF)模型,包括前五个最重要的特征(日+3、+4、+5、+26和+27 ANC值)来预测聚类分配。在我们的测试集中,我们应用RF模型来预测集群分配。与eICAHT标准相比,rf预测的聚类更紧凑,分离性更好(Dunn指数:0.078 v 0.034;平均轮廓宽度:0.12 v 0.010)。此外,RF模型确定了处于良好恢复组的患者,总生存率为中等(风险比[HR], 1.70 [95% CI, 1.05至2.74];P = 0.029;参考文献,非常好),2级eICAHT未捕获这些患者(风险比[HR], 1.37 [95% CI, 0.80至2.35];P = 0.25;参考文献,0-1级)。结论:无监督的时间序列聚类识别出CAR - t细胞治疗后血液毒性的独特和临床相关模式。我们训练并测试了一个RF模型,该模型仅使用五个特征就能准确地预测聚类分配。预测可以使用我们的在线web应用程序生成。
{"title":"Time-Series Clustering Captures Patterns of Early Immune Effector Cell-Associated Hematotoxicity That Are Predictable Using Tree-Based Models.","authors":"Emily C Liang, Yein Jeon, Yang Qiao, Xiancheng Wu, Jennifer J Huang, Andrew J Portuguese, Ryan Basom, Aiko Torkelson, Delaney Kirchmeier, Kristina Braathen, Andrew J Cowan, Mazyar Shadman, Alexandre V Hirayama, Brian G Till, Erik L Kimble, Qian Wu, Jordan Gauthier","doi":"10.1200/CCI-25-00148","DOIUrl":"10.1200/CCI-25-00148","url":null,"abstract":"<p><strong>Purpose: </strong>Immune effector cell-associated hematotoxicity (ICAHT) is a major cause of nonrelapse mortality after chimeric antigen receptor (CAR) T-cell therapy. We hypothesized that unsupervised time-series clustering could better identify archetypal patterns of early hematotoxicity compared to the early ICAHT (eICAHT) grading system.</p><p><strong>Methods: </strong>We applied unsupervised k-means time-series clustering based on Euclidean distances to longitudinal absolute neutrophil count (ANC) data from days +0 through +30 post-CAR T-cell infusion in 691 patients treated at our center (training set: n = 483, 70%; test set: n = 208, 30%).</p><p><strong>Results: </strong>Within our training set, we identified an optimal cluster solution based on four ANC recovery clusters, which were labeled as very good, good, poor, and very poor. We trained a random forest (RF) model including the top five most important features (day +3, +4, +5, +26, and +27 ANC values) to predict the cluster assignments. Within our test set, we applied the RF model to predict cluster assignments. Compared with the eICAHT criteria, the RF-predicted clusters were more compact and better separated (Dunn index: 0.078 <i>v</i> 0.034; average silhouette width: 0.12 <i>v</i> 0.010). In addition, the RF model identified patients in the good recovery cluster with intermediate overall survival (hazard ratio [HR], 1.70 [95% CI, 1.05 to 2.74]; <i>P</i> = .029; reference, very good), which was not captured by grade 2 eICAHT (HR, 1.37 [95% CI, 0.80 to 2.35]; <i>P</i> = .25; reference, grade 0-1).</p><p><strong>Conclusion: </strong>Unsupervised time-series clustering identified distinct and clinically relevant patterns of hematotoxicity after CAR T-cell therapy. We trained and tested an RF model that accurately predicted cluster assignments using only five features. Predictions can be generated using our online web application.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"10 ","pages":"e2500148"},"PeriodicalIF":2.8,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12810859/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145985901","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Impact of Real-World Response to First-Line Immunotherapy and Chemotherapy on Subsequent Treatment Outcomes in Patients With Advanced or Metastatic Non-Small Cell Lung Cancer. 一线免疫治疗和化疗的真实世界反应对晚期或转移性非小细胞肺癌患者后续治疗结果的影响
IF 2.8 Q2 ONCOLOGY Pub Date : 2026-01-01 Epub Date: 2026-01-21 DOI: 10.1200/CCI-25-00207
Jyoti Malhotra, Shilpa Viswanathan, Shivani K Mhatre, Inderjit K Dhillon, Riddhi Patel, Nicole Yohn, Furaha Kariburyo-Yay, Xinye Li, Biagio Ricciuti

Purpose: This study examined real-world overall survival (rwOS) in patients with advanced or metastatic non-small cell lung cancer (a/mNSCLC) treated with combination immunotherapy (IO) and platinum chemotherapy in first line (1L), followed by second-line or beyond (2L+) non-IO, nonplatinum chemotherapy and explored the association between real-world duration of response (rwDOR) to 1L treatment and rwOS on the 2L+ treatment.

Methods: This study used two US-based data sets: ConcertAI Patient360 NSCLC data set (ConcertAI) and the Flatiron Health Research Database (FHRD), and included adults with a/mNSCLC diagnosed from January 1, 2018, to March 31, 2023 (data cutoff: March 31, 2024). Kaplan-Meier and multivariate Cox regression analyses estimated rwOS for the index regimen by rwDOR to 1L.

Results: Patients with rwDOR ≤6 (≈60%) v >6 months (≈40%) in 1L were similar across the 596 ConcertAI patients and 1,094 FHRD patients. Across the ConcertAI data set/FHRD, 52.6%/55.7% of patients achieved complete/partial response as real-world best overall response to 1L combination IO and platinum chemotherapy and 17.8%/19.1% had stable disease. The median rwOS on 2L+ treatment was 8.3 v 5.2 months (P = .001; ConcertAI) and 8.3 v 5.1 months (P < .001; FHRD) for patients with 1L rwDOR >6 v ≤6 months. The adjusted hazard ratio for patients with 1L rwDOR >6 v ≤6 months was 0.74 (95% CI, 0.61 to 0.90; P = .002) and 0.76 (95% CI, 0.67 to 0.88; P < .001) in the ConcertAI data set and FHRD, respectively.

Conclusion: Our findings demonstrate that patients with rwDOR ≥6 months on 1L combination IO and platinum chemotherapy exhibit longer rwOS on subsequent treatments. This emphasizes the need for 1L treatments that extend DOR and delay the onset of acquired resistance, which remains an unmet need for approximately 60% of patients who do not achieve a sustained response in clinical practice.

目的:本研究考察了在一线(1L)联合免疫治疗(IO)和铂类化疗后,二线或二线以上(2L+)非IO、非铂类化疗的晚期或转移性非小细胞肺癌(a/mNSCLC)患者的真实总生存期(rwOS),并探讨了1L治疗的真实反应时间(rwDOR)与2L+治疗的rwOS之间的关系。方法:本研究使用了两个基于美国的数据集:ConcertAI Patient360 NSCLC数据集(ConcertAI)和Flatiron健康研究数据库(FHRD),并纳入了2018年1月1日至2023年3月31日诊断为a/mNSCLC的成年人(数据截止日期:2024年3月31日)。Kaplan-Meier和多变量Cox回归分析以rwDOR为1L估计指标方案的rwOS。结果:596例ConcertAI患者和1094例FHRD患者在1L中rwDOR≤6(≈60%)v >6个月(≈40%)的患者相似。在ConcertAI数据集/FHRD中,52.6%/55.7%的患者对1L IO联合铂化疗达到完全/部分缓解,达到真实世界最佳总体缓解,17.8%/19.1%的患者病情稳定。2L+治疗的中位rwOS为8.3 v 5.2个月(P = 0.001; ConcertAI), 1L rwDOR≤6个月的患者中位rwOS为8.3 v 5.1个月(P < 0.001; FHRD)。在ConcertAI数据集和FHRD中,1L rwDOR≤6个月患者的校正危险比分别为0.74 (95% CI, 0.61 ~ 0.90, P = 0.002)和0.76 (95% CI, 0.67 ~ 0.88, P < 0.001)。结论:我们的研究结果表明,rwDOR≥6个月的1L IO联合铂化疗患者在后续治疗中表现出更长的rwOS。这强调了l治疗的必要性,延长DOR和延迟获得性耐药的发生,对于在临床实践中没有实现持续反应的大约60%的患者来说,这仍然是一个未满足的需求。
{"title":"Impact of Real-World Response to First-Line Immunotherapy and Chemotherapy on Subsequent Treatment Outcomes in Patients With Advanced or Metastatic Non-Small Cell Lung Cancer.","authors":"Jyoti Malhotra, Shilpa Viswanathan, Shivani K Mhatre, Inderjit K Dhillon, Riddhi Patel, Nicole Yohn, Furaha Kariburyo-Yay, Xinye Li, Biagio Ricciuti","doi":"10.1200/CCI-25-00207","DOIUrl":"https://doi.org/10.1200/CCI-25-00207","url":null,"abstract":"<p><strong>Purpose: </strong>This study examined real-world overall survival (rwOS) in patients with advanced or metastatic non-small cell lung cancer (a/mNSCLC) treated with combination immunotherapy (IO) and platinum chemotherapy in first line (1L), followed by second-line or beyond (2L+) non-IO, nonplatinum chemotherapy and explored the association between real-world duration of response (rwDOR) to 1L treatment and rwOS on the 2L+ treatment.</p><p><strong>Methods: </strong>This study used two US-based data sets: ConcertAI Patient360 NSCLC data set (ConcertAI) and the Flatiron Health Research Database (FHRD), and included adults with a/mNSCLC diagnosed from January 1, 2018, to March 31, 2023 (data cutoff: March 31, 2024). Kaplan-Meier and multivariate Cox regression analyses estimated rwOS for the index regimen by rwDOR to 1L.</p><p><strong>Results: </strong>Patients with rwDOR ≤6 (≈60%) <i>v</i> >6 months (≈40%) in 1L were similar across the 596 ConcertAI patients and 1,094 FHRD patients. Across the ConcertAI data set/FHRD, 52.6%/55.7% of patients achieved complete/partial response as real-world best overall response to 1L combination IO and platinum chemotherapy and 17.8%/19.1% had stable disease. The median rwOS on 2L+ treatment was 8.3 <i>v</i> 5.2 months (<i>P</i> = .001; ConcertAI) and 8.3 <i>v</i> 5.1 months (<i>P</i> < .001; FHRD) for patients with 1L rwDOR >6 <i>v</i> ≤6 months. The adjusted hazard ratio for patients with 1L rwDOR >6 <i>v</i> ≤6 months was 0.74 (95% CI, 0.61 to 0.90; <i>P</i> = .002) and 0.76 (95% CI, 0.67 to 0.88; <i>P</i> < .001) in the ConcertAI data set and FHRD, respectively.</p><p><strong>Conclusion: </strong>Our findings demonstrate that patients with rwDOR ≥6 months on 1L combination IO and platinum chemotherapy exhibit longer rwOS on subsequent treatments. This emphasizes the need for 1L treatments that extend DOR and delay the onset of acquired resistance, which remains an unmet need for approximately 60% of patients who do not achieve a sustained response in clinical practice.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"10 ","pages":"e2500207"},"PeriodicalIF":2.8,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146020548","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Novel Electronic Health Record-Based Data Commons for Pancreatic Cancer. 新型基于胰腺癌电子健康记录的数据共享
IF 2.8 Q2 ONCOLOGY Pub Date : 2026-01-01 Epub Date: 2026-01-09 DOI: 10.1200/CCI-24-00265
Kaleem S Ahmed, Clayton T Marcinak, Muhammad Maisam Ali, Sheriff M Issaka, Yonghe Yan, Gabriel McMahan, Thomas Callaci, Noelle K LoConte, Andrea Shiefelbein, Sharon Weber, Majid Afshar, Matthew M Churpek, Jomol Mathew, Syed Nabeel Zafar

Purpose: Clinical research in pancreatic cancer (PC) has been limited because of a lack of granular data in national data sets. An electronic health record (EHR)-based data set specifically designed for PC has immense potential to advance research. This study describes the creation of an EHR-based data commons for patients with PC.

Methods: We generated an index cohort of adult patients at our institution diagnosed with PC (International Classification of Diseases for Oncology, codes C25.0-25.9) between January 1, 2010, and December 31, 2023. To develop the Pancreatic Cancer Data Commons (PCDC), we linked six data sources: (1) institutional EHR data, (2) cancer-specific data from the North American Association of Central Cancer Registries, (3) surgical outcomes from the National Surgical Quality Improvement Program, (4) community-level data from the American Community Survey, (5) national mortality data from Obituary.com, and (6) genomic data from the cBioPortal for Cancer Genomics. We evaluated the feasibility of using the Observational Medical Outcomes Partnership common data model. The data set is stored on a cloud-based, Health Insurance Portability and Accountability Act-secure, and National Institute of Standards and Technology-compliant server.

Results: The PCDC currently includes data of 3,542 unique patients. The mean age at diagnosis is 66.6 ± 11.7 years; 53.3% is male, and 92.2% is White. Linkage to six national data sets increased the completeness of cancer-specific data from 31.3% to 71.6%. Most patients presented at stage IV (43.6%), followed by stage I (22.6%). As of the latest update, 1,074 (30.3%) patients were still alive.

Conclusion: The PCDC is a centralized resource that solves a gap in PC research. The ability to securely link and analyze protected patient data is a strategic step toward enhancing clinical research and optimizing care for patients with PC. Our future work includes expanding the PCDC to multiple centers using common data models.

目的:由于缺乏国家数据集的颗粒数据,胰腺癌(PC)的临床研究受到限制。专门为个人电脑设计的基于电子健康记录(EHR)的数据集具有推进研究的巨大潜力。本研究描述了为PC患者创建基于ehr的数据共享。方法:我们对2010年1月1日至2023年12月31日在我院诊断为PC(国际肿瘤疾病分类,代码C25.0-25.9)的成年患者进行了索引队列研究。为了开发胰腺癌数据共享(PCDC),我们链接了六个数据源:(1)机构电子病历数据,(2)来自北美中央癌症登记处协会的癌症特异性数据,(3)来自国家手术质量改进计划的手术结果,(4)来自美国社区调查的社区水平数据,(5)来自Obituary.com的全国死亡率数据,(6)来自癌症基因组学cBioPortal的基因组数据。我们评估了使用观察性医疗结果伙伴关系通用数据模型的可行性。数据集存储在基于云的、健康保险可移植性和责任法案安全的、符合美国国家标准与技术研究所标准的服务器上。结果:PCDC目前包括3,542例独特患者的数据。平均诊断年龄66.6±11.7岁;男性占53.3%,白人占92.2%。与六个国家数据集的联系将癌症特异性数据的完整性从31.3%提高到71.6%。大多数患者出现在IV期(43.6%),其次是I期(22.6%)。截至最新数据,1074名(30.3%)患者仍然活着。结论:PCDC是解决PC研究空白的集中资源。安全链接和分析受保护的患者数据的能力是加强临床研究和优化PC患者护理的战略步骤。我们未来的工作包括使用通用数据模型将PCDC扩展到多个中心。
{"title":"Novel Electronic Health Record-Based Data Commons for Pancreatic Cancer.","authors":"Kaleem S Ahmed, Clayton T Marcinak, Muhammad Maisam Ali, Sheriff M Issaka, Yonghe Yan, Gabriel McMahan, Thomas Callaci, Noelle K LoConte, Andrea Shiefelbein, Sharon Weber, Majid Afshar, Matthew M Churpek, Jomol Mathew, Syed Nabeel Zafar","doi":"10.1200/CCI-24-00265","DOIUrl":"10.1200/CCI-24-00265","url":null,"abstract":"<p><strong>Purpose: </strong>Clinical research in pancreatic cancer (PC) has been limited because of a lack of granular data in national data sets. An electronic health record (EHR)-based data set specifically designed for PC has immense potential to advance research. This study describes the creation of an EHR-based data commons for patients with PC.</p><p><strong>Methods: </strong>We generated an index cohort of adult patients at our institution diagnosed with PC (International Classification of Diseases for Oncology, codes C25.0-25.9) between January 1, 2010, and December 31, 2023. To develop the Pancreatic Cancer Data Commons (PCDC), we linked six data sources: (1) institutional EHR data, (2) cancer-specific data from the North American Association of Central Cancer Registries, (3) surgical outcomes from the National Surgical Quality Improvement Program, (4) community-level data from the American Community Survey, (5) national mortality data from Obituary.com, and (6) genomic data from the cBioPortal for Cancer Genomics. We evaluated the feasibility of using the Observational Medical Outcomes Partnership common data model. The data set is stored on a cloud-based, Health Insurance Portability and Accountability Act-secure, and National Institute of Standards and Technology-compliant server.</p><p><strong>Results: </strong>The PCDC currently includes data of 3,542 unique patients. The mean age at diagnosis is 66.6 ± 11.7 years; 53.3% is male, and 92.2% is White. Linkage to six national data sets increased the completeness of cancer-specific data from 31.3% to 71.6%. Most patients presented at stage IV (43.6%), followed by stage I (22.6%). As of the latest update, 1,074 (30.3%) patients were still alive.</p><p><strong>Conclusion: </strong>The PCDC is a centralized resource that solves a gap in PC research. The ability to securely link and analyze protected patient data is a strategic step toward enhancing clinical research and optimizing care for patients with PC. Our future work includes expanding the PCDC to multiple centers using common data models.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"10 ","pages":"e2400265"},"PeriodicalIF":2.8,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12795310/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145946747","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PREPARE ALL: An Artificial Intelligence Tool for Predicting Relapse in Children With Acute Lymphoblastic Leukemia. PREPARE ALL:预测急性淋巴细胞白血病儿童复发的人工智能工具。
IF 2.8 Q2 ONCOLOGY Pub Date : 2026-01-01 Epub Date: 2026-01-21 DOI: 10.1200/CCI-25-00222
Subikksha Saravanan, Raghunathan Rengaswamy, Gaurav Narula, Sameer Bakhshi, Rachna Seth, Nandana Das, Manash Pratim Gogoi, Shripad Banavali, Prasanth Srinivasan, Gargi Das, T K Balaji, Shekar Krishnan, Vaskar Saha, Vijayalakshmi Ramshankar, Venkatraman Radhakrishnan

Purpose: The Pediatric Relapse Prediction and Risk Evaluation for Acute Lymphoblastic Leukemia (PREPARE-ALL) tool aims to predict relapse in pediatric ALL by integrating clinical expertise with artificial intelligence and machine learning (ML), particularly Extreme Gradient Boosting (XGBoost). PREPARE-ALL demonstrates that multicenter, protocol-driven clinical and laboratory data can be used through ML to generate reproducible relapse predictions with greater sensitivity than individual clinician assessments.

Methods: PREPARE-ALL was developed using data from the ICiCLe ALL-14 pretrial cohort across five centers, incorporating 33 clinical and laboratory features.

Results: Among 2,252 patients enrolled in the study, 565 (25.1%) relapsed. Using an 80:20 train-test split, XGBoost achieved a sensitivity of 68.5% (245/447 relapses detected). Additional metrics included a positive predictive value of 31.3%, a negative predictive value of 82.8%, an accuracy of 54.8%, and a specificity of 50.3%. Key predictors of relapse included high hyperdiploidy and BCR-ABL1 fusion positive, positive measurable residual disease status at the end of induction, sex, age, highest presenting WBC, and final risk group. Three clinicians scored the validation data set; the developed model achieved a higher recall (68.5%) compared with clinical judgment (approximately 31%-36%).

Conclusion: PREPARE-ALL identifies twice as many relapses as clinicians and serves as a practical decision-support tool for early relapse triage and treatment planning, enabling timely therapeutic adjustments and improved outcomes in pediatric ALL.

目的:儿科急性淋巴细胞白血病复发预测和风险评估(PREPARE-ALL)工具旨在通过将临床专业知识与人工智能和机器学习(ML),特别是极限梯度增强(XGBoost)相结合,预测儿科ALL的复发。PREPARE-ALL表明,通过ML可以使用多中心、协议驱动的临床和实验室数据来生成可重复的复发预测,其灵敏度高于单个临床医生的评估。方法:PREPARE-ALL是利用来自5个中心的ICiCLe ALL-14试验前队列的数据开发的,包括33个临床和实验室特征。结果:在纳入研究的2252例患者中,565例(25.1%)复发。使用80:20的列车测试分割,XGBoost实现了68.5%的灵敏度(检测到245/447次复发)。其他指标包括阳性预测值31.3%,阴性预测值82.8%,准确率54.8%,特异性50.3%。复发的关键预测因素包括高二倍体和BCR-ABL1融合阳性,诱导结束时可测量的阳性残留疾病状态,性别,年龄,最高呈现WBC和最终危险组。三位临床医生对验证数据集进行评分;与临床判断(约31%-36%)相比,开发的模型实现了更高的召回率(68.5%)。结论:prep -ALL识别的复发率是临床医生的两倍,可作为早期复发分诊和治疗计划的实用决策支持工具,能够及时调整治疗并改善儿科ALL的预后。
{"title":"PREPARE ALL: An Artificial Intelligence Tool for Predicting Relapse in Children With Acute Lymphoblastic Leukemia.","authors":"Subikksha Saravanan, Raghunathan Rengaswamy, Gaurav Narula, Sameer Bakhshi, Rachna Seth, Nandana Das, Manash Pratim Gogoi, Shripad Banavali, Prasanth Srinivasan, Gargi Das, T K Balaji, Shekar Krishnan, Vaskar Saha, Vijayalakshmi Ramshankar, Venkatraman Radhakrishnan","doi":"10.1200/CCI-25-00222","DOIUrl":"10.1200/CCI-25-00222","url":null,"abstract":"<p><strong>Purpose: </strong>The Pediatric Relapse Prediction and Risk Evaluation for Acute Lymphoblastic Leukemia (PREPARE-ALL) tool aims to predict relapse in pediatric ALL by integrating clinical expertise with artificial intelligence and machine learning (ML), particularly Extreme Gradient Boosting (XGBoost). PREPARE-ALL demonstrates that multicenter, protocol-driven clinical and laboratory data can be used through ML to generate reproducible relapse predictions with greater sensitivity than individual clinician assessments.</p><p><strong>Methods: </strong>PREPARE-ALL was developed using data from the ICiCLe ALL-14 pretrial cohort across five centers, incorporating 33 clinical and laboratory features.</p><p><strong>Results: </strong>Among 2,252 patients enrolled in the study, 565 (25.1%) relapsed. Using an 80:20 train-test split, XGBoost achieved a sensitivity of 68.5% (245/447 relapses detected). Additional metrics included a positive predictive value of 31.3%, a negative predictive value of 82.8%, an accuracy of 54.8%, and a specificity of 50.3%. Key predictors of relapse included high hyperdiploidy and BCR-ABL1 fusion positive, positive measurable residual disease status at the end of induction, sex, age, highest presenting WBC, and final risk group. Three clinicians scored the validation data set; the developed model achieved a higher recall (68.5%) compared with clinical judgment (approximately 31%-36%).</p><p><strong>Conclusion: </strong>PREPARE-ALL identifies twice as many relapses as clinicians and serves as a practical decision-support tool for early relapse triage and treatment planning, enabling timely therapeutic adjustments and improved outcomes in pediatric ALL.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"10 ","pages":"e2500222"},"PeriodicalIF":2.8,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146020517","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
JCO Clinical Cancer Informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1