Purpose: The Pediatric Relapse Prediction and Risk Evaluation for Acute Lymphoblastic Leukemia (PREPARE-ALL) tool aims to predict relapse in pediatric ALL by integrating clinical expertise with artificial intelligence and machine learning (ML), particularly Extreme Gradient Boosting (XGBoost). PREPARE-ALL demonstrates that multicenter, protocol-driven clinical and laboratory data can be used through ML to generate reproducible relapse predictions with greater sensitivity than individual clinician assessments.
Methods: PREPARE-ALL was developed using data from the ICiCLe ALL-14 pretrial cohort across five centers, incorporating 33 clinical and laboratory features.
Results: Among 2,252 patients enrolled in the study, 565 (25.1%) relapsed. Using an 80:20 train-test split, XGBoost achieved a sensitivity of 68.5% (245/447 relapses detected). Additional metrics included a positive predictive value of 31.3%, a negative predictive value of 82.8%, an accuracy of 54.8%, and a specificity of 50.3%. Key predictors of relapse included high hyperdiploidy and BCR-ABL1 fusion positive, positive measurable residual disease status at the end of induction, sex, age, highest presenting WBC, and final risk group. Three clinicians scored the validation data set; the developed model achieved a higher recall (68.5%) compared with clinical judgment (approximately 31%-36%).
Conclusion: PREPARE-ALL identifies twice as many relapses as clinicians and serves as a practical decision-support tool for early relapse triage and treatment planning, enabling timely therapeutic adjustments and improved outcomes in pediatric ALL.
{"title":"PREPARE ALL: An Artificial Intelligence Tool for Predicting Relapse in Children With Acute Lymphoblastic Leukemia.","authors":"Subikksha Saravanan, Raghunathan Rengaswamy, Gaurav Narula, Sameer Bakhshi, Rachna Seth, Nandana Das, Manash Pratim Gogoi, Shripad Banavali, Prasanth Srinivasan, Gargi Das, T K Balaji, Shekar Krishnan, Vaskar Saha, Vijayalakshmi Ramshankar, Venkatraman Radhakrishnan","doi":"10.1200/CCI-25-00222","DOIUrl":"10.1200/CCI-25-00222","url":null,"abstract":"<p><strong>Purpose: </strong>The Pediatric Relapse Prediction and Risk Evaluation for Acute Lymphoblastic Leukemia (PREPARE-ALL) tool aims to predict relapse in pediatric ALL by integrating clinical expertise with artificial intelligence and machine learning (ML), particularly Extreme Gradient Boosting (XGBoost). PREPARE-ALL demonstrates that multicenter, protocol-driven clinical and laboratory data can be used through ML to generate reproducible relapse predictions with greater sensitivity than individual clinician assessments.</p><p><strong>Methods: </strong>PREPARE-ALL was developed using data from the ICiCLe ALL-14 pretrial cohort across five centers, incorporating 33 clinical and laboratory features.</p><p><strong>Results: </strong>Among 2,252 patients enrolled in the study, 565 (25.1%) relapsed. Using an 80:20 train-test split, XGBoost achieved a sensitivity of 68.5% (245/447 relapses detected). Additional metrics included a positive predictive value of 31.3%, a negative predictive value of 82.8%, an accuracy of 54.8%, and a specificity of 50.3%. Key predictors of relapse included high hyperdiploidy and BCR-ABL1 fusion positive, positive measurable residual disease status at the end of induction, sex, age, highest presenting WBC, and final risk group. Three clinicians scored the validation data set; the developed model achieved a higher recall (68.5%) compared with clinical judgment (approximately 31%-36%).</p><p><strong>Conclusion: </strong>PREPARE-ALL identifies twice as many relapses as clinicians and serves as a practical decision-support tool for early relapse triage and treatment planning, enabling timely therapeutic adjustments and improved outcomes in pediatric ALL.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"10 ","pages":"e2500222"},"PeriodicalIF":2.8,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146020517","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-01Epub Date: 2026-01-28DOI: 10.1200/CCI-25-00244
Nesrine Lajmi, Mehul Patel, Gareth Obery, Archana Dorge, Ashish Sharma, Jack Halligan, Ernest Lo
Purpose: A core clinical task is to synthesize fragmented patient data into a coherent summary to support decision making. However, electronic health record (EHR) inefficiencies burden clinicians and contribute to their cognitive overload and burnout. This study evaluated the impact of a large language model (LLM)-enabled clinical decision support (LLM-CDS) platform compared with a simulated EHR (SimEPR) on workflow efficiency and user experience in generating accurate clinical summaries during tumor board preparation and explored its applicability to consultation preparation, referrals, treatment planning, and patient communication.
Methods: In a remote, within-participant simulation, 26 oncologists from the United Kingdom, United States, Spain, and Singapore reviewed synthetic breast cancer cases and created comprehensive summaries for tumor board discussions using both LLM-CDS and SimEPR. LLM-CDS provided editable LLM-generated summaries; SimEPR required manual composition. Time to task completion was recorded. An independent reviewer assessed summary quality based on completeness, correctness, and conciseness. Participants also completed surveys on usability, cognitive load, and feature acceptability.
Results: LLM-CDS significantly reduced the summary completion time compared with SimEPR (6:55 v 8:47 minutes; P < .001). Summary completeness was rated higher with LLM-CDS (mean score, 3.93 v 3.13), whereas correctness and conciseness were similar. Overall, 87% of participants would recommend LLM-CDS and 96% would anticipate time savings. The system usability scale score for LLM-CDS was 65.7. Although perceived cognitive load was lower with LLM-CDS, the difference was not statistically significant. The LLM summary was the most valued, with 92% finding it useful for the tumor board and consultation preparation.
Conclusion: The LLM-CDS platform improved the efficiency and completeness of clinical summarization. Strong user acceptance and anticipated time savings underscore the potential for streamlining a range of oncology workflows.
目的:一项核心临床任务是将零散的患者数据综合成连贯的摘要,以支持决策。然而,电子健康记录(EHR)效率低下给临床医生带来负担,并导致他们的认知超载和倦怠。本研究评估了大型语言模型(LLM)支持的临床决策支持(LLM- cds)平台与模拟EHR (SimEPR)在肿瘤板制备过程中生成准确临床摘要的工作流程效率和用户体验方面的影响,并探讨了其在会诊准备、转诊、治疗计划和患者沟通方面的适用性。方法:在远程参与者模拟中,来自英国、美国、西班牙和新加坡的26名肿瘤学家回顾了合成乳腺癌病例,并使用LLM-CDS和SimEPR为肿瘤委员会讨论创建了综合摘要。LLM-CDS提供可编辑的llm生成的摘要;SimEPR需要手工合成。记录完成任务的时间。一个独立的审稿人根据完整性、正确性和简洁性评估摘要的质量。参与者还完成了关于可用性、认知负荷和功能可接受性的调查。结果:与SimEPR相比,LLM-CDS显著缩短了汇总完成时间(6:55 vs 8:47分钟;P < 0.001)。LLM-CDS的总结完整性评分更高(平均评分3.93 v 3.13),而正确性和简洁性相似。总体而言,87%的参与者会推荐LLM-CDS, 96%的人预计会节省时间。LLM-CDS的系统可用性量表得分为65.7分。虽然LLM-CDS组的认知负荷较低,但差异无统计学意义。法学硕士总结是最有价值的,92%的人认为它对肿瘤板和会诊准备有用。结论:LLM-CDS平台提高了临床总结的效率和完整性。强大的用户接受度和预期的时间节省强调了简化肿瘤工作流程的潜力。
{"title":"Simulation-Based Evaluation of a Large Language Model-Enabled Clinical Decision Support Platform in Oncology.","authors":"Nesrine Lajmi, Mehul Patel, Gareth Obery, Archana Dorge, Ashish Sharma, Jack Halligan, Ernest Lo","doi":"10.1200/CCI-25-00244","DOIUrl":"https://doi.org/10.1200/CCI-25-00244","url":null,"abstract":"<p><strong>Purpose: </strong>A core clinical task is to synthesize fragmented patient data into a coherent summary to support decision making. However, electronic health record (EHR) inefficiencies burden clinicians and contribute to their cognitive overload and burnout. This study evaluated the impact of a large language model (LLM)-enabled clinical decision support (LLM-CDS) platform compared with a simulated EHR (SimEPR) on workflow efficiency and user experience in generating accurate clinical summaries during tumor board preparation and explored its applicability to consultation preparation, referrals, treatment planning, and patient communication.</p><p><strong>Methods: </strong>In a remote, within-participant simulation, 26 oncologists from the United Kingdom, United States, Spain, and Singapore reviewed synthetic breast cancer cases and created comprehensive summaries for tumor board discussions using both LLM-CDS and SimEPR. LLM-CDS provided editable LLM-generated summaries; SimEPR required manual composition. Time to task completion was recorded. An independent reviewer assessed summary quality based on completeness, correctness, and conciseness. Participants also completed surveys on usability, cognitive load, and feature acceptability.</p><p><strong>Results: </strong>LLM-CDS significantly reduced the summary completion time compared with SimEPR (6:55 <i>v</i> 8:47 minutes; <i>P</i> < .001). Summary completeness was rated higher with LLM-CDS (mean score, 3.93 <i>v</i> 3.13), whereas correctness and conciseness were similar. Overall, 87% of participants would recommend LLM-CDS and 96% would anticipate time savings. The system usability scale score for LLM-CDS was 65.7. Although perceived cognitive load was lower with LLM-CDS, the difference was not statistically significant. The LLM summary was the most valued, with 92% finding it useful for the tumor board and consultation preparation.</p><p><strong>Conclusion: </strong>The LLM-CDS platform improved the efficiency and completeness of clinical summarization. Strong user acceptance and anticipated time savings underscore the potential for streamlining a range of oncology workflows.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"10 ","pages":"e2500244"},"PeriodicalIF":2.8,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12863596/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146114927","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-01Epub Date: 2026-01-08DOI: 10.1200/CCI-25-00221
Raúl Marín, Ania Alay, Maria Ajenjo-Bauza, Sara Hijazo-Pechero, Carla Montironi, Eva Hernández-Illán, Pedro Jares, Daniel Azuara, Mar Varela, August Vidal, Joan Anton Puig-Butillé, Conxi Lázaro, Víctor Moreno, David Cordero, Ernest Nadal, Xavier Solé
Purpose: Somatic next-generation sequencing (NGS) panels are widely used in precision oncology to detect clinically actionable genomic alterations. However, interpreting diverse DNA and RNA alterations remains challenging because of the complexity of tumor-only data and the limitations of current pipelines, which are often proprietary, noncustomizable, or lack visual reports to support clinical interpretation. We present ClinBioNGS, an open-source, panel-agnostic bioinformatics pipeline for the comprehensive analysis of somatic NGS cancer panels in both clinical and translational settings.
Materials and methods: ClinBioNGS is a modular, fully containerized workflow implemented in Nextflow. It supports integrated analysis of DNA and RNA data, including multicaller small variant detection, copy number alteration (CNA) profiling, gene fusion and splice variant identification, and evaluation of complex genomic biomarkers such as tumor mutational burden and microsatellite instability. Variants are annotated and prioritized using established clinical frameworks. The results are compiled in a self-contained interactive HTML report with dynamic tables and informative visualizations to facilitate clinical interpretation. Validation included SEQC2 reference data sets across six commercial panels, and benchmarking was performed on 2,024 clinical tumor samples analyzed with three commercial platforms.
Results: ClinBioNGS achieved high accuracy in SEQC2 validation, with precision (0.987-1.000), recall (0.920-0.997), and F1 scores (0.956-0.999) across diverse panels. In a clinical benchmarking with real-world data, the pipeline demonstrated high concordance with commercial solutions for small variants (97%), CNAs (89%), and RNA alterations (94%), while also identifying additional high-confidence alterations missed by vendor pipelines.
Conclusion: ClinBioNGS provides a robust, flexible, and transparent solution for standardized analysis of somatic NGS cancer panels. It supports reproducible, clinically oriented interpretation of genomic data and is freely available for noncommercial research-use only at GitHub.
{"title":"ClinBioNGS: A Clinical Bioinformatics Pipeline for Integrated Analysis of Somatic Next-Generation Sequencing Cancer Panels.","authors":"Raúl Marín, Ania Alay, Maria Ajenjo-Bauza, Sara Hijazo-Pechero, Carla Montironi, Eva Hernández-Illán, Pedro Jares, Daniel Azuara, Mar Varela, August Vidal, Joan Anton Puig-Butillé, Conxi Lázaro, Víctor Moreno, David Cordero, Ernest Nadal, Xavier Solé","doi":"10.1200/CCI-25-00221","DOIUrl":"https://doi.org/10.1200/CCI-25-00221","url":null,"abstract":"<p><strong>Purpose: </strong>Somatic next-generation sequencing (NGS) panels are widely used in precision oncology to detect clinically actionable genomic alterations. However, interpreting diverse DNA and RNA alterations remains challenging because of the complexity of tumor-only data and the limitations of current pipelines, which are often proprietary, noncustomizable, or lack visual reports to support clinical interpretation. We present ClinBioNGS, an open-source, panel-agnostic bioinformatics pipeline for the comprehensive analysis of somatic NGS cancer panels in both clinical and translational settings.</p><p><strong>Materials and methods: </strong>ClinBioNGS is a modular, fully containerized workflow implemented in Nextflow. It supports integrated analysis of DNA and RNA data, including multicaller small variant detection, copy number alteration (CNA) profiling, gene fusion and splice variant identification, and evaluation of complex genomic biomarkers such as tumor mutational burden and microsatellite instability. Variants are annotated and prioritized using established clinical frameworks. The results are compiled in a self-contained interactive HTML report with dynamic tables and informative visualizations to facilitate clinical interpretation. Validation included SEQC2 reference data sets across six commercial panels, and benchmarking was performed on 2,024 clinical tumor samples analyzed with three commercial platforms.</p><p><strong>Results: </strong>ClinBioNGS achieved high accuracy in SEQC2 validation, with precision (0.987-1.000), recall (0.920-0.997), and F1 scores (0.956-0.999) across diverse panels. In a clinical benchmarking with real-world data, the pipeline demonstrated high concordance with commercial solutions for small variants (97%), CNAs (89%), and RNA alterations (94%), while also identifying additional high-confidence alterations missed by vendor pipelines.</p><p><strong>Conclusion: </strong>ClinBioNGS provides a robust, flexible, and transparent solution for standardized analysis of somatic NGS cancer panels. It supports reproducible, clinically oriented interpretation of genomic data and is freely available for noncommercial research-use only at GitHub.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"10 ","pages":"e2500221"},"PeriodicalIF":2.8,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145935982","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-01Epub Date: 2026-01-23DOI: 10.1200/CCI-25-00240
Samhita Pamidimarri Naga, Peter H J Slootbeek, Sofie H Tolmeijer, Christian Gillissen, Marjolijn J L Ligtenberg, Niven Mehra, Richarda M de Voer
Purpose: Shallow whole-genome sequencing (sWGS) is a cost-effective approach for detecting genome wide copy number profiles in tumor samples. In metastatic castration-resistant prostate cancer (mCRPC), recognizing homologous recombination deficiency (HRD) and tandem duplication (TD) genomic profiles may contribute to improved treatment choices such as poly (ADP-ribose) polymerase inhibitors. This study aims to determine the minimum sequencing depth and tumor content (TC) required to accurately identify these clinically significant genomic profiles using sWGS.
Materials and methods: Whole-genome sequencing (WGS) data from 168 tumor and matched normal biopsies from 155 patients with mCRPC were mixed in silico to generate a set of 3,360 mixtures with varying TCs (original, 20%, 10%, 5%, 3%) and sequencing depths (original, 5×, 2×, 1×, 0.1×). Copy number variations (CNVs) were analyzed using ichorCNA and WisecondorX at different window sizes.
Results: An average sequencing depth of 1× at 20% TC was found to be sufficient to detect CNVs with high sensitivity (>0.85) and high specificity (>0.95). For HRD and TD profile detection, ichorCNA at a 50 Kb window size was optimal and a reliable detection of HRD profiles was achieved with a very strong correlation of R = 0.88 (P < 2.2e-16). Detection of TD profiles also remained accurate at these parameters with a strong correlation of R = 0.72 (P < 2.2e-16), although the median length of duplication events increased at lower depths. TC estimation by ichorCNA strongly correlated with full-depth WGS of diploid genomes.
Conclusion: In this study, through in silico simulations of WGS data, we demonstrate that the genomic scars of two druggable genomic profiles, HRD and TD, can be reliably detected in mCRPC with 1× average sequencing depth and ≥20% TC. Further research is required to correlate these markers with outcome of specific treatments using sWGS.
{"title":"Assessing the Detection Power of Genome-Wide Copy Number Variation Profiles in Prostate Cancer Using Simulated Shallow Whole-Genome Sequencing Data.","authors":"Samhita Pamidimarri Naga, Peter H J Slootbeek, Sofie H Tolmeijer, Christian Gillissen, Marjolijn J L Ligtenberg, Niven Mehra, Richarda M de Voer","doi":"10.1200/CCI-25-00240","DOIUrl":"10.1200/CCI-25-00240","url":null,"abstract":"<p><strong>Purpose: </strong>Shallow whole-genome sequencing (sWGS) is a cost-effective approach for detecting genome wide copy number profiles in tumor samples. In metastatic castration-resistant prostate cancer (mCRPC), recognizing homologous recombination deficiency (HRD) and tandem duplication (TD) genomic profiles may contribute to improved treatment choices such as poly (ADP-ribose) polymerase inhibitors. This study aims to determine the minimum sequencing depth and tumor content (TC) required to accurately identify these clinically significant genomic profiles using sWGS.</p><p><strong>Materials and methods: </strong>Whole-genome sequencing (WGS) data from 168 tumor and matched normal biopsies from 155 patients with mCRPC were mixed in silico to generate a set of 3,360 mixtures with varying TCs (original, 20%, 10%, 5%, 3%) and sequencing depths (original, 5×, 2×, 1×, 0.1×). Copy number variations (CNVs) were analyzed using ichorCNA and WisecondorX at different window sizes.</p><p><strong>Results: </strong>An average sequencing depth of 1× at 20% TC was found to be sufficient to detect CNVs with high sensitivity (>0.85) and high specificity (>0.95). For HRD and TD profile detection, ichorCNA at a 50 Kb window size was optimal and a reliable detection of HRD profiles was achieved with a very strong correlation of R = 0.88 (<i>P</i> < 2.2e-16). Detection of TD profiles also remained accurate at these parameters with a strong correlation of R = 0.72 (<i>P</i> < 2.2e-16), although the median length of duplication events increased at lower depths. TC estimation by ichorCNA strongly correlated with full-depth WGS of diploid genomes.</p><p><strong>Conclusion: </strong>In this study, through in silico simulations of WGS data, we demonstrate that the genomic scars of two druggable genomic profiles, HRD and TD, can be reliably detected in mCRPC with 1× average sequencing depth and ≥20% TC. Further research is required to correlate these markers with outcome of specific treatments using sWGS.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"10 ","pages":"e2500240"},"PeriodicalIF":2.8,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12834301/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146042138","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Purpose: To evaluate the appropriateness of a cure model when analyzing right-censored end points of a randomized clinical trial (RCT) in malignancy in the presence of long-term survivors. We aim to derive how the ratio estimation of censored cured subjects (RECeUS), previously proposed for a homogeneous population, could be extended for use in RCTs.
Methods: Based on the RECeUS method, four decision rules were considered to assess the appropriateness of a cure model. They considered the eligibility conditions to be met: in both arms, in at least one randomized arm, in the entire sample, or when only considering an average of the conditions, respectively. A simulation study was performed to evaluate their performance and the impact of the link function when considering the appropriateness of cure models. We also illustrate the method using two real data examples from two RCTs conducted in patients with acute leukemia and COVID-19 disease.
Results: Simulation results show that the best decision rule that can be applied in all considered treatment effect scenarios might be to check the criteria in at least one randomized arm. Regardless of the rules, the cure model appeared to be appropriate in both RCT data.
Conclusion: When analyzing survival data from RCTs, the appropriateness of a cure model could be considered in the face of a plateau shape of the survival curves. To ensure that the presence of such a plateau in the survival curves is a reliable indicator of the presence of cured patients in the population, the RECeUS method should be used in each randomized arm separately, with criteria met in at least one randomized arm.
{"title":"Detecting the Cure Model Appropriateness in Randomized Clinical Trials With Long-Term Survivors.","authors":"Cheryl Kouadio, Subodh Selukar, Megan Othus, Sylvie Chevret","doi":"10.1200/CCI-25-00084","DOIUrl":"https://doi.org/10.1200/CCI-25-00084","url":null,"abstract":"<p><strong>Purpose: </strong>To evaluate the appropriateness of a cure model when analyzing right-censored end points of a randomized clinical trial (RCT) in malignancy in the presence of long-term survivors. We aim to derive how the ratio estimation of censored cured subjects (RECeUS), previously proposed for a homogeneous population, could be extended for use in RCTs.</p><p><strong>Methods: </strong>Based on the RECeUS method, four decision rules were considered to assess the appropriateness of a cure model. They considered the eligibility conditions to be met: in both arms, in at least one randomized arm, in the entire sample, or when only considering an average of the conditions, respectively. A simulation study was performed to evaluate their performance and the impact of the link function when considering the appropriateness of cure models. We also illustrate the method using two real data examples from two RCTs conducted in patients with acute leukemia and COVID-19 disease.</p><p><strong>Results: </strong>Simulation results show that the best decision rule that can be applied in all considered treatment effect scenarios might be to check the criteria in at least one randomized arm. Regardless of the rules, the cure model appeared to be appropriate in both RCT data.</p><p><strong>Conclusion: </strong>When analyzing survival data from RCTs, the appropriateness of a cure model could be considered in the face of a plateau shape of the survival curves. To ensure that the presence of such a plateau in the survival curves is a reliable indicator of the presence of cured patients in the population, the RECeUS method should be used in each randomized arm separately, with criteria met in at least one randomized arm.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"9 ","pages":"e2500084"},"PeriodicalIF":2.8,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145764401","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Yiming Xue, Yunzheng Zhu, Luoting Zhuang, YongKyung Oh, Ricky Taira, Denise R Aberle, Ashley Elizabeth Prosper, William Hsu, Yannan Lin
Purpose: Tobacco use is a major risk factor for diseases such as cancer. Granular quantitative details of smoking (eg, pack years and years since quitting) are essential for assessing disease risk and determining eligibility for lung cancer screening (LCS). However, existing natural language processing (NLP) tools struggle to extract detailed quantitative smoking data from clinical narratives.
Methods: We cross-validated four pretrained Bidirectional Encoder Representations from Transformers (BERT)-based models-BERT, BioBERT, ClinicalBERT, and MedBERT-by fine-tuning them on 90% of 3,261 sentences mentioning smoking history to extract six quantitative smoking history variables from clinical narratives. The model with the highest cross-validated micro-averaged F1 scores across most variables was selected as the final SmokeBERT model and was further fine-tuned on the 90% training data. Model performance was evaluated on a 10% holdout test set and an external validation set containing 3,191 sentences.
Results: ClinicalBERT was selected as the final model based on cross-validation and was fine-tuned on the training data to create the SmokeBERT model. Compared with the state-of-the-art rule-based NLP model and the Generative Pre-trained Transformer Open Source Series 20 billion parameter model, SmokeBERT demonstrated superior performance in smoking data extraction (overall F1 score, holdout test: 0.97 v 0.88-0.90; external validation: 0.86 v 0.72-0.79) and in identifying LCS-eligible patients (97% v 59%-97% for ≥20 pack-years and 100% v 60%-84% for ≤15 years since quitting).
Conclusion: We developed SmokeBERT, a fine-tuned BERT-based model optimized for extracting detailed quantitative smoking histories. Future work includes evaluating performance on larger clinical data sets and developing a multilingual, language-agnostic version of SmokeBERT.
目的:烟草使用是癌症等疾病的一个主要危险因素。吸烟的细粒度定量细节(例如,吸烟年数和戒烟后的年数)对于评估疾病风险和确定肺癌筛查(LCS)的资格至关重要。然而,现有的自然语言处理(NLP)工具难以从临床叙述中提取详细的定量吸烟数据。方法:我们交叉验证了基于变形金刚(BERT)模型的四种预训练双向编码器表示——BERT、BioBERT、ClinicalBERT和medbert——通过对3261个提到吸烟史的句子中的90%进行微调,从临床叙述中提取出6个定量吸烟史变量。在大多数变量中交叉验证的微平均F1得分最高的模型被选为最终的SmokeBERT模型,并在90%的训练数据上进一步微调。模型的性能在10%的保留测试集和包含3191个句子的外部验证集上进行评估。结果:在交叉验证的基础上,ClinicalBERT被选择为最终模型,并在训练数据的基础上进行微调,建立了SmokeBERT模型。与最先进的基于规则的NLP模型和生成式预训练变压器开源系列200亿参数模型相比,SmokeBERT在吸烟数据提取(总体F1评分,坚持测试:0.97 v 0.88-0.90;外部验证:0.86 v 0.72-0.79)和识别lcs合格患者(≥20包年为97% v 59%-97%,戒烟≤15年为100% v 60%-84%)方面表现出更优越的性能。结论:我们开发了一个微调的基于bert的模型,用于提取详细的定量吸烟史。未来的工作包括评估在更大的临床数据集上的表现,以及开发一个多语言、语言无关的smoke - bert版本。
{"title":"SmokeBERT: A Bidirectional Encoder Representations From Transformers-Based Model for Quantitative Smoking History Extraction From Clinical Narratives to Improve Lung Cancer Screening.","authors":"Yiming Xue, Yunzheng Zhu, Luoting Zhuang, YongKyung Oh, Ricky Taira, Denise R Aberle, Ashley Elizabeth Prosper, William Hsu, Yannan Lin","doi":"10.1200/CCI-25-00223","DOIUrl":"10.1200/CCI-25-00223","url":null,"abstract":"<p><strong>Purpose: </strong>Tobacco use is a major risk factor for diseases such as cancer. Granular quantitative details of smoking (eg, pack years and years since quitting) are essential for assessing disease risk and determining eligibility for lung cancer screening (LCS). However, existing natural language processing (NLP) tools struggle to extract detailed quantitative smoking data from clinical narratives.</p><p><strong>Methods: </strong>We cross-validated four pretrained Bidirectional Encoder Representations from Transformers (BERT)-based models-BERT, BioBERT, ClinicalBERT, and MedBERT-by fine-tuning them on 90% of 3,261 sentences mentioning smoking history to extract six quantitative smoking history variables from clinical narratives. The model with the highest cross-validated micro-averaged F1 scores across most variables was selected as the final SmokeBERT model and was further fine-tuned on the 90% training data. Model performance was evaluated on a 10% holdout test set and an external validation set containing 3,191 sentences.</p><p><strong>Results: </strong>ClinicalBERT was selected as the final model based on cross-validation and was fine-tuned on the training data to create the SmokeBERT model. Compared with the state-of-the-art rule-based NLP model and the Generative Pre-trained Transformer Open Source Series 20 billion parameter model, SmokeBERT demonstrated superior performance in smoking data extraction (overall F1 score, holdout test: 0.97 <i>v</i> 0.88-0.90; external validation: 0.86 <i>v</i> 0.72-0.79) and in identifying LCS-eligible patients (97% <i>v</i> 59%-97% for ≥20 pack-years and 100% <i>v</i> 60%-84% for ≤15 years since quitting).</p><p><strong>Conclusion: </strong>We developed SmokeBERT, a fine-tuned BERT-based model optimized for extracting detailed quantitative smoking histories. Future work includes evaluating performance on larger clinical data sets and developing a multilingual, language-agnostic version of SmokeBERT.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"9 ","pages":"e2500223"},"PeriodicalIF":2.8,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145656443","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-01Epub Date: 2025-12-05DOI: 10.1200/CCI-25-00132
David N Karp, Khaldoun Hamade, Christopher M McNair, Amy E Leader
Purpose: Cancer centers and health systems are tasked with deciding where to deploy community interventions to reduce the burden of cancer within their catchment areas. Few methods exist to prioritize communities in a systematic manner, considering features of individuals, populations, systems, and policies. We developed a geographically informed index to prioritize census tracts based on community need, with an initial focus on identifying communities in need of breast cancer screening (BCS) interventions.
Methods: This study used publicly available data to select variables known to be associated with disparities in BCS rates. Variables were identified from five categories: economic stability, education access and quality, neighborhood and built environment, social and community context, and health status and health care access and quality. Data were analyzed at the census tract level across the Sidney Kimmel Comprehensive Cancer Center catchment (N = 1,216). Principal component analysis was applied to 23 variables, and five principal components were selected to construct a composite measure using a weighted sum. The resulting index values were used to stratify the data set for further analysis and mapped for visualization.
Results: The analysis produced the Community Need Priority Index (CNPI)-BCS, with values ranging from 0 to 1 (mean, 0.259; standard deviation [SD], 0.161). The top quintile (Q5, n = 243) represented the highest-need communities. Q5 tracts were primarily concentrated in Philadelphia, Camden, and Delaware counties. Philadelphia County had the highest average (mean, 0.364; SD, 1.78) and the most tracts in the top quintile (45%, n = 175). Montgomery county had the lowest average (mean, 0.169; SD, 0.092).
Conclusion: This novel methodological approach considered the complex nature of multiple, intersectional barriers to good health to identify priority areas of need within cancer center catchment areas.
目的:癌症中心和卫生系统的任务是决定在何处部署社区干预措施,以减轻其集水区内的癌症负担。考虑到个人、群体、系统和政策的特点,很少有方法以系统的方式对社区进行优先排序。我们开发了一个地理信息指数,根据社区需求对人口普查区进行优先排序,最初的重点是确定需要乳腺癌筛查(BCS)干预的社区。方法:本研究使用公开可用的数据来选择已知与BCS发病率差异相关的变量。变量从五个类别中确定:经济稳定性、教育机会和质量、邻里和建成环境、社会和社区背景、健康状况和卫生保健机会和质量。数据在Sidney Kimmel综合癌症中心集水区的人口普查区水平上进行分析(N = 1,216)。对23个变量进行主成分分析,选取5个主成分,采用加权和构建复合测度。所得的指标值用于对数据集进行分层,以便进一步分析,并将其映射为可视化。结果:分析产生了社区需求优先指数(CNPI)-BCS,其值范围为0到1(平均值0.259;标准差[SD], 0.161)。前五分之一(Q5, n = 243)代表需求最高的社区。Q5主要集中在费城、卡姆登和特拉华州。费城县的平均值最高(平均值0.364;标准差1.78),前五分位数的土地最多(45%,n = 175)。蒙哥马利县的平均值最低(平均值0.169;标准差0.092)。结论:这种新颖的方法方法考虑了多种交叉的健康障碍的复杂性,以确定癌症中心集水区内的优先需求领域。
{"title":"Development of a Composite Measure to Identify Priority Areas of Need for Cancer Screening Interventions.","authors":"David N Karp, Khaldoun Hamade, Christopher M McNair, Amy E Leader","doi":"10.1200/CCI-25-00132","DOIUrl":"10.1200/CCI-25-00132","url":null,"abstract":"<p><strong>Purpose: </strong>Cancer centers and health systems are tasked with deciding where to deploy community interventions to reduce the burden of cancer within their catchment areas. Few methods exist to prioritize communities in a systematic manner, considering features of individuals, populations, systems, and policies. We developed a geographically informed index to prioritize census tracts based on community need, with an initial focus on identifying communities in need of breast cancer screening (BCS) interventions.</p><p><strong>Methods: </strong>This study used publicly available data to select variables known to be associated with disparities in BCS rates. Variables were identified from five categories: economic stability, education access and quality, neighborhood and built environment, social and community context, and health status and health care access and quality. Data were analyzed at the census tract level across the Sidney Kimmel Comprehensive Cancer Center catchment (N = 1,216). Principal component analysis was applied to 23 variables, and five principal components were selected to construct a composite measure using a weighted sum. The resulting index values were used to stratify the data set for further analysis and mapped for visualization.</p><p><strong>Results: </strong>The analysis produced the Community Need Priority Index (CNPI)-BCS, with values ranging from 0 to 1 (mean, 0.259; standard deviation [SD], 0.161). The top quintile (Q5, n = 243) represented the highest-need communities. Q5 tracts were primarily concentrated in Philadelphia, Camden, and Delaware counties. Philadelphia County had the highest average (mean, 0.364; SD, 1.78) and the most tracts in the top quintile (45%, n = 175). Montgomery county had the lowest average (mean, 0.169; SD, 0.092).</p><p><strong>Conclusion: </strong>This novel methodological approach considered the complex nature of multiple, intersectional barriers to good health to identify priority areas of need within cancer center catchment areas.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"9 ","pages":"e2500132"},"PeriodicalIF":2.8,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12822900/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145688619","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-01Epub Date: 2025-12-22DOI: 10.1200/CCI-25-00297
Mahima Akula, Ryan W Huey, Arthur S Hong
{"title":"Dissonance in the Sole Quality Measure for Outpatient Chemotherapy, OP-35.","authors":"Mahima Akula, Ryan W Huey, Arthur S Hong","doi":"10.1200/CCI-25-00297","DOIUrl":"10.1200/CCI-25-00297","url":null,"abstract":"","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"9 ","pages":"e2500297"},"PeriodicalIF":2.8,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12724631/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145812123","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-01Epub Date: 2025-12-11DOI: 10.1200/CCI-25-00310
Ning Liao, Cheukfai Li, Charles M Balch
{"title":"Reply to: Critical Role of Model Selection in Evaluating AI Performance for Tumor Board Decision Making.","authors":"Ning Liao, Cheukfai Li, Charles M Balch","doi":"10.1200/CCI-25-00310","DOIUrl":"https://doi.org/10.1200/CCI-25-00310","url":null,"abstract":"","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"9 ","pages":"e2500310"},"PeriodicalIF":2.8,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145745684","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}