首页 > 最新文献

JCO Clinical Cancer Informatics最新文献

英文 中文
Which Patients With Cancer Access Their Clinical Notes? A Disparities Analysis. 哪些癌症患者可以查阅他们的临床记录?A差异分析。
IF 2.8 Q2 ONCOLOGY Pub Date : 2025-10-01 Epub Date: 2025-10-16 DOI: 10.1200/CCI-24-00254
Susan Chimonas, Charlie White, Kenneth Seier, Fernanda Polubriaginof, Chelsea Michael, Chasity Walters, Allison Lipitz-Snyderman, Gilad Kuperman

Purpose: Access to clinical notes enhances patient engagement and trust, and the 21st Century Cures Act enabled immediate electronic patient access in April 2021. Yet, technological advances may perpetuate disparities, which remain understudied. Understanding whether inequities in note access exist in oncology would highlight challenges around making this foundational health information available to all patients receiving ongoing, complex medical care.

Materials and methods: This study at a high-volume specialty cancer center explored disparities around clinical notes posted to patients' portal accounts from September 1, 2021, to August 31, 2022, and accessed by March 19, 2024. Logistic and Poisson regression were used to identify patient characteristics associated with note access and note opening rates.

Results: The study included 124,554 patients and 815,104 clinical notes, 43.7% of which (356,290) were accessed. Although modest differences in access rates emerged around sex, age, and marital status, larger disparities appeared for ethnicity, race, and language: Black patients (odds ratio [OR], 0.63 [95% CI, 0.60 to 0.66]; P < .001; incidence rate ratio [IRR], 0.74 [95% CI, 0.73 to 0.76]; P < .001), Hispanic patients (OR, 0.85 [95% CI, 0.80 to 0.90]; P < .001; IRR, 0.90 [95% CI, 0.89 to 0.92]; P < .001), and non-English-preferred language speakers (OR, 0.78 [95% CI, 0.72 to 0.84]; P < .001; IRR, 0.82 [95% CI, 0.80 to 0.84]; P < .001) were, respectively, 37%, 15%, and 22% less likely to open at least one note, and opened 26%, 10%, and 18% percent fewer notes, compared with white, non-Hispanic, and English-preferred patients, respectively.

Conclusion: This analysis highlighted disparities, by race, ethnicity, and language, in cancer patients' accessing clinical notes. Tailored interventions are crucial to ensure that diverse groups benefit from digital health care resources.

目的:临床记录的访问增强了患者的参与和信任,21世纪治愈法案于2021年4月实现了患者的即时电子访问。然而,技术进步可能会使差距永久化,这方面的研究尚未充分。了解肿瘤病历获取方面是否存在不公平现象,将凸显向所有正在接受复杂医疗护理的患者提供这一基础健康信息的挑战。材料和方法:本研究在一个高容量的专业癌症中心进行,探讨了从2021年9月1日到2022年8月31日发布到患者门户账户的临床记录之间的差异,并在2024年3月19日之前访问。使用逻辑回归和泊松回归来确定与病历获取和病历打开率相关的患者特征。结果:共纳入124554例患者和815104份临床记录,其中43.7%(356290份)被查阅。虽然适度差异率出现性的访问,年龄,婚姻状况,更大的差距出现在种族,种族,和语言:黑人患者(比值比(或),0.63(95%可信区间,0.60至0.66);P <措施;发病率比(IRR), 0.74(95%可信区间,0.73至0.76);P <措施),拉美裔患者(或者,0.85(95%可信区间,0.80至0.90);P <措施;IRR, 0.90(95%可信区间,0.89至0.92);P <措施),和non-English-preferred语言(或者,0.78(95%可信区间,0.72至0.84);P <措施;IRR, 0.82 [95% CI, 0.80 ~ 0.84];P < 0.001)分别比白人、非西班牙裔和偏好英语的患者少打开至少一个音符的可能性分别为37%、15%和22%,打开音符的可能性分别为26%、10%和18%。结论:该分析突出了不同种族、民族和语言在癌症患者获取临床记录方面的差异。量身定制的干预措施对于确保不同群体从数字医疗保健资源中受益至关重要。
{"title":"Which Patients With Cancer Access Their Clinical Notes? A Disparities Analysis.","authors":"Susan Chimonas, Charlie White, Kenneth Seier, Fernanda Polubriaginof, Chelsea Michael, Chasity Walters, Allison Lipitz-Snyderman, Gilad Kuperman","doi":"10.1200/CCI-24-00254","DOIUrl":"https://doi.org/10.1200/CCI-24-00254","url":null,"abstract":"<p><strong>Purpose: </strong>Access to clinical notes enhances patient engagement and trust, and the 21st Century Cures Act enabled immediate electronic patient access in April 2021. Yet, technological advances may perpetuate disparities, which remain understudied. Understanding whether inequities in note access exist in oncology would highlight challenges around making this foundational health information available to all patients receiving ongoing, complex medical care.</p><p><strong>Materials and methods: </strong>This study at a high-volume specialty cancer center explored disparities around clinical notes posted to patients' portal accounts from September 1, 2021, to August 31, 2022, and accessed by March 19, 2024. Logistic and Poisson regression were used to identify patient characteristics associated with note access and note opening rates.</p><p><strong>Results: </strong>The study included 124,554 patients and 815,104 clinical notes, 43.7% of which (356,290) were accessed. Although modest differences in access rates emerged around sex, age, and marital status, larger disparities appeared for ethnicity, race, and language: Black patients (odds ratio [OR], 0.63 [95% CI, 0.60 to 0.66]; <i>P</i> < .001; incidence rate ratio [IRR], 0.74 [95% CI, 0.73 to 0.76]; <i>P</i> < .001), Hispanic patients (OR, 0.85 [95% CI, 0.80 to 0.90]; <i>P</i> < .001; IRR, 0.90 [95% CI, 0.89 to 0.92]; <i>P</i> < .001), and non-English-preferred language speakers (OR, 0.78 [95% CI, 0.72 to 0.84]; <i>P</i> < .001; IRR, 0.82 [95% CI, 0.80 to 0.84]; <i>P</i> < .001) were, respectively, 37%, 15%, and 22% less likely to open at least one note, and opened 26%, 10%, and 18% percent fewer notes, compared with white, non-Hispanic, and English-preferred patients, respectively.</p><p><strong>Conclusion: </strong>This analysis highlighted disparities, by race, ethnicity, and language, in cancer patients' accessing clinical notes. Tailored interventions are crucial to ensure that diverse groups benefit from digital health care resources.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"9 ","pages":"e2400254"},"PeriodicalIF":2.8,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145310033","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluating Dimensionality Reduction for Patient-Reported Outcome-Based Survival Modeling in Patients With Head and Neck Cancer. 评估头颈癌患者报告的基于结果的生存模型的降维效果。
IF 2.8 Q2 ONCOLOGY Pub Date : 2025-10-01 Epub Date: 2025-10-15 DOI: 10.1200/CCI-25-00069
Eric Ababio Anyimadu, Yaohua Wang, Amy C Moreno, Clifton David Fuller, Xinhua Zhang, G Elisabeta Marai, Guadalupe M Canahuate

Purpose: This study aims to improve survival modeling in head and neck cancer (HNC) by integrating patient-reported outcomes (PROs) using dimensionality reduction techniques. PROs capture symptom severity across the treatment timeline and offer key insights for personalized care. However, their high dimensionality poses challenges such as overfitting and computational complexity. This work focuses on transforming and incorporating PRO data to enhance model performance in HNC.

Materials and methods: We analyzed retrospective data of 923 patients with HNC treated at the University of Texas MD Anderson Cancer Center between 2010 and 2021. Baseline clinical data including demographic, treatment, and disease characteristics were used to build a reference survival model. PRO data, capturing symptom ratings, were integrated using dimensionality reduction techniques: principal component analysis (PCA), autoencoders (AEs), and patient clustering. These reduced representations, combined with clinical data, were input into Cox proportional hazards models to predict overall survival (OS) and progression-free survival (PFS). Model performance was assessed using the concordance index, time-dependent AUC, Brier score for calibration, and hazard ratios for predictor significance.

Results: Cox models incorporating PCA and AE outperformed the clinical-only reference model for both OS and PFS. The PCA-based model achieved the highest C-indices (0.74 for OS and 0.64 for PFS), followed by the AE model (0.73 and 0.63) and the clustering model (0.72 and 0.62). Time-dependent AUCs reinforced these results, with PCA showing the highest average AUC over 36 months. All models were well-calibrated, with low Brier scores. Key predictors included age, disease stage, and tumor subsite.

Conclusion: Dimensionality reduction techniques improve survival prediction in patients with HNC by effectively incorporating PRO data, potentially providing greater insights into more personalized treatment strategies.

目的:本研究旨在通过使用降维技术整合患者报告的预后(PROs),改善头颈癌(HNC)的生存模型。专业医生在整个治疗过程中捕捉症状的严重程度,并为个性化护理提供关键见解。然而,它们的高维性带来了过拟合和计算复杂性等挑战。这项工作的重点是转换和合并PRO数据,以提高HNC中的模型性能。材料和方法:我们分析了2010年至2021年间在德克萨斯大学MD安德森癌症中心接受治疗的923例HNC患者的回顾性数据。基线临床数据包括人口统计学、治疗和疾病特征,用于建立参考生存模型。捕获症状评分的PRO数据使用降维技术进行整合:主成分分析(PCA)、自动编码器(AEs)和患者聚类。这些减少的表征,结合临床数据,被输入到Cox比例风险模型中,以预测总生存期(OS)和无进展生存期(PFS)。使用一致性指数、随时间变化的AUC、Brier评分进行校准和风险比进行预测显著性评估模型性能。结果:结合PCA和AE的Cox模型在OS和PFS方面都优于临床参考模型。基于pca的模型的c指数最高(OS为0.74,PFS为0.64),其次是AE模型(0.73和0.63)和聚类模型(0.72和0.62)。时间依赖的AUC强化了这些结果,PCA显示36个月的平均AUC最高。所有模型都经过良好校准,Brier评分较低。关键预测因素包括年龄、疾病分期和肿瘤亚位点。结论:降维技术通过有效地整合PRO数据,提高了HNC患者的生存预测,可能为更个性化的治疗策略提供更深入的见解。
{"title":"Evaluating Dimensionality Reduction for Patient-Reported Outcome-Based Survival Modeling in Patients With Head and Neck Cancer.","authors":"Eric Ababio Anyimadu, Yaohua Wang, Amy C Moreno, Clifton David Fuller, Xinhua Zhang, G Elisabeta Marai, Guadalupe M Canahuate","doi":"10.1200/CCI-25-00069","DOIUrl":"10.1200/CCI-25-00069","url":null,"abstract":"<p><strong>Purpose: </strong>This study aims to improve survival modeling in head and neck cancer (HNC) by integrating patient-reported outcomes (PROs) using dimensionality reduction techniques. PROs capture symptom severity across the treatment timeline and offer key insights for personalized care. However, their high dimensionality poses challenges such as overfitting and computational complexity. This work focuses on transforming and incorporating PRO data to enhance model performance in HNC.</p><p><strong>Materials and methods: </strong>We analyzed retrospective data of 923 patients with HNC treated at the University of Texas MD Anderson Cancer Center between 2010 and 2021. Baseline clinical data including demographic, treatment, and disease characteristics were used to build a reference survival model. PRO data, capturing symptom ratings, were integrated using dimensionality reduction techniques: principal component analysis (PCA), autoencoders (AEs), and patient clustering. These reduced representations, combined with clinical data, were input into Cox proportional hazards models to predict overall survival (OS) and progression-free survival (PFS). Model performance was assessed using the concordance index, time-dependent AUC, Brier score for calibration, and hazard ratios for predictor significance.</p><p><strong>Results: </strong>Cox models incorporating PCA and AE outperformed the clinical-only reference model for both OS and PFS. The PCA-based model achieved the highest C-indices (0.74 for OS and 0.64 for PFS), followed by the AE model (0.73 and 0.63) and the clustering model (0.72 and 0.62). Time-dependent AUCs reinforced these results, with PCA showing the highest average AUC over 36 months. All models were well-calibrated, with low Brier scores. Key predictors included age, disease stage, and tumor subsite.</p><p><strong>Conclusion: </strong>Dimensionality reduction techniques improve survival prediction in patients with HNC by effectively incorporating PRO data, potentially providing greater insights into more personalized treatment strategies.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"9 ","pages":"e2500069"},"PeriodicalIF":2.8,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12529990/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145304315","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Development and Validation of a Claims-Based Algorithm for Identifying Incident Colorectal Cancer and Determining Progression Phases. 基于索赔的识别结直肠癌事件和确定进展阶段的算法的开发和验证。
IF 2.8 Q2 ONCOLOGY Pub Date : 2025-10-01 Epub Date: 2025-10-23 DOI: 10.1200/CCI-25-00107
Nobukazu Agatsuma, Takahiro Utsumi, Takahiro Inoue, Yukari Tanaka, Yoshitaka Nishikawa, Takahiro Horimatsu, Yuki Nakanishi, Mitsuhiro Nikaido, Takeshi Seta, Nobuaki Hoshino, Yoshimitsu Takahashi, Takeo Nakayama, Hiroshi Seno

Purpose: Health insurance claims comprising diagnosis and treatment information offer insights into clinical practice and medical care costs. However, inaccurate diagnosis codes listed in claims and the absence of staging information limit the understanding of colorectal cancer (CRC)-related clinical practice. We developed and validated an algorithm to accurately identify incident CRC cases and their progression phases using claims data.

Methods: We conducted a retrospective study using claims data from three Japanese institutions, including two designated cancer care hospitals (DCCHs), between April 2016 and August 2022. An algorithm that uses CRC-associated diagnostic codes and claim codes for CRC-specific treatments was developed to identify incident CRC cases and classify patients into three progression phases (treatment-sequenced groups: endoscopic, surgical, and noncurative). The algorithm was refined using cohorts from two DCCHs in April-September 2017 and April-September 2019 to enhance performance metrics, with validity tested at these hospitals during different periods and at another hospital. The performance metrics of the algorithm included positive predictive value (PPV), sensitivity in identifying incident CRC, and accuracy in determining progression phases.

Results: The performance metrics of the algorithm were enhanced by filtering prevalent cases, selecting CRC-specific treatments, and targeting invasive CRC cases. The algorithm for identifying incident invasive CRC achieved high PPVs (91.2% [95% CI, 89.5 to 92.7] and 94.4% [95% CI, 87.6 to 97.6]), sensitivities (94.6% [95% CI, 93.1 to 95.7] and 100.0% [95% CI, 95.7 to 100.0]), and progression phase accuracies (91.5% [95% CI, 89.7 to 93.0] and 97.6% [95% CI, 91.8 to 99.4]) in two validation cohorts.

Conclusion: The developed algorithm accurately identified incident invasive CRC cases and determined their progression phases using claims data. Application of this algorithm could contribute to research on real-world practices and medical care costs associated with CRC.

目的:包括诊断和治疗信息的健康保险索赔提供了对临床实践和医疗保健费用的见解。然而,索赔中列出的不准确的诊断代码和缺乏分期信息限制了对结直肠癌(CRC)相关临床实践的理解。我们开发并验证了一种算法,可以使用索赔数据准确识别CRC事件病例及其进展阶段。方法:我们对2016年4月至2022年8月期间来自三家日本机构(包括两家指定癌症护理医院(DCCHs))的索赔数据进行了回顾性研究。研究人员开发了一种算法,该算法使用CRC相关的诊断代码和CRC特异性治疗的索赔代码来识别CRC事件病例,并将患者分为三个进展阶段(治疗顺序组:内窥镜、手术和不可治愈)。在2017年4月至9月和2019年4月至9月期间,使用两家DCCHs的队列对该算法进行了改进,以提高绩效指标,并在不同时期和另一家医院对这些医院进行了有效性测试。该算法的性能指标包括阳性预测值(PPV)、识别CRC事件的敏感性和确定进展阶段的准确性。结果:通过过滤流行病例、选择CRC特异性治疗方法和靶向侵袭性CRC病例,提高了算法的性能指标。在两个验证队列中,识别侵袭性结直肠癌的算法实现了较高的ppv (91.2% [95% CI, 89.5至92.7]和94.4% [95% CI, 87.6至97.6])、灵敏度(94.6% [95% CI, 93.1至95.7]和100.0% [95% CI, 95.7至100.0])和进展阶段准确性(91.5% [95% CI, 89.7至93.0]和97.6% [95% CI, 91.8至99.4])。结论:开发的算法能够准确识别侵袭性结直肠癌病例,并根据索赔数据确定其进展阶段。该算法的应用有助于研究与CRC相关的现实实践和医疗费用。
{"title":"Development and Validation of a Claims-Based Algorithm for Identifying Incident Colorectal Cancer and Determining Progression Phases.","authors":"Nobukazu Agatsuma, Takahiro Utsumi, Takahiro Inoue, Yukari Tanaka, Yoshitaka Nishikawa, Takahiro Horimatsu, Yuki Nakanishi, Mitsuhiro Nikaido, Takeshi Seta, Nobuaki Hoshino, Yoshimitsu Takahashi, Takeo Nakayama, Hiroshi Seno","doi":"10.1200/CCI-25-00107","DOIUrl":"https://doi.org/10.1200/CCI-25-00107","url":null,"abstract":"<p><strong>Purpose: </strong>Health insurance claims comprising diagnosis and treatment information offer insights into clinical practice and medical care costs. However, inaccurate diagnosis codes listed in claims and the absence of staging information limit the understanding of colorectal cancer (CRC)-related clinical practice. We developed and validated an algorithm to accurately identify incident CRC cases and their progression phases using claims data.</p><p><strong>Methods: </strong>We conducted a retrospective study using claims data from three Japanese institutions, including two designated cancer care hospitals (DCCHs), between April 2016 and August 2022. An algorithm that uses CRC-associated diagnostic codes and claim codes for CRC-specific treatments was developed to identify incident CRC cases and classify patients into three progression phases (treatment-sequenced groups: endoscopic, surgical, and noncurative). The algorithm was refined using cohorts from two DCCHs in April-September 2017 and April-September 2019 to enhance performance metrics, with validity tested at these hospitals during different periods and at another hospital. The performance metrics of the algorithm included positive predictive value (PPV), sensitivity in identifying incident CRC, and accuracy in determining progression phases.</p><p><strong>Results: </strong>The performance metrics of the algorithm were enhanced by filtering prevalent cases, selecting CRC-specific treatments, and targeting invasive CRC cases. The algorithm for identifying incident invasive CRC achieved high PPVs (91.2% [95% CI, 89.5 to 92.7] and 94.4% [95% CI, 87.6 to 97.6]), sensitivities (94.6% [95% CI, 93.1 to 95.7] and 100.0% [95% CI, 95.7 to 100.0]), and progression phase accuracies (91.5% [95% CI, 89.7 to 93.0] and 97.6% [95% CI, 91.8 to 99.4]) in two validation cohorts.</p><p><strong>Conclusion: </strong>The developed algorithm accurately identified incident invasive CRC cases and determined their progression phases using claims data. Application of this algorithm could contribute to research on real-world practices and medical care costs associated with CRC.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"9 ","pages":"e2500107"},"PeriodicalIF":2.8,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145356784","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mitigating Ethical Issues for Large Language Models in Oncology: A Systematic Review. 减轻肿瘤大语言模型的伦理问题:系统综述。
IF 2.8 Q2 ONCOLOGY Pub Date : 2025-09-01 Epub Date: 2025-09-24 DOI: 10.1200/CCI-25-00076
Shuang Zhou, Xingyi Liu, Zidu Xu, Zaifu Zhan, Meijia Song, Jun Wang, Shiao Liu, Hua Xu, Rui Zhang

Purpose: Large language models (LLMs) have demonstrated remarkable versatility in oncology applications, such as cancer staging and survival analysis. Despite their potential, ethical concerns such as data privacy breaches, bias in training data, lack of transparency, and risks associated with erroneous outputs pose significant challenges to their adoption in high-stakes oncology settings. Therefore, we aim to explore the ethical challenges associated with LLM-based applications in oncology and evaluate emerging techniques designed to address these issues.

Methods: Following the Preferred Reporting Items for Systematic Reviews and Meta-Analyses framework, a systematic review was conducted to evaluate publications related to ethical issues of LLMs in oncology across eight academic databases (eg, PubMed, Web of Science, and Embase) between January 1, 2019, and December 31, 2024.

Results: The search retrieved 4,319 published articles, of which 65 publications were preserved and included in our analysis. We identified six prevalent ethical challenges in oncology, including trust, equity, privacy, transparency, nonmaleficence, and accountability. We then evaluated emerging technical solutions to mitigate ethical challenges and summarized evaluation metrics used to assess these solutions' effectiveness.

Conclusion: This review provides actionable recommendations for responsibly deploying LLMs in oncology, ensuring adherence to ethical guidelines, and fostering improved patient outcomes. By bridging technical and clinical perspectives, this review offers a foundational framework for advancing ethical artificial intelligence applications in oncology and highlights areas for future research.

目的:大型语言模型(llm)在肿瘤分期和生存分析等肿瘤学应用中表现出了显著的多功能性。尽管它们具有潜力,但诸如数据隐私泄露、训练数据偏差、缺乏透明度以及与错误输出相关的风险等伦理问题,对它们在高风险肿瘤学环境中的采用构成了重大挑战。因此,我们的目标是探索与基于法学硕士的肿瘤学应用相关的伦理挑战,并评估旨在解决这些问题的新兴技术。方法:根据系统评价和荟萃分析框架的首选报告项目,对2019年1月1日至2024年12月31日期间8个学术数据库(如PubMed、Web of Science和Embase)中与肿瘤学法学硕士伦理问题相关的出版物进行系统评价。结果:检索到4319篇已发表文章,其中65篇被保留并纳入我们的分析。我们确定了肿瘤学中六个普遍的伦理挑战,包括信任、公平、隐私、透明度、非恶意和问责制。然后,我们评估了新兴的技术解决方案,以减轻道德挑战,并总结了用于评估这些解决方案有效性的评估指标。结论:本综述为负责任地在肿瘤学中部署法学硕士提供了可操作的建议,确保遵守伦理准则,并促进改善患者预后。通过连接技术和临床观点,本综述为推进肿瘤伦理人工智能应用提供了一个基础框架,并强调了未来的研究领域。
{"title":"Mitigating Ethical Issues for Large Language Models in Oncology: A Systematic Review.","authors":"Shuang Zhou, Xingyi Liu, Zidu Xu, Zaifu Zhan, Meijia Song, Jun Wang, Shiao Liu, Hua Xu, Rui Zhang","doi":"10.1200/CCI-25-00076","DOIUrl":"10.1200/CCI-25-00076","url":null,"abstract":"<p><strong>Purpose: </strong>Large language models (LLMs) have demonstrated remarkable versatility in oncology applications, such as cancer staging and survival analysis. Despite their potential, ethical concerns such as data privacy breaches, bias in training data, lack of transparency, and risks associated with erroneous outputs pose significant challenges to their adoption in high-stakes oncology settings. Therefore, we aim to explore the ethical challenges associated with LLM-based applications in oncology and evaluate emerging techniques designed to address these issues.</p><p><strong>Methods: </strong>Following the Preferred Reporting Items for Systematic Reviews and Meta-Analyses framework, a systematic review was conducted to evaluate publications related to ethical issues of LLMs in oncology across eight academic databases (eg, PubMed, Web of Science, and Embase) between January 1, 2019, and December 31, 2024.</p><p><strong>Results: </strong>The search retrieved 4,319 published articles, of which 65 publications were preserved and included in our analysis. We identified six prevalent ethical challenges in oncology, including trust, equity, privacy, transparency, nonmaleficence, and accountability. We then evaluated emerging technical solutions to mitigate ethical challenges and summarized evaluation metrics used to assess these solutions' effectiveness.</p><p><strong>Conclusion: </strong>This review provides actionable recommendations for responsibly deploying LLMs in oncology, ensuring adherence to ethical guidelines, and fostering improved patient outcomes. By bridging technical and clinical perspectives, this review offers a foundational framework for advancing ethical artificial intelligence applications in oncology and highlights areas for future research.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"9 ","pages":"e2500076"},"PeriodicalIF":2.8,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145139554","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing Oncology-Specific Question Answering With Large Language Models Through Fine-Tuned Embeddings With Synthetic Data. 通过合成数据的微调嵌入,用大型语言模型增强肿瘤特定问题的回答。
IF 2.8 Q2 ONCOLOGY Pub Date : 2025-09-01 Epub Date: 2025-09-05 DOI: 10.1200/CCI-25-00011
Kun-Han Lu, Sina Mehdinia, Kingson Man, Chi Wah Wong, Allen Mao, Zahra Eftekhari

Purpose: The recent advancements of retrieval-augmented generation (RAG) and large language models (LLMs) have revolutionized the extraction of real-world evidence from unstructured electronic health records (EHRs) in oncology. This study aims to enhance RAG's effectiveness by implementing a retriever encoder specifically designed for oncology EHRs, with the goal of improving the precision and relevance of retrieved clinical notes for oncology-related queries.

Methods: Our model was pretrained with more than six million oncology notes from 209,135 patients at City of Hope. The model was subsequently fine-tuned into a sentence transformer model using 12,371 query-passage training pairs. Specifically, the passages were obtained from actual patient notes, whereas the query was synthesized by an LLM. We evaluated the retrieval performance of our model by comparing it with six widely used embedding models on 50 oncology questions across 10 categories based on Normalized Discounted Cumulative Gain (NDCG), Precision, and Recall.

Results: In our test data set comprising 53 patients, our model exceeded the performance of the runner-up model by 9% for NDCG (evaluated at the top 10 results), 7% for Precision (top 10), and 6% for Recall (top 10). Our model showed exceptional retrieval performance across all metrics for oncology-specific categories, including biomarkers assessed, current diagnosis, disease status, laboratory results, tumor characteristics, and tumor staging.

Conclusion: Our findings highlight the effectiveness of pretrained contextual embeddings and sentence transformers in retrieving pertinent notes from oncology EHRs. The innovative use of LLM-synthesized query-passage pairs for data augmentation was proven to be effective. This fine-tuning approach holds significant promise in specialized fields like health care, where acquiring annotated data is challenging.

目的:检索增强生成(RAG)和大型语言模型(llm)的最新进展彻底改变了肿瘤学中从非结构化电子健康记录(EHRs)中提取真实世界证据的方法。本研究旨在通过实现一个专门为肿瘤电子病历设计的检索编码器来提高RAG的有效性,目的是提高肿瘤相关查询检索临床记录的准确性和相关性。方法:我们的模型是用来自希望之城209,135名患者的600多万份肿瘤笔记进行预训练的。该模型随后使用12,371个查询通道训练对微调为句子转换模型。具体来说,这些段落是从实际的病人笔记中获得的,而查询是由LLM合成的。基于归一化贴现累积增益(NDCG)、精度和召回率,我们通过将模型与六种广泛使用的嵌入模型在10个类别的50个肿瘤学问题上进行比较,评估了模型的检索性能。结果:在包含53名患者的测试数据集中,我们的模型在NDCG(以前10名的结果进行评估)、精度(前10名)和召回率(前10名)方面的表现分别超过了第二名模型9%、7%和6%。我们的模型在肿瘤特定类别的所有指标上都显示出卓越的检索性能,包括评估的生物标志物、当前诊断、疾病状态、实验室结果、肿瘤特征和肿瘤分期。结论:我们的研究结果强调了预先训练的上下文嵌入和句子转换在从肿瘤学电子病历中检索相关笔记方面的有效性。将llm合成的查询通道对创新地用于数据增强已被证明是有效的。这种微调方法在医疗保健等专业领域具有重要的前景,在这些领域获取带注释的数据具有挑战性。
{"title":"Enhancing Oncology-Specific Question Answering With Large Language Models Through Fine-Tuned Embeddings With Synthetic Data.","authors":"Kun-Han Lu, Sina Mehdinia, Kingson Man, Chi Wah Wong, Allen Mao, Zahra Eftekhari","doi":"10.1200/CCI-25-00011","DOIUrl":"https://doi.org/10.1200/CCI-25-00011","url":null,"abstract":"<p><strong>Purpose: </strong>The recent advancements of retrieval-augmented generation (RAG) and large language models (LLMs) have revolutionized the extraction of real-world evidence from unstructured electronic health records (EHRs) in oncology. This study aims to enhance RAG's effectiveness by implementing a retriever encoder specifically designed for oncology EHRs, with the goal of improving the precision and relevance of retrieved clinical notes for oncology-related queries.</p><p><strong>Methods: </strong>Our model was pretrained with more than six million oncology notes from 209,135 patients at City of Hope. The model was subsequently fine-tuned into a sentence transformer model using 12,371 query-passage training pairs. Specifically, the passages were obtained from actual patient notes, whereas the query was synthesized by an LLM. We evaluated the retrieval performance of our model by comparing it with six widely used embedding models on 50 oncology questions across 10 categories based on Normalized Discounted Cumulative Gain (NDCG), Precision, and Recall.</p><p><strong>Results: </strong>In our test data set comprising 53 patients, our model exceeded the performance of the runner-up model by 9% for NDCG (evaluated at the top 10 results), 7% for Precision (top 10), and 6% for Recall (top 10). Our model showed exceptional retrieval performance across all metrics for oncology-specific categories, including biomarkers assessed, current diagnosis, disease status, laboratory results, tumor characteristics, and tumor staging.</p><p><strong>Conclusion: </strong>Our findings highlight the effectiveness of pretrained contextual embeddings and sentence transformers in retrieving pertinent notes from oncology EHRs. The innovative use of LLM-synthesized query-passage pairs for data augmentation was proven to be effective. This fine-tuning approach holds significant promise in specialized fields like health care, where acquiring annotated data is challenging.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"9 ","pages":"e2500011"},"PeriodicalIF":2.8,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145006898","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Erratum: Breast, Cervical, and Colorectal Cancer Screening Among New Jersey Medicaid Enrollees: 2017-2022. 勘误:乳腺癌,宫颈癌和结直肠癌筛查在新泽西州医疗补助登记者:2017-2022。
IF 2.8 Q2 ONCOLOGY Pub Date : 2025-09-01 Epub Date: 2025-09-10 DOI: 10.1200/CCI-25-00256
Ann M Nguyen, Adriana Waldron-Corredor, Feng-Yi Liu, Xiaoling Yun, Jose Nova, Anita Y Kinney, Joel C Cantor, Jennifer Tsui
{"title":"Erratum: Breast, Cervical, and Colorectal Cancer Screening Among New Jersey Medicaid Enrollees: 2017-2022.","authors":"Ann M Nguyen, Adriana Waldron-Corredor, Feng-Yi Liu, Xiaoling Yun, Jose Nova, Anita Y Kinney, Joel C Cantor, Jennifer Tsui","doi":"10.1200/CCI-25-00256","DOIUrl":"https://doi.org/10.1200/CCI-25-00256","url":null,"abstract":"","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"9 ","pages":"e2500256"},"PeriodicalIF":2.8,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145031163","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hybrid ReGex and Natural Language Inference Model as a Zero-Shot Classifier for Extracting Data From Medical Reports. 混合ReGex和自然语言推理模型作为零采样分类器从医疗报告中提取数据。
IF 2.8 Q2 ONCOLOGY Pub Date : 2025-09-01 Epub Date: 2025-09-22 DOI: 10.1200/CCI-25-00130
Nicolas Wagneur, Olivier Capitain, Stéphane Supiot, Florent Le Borgne, François Bocquet, Mario Campone, Tanguy Perennec

Purpose: This study presents a new method based on regular expressions (ReGex) and artificial intelligence for extracting relevant medical data from clinical reports. This hybrid approach is designed to address the limitations of each technique. The pipeline is evaluated for its effectiveness in extracting key clinical information from prostate cancer medical reports.

Methods: We developed a hybrid pipeline that combines ReGex for initial data extraction with a Natural Language Inference model for classification. This approach was retrospectively applied to 1,000 reports randomly selected among all consultation reports of patients with prostate cancer treated at the institute, focusing on identifying key clinical information such as rectal bleeding, dysuria, pollakiuria, and hematuria. The model's performance was evaluated using precision, recall, accuracy, F1-score, and Cohen's kappa coefficient.

Results: The pipeline demonstrated high performance, with precision scores ranging from 0.778 to 0.954 and recall consistently high at 0.920 to 1.00. F1-scores indicated balanced accuracy across symptoms, and Cohen's kappa values (0.871 to 0.951) reflected strong agreement with physician-labeled data.

Conclusion: The proposed pipeline is both efficient and fast while being computationally lightweight. It achieves high accuracy in extracting medical data from clinical reports, making it an effective and practical tool for clinical research and health care applications.

目的:提出一种基于正则表达式(ReGex)和人工智能的临床报告相关医学数据提取新方法。这种混合方法旨在解决每种技术的局限性。该管道在从前列腺癌医学报告中提取关键临床信息方面的有效性得到了评估。方法:我们开发了一个混合管道,该管道结合了用于初始数据提取的ReGex和用于分类的自然语言推理模型。该方法回顾性应用于在该所治疗的前列腺癌患者的所有会诊报告中随机抽取的1000份报告,重点识别直肠出血、排尿困难、尿疹、血尿等关键临床信息。模型的性能评估使用精度,召回率,准确度,f1得分,和科恩的kappa系数。结果:管道表现出良好的性能,精度得分在0.778 ~ 0.954之间,召回率始终保持在0.920 ~ 1.00之间。f1评分表明各症状的准确性平衡,Cohen的kappa值(0.871至0.951)与医生标记的数据高度一致。结论:所提出的流水线既高效又快速,而且计算量轻。从临床报告中提取医疗数据的准确性较高,是临床研究和医疗保健应用的有效实用工具。
{"title":"Hybrid ReGex and Natural Language Inference Model as a Zero-Shot Classifier for Extracting Data From Medical Reports.","authors":"Nicolas Wagneur, Olivier Capitain, Stéphane Supiot, Florent Le Borgne, François Bocquet, Mario Campone, Tanguy Perennec","doi":"10.1200/CCI-25-00130","DOIUrl":"https://doi.org/10.1200/CCI-25-00130","url":null,"abstract":"<p><strong>Purpose: </strong>This study presents a new method based on regular expressions (ReGex) and artificial intelligence for extracting relevant medical data from clinical reports. This hybrid approach is designed to address the limitations of each technique. The pipeline is evaluated for its effectiveness in extracting key clinical information from prostate cancer medical reports.</p><p><strong>Methods: </strong>We developed a hybrid pipeline that combines ReGex for initial data extraction with a Natural Language Inference model for classification. This approach was retrospectively applied to 1,000 reports randomly selected among all consultation reports of patients with prostate cancer treated at the institute, focusing on identifying key clinical information such as rectal bleeding, dysuria, pollakiuria, and hematuria. The model's performance was evaluated using precision, recall, accuracy, F1-score, and Cohen's kappa coefficient.</p><p><strong>Results: </strong>The pipeline demonstrated high performance, with precision scores ranging from 0.778 to 0.954 and recall consistently high at 0.920 to 1.00. F1-scores indicated balanced accuracy across symptoms, and Cohen's kappa values (0.871 to 0.951) reflected strong agreement with physician-labeled data.</p><p><strong>Conclusion: </strong>The proposed pipeline is both efficient and fast while being computationally lightweight. It achieves high accuracy in extracting medical data from clinical reports, making it an effective and practical tool for clinical research and health care applications.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"9 ","pages":"e2500130"},"PeriodicalIF":2.8,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145126390","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Artificial Intelligence-Based Model Exploiting Hematoxylin and Eosin Images to Predict Rare Gene Mutations in Patients With Lung Adenocarcinoma. 利用苏木精和伊红图像预测肺腺癌患者罕见基因突变的人工智能模型。
IF 2.8 Q2 ONCOLOGY Pub Date : 2025-09-01 Epub Date: 2025-09-26 DOI: 10.1200/CCI-25-00093
Peiling Yu, Weixing Chen, Nan Liu, Yang Yu, Hongyu Guo, Yinan Yuan, Weilin Guo, Yini Alatan, Jinming Zhao, Hongbo Su, Siru Nie, Xiaoyu Cui, Yuan Miao

Purpose: Accurately identifying gene mutations in lung cancer is crucial for treatment, while molecular diagnostic methods are time-consuming and complex. This study aims to develop an advanced deep learning model to address this issue.

Methods: In this study, the ResNeXt101 model framework was established to predict the gene mutation status in lung adenocarcinoma. The model was trained and validated using data from two cohorts: cohort 1, comprising 144 patients from the First Affiliated Hospital of China Medical University, and cohort 2, which includes 69 patients from the The Cancer Genome Atlas-Lung Adenocarcinoma public database. The model was trained and validated on the two data sets, respectively, and they served as external test sets for each other to further verify the performance of the model. Additionally, we tested the trained model on a metastatic cancer data set, which included metastases to organs outside the lungs. The performance of the model was evaluated using the AUC, accuracy, precision, recall, and F1 score.

Results: In cohort 1, the model achieved an AUC ranging from 0.93 to 1. In the external test on cohort 2, it performed well in predicting five of the six genes (AUC = 0.85-1). When tested on the metastatic cancer data set, it successfully predicted mutations of three of the six genes (AUC = 0.72-0.80).

Conclusion: The artificial intelligence model developed in this study has a high accuracy in predicting gene mutations in lung adenocarcinoma, which is conducive to improving the management of patients with lung adenocarcinoma and promoting precision medicine.

目的:准确识别肺癌基因突变对治疗至关重要,而分子诊断方法耗时且复杂。本研究旨在开发一种先进的深度学习模型来解决这个问题。方法:本研究建立ResNeXt101模型框架,预测肺腺癌基因突变状态。该模型使用来自两个队列的数据进行训练和验证:队列1包括来自中国医科大学第一附属医院的144名患者,队列2包括来自癌症基因组图谱-肺腺癌公共数据库的69名患者。分别在两个数据集上对模型进行训练和验证,并互为外部测试集,进一步验证模型的性能。此外,我们在转移性癌症数据集上测试了训练模型,其中包括肺外器官的转移。使用AUC、准确度、精密度、召回率和F1分数来评估模型的性能。结果:在队列1中,该模型的AUC范围为0.93 ~ 1。在队列2的外部测试中,该方法预测了6个基因中的5个(AUC = 0.85-1)。当在转移性癌症数据集上进行测试时,它成功地预测了六个基因中的三个基因的突变(AUC = 0.72-0.80)。结论:本研究建立的人工智能模型在预测肺腺癌基因突变方面具有较高的准确性,有利于提高肺腺癌患者的管理水平,促进精准医疗。
{"title":"Artificial Intelligence-Based Model Exploiting Hematoxylin and Eosin Images to Predict Rare Gene Mutations in Patients With Lung Adenocarcinoma.","authors":"Peiling Yu, Weixing Chen, Nan Liu, Yang Yu, Hongyu Guo, Yinan Yuan, Weilin Guo, Yini Alatan, Jinming Zhao, Hongbo Su, Siru Nie, Xiaoyu Cui, Yuan Miao","doi":"10.1200/CCI-25-00093","DOIUrl":"10.1200/CCI-25-00093","url":null,"abstract":"<p><strong>Purpose: </strong>Accurately identifying gene mutations in lung cancer is crucial for treatment, while molecular diagnostic methods are time-consuming and complex. This study aims to develop an advanced deep learning model to address this issue.</p><p><strong>Methods: </strong>In this study, the ResNeXt101 model framework was established to predict the gene mutation status in lung adenocarcinoma. The model was trained and validated using data from two cohorts: cohort 1, comprising 144 patients from the First Affiliated Hospital of China Medical University, and cohort 2, which includes 69 patients from the The Cancer Genome Atlas-Lung Adenocarcinoma public database. The model was trained and validated on the two data sets, respectively, and they served as external test sets for each other to further verify the performance of the model. Additionally, we tested the trained model on a metastatic cancer data set, which included metastases to organs outside the lungs. The performance of the model was evaluated using the AUC, accuracy, precision, recall, and F1 score.</p><p><strong>Results: </strong>In cohort 1, the model achieved an AUC ranging from 0.93 to 1. In the external test on cohort 2, it performed well in predicting five of the six genes (AUC = 0.85-1). When tested on the metastatic cancer data set, it successfully predicted mutations of three of the six genes (AUC = 0.72-0.80).</p><p><strong>Conclusion: </strong>The artificial intelligence model developed in this study has a high accuracy in predicting gene mutations in lung adenocarcinoma, which is conducive to improving the management of patients with lung adenocarcinoma and promoting precision medicine.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"9 ","pages":"e2500093"},"PeriodicalIF":2.8,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12487657/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145180005","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Clinician's Artificial Intelligence Checklist and Evaluation Questionnaire: Tools for Oncologists to Assess Artificial Intelligence and Machine Learning Models. 临床医生的人工智能清单和评估问卷:肿瘤学家评估人工智能和机器学习模型的工具。
IF 2.8 Q2 ONCOLOGY Pub Date : 2025-09-01 Epub Date: 2025-09-17 DOI: 10.1200/CCI-25-00067
Nadia S Siddiqui, Yazan Bouchi, Syed Jawad Hussain Shah, Saeed Alqarni, Suraj Sood, Yugyung Lee, John Park, John Kang

Advancements in oncology are accelerating in the fields of artificial intelligence (AI) and machine learning. The complexity and multidisciplinary nature of oncology necessitate a cautious approach to evaluating AI models. The surge in development of AI tools highlights a need for organized evaluation methods. Currently, widely accepted guidelines are aimed at developers and do not provide necessary technical background for clinicians. Additionally, published guides introducing clinicians to AI in medicine often lack user-friendly evaluation tools or lack specificity to oncology. This paper provides background on model development and proposes a yes/no checklist and questionnaire designed to help oncologists effectively assess AI models. The yes/no checklist is intended to be used as a more efficient scan of whether the model conforms to published best standards. The open-ended questionnaire is intended for a more in-depth survey. The checklist and the questionnaire were developed by clinical and AI researchers. Initial discussions identified broad domains, gradually narrowing to model development points relevant to clinical practice. The development process included two literature searches to align with current best practices. Insights from 24 articles were integrated to refine the questionnaire and the checklist. The developed tools are intended for use by clinicians in the field of oncology looking to evaluate AI models. Cases of four AI applications in oncology are analyzed, demonstrating utility in real-world scenarios and enhancing case-based learning for clinicians. These tools highlight the interdisciplinary nature of effective AI integration in oncology.

肿瘤学在人工智能(AI)和机器学习领域的进展正在加速。肿瘤学的复杂性和多学科性质需要谨慎地评估人工智能模型。人工智能工具的迅猛发展凸显了对有组织的评估方法的需求。目前,广泛接受的指南是针对开发人员的,并没有为临床医生提供必要的技术背景。此外,向临床医生介绍医学人工智能的出版指南往往缺乏用户友好的评估工具或缺乏肿瘤学的特异性。本文提供了模型开发的背景,并提出了一个是/否清单和问卷,旨在帮助肿瘤学家有效地评估人工智能模型。是/否检查表用于更有效地扫描模型是否符合已发布的最佳标准。开放式问卷旨在进行更深入的调查。检查表和问卷由临床和人工智能研究人员开发。最初的讨论确定了广泛的领域,逐渐缩小到与临床实践相关的模型开发点。开发过程包括两次文献检索,以与当前的最佳实践保持一致。整合了24篇文章的见解,以完善问卷和检查表。开发的工具旨在供肿瘤领域的临床医生使用,以评估人工智能模型。分析了四种人工智能在肿瘤学中的应用案例,展示了在现实世界场景中的效用,并加强了临床医生基于案例的学习。这些工具突出了人工智能在肿瘤学中有效整合的跨学科性质。
{"title":"Clinician's Artificial Intelligence Checklist and Evaluation Questionnaire: Tools for Oncologists to Assess Artificial Intelligence and Machine Learning Models.","authors":"Nadia S Siddiqui, Yazan Bouchi, Syed Jawad Hussain Shah, Saeed Alqarni, Suraj Sood, Yugyung Lee, John Park, John Kang","doi":"10.1200/CCI-25-00067","DOIUrl":"https://doi.org/10.1200/CCI-25-00067","url":null,"abstract":"<p><p>Advancements in oncology are accelerating in the fields of artificial intelligence (AI) and machine learning. The complexity and multidisciplinary nature of oncology necessitate a cautious approach to evaluating AI models. The surge in development of AI tools highlights a need for organized evaluation methods. Currently, widely accepted guidelines are aimed at developers and do not provide necessary technical background for clinicians. Additionally, published guides introducing clinicians to AI in medicine often lack user-friendly evaluation tools or lack specificity to oncology. This paper provides background on model development and proposes a yes/no checklist and questionnaire designed to help oncologists effectively assess AI models. The yes/no checklist is intended to be used as a more efficient scan of whether the model conforms to published best standards. The open-ended questionnaire is intended for a more in-depth survey. The checklist and the questionnaire were developed by clinical and AI researchers. Initial discussions identified broad domains, gradually narrowing to model development points relevant to clinical practice. The development process included two literature searches to align with current best practices. Insights from 24 articles were integrated to refine the questionnaire and the checklist. The developed tools are intended for use by clinicians in the field of oncology looking to evaluate AI models. Cases of four AI applications in oncology are analyzed, demonstrating utility in real-world scenarios and enhancing case-based learning for clinicians. These tools highlight the interdisciplinary nature of effective AI integration in oncology.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"9 ","pages":"e2500067"},"PeriodicalIF":2.8,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145082288","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing Readability of Lay Abstracts and Summaries for Urologic Oncology Literature Using Generative Artificial Intelligence: BRIDGE-AI 6 Randomized Controlled Trial. 利用生成式人工智能提高泌尿外科肿瘤学文献摘要的可读性:BRIDGE-AI 6随机对照试验
IF 2.8 Q2 ONCOLOGY Pub Date : 2025-09-01 Epub Date: 2025-09-10 DOI: 10.1200/CCI-25-00042
Conner Ganjavi, Ethan Layne, Francesco Cei, Karanvir Gill, Vasileios Magoulianitis, Andre Abreu, Mitchell Goldenberg, Mihir M Desai, Inderbir Gill, Giovanni E Cacciamani

Purpose: To evaluate a generative artificial intelligence (GAI) framework for creating readable lay abstracts and summaries (LASs) of urologic oncology research, while maintaining accuracy, completeness, and clarity, for the purpose of assessing their comprehension and perception among patients and caregivers.

Methods: Forty original abstracts (OAs) on prostate, bladder, kidney, and testis cancers from leading journals were selected. LASs were generated using a free GAI tool, with three versions per abstract for consistency. Readability was compared with OAs using validated metrics. Two independent reviewers assessed accuracy, completeness, and clarity and identified AI hallucinations. A pilot study was conducted with 277 patients and caregivers randomly assigned to receive either OAs or LASs and complete comprehension and perception assessments.

Results: Mean GAI-generated LAS generation time was <10 seconds. Across 600 sections generated, readability and quality metrics were consistent (P > .05). Quality scores ranged from 85% to 100%, with hallucinations in 1% of sections. The best test showed significantly better readability (68.9 v 25.3; P < .001), grade level, and text metrics compared with the OA. Methods sections had slightly lower accuracy (85% v 100%; P = .03) and trifecta achievement (82.5% v 100%; P = .01), but other sections retained high quality (≥92.5%; P > .05). GAI-generated LAS recipients scored significantly better in comprehension and most perception-based questions (P < .001) with LAS being the only consistently significant predictor (P < .001).

Conclusion: GAI-generated LASs for urologic oncology research are highly readable and generally preserve the quality of the OAs. Patients and caregivers demonstrated improved comprehension and more favorable perceptions of LASs compared with OAs. Human oversight remains essential to ensure the accurate, complete, and clear representations of the original research.

目的:评估一种生成式人工智能(GAI)框架,用于创建可读的泌尿外科肿瘤学研究摘要和摘要(LASs),同时保持准确性、完整性和清晰度,目的是评估患者和护理人员对其的理解和感知。方法:选取40篇主要期刊上关于前列腺癌、膀胱癌、肾癌和睾丸癌的原始摘要。LASs是使用免费的GAI工具生成的,为了一致性,每个摘要有三个版本。使用经过验证的指标将可读性与oa进行比较。两名独立评审员评估了准确性、完整性和清晰度,并确定了人工智能幻觉。一项试点研究对277名患者和护理人员进行了随机分配,接受oa或LASs,并完成理解和感知评估。结果:gai生成LAS的平均生成时间P < 0.05)。质量分数从85%到100%不等,有1%的部分出现幻觉。与OA相比,最佳测试显示出更好的可读性(68.9 v 25.3; P < 0.001)、年级水平和文本指标。方法切片准确度略低(85% v 100%, P = 0.03),三联片准确度略低(82.5% v 100%, P = 0.01),但其他切片质量较高(≥92.5%,P = 0.05)。ai生成的LAS接收者在理解和大多数基于感知的问题上得分明显更好(P < .001), LAS是唯一持续显著的预测因子(P < .001)。结论:人工智能生成的用于泌尿肿瘤研究的LASs具有很高的可读性,并且总体上保持了oa的质量。与oa相比,患者和护理人员表现出更好的理解和更有利的认知。人为的监督对于确保原始研究的准确、完整和清晰的表述仍然是必不可少的。
{"title":"Enhancing Readability of Lay Abstracts and Summaries for Urologic Oncology Literature Using Generative Artificial Intelligence: BRIDGE-AI 6 Randomized Controlled Trial.","authors":"Conner Ganjavi, Ethan Layne, Francesco Cei, Karanvir Gill, Vasileios Magoulianitis, Andre Abreu, Mitchell Goldenberg, Mihir M Desai, Inderbir Gill, Giovanni E Cacciamani","doi":"10.1200/CCI-25-00042","DOIUrl":"10.1200/CCI-25-00042","url":null,"abstract":"<p><strong>Purpose: </strong>To evaluate a generative artificial intelligence (GAI) framework for creating readable lay abstracts and summaries (LASs) of urologic oncology research, while maintaining accuracy, completeness, and clarity, for the purpose of assessing their comprehension and perception among patients and caregivers.</p><p><strong>Methods: </strong>Forty original abstracts (OAs) on prostate, bladder, kidney, and testis cancers from leading journals were selected. LASs were generated using a free GAI tool, with three versions per abstract for consistency. Readability was compared with OAs using validated metrics. Two independent reviewers assessed accuracy, completeness, and clarity and identified AI hallucinations. A pilot study was conducted with 277 patients and caregivers randomly assigned to receive either OAs or LASs and complete comprehension and perception assessments.</p><p><strong>Results: </strong>Mean GAI-generated LAS generation time was <10 seconds. Across 600 sections generated, readability and quality metrics were consistent (<i>P</i> > .05). Quality scores ranged from 85% to 100%, with hallucinations in 1% of sections. The best test showed significantly better readability (68.9 <i>v</i> 25.3; <i>P</i> < .001), grade level, and text metrics compared with the OA. Methods sections had slightly lower accuracy (85% <i>v</i> 100%; <i>P</i> = .03) and trifecta achievement (82.5% <i>v</i> 100%; <i>P</i> = .01), but other sections retained high quality (≥92.5%; <i>P</i> > .05). GAI-generated LAS recipients scored significantly better in comprehension and most perception-based questions (<i>P</i> < .001) with LAS being the only consistently significant predictor (<i>P</i> < .001).</p><p><strong>Conclusion: </strong>GAI-generated LASs for urologic oncology research are highly readable and generally preserve the quality of the OAs. Patients and caregivers demonstrated improved comprehension and more favorable perceptions of LASs compared with OAs. Human oversight remains essential to ensure the accurate, complete, and clear representations of the original research.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"9 ","pages":"e2500042"},"PeriodicalIF":2.8,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145034690","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
JCO Clinical Cancer Informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1