首页 > 最新文献

JCO Clinical Cancer Informatics最新文献

英文 中文
Development and Validation of Emoji Response Scales for Assessing Patient-Reported Outcomes. 用于评估患者报告结果的表情符号反应量表的开发和验证。
IF 3.3 Q2 ONCOLOGY Pub Date : 2025-01-01 Epub Date: 2025-01-07 DOI: 10.1200/CCI-24-00148
Carrie A Thompson, Paul J Novotny, Kathleen Yost, Alicia C Bartz, Lauren Rogak, Amylou C Dueck

Purpose: Emoji are digital images or icons used to express an idea or emotion in electronic communication. The purpose of this study was to develop and evaluate the psychometric properties of two patient-reported scales that incorporate emoji.

Methods: The Emoji Response Scale developed for this study has two parts: the Emoji-Ordinal and Emoji-Mood scales. A pilot study was designed to validate the ordinal nature of the Emoji-Ordinal Scale. Twenty patients were shown all possible pairs of five emoji and asked to select the most positive from each pair. The psychometric ordering was assessed using Coombs unfolding and Thurstone scaling. A separate pilot study was designed to determine which emoji to include in the Emoji-Mood Scale. Ten common feelings experienced by patients with cancer were chosen by the study team. Patients and providers were asked to select the one emoji that best represented each feeling from the selection. The most commonly selected emotions and representative emoji were chosen for the Emoji-Mood Scale. In a randomized study of 294 patients, Spearman correlations, Wilcoxon tests, and Bland-Altman analyses determined the construct validity of the scales compared with Linear Analog Scale Assessments (LASA) and Patient-Reported Outcomes Measurement Information System (PROMIS) scores.

Results: Ninety-five percent of patients selected the same ordering among the ordinal emoji, and Thurstone scaling confirmed the ordinal nature of the response scale. The construct validity of the scales was high with correlations between the Emoji-Ordinal Scale and the LASA scale of 0.70 for emotional well-being, 0.72 for physical well-being, 0.74 for overall quality of life, and -0.81 for fatigue. Emoji-Mood Scale ratings were strongly related to PROMIS global mental, global physical, fatigue, anxiety, sleep disturbance, and social activity scales (P < .0001).

Conclusion: This study provides evidence that scales incorporating emoji are valid for collecting patient-reported outcomes.

目的:表情符号是在电子交流中用来表达思想或情感的数字图像或图标。本研究的目的是开发和评估两种包含表情符号的患者报告量表的心理测量特性。方法:本研究开发的表情符号反应量表分为Emoji- ordinal和Emoji- mood两部分。一项初步研究旨在验证表情符号-顺序量表的序性。研究人员向20名患者展示了5个表情符号的所有可能组合,并要求他们从每对表情符号中选出最积极的一个。采用Coombs展开法和Thurstone量表评估心理测量的排序。另一项独立的试点研究旨在确定表情表情情绪量表中包含哪些表情符号。研究小组选择了癌症患者的十种常见感受。患者和医疗服务提供者被要求从选择中选择一个最能代表每种感觉的表情符号。最常被选择的情绪和代表性的表情符号被选为表情-情绪量表。在一项294例患者的随机研究中,Spearman相关性、Wilcoxon检验和Bland-Altman分析确定了量表与线性模拟量表评估(LASA)和患者报告结果测量信息系统(PROMIS)评分相比的结构效度。结果:95%的患者在有序表情符号中选择了相同的顺序,Thurstone量表证实了反应量表的序数性质。Emoji-Ordinal量表与LASA量表的结构效度较高,情绪幸福感为0.70,身体幸福感为0.72,整体生活质量为0.74,疲劳度为-0.81。Emoji-Mood量表评分与PROMIS整体心理、整体身体、疲劳、焦虑、睡眠障碍和社会活动量表密切相关(P < 0.0001)。结论:本研究提供的证据表明,包含表情符号的量表对于收集患者报告的结果是有效的。
{"title":"Development and Validation of Emoji Response Scales for Assessing Patient-Reported Outcomes.","authors":"Carrie A Thompson, Paul J Novotny, Kathleen Yost, Alicia C Bartz, Lauren Rogak, Amylou C Dueck","doi":"10.1200/CCI-24-00148","DOIUrl":"https://doi.org/10.1200/CCI-24-00148","url":null,"abstract":"<p><strong>Purpose: </strong>Emoji are digital images or icons used to express an idea or emotion in electronic communication. The purpose of this study was to develop and evaluate the psychometric properties of two patient-reported scales that incorporate emoji.</p><p><strong>Methods: </strong>The Emoji Response Scale developed for this study has two parts: the Emoji-Ordinal and Emoji-Mood scales. A pilot study was designed to validate the ordinal nature of the Emoji-Ordinal Scale. Twenty patients were shown all possible pairs of five emoji and asked to select the most positive from each pair. The psychometric ordering was assessed using Coombs unfolding and Thurstone scaling. A separate pilot study was designed to determine which emoji to include in the Emoji-Mood Scale. Ten common feelings experienced by patients with cancer were chosen by the study team. Patients and providers were asked to select the one emoji that best represented each feeling from the selection. The most commonly selected emotions and representative emoji were chosen for the Emoji-Mood Scale. In a randomized study of 294 patients, Spearman correlations, Wilcoxon tests, and Bland-Altman analyses determined the construct validity of the scales compared with Linear Analog Scale Assessments (LASA) and Patient-Reported Outcomes Measurement Information System (PROMIS) scores.</p><p><strong>Results: </strong>Ninety-five percent of patients selected the same ordering among the ordinal emoji, and Thurstone scaling confirmed the ordinal nature of the response scale. The construct validity of the scales was high with correlations between the Emoji-Ordinal Scale and the LASA scale of 0.70 for emotional well-being, 0.72 for physical well-being, 0.74 for overall quality of life, and -0.81 for fatigue. Emoji-Mood Scale ratings were strongly related to PROMIS global mental, global physical, fatigue, anxiety, sleep disturbance, and social activity scales (<i>P</i> < .0001).</p><p><strong>Conclusion: </strong>This study provides evidence that scales incorporating emoji are valid for collecting patient-reported outcomes.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"9 ","pages":"e2400148"},"PeriodicalIF":3.3,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142958610","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Feasibility and Acceptability of Collecting Passive Smartphone Data for Potential Use in Digital Phenotyping Among Family Caregivers and Patients With Advanced Cancer. 收集被动智能手机数据用于家庭护理人员和晚期癌症患者数字表型分析的可行性和可接受性。
IF 3.3 Q2 ONCOLOGY Pub Date : 2025-01-01 Epub Date: 2025-01-02 DOI: 10.1200/CCI-24-00201
J Nicholas Odom, Kyungmi Lee, Erin R Currie, Kristen Allen-Watts, Erin R Harrell, Avery C Bechthold, Sally Engler, Kayleigh Curry, Arif H Kamal, Christine S Ritchie, George Demiris, Alexi A Wright, Marie A Bakitas, Andres Azuero

Purpose: Modeling passively collected smartphone sensor data (called digital phenotyping) has the potential to detect distress among family caregivers and patients with advanced cancer and could lead to novel clinical models of cancer care. The purpose of this study was to assess the feasibility and acceptability of collecting passive smartphone data from family caregivers and their care recipients with advanced cancer over 24 weeks.

Methods: This was an observational feasibility study. Family caregivers and patients with advanced cancer were recruited through clinic or via social media and downloaded a digital phenotyping application (Beiwe) to their smartphones that passively collected sensor data over 24 weeks. Feasibility was evaluated by quantifying enrollment and retention and the quantity of acquired data. Acceptability was assessed through post-24 week qualitative interviews.

Results: Of 178 caregiver and patient dyads approached, 22.5% of caregivers (n = 40) and 10.1% of patients (n = 18) both consented to the study and successfully downloaded the application, with most recruited through social media (93%). Of 24 weeks (168 days), the median number of days that data were received was 141 days. Interviews yielded three themes: (1) experiences with study procedures were generally positive despite some technical challenges; (2) security and privacy concerns were minimal, mitigated by clear explanations, trust in the health care system, and privacy norms; and (3) a clinical model that used passive smartphone monitoring to automatically trigger assistance could be beneficial but with concern about false alarms.

Conclusion: This pilot study of collecting passive smartphone data found mixed indicators of feasibility, with suboptimal enrollment rates, particularly via clinic, but positive retention and data collection rates for those who did enroll. Participants had generally positive views of passive monitoring.

目的:对被动收集的智能手机传感器数据(称为数字表型)进行建模,有可能发现家庭照顾者和晚期癌症患者的痛苦,并可能导致癌症护理的新型临床模型。本研究的目的是评估从患有晚期癌症的家庭照顾者及其照顾者收集24周以上被动智能手机数据的可行性和可接受性。方法:观察性可行性研究。通过诊所或社交媒体招募家庭护理人员和晚期癌症患者,并将数字表型应用程序(Beiwe)下载到他们的智能手机中,该应用程序在24周内被动收集传感器数据。通过量化入学人数和保留率以及获得的数据量来评估可行性。通过24周后的定性访谈评估可接受性。结果:在接触的178对护理人员和患者中,22.5%的护理人员(n = 40)和10.1%的患者(n = 18)都同意这项研究并成功下载了应用程序,其中大多数是通过社交媒体招募的(93%)。在24周(168天)中,收到数据的中位数天数为141天。访谈产生了三个主题:(1)尽管存在一些技术挑战,但研究过程的经验总体上是积极的;(2)安全和隐私问题最小,通过清晰的解释、对医疗保健系统的信任和隐私规范来缓解;(3)使用被动智能手机监测自动触发援助的临床模型可能是有益的,但要注意虚报。结论:这项收集被动智能手机数据的试点研究发现,可行性指标好坏参半,注册率不理想,特别是通过诊所,但注册者的留存率和数据收集率是积极的。参与者普遍对被动监控持积极态度。
{"title":"Feasibility and Acceptability of Collecting Passive Smartphone Data for Potential Use in Digital Phenotyping Among Family Caregivers and Patients With Advanced Cancer.","authors":"J Nicholas Odom, Kyungmi Lee, Erin R Currie, Kristen Allen-Watts, Erin R Harrell, Avery C Bechthold, Sally Engler, Kayleigh Curry, Arif H Kamal, Christine S Ritchie, George Demiris, Alexi A Wright, Marie A Bakitas, Andres Azuero","doi":"10.1200/CCI-24-00201","DOIUrl":"10.1200/CCI-24-00201","url":null,"abstract":"<p><strong>Purpose: </strong>Modeling passively collected smartphone sensor data (called digital phenotyping) has the potential to detect distress among family caregivers and patients with advanced cancer and could lead to novel clinical models of cancer care. The purpose of this study was to assess the feasibility and acceptability of collecting passive smartphone data from family caregivers and their care recipients with advanced cancer over 24 weeks.</p><p><strong>Methods: </strong>This was an observational feasibility study. Family caregivers and patients with advanced cancer were recruited through clinic or via social media and downloaded a digital phenotyping application (Beiwe) to their smartphones that passively collected sensor data over 24 weeks. Feasibility was evaluated by quantifying enrollment and retention and the quantity of acquired data. Acceptability was assessed through post-24 week qualitative interviews.</p><p><strong>Results: </strong>Of 178 caregiver and patient dyads approached, 22.5% of caregivers (n = 40) and 10.1% of patients (n = 18) both consented to the study and successfully downloaded the application, with most recruited through social media (93%). Of 24 weeks (168 days), the median number of days that data were received was 141 days. Interviews yielded three themes: (1) experiences with study procedures were generally positive despite some technical challenges; (2) security and privacy concerns were minimal, mitigated by clear explanations, trust in the health care system, and privacy norms; and (3) a clinical model that used passive smartphone monitoring to automatically trigger assistance could be beneficial but with concern about false alarms.</p><p><strong>Conclusion: </strong>This pilot study of collecting passive smartphone data found mixed indicators of feasibility, with suboptimal enrollment rates, particularly via clinic, but positive retention and data collection rates for those who did enroll. Participants had generally positive views of passive monitoring.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"9 ","pages":"e2400201"},"PeriodicalIF":3.3,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11706344/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142923994","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Explainable Machine Learning to Predict Treatment Response in Advanced Non-Small Cell Lung Cancer. 可解释的机器学习预测晚期非小细胞肺癌的治疗反应。
IF 3.3 Q2 ONCOLOGY Pub Date : 2025-01-01 Epub Date: 2025-01-03 DOI: 10.1200/CCI-24-00157
Vinayak S Ahluwalia, Ravi B Parikh

Purpose: Immune checkpoint inhibitors (ICIs) have demonstrated promise in the treatment of various cancers. Single-drug ICI therapy (immuno-oncology [IO] monotherapy) that targets PD-L1 is the standard of care in patients with advanced non-small cell lung cancer (NSCLC) with PD-L1 expression ≥50%. We sought to find out if a machine learning (ML) algorithm can perform better as a predictive biomarker than PD-L1 alone.

Methods: Using a real-world, nationwide electronic health record-derived deidentified database of 38,048 patients with advanced NSCLC, we trained binary prediction algorithms to predict likelihood of 12-month progression-free survival (PFS; 12-month PFS) and 12-month overall survival (OS; 12-month OS) from initiation of first-line therapy. We evaluated the algorithms by calculating the AUC on the test set. We plotted Kaplan-Meier curves and fit Cox survival models comparing survival between patients who were classified as low-risk (LR) for 12-month disease progression or 12-month mortality versus those classified as high-risk.

Results: The ML algorithms achieved an AUC of 0.701 (95% CI, 0.689 to 0.714) and 0.718 (95% CI, 0.707 to 0.730) for 12-month PFS and 12-month OS, respectively. Patients in the LR group had lower 12-month disease progression (hazard ratio [HR], 0.47 [95% CI, 0.45 to 0.50]; P < .001) and 12-month all-cause mortality (HR, 0.31 [95% CI, 0.29 to 0.34]; P < .0001) compared with the high-risk group. Patients deemed LR for disease progression and mortality on IO monotherapy were less likely to progress (HR, 0.53 [95% CI, 0.46 to 0.61]; P < .0001) or die (HR, 0.30 [95% CI, 0.24 to 0.37]; P < .001) compared with the high-risk group.

Conclusion: An ML algorithm can more accurately predict response to first-line therapy, including IO monotherapy, in patients with advanced NSCLC, compared with PD-L1 alone. ML may better aid clinical decision making in oncology than a single biomarker.

目的:免疫检查点抑制剂(ICIs)在治疗各种癌症方面已经显示出前景。靶向PD-L1的单药ICI治疗(免疫肿瘤学[IO]单药治疗)是PD-L1表达≥50%的晚期非小细胞肺癌(NSCLC)患者的标准治疗方案。我们试图找出机器学习(ML)算法是否可以比单独使用PD-L1更好地作为预测性生物标志物。方法:使用真实世界的全国性电子健康记录衍生的未识别数据库,包括38,048例晚期非小细胞肺癌患者,我们训练二元预测算法来预测12个月无进展生存(PFS;12个月PFS)和12个月总生存期(OS;开始一线治疗后12个月(OS)。我们通过计算测试集上的AUC来评估算法。我们绘制Kaplan-Meier曲线并拟合Cox生存模型,比较低危(LR)患者12个月疾病进展或12个月死亡率与高危患者的生存率。结果:ML算法的12个月PFS和12个月OS的AUC分别为0.701 (95% CI, 0.689至0.714)和0.718 (95% CI, 0.707至0.730)。LR组患者12个月的疾病进展较低(风险比[HR], 0.47 [95% CI, 0.45 ~ 0.50];P < 0.001)和12个月全因死亡率(HR, 0.31 [95% CI, 0.29 ~ 0.34];P < 0.0001)。经IO单药治疗认为疾病进展为LR的患者和死亡率进展的可能性较小(HR, 0.53 [95% CI, 0.46至0.61];P < 0.0001)或死亡(HR, 0.30 [95% CI, 0.24 ~ 0.37];P < 0.001)。结论:与单独使用PD-L1相比,ML算法可以更准确地预测晚期NSCLC患者对一线治疗(包括IO单药治疗)的反应。ML可能比单一的生物标志物更有助于肿瘤学的临床决策。
{"title":"Explainable Machine Learning to Predict Treatment Response in Advanced Non-Small Cell Lung Cancer.","authors":"Vinayak S Ahluwalia, Ravi B Parikh","doi":"10.1200/CCI-24-00157","DOIUrl":"10.1200/CCI-24-00157","url":null,"abstract":"<p><strong>Purpose: </strong>Immune checkpoint inhibitors (ICIs) have demonstrated promise in the treatment of various cancers. Single-drug ICI therapy (immuno-oncology [IO] monotherapy) that targets PD-L1 is the standard of care in patients with advanced non-small cell lung cancer (NSCLC) with PD-L1 expression ≥50%. We sought to find out if a machine learning (ML) algorithm can perform better as a predictive biomarker than PD-L1 alone.</p><p><strong>Methods: </strong>Using a real-world, nationwide electronic health record-derived deidentified database of 38,048 patients with advanced NSCLC, we trained binary prediction algorithms to predict likelihood of 12-month progression-free survival (PFS; 12-month PFS) and 12-month overall survival (OS; 12-month OS) from initiation of first-line therapy. We evaluated the algorithms by calculating the AUC on the test set. We plotted Kaplan-Meier curves and fit Cox survival models comparing survival between patients who were classified as low-risk (LR) for 12-month disease progression or 12-month mortality versus those classified as high-risk.</p><p><strong>Results: </strong>The ML algorithms achieved an AUC of 0.701 (95% CI, 0.689 to 0.714) and 0.718 (95% CI, 0.707 to 0.730) for 12-month PFS and 12-month OS, respectively. Patients in the LR group had lower 12-month disease progression (hazard ratio [HR], 0.47 [95% CI, 0.45 to 0.50]; <i>P</i> < .001) and 12-month all-cause mortality (HR, 0.31 [95% CI, 0.29 to 0.34]; <i>P</i> < .0001) compared with the high-risk group. Patients deemed LR for disease progression and mortality on IO monotherapy were less likely to progress (HR, 0.53 [95% CI, 0.46 to 0.61]; <i>P</i> < .0001) or die (HR, 0.30 [95% CI, 0.24 to 0.37]; <i>P</i> < .001) compared with the high-risk group.</p><p><strong>Conclusion: </strong>An ML algorithm can more accurately predict response to first-line therapy, including IO monotherapy, in patients with advanced NSCLC, compared with PD-L1 alone. ML may better aid clinical decision making in oncology than a single biomarker.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"9 ","pages":"e2400157"},"PeriodicalIF":3.3,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11706357/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142928576","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Provision of Radiology Reports Simplified With Large Language Models to Patients With Cancer: Impact on Patient Satisfaction.
IF 3.3 Q2 ONCOLOGY Pub Date : 2025-01-01 Epub Date: 2025-01-29 DOI: 10.1200/CCI-24-00166
Amit Gupta, Swarndeep Singh, Hema Malhotra, Himanshu Pruthi, Aparna Sharma, Amit K Garg, Mukesh Yadav, Devasenathipathy Kandasamy, Atul Batra, Krithika Rangarajan

Purpose: To explore the perceived utility and effect of simplified radiology reports on oncology patients' knowledge and feasibility of large language models (LLMs) to generate such reports.

Materials and methods: This study was approved by the Institute Ethics Committee. In phase I, five state-of-the-art LLMs (Generative Pre-Trained Transformer-4o [GPT-4o], Google Gemini, Claude Opus, Llama-3.1-8B, and Phi-3.5-mini) were tested to simplify 50 oncology computed tomography (CT) report impressions using five distinct prompts with each LLM. The outputs were evaluated quantitatively using readability indices. Five LLM-prompt combinations with best average readability scores were also assessed qualitatively, and the best LLM-prompt combination was selected. In phase II, 100 consecutive oncology patients were randomly assigned into two groups: original report (received original report impression) and simplified report (received LLM-generated simplified versions of their CT report impressions under the supervision of a radiologist). A questionnaire assessed the impact of these reports on patients' knowledge and perceived utility.

Results: In phase I, Claude Opus-Prompt 3 (explain to a 15-year-old) performed slightly better than other LLMs, although scores for GPT-4o, Gemini, Claude Opus, and Llama-3.1 were not significantly different (P > .0033 on Wilcoxon signed-rank test with Bonferroni correction). In phase II, simplified report group patients demonstrated significantly better knowledge of primary site and extent of their disease as well as showed significantly higher confidence and understanding of the report (P < .05 for all). Only three (of 50) simplified reports required corrections by the radiologist.

Conclusion: Simplified radiology reports significantly enhanced patients' understanding and confidence in comprehending their medical condition. LLMs performed very well at this simplification task; therefore, they can be potentially used for this purpose, although there remains a need for human oversight.

{"title":"Provision of Radiology Reports Simplified With Large Language Models to Patients With Cancer: Impact on Patient Satisfaction.","authors":"Amit Gupta, Swarndeep Singh, Hema Malhotra, Himanshu Pruthi, Aparna Sharma, Amit K Garg, Mukesh Yadav, Devasenathipathy Kandasamy, Atul Batra, Krithika Rangarajan","doi":"10.1200/CCI-24-00166","DOIUrl":"https://doi.org/10.1200/CCI-24-00166","url":null,"abstract":"<p><strong>Purpose: </strong>To explore the perceived utility and effect of simplified radiology reports on oncology patients' knowledge and feasibility of large language models (LLMs) to generate such reports.</p><p><strong>Materials and methods: </strong>This study was approved by the Institute Ethics Committee. In phase I, five state-of-the-art LLMs (Generative Pre-Trained Transformer-4o [GPT-4o], Google Gemini, Claude Opus, Llama-3.1-8B, and Phi-3.5-mini) were tested to simplify 50 oncology computed tomography (CT) report impressions using five distinct prompts with each LLM. The outputs were evaluated quantitatively using readability indices. Five LLM-prompt combinations with best average readability scores were also assessed qualitatively, and the best LLM-prompt combination was selected. In phase II, 100 consecutive oncology patients were randomly assigned into two groups: original report (received original report impression) and simplified report (received LLM-generated simplified versions of their CT report impressions under the supervision of a radiologist). A questionnaire assessed the impact of these reports on patients' knowledge and perceived utility.</p><p><strong>Results: </strong>In phase I, Claude Opus-Prompt 3 (explain to a 15-year-old) performed slightly better than other LLMs, although scores for GPT-4o, Gemini, Claude Opus, and Llama-3.1 were not significantly different (<i>P</i> > .0033 on Wilcoxon signed-rank test with Bonferroni correction). In phase II, simplified report group patients demonstrated significantly better knowledge of primary site and extent of their disease as well as showed significantly higher confidence and understanding of the report (<i>P</i> < .05 for all). Only three (of 50) simplified reports required corrections by the radiologist.</p><p><strong>Conclusion: </strong>Simplified radiology reports significantly enhanced patients' understanding and confidence in comprehending their medical condition. LLMs performed very well at this simplification task; therefore, they can be potentially used for this purpose, although there remains a need for human oversight.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"9 ","pages":"e2400166"},"PeriodicalIF":3.3,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143069590","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Toward a Computable Phenotype for Determining Eligibility of Lung Cancer Screening Using Electronic Health Records. 利用电子健康记录确定肺癌筛查合格性的可计算表型
IF 3.3 Q2 ONCOLOGY Pub Date : 2025-01-01 Epub Date: 2025-01-16 DOI: 10.1200/CCI.24.00139
Shuang Yang, Yu Huang, Xiwei Lou, Tianchen Lyu, Ruoqi Wei, Hiren J Mehta, Yonghui Wu, Michelle Alvarado, Ramzi G Salloum, Dejana Braithwaite, Jinhai Huo, Ya-Chen Tina Shih, Yi Guo, Jiang Bian

Purpose: Lung cancer screening (LCS) has the potential to reduce mortality and detect lung cancer at its early stages, but the high false-positive rate associated with low-dose computed tomography (LDCT) for LCS acts as a barrier to its widespread adoption. This study aims to develop computable phenotype (CP) algorithms on the basis of electronic health records (EHRs) to identify individual's eligibility for LCS, thereby enhancing LCS utilization in real-world settings.

Materials and methods: The study cohort included 5,778 individuals who underwent LDCT for LCS from 2012 to 2022, as recorded in the University of Florida Health Integrated Data Repository. CP rules derived from LCS guidelines were used to identify potential candidates, incorporating both structured EHR and clinical notes analyzed via natural language processing. We then conducted manual reviews of 453 randomly selected charts to refine and validate these rules, assessing CP performance using metrics, for example, F1 score, specificity, and sensitivity.

Results: We developed an optimal CP rule that integrates both structured and unstructured data, adhering to the US Preventive Services Task Force 2013 and 2020 guidelines. This rule focuses on age (55-80 years for 2013 and 50-80 years for 2020), smoking status (current, former, and others), and pack-years (≥30 for 2013 and ≥20 for 2020), achieving F1 scores of 0.75 and 0.84 for the respective guidelines. Including unstructured data improved the F1 score performance by up to 9.2% for 2013 and 12.9% for 2020, compared with using structured data alone.

Conclusion: Our findings underscore the critical need for improved documentation of smoking information in EHRs, demonstrate the value of artificial intelligence techniques in enhancing CP performance, and confirm the effectiveness of EHR-based CP in identifying LCS-eligible individuals. This supports its potential to aid clinical decision making and optimize patient care.

目的:肺癌筛查(LCS)具有降低死亡率和早期发现肺癌的潜力,但低剂量计算机断层扫描(LDCT)对LCS的高假阳性率是其广泛采用的障碍。本研究旨在开发基于电子健康记录(EHRs)的可计算表型(CP)算法,以确定个人是否有资格使用LCS,从而提高LCS在现实环境中的应用。材料和方法:研究队列包括5,778名在2012年至2022年期间接受LDCT治疗LCS的个体,记录在佛罗里达大学健康综合数据库中。来自LCS指南的CP规则用于识别潜在的候选人,结合结构化的电子病历和通过自然语言处理分析的临床记录。然后,我们对453个随机选择的图表进行了人工审查,以完善和验证这些规则,使用指标评估CP的表现,例如F1评分、特异性和敏感性。结果:我们开发了一个优化的CP规则,集成了结构化和非结构化数据,符合美国预防服务工作组2013年和2020年的指南。该规则侧重于年龄(2013年55-80岁,2020年50-80岁),吸烟状况(现在,以前和其他)和包龄(2013年≥30岁,2020年≥20岁),分别达到0.75和0.84的F1分数。与单独使用结构化数据相比,包含非结构化数据的F1成绩在2013年和2020年分别提高了9.2%和12.9%。结论:我们的研究结果强调了改善电子病历中吸烟信息记录的迫切需要,证明了人工智能技术在提高CP绩效方面的价值,并证实了基于电子病历的CP在识别lcs合格个体方面的有效性。这支持了它帮助临床决策和优化患者护理的潜力。
{"title":"Toward a Computable Phenotype for Determining Eligibility of Lung Cancer Screening Using Electronic Health Records.","authors":"Shuang Yang, Yu Huang, Xiwei Lou, Tianchen Lyu, Ruoqi Wei, Hiren J Mehta, Yonghui Wu, Michelle Alvarado, Ramzi G Salloum, Dejana Braithwaite, Jinhai Huo, Ya-Chen Tina Shih, Yi Guo, Jiang Bian","doi":"10.1200/CCI.24.00139","DOIUrl":"10.1200/CCI.24.00139","url":null,"abstract":"<p><strong>Purpose: </strong>Lung cancer screening (LCS) has the potential to reduce mortality and detect lung cancer at its early stages, but the high false-positive rate associated with low-dose computed tomography (LDCT) for LCS acts as a barrier to its widespread adoption. This study aims to develop computable phenotype (CP) algorithms on the basis of electronic health records (EHRs) to identify individual's eligibility for LCS, thereby enhancing LCS utilization in real-world settings.</p><p><strong>Materials and methods: </strong>The study cohort included 5,778 individuals who underwent LDCT for LCS from 2012 to 2022, as recorded in the University of Florida Health Integrated Data Repository. CP rules derived from LCS guidelines were used to identify potential candidates, incorporating both structured EHR and clinical notes analyzed via natural language processing. We then conducted manual reviews of 453 randomly selected charts to refine and validate these rules, assessing CP performance using metrics, for example, F1 score, specificity, and sensitivity.</p><p><strong>Results: </strong>We developed an optimal CP rule that integrates both structured and unstructured data, adhering to the US Preventive Services Task Force 2013 and 2020 guidelines. This rule focuses on age (55-80 years for 2013 and 50-80 years for 2020), smoking status (current, former, and others), and pack-years (≥30 for 2013 and ≥20 for 2020), achieving F1 scores of 0.75 and 0.84 for the respective guidelines. Including unstructured data improved the F1 score performance by up to 9.2% for 2013 and 12.9% for 2020, compared with using structured data alone.</p><p><strong>Conclusion: </strong>Our findings underscore the critical need for improved documentation of smoking information in EHRs, demonstrate the value of artificial intelligence techniques in enhancing CP performance, and confirm the effectiveness of EHR-based CP in identifying LCS-eligible individuals. This supports its potential to aid clinical decision making and optimize patient care.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"9 ","pages":"e2400139"},"PeriodicalIF":3.3,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11748906/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143016055","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Errata: Waiting to Exhale: The Feasibility and Appropriateness of Home Blood Oxygen Monitoring in Oncology Patients Post-Hospital Discharge. 勘误:等待呼气:肿瘤患者出院后家庭血氧监测的可行性和适宜性。
IF 3.3 Q2 ONCOLOGY Pub Date : 2025-01-01 Epub Date: 2025-01-10 DOI: 10.1200/CCI-24-00300
{"title":"Errata: Waiting to Exhale: The Feasibility and Appropriateness of Home Blood Oxygen Monitoring in Oncology Patients Post-Hospital Discharge.","authors":"","doi":"10.1200/CCI-24-00300","DOIUrl":"https://doi.org/10.1200/CCI-24-00300","url":null,"abstract":"","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"9 ","pages":"e2400300"},"PeriodicalIF":3.3,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142958612","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Novel Use and Value of Contrast-Enhanced Susceptibility-Weighted Imaging Morphologic and Radiomic Features in Predicting Extremity Soft Tissue Undifferentiated Pleomorphic Sarcoma Treatment Response.
IF 3.3 Q2 ONCOLOGY Pub Date : 2025-01-01 Epub Date: 2025-01-22 DOI: 10.1200/CCI.24.00042
Raul F Valenzuela, Elvis de Jesus Duran Sierra, Mathew A Canjirathinkal, Behrang Amini, Ken-Pin Hwang, Jingfei Ma, Keila E Torres, R Jason Stafford, Wei-Lien Wang, Robert S Benjamin, Andrew J Bishop, John E Madewell, William A Murphy, Colleen M Costelloe

Purpose: Undifferentiated pleomorphic sarcomas (UPSs) demonstrate therapy-induced hemosiderin deposition, granulation tissue formation, fibrosis, and calcification. We aimed to determine the treatment-assessment value of morphologic tumoral hemorrhage patterns and first- and high-order radiomic features extracted from contrast-enhanced susceptibility-weighted imaging (CE-SWI).

Materials and methods: This retrospective institutional review board-authorized study included 33 patients with extremity UPS with magnetic resonance imaging and resection performed from February 2021 to May 2023. Volumetric tumor segmentation was obtained at baseline, postsystemic chemotherapy (PC), and postradiation therapy (PRT). The pathology-assessed treatment effect (PATE) in surgical specimens separated patients into responders (R; ≥90%, n = 16), partial responders (PR; 89%-31%, n = 10), and nonresponders (NR; ≤30%, n = 7). RECIST, WHO, and volume were assessed for all time points. CE-SWI T2* morphologic patterns and 107 radiomic features were analyzed.

Results: A Complete-Ring (CR) pattern was observed in PRT in 71.4% of R (P = 7.71 × 10-6), an Incomplete-Ring pattern in 33.3% of PR (P = .2751), and a Globular pattern in 50% of NR (P = .1562). The first-order radiomic analysis from the CE-SWI intensity histogram outlined the values of the 10th and 90th percentiles and their skewness. R showed a 280% increase in 10th percentile voxels (P = .061) and a 241% increase in skewness (P = .0449) at PC. PR/NR showed a 690% increase in the 90th percentile voxels (P = .03) at PC. Multiple high-order radiomic texture features observed at PRT discriminated better R versus PR/NR than the first-order features.

Conclusion: CE-SWI morphologic patterns strongly correlate with PATE. The CR morphology pattern was the most frequent in R and had the highest statistical association predicting response at PRT, easily recognized by a radiologist not requiring postprocessing software. It can potentially outperform size-based metrics, such as RECIST. The first- and high-order radiomic analysis found several features separating R versus PR/NR.

{"title":"Novel Use and Value of Contrast-Enhanced Susceptibility-Weighted Imaging Morphologic and Radiomic Features in Predicting Extremity Soft Tissue Undifferentiated Pleomorphic Sarcoma Treatment Response.","authors":"Raul F Valenzuela, Elvis de Jesus Duran Sierra, Mathew A Canjirathinkal, Behrang Amini, Ken-Pin Hwang, Jingfei Ma, Keila E Torres, R Jason Stafford, Wei-Lien Wang, Robert S Benjamin, Andrew J Bishop, John E Madewell, William A Murphy, Colleen M Costelloe","doi":"10.1200/CCI.24.00042","DOIUrl":"https://doi.org/10.1200/CCI.24.00042","url":null,"abstract":"<p><strong>Purpose: </strong>Undifferentiated pleomorphic sarcomas (UPSs) demonstrate therapy-induced hemosiderin deposition, granulation tissue formation, fibrosis, and calcification. We aimed to determine the treatment-assessment value of morphologic tumoral hemorrhage patterns and first- and high-order radiomic features extracted from contrast-enhanced susceptibility-weighted imaging (CE-SWI).</p><p><strong>Materials and methods: </strong>This retrospective institutional review board-authorized study included 33 patients with extremity UPS with magnetic resonance imaging and resection performed from February 2021 to May 2023. Volumetric tumor segmentation was obtained at baseline, postsystemic chemotherapy (PC), and postradiation therapy (PRT). The pathology-assessed treatment effect (PATE) in surgical specimens separated patients into responders (R; ≥90%, n = 16), partial responders (PR; 89%-31%, n = 10), and nonresponders (NR; ≤30%, n = 7). RECIST, WHO, and volume were assessed for all time points. CE-SWI T2* morphologic patterns and 107 radiomic features were analyzed.</p><p><strong>Results: </strong>A Complete-Ring (CR) pattern was observed in PRT in 71.4% of R (<i>P</i> = 7.71 × 10<sup>-6</sup>), an Incomplete-Ring pattern in 33.3% of PR (<i>P</i> = .2751), and a Globular pattern in 50% of NR (<i>P</i> = .1562). The first-order radiomic analysis from the CE-SWI intensity histogram outlined the values of the 10th and 90th percentiles and their skewness. R showed a 280% increase in 10th percentile voxels (<i>P</i> = .061) and a 241% increase in skewness (<i>P</i> = .0449) at PC. PR/NR showed a 690% increase in the 90th percentile voxels (<i>P</i> = .03) at PC. Multiple high-order radiomic texture features observed at PRT discriminated better R versus PR/NR than the first-order features.</p><p><strong>Conclusion: </strong>CE-SWI morphologic patterns strongly correlate with PATE. The CR morphology pattern was the most frequent in R and had the highest statistical association predicting response at PRT, easily recognized by a radiologist not requiring postprocessing software. It can potentially outperform size-based metrics, such as RECIST. The first- and high-order radiomic analysis found several features separating R versus PR/NR.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"9 ","pages":"e2400042"},"PeriodicalIF":3.3,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143025544","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Development and Validation of Dynamic 5-Year Breast Cancer Risk Model Using Repeated Mammograms. 基于重复乳房x光检查的动态5年乳腺癌风险模型的建立与验证。
IF 3.3 Q2 ONCOLOGY Pub Date : 2024-12-01 Epub Date: 2024-12-05 DOI: 10.1200/CCI-24-00200
Shu Jiang, Debbie L Bennett, Bernard A Rosner, Rulla M Tamimi, Graham A Colditz

Purpose: Current image-based long-term risk prediction models do not fully use previous screening mammogram images. Dynamic prediction models have not been investigated for use in routine care.

Methods: We analyzed a prospective WashU clinic-based cohort of 10,099 cancer-free women at entry (between November 3, 2008 and February 2012). Follow-up through 2020 identified 478 pathology-confirmed breast cancers (BCs). The cohort included 27% Black women. An external validation cohort (Emory) included 18,360 women screened from 2013, followed through 2020. This included 42% Black women and 332 pathology-confirmed BC excluding those diagnosed within 6 months of screening. We trained a dynamic model using repeated screening mammograms at WashU to predict 5-year risk. This opportunistic screening service presented a range of mammogram images for each woman. We applied the model to the external validation data to evaluate discrimination performance (AUC) and calibrated to US SEER.

Results: Using 3 years of previous mammogram images available at the current screening visit, we obtained a 5-year AUC of 0.80 (95% CI, 0.78 to 0.83) in the external validation. This represents a significant improvement over the current visit mammogram AUC 0.74 (95% CI, 0.71 to 0.77; P < .01) in the same women. When calibrated, a risk ratio of 21.1 was observed comparing high (>4%) to very low (<0.3%) 5-year risk. The dynamic model classified 16% of the cohort as high risk among whom 61% of all BCs were diagnosed. The dynamic model performed comparably in Black and White women.

Conclusion: Adding previous screening mammogram images improves 5-year BC risk prediction beyond static models. It can identify women at high risk who might benefit from supplemental screening or risk-reduction strategies.

目的:目前基于图像的长期风险预测模型没有充分利用以往的筛查性乳房x线照片。动态预测模型尚未被研究用于常规护理。方法:在2008年11月3日至2012年2月期间,我们分析了10099名无癌妇女的前瞻性WashU临床队列。到2020年的随访发现478例病理证实的乳腺癌(bc)。该队列包括27%的黑人女性。外部验证队列(Emory)包括从2013年到2020年筛查的18360名女性。其中包括42%的黑人女性和332例病理证实的BC,不包括6个月内筛查出的患者。我们训练了一个动态模型,使用WashU反复筛查乳房x线照片来预测5年的风险。这种机会性筛查服务为每位妇女提供了一系列乳房x光照片。我们将该模型应用于外部验证数据来评估识别性能(AUC),并校准为美国SEER。结果:使用当前筛查访问时可获得的3年既往乳房x线照片,我们在外部验证中获得了0.80 (95% CI, 0.78至0.83)的5年AUC。与目前的乳腺x线检查相比,这是一个显著的改善,AUC为0.74 (95% CI, 0.71至0.77;P < 0.01)。校正后,观察到高风险比为21.1 (b> 4%)与极低风险比(结论:与静态模型相比,添加先前筛查乳房x光片可改善5年BC风险预测。它可以识别出可能从补充筛查或降低风险策略中受益的高风险妇女。
{"title":"Development and Validation of Dynamic 5-Year Breast Cancer Risk Model Using Repeated Mammograms.","authors":"Shu Jiang, Debbie L Bennett, Bernard A Rosner, Rulla M Tamimi, Graham A Colditz","doi":"10.1200/CCI-24-00200","DOIUrl":"10.1200/CCI-24-00200","url":null,"abstract":"<p><strong>Purpose: </strong>Current image-based long-term risk prediction models do not fully use previous screening mammogram images. Dynamic prediction models have not been investigated for use in routine care.</p><p><strong>Methods: </strong>We analyzed a prospective WashU clinic-based cohort of 10,099 cancer-free women at entry (between November 3, 2008 and February 2012). Follow-up through 2020 identified 478 pathology-confirmed breast cancers (BCs). The cohort included 27% Black women. An external validation cohort (Emory) included 18,360 women screened from 2013, followed through 2020. This included 42% Black women and 332 pathology-confirmed BC excluding those diagnosed within 6 months of screening. We trained a dynamic model using repeated screening mammograms at WashU to predict 5-year risk. This opportunistic screening service presented a range of mammogram images for each woman. We applied the model to the external validation data to evaluate discrimination performance (AUC) and calibrated to US SEER.</p><p><strong>Results: </strong>Using 3 years of previous mammogram images available at the current screening visit, we obtained a 5-year AUC of 0.80 (95% CI, 0.78 to 0.83) in the external validation. This represents a significant improvement over the current visit mammogram AUC 0.74 (95% CI, 0.71 to 0.77; <i>P</i> < .01) in the same women. When calibrated, a risk ratio of 21.1 was observed comparing high (>4%) to very low (<0.3%) 5-year risk. The dynamic model classified 16% of the cohort as high risk among whom 61% of all BCs were diagnosed. The dynamic model performed comparably in Black and White women.</p><p><strong>Conclusion: </strong>Adding previous screening mammogram images improves 5-year BC risk prediction beyond static models. It can identify women at high risk who might benefit from supplemental screening or risk-reduction strategies.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"8 ","pages":"e2400200"},"PeriodicalIF":3.3,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11634085/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142786942","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automated Identification of Breast Cancer Relapse in Computed Tomography Reports Using Natural Language Processing. 使用自然语言处理的计算机断层扫描报告中乳腺癌复发的自动识别。
IF 3.3 Q2 ONCOLOGY Pub Date : 2024-12-01 Epub Date: 2024-12-20 DOI: 10.1200/CCI.24.00107
Jaimie J Lee, Andres Zepeda, Gregory Arbour, Kathryn V Isaac, Raymond T Ng, Alan M Nichol

Purpose: Breast cancer relapses are rarely collected by cancer registries because of logistical and financial constraints. Hence, we investigated natural language processing (NLP), enhanced with state-of-the-art deep learning transformer tools and large language models, to automate relapse identification in the text of computed tomography (CT) reports.

Methods: We analyzed follow-up CT reports from patients diagnosed with breast cancer between January 1, 2005, and December 31, 2014. The reports were curated and annotated for the presence or absence of local, regional, and distant breast cancer relapses. We performed 10-fold cross-validation to evaluate models identifying different types of relapses in CT reports. Model performance was assessed with classification metrics, reported with 95% confidence intervals.

Results: In our data set of 1,445 CT reports, 799 (55.3%) described any relapse, 72 (5.0%) local relapses, 97 (6.7%) regional relapses, and 743 (51.4%) distant relapses. The any-relapse model achieved an accuracy of 89.6% (87.8-91.1), with a sensitivity of 93.2% (91.4-94.9) and a specificity of 84.2% (80.9-87.1). The local relapse model achieved an accuracy of 94.6% (93.3-95.7), a sensitivity of 44.4% (32.8-56.3), and a specificity of 97.2% (96.2-98.0). The regional relapse model showed an accuracy of 93.6% (92.3-94.9), a sensitivity of 70.1% (60.0-79.1), and a specificity of 95.3% (94.2-96.5). Finally, the distant relapse model demonstrated an accuracy of 88.1% (86.2-89.7), a sensitivity of 91.8% (89.9-93.8), and a specificity of 83.7% (80.5-86.4).

Conclusion: We developed NLP models to identify local, regional, and distant breast cancer relapses from CT reports. Automating the identification of breast cancer relapses can enhance data collection about patient outcomes.

目的:由于后勤和财政限制,乳腺癌复发很少被癌症登记处收集。因此,我们研究了自然语言处理(NLP),辅以最先进的深度学习转换工具和大型语言模型,以自动识别计算机断层扫描(CT)报告文本中的复发。方法:我们分析2005年1月1日至2014年12月31日诊断为乳腺癌的患者的随访CT报告。这些报告是针对局部、区域和远处乳腺癌复发的存在与否进行整理和注释的。我们进行了10倍交叉验证,以评估CT报告中识别不同类型复发的模型。用分类指标评估模型性能,报告的置信区间为95%。结果:在1445例CT报告中,799例(55.3%)复发,72例(5.0%)局部复发,97例(6.7%)局部复发,743例(51.4%)远处复发。任意复发模型的准确率为89.6%(87.8-91.1),敏感性为93.2%(91.4-94.9),特异性为84.2%(80.9-87.1)。局部复发模型准确率为94.6%(93.3 ~ 95.7),敏感性为44.4%(32.8 ~ 56.3),特异性为97.2%(96.2 ~ 98.0)。区域复发模型准确率为93.6%(92.3 ~ 94.9),敏感性为70.1%(60.0 ~ 79.1),特异性为95.3%(94.2 ~ 96.5)。最后,远端复发模型的准确率为88.1%(86.2-89.7),敏感性为91.8%(89.9-93.8),特异性为83.7%(80.5-86.4)。结论:我们开发了NLP模型,从CT报告中识别局部、区域和远处乳腺癌复发。乳腺癌复发的自动化识别可以增强对患者预后的数据收集。
{"title":"Automated Identification of Breast Cancer Relapse in Computed Tomography Reports Using Natural Language Processing.","authors":"Jaimie J Lee, Andres Zepeda, Gregory Arbour, Kathryn V Isaac, Raymond T Ng, Alan M Nichol","doi":"10.1200/CCI.24.00107","DOIUrl":"10.1200/CCI.24.00107","url":null,"abstract":"<p><strong>Purpose: </strong>Breast cancer relapses are rarely collected by cancer registries because of logistical and financial constraints. Hence, we investigated natural language processing (NLP), enhanced with state-of-the-art deep learning transformer tools and large language models, to automate relapse identification in the text of computed tomography (CT) reports.</p><p><strong>Methods: </strong>We analyzed follow-up CT reports from patients diagnosed with breast cancer between January 1, 2005, and December 31, 2014. The reports were curated and annotated for the presence or absence of local, regional, and distant breast cancer relapses. We performed 10-fold cross-validation to evaluate models identifying different types of relapses in CT reports. Model performance was assessed with classification metrics, reported with 95% confidence intervals.</p><p><strong>Results: </strong>In our data set of 1,445 CT reports, 799 (55.3%) described any relapse, 72 (5.0%) local relapses, 97 (6.7%) regional relapses, and 743 (51.4%) distant relapses. The any-relapse model achieved an accuracy of 89.6% (87.8-91.1), with a sensitivity of 93.2% (91.4-94.9) and a specificity of 84.2% (80.9-87.1). The local relapse model achieved an accuracy of 94.6% (93.3-95.7), a sensitivity of 44.4% (32.8-56.3), and a specificity of 97.2% (96.2-98.0). The regional relapse model showed an accuracy of 93.6% (92.3-94.9), a sensitivity of 70.1% (60.0-79.1), and a specificity of 95.3% (94.2-96.5). Finally, the distant relapse model demonstrated an accuracy of 88.1% (86.2-89.7), a sensitivity of 91.8% (89.9-93.8), and a specificity of 83.7% (80.5-86.4).</p><p><strong>Conclusion: </strong>We developed NLP models to identify local, regional, and distant breast cancer relapses from CT reports. Automating the identification of breast cancer relapses can enhance data collection about patient outcomes.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"8 ","pages":"e2400107"},"PeriodicalIF":3.3,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11670918/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142869773","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Implementation Strategy for Artificial Intelligence in Radiotherapy: Can Implementation Science Help? 人工智能在放射治疗中的实施策略:实施科学有帮助吗?
IF 3.3 Q2 ONCOLOGY Pub Date : 2024-12-01 Epub Date: 2024-12-20 DOI: 10.1200/CCI.24.00101
Rachelle Swart, Liesbeth Boersma, Rianne Fijten, Wouter van Elmpt, Paul Cremers, Maria J G Jacobs

Purpose: Artificial intelligence (AI) applications in radiotherapy (RT) are expected to save time and improve quality, but implementation remains limited. Therefore, we used implementation science to develop a format for designing an implementation strategy for AI. This study aimed to (1) apply this format to develop an AI implementation strategy for our center; (2) identify insights gained to enhance AI implementation using this format; and (3) assess the feasibility and acceptability of this format to design a center-specific implementation strategy for departments aiming to implement AI.

Methods: We created an AI-implementation strategy for our own center using implementation science methods. This included a stakeholder analysis, literature review, and interviews to identify facilitators and barriers, and designed strategies to overcome the barriers. These methods were subsequently used in a workshop with teams from seven Dutch RT centers to develop their own AI-implementation plans. The applicability, appropriateness, and feasibility were evaluated by the workshop participants, and relevant insights for AI implementation were summarized.

Results: The stakeholder analysis identified internal (physicians, physicists, RT technicians, information technology, and education) and external (patients and representatives) stakeholders. Barriers and facilitators included concerns about opacity, privacy, data quality, legal aspects, knowledge, trust, stakeholder involvement, ethics, and multidisciplinary collaboration, all integrated into our implementation strategy. The workshop evaluation showed high acceptability (18 participants [90%]), appropriateness (17 participants [85%]), and feasibility (15 participants [75%]) of the implementation strategy. Sixteen participants fully agreed with the format.

Conclusion: Our study highlights the need for a collaborative approach to implement AI in RT. We designed a strategy to overcome organizational challenges, improve AI integration, and enhance patient care. Workshop feedback indicates the proposed methods are useful for multiple RT centers. Insights gained by applying the methods highlight the importance of multidisciplinary collaboration in the development and implementation of AI.

目的:人工智能(AI)在放射治疗(RT)中的应用有望节省时间和提高质量,但实施仍然有限。因此,我们使用实现科学来开发设计人工智能实现策略的格式。本研究旨在(1)应用此格式为我们的中心制定人工智能实施策略;(2)识别使用此格式增强人工智能实施所获得的见解;(3)评估该格式的可行性和可接受性,为旨在实施人工智能的部门设计特定于中心的实施策略。方法:运用实施科学的方法为我们自己的中心制定了人工智能实施策略。这包括利益相关者分析、文献回顾和访谈,以确定促进因素和障碍,并设计策略来克服障碍。这些方法随后在荷兰七个RT中心的团队的研讨会上使用,以制定他们自己的人工智能实施计划。研讨会参与者评估了适用性、适当性和可行性,并总结了人工智能实施的相关见解。结果:利益相关者分析确定了内部利益相关者(医生、物理学家、RT技术人员、信息技术和教育)和外部利益相关者(患者和代表)。障碍和促进因素包括对不透明、隐私、数据质量、法律方面、知识、信任、利益相关者参与、道德和多学科合作的担忧,这些都纳入了我们的实施战略。工作坊评估显示实施策略的可接受性(18人[90%])、适宜性(17人[85%])和可行性(15人[75%])较高。16位与会者完全同意会议形式。结论:我们的研究强调了在rt中实施人工智能的协作方法的必要性。我们设计了一种策略来克服组织挑战,改善人工智能集成,并加强患者护理。研讨会反馈表明,所提出的方法适用于多个RT中心。通过应用这些方法获得的见解强调了在人工智能的开发和实施中多学科合作的重要性。
{"title":"Implementation Strategy for Artificial Intelligence in Radiotherapy: Can Implementation Science Help?","authors":"Rachelle Swart, Liesbeth Boersma, Rianne Fijten, Wouter van Elmpt, Paul Cremers, Maria J G Jacobs","doi":"10.1200/CCI.24.00101","DOIUrl":"10.1200/CCI.24.00101","url":null,"abstract":"<p><strong>Purpose: </strong>Artificial intelligence (AI) applications in radiotherapy (RT) are expected to save time and improve quality, but implementation remains limited. Therefore, we used implementation science to develop a format for designing an implementation strategy for AI. This study aimed to (1) apply this format to develop an AI implementation strategy for our center; (2) identify insights gained to enhance AI implementation using this format; and (3) assess the feasibility and acceptability of this format to design a center-specific implementation strategy for departments aiming to implement AI.</p><p><strong>Methods: </strong>We created an AI-implementation strategy for our own center using implementation science methods. This included a stakeholder analysis, literature review, and interviews to identify facilitators and barriers, and designed strategies to overcome the barriers. These methods were subsequently used in a workshop with teams from seven Dutch RT centers to develop their own AI-implementation plans. The applicability, appropriateness, and feasibility were evaluated by the workshop participants, and relevant insights for AI implementation were summarized.</p><p><strong>Results: </strong>The stakeholder analysis identified internal (physicians, physicists, RT technicians, information technology, and education) and external (patients and representatives) stakeholders. Barriers and facilitators included concerns about opacity, privacy, data quality, legal aspects, knowledge, trust, stakeholder involvement, ethics, and multidisciplinary collaboration, all integrated into our implementation strategy. The workshop evaluation showed high acceptability (18 participants [90%]), appropriateness (17 participants [85%]), and feasibility (15 participants [75%]) of the implementation strategy. Sixteen participants fully agreed with the format.</p><p><strong>Conclusion: </strong>Our study highlights the need for a collaborative approach to implement AI in RT. We designed a strategy to overcome organizational challenges, improve AI integration, and enhance patient care. Workshop feedback indicates the proposed methods are useful for multiple RT centers. Insights gained by applying the methods highlight the importance of multidisciplinary collaboration in the development and implementation of AI.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"8 ","pages":"e2400101"},"PeriodicalIF":3.3,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11670909/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142869774","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
JCO Clinical Cancer Informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1