首页 > 最新文献

JCO Clinical Cancer Informatics最新文献

英文 中文
Erratum: Development of an Automatic Rule-Based Algorithm for the Detection of Ovarian Cancer Recurrence From Electronic Health Records. 勘误:开发基于规则的自动算法,从电子健康记录中检测卵巢癌复发。
IF 4.2 Q2 Medicine Pub Date : 2024-04-01 DOI: 10.1200/CCI.24.00062
{"title":"Erratum: Development of an Automatic Rule-Based Algorithm for the Detection of Ovarian Cancer Recurrence From Electronic Health Records.","authors":"","doi":"10.1200/CCI.24.00062","DOIUrl":"https://doi.org/10.1200/CCI.24.00062","url":null,"abstract":"","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":null,"pages":null},"PeriodicalIF":4.2,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140768639","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Explainable Machine Learning Model to Preoperatively Predict Postoperative Complications in Inpatients With Cancer Undergoing Major Operations. 可解释的机器学习模型,用于术前预测接受大手术的癌症住院患者的术后并发症。
IF 4.2 Q2 Medicine Pub Date : 2024-04-01 DOI: 10.1200/cci.23.00247
Matthew C Hernandez, Chen Chen, Andrew Nguyen, Kevin Choong, Cameron Carlin, Rebecca A. Nelson, Lorenzo A. Rossi, Naini S. Seth, Kathy McNeese, Bertram Yuh, Z. Eftekhari, Lily L. Lai
PURPOSEPreoperative prediction of postoperative complications (PCs) in inpatients with cancer is challenging. We developed an explainable machine learning (ML) model to predict PCs in a heterogenous population of inpatients with cancer undergoing same-hospitalization major operations.METHODSConsecutive inpatients who underwent same-hospitalization operations from December 2017 to June 2021 at a single institution were retrospectively reviewed. The ML model was developed and tested using electronic health record (EHR) data to predict 30-day PCs for patients with Clavien-Dindo grade 3 or higher (CD 3+) per the CD classification system. Model performance was assessed using area under the receiver operating characteristic curve (AUROC), area under the precision recall curve (AUPRC), and calibration plots. Model explanation was performed using the Shapley additive explanations (SHAP) method at cohort and individual operation levels.RESULTSA total of 988 operations in 827 inpatients were included. The ML model was trained using 788 operations and tested using a holdout set of 200 operations. The CD 3+ complication rates were 28.6% and 27.5% in the training and holdout test sets, respectively. Training and holdout test sets' model performance in predicting CD 3+ complications yielded an AUROC of 0.77 and 0.73 and an AUPRC of 0.56 and 0.52, respectively. Calibration plots demonstrated good reliability. The SHAP method identified features and the contributions of the features to the risk of PCs.CONCLUSIONWe trained and tested an explainable ML model to predict the risk of developing PCs in patients with cancer. Using patient-specific EHR data, the ML model accurately discriminated the risk of developing CD 3+ complications and displayed top features at the individual operation and cohort level.
目的对癌症患者的术后并发症(PCs)进行术前预测是一项挑战。我们开发了一种可解释的机器学习(ML)模型,用于预测接受同院大手术的异质癌症住院患者的PC。方法对2017年12月至2021年6月期间在一家机构接受同院手术的连续住院患者进行了回顾性回顾。使用电子健康记录(EHR)数据开发并测试了ML模型,以预测根据CD分类系统Clavien-Dindo 3级或以上(CD 3+)患者的30天PCs。使用接收者操作特征曲线下面积(AUROC)、精确召回曲线下面积(AUPRC)和校准图评估模型性能。使用沙普利加法解释(SHAP)方法在队列和单个手术水平上对模型进行了解释。使用 788 例手术对 ML 模型进行了训练,并使用 200 例手术的保留集进行了测试。在训练集和暂缓测试集中,CD 3+ 并发症发生率分别为 28.6% 和 27.5%。训练集和保留测试集的模型在预测 CD 3+ 并发症方面的表现分别为 AUROC 0.77 和 0.73,AUPRC 0.56 和 0.52。校准图显示了良好的可靠性。结论我们训练并测试了一个可解释的 ML 模型,用于预测癌症患者罹患多发性硬化症的风险。通过使用患者特定的电子病历数据,ML 模型准确区分了 CD 3+ 并发症的发病风险,并显示了单个手术和队列水平的首要特征。
{"title":"Explainable Machine Learning Model to Preoperatively Predict Postoperative Complications in Inpatients With Cancer Undergoing Major Operations.","authors":"Matthew C Hernandez, Chen Chen, Andrew Nguyen, Kevin Choong, Cameron Carlin, Rebecca A. Nelson, Lorenzo A. Rossi, Naini S. Seth, Kathy McNeese, Bertram Yuh, Z. Eftekhari, Lily L. Lai","doi":"10.1200/cci.23.00247","DOIUrl":"https://doi.org/10.1200/cci.23.00247","url":null,"abstract":"PURPOSE\u0000Preoperative prediction of postoperative complications (PCs) in inpatients with cancer is challenging. We developed an explainable machine learning (ML) model to predict PCs in a heterogenous population of inpatients with cancer undergoing same-hospitalization major operations.\u0000\u0000\u0000METHODS\u0000Consecutive inpatients who underwent same-hospitalization operations from December 2017 to June 2021 at a single institution were retrospectively reviewed. The ML model was developed and tested using electronic health record (EHR) data to predict 30-day PCs for patients with Clavien-Dindo grade 3 or higher (CD 3+) per the CD classification system. Model performance was assessed using area under the receiver operating characteristic curve (AUROC), area under the precision recall curve (AUPRC), and calibration plots. Model explanation was performed using the Shapley additive explanations (SHAP) method at cohort and individual operation levels.\u0000\u0000\u0000RESULTS\u0000A total of 988 operations in 827 inpatients were included. The ML model was trained using 788 operations and tested using a holdout set of 200 operations. The CD 3+ complication rates were 28.6% and 27.5% in the training and holdout test sets, respectively. Training and holdout test sets' model performance in predicting CD 3+ complications yielded an AUROC of 0.77 and 0.73 and an AUPRC of 0.56 and 0.52, respectively. Calibration plots demonstrated good reliability. The SHAP method identified features and the contributions of the features to the risk of PCs.\u0000\u0000\u0000CONCLUSION\u0000We trained and tested an explainable ML model to predict the risk of developing PCs in patients with cancer. Using patient-specific EHR data, the ML model accurately discriminated the risk of developing CD 3+ complications and displayed top features at the individual operation and cohort level.","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":null,"pages":null},"PeriodicalIF":4.2,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140774755","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Organizational Breast Cancer Data Mart: A Solution for Assessing Outcomes of Imaging and Treatment. 组织乳腺癌数据集市:评估成像和治疗结果的解决方案。
IF 4.2 Q2 Medicine Pub Date : 2024-04-01 DOI: 10.1200/CCI.23.00193
Margarita L Zuley, Jonathan Silverstein, Durwin Logue, Richard S Morgan, Rohit Bhargava, Priscilla F. McAuliffe, A. Brufsky, Andriy I Bandos, Robert M. Nishikawa
PURPOSEIn the United States, a comprehensive national breast cancer registry (CR) does not exist. Thus, care and coverage decisions are based on data from population subsets, other countries, or models. We report a prototype real-world research data mart to assess mortality, morbidity, and costs for breast cancer diagnosis and treatment.METHODSWith institutional review board approval and Health Insurance Portability and Accountability Act (HIPPA) compliance, a multidisciplinary clinical and research data warehouse (RDW) expert group curated demographic, risk, imaging, pathology, treatment, and outcome data from the electronic health records (EHR), radiology (RIS), and CR for patients having breast imaging and/or a diagnosis of breast cancer in our institution from January 1, 2004, to December 31, 2020. Domains were defined by prebuilt views to extract data denormalized according to requirements from the existing RDW using an export, transform, load pattern. Data dictionaries were included. Structured query language was used for data cleaning.RESULTSFive-hundred eighty-nine elements (EHR 311, RIS 211, and CR 67) were mapped to 27 domains; all, except one containing CR elements, had cancer and noncancer cohort views, resulting in a total of 53 views (average 12 elements/view; range, 4-67). EHR and RIS queries returned 497,218 patients with 2,967,364 imaging examinations and associated visit details. Cancer biology, treatment, and outcome details for 15,619 breast cancer cases were imported from the CR of our primary breast care facility for this prototype mart.CONCLUSIONInstitutional real-world data marts enable comprehensive understanding of care outcomes within an organization. As clinical data sources become increasingly structured, such marts may be an important source for future interinstitution analysis and potentially an opportunity to create robust real-world results that could be used to support evidence-based national policy and care decisions for breast cancer.
目的 在美国,还没有一个全面的全国性乳腺癌登记处(CR)。因此,护理和保险决策都是基于来自人口子集、其他国家或模型的数据。我们报告了一个真实世界研究数据集市原型,用于评估乳腺癌诊断和治疗的死亡率、发病率和成本。方法经机构审查委员会批准并遵守《健康保险可携性和责任法案》(HIPPA),一个多学科临床和研究数据仓库(RDW)专家组从电子健康记录(EHR)、放射学(RIS)和 CR 中收集了本机构 2004 年 1 月 1 日至 2020 年 12 月 31 日期间乳腺成像和/或乳腺癌诊断患者的人口统计学、风险、成像、病理学、治疗和结果数据。域由预建视图定义,以便使用导出、转换、加载模式从现有 RDW 中提取符合要求的去规范化数据。数据字典也包括在内。结果589个元素(EHR 311个、RIS 211个和CR 67个)被映射到27个域;除一个包含CR元素的域外,所有域都有癌症和非癌症队列视图,因此共有53个视图(平均12个元素/视图;范围4-67)。EHR 和 RIS 查询返回了 497,218 名患者的 2,967,364 次成像检查和相关就诊详情。该原型市场从我们主要乳腺医疗机构的 CR 中导入了 15619 个乳腺癌病例的癌症生物学、治疗和结果详情。随着临床数据源的结构化程度越来越高,此类数据集市可能会成为未来机构间分析的重要来源,并有可能成为创建强大的真实世界结果的机会,这些结果可用于支持以证据为基础的乳腺癌国家政策和护理决策。
{"title":"Organizational Breast Cancer Data Mart: A Solution for Assessing Outcomes of Imaging and Treatment.","authors":"Margarita L Zuley, Jonathan Silverstein, Durwin Logue, Richard S Morgan, Rohit Bhargava, Priscilla F. McAuliffe, A. Brufsky, Andriy I Bandos, Robert M. Nishikawa","doi":"10.1200/CCI.23.00193","DOIUrl":"https://doi.org/10.1200/CCI.23.00193","url":null,"abstract":"PURPOSE\u0000In the United States, a comprehensive national breast cancer registry (CR) does not exist. Thus, care and coverage decisions are based on data from population subsets, other countries, or models. We report a prototype real-world research data mart to assess mortality, morbidity, and costs for breast cancer diagnosis and treatment.\u0000\u0000\u0000METHODS\u0000With institutional review board approval and Health Insurance Portability and Accountability Act (HIPPA) compliance, a multidisciplinary clinical and research data warehouse (RDW) expert group curated demographic, risk, imaging, pathology, treatment, and outcome data from the electronic health records (EHR), radiology (RIS), and CR for patients having breast imaging and/or a diagnosis of breast cancer in our institution from January 1, 2004, to December 31, 2020. Domains were defined by prebuilt views to extract data denormalized according to requirements from the existing RDW using an export, transform, load pattern. Data dictionaries were included. Structured query language was used for data cleaning.\u0000\u0000\u0000RESULTS\u0000Five-hundred eighty-nine elements (EHR 311, RIS 211, and CR 67) were mapped to 27 domains; all, except one containing CR elements, had cancer and noncancer cohort views, resulting in a total of 53 views (average 12 elements/view; range, 4-67). EHR and RIS queries returned 497,218 patients with 2,967,364 imaging examinations and associated visit details. Cancer biology, treatment, and outcome details for 15,619 breast cancer cases were imported from the CR of our primary breast care facility for this prototype mart.\u0000\u0000\u0000CONCLUSION\u0000Institutional real-world data marts enable comprehensive understanding of care outcomes within an organization. As clinical data sources become increasingly structured, such marts may be an important source for future interinstitution analysis and potentially an opportunity to create robust real-world results that could be used to support evidence-based national policy and care decisions for breast cancer.","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":null,"pages":null},"PeriodicalIF":4.2,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140788135","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine Learning-Based Survival Prediction Models for Progression-Free and Overall Survival in Advanced-Stage Hodgkin Lymphoma. 基于机器学习的晚期霍奇金淋巴瘤无进展生存期和总生存期预测模型
IF 4.2 Q2 Medicine Pub Date : 2024-04-01 DOI: 10.1200/CCI.23.00255
R. Rask Kragh Jørgensen, Fanny Bergström, S. Eloranta, M. Tang Severinsen, K. Bjøro Smeland, Alexander Fosså, J. Haaber Christensen, Martin Hutchings, Rasmus Bo Dahl-Sørensen, P. Kamper, I. Glimelius, Karin E Smedby, Susan K Parsons, Angie Mae Rodday, Matthew J Maurer, Andrew M Evens, Tarec C El-Galaly, L. Hjort Jakobsen
PURPOSEPatients diagnosed with advanced-stage Hodgkin lymphoma (aHL) have historically been risk-stratified using the International Prognostic Score (IPS). This study investigated if a machine learning (ML) approach could outperform existing models when it comes to predicting overall survival (OS) and progression-free survival (PFS).PATIENTS AND METHODSThis study used patient data from the Danish National Lymphoma Register for model development (development cohort). The ML model was developed using stacking, which combines several predictive survival models (Cox proportional hazard, flexible parametric model, IPS, principal component, penalized regression) into a single model, and was compared with two versions of IPS (IPS-3 and IPS-7) and the newly developed aHL international prognostic index (A-HIPI). Internal model validation was performed using nested cross-validation, and external validation was performed using patient data from the Swedish Lymphoma Register and Cancer Registry of Norway (validation cohort).RESULTSIn total, 707 and 760 patients with aHL were included in the development and validation cohorts, respectively. Examining model performance for OS in the development cohort, the concordance index (C-index) for the ML model, IPS-7, IPS-3, and A-HIPI was found to be 0.789, 0.608, 0.650, and 0.768, respectively. The corresponding estimates in the validation cohort were 0.749, 0.700, 0.663, and 0.741. For PFS, the ML model achieved the highest C-index in both cohorts (0.665 in the development cohort and 0.691 in the validation cohort). The time-varying AUCs for both the ML model and the A-HIPI were consistently higher in both cohorts compared with the IPS models within the first 5 years after diagnosis.CONCLUSIONThe new prognostic model for aHL on the basis of ML techniques demonstrated a substantial improvement compared with the IPS models, but yielded a limited improvement in predictive performance compared with the A-HIPI.
目的诊断为晚期霍奇金淋巴瘤(aHL)的患者历来使用国际预后评分(IPS)进行风险分级。本研究调查了机器学习(ML)方法在预测总生存期(OS)和无进展生存期(PFS)方面是否优于现有模型。ML 模型采用堆叠法开发,将多个预测生存模型(Cox 比例危险模型、灵活参数模型、IPS、主成分、惩罚回归)合并为一个模型,并与两个版本的 IPS(IPS-3 和 IPS-7)和新开发的 aHL 国际预后指数(A-HIPI)进行比较。模型内部验证采用嵌套交叉验证法,外部验证采用瑞典淋巴瘤登记处和挪威癌症登记处的患者数据(验证队列)。在对开发队列中的 OS 模型性能进行检查时发现,ML 模型、IPS-7、IPS-3 和 A-HIPI 的一致性指数(C-index)分别为 0.789、0.608、0.650 和 0.768。验证队列中的相应估计值分别为 0.749、0.700、0.663 和 0.741。就 PFS 而言,ML 模型在两个队列中都获得了最高的 C 指数(开发队列为 0.665,验证队列为 0.691)。结论与 IPS 模型相比,基于 ML 技术的 aHL 新预后模型有了很大改进,但与 A-HIPI 相比,其预测性能的改进有限。
{"title":"Machine Learning-Based Survival Prediction Models for Progression-Free and Overall Survival in Advanced-Stage Hodgkin Lymphoma.","authors":"R. Rask Kragh Jørgensen, Fanny Bergström, S. Eloranta, M. Tang Severinsen, K. Bjøro Smeland, Alexander Fosså, J. Haaber Christensen, Martin Hutchings, Rasmus Bo Dahl-Sørensen, P. Kamper, I. Glimelius, Karin E Smedby, Susan K Parsons, Angie Mae Rodday, Matthew J Maurer, Andrew M Evens, Tarec C El-Galaly, L. Hjort Jakobsen","doi":"10.1200/CCI.23.00255","DOIUrl":"https://doi.org/10.1200/CCI.23.00255","url":null,"abstract":"PURPOSE\u0000Patients diagnosed with advanced-stage Hodgkin lymphoma (aHL) have historically been risk-stratified using the International Prognostic Score (IPS). This study investigated if a machine learning (ML) approach could outperform existing models when it comes to predicting overall survival (OS) and progression-free survival (PFS).\u0000\u0000\u0000PATIENTS AND METHODS\u0000This study used patient data from the Danish National Lymphoma Register for model development (development cohort). The ML model was developed using stacking, which combines several predictive survival models (Cox proportional hazard, flexible parametric model, IPS, principal component, penalized regression) into a single model, and was compared with two versions of IPS (IPS-3 and IPS-7) and the newly developed aHL international prognostic index (A-HIPI). Internal model validation was performed using nested cross-validation, and external validation was performed using patient data from the Swedish Lymphoma Register and Cancer Registry of Norway (validation cohort).\u0000\u0000\u0000RESULTS\u0000In total, 707 and 760 patients with aHL were included in the development and validation cohorts, respectively. Examining model performance for OS in the development cohort, the concordance index (C-index) for the ML model, IPS-7, IPS-3, and A-HIPI was found to be 0.789, 0.608, 0.650, and 0.768, respectively. The corresponding estimates in the validation cohort were 0.749, 0.700, 0.663, and 0.741. For PFS, the ML model achieved the highest C-index in both cohorts (0.665 in the development cohort and 0.691 in the validation cohort). The time-varying AUCs for both the ML model and the A-HIPI were consistently higher in both cohorts compared with the IPS models within the first 5 years after diagnosis.\u0000\u0000\u0000CONCLUSION\u0000The new prognostic model for aHL on the basis of ML techniques demonstrated a substantial improvement compared with the IPS models, but yielded a limited improvement in predictive performance compared with the A-HIPI.","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":null,"pages":null},"PeriodicalIF":4.2,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140785070","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Identification and Characterization of Immune Checkpoint Inhibitor-Induced Toxicities From Electronic Health Records Using Natural Language Processing. 利用自然语言处理技术从电子健康记录中识别和描述免疫检查点抑制剂引发的毒性。
IF 4.2 Q2 Medicine Pub Date : 2024-04-01 DOI: 10.1200/CCI.23.00151
Hannah Barman, Sriram Venkateswaran, Antonio Del Santo, Unice Yoo, Eli Silvert, Krishna Rao, Bharathwaj Raghunathan, Lisa A Kottschade, Matthew S Block, G Scott Chandler, Joshua Zalis, Tyler E Wagner, Rajat Mohindra

Purpose: Immune checkpoint inhibitors (ICIs) have revolutionized cancer treatment, yet their use is associated with immune-related adverse events (irAEs). Estimating the prevalence and patient impact of these irAEs in the real-world data setting is critical for characterizing the benefit/risk profile of ICI therapies beyond the clinical trial population. Diagnosis codes, such as International Classification of Diseases codes, do not comprehensively illustrate a patient's care journey and offer no insight into drug-irAE causality. This study aims to capture the relationship between ICIs and irAEs more accurately by using augmented curation (AC), a natural language processing-based innovation, on unstructured data in electronic health records.

Methods: In a cohort of 9,290 patients treated with ICIs at Mayo Clinic from 2005 to 2021, we compared the prevalence of irAEs using diagnosis codes and AC models, which classify drug-irAE pairs in clinical notes with implied textual causality. Four illustrative irAEs with high patient impact-myocarditis, encephalitis, pneumonitis, and severe cutaneous adverse reactions, abbreviated as MEPS-were analyzed using corticosteroid administration and ICI discontinuation as proxies of severity.

Results: For MEPS, only 70% (n = 118) of patients found by AC were also identified by diagnosis codes. Using AC models, patients with MEPS received corticosteroids for their respective irAE 82% of the time and permanently discontinued the ICI because of the irAE 35.9% (n = 115) of the time.

Conclusion: Overall, AC models enabled more accurate identification and assessment of patient impact of ICI-induced irAEs not found using diagnosis codes, demonstrating a novel and more efficient strategy to assess real-world clinical outcomes in patients treated with ICIs.

目的:免疫检查点抑制剂(ICIs)给癌症治疗带来了革命性的变化,但其使用与免疫相关不良事件(irAEs)有关。在真实世界的数据环境中估计这些 irAEs 的发生率和对患者的影响对于描述 ICI 疗法在临床试验人群之外的收益/风险概况至关重要。诊断代码(如国际疾病分类代码)无法全面说明患者的治疗过程,也无法深入了解药物与 irAE 的因果关系。本研究旨在通过对电子健康记录中的非结构化数据使用基于自然语言处理的创新技术--增强策展(AC),更准确地捕捉 ICIs 和 irAEs 之间的关系:在梅奥诊所 2005 年至 2021 年接受 ICIs 治疗的 9290 名患者队列中,我们使用诊断代码和 AC 模型比较了 irAEs 的发生率。我们使用皮质类固醇给药和 ICI 停药作为严重程度的代用指标,分析了四种对患者影响较大的示例性 irAE--心肌炎、脑炎、肺炎和严重皮肤不良反应(简称 MEPS):就 MEPS 而言,只有 70% 的 AC 患者(n = 118)还能通过诊断代码进行识别。使用 AC 模型,82% 的 MEPS 患者因各自的虹膜急性心动过速而接受皮质类固醇治疗,35.9% 的患者(n = 115)因虹膜急性心动过速而永久停用 ICI:总之,AC 模型能够更准确地识别和评估 ICI 引起的 irAEs 对患者的影响,而诊断代码则无法识别和评估 ICI 引起的 irAEs 对患者的影响。
{"title":"Identification and Characterization of Immune Checkpoint Inhibitor-Induced Toxicities From Electronic Health Records Using Natural Language Processing.","authors":"Hannah Barman, Sriram Venkateswaran, Antonio Del Santo, Unice Yoo, Eli Silvert, Krishna Rao, Bharathwaj Raghunathan, Lisa A Kottschade, Matthew S Block, G Scott Chandler, Joshua Zalis, Tyler E Wagner, Rajat Mohindra","doi":"10.1200/CCI.23.00151","DOIUrl":"10.1200/CCI.23.00151","url":null,"abstract":"<p><strong>Purpose: </strong>Immune checkpoint inhibitors (ICIs) have revolutionized cancer treatment, yet their use is associated with immune-related adverse events (irAEs). Estimating the prevalence and patient impact of these irAEs in the real-world data setting is critical for characterizing the benefit/risk profile of ICI therapies beyond the clinical trial population. Diagnosis codes, such as International Classification of Diseases codes, do not comprehensively illustrate a patient's care journey and offer no insight into drug-irAE causality. This study aims to capture the relationship between ICIs and irAEs more accurately by using augmented curation (AC), a natural language processing-based innovation, on unstructured data in electronic health records.</p><p><strong>Methods: </strong>In a cohort of 9,290 patients treated with ICIs at Mayo Clinic from 2005 to 2021, we compared the prevalence of irAEs using diagnosis codes and AC models, which classify drug-irAE pairs in clinical notes with implied textual causality. Four illustrative irAEs with high patient impact-myocarditis, encephalitis, pneumonitis, and severe cutaneous adverse reactions, abbreviated as MEPS-were analyzed using corticosteroid administration and ICI discontinuation as proxies of severity.</p><p><strong>Results: </strong>For MEPS, only 70% (n = 118) of patients found by AC were also identified by diagnosis codes. Using AC models, patients with MEPS received corticosteroids for their respective irAE 82% of the time and permanently discontinued the ICI because of the irAE 35.9% (n = 115) of the time.</p><p><strong>Conclusion: </strong>Overall, AC models enabled more accurate identification and assessment of patient impact of ICI-induced irAEs not found using diagnosis codes, demonstrating a novel and more efficient strategy to assess real-world clinical outcomes in patients treated with ICIs.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":null,"pages":null},"PeriodicalIF":4.2,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11161244/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140874915","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Service Evaluation of MyChristie-MyHealth, an Electronic Patient-Reported Outcome Measure Integrated Into Clinical Cancer Care. MyChristie-MyHealth 的服务评估--一种整合到临床癌症护理中的患者报告结果电子测量方法。
IF 4.2 Q2 Medicine Pub Date : 2024-04-01 DOI: 10.1200/CCI.23.00162
Lee A Shipman, James Price, D. Abdulwahid, N. Bayman, Fiona H Blackhall, Raffaele Califano, C. Chan, J. Coote, Marie Eaton, Jacqueline Fenemore, Fabio Gomes, Margaret Harris, E. Halkyard, Colin Lindsay, H. Neal, D. Mcentee, H. Sheikh, Y. Summers, Paul Taylor, David Woolf, Janelle Yorke, Corinne Faivre-Finn
PURPOSEElectronic patient-reported outcome measures (ePROMs) are digitalized health questionnaires used to gauge patients' subjective experience of health and disease. They are becoming prevalent in cancer care and have been linked to a host of benefits including improved survival. MyChristie-MyHealth is the ePROM established at the Christie NHS Foundation Trust in 2019. We conducted an evaluation of this service to understand user experiences, as well as strategies to improve its functioning.METHODSData collection: Patients who had opted never to complete MyChristie-MyHealth (n = 87), and those who had completed at least one (n = 87) were identified. Demographic data included age, sex, ethnicity, postcode, diagnosis, treatment intent, and trial status. Semistructured interviews were held with noncompleters (n = 30) and completers (n = 31) of MyChristie-MyHealth, as well as clinician users (n = 6), covering themes such as accessibility, acceptability and usefulness, and open discourse on ways in which the service could be improved.RESULTSNoncompleters of MyChristie-MyHealth were older (median age 72 v 66 years, P = .005), receiving treatment with curative rather than palliative intent (odds ratio [OR], 1.45; P = .045), and less likely to be enrolled on a clinical trial (OR, 0.531; P = .011). They were less likely to own a smartphone (33% v 97%) or have reliable Internet access (45% v 100%). Satisfaction with MyChristie-MyHealth was high in both groups: 93% (n = 29) of completers and 87% (n = 26) noncompleters felt generally happy to complete. Completers of MyChristie-MyHealth wanted their results to be acknowledged by their clinicians. Clinicians wanted results to be displayed in a more user-friendly way.CONCLUSIONWe have broadly characterized noncompleters of the Christie ePROM to identify those in need of extra support or encouragement in the clinic. An action plan resulting from this review has been compiled and will inform the future development of MyChristie-MyHealth.
目的电子患者报告结果测量法(ePROM)是一种数字化健康问卷,用于评估患者对健康和疾病的主观感受。它们在癌症护理中越来越普遍,并与包括提高存活率在内的一系列益处相关联。MyChristie-MyHealth 是克里斯蒂国家医疗服务系统基金会信托基金于 2019 年建立的 ePROM。我们对这项服务进行了评估,以了解用户体验以及改善其功能的策略:对选择从未完成 MyChristie-MyHealth 的患者(87 人)和至少完成一次的患者(87 人)进行了识别。人口统计学数据包括年龄、性别、种族、邮编、诊断、治疗意向和试验状态。对 "我的基督徒-我的健康 "的未完成者(30 人)和完成者(31 人)以及临床医生用户(6 人)进行了半结构式访谈,访谈主题包括可访问性、可接受性和实用性,并就改进服务的方法进行了公开讨论。结果未使用 MyChristie-MyHealth 的患者年龄较大(中位年龄为 72 岁对 66 岁,P = .005),接受的是治疗性而非姑息性治疗(几率比 [OR],1.45;P = .045),并且不太可能参加临床试验(OR,0.531;P = .011)。他们拥有智能手机(33% 对 97%)或能可靠上网(45% 对 100%)的可能性较低。两组人对 MyChristie-MyHealth 的满意度都很高:93%(n = 29)的完成者和 87%(n = 26)的未完成者对完成 MyChristie-MyHealth 总体感到满意。MyChristie-MyHealth 的完成者希望他们的结果能得到临床医生的认可。结论我们对未完成克里斯蒂电子PROM者的特征进行了概括,以确定那些在临床上需要额外支持或鼓励的人。根据此次审查制定的行动计划将为 "我的克里斯蒂-我的健康 "的未来发展提供参考。
{"title":"Service Evaluation of MyChristie-MyHealth, an Electronic Patient-Reported Outcome Measure Integrated Into Clinical Cancer Care.","authors":"Lee A Shipman, James Price, D. Abdulwahid, N. Bayman, Fiona H Blackhall, Raffaele Califano, C. Chan, J. Coote, Marie Eaton, Jacqueline Fenemore, Fabio Gomes, Margaret Harris, E. Halkyard, Colin Lindsay, H. Neal, D. Mcentee, H. Sheikh, Y. Summers, Paul Taylor, David Woolf, Janelle Yorke, Corinne Faivre-Finn","doi":"10.1200/CCI.23.00162","DOIUrl":"https://doi.org/10.1200/CCI.23.00162","url":null,"abstract":"PURPOSE\u0000Electronic patient-reported outcome measures (ePROMs) are digitalized health questionnaires used to gauge patients' subjective experience of health and disease. They are becoming prevalent in cancer care and have been linked to a host of benefits including improved survival. MyChristie-MyHealth is the ePROM established at the Christie NHS Foundation Trust in 2019. We conducted an evaluation of this service to understand user experiences, as well as strategies to improve its functioning.\u0000\u0000\u0000METHODS\u0000Data collection: Patients who had opted never to complete MyChristie-MyHealth (n = 87), and those who had completed at least one (n = 87) were identified. Demographic data included age, sex, ethnicity, postcode, diagnosis, treatment intent, and trial status. Semistructured interviews were held with noncompleters (n = 30) and completers (n = 31) of MyChristie-MyHealth, as well as clinician users (n = 6), covering themes such as accessibility, acceptability and usefulness, and open discourse on ways in which the service could be improved.\u0000\u0000\u0000RESULTS\u0000Noncompleters of MyChristie-MyHealth were older (median age 72 v 66 years, P = .005), receiving treatment with curative rather than palliative intent (odds ratio [OR], 1.45; P = .045), and less likely to be enrolled on a clinical trial (OR, 0.531; P = .011). They were less likely to own a smartphone (33% v 97%) or have reliable Internet access (45% v 100%). Satisfaction with MyChristie-MyHealth was high in both groups: 93% (n = 29) of completers and 87% (n = 26) noncompleters felt generally happy to complete. Completers of MyChristie-MyHealth wanted their results to be acknowledged by their clinicians. Clinicians wanted results to be displayed in a more user-friendly way.\u0000\u0000\u0000CONCLUSION\u0000We have broadly characterized noncompleters of the Christie ePROM to identify those in need of extra support or encouragement in the clinic. An action plan resulting from this review has been compiled and will inform the future development of MyChristie-MyHealth.","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":null,"pages":null},"PeriodicalIF":4.2,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140789211","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Association Between Body Composition and Survival in Patients With Gastroesophageal Adenocarcinoma: An Automated Deep Learning Approach. 胃食管腺癌患者身体成分与存活率之间的关系:一种自动深度学习方法
IF 4.2 Q2 Medicine Pub Date : 2024-04-01 DOI: 10.1200/CCI.23.00231
M. Jung, T. Diallo, Tobias Scheef, Marco Reisert, Alexander Rau, Maximilan F Russe, Fabian Bamberg, Stefan Fichtner-Feigl, M. Quante, Jakob Weiss
PURPOSEBody composition (BC) may play a role in outcome prognostication in patients with gastroesophageal adenocarcinoma (GEAC). Artificial intelligence provides new possibilities to opportunistically quantify BC from computed tomography (CT) scans. We developed a deep learning (DL) model for fully automatic BC quantification on routine staging CTs and determined its prognostic role in a clinical cohort of patients with GEAC.MATERIALS AND METHODSWe developed and tested a DL model to quantify BC measures defined as subcutaneous and visceral adipose tissue (VAT) and skeletal muscle on routine CT and investigated their prognostic value in a cohort of patients with GEAC using baseline, 3-6-month, and 6-12-month postoperative CTs. Primary outcome was all-cause mortality, and secondary outcome was disease-free survival (DFS). Cox regression assessed the association between (1) BC at baseline and mortality and (2) the decrease in BC between baseline and follow-up scans and mortality/DFS.RESULTSModel performance was high with Dice coefficients ≥0.94 ± 0.06. Among 299 patients with GEAC (age 63.0 ± 10.7 years; 19.4% female), 140 deaths (47%) occurred over a median follow-up of 31.3 months. At baseline, no BC measure was associated with DFS. Only a substantial decrease in VAT >70% after a 6- to 12-month follow-up was associated with mortality (hazard ratio [HR], 1.99 [95% CI, 1.18 to 3.34]; P = .009) and DFS (HR, 1.73 [95% CI, 1.01 to 2.95]; P = .045) independent of age, sex, BMI, Union for International Cancer Control stage, histologic grading, resection status, neoadjuvant therapy, and time between surgery and follow-up CT.CONCLUSIONDL enables opportunistic estimation of BC from routine staging CT to quantify prognostic information. In patients with GEAC, only a substantial decrease of VAT 6-12 months postsurgery was an independent predictor for DFS beyond traditional risk factors, which may help to identify individuals at high risk who go otherwise unnoticed.
目的身体成分(BC)可能会对胃食管腺癌(GEAC)患者的预后结果产生影响。人工智能为从计算机断层扫描(CT)扫描中适时量化BC提供了新的可能性。我们开发并测试了一种深度学习(DL)模型,用于在常规分期 CT 上全自动量化 BC,并利用基线、3-6 个月和术后 6-12 个月的 CT 在 GEAC 患者队列中确定其预后作用。主要结果是全因死亡率,次要结果是无病生存期(DFS)。Cox回归评估了(1)基线BC与死亡率之间的关系;(2)基线与随访扫描之间BC的下降与死亡率/DFS之间的关系。在 299 名 GEAC 患者(年龄为 63.0 ± 10.7 岁;19.4% 为女性)中,有 140 人(47%)在中位 31.3 个月的随访期间死亡。基线时,没有任何BC指标与DFS相关。只有在随访 6 至 12 个月后 VAT 大幅下降 >70% 才与死亡率(危险比 [HR],1.99 [95% CI,1.18 至 3.34];P = .009)和 DFS(HR,1.73 [95% CI,1.01 至 2.95];P = .结论DL能根据常规分期CT对BC进行机会性估计,量化预后信息。在GEAC患者中,只有术后6-12个月VAT的大幅下降才是超越传统风险因素的DFS独立预测因素,这可能有助于识别那些未被注意的高危人群。
{"title":"Association Between Body Composition and Survival in Patients With Gastroesophageal Adenocarcinoma: An Automated Deep Learning Approach.","authors":"M. Jung, T. Diallo, Tobias Scheef, Marco Reisert, Alexander Rau, Maximilan F Russe, Fabian Bamberg, Stefan Fichtner-Feigl, M. Quante, Jakob Weiss","doi":"10.1200/CCI.23.00231","DOIUrl":"https://doi.org/10.1200/CCI.23.00231","url":null,"abstract":"PURPOSE\u0000Body composition (BC) may play a role in outcome prognostication in patients with gastroesophageal adenocarcinoma (GEAC). Artificial intelligence provides new possibilities to opportunistically quantify BC from computed tomography (CT) scans. We developed a deep learning (DL) model for fully automatic BC quantification on routine staging CTs and determined its prognostic role in a clinical cohort of patients with GEAC.\u0000\u0000\u0000MATERIALS AND METHODS\u0000We developed and tested a DL model to quantify BC measures defined as subcutaneous and visceral adipose tissue (VAT) and skeletal muscle on routine CT and investigated their prognostic value in a cohort of patients with GEAC using baseline, 3-6-month, and 6-12-month postoperative CTs. Primary outcome was all-cause mortality, and secondary outcome was disease-free survival (DFS). Cox regression assessed the association between (1) BC at baseline and mortality and (2) the decrease in BC between baseline and follow-up scans and mortality/DFS.\u0000\u0000\u0000RESULTS\u0000Model performance was high with Dice coefficients ≥0.94 ± 0.06. Among 299 patients with GEAC (age 63.0 ± 10.7 years; 19.4% female), 140 deaths (47%) occurred over a median follow-up of 31.3 months. At baseline, no BC measure was associated with DFS. Only a substantial decrease in VAT >70% after a 6- to 12-month follow-up was associated with mortality (hazard ratio [HR], 1.99 [95% CI, 1.18 to 3.34]; P = .009) and DFS (HR, 1.73 [95% CI, 1.01 to 2.95]; P = .045) independent of age, sex, BMI, Union for International Cancer Control stage, histologic grading, resection status, neoadjuvant therapy, and time between surgery and follow-up CT.\u0000\u0000\u0000CONCLUSION\u0000DL enables opportunistic estimation of BC from routine staging CT to quantify prognostic information. In patients with GEAC, only a substantial decrease of VAT 6-12 months postsurgery was an independent predictor for DFS beyond traditional risk factors, which may help to identify individuals at high risk who go otherwise unnoticed.","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":null,"pages":null},"PeriodicalIF":4.2,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140766512","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Integration of Telemedicine Consultation Into a Tertiary Radiation Oncology Department: Predictors of Use, Treatment Yield, and Effects on Patient Population. 将远程医疗咨询纳入三级放射肿瘤科:使用预测因素、治疗效果及对患者群体的影响。
IF 4.2 Q2 Medicine Pub Date : 2024-04-01 DOI: 10.1200/CCI.23.00239
Y. Sharifzadeh, William G Breen, W. S. Harmsen, A. Amundson, Allison E Garda, D. Routman, M. Waddle, Kenneth W Merrell, C. Hallemeier, Nadia N. Laack, Anantha Kollengode, Kimberly S Corbin
PURPOSEThe COVID-19 pandemic led to rapid expansion of telemedicine. The implications of telemedicine have not been rigorously studied in radiation oncology, a procedural specialty. This study aimed to evaluate the characteristics of in-person patients (IPPs) and virtual patients (VPs) who presented to a large cancer center before and during the pandemic and to understand variables affecting likelihood of receiving radiotherapy (yield) at our institution.METHODSA total of 17,915 patients presenting for new consultation between 2019 and 2021 were included, stratified by prepandemic and pandemic periods starting March 24, 2020. Telemedicine visits included video and telephone calls. Area deprivation indices (ADIs) were also compared.RESULTSThe overall population was 56% male and 93% White with mean age of 63 years. During the pandemic, VPs accounted for 21% of visits, were on average younger than their in-person (IP) counterparts (63.3 years IP v 62.4 VP), and lived further away from clinic (215 miles IP v 402 VP). Among treated VPs, living closer to clinic was associated with higher yield (odds ratio [OR], 0.95; P < .001). This was also seen among IPPs who received treatment (OR, 0.96; P < .001); however, the average distance from clinic was significantly lower for IPPs than VPs (205 miles IP v 349 VP). Specialized radiotherapy (proton and brachytherapy) was used more in VPs. IPPs had higher ADI than VPs. Among VPs, those treated had higher ADI (P < .001).CONCLUSIONPatient characteristics and yield were significantly different between IPPs and VPs. Telemedicine increased reach to patients further away from clinic, including from rural or health care-deprived areas, allowing access to specialized radiation oncology care. Telemedicine has the potential to increase the reach of other technical and procedural specialties.
目的COVID-19 大流行导致远程医疗迅速发展。远程医疗对放射肿瘤学这一程序性专科的影响尚未得到严格研究。本研究旨在评估在大流行之前和期间到一家大型癌症中心就诊的亲诊患者(IPPs)和虚拟患者(VPs)的特征,并了解影响在本机构接受放射治疗(产量)的可能性的变量。方法纳入了2019年至2021年期间新就诊的17915名患者,按大流行前和2020年3月24日开始的大流行期间进行分层。远程医疗就诊包括视频和电话。结果总体人群中 56% 为男性,93% 为白人,平均年龄为 63 岁。在大流行期间,自愿者占就诊人数的 21%,平均年龄比亲自就诊者(IP)更年轻(IP 为 63.3 岁,自愿者为 62.4 岁),居住地离诊所更远(IP 为 215 英里,自愿者为 402 英里)。在接受治疗的自愿者中,居住地离诊所越近,收益率越高(几率比 [OR],0.95;P < .001)。在接受治疗的 IPP 患者中也出现了这种情况(OR,0.96;P < .001);但 IPP 患者的平均诊所距离明显低于 VP 患者(IP 205 英里对 VP 349 英里)。专业放疗(质子和近距离放射治疗)在自愿接受治疗者中使用较多。IPP 的 ADI 比 VP 高。在 VPs 中,接受治疗者的 ADI 较高(P < .001)。远程医疗扩大了对距离诊所较远的患者的覆盖范围,包括农村或医疗条件较差地区的患者,使他们能够获得专业的肿瘤放射治疗。远程医疗有可能扩大其他技术和程序专科的覆盖范围。
{"title":"Integration of Telemedicine Consultation Into a Tertiary Radiation Oncology Department: Predictors of Use, Treatment Yield, and Effects on Patient Population.","authors":"Y. Sharifzadeh, William G Breen, W. S. Harmsen, A. Amundson, Allison E Garda, D. Routman, M. Waddle, Kenneth W Merrell, C. Hallemeier, Nadia N. Laack, Anantha Kollengode, Kimberly S Corbin","doi":"10.1200/CCI.23.00239","DOIUrl":"https://doi.org/10.1200/CCI.23.00239","url":null,"abstract":"PURPOSE\u0000The COVID-19 pandemic led to rapid expansion of telemedicine. The implications of telemedicine have not been rigorously studied in radiation oncology, a procedural specialty. This study aimed to evaluate the characteristics of in-person patients (IPPs) and virtual patients (VPs) who presented to a large cancer center before and during the pandemic and to understand variables affecting likelihood of receiving radiotherapy (yield) at our institution.\u0000\u0000\u0000METHODS\u0000A total of 17,915 patients presenting for new consultation between 2019 and 2021 were included, stratified by prepandemic and pandemic periods starting March 24, 2020. Telemedicine visits included video and telephone calls. Area deprivation indices (ADIs) were also compared.\u0000\u0000\u0000RESULTS\u0000The overall population was 56% male and 93% White with mean age of 63 years. During the pandemic, VPs accounted for 21% of visits, were on average younger than their in-person (IP) counterparts (63.3 years IP v 62.4 VP), and lived further away from clinic (215 miles IP v 402 VP). Among treated VPs, living closer to clinic was associated with higher yield (odds ratio [OR], 0.95; P < .001). This was also seen among IPPs who received treatment (OR, 0.96; P < .001); however, the average distance from clinic was significantly lower for IPPs than VPs (205 miles IP v 349 VP). Specialized radiotherapy (proton and brachytherapy) was used more in VPs. IPPs had higher ADI than VPs. Among VPs, those treated had higher ADI (P < .001).\u0000\u0000\u0000CONCLUSION\u0000Patient characteristics and yield were significantly different between IPPs and VPs. Telemedicine increased reach to patients further away from clinic, including from rural or health care-deprived areas, allowing access to specialized radiation oncology care. Telemedicine has the potential to increase the reach of other technical and procedural specialties.","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":null,"pages":null},"PeriodicalIF":4.2,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140763654","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Methodology for Using Real-World Data From Electronic Health Records to Assess Chemotherapy Administration in Women With Breast Cancer. 使用电子健康记录真实数据评估乳腺癌妇女化疗管理的方法。
IF 4.2 Q2 Medicine Pub Date : 2024-04-01 DOI: 10.1200/CCI.23.00209
Jenna Bhimani, K. O'Connell, I. Ergas, Marilyn J. Foley, Grace B Gallagher, Jennifer J Griggs, Narre Heon, Tatjana Kolevska, Yuriy Kotsurovskyy, Candyce H Kroenke, Cecile A Laurent, Raymond Liu, Kanichi G Nakata, Sonia Persaud, Donna R Rivera, Janise M. Roh, Sara M. Tabatabai, Emily Valice, Erin J A Bowles, Elisa V Bandera, Lawrence H Kushi, Elizabeth D Kantor
PURPOSEIdentification of patients' intended chemotherapy regimens is critical to most research questions conducted in the real-world setting of cancer care. Yet, these data are not routinely available in electronic health records (EHRs) at the specificity required to address these questions. We developed a methodology to identify patients' intended regimens from EHR data in the Optimal Breast Cancer Chemotherapy Dosing (OBCD) study.METHODSIn women older than 18 years, diagnosed with primary stage I-IIIA breast cancer at Kaiser Permanente Northern California (2006-2019), we categorized participants into 24 drug combinations described in National Comprehensive Cancer Network guidelines for breast cancer treatment. Participants were categorized into 50 guideline chemotherapy administration schedules within these combinations using an iterative algorithm process, followed by chart abstraction where necessary. We also identified patients intended to receive nonguideline administration schedules within guideline drug combinations and nonguideline drug combinations. This process was adapted at Kaiser Permanente Washington using abstracted data (2004-2015).RESULTSIn the OBCD cohort, 13,231 women received adjuvant or neoadjuvant chemotherapy, of whom 10,213 (77%) had their intended regimen identified via the algorithm, 2,416 (18%) had their intended regimen identified via abstraction, and 602 (4.5%) could not be identified. Across guideline drug combinations, 111 nonguideline dosing schedules were used, alongside 61 nonguideline drug combinations. A number of factors were associated with requiring abstraction for regimen determination, including: decreasing neighborhood household income, earlier diagnosis year, later stage, nodal status, and human epidermal growth factor receptor 2 (HER2)+ status.CONCLUSIONWe describe the challenges and approaches to operationalize complex, real-world data to identify intended chemotherapy regimens in large, observational studies. This methodology can improve efficiency of use of large-scale clinical data in real-world populations, helping answer critical questions to improve care delivery and patient outcomes.
目的确定患者的预期化疗方案对于在癌症治疗的现实环境中开展的大多数研究问题至关重要。然而,电子健康记录(EHR)中的这些数据并不具备解决这些问题所需的常规特异性。我们开发了一种方法,从最佳乳腺癌化疗剂量(OBCD)研究中的电子病历数据中确定患者的预期治疗方案。方法在北加州凯撒医疗中心(Kaiser Permanente Northern California,2006-2019 年)被诊断为原发性 I-IIIA 期乳腺癌的 18 岁以上女性中,我们将参与者分为国家综合癌症网络乳腺癌治疗指南中描述的 24 种药物组合。在这些药物组合中,我们采用迭代算法将参与者分为 50 个指南化疗给药计划,必要时还进行了病历摘录。我们还在指南药物组合和非指南药物组合中确定了打算接受非指南给药方案的患者。结果 在 OBCD 队列中,13231 名女性接受了辅助化疗或新辅助化疗,其中 10213 人(77%)通过算法确定了预定方案,2416 人(18%)通过病历摘要确定了预定方案,602 人(4.5%)无法确定。在指南药物组合中,使用了 111 种非指南剂量表,以及 61 种非指南药物组合。一些因素与需要抽取数据以确定治疗方案有关,其中包括:社区家庭收入减少、诊断年份较早、分期较晚、结节状态和人类表皮生长因子受体 2 (HER2)+ 状态。这种方法可以提高真实世界人群中大规模临床数据的使用效率,帮助回答关键问题,改善医疗服务和患者预后。
{"title":"Methodology for Using Real-World Data From Electronic Health Records to Assess Chemotherapy Administration in Women With Breast Cancer.","authors":"Jenna Bhimani, K. O'Connell, I. Ergas, Marilyn J. Foley, Grace B Gallagher, Jennifer J Griggs, Narre Heon, Tatjana Kolevska, Yuriy Kotsurovskyy, Candyce H Kroenke, Cecile A Laurent, Raymond Liu, Kanichi G Nakata, Sonia Persaud, Donna R Rivera, Janise M. Roh, Sara M. Tabatabai, Emily Valice, Erin J A Bowles, Elisa V Bandera, Lawrence H Kushi, Elizabeth D Kantor","doi":"10.1200/CCI.23.00209","DOIUrl":"https://doi.org/10.1200/CCI.23.00209","url":null,"abstract":"PURPOSE\u0000Identification of patients' intended chemotherapy regimens is critical to most research questions conducted in the real-world setting of cancer care. Yet, these data are not routinely available in electronic health records (EHRs) at the specificity required to address these questions. We developed a methodology to identify patients' intended regimens from EHR data in the Optimal Breast Cancer Chemotherapy Dosing (OBCD) study.\u0000\u0000\u0000METHODS\u0000In women older than 18 years, diagnosed with primary stage I-IIIA breast cancer at Kaiser Permanente Northern California (2006-2019), we categorized participants into 24 drug combinations described in National Comprehensive Cancer Network guidelines for breast cancer treatment. Participants were categorized into 50 guideline chemotherapy administration schedules within these combinations using an iterative algorithm process, followed by chart abstraction where necessary. We also identified patients intended to receive nonguideline administration schedules within guideline drug combinations and nonguideline drug combinations. This process was adapted at Kaiser Permanente Washington using abstracted data (2004-2015).\u0000\u0000\u0000RESULTS\u0000In the OBCD cohort, 13,231 women received adjuvant or neoadjuvant chemotherapy, of whom 10,213 (77%) had their intended regimen identified via the algorithm, 2,416 (18%) had their intended regimen identified via abstraction, and 602 (4.5%) could not be identified. Across guideline drug combinations, 111 nonguideline dosing schedules were used, alongside 61 nonguideline drug combinations. A number of factors were associated with requiring abstraction for regimen determination, including: decreasing neighborhood household income, earlier diagnosis year, later stage, nodal status, and human epidermal growth factor receptor 2 (HER2)+ status.\u0000\u0000\u0000CONCLUSION\u0000We describe the challenges and approaches to operationalize complex, real-world data to identify intended chemotherapy regimens in large, observational studies. This methodology can improve efficiency of use of large-scale clinical data in real-world populations, helping answer critical questions to improve care delivery and patient outcomes.","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":null,"pages":null},"PeriodicalIF":4.2,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140787228","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Using Machine Learning to Predict Unplanned Hospital Utilization and Chemotherapy Management From Patient-Reported Outcome Measures. 利用机器学习从患者报告的结果指标预测计划外住院费用和化疗管理。
IF 4.2 Q2 Medicine Pub Date : 2024-04-01 DOI: 10.1200/CCI.23.00264
Zuzanna Wójcik, Vania Dimitrova, Lorraine Warrington, Galina Velikova, Kate Absolom

Purpose: Adverse effects of chemotherapy often require hospital admissions or treatment management. Identifying factors contributing to unplanned hospital utilization may improve health care quality and patients' well-being. This study aimed to assess if patient-reported outcome measures (PROMs) improve performance of machine learning (ML) models predicting hospital admissions, triage events (contacting helpline or attending hospital), and changes to chemotherapy.

Materials and methods: Clinical trial data were used and contained responses to three PROMs (European Organisation for Research and Treatment of Cancer Core Quality of Life Questionnaire [QLQ-C30], EuroQol Five-Dimensional Visual Analogue Scale [EQ-5D], and Functional Assessment of Cancer Therapy-General [FACT-G]) and clinical information on 508 participants undergoing chemotherapy. Six feature sets (with following variables: [1] all available; [2] clinical; [3] PROMs; [4] clinical and QLQ-C30; [5] clinical and EQ-5D; [6] clinical and FACT-G) were applied in six ML models (logistic regression [LR], decision tree, adaptive boosting, random forest [RF], support vector machines [SVMs], and neural network) to predict admissions, triage events, and chemotherapy changes.

Results: The comprehensive analysis of predictive performances of the six ML models for each feature set in three different methods for handling class imbalance indicated that PROMs improved predictions of all outcomes. RF and SVMs had the highest performance for predicting admissions and changes to chemotherapy in balanced data sets, and LR in imbalanced data set. Balancing data led to the best performance compared with imbalanced data set or data set with balanced train set only.

Conclusion: These results endorsed the view that ML can be applied on PROM data to predict hospital utilization and chemotherapy management. If further explored, this study may contribute to health care planning and treatment personalization. Rigorous comparison of model performance affected by different imbalanced data handling methods shows best practice in ML research.

目的:化疗的不良反应往往需要入院治疗或治疗管理。识别导致非计划住院的因素可提高医疗质量和患者的福利。本研究旨在评估患者报告的结果测量(PROMs)是否能提高机器学习(ML)模型预测入院、分流事件(联系帮助热线或到医院就诊)和化疗改变的性能:使用的临床试验数据包含对三种PROMs(欧洲癌症研究和治疗组织核心生活质量问卷[QLQ-C30]、EuroQol五维视觉模拟量表[EQ-5D]和癌症治疗功能评估总表[FACT-G])的回答以及508名接受化疗者的临床信息。六个特征集(包含以下变量:[1] 所有可用变量;[2] 临床变量;[3] PROMs;[4] 临床变量和 QLQ-C30;[5] 临床变量和 EQ-5D;[6] 临床变量和 FACT-G)应用于六个 ML 模型(逻辑回归[LR]、决策树、自适应提升、随机森林[RF]、支持向量机[SVMs]和神经网络),以预测入院、分诊事件和化疗变化:通过对六种 ML 模型在每种特征集上的预测性能进行综合分析,并采用三种不同的方法来处理类别不平衡,结果表明 PROMs 提高了对所有结果的预测。在平衡数据集中,RF 和 SVM 预测入院和化疗变化的性能最高,而在不平衡数据集中,LR 预测入院和化疗变化的性能最高。与不平衡数据集或仅有平衡训练集的数据集相比,平衡数据的性能最佳:这些结果证实了一种观点,即可以将 ML 应用于 PROM 数据,以预测医院使用情况和化疗管理。如果进一步探索,这项研究可能有助于医疗保健规划和个性化治疗。对不同不平衡数据处理方法所影响的模型性能进行严格比较,显示了 ML 研究的最佳实践。
{"title":"Using Machine Learning to Predict Unplanned Hospital Utilization and Chemotherapy Management From Patient-Reported Outcome Measures.","authors":"Zuzanna Wójcik, Vania Dimitrova, Lorraine Warrington, Galina Velikova, Kate Absolom","doi":"10.1200/CCI.23.00264","DOIUrl":"10.1200/CCI.23.00264","url":null,"abstract":"<p><strong>Purpose: </strong>Adverse effects of chemotherapy often require hospital admissions or treatment management. Identifying factors contributing to unplanned hospital utilization may improve health care quality and patients' well-being. This study aimed to assess if patient-reported outcome measures (PROMs) improve performance of machine learning (ML) models predicting hospital admissions, triage events (contacting helpline or attending hospital), and changes to chemotherapy.</p><p><strong>Materials and methods: </strong>Clinical trial data were used and contained responses to three PROMs (European Organisation for Research and Treatment of Cancer Core Quality of Life Questionnaire [QLQ-C30], EuroQol Five-Dimensional Visual Analogue Scale [EQ-5D], and Functional Assessment of Cancer Therapy-General [FACT-G]) and clinical information on 508 participants undergoing chemotherapy. Six feature sets (with following variables: [1] all available; [2] clinical; [3] PROMs; [4] clinical and QLQ-C30; [5] clinical and EQ-5D; [6] clinical and FACT-G) were applied in six ML models (logistic regression [LR], decision tree, adaptive boosting, random forest [RF], support vector machines [SVMs], and neural network) to predict admissions, triage events, and chemotherapy changes.</p><p><strong>Results: </strong>The comprehensive analysis of predictive performances of the six ML models for each feature set in three different methods for handling class imbalance indicated that PROMs improved predictions of all outcomes. RF and SVMs had the highest performance for predicting admissions and changes to chemotherapy in balanced data sets, and LR in imbalanced data set. Balancing data led to the best performance compared with imbalanced data set or data set with balanced train set only.</p><p><strong>Conclusion: </strong>These results endorsed the view that ML can be applied on PROM data to predict hospital utilization and chemotherapy management. If further explored, this study may contribute to health care planning and treatment personalization. Rigorous comparison of model performance affected by different imbalanced data handling methods shows best practice in ML research.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":null,"pages":null},"PeriodicalIF":4.2,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11161248/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140871744","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
JCO Clinical Cancer Informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1