Extraction of Unstructured Electronic Health Records to Evaluate Glioblastoma Treatment Patterns.

IF 2.8 Q2 ONCOLOGY JCO Clinical Cancer Informatics Pub Date : 2024-06-01 DOI:10.1200/CCI.23.00091

Akshay Swaminathan, Alexander L Ren, Janet Y Wu, Aarohi Bhargava-Shah, Ivan Lopez, Ujwal Srivastava, Vassilis Alexopoulos, Rebecca Pizzitola, Brandon Bui, Layth Alkhani, Susan Lee, Nathan Mohit, Noel Seo, Nicholas Macedo, Winson Cheng, William Wang, Edward Tran, Reena Thomas, Olivier Gevaert

{"title":"Extraction of Unstructured Electronic Health Records to Evaluate Glioblastoma Treatment Patterns.","authors":"Akshay Swaminathan, Alexander L Ren, Janet Y Wu, Aarohi Bhargava-Shah, Ivan Lopez, Ujwal Srivastava, Vassilis Alexopoulos, Rebecca Pizzitola, Brandon Bui, Layth Alkhani, Susan Lee, Nathan Mohit, Noel Seo, Nicholas Macedo, Winson Cheng, William Wang, Edward Tran, Reena Thomas, Olivier Gevaert","doi":"10.1200/CCI.23.00091","DOIUrl":null,"url":null,"abstract":"Purpose: Data on lines of therapy (LOTs) for cancer treatment are important for clinical oncology research, but LOTs are not explicitly recorded in electronic health records (EHRs). We present an efficient approach for clinical data abstraction and a flexible algorithm to derive LOTs from EHR-based medication data on patients with glioblastoma multiforme (GBM).Methods: Nonclinicians were trained to abstract the diagnosis of GBM from EHRs, and their accuracy was compared with abstraction performed by clinicians. The resulting data were used to build a cohort of patients with confirmed GBM diagnosis. An algorithm was developed to derive LOTs using structured medication data, accounting for the addition and discontinuation of therapies and drug class. Descriptive statistics were calculated and time-to-next-treatment (TTNT) analysis was performed using the Kaplan-Meier method.Results: Treating clinicians as the gold standard, nonclinicians abstracted GBM diagnosis with a sensitivity of 0.98, specificity 1.00, positive predictive value 1.00, and negative predictive value 0.90, suggesting that nonclinician abstraction of GBM diagnosis was comparable with clinician abstraction. Of 693 patients with a confirmed diagnosis of GBM, 246 patients contained structured information about the types of medications received. Of them, 165 (67.1%) received a first-line therapy (1L) of temozolomide, and the median TTNT from the start of 1L was 179 days.Conclusion: We described a workflow for extracting diagnosis of GBM and LOT from EHR data that combines nonclinician abstraction with algorithmic processing, demonstrating comparable accuracy with clinician abstraction and highlighting the potential for scalable and efficient EHR-based oncology research.","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"8 ","pages":"e2300091"},"PeriodicalIF":2.8000,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11371099/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JCO Clinical Cancer Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1200/CCI.23.00091","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Purpose: Data on lines of therapy (LOTs) for cancer treatment are important for clinical oncology research, but LOTs are not explicitly recorded in electronic health records (EHRs). We present an efficient approach for clinical data abstraction and a flexible algorithm to derive LOTs from EHR-based medication data on patients with glioblastoma multiforme (GBM).

Methods: Nonclinicians were trained to abstract the diagnosis of GBM from EHRs, and their accuracy was compared with abstraction performed by clinicians. The resulting data were used to build a cohort of patients with confirmed GBM diagnosis. An algorithm was developed to derive LOTs using structured medication data, accounting for the addition and discontinuation of therapies and drug class. Descriptive statistics were calculated and time-to-next-treatment (TTNT) analysis was performed using the Kaplan-Meier method.

Results: Treating clinicians as the gold standard, nonclinicians abstracted GBM diagnosis with a sensitivity of 0.98, specificity 1.00, positive predictive value 1.00, and negative predictive value 0.90, suggesting that nonclinician abstraction of GBM diagnosis was comparable with clinician abstraction. Of 693 patients with a confirmed diagnosis of GBM, 246 patients contained structured information about the types of medications received. Of them, 165 (67.1%) received a first-line therapy (1L) of temozolomide, and the median TTNT from the start of 1L was 179 days.

Conclusion: We described a workflow for extracting diagnosis of GBM and LOT from EHR data that combines nonclinician abstraction with algorithmic processing, demonstrating comparable accuracy with clinician abstraction and highlighting the potential for scalable and efficient EHR-based oncology research.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

提取非结构化电子健康记录，评估胶质母细胞瘤治疗模式。

目的：癌症治疗线（LOT）数据对于临床肿瘤学研究非常重要，但电子健康记录（EHR）中并没有明确记录治疗线。我们提出了一种高效的临床数据抽取方法和一种灵活的算法，可从基于电子病历的多形性胶质母细胞瘤（GBM）患者用药数据中得出治疗方案：方法：培训非临床医生从电子病历中抽取 GBM 诊断数据，并将其准确性与临床医生抽取数据的准确性进行比较。所得数据用于建立确诊为 GBM 的患者队列。我们开发了一种算法，利用结构化的用药数据推导出 LOTs，并考虑到疗法和药物类别的增加和中断。计算了描述性统计数字，并采用卡普兰-梅耶法进行了下次治疗时间（TTNT）分析：将临床医生作为金标准，非临床医生抽取 GBM 诊断的敏感性为 0.98，特异性为 1.00，阳性预测值为 1.00，阴性预测值为 0.90，表明非临床医生抽取 GBM 诊断与临床医生抽取相当。在 693 名确诊为 GBM 的患者中，有 246 名患者提供了有关所接受药物类型的结构化信息。其中，165 人（67.1%）接受了替莫唑胺一线治疗（1L），从 1L 开始的中位 TTNT 为 179 天：我们描述了一种从电子病历数据中提取 GBM 和 LOT 诊断的工作流程，该流程将非临床医生抽取与算法处理相结合，显示出与临床医生抽取相当的准确性，并突出了基于电子病历的可扩展、高效的肿瘤学研究的潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊