Extraction of Unstructured Electronic Health Records to Evaluate Glioblastoma Treatment Patterns.

IF 3.3 Q2 ONCOLOGY JCO Clinical Cancer Informatics Pub Date : 2024-06-01 DOI:10.1200/CCI.23.00091
Akshay Swaminathan, Alexander L Ren, Janet Y Wu, Aarohi Bhargava-Shah, Ivan Lopez, Ujwal Srivastava, Vassilis Alexopoulos, Rebecca Pizzitola, Brandon Bui, Layth Alkhani, Susan Lee, Nathan Mohit, Noel Seo, Nicholas Macedo, Winson Cheng, William Wang, Edward Tran, Reena Thomas, Olivier Gevaert
{"title":"Extraction of Unstructured Electronic Health Records to Evaluate Glioblastoma Treatment Patterns.","authors":"Akshay Swaminathan, Alexander L Ren, Janet Y Wu, Aarohi Bhargava-Shah, Ivan Lopez, Ujwal Srivastava, Vassilis Alexopoulos, Rebecca Pizzitola, Brandon Bui, Layth Alkhani, Susan Lee, Nathan Mohit, Noel Seo, Nicholas Macedo, Winson Cheng, William Wang, Edward Tran, Reena Thomas, Olivier Gevaert","doi":"10.1200/CCI.23.00091","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>Data on lines of therapy (LOTs) for cancer treatment are important for clinical oncology research, but LOTs are not explicitly recorded in electronic health records (EHRs). We present an efficient approach for clinical data abstraction and a flexible algorithm to derive LOTs from EHR-based medication data on patients with glioblastoma multiforme (GBM).</p><p><strong>Methods: </strong>Nonclinicians were trained to abstract the diagnosis of GBM from EHRs, and their accuracy was compared with abstraction performed by clinicians. The resulting data were used to build a cohort of patients with confirmed GBM diagnosis. An algorithm was developed to derive LOTs using structured medication data, accounting for the addition and discontinuation of therapies and drug class. Descriptive statistics were calculated and time-to-next-treatment (TTNT) analysis was performed using the Kaplan-Meier method.</p><p><strong>Results: </strong>Treating clinicians as the gold standard, nonclinicians abstracted GBM diagnosis with a sensitivity of 0.98, specificity 1.00, positive predictive value 1.00, and negative predictive value 0.90, suggesting that nonclinician abstraction of GBM diagnosis was comparable with clinician abstraction. Of 693 patients with a confirmed diagnosis of GBM, 246 patients contained structured information about the types of medications received. Of them, 165 (67.1%) received a first-line therapy (1L) of temozolomide, and the median TTNT from the start of 1L was 179 days.</p><p><strong>Conclusion: </strong>We described a workflow for extracting diagnosis of GBM and LOT from EHR data that combines nonclinician abstraction with algorithmic processing, demonstrating comparable accuracy with clinician abstraction and highlighting the potential for scalable and efficient EHR-based oncology research.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"8 ","pages":"e2300091"},"PeriodicalIF":3.3000,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11371099/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JCO Clinical Cancer Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1200/CCI.23.00091","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Purpose: Data on lines of therapy (LOTs) for cancer treatment are important for clinical oncology research, but LOTs are not explicitly recorded in electronic health records (EHRs). We present an efficient approach for clinical data abstraction and a flexible algorithm to derive LOTs from EHR-based medication data on patients with glioblastoma multiforme (GBM).

Methods: Nonclinicians were trained to abstract the diagnosis of GBM from EHRs, and their accuracy was compared with abstraction performed by clinicians. The resulting data were used to build a cohort of patients with confirmed GBM diagnosis. An algorithm was developed to derive LOTs using structured medication data, accounting for the addition and discontinuation of therapies and drug class. Descriptive statistics were calculated and time-to-next-treatment (TTNT) analysis was performed using the Kaplan-Meier method.

Results: Treating clinicians as the gold standard, nonclinicians abstracted GBM diagnosis with a sensitivity of 0.98, specificity 1.00, positive predictive value 1.00, and negative predictive value 0.90, suggesting that nonclinician abstraction of GBM diagnosis was comparable with clinician abstraction. Of 693 patients with a confirmed diagnosis of GBM, 246 patients contained structured information about the types of medications received. Of them, 165 (67.1%) received a first-line therapy (1L) of temozolomide, and the median TTNT from the start of 1L was 179 days.

Conclusion: We described a workflow for extracting diagnosis of GBM and LOT from EHR data that combines nonclinician abstraction with algorithmic processing, demonstrating comparable accuracy with clinician abstraction and highlighting the potential for scalable and efficient EHR-based oncology research.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
提取非结构化电子健康记录,评估胶质母细胞瘤治疗模式。
目的:癌症治疗线(LOT)数据对于临床肿瘤学研究非常重要,但电子健康记录(EHR)中并没有明确记录治疗线。我们提出了一种高效的临床数据抽取方法和一种灵活的算法,可从基于电子病历的多形性胶质母细胞瘤(GBM)患者用药数据中得出治疗方案:方法: 培训非临床医生从电子病历中抽取 GBM 诊断数据,并将其准确性与临床医生抽取数据的准确性进行比较。所得数据用于建立确诊为 GBM 的患者队列。我们开发了一种算法,利用结构化的用药数据推导出 LOTs,并考虑到疗法和药物类别的增加和中断。计算了描述性统计数字,并采用卡普兰-梅耶法进行了下次治疗时间(TTNT)分析:将临床医生作为金标准,非临床医生抽取 GBM 诊断的敏感性为 0.98,特异性为 1.00,阳性预测值为 1.00,阴性预测值为 0.90,表明非临床医生抽取 GBM 诊断与临床医生抽取相当。在 693 名确诊为 GBM 的患者中,有 246 名患者提供了有关所接受药物类型的结构化信息。其中,165 人(67.1%)接受了替莫唑胺一线治疗(1L),从 1L 开始的中位 TTNT 为 179 天:我们描述了一种从电子病历数据中提取 GBM 和 LOT 诊断的工作流程,该流程将非临床医生抽取与算法处理相结合,显示出与临床医生抽取相当的准确性,并突出了基于电子病历的可扩展、高效的肿瘤学研究的潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
6.20
自引率
4.80%
发文量
190
期刊最新文献
Development, Validation, and Clinical Utility of Electronic Patient-Reported Outcome Measure-Enhanced Prediction Models for Overall Survival in Patients With Advanced Non-Small Cell Lung Cancer Receiving Immunotherapy. Metastatic Versus Localized Disease as Inclusion Criteria That Can Be Automatically Extracted From Randomized Controlled Trials Using Natural Language Processing. Identifying Oncology Patients at High Risk for Potentially Preventable Emergency Department Visits Using a Novel Definition. Use of Patient-Reported Outcomes in Risk Prediction Model Development to Support Cancer Care Delivery: A Scoping Review. Optimizing End Points for Phase III Cancer Trials.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1