Sergey D Goryachev, Cenk Yildirim, Clark DuMontier, Jennifer La, Mayuri Dharne, J Michael Gaziano, Mary T Brophy, Nikhil C Munshi, Jane A Driver, Nhan V Do, Nathanael R Fillmore
{"title":"Natural Language Processing Algorithm to Extract Multiple Myeloma Stage From Oncology Notes in the Veterans Affairs Healthcare System.","authors":"Sergey D Goryachev, Cenk Yildirim, Clark DuMontier, Jennifer La, Mayuri Dharne, J Michael Gaziano, Mary T Brophy, Nikhil C Munshi, Jane A Driver, Nhan V Do, Nathanael R Fillmore","doi":"10.1200/CCI.23.00197","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>Stage in multiple myeloma (MM) is an essential measure of disease risk, but its measurement in large databases is often lacking. We aimed to develop and validate a natural language processing (NLP) algorithm to extract oncologists' documentation of stage in the national Veterans Affairs (VA) Healthcare System.</p><p><strong>Methods: </strong>Using nationwide electronic health record (EHR) and cancer registry data from the VA Corporate Data Warehouse, we developed and validated a rule-based NLP algorithm to extract oncologist-determined MM stage. To that end, a clinician annotated MM stage within over 5,000 short snippets of clinical notes, and annotated MM stage at MM treatment initiation for 200 patients. These were allocated into snippet- and patient-level development and validation sets. We developed MM stage extraction and roll-up algorithms within the development sets. After the algorithms were finalized, we validated them using standard measures in held-out validation sets.</p><p><strong>Results: </strong>We developed algorithms for three different MM staging systems that have been in widespread use (Revised International Staging System [R-ISS], International Staging System [ISS], and Durie-Salmon [DS]) and for stage reported without a clearly defined system. Precision and recall were uniformly high for MM stage at the snippet level, ranging from 0.92 to 0.99 for the different MM staging systems. Performance in identifying for MM stage at treatment initiation at the patient level was also excellent, with precision of 0.92, 0.96, 0.90, and 0.86 and recall of 0.99, 0.98, 0.94, and 0.92 for R-ISS, ISS, DS, and unclear stage, respectively.</p><p><strong>Conclusion: </strong>Our MM stage extraction algorithm uses rule-based NLP and data aggregation to accurately measure MM stage documented in oncology notes and pathology reports in VA's national EHR system. It may be adapted to other systems where MM stage is recorded in clinical notes.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":null,"pages":null},"PeriodicalIF":3.3000,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11371094/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JCO Clinical Cancer Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1200/CCI.23.00197","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Purpose: Stage in multiple myeloma (MM) is an essential measure of disease risk, but its measurement in large databases is often lacking. We aimed to develop and validate a natural language processing (NLP) algorithm to extract oncologists' documentation of stage in the national Veterans Affairs (VA) Healthcare System.
Methods: Using nationwide electronic health record (EHR) and cancer registry data from the VA Corporate Data Warehouse, we developed and validated a rule-based NLP algorithm to extract oncologist-determined MM stage. To that end, a clinician annotated MM stage within over 5,000 short snippets of clinical notes, and annotated MM stage at MM treatment initiation for 200 patients. These were allocated into snippet- and patient-level development and validation sets. We developed MM stage extraction and roll-up algorithms within the development sets. After the algorithms were finalized, we validated them using standard measures in held-out validation sets.
Results: We developed algorithms for three different MM staging systems that have been in widespread use (Revised International Staging System [R-ISS], International Staging System [ISS], and Durie-Salmon [DS]) and for stage reported without a clearly defined system. Precision and recall were uniformly high for MM stage at the snippet level, ranging from 0.92 to 0.99 for the different MM staging systems. Performance in identifying for MM stage at treatment initiation at the patient level was also excellent, with precision of 0.92, 0.96, 0.90, and 0.86 and recall of 0.99, 0.98, 0.94, and 0.92 for R-ISS, ISS, DS, and unclear stage, respectively.
Conclusion: Our MM stage extraction algorithm uses rule-based NLP and data aggregation to accurately measure MM stage documented in oncology notes and pathology reports in VA's national EHR system. It may be adapted to other systems where MM stage is recorded in clinical notes.
目的:多发性骨髓瘤(MM)的分期是衡量疾病风险的一个重要指标,但在大型数据库中往往缺乏对分期的测量。我们旨在开发并验证一种自然语言处理(NLP)算法,以提取全国退伍军人事务(VA)医疗保健系统中肿瘤学家对分期的记录:利用退伍军人事务部企业数据仓库(VA Corporate Data Warehouse)中的全国电子健康记录(EHR)和癌症登记数据,我们开发并验证了一种基于规则的 NLP 算法,用于提取肿瘤学家确定的 MM 分期。为此,一名临床医生在 5000 多份简短的临床笔记片段中注释了 MM 分期,并在 200 名患者开始 MM 治疗时注释了 MM 分期。这些数据被分配到片段级和患者级的开发集和验证集。我们在开发集内开发了 MM 阶段提取和卷积算法。算法确定后,我们在保留的验证集中使用标准测量方法对其进行了验证:我们为三种广泛使用的不同 MM 分期系统(修订版国际分期系统 [R-ISS]、国际分期系统 [ISS] 和 Durie-Salmon [DS])以及没有明确定义系统的分期报告开发了算法。在片段水平上,MM 分期的精确度和召回率都很高,不同 MM 分期系统的精确度和召回率从 0.92 到 0.99 不等。在患者层面识别开始治疗时的 MM 分期也非常出色,R-ISS、ISS、DS 和不明确分期的精确度分别为 0.92、0.96、0.90 和 0.86,召回率分别为 0.99、0.98、0.94 和 0.92:我们的MM分期提取算法使用基于规则的NLP和数据聚合来准确测量退伍军人事务部国家电子病历系统中肿瘤笔记和病理报告中记录的MM分期。该算法可适用于在临床笔记中记录 MM 分期的其他系统。