Keith Harrigian MS , Diep Tran MSc , Tina Tang MD , Anthony Gonzales OD , Paul Nagy PhD , Hadi Kharrazi MD, PhD , Mark Dredze PhD , Cindy X. Cai MD, MS
{"title":"Improving the Identification of Diabetic Retinopathy and Related Conditions in the Electronic Health Record Using Natural Language Processing Methods","authors":"Keith Harrigian MS , Diep Tran MSc , Tina Tang MD , Anthony Gonzales OD , Paul Nagy PhD , Hadi Kharrazi MD, PhD , Mark Dredze PhD , Cindy X. Cai MD, MS","doi":"10.1016/j.xops.2024.100578","DOIUrl":null,"url":null,"abstract":"<div><h3>Purpose</h3><p>To compare the performance of 3 phenotyping methods in identifying diabetic retinopathy (DR) and related clinical conditions.</p></div><div><h3>Design</h3><p>Three phenotyping methods were used to identify clinical conditions including unspecified DR, nonproliferative DR (NPDR) (mild, moderate, severe), consolidated NPDR (unspecified DR or any NPDR), proliferative DR, diabetic macular edema (DME), vitreous hemorrhage, retinal detachment (RD) (tractional RD or combined tractional and rhegmatogenous RD), and neovascular glaucoma (NVG). The first method used only International Classification of Diseases, 10th Revision (ICD-10) diagnosis codes (<em>ICD-10 Lookup System</em>). The next 2 methods used a Bidirectional Encoder Representations from Transformers with a dense Multilayer Perceptron output layer natural language processing (NLP) framework. The NLP framework was applied either to free-text of provider notes (<em>Text-Only NLP System</em>) or both free-text and ICD-10 diagnosis codes (<em>Text-and-International Classification of Diseases</em> [<em>ICD</em>] <em>NLP System</em>).</p></div><div><h3>Subjects</h3><p>Adults ≥18 years with diabetes mellitus seen at the Wilmer Eye Institute.</p></div><div><h3>Methods</h3><p>We compared the performance of the 3 phenotyping methods in identifying the DR related conditions with gold standard chart review. We also compared the estimated disease prevalence using each method.</p></div><div><h3>Main Outcome Measures</h3><p>Performance of each method was reported as the macro F1 score. The agreement between the methods was calculated using the kappa statistic. Prevalence estimates were also calculated for each method.</p></div><div><h3>Results</h3><p>A total of 91 097 patients and 692 486 office visits were included in the study. Compared with the gold standard, the <em>Text-and-ICD NLP System</em> had the highest F1 score for most clinical conditions (range 0.39–0.64). The agreement between the <em>ICD-10 Lookup System</em> and <em>Text-Only NLP System</em> varied (kappa of 0.21–0.81). The prevalence of DR and related conditions ranged from 1.1% for NVG to 17.9% for DME (using the <em>Text-and-ICD NLP System</em>).</p></div><div><h3>Conclusions</h3><p>The prevalence of DR and related conditions varied significantly depending on the methodology of identifying cases. The best performing phenotyping method was the <em>Text-and-ICD NLP System</em> that used information in both diagnosis codes as well as free-text notes.</p></div><div><h3>Financial Disclosures</h3><p>Proprietary or commercial disclosure may be found in the Footnotes and Disclosures at the end of this article.</p></div>","PeriodicalId":74363,"journal":{"name":"Ophthalmology science","volume":"4 6","pages":"Article 100578"},"PeriodicalIF":3.2000,"publicationDate":"2024-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666914524001143/pdfft?md5=aee0aca9014224fef1aa919db24f5c88&pid=1-s2.0-S2666914524001143-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ophthalmology science","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666914524001143","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"OPHTHALMOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Purpose
To compare the performance of 3 phenotyping methods in identifying diabetic retinopathy (DR) and related clinical conditions.
Design
Three phenotyping methods were used to identify clinical conditions including unspecified DR, nonproliferative DR (NPDR) (mild, moderate, severe), consolidated NPDR (unspecified DR or any NPDR), proliferative DR, diabetic macular edema (DME), vitreous hemorrhage, retinal detachment (RD) (tractional RD or combined tractional and rhegmatogenous RD), and neovascular glaucoma (NVG). The first method used only International Classification of Diseases, 10th Revision (ICD-10) diagnosis codes (ICD-10 Lookup System). The next 2 methods used a Bidirectional Encoder Representations from Transformers with a dense Multilayer Perceptron output layer natural language processing (NLP) framework. The NLP framework was applied either to free-text of provider notes (Text-Only NLP System) or both free-text and ICD-10 diagnosis codes (Text-and-International Classification of Diseases [ICD] NLP System).
Subjects
Adults ≥18 years with diabetes mellitus seen at the Wilmer Eye Institute.
Methods
We compared the performance of the 3 phenotyping methods in identifying the DR related conditions with gold standard chart review. We also compared the estimated disease prevalence using each method.
Main Outcome Measures
Performance of each method was reported as the macro F1 score. The agreement between the methods was calculated using the kappa statistic. Prevalence estimates were also calculated for each method.
Results
A total of 91 097 patients and 692 486 office visits were included in the study. Compared with the gold standard, the Text-and-ICD NLP System had the highest F1 score for most clinical conditions (range 0.39–0.64). The agreement between the ICD-10 Lookup System and Text-Only NLP System varied (kappa of 0.21–0.81). The prevalence of DR and related conditions ranged from 1.1% for NVG to 17.9% for DME (using the Text-and-ICD NLP System).
Conclusions
The prevalence of DR and related conditions varied significantly depending on the methodology of identifying cases. The best performing phenotyping method was the Text-and-ICD NLP System that used information in both diagnosis codes as well as free-text notes.
Financial Disclosures
Proprietary or commercial disclosure may be found in the Footnotes and Disclosures at the end of this article.