Yaguang Zheng, Victoria Vaughan Dickson, Saul Blecker, Jason M Ng, Brynne Campbell Rice, Gail D'Eramo Melkus, Liat Shenkar, Marie Claire R Mortejo, Stephen B Johnson
{"title":"用自然语言处理识别低血糖患者:系统文献综述。","authors":"Yaguang Zheng, Victoria Vaughan Dickson, Saul Blecker, Jason M Ng, Brynne Campbell Rice, Gail D'Eramo Melkus, Liat Shenkar, Marie Claire R Mortejo, Stephen B Johnson","doi":"10.2196/34681","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Accurately identifying patients with hypoglycemia is key to preventing adverse events and mortality. Natural language processing (NLP), a form of artificial intelligence, uses computational algorithms to extract information from text data. NLP is a scalable, efficient, and quick method to extract hypoglycemia-related information when using electronic health record data sources from a large population.</p><p><strong>Objective: </strong>The objective of this systematic review was to synthesize the literature on the application of NLP to extract hypoglycemia from electronic health record clinical notes.</p><p><strong>Methods: </strong>Literature searches were conducted electronically in PubMed, Web of Science Core Collection, CINAHL (EBSCO), PsycINFO (Ovid), IEEE Xplore, Google Scholar, and ACL Anthology. Keywords included hypoglycemia, low blood glucose, NLP, and machine learning. Inclusion criteria included studies that applied NLP to identify hypoglycemia, reported the outcomes related to hypoglycemia, and were published in English as full papers.</p><p><strong>Results: </strong>This review (n=8 studies) revealed heterogeneity of the reported results related to hypoglycemia. Of the 8 included studies, 4 (50%) reported that the prevalence rate of any level of hypoglycemia was 3.4% to 46.2%. The use of NLP to analyze clinical notes improved the capture of undocumented or missed hypoglycemic events using International Classification of Diseases, Ninth Revision (ICD-9), and International Classification of Diseases, Tenth Revision (ICD-10), and laboratory testing. The combination of NLP and ICD-9 or ICD-10 codes significantly increased the identification of hypoglycemic events compared with individual methods; for example, the prevalence rates of hypoglycemia were 12.4% for International Classification of Diseases codes, 25.1% for an NLP algorithm, and 32.2% for combined algorithms. All the reviewed studies applied rule-based NLP algorithms to identify hypoglycemia.</p><p><strong>Conclusions: </strong>The findings provided evidence that the application of NLP to analyze clinical notes improved the capture of hypoglycemic events, particularly when combined with the ICD-9 or ICD-10 codes and laboratory testing.</p>","PeriodicalId":52371,"journal":{"name":"JMIR Diabetes","volume":"7 2","pages":"e34681"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9152713/pdf/","citationCount":"3","resultStr":"{\"title\":\"Identifying Patients With Hypoglycemia Using Natural Language Processing: Systematic Literature Review.\",\"authors\":\"Yaguang Zheng, Victoria Vaughan Dickson, Saul Blecker, Jason M Ng, Brynne Campbell Rice, Gail D'Eramo Melkus, Liat Shenkar, Marie Claire R Mortejo, Stephen B Johnson\",\"doi\":\"10.2196/34681\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Accurately identifying patients with hypoglycemia is key to preventing adverse events and mortality. Natural language processing (NLP), a form of artificial intelligence, uses computational algorithms to extract information from text data. NLP is a scalable, efficient, and quick method to extract hypoglycemia-related information when using electronic health record data sources from a large population.</p><p><strong>Objective: </strong>The objective of this systematic review was to synthesize the literature on the application of NLP to extract hypoglycemia from electronic health record clinical notes.</p><p><strong>Methods: </strong>Literature searches were conducted electronically in PubMed, Web of Science Core Collection, CINAHL (EBSCO), PsycINFO (Ovid), IEEE Xplore, Google Scholar, and ACL Anthology. Keywords included hypoglycemia, low blood glucose, NLP, and machine learning. Inclusion criteria included studies that applied NLP to identify hypoglycemia, reported the outcomes related to hypoglycemia, and were published in English as full papers.</p><p><strong>Results: </strong>This review (n=8 studies) revealed heterogeneity of the reported results related to hypoglycemia. Of the 8 included studies, 4 (50%) reported that the prevalence rate of any level of hypoglycemia was 3.4% to 46.2%. The use of NLP to analyze clinical notes improved the capture of undocumented or missed hypoglycemic events using International Classification of Diseases, Ninth Revision (ICD-9), and International Classification of Diseases, Tenth Revision (ICD-10), and laboratory testing. The combination of NLP and ICD-9 or ICD-10 codes significantly increased the identification of hypoglycemic events compared with individual methods; for example, the prevalence rates of hypoglycemia were 12.4% for International Classification of Diseases codes, 25.1% for an NLP algorithm, and 32.2% for combined algorithms. All the reviewed studies applied rule-based NLP algorithms to identify hypoglycemia.</p><p><strong>Conclusions: </strong>The findings provided evidence that the application of NLP to analyze clinical notes improved the capture of hypoglycemic events, particularly when combined with the ICD-9 or ICD-10 codes and laboratory testing.</p>\",\"PeriodicalId\":52371,\"journal\":{\"name\":\"JMIR Diabetes\",\"volume\":\"7 2\",\"pages\":\"e34681\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-05-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9152713/pdf/\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"JMIR Diabetes\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2196/34681\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"Medicine\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"JMIR Diabetes","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2196/34681","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 3
摘要
背景:准确识别低血糖患者是预防不良事件和死亡率的关键。自然语言处理(NLP)是人工智能的一种形式,它使用计算算法从文本数据中提取信息。当使用来自大量人群的电子健康记录数据源时,NLP是一种可扩展、高效和快速的方法,可以提取与低血糖相关的信息。目的:本系统综述的目的是综合应用NLP从电子病历临床记录中提取低血糖的文献。方法:在PubMed、Web of Science Core Collection、CINAHL (EBSCO)、PsycINFO (Ovid)、IEEE Xplore、Google Scholar和ACL Anthology中进行电子文献检索。关键词:低血糖、低血糖、NLP、机器学习。纳入标准包括应用NLP识别低血糖,报告与低血糖相关的结局,并以英文全文发表的研究。结果:本综述(n=8项研究)揭示了与低血糖相关的报告结果的异质性。在纳入的8项研究中,4项(50%)报告了任何水平的低血糖患病率为3.4%至46.2%。使用NLP分析临床记录,提高了使用《国际疾病分类》第九版(ICD-9)、《国际疾病分类》第十版(ICD-10)和实验室检测对未记录或遗漏的低血糖事件的捕捉。与单独的方法相比,NLP与ICD-9或ICD-10代码的结合显著提高了低血糖事件的识别率;例如,国际疾病分类代码的低血糖患病率为12.4%,NLP算法的患病率为25.1%,组合算法的患病率为32.2%。所有回顾的研究都应用基于规则的NLP算法来识别低血糖。结论:研究结果证明,应用NLP分析临床记录可以改善低血糖事件的捕获,特别是当与ICD-9或ICD-10代码和实验室测试相结合时。
Identifying Patients With Hypoglycemia Using Natural Language Processing: Systematic Literature Review.
Background: Accurately identifying patients with hypoglycemia is key to preventing adverse events and mortality. Natural language processing (NLP), a form of artificial intelligence, uses computational algorithms to extract information from text data. NLP is a scalable, efficient, and quick method to extract hypoglycemia-related information when using electronic health record data sources from a large population.
Objective: The objective of this systematic review was to synthesize the literature on the application of NLP to extract hypoglycemia from electronic health record clinical notes.
Methods: Literature searches were conducted electronically in PubMed, Web of Science Core Collection, CINAHL (EBSCO), PsycINFO (Ovid), IEEE Xplore, Google Scholar, and ACL Anthology. Keywords included hypoglycemia, low blood glucose, NLP, and machine learning. Inclusion criteria included studies that applied NLP to identify hypoglycemia, reported the outcomes related to hypoglycemia, and were published in English as full papers.
Results: This review (n=8 studies) revealed heterogeneity of the reported results related to hypoglycemia. Of the 8 included studies, 4 (50%) reported that the prevalence rate of any level of hypoglycemia was 3.4% to 46.2%. The use of NLP to analyze clinical notes improved the capture of undocumented or missed hypoglycemic events using International Classification of Diseases, Ninth Revision (ICD-9), and International Classification of Diseases, Tenth Revision (ICD-10), and laboratory testing. The combination of NLP and ICD-9 or ICD-10 codes significantly increased the identification of hypoglycemic events compared with individual methods; for example, the prevalence rates of hypoglycemia were 12.4% for International Classification of Diseases codes, 25.1% for an NLP algorithm, and 32.2% for combined algorithms. All the reviewed studies applied rule-based NLP algorithms to identify hypoglycemia.
Conclusions: The findings provided evidence that the application of NLP to analyze clinical notes improved the capture of hypoglycemic events, particularly when combined with the ICD-9 or ICD-10 codes and laboratory testing.