使用自然语言处理的压力超声心动图报告的自动解释。

IF 4.4 Q1 CARDIAC & CARDIOVASCULAR SYSTEMS European heart journal. Digital health Pub Date : 2022-12-01 DOI:10.1093/ehjdh/ztac047

Chengyi Zheng, Benjamin C Sun, Yi-Lin Wu, Maros Ferencik, Ming-Sum Lee, Rita F Redberg, Aniket A Kawatkar, Visanee V Musigdilok, Adam L Sharp

{"title":"使用自然语言处理的压力超声心动图报告的自动解释。","authors":"Chengyi Zheng, Benjamin C Sun, Yi-Lin Wu, Maros Ferencik, Ming-Sum Lee, Rita F Redberg, Aniket A Kawatkar, Visanee V Musigdilok, Adam L Sharp","doi":"10.1093/ehjdh/ztac047","DOIUrl":null,"url":null,"abstract":"Aims: Stress echocardiography (SE) findings and interpretations are commonly documented in free-text reports. Reusing SE results requires laborious manual reviews. This study aimed to develop and validate an automated method for abstracting SE reports in a large cohort.Methods and results: This study included adult patients who had SE within 30 days of their emergency department visit for suspected acute coronary syndrome in a large integrated healthcare system. An automated natural language processing (NLP) algorithm was developed to abstract SE reports and classify overall SE results into normal, non-diagnostic, infarction, and ischaemia categories. Randomly selected reports (n = 140) were double-blindly reviewed by cardiologists to perform criterion validity of the NLP algorithm. Construct validity was tested on the entire cohort using abstracted SE data and additional clinical variables. The NLP algorithm abstracted 6346 consecutive SE reports. Cardiologists had good agreements on the overall SE results on the 140 reports: Kappa (0.83) and intraclass correlation coefficient (0.89). The NLP algorithm achieved 98.6% specificity and negative predictive value, 95.7% sensitivity, positive predictive value, and F-score on the overall SE results and near-perfect scores on ischaemia findings. The 30-day acute myocardial infarction or death outcomes were highest among patients with ischaemia (5.0%), followed by infarction (1.4%), non-diagnostic (0.8%), and normal (0.3%) results. We found substantial variations in the format and quality of SE reports, even within the same institution.Conclusions: Natural language processing is an accurate and efficient method for abstracting unstructured SE reports. This approach creates new opportunities for research, public health measures, and care improvement.","PeriodicalId":72965,"journal":{"name":"European heart journal. Digital health","volume":"3 4","pages":"626-637"},"PeriodicalIF":4.4000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/97/ff/ztac047.PMC9779789.pdf","citationCount":"1","resultStr":"{\"title\":\"Automated interpretation of stress echocardiography reports using natural language processing.\",\"authors\":\"Chengyi Zheng, Benjamin C Sun, Yi-Lin Wu, Maros Ferencik, Ming-Sum Lee, Rita F Redberg, Aniket A Kawatkar, Visanee V Musigdilok, Adam L Sharp\",\"doi\":\"10.1093/ehjdh/ztac047\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Aims: Stress echocardiography (SE) findings and interpretations are commonly documented in free-text reports. Reusing SE results requires laborious manual reviews. This study aimed to develop and validate an automated method for abstracting SE reports in a large cohort.Methods and results: This study included adult patients who had SE within 30 days of their emergency department visit for suspected acute coronary syndrome in a large integrated healthcare system. An automated natural language processing (NLP) algorithm was developed to abstract SE reports and classify overall SE results into normal, non-diagnostic, infarction, and ischaemia categories. Randomly selected reports (n = 140) were double-blindly reviewed by cardiologists to perform criterion validity of the NLP algorithm. Construct validity was tested on the entire cohort using abstracted SE data and additional clinical variables. The NLP algorithm abstracted 6346 consecutive SE reports. Cardiologists had good agreements on the overall SE results on the 140 reports: Kappa (0.83) and intraclass correlation coefficient (0.89). The NLP algorithm achieved 98.6% specificity and negative predictive value, 95.7% sensitivity, positive predictive value, and F-score on the overall SE results and near-perfect scores on ischaemia findings. The 30-day acute myocardial infarction or death outcomes were highest among patients with ischaemia (5.0%), followed by infarction (1.4%), non-diagnostic (0.8%), and normal (0.3%) results. We found substantial variations in the format and quality of SE reports, even within the same institution.Conclusions: Natural language processing is an accurate and efficient method for abstracting unstructured SE reports. This approach creates new opportunities for research, public health measures, and care improvement.\",\"PeriodicalId\":72965,\"journal\":{\"name\":\"European heart journal. Digital health\",\"volume\":\"3 4\",\"pages\":\"626-637\"},\"PeriodicalIF\":4.4000,\"publicationDate\":\"2022-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/97/ff/ztac047.PMC9779789.pdf\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"European heart journal. Digital health\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1093/ehjdh/ztac047\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CARDIAC & CARDIOVASCULAR SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"European heart journal. Digital health","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/ehjdh/ztac047","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CARDIAC & CARDIOVASCULAR SYSTEMS","Score":null,"Total":0}

引用次数: 1

摘要

目的:压力超声心动图(SE)的结果和解释通常记录在自由文本报告中。重用SE结果需要费力的手工审查。本研究旨在开发和验证一种在大型队列中提取SE报告的自动化方法。方法和结果:本研究纳入了在大型综合医疗保健系统中因疑似急性冠状动脉综合征就诊的急诊30天内患有SE的成年患者。开发了一种自动自然语言处理(NLP)算法来提取SE报告，并将总体SE结果分为正常、非诊断性、梗死和缺血类别。随机选择的报告(n = 140)由心脏病专家进行双盲审查，以执行NLP算法的标准有效性。使用抽象的SE数据和其他临床变量对整个队列进行结构效度测试。NLP算法提取了6346个连续的SE报告。心脏病专家对140份报告的总体SE结果有很好的一致性:Kappa(0.83)和类内相关系数(0.89)。NLP算法的特异性为98.6%，阴性预测值为95.7%，敏感性为95.7%，阳性预测值为95.7%，总体SE结果为f分，缺血结果为接近完美分。30天急性心肌梗死或死亡结果在缺血患者中最高(5.0%)，其次是梗死(1.4%)、非诊断性(0.8%)和正常(0.3%)结果。我们发现，即使在同一机构内，SE报告的格式和质量也存在很大差异。结论:自然语言处理是一种准确、高效的非结构化SE报告提取方法。这种方法为研究、公共卫生措施和改善护理创造了新的机会。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

摘要图片

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Automated interpretation of stress echocardiography reports using natural language processing.

Aims: Stress echocardiography (SE) findings and interpretations are commonly documented in free-text reports. Reusing SE results requires laborious manual reviews. This study aimed to develop and validate an automated method for abstracting SE reports in a large cohort.

Methods and results: This study included adult patients who had SE within 30 days of their emergency department visit for suspected acute coronary syndrome in a large integrated healthcare system. An automated natural language processing (NLP) algorithm was developed to abstract SE reports and classify overall SE results into normal, non-diagnostic, infarction, and ischaemia categories. Randomly selected reports (n = 140) were double-blindly reviewed by cardiologists to perform criterion validity of the NLP algorithm. Construct validity was tested on the entire cohort using abstracted SE data and additional clinical variables. The NLP algorithm abstracted 6346 consecutive SE reports. Cardiologists had good agreements on the overall SE results on the 140 reports: Kappa (0.83) and intraclass correlation coefficient (0.89). The NLP algorithm achieved 98.6% specificity and negative predictive value, 95.7% sensitivity, positive predictive value, and F-score on the overall SE results and near-perfect scores on ischaemia findings. The 30-day acute myocardial infarction or death outcomes were highest among patients with ischaemia (5.0%), followed by infarction (1.4%), non-diagnostic (0.8%), and normal (0.3%) results. We found substantial variations in the format and quality of SE reports, even within the same institution.

Conclusions: Natural language processing is an accurate and efficient method for abstracting unstructured SE reports. This approach creates new opportunities for research, public health measures, and care improvement.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

European heart journal. Digital health

CiteScore

5.00

自引率

0.00%

发文量