Automated interpretation of stress echocardiography reports using natural language processing.

IF 3.9 Q1 CARDIAC & CARDIOVASCULAR SYSTEMS European heart journal. Digital health Pub Date : 2022-12-01 DOI:10.1093/ehjdh/ztac047
Chengyi Zheng, Benjamin C Sun, Yi-Lin Wu, Maros Ferencik, Ming-Sum Lee, Rita F Redberg, Aniket A Kawatkar, Visanee V Musigdilok, Adam L Sharp
{"title":"Automated interpretation of stress echocardiography reports using natural language processing.","authors":"Chengyi Zheng,&nbsp;Benjamin C Sun,&nbsp;Yi-Lin Wu,&nbsp;Maros Ferencik,&nbsp;Ming-Sum Lee,&nbsp;Rita F Redberg,&nbsp;Aniket A Kawatkar,&nbsp;Visanee V Musigdilok,&nbsp;Adam L Sharp","doi":"10.1093/ehjdh/ztac047","DOIUrl":null,"url":null,"abstract":"<p><strong>Aims: </strong>Stress echocardiography (SE) findings and interpretations are commonly documented in free-text reports. Reusing SE results requires laborious manual reviews. This study aimed to develop and validate an automated method for abstracting SE reports in a large cohort.</p><p><strong>Methods and results: </strong>This study included adult patients who had SE within 30 days of their emergency department visit for suspected acute coronary syndrome in a large integrated healthcare system. An automated natural language processing (NLP) algorithm was developed to abstract SE reports and classify overall SE results into normal, non-diagnostic, infarction, and ischaemia categories. Randomly selected reports (<i>n</i> = 140) were double-blindly reviewed by cardiologists to perform criterion validity of the NLP algorithm. Construct validity was tested on the entire cohort using abstracted SE data and additional clinical variables. The NLP algorithm abstracted 6346 consecutive SE reports. Cardiologists had good agreements on the overall SE results on the 140 reports: Kappa (0.83) and intraclass correlation coefficient (0.89). The NLP algorithm achieved 98.6% specificity and negative predictive value, 95.7% sensitivity, positive predictive value, and <i>F</i>-score on the overall SE results and near-perfect scores on ischaemia findings. The 30-day acute myocardial infarction or death outcomes were highest among patients with ischaemia (5.0%), followed by infarction (1.4%), non-diagnostic (0.8%), and normal (0.3%) results. We found substantial variations in the format and quality of SE reports, even within the same institution.</p><p><strong>Conclusions: </strong>Natural language processing is an accurate and efficient method for abstracting unstructured SE reports. This approach creates new opportunities for research, public health measures, and care improvement.</p>","PeriodicalId":72965,"journal":{"name":"European heart journal. Digital health","volume":null,"pages":null},"PeriodicalIF":3.9000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/97/ff/ztac047.PMC9779789.pdf","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European heart journal. Digital health","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/ehjdh/ztac047","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CARDIAC & CARDIOVASCULAR SYSTEMS","Score":null,"Total":0}
引用次数: 1

Abstract

Aims: Stress echocardiography (SE) findings and interpretations are commonly documented in free-text reports. Reusing SE results requires laborious manual reviews. This study aimed to develop and validate an automated method for abstracting SE reports in a large cohort.

Methods and results: This study included adult patients who had SE within 30 days of their emergency department visit for suspected acute coronary syndrome in a large integrated healthcare system. An automated natural language processing (NLP) algorithm was developed to abstract SE reports and classify overall SE results into normal, non-diagnostic, infarction, and ischaemia categories. Randomly selected reports (n = 140) were double-blindly reviewed by cardiologists to perform criterion validity of the NLP algorithm. Construct validity was tested on the entire cohort using abstracted SE data and additional clinical variables. The NLP algorithm abstracted 6346 consecutive SE reports. Cardiologists had good agreements on the overall SE results on the 140 reports: Kappa (0.83) and intraclass correlation coefficient (0.89). The NLP algorithm achieved 98.6% specificity and negative predictive value, 95.7% sensitivity, positive predictive value, and F-score on the overall SE results and near-perfect scores on ischaemia findings. The 30-day acute myocardial infarction or death outcomes were highest among patients with ischaemia (5.0%), followed by infarction (1.4%), non-diagnostic (0.8%), and normal (0.3%) results. We found substantial variations in the format and quality of SE reports, even within the same institution.

Conclusions: Natural language processing is an accurate and efficient method for abstracting unstructured SE reports. This approach creates new opportunities for research, public health measures, and care improvement.

Abstract Image

Abstract Image

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用自然语言处理的压力超声心动图报告的自动解释。
目的:压力超声心动图(SE)的结果和解释通常记录在自由文本报告中。重用SE结果需要费力的手工审查。本研究旨在开发和验证一种在大型队列中提取SE报告的自动化方法。方法和结果:本研究纳入了在大型综合医疗保健系统中因疑似急性冠状动脉综合征就诊的急诊30天内患有SE的成年患者。开发了一种自动自然语言处理(NLP)算法来提取SE报告,并将总体SE结果分为正常、非诊断性、梗死和缺血类别。随机选择的报告(n = 140)由心脏病专家进行双盲审查,以执行NLP算法的标准有效性。使用抽象的SE数据和其他临床变量对整个队列进行结构效度测试。NLP算法提取了6346个连续的SE报告。心脏病专家对140份报告的总体SE结果有很好的一致性:Kappa(0.83)和类内相关系数(0.89)。NLP算法的特异性为98.6%,阴性预测值为95.7%,敏感性为95.7%,阳性预测值为95.7%,总体SE结果为f分,缺血结果为接近完美分。30天急性心肌梗死或死亡结果在缺血患者中最高(5.0%),其次是梗死(1.4%)、非诊断性(0.8%)和正常(0.3%)结果。我们发现,即使在同一机构内,SE报告的格式和质量也存在很大差异。结论:自然语言处理是一种准确、高效的非结构化SE报告提取方法。这种方法为研究、公共卫生措施和改善护理创造了新的机会。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
5.00
自引率
0.00%
发文量
0
期刊最新文献
Correction to: The association of electronic health literacy with behavioural and psychological coronary artery disease risk factors in patients after percutaneous coronary intervention: a 12-month follow-up study. Prospects for artificial intelligence-enhanced electrocardiogram as a unified screening tool for cardiac and non-cardiac conditions: an explorative study in emergency care. Unlocking the potential of artificial intelligence in electrocardiogram biometrics: age-related changes, anomaly detection, and data authenticity in mobile health platforms. Hypertrophic cardiomyopathy detection with artificial intelligence electrocardiography in international cohorts: an external validation study. Development and validation of risk prediction model for recurrent cardiovascular events among Chinese: the Personalized CARdiovascular DIsease risk Assessment for Chinese model.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1