从电子健康记录中提取罕见不良事件的主动学习:一项儿科心脏病学研究。

IF 3.7 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS International Journal of Medical Informatics Pub Date : 2024-12-12 DOI:10.1016/j.ijmedinf.2024.105761
Sophie Quennelle , Sophie Malekzadeh-Milani , Nicolas Garcelon , Hassan Faour , Anita Burgun , Carole Faviez , Rosy Tsopra , Damien Bonnet , Antoine Neuraz
{"title":"从电子健康记录中提取罕见不良事件的主动学习:一项儿科心脏病学研究。","authors":"Sophie Quennelle ,&nbsp;Sophie Malekzadeh-Milani ,&nbsp;Nicolas Garcelon ,&nbsp;Hassan Faour ,&nbsp;Anita Burgun ,&nbsp;Carole Faviez ,&nbsp;Rosy Tsopra ,&nbsp;Damien Bonnet ,&nbsp;Antoine Neuraz","doi":"10.1016/j.ijmedinf.2024.105761","DOIUrl":null,"url":null,"abstract":"<div><h3>Objective</h3><div>Automate the extraction of adverse events from the text of electronic medical records of patients hospitalized for cardiac catheterization.</div></div><div><h3>Methods</h3><div>We focused on events related to cardiac catheterization as defined by the NCDR-IMPACT registry. These events were extracted from the Necker Children’s Hospital data warehouse. Electronic health records were pre-screened using regular expressions. The resulting datasets contained numerous false positives sentences that were annotated by a cardiologist using an active learning process. A deep learning text classifier was then trained on this active learning-annotated dataset to accurately identify patients who have suffered a serious adverse event.</div></div><div><h3>Results</h3><div>The dataset included 2,980 patients. Regular expression based extraction of adverse events related to cardiac catheterization achieved a perfect recall. Due to the rarity of adverse events, the dataset obtained from this initial pre-screening step was imbalanced, containing a significant number of false positives. The active learning annotation enabled the acquisition of a representative dataset suitable for training a deep learning model. The deep learning text-classifier identified patients who underwent adverse events after cardiac catheterization with a recall of 0.78 and a specificity of 0.94.</div></div><div><h3>Conclusion</h3><div>Our model effectively identified patients who experienced adverse events related to cardiac catheterization using real clinical data. Enabled by an active learning annotation process, it shows promise for large language model applications in clinical research, especially for rare diseases with limited annotated databases. Our model’s strength lies in its development by physicians for physicians, ensuring its relevance and applicability in clinical practice.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"195 ","pages":"Article 105761"},"PeriodicalIF":3.7000,"publicationDate":"2024-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Active learning for extracting rare adverse events from electronic health records: A study in pediatric cardiology\",\"authors\":\"Sophie Quennelle ,&nbsp;Sophie Malekzadeh-Milani ,&nbsp;Nicolas Garcelon ,&nbsp;Hassan Faour ,&nbsp;Anita Burgun ,&nbsp;Carole Faviez ,&nbsp;Rosy Tsopra ,&nbsp;Damien Bonnet ,&nbsp;Antoine Neuraz\",\"doi\":\"10.1016/j.ijmedinf.2024.105761\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Objective</h3><div>Automate the extraction of adverse events from the text of electronic medical records of patients hospitalized for cardiac catheterization.</div></div><div><h3>Methods</h3><div>We focused on events related to cardiac catheterization as defined by the NCDR-IMPACT registry. These events were extracted from the Necker Children’s Hospital data warehouse. Electronic health records were pre-screened using regular expressions. The resulting datasets contained numerous false positives sentences that were annotated by a cardiologist using an active learning process. A deep learning text classifier was then trained on this active learning-annotated dataset to accurately identify patients who have suffered a serious adverse event.</div></div><div><h3>Results</h3><div>The dataset included 2,980 patients. Regular expression based extraction of adverse events related to cardiac catheterization achieved a perfect recall. Due to the rarity of adverse events, the dataset obtained from this initial pre-screening step was imbalanced, containing a significant number of false positives. The active learning annotation enabled the acquisition of a representative dataset suitable for training a deep learning model. The deep learning text-classifier identified patients who underwent adverse events after cardiac catheterization with a recall of 0.78 and a specificity of 0.94.</div></div><div><h3>Conclusion</h3><div>Our model effectively identified patients who experienced adverse events related to cardiac catheterization using real clinical data. Enabled by an active learning annotation process, it shows promise for large language model applications in clinical research, especially for rare diseases with limited annotated databases. Our model’s strength lies in its development by physicians for physicians, ensuring its relevance and applicability in clinical practice.</div></div>\",\"PeriodicalId\":54950,\"journal\":{\"name\":\"International Journal of Medical Informatics\",\"volume\":\"195 \",\"pages\":\"Article 105761\"},\"PeriodicalIF\":3.7000,\"publicationDate\":\"2024-12-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Medical Informatics\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1386505624004246\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Medical Informatics","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1386505624004246","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

摘要

目的:从心导管住院患者的电子病历文本中自动提取不良事件。方法:我们关注ndr - impact注册表中定义的与心导管术相关的事件。这些事件是从Necker儿童医院的数据仓库中提取的。使用正则表达式对电子健康记录进行预筛选。结果数据集包含许多假阳性句子,由心脏病专家使用主动学习过程注释。然后在这个主动学习注释数据集上训练深度学习文本分类器,以准确识别遭受严重不良事件的患者。结果:数据集包括2980例患者。基于正则表达式的心导管相关不良事件提取达到了完美的召回率。由于不良事件的罕见性,从最初的预筛选步骤获得的数据集是不平衡的,包含大量的假阳性。主动学习标注使得获取适合训练深度学习模型的代表性数据集成为可能。深度学习文本分类器识别出心导管插入术后发生不良事件的患者,召回率为0.78,特异性为0.94。结论:我们的模型使用真实的临床数据有效地识别了经历心导管相关不良事件的患者。通过主动学习注释过程,它显示了在临床研究中大型语言模型应用的前景,特别是对于具有有限注释数据库的罕见疾病。我们的模型的优势在于它是由医生为医生开发的,保证了它在临床实践中的相关性和适用性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

摘要图片

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Active learning for extracting rare adverse events from electronic health records: A study in pediatric cardiology

Objective

Automate the extraction of adverse events from the text of electronic medical records of patients hospitalized for cardiac catheterization.

Methods

We focused on events related to cardiac catheterization as defined by the NCDR-IMPACT registry. These events were extracted from the Necker Children’s Hospital data warehouse. Electronic health records were pre-screened using regular expressions. The resulting datasets contained numerous false positives sentences that were annotated by a cardiologist using an active learning process. A deep learning text classifier was then trained on this active learning-annotated dataset to accurately identify patients who have suffered a serious adverse event.

Results

The dataset included 2,980 patients. Regular expression based extraction of adverse events related to cardiac catheterization achieved a perfect recall. Due to the rarity of adverse events, the dataset obtained from this initial pre-screening step was imbalanced, containing a significant number of false positives. The active learning annotation enabled the acquisition of a representative dataset suitable for training a deep learning model. The deep learning text-classifier identified patients who underwent adverse events after cardiac catheterization with a recall of 0.78 and a specificity of 0.94.

Conclusion

Our model effectively identified patients who experienced adverse events related to cardiac catheterization using real clinical data. Enabled by an active learning annotation process, it shows promise for large language model applications in clinical research, especially for rare diseases with limited annotated databases. Our model’s strength lies in its development by physicians for physicians, ensuring its relevance and applicability in clinical practice.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
International Journal of Medical Informatics
International Journal of Medical Informatics 医学-计算机:信息系统
CiteScore
8.90
自引率
4.10%
发文量
217
审稿时长
42 days
期刊介绍: International Journal of Medical Informatics provides an international medium for dissemination of original results and interpretative reviews concerning the field of medical informatics. The Journal emphasizes the evaluation of systems in healthcare settings. The scope of journal covers: Information systems, including national or international registration systems, hospital information systems, departmental and/or physician''s office systems, document handling systems, electronic medical record systems, standardization, systems integration etc.; Computer-aided medical decision support systems using heuristic, algorithmic and/or statistical methods as exemplified in decision theory, protocol development, artificial intelligence, etc. Educational computer based programs pertaining to medical informatics or medicine in general; Organizational, economic, social, clinical impact, ethical and cost-benefit aspects of IT applications in health care.
期刊最新文献
Machine learning for predicting outcomes of transcatheter aortic valve implantation: A systematic review AI-driven triage in emergency departments: A review of benefits, challenges, and future directions Predicting cancer survival at different stages: Insights from fair and explainable machine learning approaches The fading structural prominence of explanations in clinical studies Utilization, challenges, and training needs of digital health technologies: Perspectives from healthcare professionals
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1