基于电子病历识别新诊断的克罗恩病病例

IF 6.1 2区 医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Artificial Intelligence in Medicine Pub Date : 2024-11-21 DOI:10.1016/j.artmed.2024.103032
Susanne Ibing , Julian Hugo , Florian Borchert , Linea Schmidt , Caroline Benson , Allison A. Marshall , Colleen Chasteau , Ujunwa Korie , Diana Paguay , Jan Philipp Sachs , Bernhard Y. Renard , Judy H. Cho , Erwin P. Böttinger , Ryan C. Ungaro
{"title":"基于电子病历识别新诊断的克罗恩病病例","authors":"Susanne Ibing ,&nbsp;Julian Hugo ,&nbsp;Florian Borchert ,&nbsp;Linea Schmidt ,&nbsp;Caroline Benson ,&nbsp;Allison A. Marshall ,&nbsp;Colleen Chasteau ,&nbsp;Ujunwa Korie ,&nbsp;Diana Paguay ,&nbsp;Jan Philipp Sachs ,&nbsp;Bernhard Y. Renard ,&nbsp;Judy H. Cho ,&nbsp;Erwin P. Böttinger ,&nbsp;Ryan C. Ungaro","doi":"10.1016/j.artmed.2024.103032","DOIUrl":null,"url":null,"abstract":"<div><h3>Background:</h3><div>Early diagnosis and treatment of Crohn’s Disease are associated with decreased risk of surgery and complications. However, diagnostic delay is frequently seen in clinical practice. To better understand Crohn’s Disease risk factors and disease indicators, we identified, described, and predicted incident Crohn’s Disease patients based on the Electronic Health Record data of the Mount Sinai Health System.</div></div><div><h3>Methods:</h3><div>We developed two phenotyping algorithms based on structured Electronic Health Record data (i.e., coded diagnosis, medication prescription, and healthcare utilization), and a more simple and advanced approach of information extraction from clinical notes, including data between 2011 and 2023. We conducted an ablation study for the classification task using different models, prediction time points, data inputs, text encoding methods, and case-control matching variables.</div></div><div><h3>Results:</h3><div>We identified 247 incident Crohn’s Disease cases and 1221 matched controls and validated our cohorts through manual chart review. A second control cohort (n = 1235) was created without matching on race. Gastrointestinal symptoms were significantly overrepresented in cases at least 180 days before the first coded Crohn’s Disease diagnosis. Adding text-based features to the clinical prediction models increased their overall performances. However, adding race as a matching variable had more effects on the model performance than the choice of modeling algorithm or input data, with an area under the receiver operating characteristic difference of 0.09 between the best-performing models.</div></div><div><h3>Conclusion:</h3><div>We demonstrate the feasibility of identifying newly diagnosed Crohn’s Disease patients within a United States health system using Electronic Health Records. For the predictive modeling task, cases and controls were distinguished only with modest performance, even though various state-of-the-art methods were applied based on features from structured and unstructured data. Our findings suggest the benefit of adding information from clinical notes in a supervised or unsupervised manner for cohort creation and predictive modeling.</div></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"159 ","pages":"Article 103032"},"PeriodicalIF":6.1000,"publicationDate":"2024-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Electronic Health Records-based identification of newly diagnosed Crohn’s Disease cases\",\"authors\":\"Susanne Ibing ,&nbsp;Julian Hugo ,&nbsp;Florian Borchert ,&nbsp;Linea Schmidt ,&nbsp;Caroline Benson ,&nbsp;Allison A. Marshall ,&nbsp;Colleen Chasteau ,&nbsp;Ujunwa Korie ,&nbsp;Diana Paguay ,&nbsp;Jan Philipp Sachs ,&nbsp;Bernhard Y. Renard ,&nbsp;Judy H. Cho ,&nbsp;Erwin P. Böttinger ,&nbsp;Ryan C. Ungaro\",\"doi\":\"10.1016/j.artmed.2024.103032\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Background:</h3><div>Early diagnosis and treatment of Crohn’s Disease are associated with decreased risk of surgery and complications. However, diagnostic delay is frequently seen in clinical practice. To better understand Crohn’s Disease risk factors and disease indicators, we identified, described, and predicted incident Crohn’s Disease patients based on the Electronic Health Record data of the Mount Sinai Health System.</div></div><div><h3>Methods:</h3><div>We developed two phenotyping algorithms based on structured Electronic Health Record data (i.e., coded diagnosis, medication prescription, and healthcare utilization), and a more simple and advanced approach of information extraction from clinical notes, including data between 2011 and 2023. We conducted an ablation study for the classification task using different models, prediction time points, data inputs, text encoding methods, and case-control matching variables.</div></div><div><h3>Results:</h3><div>We identified 247 incident Crohn’s Disease cases and 1221 matched controls and validated our cohorts through manual chart review. A second control cohort (n = 1235) was created without matching on race. Gastrointestinal symptoms were significantly overrepresented in cases at least 180 days before the first coded Crohn’s Disease diagnosis. Adding text-based features to the clinical prediction models increased their overall performances. However, adding race as a matching variable had more effects on the model performance than the choice of modeling algorithm or input data, with an area under the receiver operating characteristic difference of 0.09 between the best-performing models.</div></div><div><h3>Conclusion:</h3><div>We demonstrate the feasibility of identifying newly diagnosed Crohn’s Disease patients within a United States health system using Electronic Health Records. For the predictive modeling task, cases and controls were distinguished only with modest performance, even though various state-of-the-art methods were applied based on features from structured and unstructured data. Our findings suggest the benefit of adding information from clinical notes in a supervised or unsupervised manner for cohort creation and predictive modeling.</div></div>\",\"PeriodicalId\":55458,\"journal\":{\"name\":\"Artificial Intelligence in Medicine\",\"volume\":\"159 \",\"pages\":\"Article 103032\"},\"PeriodicalIF\":6.1000,\"publicationDate\":\"2024-11-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Artificial Intelligence in Medicine\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0933365724002744\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial Intelligence in Medicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0933365724002744","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

摘要

背景:克罗恩病的早期诊断和治疗可降低手术风险和并发症。然而,临床实践中却经常出现诊断延误的情况。为了更好地了解克罗恩病的风险因素和疾病指标,我们根据西奈山医疗系统的电子病历数据对克罗恩病患者进行了识别、描述和预测。方法:我们开发了两种基于结构化电子病历数据(即编码诊断、药物处方和医疗保健使用)的表型算法,以及一种从临床笔记(包括 2011 年至 2023 年的数据)中提取信息的更简单、更先进的方法。我们使用不同的模型、预测时间点、数据输入、文本编码方法和病例对照匹配变量对分类任务进行了消融研究。结果:我们确定了 247 例克罗恩病病例和 1221 例匹配对照,并通过人工病历审查验证了我们的队列。第二个对照组(n = 1235)是在没有种族匹配的情况下建立的。在首次编码克罗恩病诊断前至少 180 天的病例中,胃肠道症状明显偏多。在临床预测模型中添加基于文本的特征可提高模型的整体性能。然而,与建模算法或输入数据的选择相比,添加种族作为匹配变量对模型性能的影响更大,表现最好的模型之间的接受者操作特征下面积差为 0.09。在预测建模任务中,尽管采用了基于结构化和非结构化数据特征的各种先进方法,但病例和对照组的区分效果一般。我们的研究结果表明,在队列创建和预测建模中,以监督或无监督的方式添加临床笔记中的信息是有益的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

摘要图片

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Electronic Health Records-based identification of newly diagnosed Crohn’s Disease cases

Background:

Early diagnosis and treatment of Crohn’s Disease are associated with decreased risk of surgery and complications. However, diagnostic delay is frequently seen in clinical practice. To better understand Crohn’s Disease risk factors and disease indicators, we identified, described, and predicted incident Crohn’s Disease patients based on the Electronic Health Record data of the Mount Sinai Health System.

Methods:

We developed two phenotyping algorithms based on structured Electronic Health Record data (i.e., coded diagnosis, medication prescription, and healthcare utilization), and a more simple and advanced approach of information extraction from clinical notes, including data between 2011 and 2023. We conducted an ablation study for the classification task using different models, prediction time points, data inputs, text encoding methods, and case-control matching variables.

Results:

We identified 247 incident Crohn’s Disease cases and 1221 matched controls and validated our cohorts through manual chart review. A second control cohort (n = 1235) was created without matching on race. Gastrointestinal symptoms were significantly overrepresented in cases at least 180 days before the first coded Crohn’s Disease diagnosis. Adding text-based features to the clinical prediction models increased their overall performances. However, adding race as a matching variable had more effects on the model performance than the choice of modeling algorithm or input data, with an area under the receiver operating characteristic difference of 0.09 between the best-performing models.

Conclusion:

We demonstrate the feasibility of identifying newly diagnosed Crohn’s Disease patients within a United States health system using Electronic Health Records. For the predictive modeling task, cases and controls were distinguished only with modest performance, even though various state-of-the-art methods were applied based on features from structured and unstructured data. Our findings suggest the benefit of adding information from clinical notes in a supervised or unsupervised manner for cohort creation and predictive modeling.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Artificial Intelligence in Medicine
Artificial Intelligence in Medicine 工程技术-工程:生物医学
CiteScore
15.00
自引率
2.70%
发文量
143
审稿时长
6.3 months
期刊介绍: Artificial Intelligence in Medicine publishes original articles from a wide variety of interdisciplinary perspectives concerning the theory and practice of artificial intelligence (AI) in medicine, medically-oriented human biology, and health care. Artificial intelligence in medicine may be characterized as the scientific discipline pertaining to research studies, projects, and applications that aim at supporting decision-based medical tasks through knowledge- and/or data-intensive computer-based solutions that ultimately support and improve the performance of a human care provider.
期刊最新文献
Hyperbolic multivariate feature learning in higher-order heterogeneous networks for drug–disease prediction Editorial Board BDFormer: Boundary-aware dual-decoder transformer for skin lesion segmentation Finger-aware Artificial Neural Network for predicting arthritis in Patients with hand pain Artificial intelligence-driven approaches in antibiotic stewardship programs and optimizing prescription practices: A systematic review
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1