Extracting social determinants of health from inpatient electronic medical records using natural language processing.

Elliot A Martin, Adam G D'Souza, Vineet Saini, Karen Tang, Hude Quan, Cathy A Eastwood
{"title":"Extracting social determinants of health from inpatient electronic medical records using natural language processing.","authors":"Elliot A Martin, Adam G D'Souza, Vineet Saini, Karen Tang, Hude Quan, Cathy A Eastwood","doi":"10.1016/j.jeph.2024.202791","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Social determinants of health (SDOH) have been shown to be important predictors of health outcomes. Here we developed methods to extract them from inpatient electronic medical record (EMR) data using techniques compatible with current EMR systems.</p><p><strong>Methods: </strong>Four social determinants were targeted: patient language barriers, employment status, education, and whether the patient lives alone. Inpatients aged 18 and older with records in the Calgary-wide EMR system were studied. Algorithms were developed on the January 2019 hospital admissions (n=8,999) and validated on the January 2018 hospital admissions (n=8,839). SDOH documented as structured data were compared against those extracted from unstructured free-text notes.</p><p><strong>Results: </strong>More than twice as many patients had a note documenting a language barrier in EMR data than in structured data; 12 % of patients indicated by EMR notes to be living alone had a partner noted in their structured marital status. The Positive Predictive Value (PPV) of the elements extracted from notes was high, at 99 % (95 % CI 94.0 %-100.0 %) for language barriers, 98 % (95 % CI 92.6 %-99.9 %) for living alone, 96 % (95 % CI 89.8 %-98.8 %) for unemployment, and 88 % (95 % CI 80.0 %-93.1 %) for retirement.</p><p><strong>Conclusions: </strong>All SDOH elements were extracted with high PPV. SDOH documentation was largely missing in structured data and sometimes misleading.</p>","PeriodicalId":517428,"journal":{"name":"Journal of epidemiology and population health","volume":"72 6","pages":"202791"},"PeriodicalIF":0.0000,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of epidemiology and population health","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1016/j.jeph.2024.202791","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/11/14 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Social determinants of health (SDOH) have been shown to be important predictors of health outcomes. Here we developed methods to extract them from inpatient electronic medical record (EMR) data using techniques compatible with current EMR systems.

Methods: Four social determinants were targeted: patient language barriers, employment status, education, and whether the patient lives alone. Inpatients aged 18 and older with records in the Calgary-wide EMR system were studied. Algorithms were developed on the January 2019 hospital admissions (n=8,999) and validated on the January 2018 hospital admissions (n=8,839). SDOH documented as structured data were compared against those extracted from unstructured free-text notes.

Results: More than twice as many patients had a note documenting a language barrier in EMR data than in structured data; 12 % of patients indicated by EMR notes to be living alone had a partner noted in their structured marital status. The Positive Predictive Value (PPV) of the elements extracted from notes was high, at 99 % (95 % CI 94.0 %-100.0 %) for language barriers, 98 % (95 % CI 92.6 %-99.9 %) for living alone, 96 % (95 % CI 89.8 %-98.8 %) for unemployment, and 88 % (95 % CI 80.0 %-93.1 %) for retirement.

Conclusions: All SDOH elements were extracted with high PPV. SDOH documentation was largely missing in structured data and sometimes misleading.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用自然语言处理技术从住院病人电子病历中提取健康的社会决定因素。
背景:健康的社会决定因素(SDOH)已被证明是健康结果的重要预测因素。在此,我们开发了从住院病人电子病历(EMR)数据中提取这些因素的方法,这些方法使用的技术与当前的 EMR 系统兼容:方法:我们针对四个社会决定因素进行了研究:患者的语言障碍、就业状况、教育程度以及是否独居。研究对象为在整个卡尔加里 EMR 系统中有记录的 18 岁及以上住院患者。根据 2019 年 1 月入院患者(人数=8999)制定了算法,并根据 2018 年 1 月入院患者(人数=8839)进行了验证。将记录为结构化数据的 SDOH 与从非结构化自由文本笔记中提取的 SDOH 进行了比较:EMR数据中记录语言障碍的患者人数是结构化数据的两倍多;EMR记录显示独居的患者中有12%在结构化婚姻状况中记录有伴侣。从笔记中提取的要素的阳性预测值(PPV)很高,语言障碍为 99 %(95 % CI 94.0 %-100.0%),独居为 98 %(95 % CI 92.6 %-99.9%),失业为 96 %(95 % CI 89.8 %-98.8%),退休为 88 %(95 % CI 80.0 %-93.1%):所有 SDOH 要素的提取均具有较高的 PPV。结构化数据中大多缺少 SDOH 文件,有时会产生误导。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Needs and expectations of healthcare professionals regarding the topic of alcohology: Results of a survey conducted in France in 2022. The hidden crisis: Moral injury among French healthcare workers. A community-based peer-facilitated psychological and social support model to improve retention in care among Cameroonian adolescents perinatally infected with human immunodeficiency virus: A randomized controlled trial. Fostering patients' participation in oncology research by developing psychological empowerment and a sense of community. Analysis of pediatric ophthalmology surgical activity in metropolitan France in 2016: Its impact on training capacities.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1