Identifying social determinants of health from clinical narratives: A study of performance, documentation ratio, and potential bias

IF 4 2区医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Journal of Biomedical Informatics Pub Date : 2024-04-14 DOI:10.1016/j.jbi.2024.104642

Zehao Yu , Cheng Peng , Xi Yang , Chong Dang , Prakash Adekkanattu , Braja Gopal Patra , Yifan Peng , Jyotishman Pathak , Debbie L. Wilson , Ching-Yuan Chang , Wei-Hsuan Lo-Ciganic , Thomas J. George , William R. Hogan , Yi Guo , Jiang Bian , Yonghui Wu

{"title":"Identifying social determinants of health from clinical narratives: A study of performance, documentation ratio, and potential bias","authors":"Zehao Yu , Cheng Peng , Xi Yang , Chong Dang , Prakash Adekkanattu , Braja Gopal Patra , Yifan Peng , Jyotishman Pathak , Debbie L. Wilson , Ching-Yuan Chang , Wei-Hsuan Lo-Ciganic , Thomas J. George , William R. Hogan , Yi Guo , Jiang Bian , Yonghui Wu","doi":"10.1016/j.jbi.2024.104642","DOIUrl":null,"url":null,"abstract":"<div><h3>Objective</h3><p>To develop a natural language processing (NLP) package to extract social determinants of health (SDoH) from clinical narratives, examine the bias among race and gender groups, test the generalizability of extracting SDoH for different disease groups, and examine population-level extraction ratio.</p></div><div><h3>Methods</h3><p>We developed SDoH corpora using clinical notes identified at the University of Florida (UF) Health. We systematically compared 7 transformer-based large language models (LLMs) and developed an open-source package – SODA (i.e., SOcial DeterminAnts) to facilitate SDoH extraction from clinical narratives. We examined the performance and potential bias of SODA for different race and gender groups, tested the generalizability of SODA using two disease domains including cancer and opioid use, and explored strategies for improvement. We applied SODA to extract 19 categories of SDoH from the breast (n = 7,971), lung (n = 11,804), and colorectal cancer (n = 6,240) cohorts to assess patient-level extraction ratio and examine the differences among race and gender groups.</p></div><div><h3>Results</h3><p>We developed an SDoH corpus using 629 clinical notes of cancer patients with annotations of 13,193 SDoH concepts/attributes from 19 categories of SDoH, and another cross-disease validation corpus using 200 notes from opioid use patients with 4,342 SDoH concepts/attributes. We compared 7 transformer models and the GatorTron model achieved the best mean average strict/lenient F1 scores of 0.9122 and 0.9367 for SDoH concept extraction and 0.9584 and 0.9593 for linking attributes to SDoH concepts. There is a small performance gap (∼4%) between Males and Females, but a large performance gap (>16 %) among race groups. The performance dropped when we applied the cancer SDoH model to the opioid cohort; fine-tuning using a smaller opioid SDoH corpus improved the performance. The extraction ratio varied in the three cancer cohorts, in which 10 SDoH could be extracted from over 70 % of cancer patients, but 9 SDoH could be extracted from less than 70 % of cancer patients. Individuals from the White and Black groups have a higher extraction ratio than other minority race groups.</p></div><div><h3>Conclusions</h3><p>Our SODA package achieved good performance in extracting 19 categories of SDoH from clinical narratives. The SODA package with pre-trained transformer models is available at <span>https://github.com/uf-hobi-informatics-lab/SODA_Docker</span><svg><path></path></svg>.</p></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":null,"pages":null},"PeriodicalIF":4.0000,"publicationDate":"2024-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Biomedical Informatics","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1532046424000601","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}

引用次数: 0

Abstract

Objective

To develop a natural language processing (NLP) package to extract social determinants of health (SDoH) from clinical narratives, examine the bias among race and gender groups, test the generalizability of extracting SDoH for different disease groups, and examine population-level extraction ratio.

Methods

We developed SDoH corpora using clinical notes identified at the University of Florida (UF) Health. We systematically compared 7 transformer-based large language models (LLMs) and developed an open-source package – SODA (i.e., SOcial DeterminAnts) to facilitate SDoH extraction from clinical narratives. We examined the performance and potential bias of SODA for different race and gender groups, tested the generalizability of SODA using two disease domains including cancer and opioid use, and explored strategies for improvement. We applied SODA to extract 19 categories of SDoH from the breast (n = 7,971), lung (n = 11,804), and colorectal cancer (n = 6,240) cohorts to assess patient-level extraction ratio and examine the differences among race and gender groups.

Results

We developed an SDoH corpus using 629 clinical notes of cancer patients with annotations of 13,193 SDoH concepts/attributes from 19 categories of SDoH, and another cross-disease validation corpus using 200 notes from opioid use patients with 4,342 SDoH concepts/attributes. We compared 7 transformer models and the GatorTron model achieved the best mean average strict/lenient F1 scores of 0.9122 and 0.9367 for SDoH concept extraction and 0.9584 and 0.9593 for linking attributes to SDoH concepts. There is a small performance gap (∼4%) between Males and Females, but a large performance gap (>16 %) among race groups. The performance dropped when we applied the cancer SDoH model to the opioid cohort; fine-tuning using a smaller opioid SDoH corpus improved the performance. The extraction ratio varied in the three cancer cohorts, in which 10 SDoH could be extracted from over 70 % of cancer patients, but 9 SDoH could be extracted from less than 70 % of cancer patients. Individuals from the White and Black groups have a higher extraction ratio than other minority race groups.

Conclusions

Our SODA package achieved good performance in extracting 19 categories of SDoH from clinical narratives. The SODA package with pre-trained transformer models is available at https://github.com/uf-hobi-informatics-lab/SODA_Docker.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

从临床叙述中识别健康的社会决定因素：绩效、文件比率和潜在偏差研究

目标开发一种自然语言处理（NLP）软件包，用于从临床叙述中提取健康的社会决定因素（SDoH），研究种族和性别群体之间的偏差，测试不同疾病群体提取 SDoH 的通用性，并研究人口层面的提取比例。方法我们利用佛罗里达大学（UF）卫生部的临床记录开发了 SDoH 语料库。我们系统比较了 7 种基于转换器的大语言模型（LLM），并开发了一个开源软件包--SODA（即 SOcial DeterminAnts），以方便从临床叙述中提取 SDoH。我们检查了 SODA 在不同种族和性别群体中的性能和潜在偏差，使用包括癌症和阿片类药物使用在内的两个疾病领域测试了 SODA 的通用性，并探讨了改进策略。我们应用 SODA 从乳腺癌（n = 7971）、肺癌（n = 11804）和结直肠癌（n = 6240）队列中提取了 19 个 SDoH 类别，以评估患者级别的提取率，并检查不同种族和性别群体之间的差异。结果我们开发了一个 SDoH 语料库，该语料库使用了 629 份癌症患者的临床笔记，注释了 19 个 SDoH 类别中的 13,193 个 SDoH 概念/属性；我们还开发了另一个跨疾病验证语料库，该语料库使用了 200 份阿片类药物使用患者的笔记，注释了 4,342 个 SDoH 概念/属性。我们对 7 个转换器模型进行了比较，GatorTron 模型在 SDoH 概念提取方面取得了 0.9122 和 0.9367 的最佳平均严格/简化 F1 分数，在将属性链接到 SDoH 概念方面取得了 0.9584 和 0.9593 的最佳平均严格/简化 F1 分数。男性和女性之间的性能差距较小（∼4%），但种族群体之间的性能差距较大（>16%）。将癌症 SDoH 模型应用于阿片类药物队列时，性能有所下降；使用较小的阿片类药物 SDoH 语料库进行微调后，性能有所提高。在三个癌症队列中，提取比例各不相同，其中可以从 70% 以上的癌症患者中提取出 10 个 SDoH，但只能从不到 70% 的癌症患者中提取出 9 个 SDoH。白人和黑人群体的 SDoH 提取率高于其他少数种族群体。带有预训练转换器模型的 SODA 软件包可在 https://github.com/uf-hobi-informatics-lab/SODA_Docker 上获取。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Journal of Biomedical Informatics 医学-计算机：跨学科应用

CiteScore

8.90

自引率

6.70%

发文量

243

审稿时长

32 days

期刊介绍： The Journal of Biomedical Informatics reflects a commitment to high-quality original research papers, reviews, and commentaries in the area of biomedical informatics methodology. Although we publish articles motivated by applications in the biomedical sciences (for example, clinical medicine, health care, population health, and translational bioinformatics), the journal emphasizes reports of new methodologies and techniques that have general applicability and that form the basis for the evolving science of biomedical informatics. Articles on medical devices; evaluations of implemented systems (including clinical trials of information technologies); or papers that provide insight into a biological process, a specific disease, or treatment options would generally be more suitable for publication in other venues. Papers on applications of signal processing and image analysis are often more suitable for biomedical engineering journals or other informatics journals, although we do publish papers that emphasize the information management and knowledge representation/modeling issues that arise in the storage and use of biological signals and images. System descriptions are welcome if they illustrate and substantiate the underlying methodology that is the principal focus of the report and an effort is made to address the generalizability and/or range of application of that methodology. Note also that, given the international nature of JBI, papers that deal with specific languages other than English, or with country-specific health systems or approaches, are acceptable for JBI only if they offer generalizable lessons that are relevant to the broad JBI readership, regardless of their country, language, culture, or health system.