Extraction of Substance Use Information From Clinical Notes: Generative Pretrained Transformer-Based Investigation.

IF 3.1 3区医学 Q2 MEDICAL INFORMATICS JMIR Medical Informatics Pub Date : 2024-08-19 DOI:10.2196/56243

Fatemeh Shah-Mohammadi, Joseph Finkelstein

{"title":"Extraction of Substance Use Information From Clinical Notes: Generative Pretrained Transformer-Based Investigation.","authors":"Fatemeh Shah-Mohammadi, Joseph Finkelstein","doi":"10.2196/56243","DOIUrl":null,"url":null,"abstract":"Background: Understanding the multifaceted nature of health outcomes requires a comprehensive examination of the social, economic, and environmental determinants that shape individual well-being. Among these determinants, behavioral factors play a crucial role, particularly the consumption patterns of psychoactive substances, which have important implications on public health. The Global Burden of Disease Study shows a growing impact in disability-adjusted life years due to substance use. The successful identification of patients' substance use information equips clinical care teams to address substance-related issues more effectively, enabling targeted support and ultimately improving patient outcomes.Objective: Traditional natural language processing methods face limitations in accurately parsing diverse clinical language associated with substance use. Large language models offer promise in overcoming these challenges by adapting to diverse language patterns. This study investigates the application of the generative pretrained transformer (GPT) model in specific GPT-3.5 for extracting tobacco, alcohol, and substance use information from patient discharge summaries in zero-shot and few-shot learning settings. This study contributes to the evolving landscape of health care informatics by showcasing the potential of advanced language models in extracting nuanced information critical for enhancing patient care.Methods: The main data source for analysis in this paper is Medical Information Mart for Intensive Care III data set. Among all notes in this data set, we focused on discharge summaries. Prompt engineering was undertaken, involving an iterative exploration of diverse prompts. Leveraging carefully curated examples and refined prompts, we investigate the model's proficiency through zero-shot as well as few-shot prompting strategies.Results: Results show GPT's varying effectiveness in identifying mentions of tobacco, alcohol, and substance use across learning scenarios. Zero-shot learning showed high accuracy in identifying substance use, whereas few-shot learning reduced accuracy but improved in identifying substance use status, enhancing recall and F1-score at the expense of lower precision.Conclusions: Excellence of zero-shot learning in precisely extracting text span mentioning substance use demonstrates its effectiveness in situations in which comprehensive recall is important. Conversely, few-shot learning offers advantages when accurately determining the status of substance use is the primary focus, even if it involves a trade-off in precision. The results contribute to enhancement of early detection and intervention strategies, tailor treatment plans with greater precision, and ultimately, contribute to a holistic understanding of patient health profiles. By integrating these artificial intelligence-driven methods into electronic health record systems, clinicians can gain immediate, comprehensive insights into substance use that results in shaping interventions that are not only timely but also more personalized and effective.","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":" ","pages":"e56243"},"PeriodicalIF":3.1000,"publicationDate":"2024-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11369538/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JMIR Medical Informatics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.2196/56243","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}

引用次数: 0

Abstract

Background: Understanding the multifaceted nature of health outcomes requires a comprehensive examination of the social, economic, and environmental determinants that shape individual well-being. Among these determinants, behavioral factors play a crucial role, particularly the consumption patterns of psychoactive substances, which have important implications on public health. The Global Burden of Disease Study shows a growing impact in disability-adjusted life years due to substance use. The successful identification of patients' substance use information equips clinical care teams to address substance-related issues more effectively, enabling targeted support and ultimately improving patient outcomes.

Objective: Traditional natural language processing methods face limitations in accurately parsing diverse clinical language associated with substance use. Large language models offer promise in overcoming these challenges by adapting to diverse language patterns. This study investigates the application of the generative pretrained transformer (GPT) model in specific GPT-3.5 for extracting tobacco, alcohol, and substance use information from patient discharge summaries in zero-shot and few-shot learning settings. This study contributes to the evolving landscape of health care informatics by showcasing the potential of advanced language models in extracting nuanced information critical for enhancing patient care.

Methods: The main data source for analysis in this paper is Medical Information Mart for Intensive Care III data set. Among all notes in this data set, we focused on discharge summaries. Prompt engineering was undertaken, involving an iterative exploration of diverse prompts. Leveraging carefully curated examples and refined prompts, we investigate the model's proficiency through zero-shot as well as few-shot prompting strategies.

Results: Results show GPT's varying effectiveness in identifying mentions of tobacco, alcohol, and substance use across learning scenarios. Zero-shot learning showed high accuracy in identifying substance use, whereas few-shot learning reduced accuracy but improved in identifying substance use status, enhancing recall and F₁-score at the expense of lower precision.

Conclusions: Excellence of zero-shot learning in precisely extracting text span mentioning substance use demonstrates its effectiveness in situations in which comprehensive recall is important. Conversely, few-shot learning offers advantages when accurately determining the status of substance use is the primary focus, even if it involves a trade-off in precision. The results contribute to enhancement of early detection and intervention strategies, tailor treatment plans with greater precision, and ultimately, contribute to a holistic understanding of patient health profiles. By integrating these artificial intelligence-driven methods into electronic health record systems, clinicians can gain immediate, comprehensive insights into substance use that results in shaping interventions that are not only timely but also more personalized and effective.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

从临床笔记中提取药物使用信息：基于 GPT 的研究。

背景：要了解健康结果的多面性，就必须全面研究影响个人福祉的社会、经济和环境决定因素。在这些决定因素中，行为因素起着至关重要的作用，尤其是精神活性物质的消费模式，对公共卫生有着重要影响。全球疾病负担研究》显示，药物使用对残疾调整生命年的影响越来越大。成功识别患者的药物使用信息能让临床护理团队更有效地解决药物相关问题，从而提供有针对性的支持，最终改善患者的治疗效果：传统的自然语言处理（NLP）方法在准确解析与药物使用相关的各种临床语言方面存在局限性。大型语言模型（LLM）通过适应不同的语言模式，有望克服这些挑战。本研究调查了生成式预训练转换器（GPT）模型的应用，特别是 GPT-3.5- 在零镜头和少镜头学习设置中从患者出院摘要中提取烟草、酒精和药物使用信息的应用。这项研究通过展示高级语言模型在提取对加强患者护理至关重要的细微信息方面的潜力，为不断发展的医疗保健信息学做出了贡献：本文分析的主要数据源是重症监护医学信息市场 III（MIMIC-III）数据集。在该数据集中的所有笔记中，我们重点关注出院摘要。我们进行了提示工程，包括对各种提示的反复探索。利用精心策划的示例和改进的提示，我们研究了该模型在零次和少量提示策略下的能力：所展示的结果凸显了 GPT 在提取提及烟草、酒精和药物使用的文本跨度时，在 "零 "和 "少 "两种学习场景下的截然不同的表现。在零次学习场景中，提取烟草、酒精和药物使用信息的准确率明显较高。然而，在少次学习的情况下，准确率则明显下降。相反，与零次学习相比，少次学习在设计物质使用状况方面有显著提高，召回率和 F1 分数也有显著提高。然而，这种提高的代价是，不仅在提取提及使用情况的文本跨度方面，而且在提取使用情况的精确度方面都有所下降：结论：零点学习在精确提取提及药物使用的文本跨度方面的卓越表现，证明了它在全面召回率非常重要的情况下的有效性。相反，当准确判断药物使用状况是主要重点时，即使需要在精确度上做出权衡，零点学习也具有优势。这些结果有助于加强早期检测和干预策略，更精确地定制治疗计划，并最终有助于全面了解患者的健康状况。通过将这些人工智能驱动的方法整合到电子健康记录系统中，临床医生可以即时、全面地了解药物使用情况，从而制定出不仅及时，而且更加个性化和有效的干预措施：

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

JMIR Medical Informatics Medicine-Health Informatics

CiteScore

7.90

自引率

3.10%

发文量

173

审稿时长

12 weeks

期刊介绍： JMIR Medical Informatics (JMI, ISSN 2291-9694) is a top-rated, tier A journal which focuses on clinical informatics, big data in health and health care, decision support for health professionals, electronic health records, ehealth infrastructures and implementation. It has a focus on applied, translational research, with a broad readership including clinicians, CIOs, engineers, industry and health informatics professionals. Published by JMIR Publications, publisher of the Journal of Medical Internet Research (JMIR), the leading eHealth/mHealth journal (Impact Factor 2016: 5.175), JMIR Med Inform has a slightly different scope (emphasizing more on applications for clinicians and health professionals rather than consumers/citizens, which is the focus of JMIR), publishes even faster, and also allows papers which are more technical or more formative than what would be published in the Journal of Medical Internet Research.