Clinical Text Generation: Are We There Yet?

IF 6 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Annual Review of Biomedical Data Science Pub Date : 2025-08-01 Epub Date: 2025-03-18 DOI:10.1146/annurev-biodatasci-103123-095202
Nicolas Hiebel, Olivier Ferret, Karën Fort, Aurélie Névéol
{"title":"Clinical Text Generation: Are We There Yet?","authors":"Nicolas Hiebel, Olivier Ferret, Karën Fort, Aurélie Névéol","doi":"10.1146/annurev-biodatasci-103123-095202","DOIUrl":null,"url":null,"abstract":"<p><p>Generative artificial intelligence (AI), operationalized as large language models, is increasingly used in the biomedical field to assist with a range of text processing tasks including text classification, information extraction, and decision support. In this article, we focus on the primary purpose of generative language models, namely the production of unstructured text. We review past and current methods used to generate text as well as methods for evaluating open text generation, i.e., in contexts where no reference text is available for comparison. We discuss clinical applications that can benefit from high quality, ethically designed text generation, such as clinical note generation and synthetic text generation in support of secondary use of health data. We also raise awareness of the risks involved with generative AI such as overconfidence in outputs due to anthropomorphism and the risk of representational and allocation harms due to biases.</p>","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":" ","pages":"173-198"},"PeriodicalIF":6.0000,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Annual Review of Biomedical Data Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1146/annurev-biodatasci-103123-095202","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/3/18 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Generative artificial intelligence (AI), operationalized as large language models, is increasingly used in the biomedical field to assist with a range of text processing tasks including text classification, information extraction, and decision support. In this article, we focus on the primary purpose of generative language models, namely the production of unstructured text. We review past and current methods used to generate text as well as methods for evaluating open text generation, i.e., in contexts where no reference text is available for comparison. We discuss clinical applications that can benefit from high quality, ethically designed text generation, such as clinical note generation and synthetic text generation in support of secondary use of health data. We also raise awareness of the risks involved with generative AI such as overconfidence in outputs due to anthropomorphism and the risk of representational and allocation harms due to biases.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
临床文本生成:我们做到了吗?
生成式人工智能(AI)作为大型语言模型,越来越多地用于生物医学领域,以协助完成一系列文本处理任务,包括文本分类、信息提取和决策支持。在本文中,我们将重点讨论生成语言模型的主要目的,即生成非结构化文本。我们回顾了过去和当前用于生成文本的方法,以及评估开放文本生成的方法,即在没有参考文本可用于比较的上下文中。我们讨论了可以从高质量、合乎伦理设计的文本生成中受益的临床应用,例如临床记录生成和合成文本生成,以支持健康数据的二次使用。我们还提高了对生成式人工智能所涉及的风险的认识,例如由于拟人化而对产出的过度自信,以及由于偏见而导致的代表性和分配损害的风险。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
11.10
自引率
1.70%
发文量
0
期刊介绍: The Annual Review of Biomedical Data Science provides comprehensive expert reviews in biomedical data science, focusing on advanced methods to store, retrieve, analyze, and organize biomedical data and knowledge. The scope of the journal encompasses informatics, computational, artificial intelligence (AI), and statistical approaches to biomedical data, including the sub-fields of bioinformatics, computational biology, biomedical informatics, clinical and clinical research informatics, biostatistics, and imaging informatics. The mission of the journal is to identify both emerging and established areas of biomedical data science, and the leaders in these fields.
期刊最新文献
Statistical Methods for Understanding Trajectories in Genetic Epidemiology. Privacy and Security Throughout the Health Data Life Cycle: From Primary Care to Research Networks. Electronic Health Record-Linked Biobank Expansion Reveals Global Health Inequities. Precision Oncology: Multimodal and Multiscale Methods to Promote Mechanistic Understanding. Spatial Transcriptomics Brings New Challenges and Opportunities for Trajectory Inference.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1