NLP modeling recommendations for restricted data availability in clinical settings.

IF 5.5 3区 医学 Q2 MEDICAL INFORMATICS BMC Medical Informatics and Decision Making Pub Date : 2025-03-07 DOI:10.1186/s12911-025-02948-2
Fabián Villena, Felipe Bravo-Marquez, Jocelyn Dunstan
{"title":"NLP modeling recommendations for restricted data availability in clinical settings.","authors":"Fabián Villena, Felipe Bravo-Marquez, Jocelyn Dunstan","doi":"10.1186/s12911-025-02948-2","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Clinical decision-making in healthcare often relies on unstructured text data, which can be challenging to analyze using traditional methods. Natural Language Processing (NLP) has emerged as a promising solution, but its application in clinical settings is hindered by restricted data availability and the need for domain-specific knowledge.</p><p><strong>Methods: </strong>We conducted an experimental analysis to evaluate the performance of various NLP modeling paradigms on multiple clinical NLP tasks in Spanish. These tasks included referral prioritization and referral specialty classification. We simulated three clinical settings with varying levels of data availability and evaluated the performance of four foundation models.</p><p><strong>Results: </strong>Clinical-specific pre-trained language models (PLMs) achieved the highest performance across tasks. For referral prioritization, Clinical PLMs attained an 88.85 % macro F1 score when fine-tuned. In referral specialty classification, the same models achieved a 53.79 % macro F1 score, surpassing domain-agnostic models. Continuing pre-training with environment-specific data improved model performance, but the gains were marginal compared to the computational resources required. Few-shot learning with large language models (LLMs) demonstrated lower performance but showed potential in data-scarce scenarios.</p><p><strong>Conclusions: </strong>Our study provides evidence-based recommendations for clinical NLP practitioners on selecting modeling paradigms based on data availability. We highlight the importance of considering data availability, task complexity, and institutional maturity when designing and training clinical NLP models. Our findings can inform the development of effective clinical NLP solutions in real-world settings.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"25 1","pages":"116"},"PeriodicalIF":5.5000,"publicationDate":"2025-03-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11889813/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Informatics and Decision Making","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12911-025-02948-2","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Clinical decision-making in healthcare often relies on unstructured text data, which can be challenging to analyze using traditional methods. Natural Language Processing (NLP) has emerged as a promising solution, but its application in clinical settings is hindered by restricted data availability and the need for domain-specific knowledge.

Methods: We conducted an experimental analysis to evaluate the performance of various NLP modeling paradigms on multiple clinical NLP tasks in Spanish. These tasks included referral prioritization and referral specialty classification. We simulated three clinical settings with varying levels of data availability and evaluated the performance of four foundation models.

Results: Clinical-specific pre-trained language models (PLMs) achieved the highest performance across tasks. For referral prioritization, Clinical PLMs attained an 88.85 % macro F1 score when fine-tuned. In referral specialty classification, the same models achieved a 53.79 % macro F1 score, surpassing domain-agnostic models. Continuing pre-training with environment-specific data improved model performance, but the gains were marginal compared to the computational resources required. Few-shot learning with large language models (LLMs) demonstrated lower performance but showed potential in data-scarce scenarios.

Conclusions: Our study provides evidence-based recommendations for clinical NLP practitioners on selecting modeling paradigms based on data availability. We highlight the importance of considering data availability, task complexity, and institutional maturity when designing and training clinical NLP models. Our findings can inform the development of effective clinical NLP solutions in real-world settings.

Abstract Image

Abstract Image

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
NLP建模建议限制数据可用性在临床设置。
背景:医疗保健中的临床决策通常依赖于非结构化文本数据,使用传统方法分析这些数据可能具有挑战性。自然语言处理(NLP)已成为一种很有前途的解决方案,但其在临床环境中的应用受到有限的数据可用性和对特定领域知识的需求的阻碍。方法:通过实验分析,评估各种NLP建模范式在西班牙语多项临床NLP任务中的表现。这些任务包括转诊优先级和转诊专业分类。我们模拟了三种具有不同数据可用性水平的临床环境,并评估了四种基础模型的性能。结果:临床特异性预训练语言模型(PLMs)在跨任务中取得了最高的性能。对于转诊优先级,临床PLMs在微调后达到88.85%的宏观F1评分。在转诊专科分类中,相同模型的宏观F1得分达到53.79%,超过了领域不可知模型。继续使用特定于环境的数据进行预训练可以提高模型的性能,但与所需的计算资源相比,这种增益是微不足道的。使用大型语言模型(llm)的少量学习表现出较低的性能,但在数据稀缺的场景中显示出潜力。结论:我们的研究为临床NLP从业者提供了基于数据可用性选择建模范式的循证建议。我们强调在设计和训练临床NLP模型时考虑数据可用性、任务复杂性和制度成熟度的重要性。我们的研究结果可以为现实世界中有效的临床NLP解决方案的发展提供信息。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
7.20
自引率
5.70%
发文量
297
审稿时长
1 months
期刊介绍: BMC Medical Informatics and Decision Making is an open access journal publishing original peer-reviewed research articles in relation to the design, development, implementation, use, and evaluation of health information technologies and decision-making for human health.
期刊最新文献
Risk stratification for reoperation outcomes in recurrent or persistent papillary thyroid carcinoma: development and validation of a machine learning model. Artificial intelligence and medical futility. Automated classification of clinical T staging pulmonary oncology from chest CT reports: a natural language processing framework for real-world data utilization. Transformer-based models in dentistry: a systematic review. Predicting coronary artery lesions in Kawasaki disease: a nomogram based on LASSO regression feature selection.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1