Multi-task transfer learning for the prediction of entity modifiers in clinical text: application to opioid use disorder case detection.

IF 1.6 3区工程技术 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Journal of Biomedical Semantics Pub Date : 2024-06-07 DOI:10.1186/s13326-024-00311-4

Abdullateef I Almudaifer, Whitney Covington, JaMor Hairston, Zachary Deitch, Ankit Anand, Caleb M Carroll, Estera Crisan, William Bradford, Lauren A Walter, Ellen F Eaton, Sue S Feldman, John D Osborne

{"title":"Multi-task transfer learning for the prediction of entity modifiers in clinical text: application to opioid use disorder case detection.","authors":"Abdullateef I Almudaifer, Whitney Covington, JaMor Hairston, Zachary Deitch, Ankit Anand, Caleb M Carroll, Estera Crisan, William Bradford, Lauren A Walter, Ellen F Eaton, Sue S Feldman, John D Osborne","doi":"10.1186/s13326-024-00311-4","DOIUrl":null,"url":null,"abstract":"Background: The semantics of entities extracted from a clinical text can be dramatically altered by modifiers, including entity negation, uncertainty, conditionality, severity, and subject. Existing models for determining modifiers of clinical entities involve regular expression or features weights that are trained independently for each modifier.Methods: We develop and evaluate a multi-task transformer architecture design where modifiers are learned and predicted jointly using the publicly available SemEval 2015 Task 14 corpus and a new Opioid Use Disorder (OUD) data set that contains modifiers shared with SemEval as well as novel modifiers specific for OUD. We evaluate the effectiveness of our multi-task learning approach versus previously published systems and assess the feasibility of transfer learning for clinical entity modifiers when only a portion of clinical modifiers are shared.Results: Our approach achieved state-of-the-art results on the ShARe corpus from SemEval 2015 Task 14, showing an increase of 1.1% on weighted accuracy, 1.7% on unweighted accuracy, and 10% on micro F1 scores.Conclusions: We show that learned weights from our shared model can be effectively transferred to a new partially matched data set, validating the use of transfer learning for clinical text modifiers.","PeriodicalId":15055,"journal":{"name":"Journal of Biomedical Semantics","volume":"15 1","pages":"11"},"PeriodicalIF":1.6000,"publicationDate":"2024-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11157899/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Biomedical Semantics","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1186/s13326-024-00311-4","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Background: The semantics of entities extracted from a clinical text can be dramatically altered by modifiers, including entity negation, uncertainty, conditionality, severity, and subject. Existing models for determining modifiers of clinical entities involve regular expression or features weights that are trained independently for each modifier.

Methods: We develop and evaluate a multi-task transformer architecture design where modifiers are learned and predicted jointly using the publicly available SemEval 2015 Task 14 corpus and a new Opioid Use Disorder (OUD) data set that contains modifiers shared with SemEval as well as novel modifiers specific for OUD. We evaluate the effectiveness of our multi-task learning approach versus previously published systems and assess the feasibility of transfer learning for clinical entity modifiers when only a portion of clinical modifiers are shared.

Results: Our approach achieved state-of-the-art results on the ShARe corpus from SemEval 2015 Task 14, showing an increase of 1.1% on weighted accuracy, 1.7% on unweighted accuracy, and 10% on micro F1 scores.

Conclusions: We show that learned weights from our shared model can be effectively transferred to a new partially matched data set, validating the use of transfer learning for clinical text modifiers.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

用于预测临床文本中实体修饰词的多任务迁移学习：应用于阿片类药物使用障碍病例检测。

背景：从临床文本中提取的实体语义可能会因修饰词（包括实体否定、不确定性、条件性、严重性和主题）而发生巨大变化。确定临床实体修饰词的现有模型涉及正则表达式或特征权重，这些模型针对每个修饰词进行独立训练：我们开发并评估了一种多任务转换器架构设计，在这种架构设计中，修饰词是利用公开的 SemEval 2015 Task 14 语料库和新的阿片类药物使用障碍（OUD）数据集共同学习和预测的。我们评估了我们的多任务学习方法与之前发布的系统相比的有效性，并评估了在只有部分临床修饰词共享的情况下对临床实体修饰词进行迁移学习的可行性：我们的方法在 SemEval 2015 Task 14 的 ShARe 语料库上取得了最先进的结果，加权准确率提高了 1.1%，非加权准确率提高了 1.7%，微 F1 分数提高了 10%：我们的研究表明，从我们的共享模型中学到的权重可以有效地转移到新的部分匹配数据集上，从而验证了转移学习在临床文本修饰符中的应用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Journal of Biomedical Semantics MATHEMATICAL & COMPUTATIONAL BIOLOGY-

CiteScore

4.20

自引率

5.30%

发文量

审稿时长

30 weeks

期刊介绍： Journal of Biomedical Semantics addresses issues of semantic enrichment and semantic processing in the biomedical domain. The scope of the journal covers two main areas: Infrastructure for biomedical semantics: focusing on semantic resources and repositories, meta-data management and resource description, knowledge representation and semantic frameworks, the Biomedical Semantic Web, and semantic interoperability. Semantic mining, annotation, and analysis: focusing on approaches and applications of semantic resources; and tools for investigation, reasoning, prediction, and discoveries in biomedicine.