Robust Chinese Clinical Named Entity Recognition with information bottleneck and adversarial training

IF 7.2 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Applied Soft Computing Pub Date : 2024-10-30 DOI:10.1016/j.asoc.2024.112409
Yunfei He , Zhiqiang Zhang , Jinlong Shen , Yuling Li , Yiwen Zhang , Weiping Ding , Fei Yang
{"title":"Robust Chinese Clinical Named Entity Recognition with information bottleneck and adversarial training","authors":"Yunfei He ,&nbsp;Zhiqiang Zhang ,&nbsp;Jinlong Shen ,&nbsp;Yuling Li ,&nbsp;Yiwen Zhang ,&nbsp;Weiping Ding ,&nbsp;Fei Yang","doi":"10.1016/j.asoc.2024.112409","DOIUrl":null,"url":null,"abstract":"<div><div>Chinese Clinical Named Entity Recognition (CCNER) aims to extract entities with specific medical significance from Chinese clinical texts, which is an important part of medical data mining. Some existing CCNER models may assume perfect text data and design complex models to improve their accuracy. However, due to the complexity of Chinese clinical entity semantics and the professionalism of annotation, Chinese clinical texts are prone to contain irregular misrepresentations and sparse entity labeling. That would lead to noisy or incomplete text features extracted by CCNER, seriously threatening the robustness of recognition in real-world scenarios. To address these problems, we propose the Robust Chinese Clinical Named Entity Recognition model (RCCNER). RCCNER comprises three essential components: multifaceted text representation, robust feature extraction, and robust model training. For multifaceted text representation, the model enhances consistency and collaboration between feature representations by integrating word embedding, radical embedding, and dictionary embedding to help withstand textual noise. Then, guided by the information bottleneck and the Hilbert–Schmidt independence criterion, robust feature extraction compresses the dependency between text representation and extracted features, while enhancing the dependency between extracted features and labels, which consequently provides reliable text features for robust recognition. The robust model training aspect leverages adversarial training to diminish RCCNER’s sensitivity to noise disturbances and sparse entity labeling, thereby reinforcing its robustness in entity recognition. RCCNER collaboratively enhances the noise immunity through text representation, text feature extraction and model training. Several experiments on two popular public datasets validate the effectiveness and robustness of RCCNER.</div></div>","PeriodicalId":50737,"journal":{"name":"Applied Soft Computing","volume":"167 ","pages":"Article 112409"},"PeriodicalIF":7.2000,"publicationDate":"2024-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Soft Computing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1568494624011839","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Chinese Clinical Named Entity Recognition (CCNER) aims to extract entities with specific medical significance from Chinese clinical texts, which is an important part of medical data mining. Some existing CCNER models may assume perfect text data and design complex models to improve their accuracy. However, due to the complexity of Chinese clinical entity semantics and the professionalism of annotation, Chinese clinical texts are prone to contain irregular misrepresentations and sparse entity labeling. That would lead to noisy or incomplete text features extracted by CCNER, seriously threatening the robustness of recognition in real-world scenarios. To address these problems, we propose the Robust Chinese Clinical Named Entity Recognition model (RCCNER). RCCNER comprises three essential components: multifaceted text representation, robust feature extraction, and robust model training. For multifaceted text representation, the model enhances consistency and collaboration between feature representations by integrating word embedding, radical embedding, and dictionary embedding to help withstand textual noise. Then, guided by the information bottleneck and the Hilbert–Schmidt independence criterion, robust feature extraction compresses the dependency between text representation and extracted features, while enhancing the dependency between extracted features and labels, which consequently provides reliable text features for robust recognition. The robust model training aspect leverages adversarial training to diminish RCCNER’s sensitivity to noise disturbances and sparse entity labeling, thereby reinforcing its robustness in entity recognition. RCCNER collaboratively enhances the noise immunity through text representation, text feature extraction and model training. Several experiments on two popular public datasets validate the effectiveness and robustness of RCCNER.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用信息瓶颈和对抗训练实现鲁棒的中文临床命名实体识别
中文临床命名实体识别(CCNER)旨在从中文临床文本中提取具有特定医学意义的实体,是医学数据挖掘的重要组成部分。现有的一些 CCNER 模型可能会假定文本数据完美无缺,并设计复杂的模型来提高其准确性。然而,由于中文临床实体语义的复杂性和标注的专业性,中文临床文本容易包含不规范的错误表述和稀疏的实体标注。这将导致 CCNER 提取的文本特征有噪声或不完整,严重威胁实际场景中识别的鲁棒性。针对这些问题,我们提出了鲁棒中文临床命名实体识别模型(RCCNER)。RCCNER 包括三个基本组成部分:多元文本表示、鲁棒性特征提取和鲁棒性模型训练。在多元文本表示方面,该模型通过整合词嵌入、部首嵌入和字典嵌入来增强特征表示之间的一致性和协作性,以帮助抵御文本噪声。然后,在信息瓶颈和希尔伯特-施密特独立性准则的指导下,鲁棒特征提取压缩了文本表示与提取特征之间的依赖性,同时增强了提取特征与标签之间的依赖性,从而为鲁棒性识别提供了可靠的文本特征。鲁棒模型训练方面利用对抗训练来降低 RCCNER 对噪声干扰和稀疏实体标签的敏感性,从而增强其在实体识别中的鲁棒性。RCCNER 通过文本表示、文本特征提取和模型训练协同增强了抗噪能力。在两个流行的公共数据集上进行的多项实验验证了 RCCNER 的有效性和鲁棒性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Applied Soft Computing
Applied Soft Computing 工程技术-计算机:跨学科应用
CiteScore
15.80
自引率
6.90%
发文量
874
审稿时长
10.9 months
期刊介绍: Applied Soft Computing is an international journal promoting an integrated view of soft computing to solve real life problems.The focus is to publish the highest quality research in application and convergence of the areas of Fuzzy Logic, Neural Networks, Evolutionary Computing, Rough Sets and other similar techniques to address real world complexities. Applied Soft Computing is a rolling publication: articles are published as soon as the editor-in-chief has accepted them. Therefore, the web site will continuously be updated with new articles and the publication time will be short.
期刊最新文献
Optimized hybrid XGBoost-CatBoost model for enhanced prediction of concrete strength and reliability analysis using Monte Carlo simulations A Z-number-based three-way decision method with classification-based state determination for the evaluation of new energy enterprises Deep supervision network with contrastive learning for zero-shot sketch-based image retrieval Adaptive deep learning models for efficient multivariate anomaly detection in IoT infrastructures A robust rank aggregation method for malicious disturbance based on objective credit
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1