Traditional chinese medicine synonymous term conversion: A bidirectional encoder representations from transformers-based model for converting synonymous terms in traditional chinese medicine

IF 4.3 3区 医学 Q1 INTEGRATIVE & COMPLEMENTARY MEDICINE World Journal of Traditional Chinese Medicine Pub Date : 2023-04-01 DOI:10.4103/2311-8571.378171
Lu Zhou, Chaohong Wu, Xi-Ting Wang, Shuangqiao Liu, Yizhuo Zhang, Yue-Meng Sun, Jian Cui, Caiyan Li, Hui-Min Yuan, Yan Sun, Feng-jie Zheng, Feng-qin Xu, Yuhang Li
{"title":"Traditional chinese medicine synonymous term conversion: A bidirectional encoder representations from transformers-based model for converting synonymous terms in traditional chinese medicine","authors":"Lu Zhou, Chaohong Wu, Xi-Ting Wang, Shuangqiao Liu, Yizhuo Zhang, Yue-Meng Sun, Jian Cui, Caiyan Li, Hui-Min Yuan, Yan Sun, Feng-jie Zheng, Feng-qin Xu, Yuhang Li","doi":"10.4103/2311-8571.378171","DOIUrl":null,"url":null,"abstract":"Background: The medical records of traditional Chinese medicine (TCM) contain numerous synonymous terms with different descriptions, which is not conducive to computer-aided data mining of TCM. However, there is a lack of models available to normalize synonymous TCM terms. Therefore, construction of a synonymous term conversion (STC) model for normalizing synonymous TCM terms is necessary. Methods: Based on the neural networks of bidirectional encoder representations from transformers (BERT), four types of TCM STC models were designed: Models based on BERT and text classification, text sequence generation, named entity recognition, and text matching. The superior STC model was selected on the basis of its performance in converting synonymous terms. Moreover, three misjudgment inspection methods for the conversion results of the STC model based on inconsistency were proposed to find incorrect term conversion: Neuron random deactivation, output comparison of multiple isomorphic models, and output comparison of multiple heterogeneous models (OCMH). Results: The classification-based STC model outperformed the other STC task models. It achieved F1 scores of 0.91, 0.91, and 0.83 for performing symptoms, patterns, and treatments STC tasks, respectively. The OCMH method showed the best performance in misjudgment inspection, with wrong detection rates of 0.80, 0.84, and 0.90 in the term conversion results for symptoms, patterns, and treatments, respectively. Conclusion: The TCM STC model based on classification achieved superior performance in converting synonymous terms for symptoms, patterns, and treatments. The misjudgment inspection method based on OCMH showed superior performance in identifying incorrect outputs.","PeriodicalId":23692,"journal":{"name":"World Journal of Traditional Chinese Medicine","volume":null,"pages":null},"PeriodicalIF":4.3000,"publicationDate":"2023-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"World Journal of Traditional Chinese Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.4103/2311-8571.378171","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"INTEGRATIVE & COMPLEMENTARY MEDICINE","Score":null,"Total":0}
引用次数: 0

Abstract

Background: The medical records of traditional Chinese medicine (TCM) contain numerous synonymous terms with different descriptions, which is not conducive to computer-aided data mining of TCM. However, there is a lack of models available to normalize synonymous TCM terms. Therefore, construction of a synonymous term conversion (STC) model for normalizing synonymous TCM terms is necessary. Methods: Based on the neural networks of bidirectional encoder representations from transformers (BERT), four types of TCM STC models were designed: Models based on BERT and text classification, text sequence generation, named entity recognition, and text matching. The superior STC model was selected on the basis of its performance in converting synonymous terms. Moreover, three misjudgment inspection methods for the conversion results of the STC model based on inconsistency were proposed to find incorrect term conversion: Neuron random deactivation, output comparison of multiple isomorphic models, and output comparison of multiple heterogeneous models (OCMH). Results: The classification-based STC model outperformed the other STC task models. It achieved F1 scores of 0.91, 0.91, and 0.83 for performing symptoms, patterns, and treatments STC tasks, respectively. The OCMH method showed the best performance in misjudgment inspection, with wrong detection rates of 0.80, 0.84, and 0.90 in the term conversion results for symptoms, patterns, and treatments, respectively. Conclusion: The TCM STC model based on classification achieved superior performance in converting synonymous terms for symptoms, patterns, and treatments. The misjudgment inspection method based on OCMH showed superior performance in identifying incorrect outputs.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
中药同义术语转换:一种基于变压器的双向编码器表示模型,用于中药同义术语转换
背景:中医病案中存在大量同义术语,且描述不一,不利于中医计算机辅助数据挖掘。然而,目前缺乏对同义中医术语进行规范化的模型。因此,有必要构建同义术语转换(STC)模型对同义中医术语进行规范化。方法:基于BERT(双向编码器表示)神经网络,设计了基于BERT和文本分类、文本序列生成、命名实体识别和文本匹配的四种TCM STC模型。基于同义词转换的性能,选择了较优的STC模型。针对STC模型基于不一致性的转换结果,提出了神经元随机失活、多个同构模型输出比较和多个异构模型输出比较(OCMH)三种误判检查方法,以发现不正确的项转换。结果:基于分类的STC任务模型优于其他STC任务模型。在执行症状、模式和治疗STC任务时,其F1得分分别为0.91、0.91和0.83。OCMH法在误判检验中表现最好,对症状、模式和治疗的术语转换结果的错误率分别为0.80、0.84和0.90。结论:基于分类的中医STC模型在症状、模式和治疗同义术语转换方面具有较好的效果。基于OCMH的误判检测方法在识别错误输出方面表现出优异的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
World Journal of Traditional Chinese Medicine
World Journal of Traditional Chinese Medicine Medicine-Complementary and Alternative Medicine
CiteScore
5.40
自引率
2.30%
发文量
259
审稿时长
24 weeks
期刊最新文献
Metaphorical Thinking of Traditional Chinese Medicine and Its Features Investigation on the Therapeutic Effect of Polygonum multiflorum Thunb. in Chronic Stress-induced Hair Loss in Mice Coupled with Metabolomics and Proteomics Inhibiting Effect and Mechanism of Aconitum tanguticum (Maxim.) Stapf on Intestinal Fibrosis of CCD-18Co Cells Rapid Classification and Identification of Chemical Compounds and Semi-Quantitative Metabolism of Huangkui Capsules and the Protective Effects of Its Quercetin Derivatives against Tacrolimus-induced HK-cell Reduction Protective Effect of Shenfu Injection against Sepsis-induced Acute Lung Injury by Suppressing Inflammation and Apoptosis Through the Regulation of the Janus Kinase 2/Signal Transducer and Activator of Transcription 3 Pathway
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1