一种改进的基于词向量的病案症状提取方法

Zhongmin Liu, Zhiming Luo, Jiajun Xu, Shaozi Li
{"title":"一种改进的基于词向量的病案症状提取方法","authors":"Zhongmin Liu, Zhiming Luo, Jiajun Xu, Shaozi Li","doi":"10.1109/ITME53901.2021.00082","DOIUrl":null,"url":null,"abstract":"Extracting and standardizing symptoms from traditional Chinese medical records plays an important role in intelligent diagnosis. Recently, abundant word vector models have been developed and used in natural language processing tasks due to their powerful performance. However, simply using a word vector model as core to analysis text is hard to satisfy both time and precision requirements. To improve this situation, we introduce an improved word vector-based symptom extraction method for traditional Chinese medicine which can extract and standardize symptoms in original medical texts written in Chinese. We design this method into three parts, Word Segmentation, Word Vector Generation, and Term Substitution. Experimental results on our dataset show that our method has a good effect in extracting medical symptoms and discarding redundant words. Compared to other baseline models of word vector representation, our method performs well in general performance of efficiency and accuracy.","PeriodicalId":6774,"journal":{"name":"2021 11th International Conference on Information Technology in Medicine and Education (ITME)","volume":"17 1","pages":"379-384"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"An Improved Word Vector-Based Symptom Extraction Method for Traditional Chinese Medical Record Analysis\",\"authors\":\"Zhongmin Liu, Zhiming Luo, Jiajun Xu, Shaozi Li\",\"doi\":\"10.1109/ITME53901.2021.00082\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Extracting and standardizing symptoms from traditional Chinese medical records plays an important role in intelligent diagnosis. Recently, abundant word vector models have been developed and used in natural language processing tasks due to their powerful performance. However, simply using a word vector model as core to analysis text is hard to satisfy both time and precision requirements. To improve this situation, we introduce an improved word vector-based symptom extraction method for traditional Chinese medicine which can extract and standardize symptoms in original medical texts written in Chinese. We design this method into three parts, Word Segmentation, Word Vector Generation, and Term Substitution. Experimental results on our dataset show that our method has a good effect in extracting medical symptoms and discarding redundant words. Compared to other baseline models of word vector representation, our method performs well in general performance of efficiency and accuracy.\",\"PeriodicalId\":6774,\"journal\":{\"name\":\"2021 11th International Conference on Information Technology in Medicine and Education (ITME)\",\"volume\":\"17 1\",\"pages\":\"379-384\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 11th International Conference on Information Technology in Medicine and Education (ITME)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ITME53901.2021.00082\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 11th International Conference on Information Technology in Medicine and Education (ITME)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITME53901.2021.00082","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

中医病案症状提取与规范在智能诊断中具有重要作用。近年来,由于词向量模型具有强大的性能,在自然语言处理任务中得到了广泛的应用。然而,单纯以词向量模型为核心进行文本分析很难同时满足时间和精度要求。为了改善这种情况,我们引入了一种改进的基于词向量的中医症状提取方法,该方法可以提取和规范中文原始医学文本中的症状。我们将该方法设计为三个部分:分词、词向量生成和术语替换。在我们的数据集上的实验结果表明,我们的方法在医学症状提取和去除冗余词方面有很好的效果。与其他基线词向量表示模型相比,我们的方法在效率和准确性方面表现良好。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
An Improved Word Vector-Based Symptom Extraction Method for Traditional Chinese Medical Record Analysis
Extracting and standardizing symptoms from traditional Chinese medical records plays an important role in intelligent diagnosis. Recently, abundant word vector models have been developed and used in natural language processing tasks due to their powerful performance. However, simply using a word vector model as core to analysis text is hard to satisfy both time and precision requirements. To improve this situation, we introduce an improved word vector-based symptom extraction method for traditional Chinese medicine which can extract and standardize symptoms in original medical texts written in Chinese. We design this method into three parts, Word Segmentation, Word Vector Generation, and Term Substitution. Experimental results on our dataset show that our method has a good effect in extracting medical symptoms and discarding redundant words. Compared to other baseline models of word vector representation, our method performs well in general performance of efficiency and accuracy.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Committees ITME 2021 Conference Organization Research on Assistant Diagnostic Method of TCM Based on BERT Drug-Drug Adverse Reactions Prediction Based On Signed Network Java Curriculum Design Concept that Integrates Design Thinking and Heuristic Teaching Keyword-based Data Augmentation Guided Chinese Medical Questions Classification
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1