基于特征扩展和文档一致性的中医事件检测

Chen Wang, Pengjun Zhai, Yu Fang
{"title":"基于特征扩展和文档一致性的中医事件检测","authors":"Chen Wang, Pengjun Zhai, Yu Fang","doi":"10.1109/CACRE50138.2020.9230246","DOIUrl":null,"url":null,"abstract":"Extracting valuable medical events from Chinese electronic medical records has important practical significance and application value for electronic medical record text mining, and event detection is a critical step in the event extraction task. Existing research methods for medical event detection in Chinese are mainly based on pattern matching and clustering, and they have two problems $:(1)$ None of them consider the named entities distribution characteristics of medical events. (2) Ignore the distribution of document consistency between medical events in each document. Therefore, this paper proposes an event detection method based on feature extension and document consistency. Firstly, design the medical event representation template and construct the event trigger dictionary according to the ACE standard. Secondly, use semiautomatic corpus labeling method to label entities and events. Then, based on the basic features, according to the distribution characteristics of the entities in the event, different entity information features are selected as extension features. Finally, use the consistency of the distribution of medical documents to improve the final result of event detection. The experimental results show that our method is significantly superior to the baseline.","PeriodicalId":325195,"journal":{"name":"2020 5th International Conference on Automation, Control and Robotics Engineering (CACRE)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Chinese medical event detection based on feature extension and document consistency\",\"authors\":\"Chen Wang, Pengjun Zhai, Yu Fang\",\"doi\":\"10.1109/CACRE50138.2020.9230246\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Extracting valuable medical events from Chinese electronic medical records has important practical significance and application value for electronic medical record text mining, and event detection is a critical step in the event extraction task. Existing research methods for medical event detection in Chinese are mainly based on pattern matching and clustering, and they have two problems $:(1)$ None of them consider the named entities distribution characteristics of medical events. (2) Ignore the distribution of document consistency between medical events in each document. Therefore, this paper proposes an event detection method based on feature extension and document consistency. Firstly, design the medical event representation template and construct the event trigger dictionary according to the ACE standard. Secondly, use semiautomatic corpus labeling method to label entities and events. Then, based on the basic features, according to the distribution characteristics of the entities in the event, different entity information features are selected as extension features. Finally, use the consistency of the distribution of medical documents to improve the final result of event detection. The experimental results show that our method is significantly superior to the baseline.\",\"PeriodicalId\":325195,\"journal\":{\"name\":\"2020 5th International Conference on Automation, Control and Robotics Engineering (CACRE)\",\"volume\":\"7 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 5th International Conference on Automation, Control and Robotics Engineering (CACRE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CACRE50138.2020.9230246\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 5th International Conference on Automation, Control and Robotics Engineering (CACRE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CACRE50138.2020.9230246","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

从中文电子病历中提取有价值的医疗事件对于电子病历文本挖掘具有重要的现实意义和应用价值,而事件检测是事件提取任务中的关键步骤。现有的中文医学事件检测研究方法主要基于模式匹配和聚类,存在两个问题:(1)没有考虑医学事件的命名实体分布特征。(2)忽略每个文档中医疗事件之间的文档一致性分布。为此,本文提出了一种基于特征扩展和文档一致性的事件检测方法。首先,根据ACE标准设计医疗事件表示模板,构建事件触发字典。其次,采用半自动语料库标注方法对实体和事件进行标注。然后,在基本特征的基础上,根据实体在事件中的分布特征,选择不同的实体信息特征作为扩展特征。最后,利用医疗文件分布的一致性来改进事件检测的最终结果。实验结果表明,该方法明显优于基线方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Chinese medical event detection based on feature extension and document consistency
Extracting valuable medical events from Chinese electronic medical records has important practical significance and application value for electronic medical record text mining, and event detection is a critical step in the event extraction task. Existing research methods for medical event detection in Chinese are mainly based on pattern matching and clustering, and they have two problems $:(1)$ None of them consider the named entities distribution characteristics of medical events. (2) Ignore the distribution of document consistency between medical events in each document. Therefore, this paper proposes an event detection method based on feature extension and document consistency. Firstly, design the medical event representation template and construct the event trigger dictionary according to the ACE standard. Secondly, use semiautomatic corpus labeling method to label entities and events. Then, based on the basic features, according to the distribution characteristics of the entities in the event, different entity information features are selected as extension features. Finally, use the consistency of the distribution of medical documents to improve the final result of event detection. The experimental results show that our method is significantly superior to the baseline.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Model Establishment of Decision Tree Algorithm and Its Application in Vehicle Fault Prediction Analysis Cooperative Level Curve Tracking in Advection-Diffusion Fields Spatial Pooling Network For Lane Line Segmentation Filters navigation and positioning based on mining vehicle motion model Dynamic Optimal Scheduling of Microgrid Based on ε constraint multi-objective Biogeography-based Optimization Algorithm
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1