基于形式概念连通性距离的中文实体关系抽取策略

Chun-ming. Cheng
{"title":"基于形式概念连通性距离的中文实体关系抽取策略","authors":"Chun-ming. Cheng","doi":"10.1109/IALP.2013.12","DOIUrl":null,"url":null,"abstract":"As Chinese expression diversity, there are some shortcomings in traditional algorithms of Chinese entity relationship extraction. For example, workload of labeling by hand on training corpus is too large, the generated relationship schemas usually have poor versatility, and it is difficult to select or integrate high quality domain ontology for extraction task. Moreover, these algorithms don't consider the fact that the entity relationship usually has different meanings with the different topic backgrounds or with the various concept granularities. The paper, utilizing statistical method and linguistics knowledge, carries out the work of crawling, parsing, filling, builds the relational formal concept lattice with Chinese entities context, and acquires entity relationship schemas described by relational formal concept. With these relational schemas and concept built above, we carry out the entry concept correlation computing and the predicate text flexible matching, and get the concept connectivity distance between entities to achieve the non-single and indirect entity relation extraction. The granularities of concept in relation extraction are more flexible, and the relational schema described by formal concept is more versatile and robust. The method in this paper provides a better semantic description for the extracted relationship, and obtains a good relation extraction performance.","PeriodicalId":413833,"journal":{"name":"2013 International Conference on Asian Language Processing","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-08-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Relationship Extraction Tactics of Chinese Entity Based on Formal Concept Connectivity Distance\",\"authors\":\"Chun-ming. Cheng\",\"doi\":\"10.1109/IALP.2013.12\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"As Chinese expression diversity, there are some shortcomings in traditional algorithms of Chinese entity relationship extraction. For example, workload of labeling by hand on training corpus is too large, the generated relationship schemas usually have poor versatility, and it is difficult to select or integrate high quality domain ontology for extraction task. Moreover, these algorithms don't consider the fact that the entity relationship usually has different meanings with the different topic backgrounds or with the various concept granularities. The paper, utilizing statistical method and linguistics knowledge, carries out the work of crawling, parsing, filling, builds the relational formal concept lattice with Chinese entities context, and acquires entity relationship schemas described by relational formal concept. With these relational schemas and concept built above, we carry out the entry concept correlation computing and the predicate text flexible matching, and get the concept connectivity distance between entities to achieve the non-single and indirect entity relation extraction. The granularities of concept in relation extraction are more flexible, and the relational schema described by formal concept is more versatile and robust. The method in this paper provides a better semantic description for the extracted relationship, and obtains a good relation extraction performance.\",\"PeriodicalId\":413833,\"journal\":{\"name\":\"2013 International Conference on Asian Language Processing\",\"volume\":\"21 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-08-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2013 International Conference on Asian Language Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IALP.2013.12\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 International Conference on Asian Language Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IALP.2013.12","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

由于中文表达的多样性,传统的中文实体关系提取算法存在一定的不足。例如,手工标注训练语料库的工作量太大,生成的关系模式通常通用性差,难以选择或集成高质量的领域本体进行抽取任务。而且,这些算法没有考虑到实体关系在不同的主题背景或不同的概念粒度下通常具有不同的含义。本文利用统计学方法和语言学知识,进行了抓取、解析、填充等工作,构建了具有中文实体语境的关系形式概念格,获得了用关系形式概念描述的实体关系图式。利用上述构建的关系模式和概念,进行条目概念关联计算和谓词文本灵活匹配,获得实体之间的概念连通性距离,实现非单一、间接的实体关系提取。关系提取中的概念粒度更加灵活,形式概念描述的关系模式更加通用和健壮。该方法为提取的关系提供了更好的语义描述,获得了良好的关系提取性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Relationship Extraction Tactics of Chinese Entity Based on Formal Concept Connectivity Distance
As Chinese expression diversity, there are some shortcomings in traditional algorithms of Chinese entity relationship extraction. For example, workload of labeling by hand on training corpus is too large, the generated relationship schemas usually have poor versatility, and it is difficult to select or integrate high quality domain ontology for extraction task. Moreover, these algorithms don't consider the fact that the entity relationship usually has different meanings with the different topic backgrounds or with the various concept granularities. The paper, utilizing statistical method and linguistics knowledge, carries out the work of crawling, parsing, filling, builds the relational formal concept lattice with Chinese entities context, and acquires entity relationship schemas described by relational formal concept. With these relational schemas and concept built above, we carry out the entry concept correlation computing and the predicate text flexible matching, and get the concept connectivity distance between entities to achieve the non-single and indirect entity relation extraction. The granularities of concept in relation extraction are more flexible, and the relational schema described by formal concept is more versatile and robust. The method in this paper provides a better semantic description for the extracted relationship, and obtains a good relation extraction performance.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Judgment, Extraction and Selective Restriction of Chinese Eventive Verb Categorization and Identification of Fragments with Shi Plus Punctuation Feature Abstraction for Lightweight and Accurate Chinese Word Segmentation The Comparative Research on the Segmentation Strategies of Tibetan Bounded-Variant Forms An Empirical Evaluation of Dimensionality Reduction Using Latent Semantic Analysis on Hindi Text
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1