考虑语境和句法特性的角色转换

Young-Shing Youn, Hye-Jeong Song, Chan-Young Park, Jong-Dae Kim, Yu-Seop Kim
{"title":"考虑语境和句法特性的角色转换","authors":"Young-Shing Youn, Hye-Jeong Song, Chan-Young Park, Jong-Dae Kim, Yu-Seop Kim","doi":"10.14257/ijdta.2017.10.8.04","DOIUrl":null,"url":null,"abstract":"Semantic Role Labeling (SRL) is to determine the relationship between predicates and their arguments in a sentence. In order to determine the semantic roles, a large amount of corpus with annotated semantic roles is required. Nowadays the most widely used semantic corpus is Proposition Bank (PropBank) which is semantically annotated over the predicate and argument structure. But the Korean version of the PropBank could not be widely used because the corpus has limitation in size and be different from its original English version in its usability. To solve these problems, we also used another semantic tagged corpus, built by Sejong Plan, which is nation-wide Korean corpus construction project. However, the task of corpus construction with semantic roles defined in PropBank and Sejong is much time-consuming and these corpora use their own role sets. They finally require a way of converting one role to other side role(s). In this paper, we propose a method for automatically converting the roles. First, we use similarity between a given noun argument word to find a new role and noun words appearing in the example sentences of candidate roles. Second, we extract suffix of the argument word and estimate closeness between the suffix and candidate roles. Finally, the predicate itself is used for selection,that is we calculate the closeness between the predicate and the candidate roles. With these, the role is decided among multiple candidate roles. In the experiment, we convert 491 arguments automatically and about 78% of them show the agreement with manually annotated arguments.","PeriodicalId":13926,"journal":{"name":"International journal of database theory and application","volume":"82 1","pages":"31-42"},"PeriodicalIF":0.0000,"publicationDate":"2017-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Role Conversion Considering Its Context and Syntactic Property\",\"authors\":\"Young-Shing Youn, Hye-Jeong Song, Chan-Young Park, Jong-Dae Kim, Yu-Seop Kim\",\"doi\":\"10.14257/ijdta.2017.10.8.04\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Semantic Role Labeling (SRL) is to determine the relationship between predicates and their arguments in a sentence. In order to determine the semantic roles, a large amount of corpus with annotated semantic roles is required. Nowadays the most widely used semantic corpus is Proposition Bank (PropBank) which is semantically annotated over the predicate and argument structure. But the Korean version of the PropBank could not be widely used because the corpus has limitation in size and be different from its original English version in its usability. To solve these problems, we also used another semantic tagged corpus, built by Sejong Plan, which is nation-wide Korean corpus construction project. However, the task of corpus construction with semantic roles defined in PropBank and Sejong is much time-consuming and these corpora use their own role sets. They finally require a way of converting one role to other side role(s). In this paper, we propose a method for automatically converting the roles. First, we use similarity between a given noun argument word to find a new role and noun words appearing in the example sentences of candidate roles. Second, we extract suffix of the argument word and estimate closeness between the suffix and candidate roles. Finally, the predicate itself is used for selection,that is we calculate the closeness between the predicate and the candidate roles. With these, the role is decided among multiple candidate roles. In the experiment, we convert 491 arguments automatically and about 78% of them show the agreement with manually annotated arguments.\",\"PeriodicalId\":13926,\"journal\":{\"name\":\"International journal of database theory and application\",\"volume\":\"82 1\",\"pages\":\"31-42\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-08-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International journal of database theory and application\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.14257/ijdta.2017.10.8.04\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International journal of database theory and application","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.14257/ijdta.2017.10.8.04","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

语义角色标注(Semantic Role Labeling, SRL)是用来确定句子中谓语及其参数之间的关系。为了确定语义角色,需要大量带有标注语义角色的语料库。目前使用最广泛的语义语料库是命题库(PropBank),它对谓词和论证结构进行了语义标注。但韩国版的PropBank由于语料库规模有限,而且在可用性方面与英文原版存在差异,因此未能得到广泛应用。为了解决这些问题,我们还使用了另一个语义标记语料库,该语料库是由Sejong Plan建立的,这是一个全国性的韩国语语料库建设项目。然而,使用PropBank和Sejong中定义的语义角色构建语料库的任务非常耗时,并且这些语料库使用自己的角色集。它们最后需要一种将一个角色转换为另一个角色的方法。本文提出了一种自动转换角色的方法。首先,我们利用给定的名词论证词与候选角色例句中出现的名词词之间的相似性来寻找新角色。其次,我们提取参数词的后缀,并估计后缀与候选角色之间的接近度。最后,谓词本身用于选择,也就是说,我们计算谓词和候选角色之间的接近度。有了这些,在多个候选角色中决定角色。在实验中,我们自动转换了491个参数,其中约78%的参数与人工标注的参数一致。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Role Conversion Considering Its Context and Syntactic Property
Semantic Role Labeling (SRL) is to determine the relationship between predicates and their arguments in a sentence. In order to determine the semantic roles, a large amount of corpus with annotated semantic roles is required. Nowadays the most widely used semantic corpus is Proposition Bank (PropBank) which is semantically annotated over the predicate and argument structure. But the Korean version of the PropBank could not be widely used because the corpus has limitation in size and be different from its original English version in its usability. To solve these problems, we also used another semantic tagged corpus, built by Sejong Plan, which is nation-wide Korean corpus construction project. However, the task of corpus construction with semantic roles defined in PropBank and Sejong is much time-consuming and these corpora use their own role sets. They finally require a way of converting one role to other side role(s). In this paper, we propose a method for automatically converting the roles. First, we use similarity between a given noun argument word to find a new role and noun words appearing in the example sentences of candidate roles. Second, we extract suffix of the argument word and estimate closeness between the suffix and candidate roles. Finally, the predicate itself is used for selection,that is we calculate the closeness between the predicate and the candidate roles. With these, the role is decided among multiple candidate roles. In the experiment, we convert 491 arguments automatically and about 78% of them show the agreement with manually annotated arguments.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Logical Data Integration Model for the Integration of Data Repositories Fuzzy Associative Classification Driven MapReduce Computing Solution for Effective Learning from Uncertain and Dynamic Big Data Decision Tree Algorithms C4.5 and C5.0 in Data Mining: A Review Evaluating Intelligent Search Agents in a Controlled Environment Using Complex Queries: An Empirical Study ScaffdCF: A Prototype Interface for Managing Conflicts in Peer Review Process of Open Collaboration Projects
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1