Learning Verb Argument Structure from Minimally Annotated Corpora

Anoop Sarkar, Woottiporn Tripasai
{"title":"Learning Verb Argument Structure from Minimally Annotated Corpora","authors":"Anoop Sarkar, Woottiporn Tripasai","doi":"10.3115/1072228.1072268","DOIUrl":null,"url":null,"abstract":"In this paper we investigate the task of automatically identifying the correct argument structure for a set of verbs. The argument structure of a verb allows us to predict the relationship between the syntactic arguments of a verb and their role in the underlying lexical semantics of the verb. Following the method described in (Merlo and Stevenson, 2001), we exploit the distributions of some selected features from the local context of a verb. These features were extracted from a 23M word WSJ corpus based on part-of-speech tags and phrasal chunks alone. We constructed several decision tree classifiers trained on this data. The best performing classifier achieved an error rate of 33.4%. This work shows that a subcategorization frame (SF) learning algorithm previously applied to Czech (Sarkar and Zeman, 2000) is used to extract SFs in English. The extracted SFs are evaluated by classifying verbs into verb alternation classes.","PeriodicalId":437823,"journal":{"name":"Proceedings of the 19th international conference on Computational linguistics -","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2002-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"18","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 19th international conference on Computational linguistics -","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3115/1072228.1072268","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 18

Abstract

In this paper we investigate the task of automatically identifying the correct argument structure for a set of verbs. The argument structure of a verb allows us to predict the relationship between the syntactic arguments of a verb and their role in the underlying lexical semantics of the verb. Following the method described in (Merlo and Stevenson, 2001), we exploit the distributions of some selected features from the local context of a verb. These features were extracted from a 23M word WSJ corpus based on part-of-speech tags and phrasal chunks alone. We constructed several decision tree classifiers trained on this data. The best performing classifier achieved an error rate of 33.4%. This work shows that a subcategorization frame (SF) learning algorithm previously applied to Czech (Sarkar and Zeman, 2000) is used to extract SFs in English. The extracted SFs are evaluated by classifying verbs into verb alternation classes.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
从最小标注语料库中学习动词论点结构
在本文中,我们研究了自动识别一组动词的正确论点结构的任务。动词的实参结构使我们能够预测动词的句法实参与它们在动词的基础词汇语义中的作用之间的关系。按照(Merlo和Stevenson, 2001)中描述的方法,我们从动词的局部上下文中利用一些选定特征的分布。这些特征是从23M个基于词性标签和短语块的WSJ语料库中提取出来的。我们根据这些数据构建了几个决策树分类器。表现最好的分类器错误率为33.4%。这项工作表明,以前应用于捷克语的子分类框架(SF)学习算法(Sarkar和Zeman, 2000)被用于提取英语中的子分类框架。通过将动词分类为动词交替类来评估提取的SFs。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Morphological Analysis of the Spontaneous Speech Corpus An Agent-based Approach to Chinese Named Entity Recognition Natural Language and Inference in a Computer Game Meta-evaluation of Summaries in a Cross-lingual Environment using Content-based Metrics Learning Verb Argument Structure from Minimally Annotated Corpora
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1