Scoring Coreference Chains with Split-Antecedent Anaphors

Q1 Arts and Humanities Dialogue and Discourse Pub Date : 2022-05-24 DOI:10.48550/arXiv.2205.12323

Silviu Paun, Juntao Yu, N. Moosavi, Massimo Poesio

{"title":"Scoring Coreference Chains with Split-Antecedent Anaphors","authors":"Silviu Paun, Juntao Yu, N. Moosavi, Massimo Poesio","doi":"10.48550/arXiv.2205.12323","DOIUrl":null,"url":null,"abstract":"Anaphoric reference is an aspect of language interpretation covering a variety of types of interpretation beyond the simple case of identity reference to entities introduced via nominal expressions covered by the traditional coreference task in its most recent incarnation in ONTONOTES and similar datasets. One of these cases that go beyond simple coreference is anaphoric reference to entities that must be added to the discourse model via accommodation, and in particular split-antecedent references to entities constructed out of other entities, as in split-antecedent plurals and in some cases of discourse deixis. Although this type of anaphoric reference is now annotated in many datasets, systems interpreting such references cannot be evaluated using the Reference coreference scorer Pradhan et al. (2014). As part of the work towards a new scorer for anaphoric reference able to evaluate all aspects of anaphoric interpretation in the coverage of the Universal Anaphora initiative, we propose in this paper a solution to the technical problem of generalizing existing metrics for identity anaphora so that they can also be used to score cases of split-antecedents. This is the first such proposal in the literature on anaphora or coreference, and has been successfully used to score both split-antecedent plural references and discourse deixis in the recent CODI/CRAC anaphora resolution in dialogue shared tasks.","PeriodicalId":37604,"journal":{"name":"Dialogue and Discourse","volume":"314 1","pages":"1-48"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Dialogue and Discourse","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2205.12323","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Arts and Humanities","Score":null,"Total":0}

引用次数: 7

Abstract

Anaphoric reference is an aspect of language interpretation covering a variety of types of interpretation beyond the simple case of identity reference to entities introduced via nominal expressions covered by the traditional coreference task in its most recent incarnation in ONTONOTES and similar datasets. One of these cases that go beyond simple coreference is anaphoric reference to entities that must be added to the discourse model via accommodation, and in particular split-antecedent references to entities constructed out of other entities, as in split-antecedent plurals and in some cases of discourse deixis. Although this type of anaphoric reference is now annotated in many datasets, systems interpreting such references cannot be evaluated using the Reference coreference scorer Pradhan et al. (2014). As part of the work towards a new scorer for anaphoric reference able to evaluate all aspects of anaphoric interpretation in the coverage of the Universal Anaphora initiative, we propose in this paper a solution to the technical problem of generalizing existing metrics for identity anaphora so that they can also be used to score cases of split-antecedents. This is the first such proposal in the literature on anaphora or coreference, and has been successfully used to score both split-antecedent plural references and discourse deixis in the recent CODI/CRAC anaphora resolution in dialogue shared tasks.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

分词前指的共参照链评分

回指指是语言解释的一个方面，涵盖了多种类型的解释，而不仅仅是通过名义表达引入实体的简单情况，在ONTONOTES和类似数据集中，传统的共指任务涵盖了这种情况。其中一种超越简单的共指的情况是对实体的回指，这些实体必须通过调节添加到话语模型中，特别是对由其他实体构建的实体的分离先行引用，如在分离先行复数和话语指示的某些情况下。尽管这种类型的回指参考现在在许多数据集中都有注释，但无法使用参考共参考评分器Pradhan等人(2014)来评估解释此类参考的系统。作为一个新的回指参照评分者的工作的一部分，该评分者能够评估通用回指倡议覆盖范围内的回指解释的各个方面，我们在本文中提出了一个解决现有身份回指度量的技术问题的解决方案，以便它们也可以用于对分裂先行词的情况进行评分。这是关于回指或共指的文献中第一次提出这样的建议，并在最近的CODI/CRAC对话共享任务的回指解析中成功地用于分词复数指和语篇指示的得分。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Dialogue and Discourse Arts and Humanities-Language and Linguistics

CiteScore

1.90

自引率

0.00%

发文量

审稿时长

12 weeks

期刊介绍： D&D seeks previously unpublished, high quality articles on the analysis of discourse and dialogue that contain -experimental and/or theoretical studies related to the construction, representation, and maintenance of (linguistic) context -linguistic analysis of phenomena characteristic of discourse and/or dialogue (including, but not limited to: reference and anaphora, presupposition and accommodation, topicality and salience, implicature, ---discourse structure and rhetorical relations, discourse markers and particles, the semantics and -pragmatics of dialogue acts, questions, imperatives, non-sentential utterances, intonation, and meta--communicative phenomena such as repair and grounding) -experimental and/or theoretical studies of agents'' information states and their dynamics in conversational interaction -new analytical frameworks that advance theoretical studies of discourse and dialogue -research on systems performing coreference resolution, discourse structure parsing, event and temporal -structure, and reference resolution in multimodal communication -experimental and/or theoretical results yielding new insight into non-linguistic interaction in -communication -work on natural language understanding (including spoken language understanding), dialogue management, -reasoning, and natural language generation (including text-to-speech) in dialogue systems -work related to the design and engineering of dialogue systems (including, but not limited to: -evaluation, usability design and testing, rapid application deployment, embodied agents, affect detection, -mixed-initiative, adaptation, and user modeling). -extremely well-written surveys of existing work. Highest priority is given to research reports that are specifically written for a multidisciplinary audience. The audience is primarily researchers on discourse and dialogue and its associated fields, including computer scientists, linguists, psychologists, philosophers, roboticists, sociologists.

期刊最新文献

The Conversational Discourse Unit: Identification and Its Role in Conversational Turn-taking Management Exploring the Sensitivity to Alternative Signals of Coherence Relations Scoring Coreference Chains with Split-Antecedent Anaphors Form and Function of Connectives in Chinese Conversational Speech Bullshit, Pragmatic Deception, and Natural Language Processing