Dialogue and Discourse最新文献

The Conversational Discourse Unit: Identification and Its Role in Conversational Turn-taking Management 会话话语单元:识别及其在会话转向管理中的作用

Q1 Arts and Humanities

Dialogue and Discourse

Pub Date : 2023-11-04 DOI: 10.5210/dad.2023.203

Junfei Hu, Liesbeth Degand

This study investigates how discourse segmentation and turn-taking interact. Mapping syntactic, prosodic and pragmatic units, five types of conversational discourse units (CDU) were identified. Based on this segmentation, associations were examined between the syntactic, prosodic and pragmatic boundaries and turn-taking, as well as the transition speed after each type of CDU. Results show: 1) The relationships between the three linguistic boundaries and the occurrence of turn-taking were significant, and the association was the strongest for the pragmatic boundaries; it was weaker for prosodic boundaries and the weakest for the syntactic boundaries. 2) The type of CDU influenced the transition speed, with the pragmatic-syntax-bound CDU being fastest. The study highlights the importance of meaning-connection and earlier emergence of the utterance gist in timing turn-taking.

本研究探讨语篇分词与轮向的互动关系。通过对句法、韵律和语用单位的映射，确定了会话话语单位的五种类型。在此基础上，研究了句法、韵律和语用边界与轮转之间的联系，以及每种CDU之后的过渡速度。结果表明:1)三种语言边界与轮替发生的关系显著，其中语用边界的关联最强;韵律边界弱，句法边界弱。2) CDU的类型影响转换速度，其中语用语法绑定的CDU转换速度最快。该研究强调了意义连接和话语主旨在时间转换中的早期出现的重要性。

引用次数: 0

Exploring the Sensitivity to Alternative Signals of Coherence Relations 探讨相干关系对替代信号的敏感性

Q1 Arts and Humanities

Dialogue and Discourse

Pub Date : 2023-10-04 DOI: 10.5210/dad.2023.202

Ekaterina Tskhovrebova, Sandrine Zufferey, Pascal Gygax

Coherence relations between elements of discourse can be signaled by linguistic devices such as connectives and/or alternative signals. While the use and comprehension of connectives have been studied in different categories of speakers, less is known about the functioning of alternative signals of coherence relations, especially in younger populations. In the current study, we aim to examine the sensitivity of French-speaking teenagers to the alternative signals of list relation (words such as plusieurs ‘several’ and différents ‘various’), combined with connectives varying in frequency and signaling two types of coherence relations (addition: en plus, en outre; consequence: donc, ainsi). Our results reveal that, as early as in teenage years, speakers are sensitive (i.e., they produce list continuation sentences) to alternative signals of list relation. Furthermore, the inference of list relation is not significantly changed when an alternative signal is combined with the more frequent additive connective en plus. However, this inference is inhibited by the less frequent additive connective en outre, and is almost completely hindered by the consequence connectives donc and ainsi. Overall, these results show that alternative list signals are an important source for the inference of the list relation, even in the presence of more salient signals of coherence such as connectives.

& # x0D;语篇要素之间的连贯关系可以通过连接词和/或替代信号等语言手段来表示。虽然在不同类别的说话者中对连接词的使用和理解进行了研究，但对连贯关系的替代信号的功能知之甚少，特别是在年轻人群中。在当前的研究中，我们的目的是研究法语青少年对列表关系的替代信号的敏感性(如plusiieurs的“几个”和diffents的“各种”)，结合频率不同的连接词和信号两种类型的连贯关系(加法:en plus, en outre;结果:donc, ainsi)。我们的研究结果表明，早在青少年时期，说话者就对列表关系的替代信号很敏感(即他们会产生列表接续句)。此外，当一个替代信号与频率更高的附加连词en +组合时，列表关系的推理没有明显改变。然而，这种推断被较不频繁的附加连接词en outre所抑制，并且几乎完全被结果连接词donc和ainsi所阻碍。总的来说，这些结果表明，即使存在更显著的连贯信号(如连接词)，替代列表信号也是列表关系推理的重要来源。

{"title":"Exploring the Sensitivity to Alternative Signals of Coherence Relations","authors":"Ekaterina Tskhovrebova, Sandrine Zufferey, Pascal Gygax","doi":"10.5210/dad.2023.202","DOIUrl":"https://doi.org/10.5210/dad.2023.202","url":null,"abstract":" Coherence relations between elements of discourse can be signaled by linguistic devices such as connectives and/or alternative signals. While the use and comprehension of connectives have been studied in different categories of speakers, less is known about the functioning of alternative signals of coherence relations, especially in younger populations. In the current study, we aim to examine the sensitivity of French-speaking teenagers to the alternative signals of list relation (words such as plusieurs ‘several’ and différents ‘various’), combined with connectives varying in frequency and signaling two types of coherence relations (addition: en plus, en outre; consequence: donc, ainsi). Our results reveal that, as early as in teenage years, speakers are sensitive (i.e., they produce list continuation sentences) to alternative signals of list relation. Furthermore, the inference of list relation is not significantly changed when an alternative signal is combined with the more frequent additive connective en plus. However, this inference is inhibited by the less frequent additive connective en outre, and is almost completely hindered by the consequence connectives donc and ainsi. Overall, these results show that alternative list signals are an important source for the inference of the list relation, even in the presence of more salient signals of coherence such as connectives.","PeriodicalId":37604,"journal":{"name":"Dialogue and Discourse","volume":"111 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135645883","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Scoring Coreference Chains with Split-Antecedent Anaphors 分词前指的共参照链评分

Q1 Arts and Humanities

Dialogue and Discourse

Pub Date : 2023-09-28 DOI: 10.5210/dad.2023.201

Silviu Paun, Juntao Yu, Nafise Sadat Moosavi, Massimo Poesio

Anaphoric reference is an aspect of language interpretation covering a variety of types of interpretation beyond the simple case of identity reference to entities introduced via nominal expressions covered by the traditional coreference task in its most recent incarnation in ONTONOTES and similar datasets. One of these cases that go beyond simple coreference is anaphoric reference to entities that must be added to the discourse model via accommodation, and in particular split-antecedent references to entities constructed out of other entities, as in split-antecedent plurals and in some cases of discourse deixis. Although this type of anaphoric reference is now annotated in many datasets, systems interpreting such references cannot be evaluated using the Reference coreference scorer Pradhan et al. (2014). As part of the work towards a new scorer for anaphoric reference able to evaluate all aspects of anaphoric interpretation in the coverage of the Universal Anaphora initiative, we propose in this paper a solution to the technical problem of generalizing existing metrics for identity anaphora so that they can also be used to score cases of split-antecedents. This is the first such proposal in the literature on anaphora or coreference, and has been successfully used to score both split-antecedent plural references and discourse deixis in the recent CODI/CRAC anaphora resolution in dialogue shared tasks.

回指指是语言解释的一个方面，涵盖了多种类型的解释，而不仅仅是通过名义表达引入实体的简单情况，在ONTONOTES和类似数据集中，传统的共指任务涵盖了这种情况。其中一种超越简单的共指的情况是对实体的回指，这些实体必须通过调节添加到话语模型中，特别是对由其他实体构建的实体的分离先行引用，如在分离先行复数和话语指示的某些情况下。尽管这种类型的回指参考现在在许多数据集中都有注释，但无法使用参考共参考评分器Pradhan等人(2014)来评估解释此类参考的系统。作为一个新的回指参照评分者的工作的一部分，该评分者能够评估通用回指倡议覆盖范围内的回指解释的各个方面，我们在本文中提出了一个解决现有身份回指度量的技术问题的解决方案，以便它们也可以用于对分裂先行词的情况进行评分。这是关于回指或共指的文献中第一次提出这样的建议，并在最近的CODI/CRAC对话共享任务的回指解析中成功地用于分词复数指和语篇指示的得分。

{"title":"Scoring Coreference Chains with Split-Antecedent Anaphors","authors":"Silviu Paun, Juntao Yu, Nafise Sadat Moosavi, Massimo Poesio","doi":"10.5210/dad.2023.201","DOIUrl":"https://doi.org/10.5210/dad.2023.201","url":null,"abstract":"Anaphoric reference is an aspect of language interpretation covering a variety of types of interpretation beyond the simple case of identity reference to entities introduced via nominal expressions covered by the traditional coreference task in its most recent incarnation in ONTONOTES and similar datasets. One of these cases that go beyond simple coreference is anaphoric reference to entities that must be added to the discourse model via accommodation, and in particular split-antecedent references to entities constructed out of other entities, as in split-antecedent plurals and in some cases of discourse deixis. Although this type of anaphoric reference is now annotated in many datasets, systems interpreting such references cannot be evaluated using the Reference coreference scorer Pradhan et al. (2014). As part of the work towards a new scorer for anaphoric reference able to evaluate all aspects of anaphoric interpretation in the coverage of the Universal Anaphora initiative, we propose in this paper a solution to the technical problem of generalizing existing metrics for identity anaphora so that they can also be used to score cases of split-antecedents. This is the first such proposal in the literature on anaphora or coreference, and has been successfully used to score both split-antecedent plural references and discourse deixis in the recent CODI/CRAC anaphora resolution in dialogue shared tasks.","PeriodicalId":37604,"journal":{"name":"Dialogue and Discourse","volume":"239 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135425394","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Form and Function of Connectives in Chinese Conversational Speech 汉语会话言语中连接词的形式与功能

Q1 Arts and Humanities

Dialogue and Discourse

Pub Date : 2023-06-02 DOI: 10.5210/dad.2023.104

Nien-Heng Wu, S. Tseng

Connectives convey discourse functions that provide textual and pragmatic information in speech communication on top of canonical, sentential use. This paper proposes an applicable scheme with illustrative examples for distinguishing Sentential, Conclusion, Disfluency, Elaboration, and Resumption uses of Mandarin connectives, including conjunctions and adverbs. Quantitative results of our annotation works are presented to gain an overview of connectives in a Mandarin conversational speech corpus. A fine-grained taxonomy is also discussed, but it requires more empirical data to approve the applicability. By conducting a multinomial logistic regression model, we illustrate that connectives exhibit consistent patterns in positional, phonetic, and contextual features oriented to the associated discourse functions. Our results confirm that the position of Conclusion and Resumption connectives orient more to positions in semantically, rather than prosodically, determined units. We also found that connectives used for all four discourse functions tend to have a higher initial F0 value than those of sentential use. Resumption and Disfluency uses are expected to have the largest increase in initial F0 value, followed by Conclusion and Elaboration uses. Durational cues of the preceding context enable distinguishing Sentential use from discourse uses of Conclusion, Elaboration, and Resumption of connectives.

连接词在语言交际中除了规范的句子使用外，还提供语篇和语用信息。本文提出了一种适用于区分汉语连接词(包括连词和副词)的句子、结论、不流利、详细和恢复使用的方法，并附有实例说明。本文给出了我们的注释工作的定量结果，以获得普通话会话语音语料库中连接词的概述。还讨论了细粒度分类法，但它需要更多的经验数据来批准其适用性。通过进行多项逻辑回归模型，我们说明了连接词在位置、语音和上下文特征方面表现出一致的模式，这些特征面向相关的话语功能。我们的研究结果证实，结论和恢复连接词的位置更倾向于语义上的位置，而不是韵律上的位置。我们还发现，所有四种话语功能使用的连接词往往比句子使用的连接词具有更高的初始F0值。恢复和不流利的使用预计会有最大的初始F0值增加，其次是结论和精化使用。前面语境的持续线索可以区分句子和语篇的结论、阐述和恢复连接词的使用。

{"title":"Form and Function of Connectives in Chinese Conversational Speech","authors":"Nien-Heng Wu, S. Tseng","doi":"10.5210/dad.2023.104","DOIUrl":"https://doi.org/10.5210/dad.2023.104","url":null,"abstract":"Connectives convey discourse functions that provide textual and pragmatic information in speech communication on top of canonical, sentential use. This paper proposes an applicable scheme with illustrative examples for distinguishing Sentential, Conclusion, Disfluency, Elaboration, and Resumption uses of Mandarin connectives, including conjunctions and adverbs. Quantitative results of our annotation works are presented to gain an overview of connectives in a Mandarin conversational speech corpus. A fine-grained taxonomy is also discussed, but it requires more empirical data to approve the applicability. By conducting a multinomial logistic regression model, we illustrate that connectives exhibit consistent patterns in positional, phonetic, and contextual features oriented to the associated discourse functions. Our results confirm that the position of Conclusion and Resumption connectives orient more to positions in semantically, rather than prosodically, determined units. We also found that connectives used for all four discourse functions tend to have a higher initial F0 value than those of sentential use. Resumption and Disfluency uses are expected to have the largest increase in initial F0 value, followed by Conclusion and Elaboration uses. Durational cues of the preceding context enable distinguishing Sentential use from discourse uses of Conclusion, Elaboration, and Resumption of connectives.","PeriodicalId":37604,"journal":{"name":"Dialogue and Discourse","volume":"43 1","pages":"88-124"},"PeriodicalIF":0.0,"publicationDate":"2023-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84049974","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Bullshit, Pragmatic Deception, and Natural Language Processing 胡扯，实用主义欺骗和自然语言处理

Q1 Arts and Humanities

Dialogue and Discourse

Pub Date : 2023-05-24 DOI: 10.5210/dad.2023.103

Oliver Deck

Fact checking and fake news detection has garnered increasing interest within the natural language processing (NLP) community in recent years, yet other aspects of misinformation remain unexplored. One such phenomenon is `bullshit', which different disciplines have tried to define since it first entered academic discussion nearly four decades ago. Fact checking bullshitters is useless, because factual reality typically plays no part in their assertions: Where liars deceive about content, bullshitters deceive about their goals. Bullshitting is misleading about language itself, which necessitates identifying the points at which pragmatic conventions are broken with deceptive intent. This paper aims to introduce bullshitology into the field of NLP by tying it to questions in a QUD-based definition, providing two approaches to bullshit annotation, and finally outlining which combinations of NLP methods will be helpful to classify which kinds of linguistic bullshit.

近年来，事实核查和假新闻检测在自然语言处理(NLP)社区中引起了越来越多的兴趣，但错误信息的其他方面仍未得到探索。其中一种现象就是“bullshit”，自近40年前它首次进入学术讨论以来，不同学科都试图给它下定义。对扯淡者进行事实核查毫无用处，因为事实真相通常与他们的断言无关:说谎者在内容上欺骗，扯淡者在目标上欺骗。胡扯是对语言本身的误导，这就需要识别出那些被欺骗性意图打破的实用惯例的点。本文旨在通过将扯淡学与基于qud的定义中的问题联系起来，将扯淡学引入NLP领域，提供了两种扯淡注释方法，最后概述了哪种NLP方法组合将有助于对哪种语言扯淡进行分类。

引用次数: 0

Attribution and the discourse structure of reports 报道的归因与语篇结构

Q1 Arts and Humanities

Dialogue and Discourse

Pub Date : 2023-04-13 DOI: 10.5210/dad.2023.102

E. Maier

I propose a discourse-level analysis of report constructions. Indirect discourse, mixed and direct quotation, free indirect discourse, and attitude ascriptions are all analyzed in terms of a discourse relation of ATTRIBUTION, connecting two propositional discourse units corresponding to (i) a frame segment (he said, she dreamed) and a (possibly complex, multi-sentence) report (“I’m an idiot”, (that) she was president). I provide a unified semantics for the discourse relation of ATTRIBUTION that invokes a flexible notion of ‘characterization’. A discourse unit may characterize a speech event by reproducing its linguistic surface form (as in quotation) or its propositional content (as in indirect speech and attitude reports), or some mixture of both (as in mixed quotation or free indirect discourse). I formalize this unified discourse-level ATTRIBUTION approach to reporting within the general framework of SDRT, and apply it to direct, indirect, and free indirect reports that extend beyond the single embedded or quoted clause. The resulting account is the first to do justice to the complex internal dependencies within stretches of reported discourse.

本文提出对报告结构进行语篇层面的分析。间接引语、混合引语和直接引语、自由间接引语和态度归因都是根据归因的话语关系来分析的，它连接了两个命题话语单元，分别对应于(i)框架段(他说，她做梦)和(可能是复杂的，多句的)报道(“我是白痴”，(that) she was president)。我为归因的话语关系提供了一个统一的语义，它调用了一个灵活的“表征”概念。话语单位可以通过再现言语事件的语言表面形式(如引语)或其命题内容(如间接引语和态度报告)，或两者的混合(如混合引语或自由间接引语)来表征言语事件。我将这种统一的话语级归因方法正式应用于SDRT总体框架内的报告，并将其应用于超出单个嵌入或引用条款的直接、间接和自由间接报告。由此产生的解释是第一个公正地对待报道话语范围内复杂的内部依赖关系的。

引用次数: 0

Characterizing the Response Space of Questions: data and theory 表征问题的回答空间:数据与理论

Q1 Arts and Humanities

Dialogue and Discourse

Pub Date : 2022-12-20 DOI: 10.5210/dad.2022.203

J. Ginzburg, Zulipiye Yusupujiang, Chuyuan Li, Kexin Ren, A. Kucharska, P. Lupkowski

The main aim of this paper is to provide a characterization of the response space for questions using a taxonomy grounded in a dialogical formal semantics. As a starting point we take the typology for responses in the form of questions provided in cite{lupginz-jlm}. This work develops a wide coverage taxonomy for question/question sequences observable in corpora including the BNC, CHILDES, and BEE, as well as formal modeling of all the postulated classes. Our aim is to extend this work to cover emph{all} responses to questions. We present the extended typology of responses to questions based on a corpus studies of BNC, BEE, Maptask and CornellMovie with include 506, 262, 467, and 678 question/response pairs respectively. We compare the data for English with data from Polish using the Spokes corpus (694 question/response pairs). We discuss annotation reliability and disagreement analysis. We sketch how each class can be formalized using a dialogical semantics appropriate for dialogue management.

本文的主要目的是使用基于对话形式语义的分类法对问题的响应空间进行表征。作为起点，我们采用cite{lupginz-jlm}中提供的问题形式的回答类型。这项工作为包括BNC、CHILDES和BEE在内的语料库中可观察到的问题/问题序列开发了一个广泛覆盖的分类法，以及所有假设类的形式化建模。我们的目标是将这项工作扩展到对emph{所有}问题的回答。我们基于BNC、BEE、Maptask和CornellMovie的语料库研究，分别包括506、262、467和678个问题/回答对，提出了扩展的问题/回答类型。我们使用Spokes语料库(694个问题/回答对)将英语数据与波兰语数据进行比较。讨论了标注可靠性和歧义分析。我们概述了如何使用适合于对话管理的对话语义来形式化每个类。

引用次数: 4

The effect of domain knowledge and implicitation on discourse relation inferences 领域知识和隐含对话语关系推理的影响

Q1 Arts and Humanities

Dialogue and Discourse

Pub Date : 2022-09-06 DOI: 10.5210/dad.2022.202

Marian Marchal, Merel C. J. Scholman, Vera Demberg

Readers adopt their domain knowledge to make inferences about information that is left implicit in the text. The present research investigates the role of domain knowledge in discourse relation interpretation, as this has not been examined experimentally in previous work. We compare interpretations of experts from the field of economics and biomedical sciences in texts from within and outside of their domain of expertise. The results show that high-knowledge readers are better at inferring the correct relation interpretation compared to low-knowledge readers. This effect was stronger in relations that contained a connective in the original text than in relations that were originally implicit. The study provides insight on the impact of background knowledge on discourse relation inferencing and how readers interpret discourse relations when they lack the required domain knowledge.

读者利用他们的领域知识对文本中隐含的信息进行推断。本研究探讨了领域知识在语篇关系解释中的作用，因为这在以前的工作中没有得到实验的检验。我们比较了来自经济学和生物医学领域的专家在其专业领域内外的文本中的解释。结果表明，高知识阅读者比低知识阅读者更能推断出正确的关系解释。这种效应在原始文本中包含连接的关系中比在原始文本中隐含的关系中更强。本研究揭示了背景知识对语篇关系推理的影响，以及读者在缺乏所需领域知识的情况下如何解释语篇关系。

引用次数: 0

Lexical Acquisition during Dialogues through Implicit Confirmation 通过隐性确认的对话词汇习得

Q1 Arts and Humanities

Dialogue and Discourse

Pub Date : 2022-06-21 DOI: 10.5210/dad.2022.104

Kazunori Komatani, Kohei Ono, Ryu Takeda, Eric Nichols, Mikio Nakano

We have been addressing the problem of acquiring attributes of unknown terms through dialogues and previously proposed an approach using the implicit confirmation process. It is crucial for dialogue systems to ask questions that do not diminish the user’s willingness to talk. In this paper, we conducted a user study to investigate user impression for several question types, including explicit and implicit, to acquire lexical knowledge. We clarified the order among the types and found that repeating the same question type annoys the user and degrades user impression even when the content of the questions is correct. We also propose a method for determining whether an estimated attribute is correct, which is included in an implicit question. The method exploits multiple-user responses to implicit questions about the attribute of the same unknown term. Experimental results revealed that the proposed method exhibited a higher precision rate for determining the correctly estimated attributes than when only single-user responses were considered.

我们一直在解决通过对话获取未知术语属性的问题，之前提出了一种使用隐式确认过程的方法。对话系统提出的问题不能削弱用户交谈的意愿，这一点至关重要。在本文中，我们进行了一项用户研究，调查用户对几种问题类型的印象，包括显性和隐性，以获得词汇知识。我们澄清了类型之间的顺序，发现即使问题的内容是正确的，重复相同的问题类型也会惹恼用户并降低用户的印象。我们还提出了一种确定估计属性是否正确的方法，该方法包含在隐式问题中。该方法利用多用户对同一未知术语属性的隐式问题的响应。实验结果表明，与仅考虑单个用户的响应相比，该方法在确定正确估计属性方面具有更高的准确率。

引用次数: 0

Scoring Coreference Chains with Split-Antecedent Anaphors 分词前指的共参照链评分

Q1 Arts and Humanities

Dialogue and Discourse

Pub Date : 2022-05-24 DOI: 10.48550/arXiv.2205.12323

Silviu Paun, Juntao Yu, N. Moosavi, Massimo Poesio

Anaphoric reference is an aspect of language interpretation covering a variety of types of interpretation beyond the simple case of identity reference to entities introduced via nominal expressions covered by the traditional coreference task in its most recent incarnation in ONTONOTES and similar datasets. One of these cases that go beyond simple coreference is anaphoric reference to entities that must be added to the discourse model via accommodation, and in particular split-antecedent references to entities constructed out of other entities, as in split-antecedent plurals and in some cases of discourse deixis. Although this type of anaphoric reference is now annotated in many datasets, systems interpreting such references cannot be evaluated using the Reference coreference scorer Pradhan et al. (2014). As part of the work towards a new scorer for anaphoric reference able to evaluate all aspects of anaphoric interpretation in the coverage of the Universal Anaphora initiative, we propose in this paper a solution to the technical problem of generalizing existing metrics for identity anaphora so that they can also be used to score cases of split-antecedents. This is the first such proposal in the literature on anaphora or coreference, and has been successfully used to score both split-antecedent plural references and discourse deixis in the recent CODI/CRAC anaphora resolution in dialogue shared tasks.

回指指是语言解释的一个方面，涵盖了多种类型的解释，而不仅仅是通过名义表达引入实体的简单情况，在ONTONOTES和类似数据集中，传统的共指任务涵盖了这种情况。其中一种超越简单的共指的情况是对实体的回指，这些实体必须通过调节添加到话语模型中，特别是对由其他实体构建的实体的分离先行引用，如在分离先行复数和话语指示的某些情况下。尽管这种类型的回指参考现在在许多数据集中都有注释，但无法使用参考共参考评分器Pradhan等人(2014)来评估解释此类参考的系统。作为一个新的回指参照评分者的工作的一部分，该评分者能够评估通用回指倡议覆盖范围内的回指解释的各个方面，我们在本文中提出了一个解决现有身份回指度量的技术问题的解决方案，以便它们也可以用于对分裂先行词的情况进行评分。这是关于回指或共指的文献中第一次提出这样的建议，并在最近的CODI/CRAC对话共享任务的回指解析中成功地用于分词复数指和语篇指示的得分。

{"title":"Scoring Coreference Chains with Split-Antecedent Anaphors","authors":"Silviu Paun, Juntao Yu, N. Moosavi, Massimo Poesio","doi":"10.48550/arXiv.2205.12323","DOIUrl":"https://doi.org/10.48550/arXiv.2205.12323","url":null,"abstract":"Anaphoric reference is an aspect of language interpretation covering a variety of types of interpretation beyond the simple case of identity reference to entities introduced via nominal expressions covered by the traditional coreference task in its most recent incarnation in ONTONOTES and similar datasets. One of these cases that go beyond simple coreference is anaphoric reference to entities that must be added to the discourse model via accommodation, and in particular split-antecedent references to entities constructed out of other entities, as in split-antecedent plurals and in some cases of discourse deixis. Although this type of anaphoric reference is now annotated in many datasets, systems interpreting such references cannot be evaluated using the Reference coreference scorer Pradhan et al. (2014). As part of the work towards a new scorer for anaphoric reference able to evaluate all aspects of anaphoric interpretation in the coverage of the Universal Anaphora initiative, we propose in this paper a solution to the technical problem of generalizing existing metrics for identity anaphora so that they can also be used to score cases of split-antecedents. This is the first such proposal in the literature on anaphora or coreference, and has been successfully used to score both split-antecedent plural references and discourse deixis in the recent CODI/CRAC anaphora resolution in dialogue shared tasks.","PeriodicalId":37604,"journal":{"name":"Dialogue and Discourse","volume":"314 1","pages":"1-48"},"PeriodicalIF":0.0,"publicationDate":"2022-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80073629","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 7