What GPT Knows About Who is Who

First Workshop on Insights from Negative Results in NLP Pub Date : 2022-05-16 DOI:10.48550/arXiv.2205.07407

Xiaohan Yang, Eduardo Peynetti, Vasco Meerman, Christy Tanner

引用次数: 6

Abstract

Coreference resolution – which is a crucial task for understanding discourse and language at large – has yet to witness widespread benefits from large language models (LLMs). Moreover, coreference resolution systems largely rely on supervised labels, which are highly expensive and difficult to annotate, thus making it ripe for prompt engineering. In this paper, we introduce a QA-based prompt-engineering method and discern generative, pre-trained LLMs’ abilities and limitations toward the task of coreference resolution. Our experiments show that GPT-2 and GPT-Neo can return valid answers, but that their capabilities to identify coreferent mentions are limited and prompt-sensitive, leading to inconsistent results.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

GPT知道谁是谁

共同参考解析是理解话语和语言的关键任务，但它尚未从大型语言模型(llm)中得到广泛的应用。此外，共参解析系统很大程度上依赖于监督标签，这是非常昂贵和难以注释的，从而使其成熟的快速工程。在本文中，我们引入了一种基于问答的提示工程方法，并识别出生成的、预先训练的法学硕士在共同参考解析任务中的能力和局限性。我们的实验表明，GPT-2和GPT-Neo可以返回有效的答案，但它们识别共同提及的能力有限且对时间敏感，导致结果不一致。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

First Workshop on Insights from Negative Results in NLP

自引率

0.00%

发文量

期刊最新文献

What GPT Knows About Who is Who Pathologies of Pre-trained Language Models in Few-shot Fine-tuning Can Question Rewriting Help Conversational Question Answering? Extending the Scope of Out-of-Domain: Examining QA models in multiple subdomains Do Data-based Curricula Work?