Revisiting the Practical Effectiveness of Constituency Parse Extraction from Pre-trained Language Models

Proceedings of COLING. International Conference on Computational Linguistics Pub Date : 2022-09-15 DOI:10.48550/arXiv.2211.00479

Taeuk Kim

引用次数: 1

Abstract

Constituency Parse Extraction from Pre-trained Language Models (CPE-PLM) is a recent paradigm that attempts to induce constituency parse trees relying only on the internal knowledge of pre-trained language models. While attractive in the perspective that similar to in-context learning, it does not require task-specific fine-tuning, the practical effectiveness of such an approach still remains unclear, except that it can function as a probe for investigating language models’ inner workings. In this work, we mathematically reformulate CPE-PLM and propose two advanced ensemble methods tailored for it, demonstrating that the new parsing paradigm can be competitive with common unsupervised parsers by introducing a set of heterogeneous PLMs combined using our techniques. Furthermore, we explore some scenarios where the trees generated by CPE-PLM are practically useful. Specifically, we show that CPE-PLM is more effective than typical supervised parsers in few-shot settings.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

从预训练语言模型中重新审视选区解析提取的实际有效性

从预训练语言模型中提取选区解析(CPE-PLM)是一种最新的范式，它试图仅依靠预训练语言模型的内部知识来诱导选区解析树。虽然从类似于上下文学习的角度来看很有吸引力，但它不需要特定于任务的微调，这种方法的实际有效性仍然不清楚，除了它可以作为调查语言模型内部工作的探针。在这项工作中，我们在数学上重新制定了CPE-PLM，并提出了为其量身定制的两种高级集成方法，通过引入一组使用我们技术组合的异构plm，证明了新的解析范式可以与常见的无监督解析器竞争。此外，我们还探讨了由CPE-PLM生成的树在实际应用中的一些场景。具体来说，我们证明了CPE-PLM在少量射击设置中比典型的监督解析器更有效。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of COLING. International Conference on Computational Linguistics

自引率

0.00%

发文量