Transparency Helps Reveal When Language Models Learn Meaning

IF 4.2 1区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Transactions of the Association for Computational Linguistics Pub Date : 2022-10-14 DOI:10.1162/tacl_a_00565

Zhaofeng Wu, Will Merrill, Hao Peng, Iz Beltagy, Noah A. Smith

引用次数: 2

Abstract

Many current NLP systems are built from language models trained to optimize unsupervised objectives on large amounts of raw text. Under what conditions might such a procedure acquire meaning? Our systematic experiments with synthetic data reveal that, with languages where all expressions have context-independent denotations (i.e., languages with strong transparency), both autoregressive and masked language models successfully learn to emulate semantic relations between expressions. However, when denotations are changed to be context-dependent with the language otherwise unmodified, this ability degrades. Turning to natural language, our experiments with a specific phenomenon—referential opacity—add to the growing body of evidence that current language models do not represent natural language semantics well. We show this failure relates to the context-dependent nature of natural language form-meaning mappings.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

透明度有助于揭示语言模型何时学习意义

许多当前的NLP系统都是由经过训练的语言模型构建的，这些模型用于优化大量原始文本上的无监督目标。在什么条件下，这样的程序才能获得意义？我们对合成数据的系统实验表明，在所有表达式都具有上下文无关指称的语言（即具有强透明度的语言）中，自回归和掩蔽语言模型都成功地学习了模拟表达式之间的语义关系。然而，当指称被更改为上下文相关，而语言在其他方面没有修改时，这种能力就会下降。谈到自然语言，我们对一种特定现象——指称不透明度——的实验增加了越来越多的证据，证明当前的语言模型不能很好地代表自然语言语义。我们表明，这种失败与自然语言形式-意义映射的上下文依赖性有关。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Transactions of the Association for Computational Linguistics Multiple-

CiteScore

32.60

自引率

4.60%

发文量

审稿时长

8 weeks

期刊介绍： The highly regarded quarterly journal Computational Linguistics has a companion journal called Transactions of the Association for Computational Linguistics. This open access journal publishes articles in all areas of natural language processing and is an important resource for academic and industry computational linguists, natural language processing experts, artificial intelligence and machine learning investigators, cognitive scientists, speech specialists, as well as linguists and philosophers. The journal disseminates work of vital relevance to these professionals on an annual basis.