Symbolic Semantic Memory in Transformer Language Models

2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA) Pub Date : 2022-12-01 DOI:10.1109/ICMLA55696.2022.00166

Robert Morain, Kenneth Vargas, Dan Ventura

引用次数: 0

Abstract

This paper demonstrates how transformer language models can be improved by giving them access to relevant structured data extracted from a knowledge base. The methods for doing so include identifying entities in a text corpus, sorting the entities using a novel attention-based approach, linking entities to a knowledge base, then extracting and filtering the knowledge to create a knowledge-augmented dataset. We evaluate these methods with the WikiText-103 corpus using standard language modeling objectives. These results show that even simple additional knowledge augmentation leads to a reduction in validation perplexity by 81.04%. These methods also significantly outperform common ways of improving language models such as increasing the model size or adding more data.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

转换语言模型中的符号语义记忆

本文演示了如何通过让转换器语言模型访问从知识库中提取的相关结构化数据来改进它们。实现这一目标的方法包括识别文本语料库中的实体，使用新颖的基于注意力的方法对实体进行分类，将实体链接到知识库，然后提取和过滤知识以创建知识增强数据集。我们使用WikiText-103语料库使用标准语言建模目标来评估这些方法。这些结果表明，即使是简单的额外知识增加，也可以使验证困惑度降低81.04%。这些方法也明显优于改进语言模型的常用方法，如增加模型大小或添加更多数据。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)

自引率

0.00%

发文量