Terminating Differentiable Tree Experts

arXiv - CS - Symbolic Computation Pub Date : 2024-07-02 DOI:arxiv-2407.02060

Jonathan Thomm, Michael Hersche, Giacomo Camposampiero, Aleksandar Terzić, Bernhard Schölkopf, Abbas Rahimi

引用次数: 0

Abstract

We advance the recently proposed neuro-symbolic Differentiable Tree Machine, which learns tree operations using a combination of transformers and Tensor Product Representations. We investigate the architecture and propose two key components. We first remove a series of different transformer layers that are used in every step by introducing a mixture of experts. This results in a Differentiable Tree Experts model with a constant number of parameters for any arbitrary number of steps in the computation, compared to the previous method in the Differentiable Tree Machine with a linear growth. Given this flexibility in the number of steps, we additionally propose a new termination algorithm to provide the model the power to choose how many steps to make automatically. The resulting Terminating Differentiable Tree Experts model sluggishly learns to predict the number of steps without an oracle. It can do so while maintaining the learning capabilities of the model, converging to the optimal amount of steps.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

终止可微分树专家

我们推进了最近提出的神经符号可微分树机，它利用变换器和张量乘积表示的组合来学习树操作。我们对该架构进行了研究，并提出了两个关键组件。首先，我们通过引入专家混合物，移除了每一步中使用的一系列不同变换器层。这就产生了可微分树专家模型，该模型在计算的任意步数下参数数量恒定，而之前的可微分树机器中的方法则是线性增长。考虑到计算步数的灵活性，我们还提出了一种新的终止算法，让模型能够自动选择计算步数。由此产生的终结可微分树专家模型可以在不使用神谕的情况下，缓慢地学习预测步数。它可以在保持模型学习能力的同时，收敛到最佳步数。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

arXiv - CS - Symbolic Computation

自引率

0.00%

发文量

期刊最新文献

Synthesizing Evolving Symbolic Representations for Autonomous Systems Introducing Quantification into a Hierarchical Graph Rewriting Language Towards Verified Polynomial Factorisation Symbolic Regression with a Learned Concept Library Active Symbolic Discovery of Ordinary Differential Equations via Phase Portrait Sketching