Invariant neural architecture for learning term synthesis in instantiation proving

IF 0.6 4区数学 Q4 COMPUTER SCIENCE, THEORY & METHODS Journal of Symbolic Computation Pub Date : 2024-08-28 DOI:10.1016/j.jsc.2024.102375

Jelle Piepenbrock , Josef Urban , Konstantin Korovin , Miroslav Olšák , Tom Heskes , Mikoláš Janota

{"title":"Invariant neural architecture for learning term synthesis in instantiation proving","authors":"Jelle Piepenbrock , Josef Urban , Konstantin Korovin , Miroslav Olšák , Tom Heskes , Mikoláš Janota","doi":"10.1016/j.jsc.2024.102375","DOIUrl":null,"url":null,"abstract":"<div><p>The development of strong CDCL-based propositional (SAT) solvers has greatly advanced several areas of automated reasoning (AR). One of the directions in AR is therefore to make use of SAT solvers in expressive formalisms such as first-order logic, for which large corpora of general mathematical problems exist today. This is possible due to Herbrand's theorem, which allows reduction of first-order problems to propositional problems by instantiation. The core challenge is synthesizing the appropriate instances from the typically infinite Herbrand universe.</p><p>In this work, we develop a machine learning system targeting this task, addressing its combinatorial and invariance properties. In particular, we develop a GNN2RNN architecture based on a graph neural network (GNN) that learns from problems and their solutions independently of many symmetries and symbol names (addressing the abundance of Skolems), combined with a recurrent neural network (RNN) that proposes for each clause its instantiations. The architecture is then combined with an efficient ground solver and, starting with zero knowledge, iteratively trained on a large corpus of mathematical problems. We show that the system is capable of solving many problems by such educated guessing, finding proofs for 32.12% of the training set. The final trained system solves 19.74% of the unseen test data on its own. We also observe that the trained system finds solutions that the iProver and CVC5 systems did not find.</p></div>","PeriodicalId":50031,"journal":{"name":"Journal of Symbolic Computation","volume":null,"pages":null},"PeriodicalIF":0.6000,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0747717124000798/pdfft?md5=03f2c9a993930436ebd44dced50d3406&pid=1-s2.0-S0747717124000798-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Symbolic Computation","FirstCategoryId":"100","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0747717124000798","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}

引用次数: 0

Abstract

The development of strong CDCL-based propositional (SAT) solvers has greatly advanced several areas of automated reasoning (AR). One of the directions in AR is therefore to make use of SAT solvers in expressive formalisms such as first-order logic, for which large corpora of general mathematical problems exist today. This is possible due to Herbrand's theorem, which allows reduction of first-order problems to propositional problems by instantiation. The core challenge is synthesizing the appropriate instances from the typically infinite Herbrand universe.

In this work, we develop a machine learning system targeting this task, addressing its combinatorial and invariance properties. In particular, we develop a GNN2RNN architecture based on a graph neural network (GNN) that learns from problems and their solutions independently of many symmetries and symbol names (addressing the abundance of Skolems), combined with a recurrent neural network (RNN) that proposes for each clause its instantiations. The architecture is then combined with an efficient ground solver and, starting with zero knowledge, iteratively trained on a large corpus of mathematical problems. We show that the system is capable of solving many problems by such educated guessing, finding proofs for 32.12% of the training set. The final trained system solves 19.74% of the unseen test data on its own. We also observe that the trained system finds solutions that the iProver and CVC5 systems did not find.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

在实例化证明中学习术语合成的不变神经架构

基于 CDCL 的强命题（SAT）求解器的开发极大地推动了自动推理（AR）的多个领域。因此，自动推理（AR）的一个发展方向是在一阶逻辑等具有表现力的形式主义中使用 SAT 求解器。Herbrand 定理允许通过实例化将一阶问题还原为命题问题，这使 SAT 成为可能。在这项工作中，我们针对这一任务开发了一个机器学习系统，解决了其组合性和不变性问题。特别是，我们开发了一种基于图神经网络（GNN）的 GNN2RNN 架构，该架构可独立于许多对称性和符号名称（解决 Skolems 的丰富性）从问题及其解决方案中学习，并与为每个条款提出实例的递归神经网络（RNN）相结合。然后，该架构与高效的地面求解器相结合，从零知识开始，在大量数学问题的语料库中进行迭代训练。我们的研究表明，该系统能够通过这种有根据的猜测解决许多问题，为 32.12% 的训练集找到了证明。最终训练有素的系统能独立解决 19.74% 的未见测试数据。我们还发现，经过训练的系统找到了 iProver 和 CVC5 系统没有找到的解决方案。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Journal of Symbolic Computation 工程技术-计算机：理论方法

CiteScore

2.10

自引率

14.30%

发文量

审稿时长

142 days

期刊介绍： An international journal, the Journal of Symbolic Computation, founded by Bruno Buchberger in 1985, is directed to mathematicians and computer scientists who have a particular interest in symbolic computation. The journal provides a forum for research in the algorithmic treatment of all types of symbolic objects: objects in formal languages (terms, formulas, programs); algebraic objects (elements in basic number domains, polynomials, residue classes, etc.); and geometrical objects. It is the explicit goal of the journal to promote the integration of symbolic computation by establishing one common avenue of communication for researchers working in the different subareas. It is also important that the algorithmic achievements of these areas should be made available to the human problem-solver in integrated software systems for symbolic computation. To help this integration, the journal publishes invited tutorial surveys as well as Applications Letters and System Descriptions.

期刊最新文献

Persistent components in Canny's generalized characteristic polynomial Coupled cluster degree of the Grassmannian Mixed volumes of networks with binomial steady-states Creative telescoping for hypergeometric double sums On nonnegative invariant quartics in type A