Learning Efficient Recursive Numeral Systems via Reinforcement Learning

Jonathan D. Thomas, Andrea Silvi, Devdatt Dubhashi, Emil Carlsson, Moa Johansson
{"title":"Learning Efficient Recursive Numeral Systems via Reinforcement Learning","authors":"Jonathan D. Thomas, Andrea Silvi, Devdatt Dubhashi, Emil Carlsson, Moa Johansson","doi":"arxiv-2409.07170","DOIUrl":null,"url":null,"abstract":"The emergence of mathematical concepts, such as number systems, is an\nunderstudied area in AI for mathematics and reasoning. It has previously been\nshown Carlsson et al. (2021) that by using reinforcement learning (RL), agents\ncan derive simple approximate and exact-restricted numeral systems. However, it\nis a major challenge to show how more complex recursive numeral systems,\nsimilar to the one utilised in English, could arise via a simple learning\nmechanism such as RL. Here, we introduce an approach towards deriving a\nmechanistic explanation of the emergence of recursive number systems where we\nconsider an RL agent which directly optimizes a lexicon under a given\nmeta-grammar. Utilising a slightly modified version of the seminal meta-grammar\nof Hurford (1975), we demonstrate that our RL agent can effectively modify the\nlexicon towards Pareto-optimal configurations which are comparable to those\nobserved within human numeral systems.","PeriodicalId":501030,"journal":{"name":"arXiv - CS - Computation and Language","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Computation and Language","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.07170","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The emergence of mathematical concepts, such as number systems, is an understudied area in AI for mathematics and reasoning. It has previously been shown Carlsson et al. (2021) that by using reinforcement learning (RL), agents can derive simple approximate and exact-restricted numeral systems. However, it is a major challenge to show how more complex recursive numeral systems, similar to the one utilised in English, could arise via a simple learning mechanism such as RL. Here, we introduce an approach towards deriving a mechanistic explanation of the emergence of recursive number systems where we consider an RL agent which directly optimizes a lexicon under a given meta-grammar. Utilising a slightly modified version of the seminal meta-grammar of Hurford (1975), we demonstrate that our RL agent can effectively modify the lexicon towards Pareto-optimal configurations which are comparable to those observed within human numeral systems.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
通过强化学习学习高效递归数字系统
数学概念(如数字系统)的出现是人工智能数学与推理中一个研究不足的领域。Carlsson 等人(2021 年)曾指出,通过强化学习(RL),代理可以推导出简单的近似和精确受限的数字系统。然而,如何通过 RL 这种简单的学习机制来展示类似英语中使用的更复杂的递归数字系统是一个重大挑战。在这里,我们引入了一种方法,旨在从机制上解释递归数字系统的出现,即我们考虑在给定元语法下直接优化词典的 RL 代理。利用赫尔福德(Hurford,1975 年)开创性元语法的略微修改版本,我们证明了我们的 RL 代理可以有效地修改词库,使其达到帕累托最优配置,这与人类数字系统中观察到的配置相当。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
LLMs + Persona-Plug = Personalized LLMs MEOW: MEMOry Supervised LLM Unlearning Via Inverted Facts Extract-and-Abstract: Unifying Extractive and Abstractive Summarization within Single Encoder-Decoder Framework Development and bilingual evaluation of Japanese medical large language model within reasonably low computational resources Human-like Affective Cognition in Foundation Models
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1