大型语言模型和扩展的丘奇-图灵理论

Jiří Wiedermann, Jan van Leeuwen
{"title":"大型语言模型和扩展的丘奇-图灵理论","authors":"Jiří Wiedermann, Jan van Leeuwen","doi":"arxiv-2409.06978","DOIUrl":null,"url":null,"abstract":"The Extended Church-Turing Thesis (ECTT) posits that all effective\ninformation processing, including unbounded and non-uniform interactive\ncomputations, can be described in terms of interactive Turing machines with\nadvice. Does this assertion also apply to the abilities of contemporary large\nlanguage models (LLMs)? From a broader perspective, this question calls for an\ninvestigation of the computational power of LLMs by the classical means of\ncomputability and computational complexity theory, especially the theory of\nautomata. Along these lines, we establish a number of fundamental results.\nFirstly, we argue that any fixed (non-adaptive) LLM is computationally\nequivalent to a, possibly very large, deterministic finite-state transducer.\nThis characterizes the base level of LLMs. We extend this to a key result\nconcerning the simulation of space-bounded Turing machines by LLMs. Secondly,\nwe show that lineages of evolving LLMs are computationally equivalent to\ninteractive Turing machines with advice. The latter finding confirms the\nvalidity of the ECTT for lineages of LLMs. From a computability viewpoint, it\nalso suggests that lineages of LLMs possess super-Turing computational power.\nConsequently, in our computational model knowledge generation is in general a\nnon-algorithmic process realized by lineages of LLMs. Finally, we discuss the\nmerits of our findings in the broader context of several related disciplines\nand philosophies.","PeriodicalId":501124,"journal":{"name":"arXiv - CS - Formal Languages and Automata Theory","volume":"14 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Large Language Models and the Extended Church-Turing Thesis\",\"authors\":\"Jiří Wiedermann, Jan van Leeuwen\",\"doi\":\"arxiv-2409.06978\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The Extended Church-Turing Thesis (ECTT) posits that all effective\\ninformation processing, including unbounded and non-uniform interactive\\ncomputations, can be described in terms of interactive Turing machines with\\nadvice. Does this assertion also apply to the abilities of contemporary large\\nlanguage models (LLMs)? From a broader perspective, this question calls for an\\ninvestigation of the computational power of LLMs by the classical means of\\ncomputability and computational complexity theory, especially the theory of\\nautomata. Along these lines, we establish a number of fundamental results.\\nFirstly, we argue that any fixed (non-adaptive) LLM is computationally\\nequivalent to a, possibly very large, deterministic finite-state transducer.\\nThis characterizes the base level of LLMs. We extend this to a key result\\nconcerning the simulation of space-bounded Turing machines by LLMs. Secondly,\\nwe show that lineages of evolving LLMs are computationally equivalent to\\ninteractive Turing machines with advice. The latter finding confirms the\\nvalidity of the ECTT for lineages of LLMs. From a computability viewpoint, it\\nalso suggests that lineages of LLMs possess super-Turing computational power.\\nConsequently, in our computational model knowledge generation is in general a\\nnon-algorithmic process realized by lineages of LLMs. Finally, we discuss the\\nmerits of our findings in the broader context of several related disciplines\\nand philosophies.\",\"PeriodicalId\":501124,\"journal\":{\"name\":\"arXiv - CS - Formal Languages and Automata Theory\",\"volume\":\"14 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Formal Languages and Automata Theory\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.06978\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Formal Languages and Automata Theory","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.06978","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

扩展丘奇-图灵论(ECTT)认为,所有有效的信息处理,包括无边界和非统一的交互式计算,都可以用带建议的交互式图灵机来描述。这一论断是否也适用于当代大型语言模型(LLM)的能力?从更广阔的视角来看,这个问题要求我们用可计算性和计算复杂性理论的经典方法,特别是同构理论,来研究 LLMs 的计算能力。首先,我们认为任何固定的(非自适应的)LLM在计算上等同于一个可能非常大的确定性有限状态变换器。我们将其扩展到一个关键结果,即用 LLM 模拟空间边界图灵机。其次,我们证明了不断演化的 LLM 在计算上等同于带建议的交互式图灵机。后一个发现证实了ECTT对LLMs行列的有效性。因此,在我们的计算模型中,知识生成一般是由LLMs的线粒体实现的非算法过程。最后,我们将在几个相关学科和哲学的更广阔背景下讨论我们研究发现的优点。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Large Language Models and the Extended Church-Turing Thesis
The Extended Church-Turing Thesis (ECTT) posits that all effective information processing, including unbounded and non-uniform interactive computations, can be described in terms of interactive Turing machines with advice. Does this assertion also apply to the abilities of contemporary large language models (LLMs)? From a broader perspective, this question calls for an investigation of the computational power of LLMs by the classical means of computability and computational complexity theory, especially the theory of automata. Along these lines, we establish a number of fundamental results. Firstly, we argue that any fixed (non-adaptive) LLM is computationally equivalent to a, possibly very large, deterministic finite-state transducer. This characterizes the base level of LLMs. We extend this to a key result concerning the simulation of space-bounded Turing machines by LLMs. Secondly, we show that lineages of evolving LLMs are computationally equivalent to interactive Turing machines with advice. The latter finding confirms the validity of the ECTT for lineages of LLMs. From a computability viewpoint, it also suggests that lineages of LLMs possess super-Turing computational power. Consequently, in our computational model knowledge generation is in general a non-algorithmic process realized by lineages of LLMs. Finally, we discuss the merits of our findings in the broader context of several related disciplines and philosophies.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Query Learning of Advice and Nominal Automata Well-Behaved (Co)algebraic Semantics of Regular Expressions in Dafny Run supports and initial algebra supports of weighted automata Alternating hierarchy of sushifts defined by nondeterministic plane-walking automata $\mathbb{N}$-polyregular functions arise from well-quasi-orderings
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1