大型语言模型和扩展的丘奇-图灵理论

arXiv - CS - Formal Languages and Automata Theory Pub Date : 2024-09-11 DOI:arxiv-2409.06978

Jiří Wiedermann, Jan van Leeuwen

{"title":"大型语言模型和扩展的丘奇-图灵理论","authors":"Jiří Wiedermann, Jan van Leeuwen","doi":"arxiv-2409.06978","DOIUrl":null,"url":null,"abstract":"The Extended Church-Turing Thesis (ECTT) posits that all effective\ninformation processing, including unbounded and non-uniform interactive\ncomputations, can be described in terms of interactive Turing machines with\nadvice. Does this assertion also apply to the abilities of contemporary large\nlanguage models (LLMs)? From a broader perspective, this question calls for an\ninvestigation of the computational power of LLMs by the classical means of\ncomputability and computational complexity theory, especially the theory of\nautomata. Along these lines, we establish a number of fundamental results.\nFirstly, we argue that any fixed (non-adaptive) LLM is computationally\nequivalent to a, possibly very large, deterministic finite-state transducer.\nThis characterizes the base level of LLMs. We extend this to a key result\nconcerning the simulation of space-bounded Turing machines by LLMs. Secondly,\nwe show that lineages of evolving LLMs are computationally equivalent to\ninteractive Turing machines with advice. The latter finding confirms the\nvalidity of the ECTT for lineages of LLMs. From a computability viewpoint, it\nalso suggests that lineages of LLMs possess super-Turing computational power.\nConsequently, in our computational model knowledge generation is in general a\nnon-algorithmic process realized by lineages of LLMs. Finally, we discuss the\nmerits of our findings in the broader context of several related disciplines\nand philosophies.","PeriodicalId":501124,"journal":{"name":"arXiv - CS - Formal Languages and Automata Theory","volume":"14 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Large Language Models and the Extended Church-Turing Thesis\",\"authors\":\"Jiří Wiedermann, Jan van Leeuwen\",\"doi\":\"arxiv-2409.06978\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The Extended Church-Turing Thesis (ECTT) posits that all effective\\ninformation processing, including unbounded and non-uniform interactive\\ncomputations, can be described in terms of interactive Turing machines with\\nadvice. Does this assertion also apply to the abilities of contemporary large\\nlanguage models (LLMs)? From a broader perspective, this question calls for an\\ninvestigation of the computational power of LLMs by the classical means of\\ncomputability and computational complexity theory, especially the theory of\\nautomata. Along these lines, we establish a number of fundamental results.\\nFirstly, we argue that any fixed (non-adaptive) LLM is computationally\\nequivalent to a, possibly very large, deterministic finite-state transducer.\\nThis characterizes the base level of LLMs. We extend this to a key result\\nconcerning the simulation of space-bounded Turing machines by LLMs. Secondly,\\nwe show that lineages of evolving LLMs are computationally equivalent to\\ninteractive Turing machines with advice. The latter finding confirms the\\nvalidity of the ECTT for lineages of LLMs. From a computability viewpoint, it\\nalso suggests that lineages of LLMs possess super-Turing computational power.\\nConsequently, in our computational model knowledge generation is in general a\\nnon-algorithmic process realized by lineages of LLMs. Finally, we discuss the\\nmerits of our findings in the broader context of several related disciplines\\nand philosophies.\",\"PeriodicalId\":501124,\"journal\":{\"name\":\"arXiv - CS - Formal Languages and Automata Theory\",\"volume\":\"14 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Formal Languages and Automata Theory\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.06978\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Formal Languages and Automata Theory","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.06978","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

扩展丘奇-图灵论（ECTT）认为，所有有效的信息处理，包括无边界和非统一的交互式计算，都可以用带建议的交互式图灵机来描述。这一论断是否也适用于当代大型语言模型（LLM）的能力？从更广阔的视角来看，这个问题要求我们用可计算性和计算复杂性理论的经典方法，特别是同构理论，来研究 LLMs 的计算能力。首先，我们认为任何固定的（非自适应的）LLM在计算上等同于一个可能非常大的确定性有限状态变换器。我们将其扩展到一个关键结果，即用 LLM 模拟空间边界图灵机。其次，我们证明了不断演化的 LLM 在计算上等同于带建议的交互式图灵机。后一个发现证实了ECTT对LLMs行列的有效性。因此，在我们的计算模型中，知识生成一般是由LLMs的线粒体实现的非算法过程。最后，我们将在几个相关学科和哲学的更广阔背景下讨论我们研究发现的优点。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Large Language Models and the Extended Church-Turing Thesis

The Extended Church-Turing Thesis (ECTT) posits that all effective information processing, including unbounded and non-uniform interactive computations, can be described in terms of interactive Turing machines with advice. Does this assertion also apply to the abilities of contemporary large language models (LLMs)? From a broader perspective, this question calls for an investigation of the computational power of LLMs by the classical means of computability and computational complexity theory, especially the theory of automata. Along these lines, we establish a number of fundamental results. Firstly, we argue that any fixed (non-adaptive) LLM is computationally equivalent to a, possibly very large, deterministic finite-state transducer. This characterizes the base level of LLMs. We extend this to a key result concerning the simulation of space-bounded Turing machines by LLMs. Secondly, we show that lineages of evolving LLMs are computationally equivalent to interactive Turing machines with advice. The latter finding confirms the validity of the ECTT for lineages of LLMs. From a computability viewpoint, it also suggests that lineages of LLMs possess super-Turing computational power. Consequently, in our computational model knowledge generation is in general a non-algorithmic process realized by lineages of LLMs. Finally, we discuss the merits of our findings in the broader context of several related disciplines and philosophies.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

arXiv - CS - Formal Languages and Automata Theory

自引率

0.00%

发文量

期刊最新文献

Query Learning of Advice and Nominal Automata Well-Behaved (Co)algebraic Semantics of Regular Expressions in Dafny Run supports and initial algebra supports of weighted automata Alternating hierarchy of sushifts defined by nondeterministic plane-walking automata $\mathbb{N}$-polyregular functions arise from well-quasi-orderings