{"title":"大型语言模型和扩展的丘奇-图灵理论","authors":"Jiří Wiedermann, Jan van Leeuwen","doi":"arxiv-2409.06978","DOIUrl":null,"url":null,"abstract":"The Extended Church-Turing Thesis (ECTT) posits that all effective\ninformation processing, including unbounded and non-uniform interactive\ncomputations, can be described in terms of interactive Turing machines with\nadvice. Does this assertion also apply to the abilities of contemporary large\nlanguage models (LLMs)? From a broader perspective, this question calls for an\ninvestigation of the computational power of LLMs by the classical means of\ncomputability and computational complexity theory, especially the theory of\nautomata. Along these lines, we establish a number of fundamental results.\nFirstly, we argue that any fixed (non-adaptive) LLM is computationally\nequivalent to a, possibly very large, deterministic finite-state transducer.\nThis characterizes the base level of LLMs. We extend this to a key result\nconcerning the simulation of space-bounded Turing machines by LLMs. Secondly,\nwe show that lineages of evolving LLMs are computationally equivalent to\ninteractive Turing machines with advice. The latter finding confirms the\nvalidity of the ECTT for lineages of LLMs. From a computability viewpoint, it\nalso suggests that lineages of LLMs possess super-Turing computational power.\nConsequently, in our computational model knowledge generation is in general a\nnon-algorithmic process realized by lineages of LLMs. Finally, we discuss the\nmerits of our findings in the broader context of several related disciplines\nand philosophies.","PeriodicalId":501124,"journal":{"name":"arXiv - CS - Formal Languages and Automata Theory","volume":"14 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Large Language Models and the Extended Church-Turing Thesis\",\"authors\":\"Jiří Wiedermann, Jan van Leeuwen\",\"doi\":\"arxiv-2409.06978\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The Extended Church-Turing Thesis (ECTT) posits that all effective\\ninformation processing, including unbounded and non-uniform interactive\\ncomputations, can be described in terms of interactive Turing machines with\\nadvice. Does this assertion also apply to the abilities of contemporary large\\nlanguage models (LLMs)? From a broader perspective, this question calls for an\\ninvestigation of the computational power of LLMs by the classical means of\\ncomputability and computational complexity theory, especially the theory of\\nautomata. Along these lines, we establish a number of fundamental results.\\nFirstly, we argue that any fixed (non-adaptive) LLM is computationally\\nequivalent to a, possibly very large, deterministic finite-state transducer.\\nThis characterizes the base level of LLMs. We extend this to a key result\\nconcerning the simulation of space-bounded Turing machines by LLMs. Secondly,\\nwe show that lineages of evolving LLMs are computationally equivalent to\\ninteractive Turing machines with advice. The latter finding confirms the\\nvalidity of the ECTT for lineages of LLMs. From a computability viewpoint, it\\nalso suggests that lineages of LLMs possess super-Turing computational power.\\nConsequently, in our computational model knowledge generation is in general a\\nnon-algorithmic process realized by lineages of LLMs. Finally, we discuss the\\nmerits of our findings in the broader context of several related disciplines\\nand philosophies.\",\"PeriodicalId\":501124,\"journal\":{\"name\":\"arXiv - CS - Formal Languages and Automata Theory\",\"volume\":\"14 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Formal Languages and Automata Theory\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.06978\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Formal Languages and Automata Theory","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.06978","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Large Language Models and the Extended Church-Turing Thesis
The Extended Church-Turing Thesis (ECTT) posits that all effective
information processing, including unbounded and non-uniform interactive
computations, can be described in terms of interactive Turing machines with
advice. Does this assertion also apply to the abilities of contemporary large
language models (LLMs)? From a broader perspective, this question calls for an
investigation of the computational power of LLMs by the classical means of
computability and computational complexity theory, especially the theory of
automata. Along these lines, we establish a number of fundamental results.
Firstly, we argue that any fixed (non-adaptive) LLM is computationally
equivalent to a, possibly very large, deterministic finite-state transducer.
This characterizes the base level of LLMs. We extend this to a key result
concerning the simulation of space-bounded Turing machines by LLMs. Secondly,
we show that lineages of evolving LLMs are computationally equivalent to
interactive Turing machines with advice. The latter finding confirms the
validity of the ECTT for lineages of LLMs. From a computability viewpoint, it
also suggests that lineages of LLMs possess super-Turing computational power.
Consequently, in our computational model knowledge generation is in general a
non-algorithmic process realized by lineages of LLMs. Finally, we discuss the
merits of our findings in the broader context of several related disciplines
and philosophies.