LLMs Will Always Hallucinate, and We Need to Live With This

arXiv - STAT - Machine Learning Pub Date : 2024-09-09 DOI:arxiv-2409.05746

Sourav Banerjee, Ayushi Agarwal, Saloni Singla

引用次数: 0

Abstract

As Large Language Models become more ubiquitous across domains, it becomes important to examine their inherent limitations critically. This work argues that hallucinations in language models are not just occasional errors but an inevitable feature of these systems. We demonstrate that hallucinations stem from the fundamental mathematical and logical structure of LLMs. It is, therefore, impossible to eliminate them through architectural improvements, dataset enhancements, or fact-checking mechanisms. Our analysis draws on computational theory and Godel's First Incompleteness Theorem, which references the undecidability of problems like the Halting, Emptiness, and Acceptance Problems. We demonstrate that every stage of the LLM process-from training data compilation to fact retrieval, intent classification, and text generation-will have a non-zero probability of producing hallucinations. This work introduces the concept of Structural Hallucination as an intrinsic nature of these systems. By establishing the mathematical certainty of hallucinations, we challenge the prevailing notion that they can be fully mitigated.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

法学硕士总会产生幻觉，我们需要接受这一点

随着大型语言模型在各个领域变得越来越普遍，批判性地审视其固有的局限性变得非常重要。本研究认为，语言模型中的幻觉并不只是偶尔出现的错误，而是这些系统不可避免的特征。我们证明，幻觉源于语言模型的基本数学和逻辑结构。因此，不可能通过架构改进、数据集增强或事实检查机制来消除幻觉。我们的分析借鉴了计算理论和戈德尔第一不完备性定理，其中提到了诸如 "停止问题"、"空性问题 "和 "接受问题 "等问题的不可判定性。我们证明，LLM 过程的每个阶段--从训练数据编译到事实检索、意图分类和文本生成--产生幻觉的概率都不为零。这项工作引入了 "结构性幻觉 "的概念，将其视为系统的内在本质。通过确定幻觉在数学上的确定性，我们对可以完全避免幻觉的普遍观点提出了挑战。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

arXiv - STAT - Machine Learning

自引率

0.00%

发文量

期刊最新文献

Fitting Multilevel Factor Models Cartan moving frames and the data manifolds Symmetry-Based Structured Matrices for Efficient Approximately Equivariant Networks Recurrent Interpolants for Probabilistic Time Series Prediction PieClam: A Universal Graph Autoencoder Based on Overlapping Inclusive and Exclusive Communities