LLMs hallucinate graphs too: a structural perspective

arXiv - CS - Social and Information Networks Pub Date : 2024-08-30 DOI:arxiv-2409.00159

Erwan Le Merrer, Gilles Tredan

引用次数: 0

Abstract

It is known that LLMs do hallucinate, that is, they return incorrect information as facts. In this paper, we introduce the possibility to study these hallucinations under a structured form: graphs. Hallucinations in this context are incorrect outputs when prompted for well known graphs from the literature (e.g. Karate club, Les Mis\'erables, graph atlas). These hallucinated graphs have the advantage of being much richer than the factual accuracy -- or not -- of a fact; this paper thus argues that such rich hallucinations can be used to characterize the outputs of LLMs. Our first contribution observes the diversity of topological hallucinations from major modern LLMs. Our second contribution is the proposal of a metric for the amplitude of such hallucinations: the Graph Atlas Distance, that is the average graph edit distance from several graphs in the graph atlas set. We compare this metric to the Hallucination Leaderboard, a hallucination rank that leverages 10,000 times more prompts to obtain its ranking.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

法学硕士也会产生图形幻觉：结构视角

众所周知，LLM 确实会产生幻觉，也就是说，它们会把不正确的信息当作事实返回。在本文中，我们引入了在图这种结构化形式下研究这些幻觉的可能性。在这种情况下，幻觉是指在提示使用文学作品中众所周知的图形（如空手道俱乐部、Les Mis\'erables 和图形图集）时的错误输出。这些被幻觉化的图形具有比事实准确与否更丰富的优势；因此，本文认为这种丰富的幻觉可以用来描述 LLM 的输出特征。我们的第一个贡献是观察了主要现代LLM的拓扑幻觉的多样性。我们的第二个贡献是提出了一个衡量此类幻觉振幅的指标：图集距离（Graph Atlas Distance），即图集中多个图的平均图编辑距离。我们将这一指标与幻觉排行榜（Hallucination Leaderboard）进行了比较，幻觉排行榜利用了 10,000 倍的提示来获得排名。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

arXiv - CS - Social and Information Networks

自引率

0.00%

发文量