{"title":"LLMs hallucinate graphs too: a structural perspective","authors":"Erwan Le Merrer, Gilles Tredan","doi":"arxiv-2409.00159","DOIUrl":null,"url":null,"abstract":"It is known that LLMs do hallucinate, that is, they return incorrect\ninformation as facts. In this paper, we introduce the possibility to study\nthese hallucinations under a structured form: graphs. Hallucinations in this\ncontext are incorrect outputs when prompted for well known graphs from the\nliterature (e.g. Karate club, Les Mis\\'erables, graph atlas). These\nhallucinated graphs have the advantage of being much richer than the factual\naccuracy -- or not -- of a fact; this paper thus argues that such rich\nhallucinations can be used to characterize the outputs of LLMs. Our first\ncontribution observes the diversity of topological hallucinations from major\nmodern LLMs. Our second contribution is the proposal of a metric for the\namplitude of such hallucinations: the Graph Atlas Distance, that is the average\ngraph edit distance from several graphs in the graph atlas set. We compare this\nmetric to the Hallucination Leaderboard, a hallucination rank that leverages\n10,000 times more prompts to obtain its ranking.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Social and Information Networks","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.00159","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
It is known that LLMs do hallucinate, that is, they return incorrect
information as facts. In this paper, we introduce the possibility to study
these hallucinations under a structured form: graphs. Hallucinations in this
context are incorrect outputs when prompted for well known graphs from the
literature (e.g. Karate club, Les Mis\'erables, graph atlas). These
hallucinated graphs have the advantage of being much richer than the factual
accuracy -- or not -- of a fact; this paper thus argues that such rich
hallucinations can be used to characterize the outputs of LLMs. Our first
contribution observes the diversity of topological hallucinations from major
modern LLMs. Our second contribution is the proposal of a metric for the
amplitude of such hallucinations: the Graph Atlas Distance, that is the average
graph edit distance from several graphs in the graph atlas set. We compare this
metric to the Hallucination Leaderboard, a hallucination rank that leverages
10,000 times more prompts to obtain its ranking.