{"title":"Pairing Analogy-Augmented Generation with Procedural Memory for Procedural Q&A","authors":"K Roth, Rushil Gupta, Simon Halle, Bang Liu","doi":"arxiv-2409.01344","DOIUrl":null,"url":null,"abstract":"While LLMs in the RAG paradigm have shown remarkable performance on a variety\nof tasks, they still under-perform on unseen domains, especially on complex\ntasks like procedural question answering. In this work, we introduce a novel\nformalism and structure for manipulating text-based procedures. Based on this\nformalism, we further present a novel dataset called LCStep, scraped from the\nLangChain Python docs. Moreover, we extend the traditional RAG system to\npropose a novel system called analogy-augmented generation (AAG), that draws\ninspiration from human analogical reasoning and ability to assimilate past\nexperiences to solve unseen problems. The proposed method uses a frozen\nlanguage model with a custom procedure memory store to adapt to specialized\nknowledge. We demonstrate that AAG outperforms few-shot and RAG baselines on\nLCStep, RecipeNLG, and CHAMP datasets under a pairwise LLM-based evaluation,\ncorroborated by human evaluation in the case of RecipeNLG.","PeriodicalId":501479,"journal":{"name":"arXiv - CS - Artificial Intelligence","volume":"25 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.01344","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
While LLMs in the RAG paradigm have shown remarkable performance on a variety
of tasks, they still under-perform on unseen domains, especially on complex
tasks like procedural question answering. In this work, we introduce a novel
formalism and structure for manipulating text-based procedures. Based on this
formalism, we further present a novel dataset called LCStep, scraped from the
LangChain Python docs. Moreover, we extend the traditional RAG system to
propose a novel system called analogy-augmented generation (AAG), that draws
inspiration from human analogical reasoning and ability to assimilate past
experiences to solve unseen problems. The proposed method uses a frozen
language model with a custom procedure memory store to adapt to specialized
knowledge. We demonstrate that AAG outperforms few-shot and RAG baselines on
LCStep, RecipeNLG, and CHAMP datasets under a pairwise LLM-based evaluation,
corroborated by human evaluation in the case of RecipeNLG.