优化汽车行业 PDF 聊天机器人的 RAG 技术：本地部署的 Ollama 模型案例研究

arXiv - CS - Multiagent Systems Pub Date : 2024-08-12 DOI:arxiv-2408.05933

Fei Liu, Zejun Kang, Xing Han

{"title":"优化汽车行业 PDF 聊天机器人的 RAG 技术：本地部署的 Ollama 模型案例研究","authors":"Fei Liu, Zejun Kang, Xing Han","doi":"arxiv-2408.05933","DOIUrl":null,"url":null,"abstract":"With the growing demand for offline PDF chatbots in automotive industrial\nproduction environments, optimizing the deployment of large language models\n(LLMs) in local, low-performance settings has become increasingly important.\nThis study focuses on enhancing Retrieval-Augmented Generation (RAG) techniques\nfor processing complex automotive industry documents using locally deployed\nOllama models. Based on the Langchain framework, we propose a multi-dimensional\noptimization approach for Ollama's local RAG implementation. Our method\naddresses key challenges in automotive document processing, including\nmulti-column layouts and technical specifications. We introduce improvements in\nPDF processing, retrieval mechanisms, and context compression, tailored to the\nunique characteristics of automotive industry documents. Additionally, we\ndesign custom classes supporting embedding pipelines and an agent supporting\nself-RAG based on LangGraph best practices. To evaluate our approach, we\nconstructed a proprietary dataset comprising typical automotive industry\ndocuments, including technical reports and corporate regulations. We compared\nour optimized RAG model and self-RAG agent against a naive RAG baseline across\nthree datasets: our automotive industry dataset, QReCC, and CoQA. Results\ndemonstrate significant improvements in context precision, context recall,\nanswer relevancy, and faithfulness, with particularly notable performance on\nthe automotive industry dataset. Our optimization scheme provides an effective\nsolution for deploying local RAG systems in the automotive sector, addressing\nthe specific needs of PDF chatbots in industrial production environments. This\nresearch has important implications for advancing information processing and\nintelligent production in the automotive industry.","PeriodicalId":501315,"journal":{"name":"arXiv - CS - Multiagent Systems","volume":"113 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Optimizing RAG Techniques for Automotive Industry PDF Chatbots: A Case Study with Locally Deployed Ollama Models\",\"authors\":\"Fei Liu, Zejun Kang, Xing Han\",\"doi\":\"arxiv-2408.05933\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the growing demand for offline PDF chatbots in automotive industrial\\nproduction environments, optimizing the deployment of large language models\\n(LLMs) in local, low-performance settings has become increasingly important.\\nThis study focuses on enhancing Retrieval-Augmented Generation (RAG) techniques\\nfor processing complex automotive industry documents using locally deployed\\nOllama models. Based on the Langchain framework, we propose a multi-dimensional\\noptimization approach for Ollama's local RAG implementation. Our method\\naddresses key challenges in automotive document processing, including\\nmulti-column layouts and technical specifications. We introduce improvements in\\nPDF processing, retrieval mechanisms, and context compression, tailored to the\\nunique characteristics of automotive industry documents. Additionally, we\\ndesign custom classes supporting embedding pipelines and an agent supporting\\nself-RAG based on LangGraph best practices. To evaluate our approach, we\\nconstructed a proprietary dataset comprising typical automotive industry\\ndocuments, including technical reports and corporate regulations. We compared\\nour optimized RAG model and self-RAG agent against a naive RAG baseline across\\nthree datasets: our automotive industry dataset, QReCC, and CoQA. Results\\ndemonstrate significant improvements in context precision, context recall,\\nanswer relevancy, and faithfulness, with particularly notable performance on\\nthe automotive industry dataset. Our optimization scheme provides an effective\\nsolution for deploying local RAG systems in the automotive sector, addressing\\nthe specific needs of PDF chatbots in industrial production environments. This\\nresearch has important implications for advancing information processing and\\nintelligent production in the automotive industry.\",\"PeriodicalId\":501315,\"journal\":{\"name\":\"arXiv - CS - Multiagent Systems\",\"volume\":\"113 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-08-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Multiagent Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2408.05933\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Multiagent Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.05933","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

随着汽车工业生产环境中对离线 PDF 聊天机器人的需求日益增长，在本地、低性能环境中优化大型语言模型（LLM）的部署变得越来越重要。本研究的重点是利用本地部署的 Ollama 模型增强检索增强生成（RAG）技术，以处理复杂的汽车行业文档。基于 Langchain 框架，我们为 Ollama 的本地 RAG 实现提出了一种多维优化方法。我们的方法解决了汽车文档处理中的关键难题，包括多列布局和技术规范。我们针对汽车行业文档的独特性，在 PDF 处理、检索机制和上下文压缩方面进行了改进。此外，我们还设计了支持嵌入管道的自定义类，以及基于 LangGraph 最佳实践的支持自 RAG 的代理。为了评估我们的方法，我们构建了一个专有数据集，其中包括典型的汽车行业文档，包括技术报告和公司法规。我们在三个数据集（汽车行业数据集、QReCC 和 CoQA）上比较了我们的优化 RAG 模型和自 RAG 代理与原始 RAG 基线。结果表明，在上下文精确度、上下文召回率、答案相关性和忠实性方面都有显著提高，在汽车行业数据集上的表现尤为突出。我们的优化方案为在汽车行业部署本地 RAG 系统提供了有效的解决方案，满足了工业生产环境中 PDF 聊天机器人的特定需求。这项研究对推动汽车行业的信息处理和智能生产具有重要意义。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Optimizing RAG Techniques for Automotive Industry PDF Chatbots: A Case Study with Locally Deployed Ollama Models

With the growing demand for offline PDF chatbots in automotive industrial production environments, optimizing the deployment of large language models (LLMs) in local, low-performance settings has become increasingly important. This study focuses on enhancing Retrieval-Augmented Generation (RAG) techniques for processing complex automotive industry documents using locally deployed Ollama models. Based on the Langchain framework, we propose a multi-dimensional optimization approach for Ollama's local RAG implementation. Our method addresses key challenges in automotive document processing, including multi-column layouts and technical specifications. We introduce improvements in PDF processing, retrieval mechanisms, and context compression, tailored to the unique characteristics of automotive industry documents. Additionally, we design custom classes supporting embedding pipelines and an agent supporting self-RAG based on LangGraph best practices. To evaluate our approach, we constructed a proprietary dataset comprising typical automotive industry documents, including technical reports and corporate regulations. We compared our optimized RAG model and self-RAG agent against a naive RAG baseline across three datasets: our automotive industry dataset, QReCC, and CoQA. Results demonstrate significant improvements in context precision, context recall, answer relevancy, and faithfulness, with particularly notable performance on the automotive industry dataset. Our optimization scheme provides an effective solution for deploying local RAG systems in the automotive sector, addressing the specific needs of PDF chatbots in industrial production environments. This research has important implications for advancing information processing and intelligent production in the automotive industry.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

arXiv - CS - Multiagent Systems

自引率

0.00%

发文量