Survey on Large Language Model-Enhanced Reinforcement Learning: Concept, Taxonomy, and Methods

IF 8.9 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE IEEE transactions on neural networks and learning systems Pub Date : 2024-11-25 DOI:10.1109/TNNLS.2024.3497992

Yuji Cao;Huan Zhao;Yuheng Cheng;Ting Shu;Yue Chen;Guolong Liu;Gaoqi Liang;Junhua Zhao;Jinyue Yan;Yun Li

{"title":"Survey on Large Language Model-Enhanced Reinforcement Learning: Concept, Taxonomy, and Methods","authors":"Yuji Cao;Huan Zhao;Yuheng Cheng;Ting Shu;Yue Chen;Guolong Liu;Gaoqi Liang;Junhua Zhao;Jinyue Yan;Yun Li","doi":"10.1109/TNNLS.2024.3497992","DOIUrl":null,"url":null,"abstract":"With extensive pretrained knowledge and high-level general capabilities, large language models (LLMs) emerge as a promising avenue to augment reinforcement learning (RL) in aspects, such as multitask learning, sample efficiency, and high-level task planning. In this survey, we provide a comprehensive review of the existing literature in LLM-enhanced RL and summarize its characteristics compared with conventional RL methods, aiming to clarify the research scope and directions for future studies. Utilizing the classical agent-environment interaction paradigm, we propose a structured taxonomy to systematically categorize LLMs’ functionalities in RL, including four roles: information processor, reward designer, decision-maker, and generator. For each role, we summarize the methodologies, analyze the specific RL challenges that are mitigated and provide insights into future directions. Finally, the comparative analysis of each role, potential applications, prospective opportunities, and challenges of the LLM-enhanced RL are discussed. By proposing this taxonomy, we aim to provide a framework for researchers to effectively leverage LLMs in the RL field, potentially accelerating RL applications in complex applications, such as robotics, autonomous driving, and energy systems.","PeriodicalId":13303,"journal":{"name":"IEEE transactions on neural networks and learning systems","volume":"36 6","pages":"9737-9757"},"PeriodicalIF":8.9000,"publicationDate":"2024-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on neural networks and learning systems","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10766898/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

With extensive pretrained knowledge and high-level general capabilities, large language models (LLMs) emerge as a promising avenue to augment reinforcement learning (RL) in aspects, such as multitask learning, sample efficiency, and high-level task planning. In this survey, we provide a comprehensive review of the existing literature in LLM-enhanced RL and summarize its characteristics compared with conventional RL methods, aiming to clarify the research scope and directions for future studies. Utilizing the classical agent-environment interaction paradigm, we propose a structured taxonomy to systematically categorize LLMs’ functionalities in RL, including four roles: information processor, reward designer, decision-maker, and generator. For each role, we summarize the methodologies, analyze the specific RL challenges that are mitigated and provide insights into future directions. Finally, the comparative analysis of each role, potential applications, prospective opportunities, and challenges of the LLM-enhanced RL are discussed. By proposing this taxonomy, we aim to provide a framework for researchers to effectively leverage LLMs in the RL field, potentially accelerating RL applications in complex applications, such as robotics, autonomous driving, and energy systems.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

大型语言模型增强强化学习调查：概念、分类和方法

凭借广泛的预训练知识和高级通用能力，大型语言模型（llm）在多任务学习、样本效率和高级任务规划等方面成为增强强化学习（RL）的有前途的途径。在本文中，我们对LLM-enhanced RL的现有文献进行了全面的综述，并总结了其与传统RL方法相比的特点，旨在明确未来研究的范围和方向。利用经典的智能体-环境交互范式，我们提出了一个结构化的分类法，系统地对法学硕士在强化学习中的功能进行分类，包括四个角色：信息处理者、奖励设计者、决策者和生成器。对于每个角色，我们总结了方法，分析了缓解的具体RL挑战，并提供了对未来方向的见解。最后，对法学硕士增强的RL的作用、潜在应用、前景机遇和挑战进行了比较分析。通过提出这一分类，我们的目标是为研究人员提供一个框架，以有效地利用RL领域的法学硕士，潜在地加速RL在复杂应用中的应用，如机器人、自动驾驶和能源系统。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE transactions on neural networks and learning systems COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

CiteScore

23.80

自引率

9.60%

发文量

2102

审稿时长

3-8 weeks

期刊介绍： The focus of IEEE Transactions on Neural Networks and Learning Systems is to present scholarly articles discussing the theory, design, and applications of neural networks as well as other learning systems. The journal primarily highlights technical and scientific research in this domain.