Ex-RL: Experience-based reinforcement learning

IF 8.1 1区 计算机科学 0 COMPUTER SCIENCE, INFORMATION SYSTEMS Information Sciences Pub Date : 2024-09-16 DOI:10.1016/j.ins.2024.121479
{"title":"Ex-RL: Experience-based reinforcement learning","authors":"","doi":"10.1016/j.ins.2024.121479","DOIUrl":null,"url":null,"abstract":"<div><p>Reinforcement learning (RL) has achieved significant success across various tasks. However, generalizing RL for similar tasks remains a challenge. This study leverages expertise from related tasks to introduce a novel algorithm, Ex-RL, for executing transfer learning in tabular RL. The methodology concentrates on abstracting previous experiences into descriptive data and utilizing such data for similar tasks. The research focuses on classic RL solutions for balancing and anti-balancing, which improve the sample efficiency of the learning process. Studies indicate that weak learners, such as Q-learning, require fewer learning episodes, resulting in a 50% improvement and a higher success rate in the learning process. An online virtual lab was developed to facilitate the execution of the experiments. The code is available at <span><span>Github</span><svg><path></path></svg></span>.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1000,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Sciences","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0020025524013938","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

Reinforcement learning (RL) has achieved significant success across various tasks. However, generalizing RL for similar tasks remains a challenge. This study leverages expertise from related tasks to introduce a novel algorithm, Ex-RL, for executing transfer learning in tabular RL. The methodology concentrates on abstracting previous experiences into descriptive data and utilizing such data for similar tasks. The research focuses on classic RL solutions for balancing and anti-balancing, which improve the sample efficiency of the learning process. Studies indicate that weak learners, such as Q-learning, require fewer learning episodes, resulting in a 50% improvement and a higher success rate in the learning process. An online virtual lab was developed to facilitate the execution of the experiments. The code is available at Github.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Ex-RL:基于经验的强化学习
强化学习(RL)在各种任务中取得了巨大成功。然而,在类似任务中推广 RL 仍然是一项挑战。本研究利用相关任务的专业知识,引入了一种新算法 Ex-RL,用于在表格 RL 中执行迁移学习。该方法专注于将以往经验抽象为描述性数据,并将这些数据用于类似任务。研究重点是平衡和反平衡的经典 RL 解决方案,它们能提高学习过程的样本效率。研究表明,弱学习者(如 Q-learning)需要的学习集数较少,从而提高了 50%,并提高了学习过程的成功率。为了方便实验的实施,我们开发了一个在线虚拟实验室。代码可在 Github 上获取。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Information Sciences
Information Sciences 工程技术-计算机:信息系统
CiteScore
14.00
自引率
17.30%
发文量
1322
审稿时长
10.4 months
期刊介绍: Informatics and Computer Science Intelligent Systems Applications is an esteemed international journal that focuses on publishing original and creative research findings in the field of information sciences. We also feature a limited number of timely tutorial and surveying contributions. Our journal aims to cater to a diverse audience, including researchers, developers, managers, strategic planners, graduate students, and anyone interested in staying up-to-date with cutting-edge research in information science, knowledge engineering, and intelligent systems. While readers are expected to share a common interest in information science, they come from varying backgrounds such as engineering, mathematics, statistics, physics, computer science, cell biology, molecular biology, management science, cognitive science, neurobiology, behavioral sciences, and biochemistry.
期刊最新文献
Wavelet structure-texture-aware super-resolution for pedestrian detection HVASR: Enhancing 360-degree video delivery with viewport-aware super resolution KNEG-CL: Unveiling data patterns using a k-nearest neighbor evolutionary graph for efficient clustering Fréchet and Gateaux gH-differentiability for interval valued functions of multiple variables Detecting fuzzy-rough conditional anomalies
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1