Episodic memory transfer for multi-task reinforcement learning

Artyom Y. Sorokin, Mikhail S. Burtsev
{"title":"Episodic memory transfer for multi-task reinforcement learning","authors":"Artyom Y. Sorokin,&nbsp;Mikhail S. Burtsev","doi":"10.1016/j.bica.2018.09.003","DOIUrl":null,"url":null,"abstract":"<div><p><span><span>Episodic memory plays important role in animal </span>behavior. It allows to reuse general skills for solution of specific tasks in changing environment. This beneficial feature of biological cognitive systems is still not incorporated successfully in an artificial neural architectures. In this paper we propose a neural architecture with shared episodic memory for multi-task </span>reinforcement learning<span> (SEM-PAAC). This architecture extends Parallel Advantage Actor Critic (PAAC) with two recurrent<span> sub-networks for separate tracking of environment and task states. The first subnetwork store episodic memory and the second one allows task specific execution of policy. Experiments in the Taxi domain demonstrated that SEM-PAAC has the same performance as PAAC when subtasks are solved separately. On the other hand when subtasks are solved jointly for completing full Taxi task SEM-PAAC is significantly better due to reuse of episodic memory. Proposed architecture also successfully learned to predict task completion. This is a step towards more autonomous agents for multitask problems.</span></span></p></div>","PeriodicalId":48756,"journal":{"name":"Biologically Inspired Cognitive Architectures","volume":"26 ","pages":"Pages 91-95"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.bica.2018.09.003","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biologically Inspired Cognitive Architectures","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2212683X18300902","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Psychology","Score":null,"Total":0}
引用次数: 3

Abstract

Episodic memory plays important role in animal behavior. It allows to reuse general skills for solution of specific tasks in changing environment. This beneficial feature of biological cognitive systems is still not incorporated successfully in an artificial neural architectures. In this paper we propose a neural architecture with shared episodic memory for multi-task reinforcement learning (SEM-PAAC). This architecture extends Parallel Advantage Actor Critic (PAAC) with two recurrent sub-networks for separate tracking of environment and task states. The first subnetwork store episodic memory and the second one allows task specific execution of policy. Experiments in the Taxi domain demonstrated that SEM-PAAC has the same performance as PAAC when subtasks are solved separately. On the other hand when subtasks are solved jointly for completing full Taxi task SEM-PAAC is significantly better due to reuse of episodic memory. Proposed architecture also successfully learned to predict task completion. This is a step towards more autonomous agents for multitask problems.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
情景记忆在多任务强化学习中的迁移
情景记忆在动物行为中起着重要的作用。它允许在不断变化的环境中重用通用技能来解决特定任务。生物认知系统的这一有益特征仍未成功地纳入人工神经体系结构。本文提出了一种基于共享情景记忆的多任务强化学习(SEM-PAAC)神经结构。该体系结构扩展了并行优势参与者批评(PAAC),使用两个循环子网络分别跟踪环境和任务状态。第一子网存储情景记忆,第二子网允许特定于任务的策略执行。Taxi领域的实验表明,SEM-PAAC在单独求解子任务时具有与PAAC相同的性能。另一方面,由于情景记忆的重用,当子任务联合解决以完成完整的出租车任务时,SEM-PAAC显著更好。提出的架构也成功地学会了预测任务的完成。这是向多任务问题的更自主代理迈出的一步。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Biologically Inspired Cognitive Architectures
Biologically Inspired Cognitive Architectures COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCEN-NEUROSCIENCES
CiteScore
3.60
自引率
0.00%
发文量
0
期刊介绍: Announcing the merge of Biologically Inspired Cognitive Architectures with Cognitive Systems Research. Cognitive Systems Research is dedicated to the study of human-level cognition. As such, it welcomes papers which advance the understanding, design and applications of cognitive and intelligent systems, both natural and artificial. The journal brings together a broad community studying cognition in its many facets in vivo and in silico, across the developmental spectrum, focusing on individual capacities or on entire architectures. It aims to foster debate and integrate ideas, concepts, constructs, theories, models and techniques from across different disciplines and different perspectives on human-level cognition. The scope of interest includes the study of cognitive capacities and architectures - both brain-inspired and non-brain-inspired - and the application of cognitive systems to real-world problems as far as it offers insights relevant for the understanding of cognition. Cognitive Systems Research therefore welcomes mature and cutting-edge research approaching cognition from a systems-oriented perspective, both theoretical and empirically-informed, in the form of original manuscripts, short communications, opinion articles, systematic reviews, and topical survey articles from the fields of Cognitive Science (including Philosophy of Cognitive Science), Artificial Intelligence/Computer Science, Cognitive Robotics, Developmental Science, Psychology, and Neuroscience and Neuromorphic Engineering. Empirical studies will be considered if they are supplemented by theoretical analyses and contributions to theory development and/or computational modelling studies.
期刊最新文献
Biologically Inspired Cognitive Architectures 2019 - Proceedings of the 10th Annual Meeting of the BICA Society, BICA 2019, Seattle, WA, USA, August 15-18, 2019 How Language Processing can Shape a Common Model of Cognition Application of Registration of Human Vegetative Reactions in the Process of Functional Magnetic Resonance Imaging Reconfigurable Locomotion of Hexapod Robot Based on Inverse Kinematics Methods of Determining Errors in Open-Ended Text Questions
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1