Episodic memory transfer for multi-task reinforcement learning

Q2 Psychology Biologically Inspired Cognitive Architectures Pub Date : 2018-10-01 DOI:10.1016/j.bica.2018.09.003

Artyom Y. Sorokin, Mikhail S. Burtsev

{"title":"Episodic memory transfer for multi-task reinforcement learning","authors":"Artyom Y. Sorokin, Mikhail S. Burtsev","doi":"10.1016/j.bica.2018.09.003","DOIUrl":null,"url":null,"abstract":"<div>Episodic memory plays important role in animal behavior. It allows to reuse general skills for solution of specific tasks in changing environment. This beneficial feature of biological cognitive systems is still not incorporated successfully in an artificial neural architectures. In this paper we propose a neural architecture with shared episodic memory for multi-task reinforcement learning (SEM-PAAC). This architecture extends Parallel Advantage Actor Critic (PAAC) with two recurrent sub-networks for separate tracking of environment and task states. The first subnetwork store episodic memory and the second one allows task specific execution of policy. Experiments in the Taxi domain demonstrated that SEM-PAAC has the same performance as PAAC when subtasks are solved separately. On the other hand when subtasks are solved jointly for completing full Taxi task SEM-PAAC is significantly better due to reuse of episodic memory. Proposed architecture also successfully learned to predict task completion. This is a step towards more autonomous agents for multitask problems.</div>","PeriodicalId":48756,"journal":{"name":"Biologically Inspired Cognitive Architectures","volume":"26 ","pages":"Pages 91-95"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.bica.2018.09.003","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biologically Inspired Cognitive Architectures","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2212683X18300902","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Psychology","Score":null,"Total":0}

引用次数: 3

Abstract

Episodic memory plays important role in animal behavior. It allows to reuse general skills for solution of specific tasks in changing environment. This beneficial feature of biological cognitive systems is still not incorporated successfully in an artificial neural architectures. In this paper we propose a neural architecture with shared episodic memory for multi-task reinforcement learning (SEM-PAAC). This architecture extends Parallel Advantage Actor Critic (PAAC) with two recurrent sub-networks for separate tracking of environment and task states. The first subnetwork store episodic memory and the second one allows task specific execution of policy. Experiments in the Taxi domain demonstrated that SEM-PAAC has the same performance as PAAC when subtasks are solved separately. On the other hand when subtasks are solved jointly for completing full Taxi task SEM-PAAC is significantly better due to reuse of episodic memory. Proposed architecture also successfully learned to predict task completion. This is a step towards more autonomous agents for multitask problems.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

情景记忆在多任务强化学习中的迁移

情景记忆在动物行为中起着重要的作用。它允许在不断变化的环境中重用通用技能来解决特定任务。生物认知系统的这一有益特征仍未成功地纳入人工神经体系结构。本文提出了一种基于共享情景记忆的多任务强化学习(SEM-PAAC)神经结构。该体系结构扩展了并行优势参与者批评(PAAC)，使用两个循环子网络分别跟踪环境和任务状态。第一子网存储情景记忆，第二子网允许特定于任务的策略执行。Taxi领域的实验表明，SEM-PAAC在单独求解子任务时具有与PAAC相同的性能。另一方面，由于情景记忆的重用，当子任务联合解决以完成完整的出租车任务时，SEM-PAAC显著更好。提出的架构也成功地学会了预测任务的完成。这是向多任务问题的更自主代理迈出的一步。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Biologically Inspired Cognitive Architectures COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCEN-NEUROSCIENCES

CiteScore

3.60

自引率

0.00%

发文量

期刊介绍： Announcing the merge of Biologically Inspired Cognitive Architectures with Cognitive Systems Research. Cognitive Systems Research is dedicated to the study of human-level cognition. As such, it welcomes papers which advance the understanding, design and applications of cognitive and intelligent systems, both natural and artificial. The journal brings together a broad community studying cognition in its many facets in vivo and in silico, across the developmental spectrum, focusing on individual capacities or on entire architectures. It aims to foster debate and integrate ideas, concepts, constructs, theories, models and techniques from across different disciplines and different perspectives on human-level cognition. The scope of interest includes the study of cognitive capacities and architectures - both brain-inspired and non-brain-inspired - and the application of cognitive systems to real-world problems as far as it offers insights relevant for the understanding of cognition. Cognitive Systems Research therefore welcomes mature and cutting-edge research approaching cognition from a systems-oriented perspective, both theoretical and empirically-informed, in the form of original manuscripts, short communications, opinion articles, systematic reviews, and topical survey articles from the fields of Cognitive Science (including Philosophy of Cognitive Science), Artificial Intelligence/Computer Science, Cognitive Robotics, Developmental Science, Psychology, and Neuroscience and Neuromorphic Engineering. Empirical studies will be considered if they are supplemented by theoretical analyses and contributions to theory development and/or computational modelling studies.