A Model for Cognitively Valid Lifelong Learning

Hanne Say, E. Oztop
{"title":"A Model for Cognitively Valid Lifelong Learning","authors":"Hanne Say, E. Oztop","doi":"10.1109/ROBIO58561.2023.10355028","DOIUrl":null,"url":null,"abstract":"In continual learning, usually a sequence of tasks are given to a learning agent and the performance of the agent after learning is measured in terms of resistance to catastrophic forgetting, efficacy of knowledge transfer and overall performance on the individual tasks. On the other hand, in multi-task learning, the system is designed to simultaneously acquire knowledge in multiple tasks, often through offline batch learning. A more cognitively valid scenario for lifelong robot learning would be to have a robotic agent to autonomously decide which task to engage and disengage while leveraging many-to-many knowledge transfer ability among tasks during online learning. In this study, we propose a novel lifelong robot learning architecture to fulfill the aforementioned desiderata, and show its validity in an environment where a robot learns the effects of its actions in different task settings. To realize the proposed model, we adopt learning progress measure for task selection, and have the tasks learn by independent neural networks with special structure that allows access to the neural layers of the non-selected tasks. The experiments conducted with a simulated robot arm in an object interaction scenario show that the proposed architecture yields better knowledge transfer and facilitates faster learning compared to baselines of fixed sequence task learning and isolated task learners with no knowledge transfer.","PeriodicalId":505134,"journal":{"name":"2023 IEEE International Conference on Robotics and Biomimetics (ROBIO)","volume":"83 8","pages":"1-7"},"PeriodicalIF":0.0000,"publicationDate":"2023-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE International Conference on Robotics and Biomimetics (ROBIO)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ROBIO58561.2023.10355028","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In continual learning, usually a sequence of tasks are given to a learning agent and the performance of the agent after learning is measured in terms of resistance to catastrophic forgetting, efficacy of knowledge transfer and overall performance on the individual tasks. On the other hand, in multi-task learning, the system is designed to simultaneously acquire knowledge in multiple tasks, often through offline batch learning. A more cognitively valid scenario for lifelong robot learning would be to have a robotic agent to autonomously decide which task to engage and disengage while leveraging many-to-many knowledge transfer ability among tasks during online learning. In this study, we propose a novel lifelong robot learning architecture to fulfill the aforementioned desiderata, and show its validity in an environment where a robot learns the effects of its actions in different task settings. To realize the proposed model, we adopt learning progress measure for task selection, and have the tasks learn by independent neural networks with special structure that allows access to the neural layers of the non-selected tasks. The experiments conducted with a simulated robot arm in an object interaction scenario show that the proposed architecture yields better knowledge transfer and facilitates faster learning compared to baselines of fixed sequence task learning and isolated task learners with no knowledge transfer.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
认知有效的终身学习模式
在持续学习中,通常会给学习代理一连串的任务,学习代理学习后的表现则根据其对灾难性遗忘的抵抗能力、知识迁移的效率以及在单个任务中的总体表现来衡量。另一方面,在多任务学习中,系统被设计为同时获取多个任务的知识,通常是通过离线批量学习。对于机器人的终身学习而言,一个更符合认知规律的方案是让机器人代理自主决定参与和脱离哪项任务,同时在在线学习过程中利用任务间多对多的知识转移能力。在本研究中,我们提出了一种新型的机器人终身学习架构,以满足上述需求,并在机器人学习其在不同任务设置中的行动效果的环境中展示了其有效性。为了实现所提出的模型,我们采用了学习进度衡量标准来进行任务选择,并让任务通过独立的神经网络进行学习,该网络具有特殊的结构,允许访问非选择任务的神经层。在模拟机械臂与物体交互场景中进行的实验表明,与固定顺序任务学习基线和无知识转移的孤立任务学习基线相比,所提出的架构能产生更好的知识转移并促进更快的学习。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Barometric Soft Tactile Sensor for Depth Independent Contact Localization Stability Margin Based Gait Design on Slopes for a Novel Reconfigurable Quadruped Robot with a Foldable Trunk Blind Walking Balance Control and Disturbance Rejection of the Bipedal Humanoid Robot Xiao-Man via Reinforcement Learning A Closed-Loop Multi-perspective Visual Servoing Approach with Reinforcement Learning Modeling and Analysis of Pipe External Surface Grinding Force using Cup-shaped Wire Brush
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1