Multimodal reinforcement learning for partner specific adaptation in robot-multi-robot interaction

M. Kirtay, V. Hafner, M. Asada, A. Kuhlen, Erhan Öztop
{"title":"Multimodal reinforcement learning for partner specific adaptation in robot-multi-robot interaction","authors":"M. Kirtay, V. Hafner, M. Asada, A. Kuhlen, Erhan Öztop","doi":"10.1109/Humanoids53995.2022.10000205","DOIUrl":null,"url":null,"abstract":"Successful and efficient teamwork requires knowledge of the individual team members' expertise. Such knowledge is typically acquired in social interaction and forms the basis for socially intelligent, partner-adapted behavior. This study aims to implement this ability in teams of multiple humanoid robots. To this end, a humanoid robot, Nao, interacted with three Pepper robots to perform a sequential audio-visual pattern recall task that required integrating multimodal information. Nao outsourced its decisions (i.e., action selections) to its robot partners to perform the task efficiently in terms of neural computational cost by applying reinforcement learning. During the interaction, Nao learned its partners' specific expertise, which allowed Nao to turn for guidance to the partner who has the expertise corresponding to the current task state. The cognitive processing of Nao included a multimodal auto-associative memory that allowed the determination of the cost of perceptual processing (i.e., cognitive load) when processing audio-visual stimuli. In turn, the processing cost is converted into a reward signal by an internal reward generation module. In this setting, the learner robot Nao aims to minimize cognitive load by turning to the partner whose expertise corresponds to a given task state. Overall, the results indicate that the learner robot discovers the expertise of partners and exploits this information to execute its task with low neural computational cost or cognitive load.","PeriodicalId":180816,"journal":{"name":"2022 IEEE-RAS 21st International Conference on Humanoid Robots (Humanoids)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE-RAS 21st International Conference on Humanoid Robots (Humanoids)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/Humanoids53995.2022.10000205","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Successful and efficient teamwork requires knowledge of the individual team members' expertise. Such knowledge is typically acquired in social interaction and forms the basis for socially intelligent, partner-adapted behavior. This study aims to implement this ability in teams of multiple humanoid robots. To this end, a humanoid robot, Nao, interacted with three Pepper robots to perform a sequential audio-visual pattern recall task that required integrating multimodal information. Nao outsourced its decisions (i.e., action selections) to its robot partners to perform the task efficiently in terms of neural computational cost by applying reinforcement learning. During the interaction, Nao learned its partners' specific expertise, which allowed Nao to turn for guidance to the partner who has the expertise corresponding to the current task state. The cognitive processing of Nao included a multimodal auto-associative memory that allowed the determination of the cost of perceptual processing (i.e., cognitive load) when processing audio-visual stimuli. In turn, the processing cost is converted into a reward signal by an internal reward generation module. In this setting, the learner robot Nao aims to minimize cognitive load by turning to the partner whose expertise corresponds to a given task state. Overall, the results indicate that the learner robot discovers the expertise of partners and exploits this information to execute its task with low neural computational cost or cognitive load.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
机器人-多机器人交互中伙伴特定适应的多模态强化学习
成功和高效的团队合作需要了解每个团队成员的专业知识。这些知识通常是在社会交往中获得的,并形成了社会智能和伴侣适应行为的基础。本研究旨在在多个类人机器人组成的团队中实现这种能力。为此,一个人形机器人Nao与三个Pepper机器人相互作用,执行需要整合多模态信息的顺序视听模式回忆任务。Nao将其决策(即行动选择)外包给其机器人伙伴,通过应用强化学习,在神经计算成本方面有效地执行任务。在交互过程中,Nao了解到其合作伙伴的特定专业知识,这使得Nao能够向具有与当前任务状态相对应的专业知识的合作伙伴寻求指导。Nao的认知加工包括多模态自联想记忆,它允许在加工视听刺激时确定知觉加工的成本(即认知负荷)。然后,通过内部奖励生成模块将处理成本转换为奖励信号。在这种情况下,学习机器人Nao的目标是通过转向与给定任务状态相对应的专业知识的伙伴来最小化认知负荷。总体而言,研究结果表明,学习机器人能够发现合作伙伴的专业知识,并利用这些信息以较低的神经计算成本或认知负荷执行任务。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Enabling Patient- and Teleoperator-led Robotic Physiotherapy via Strain Map Segmentation and Shared-authority Self-Contained Calibration of an Elastic Humanoid Upper Body Using Only a Head-Mounted RGB Camera Self-collision avoidance in bimanual teleoperation using CollisionIK: algorithm revision and usability experiment Bimanual Manipulation Workspace Analysis of Humanoid Robots with Object Specific Coupling Constraints A Dexterous, Adaptive, Affordable, Humanlike Robot Hand: Towards Prostheses with Dexterous Manipulation Capabilities
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1