Heterogeneous Imitation Learning from Demonstrators of Varying Physiology and Skill

Jeff Allen, J. Anderson
{"title":"Heterogeneous Imitation Learning from Demonstrators of Varying Physiology and Skill","authors":"Jeff Allen, J. Anderson","doi":"10.1109/ICMLA.2010.23","DOIUrl":null,"url":null,"abstract":"Imitation learning enables a learner to improve its abilities by observing others. Most robotic imitation learning systems only learn from demonstrators that are homogeneous physiologically (i.e. the same size and mode of locomotion) and in terms of skill level. To successfully learn from physically heterogeneous robots that may also vary in ability, the imitator must be able to abstract behaviours it observes and approximate them with its own actions, which may be very different than those of the demonstrator. This paper describes an approach to imitation learning from heterogeneous demonstrators, using global vision for observations. It supports learning from physiologically different demonstrators (wheeled and legged, of various sizes), and self-adapts to demonstrators with varying levels of skill. The latter allows a bias toward demonstrators that are successful in the domain, but also allows different parts of a task to be learned from different individuals (that is, worthwhile parts of a task can still be learned from a poorly-performing demonstrator). We assume the imitator has no initial knowledge of the observable effects of its own actions, and train a set of Hidden Markov Models to map observations to actions and create an understanding of the imitator's own abilities. We then use a combination of tracking sequences of primitives and predicting future primitives from existing combinations using forward models to learn abstract behaviours from the demonstrations of others. This approach is evaluated using a group of heterogeneous robots that have been previously used in RoboCup soccer competitions.","PeriodicalId":336514,"journal":{"name":"2010 Ninth International Conference on Machine Learning and Applications","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 Ninth International Conference on Machine Learning and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLA.2010.23","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Imitation learning enables a learner to improve its abilities by observing others. Most robotic imitation learning systems only learn from demonstrators that are homogeneous physiologically (i.e. the same size and mode of locomotion) and in terms of skill level. To successfully learn from physically heterogeneous robots that may also vary in ability, the imitator must be able to abstract behaviours it observes and approximate them with its own actions, which may be very different than those of the demonstrator. This paper describes an approach to imitation learning from heterogeneous demonstrators, using global vision for observations. It supports learning from physiologically different demonstrators (wheeled and legged, of various sizes), and self-adapts to demonstrators with varying levels of skill. The latter allows a bias toward demonstrators that are successful in the domain, but also allows different parts of a task to be learned from different individuals (that is, worthwhile parts of a task can still be learned from a poorly-performing demonstrator). We assume the imitator has no initial knowledge of the observable effects of its own actions, and train a set of Hidden Markov Models to map observations to actions and create an understanding of the imitator's own abilities. We then use a combination of tracking sequences of primitives and predicting future primitives from existing combinations using forward models to learn abstract behaviours from the demonstrations of others. This approach is evaluated using a group of heterogeneous robots that have been previously used in RoboCup soccer competitions.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
来自不同生理和技能示范的异质模仿学习
模仿学习使学习者能够通过观察他人来提高自己的能力。大多数机器人模仿学习系统只从生理上(即相同的大小和运动模式)和技能水平上相同的演示者中学习。为了成功地从能力各异的异构机器人身上学习,模仿者必须能够抽象出它所观察到的行为,并用自己的行动来近似它们,这可能与演示者的行为大不相同。本文描述了一种利用全局视觉进行观察的方法,从异质演示体中进行模仿学习。它支持从生理上不同的示威者(轮式和腿,各种大小)学习,并自我适应不同水平的技能示威者。后者允许偏向于在领域中成功的演示者,但也允许从不同的个体学习任务的不同部分(也就是说,任务的有价值的部分仍然可以从表现不佳的演示者那里学习)。我们假设模仿者对其自身行为的可观察效果没有初始知识,并训练一组隐马尔可夫模型来将观察映射到动作并创建对模仿者自身能力的理解。然后,我们使用跟踪原语序列的组合,并使用正向模型从现有组合中预测未来的原语,从其他人的演示中学习抽象行为。该方法使用一组异构机器人进行评估,这些机器人之前曾在机器人世界杯足球比赛中使用过。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Comparative Analysis of DNA Microarray Data through the Use of Feature Selection Techniques Learning from Multiple Related Data Streams with Asynchronous Flowing Speeds Bayesian Inferences and Forecasting in Spatial Time Series Models A Framework for Comprehensive Electronic QA in Radiation Therapy Model-Based Co-clustering for Continuous Data
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1