Learning through Imitation and Reinforcement Learning: Toward the Acquisition of Painting Motions

Tatsuya Sakato, Motoyuki Ozeki, N. Oka
{"title":"Learning through Imitation and Reinforcement Learning: Toward the Acquisition of Painting Motions","authors":"Tatsuya Sakato, Motoyuki Ozeki, N. Oka","doi":"10.1109/IIAI-AAI.2014.174","DOIUrl":null,"url":null,"abstract":"Learning is essential for an autonomous agent to adapt to an environment. One method of learning is through trial and error, however, this method is impractical in a complex environment because of the long learning time required by the agent. Therefore, guidelines are necessary in order to expedite the learning process in such environments, and imitation is one such guideline. Sakato, Ozeki, and Oka (2012-2013) recently proposed a computational model of imitation and autonomous behavior by which an agent can reduce its learning time through imitation. They evaluate the model in discrete and continuous spaces, and apply the model to a real robot in order to acquire painting skills. Their experimental results indicate that the model adapted to the experimental environment by imitation. In this paper, we introduce the model and discuss what are needed to improve the model.","PeriodicalId":432222,"journal":{"name":"2014 IIAI 3rd International Conference on Advanced Applied Informatics","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 IIAI 3rd International Conference on Advanced Applied Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IIAI-AAI.2014.174","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

Learning is essential for an autonomous agent to adapt to an environment. One method of learning is through trial and error, however, this method is impractical in a complex environment because of the long learning time required by the agent. Therefore, guidelines are necessary in order to expedite the learning process in such environments, and imitation is one such guideline. Sakato, Ozeki, and Oka (2012-2013) recently proposed a computational model of imitation and autonomous behavior by which an agent can reduce its learning time through imitation. They evaluate the model in discrete and continuous spaces, and apply the model to a real robot in order to acquire painting skills. Their experimental results indicate that the model adapted to the experimental environment by imitation. In this paper, we introduce the model and discuss what are needed to improve the model.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
模仿学习与强化学习:关于绘画动作的习得
学习对于自主代理适应环境至关重要。一种学习方法是通过试错,然而,这种方法在复杂的环境中是不切实际的,因为智能体需要很长的学习时间。因此,为了在这样的环境中加快学习过程,指导方针是必要的,模仿就是这样一个指导方针。Sakato, Ozeki和Oka(2012-2013)最近提出了一个模仿和自主行为的计算模型,通过该模型,智能体可以通过模仿来减少学习时间。他们在离散和连续空间中评估模型,并将模型应用于真实的机器人,以获得绘画技能。实验结果表明,该模型对实验环境具有良好的模仿适应性。在本文中,我们介绍了该模型,并讨论了该模型需要改进的地方。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Impacts of Firm's Mimetic Isomorphic Behaviors on Customer Satisfaction from the Perspectives of Expectation Theory and Self-Determination Theory: An Approach of Hierarchical Linear Modeling Data Mining for Lifestyle Risk Factors Associated with Overweight and Obesity among Adolescents The Two-Stage Analog Neural Network Model and Hardware Implementation Experience Formalized as a Service for Geographical and Temporal Remote Collaboration A Supporting System for Finding Lost Objects for Dementia Patient and Caregiver by Image Recognition
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1