基于模式切换的最优机器人行为在线学习

T. Sekiguchi, Yuichi Kobayashi, A. Shimizu, T. Kaneko
{"title":"基于模式切换的最优机器人行为在线学习","authors":"T. Sekiguchi, Yuichi Kobayashi, A. Shimizu, T. Kaneko","doi":"10.1109/ROSE.2012.6402625","DOIUrl":null,"url":null,"abstract":"This paper presents an optimal robot motion learning method that involves object manipulation where dynamics of robots and environment are unknown. The dynamics of the environment is acquired by the robot's experience through online learning. A reinforcement learning framework which incorporates model identification is proposed. Based on the learning framework, an idea of effective motion acquisition is proposed through decomposing the task by detecting `switching of dynamics', which is called mode-switching. Object manipulation is divided into two modes, approaching to the object and pushing it toward the goal. This enables the robot to learn motions while reducing number of trials and to behave more dexterously by integrating modes, each of which was learned separately. The proposed learning method is evaluated in simulation of a wheeled robot. It was shown that appropriate motion for re-approaching and re-pushing to accurately move the object to the goal can be realized using the proposed idea of planning with mode switching.","PeriodicalId":306272,"journal":{"name":"2012 IEEE International Symposium on Robotic and Sensors Environments Proceedings","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2012-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Online learning of optimal robot behavior for object manipulation using mode switching\",\"authors\":\"T. Sekiguchi, Yuichi Kobayashi, A. Shimizu, T. Kaneko\",\"doi\":\"10.1109/ROSE.2012.6402625\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents an optimal robot motion learning method that involves object manipulation where dynamics of robots and environment are unknown. The dynamics of the environment is acquired by the robot's experience through online learning. A reinforcement learning framework which incorporates model identification is proposed. Based on the learning framework, an idea of effective motion acquisition is proposed through decomposing the task by detecting `switching of dynamics', which is called mode-switching. Object manipulation is divided into two modes, approaching to the object and pushing it toward the goal. This enables the robot to learn motions while reducing number of trials and to behave more dexterously by integrating modes, each of which was learned separately. The proposed learning method is evaluated in simulation of a wheeled robot. It was shown that appropriate motion for re-approaching and re-pushing to accurately move the object to the goal can be realized using the proposed idea of planning with mode switching.\",\"PeriodicalId\":306272,\"journal\":{\"name\":\"2012 IEEE International Symposium on Robotic and Sensors Environments Proceedings\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 IEEE International Symposium on Robotic and Sensors Environments Proceedings\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ROSE.2012.6402625\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE International Symposium on Robotic and Sensors Environments Proceedings","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ROSE.2012.6402625","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

本文提出了一种涉及未知机器人动力学和环境的物体操作的最优机器人运动学习方法。环境的动态是由机器人通过在线学习的经验获得的。提出了一种结合模型识别的强化学习框架。在学习框架的基础上,提出了一种通过检测“动态切换”来分解任务的有效运动获取思想,即模式切换。对象操作分为接近对象和推动对象向目标移动两种模式。这使得机器人能够在减少试验次数的同时学习动作,并通过整合模式(每个模式都是单独学习的)来更灵活地行动。在轮式机器人的仿真中对所提出的学习方法进行了验证。结果表明,采用模式切换规划思想可以实现适当的再接近和再推运动,使目标物体准确移动到目标位置。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Online learning of optimal robot behavior for object manipulation using mode switching
This paper presents an optimal robot motion learning method that involves object manipulation where dynamics of robots and environment are unknown. The dynamics of the environment is acquired by the robot's experience through online learning. A reinforcement learning framework which incorporates model identification is proposed. Based on the learning framework, an idea of effective motion acquisition is proposed through decomposing the task by detecting `switching of dynamics', which is called mode-switching. Object manipulation is divided into two modes, approaching to the object and pushing it toward the goal. This enables the robot to learn motions while reducing number of trials and to behave more dexterously by integrating modes, each of which was learned separately. The proposed learning method is evaluated in simulation of a wheeled robot. It was shown that appropriate motion for re-approaching and re-pushing to accurately move the object to the goal can be realized using the proposed idea of planning with mode switching.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Improving the efficiency of highly predictable wireless sensor platforms with hybrid scheduling Experimental characterization of two generations of Kinect's depth sensors Online learning of optimal robot behavior for object manipulation using mode switching Evolving sensor environments with visual attention: An experimental exploration Estimating optimal regions for improvement in range acquisition from a single point of view
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1