基于模式切换的最优机器人行为在线学习

2012 IEEE International Symposium on Robotic and Sensors Environments Proceedings Pub Date : 2012-11-01 DOI:10.1109/ROSE.2012.6402625

T. Sekiguchi, Yuichi Kobayashi, A. Shimizu, T. Kaneko

{"title":"基于模式切换的最优机器人行为在线学习","authors":"T. Sekiguchi, Yuichi Kobayashi, A. Shimizu, T. Kaneko","doi":"10.1109/ROSE.2012.6402625","DOIUrl":null,"url":null,"abstract":"This paper presents an optimal robot motion learning method that involves object manipulation where dynamics of robots and environment are unknown. The dynamics of the environment is acquired by the robot's experience through online learning. A reinforcement learning framework which incorporates model identification is proposed. Based on the learning framework, an idea of effective motion acquisition is proposed through decomposing the task by detecting `switching of dynamics', which is called mode-switching. Object manipulation is divided into two modes, approaching to the object and pushing it toward the goal. This enables the robot to learn motions while reducing number of trials and to behave more dexterously by integrating modes, each of which was learned separately. The proposed learning method is evaluated in simulation of a wheeled robot. It was shown that appropriate motion for re-approaching and re-pushing to accurately move the object to the goal can be realized using the proposed idea of planning with mode switching.","PeriodicalId":306272,"journal":{"name":"2012 IEEE International Symposium on Robotic and Sensors Environments Proceedings","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2012-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Online learning of optimal robot behavior for object manipulation using mode switching\",\"authors\":\"T. Sekiguchi, Yuichi Kobayashi, A. Shimizu, T. Kaneko\",\"doi\":\"10.1109/ROSE.2012.6402625\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents an optimal robot motion learning method that involves object manipulation where dynamics of robots and environment are unknown. The dynamics of the environment is acquired by the robot's experience through online learning. A reinforcement learning framework which incorporates model identification is proposed. Based on the learning framework, an idea of effective motion acquisition is proposed through decomposing the task by detecting `switching of dynamics', which is called mode-switching. Object manipulation is divided into two modes, approaching to the object and pushing it toward the goal. This enables the robot to learn motions while reducing number of trials and to behave more dexterously by integrating modes, each of which was learned separately. The proposed learning method is evaluated in simulation of a wheeled robot. It was shown that appropriate motion for re-approaching and re-pushing to accurately move the object to the goal can be realized using the proposed idea of planning with mode switching.\",\"PeriodicalId\":306272,\"journal\":{\"name\":\"2012 IEEE International Symposium on Robotic and Sensors Environments Proceedings\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 IEEE International Symposium on Robotic and Sensors Environments Proceedings\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ROSE.2012.6402625\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE International Symposium on Robotic and Sensors Environments Proceedings","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ROSE.2012.6402625","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

本文提出了一种涉及未知机器人动力学和环境的物体操作的最优机器人运动学习方法。环境的动态是由机器人通过在线学习的经验获得的。提出了一种结合模型识别的强化学习框架。在学习框架的基础上，提出了一种通过检测“动态切换”来分解任务的有效运动获取思想，即模式切换。对象操作分为接近对象和推动对象向目标移动两种模式。这使得机器人能够在减少试验次数的同时学习动作，并通过整合模式(每个模式都是单独学习的)来更灵活地行动。在轮式机器人的仿真中对所提出的学习方法进行了验证。结果表明，采用模式切换规划思想可以实现适当的再接近和再推运动，使目标物体准确移动到目标位置。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Online learning of optimal robot behavior for object manipulation using mode switching

This paper presents an optimal robot motion learning method that involves object manipulation where dynamics of robots and environment are unknown. The dynamics of the environment is acquired by the robot's experience through online learning. A reinforcement learning framework which incorporates model identification is proposed. Based on the learning framework, an idea of effective motion acquisition is proposed through decomposing the task by detecting `switching of dynamics', which is called mode-switching. Object manipulation is divided into two modes, approaching to the object and pushing it toward the goal. This enables the robot to learn motions while reducing number of trials and to behave more dexterously by integrating modes, each of which was learned separately. The proposed learning method is evaluated in simulation of a wheeled robot. It was shown that appropriate motion for re-approaching and re-pushing to accurately move the object to the goal can be realized using the proposed idea of planning with mode switching.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2012 IEEE International Symposium on Robotic and Sensors Environments Proceedings

自引率

0.00%

发文量