Robotic Arm Motion Planning Based on Curriculum Reinforcement Learning

Dongxu Zhou, Ruiqing Jia, Haifeng Yao
{"title":"Robotic Arm Motion Planning Based on Curriculum Reinforcement Learning","authors":"Dongxu Zhou, Ruiqing Jia, Haifeng Yao","doi":"10.1109/ICCRE51898.2021.9435700","DOIUrl":null,"url":null,"abstract":"With the rapid changes in application scenarios, the robotic arm’s motion planning function is playing an increasingly important role. The traditional demonstration motion planning method of the robotic arm cannot be carried out quickly. The use of reinforcement learning algorithms to solve motion planning problems is a new research trend that has emerged in recent years. However, reinforcement learning algorithms are difficult to converge quickly in some complex tasks. This leads to inefficient and difficult training problems in actual training. This paper proposes a robotic arm motion planning method based on curriculum reinforcement learning. This method adopts the concept of obstacle effective sphere to simplify obstacles in the environment. According to the reinforcement learning agent’s real-time motion planning ability, the size of the effective sphere radius of the obstacle is adaptively adjusted so that the agent can train in an environment that matches its ability. The agent can first be trained in a simple environment and then gradually transition to a complete obstacle environment. The experiment in a virtual environment shows that this method can successfully perform motion planning. Comparing this method with the training effect of using only the PPO algorithm shows that this algorithm can effectively improve the efficiency of reinforcement learning training and reduce algorithm convergence difficulty.","PeriodicalId":382619,"journal":{"name":"2021 6th International Conference on Control and Robotics Engineering (ICCRE)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 6th International Conference on Control and Robotics Engineering (ICCRE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCRE51898.2021.9435700","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

With the rapid changes in application scenarios, the robotic arm’s motion planning function is playing an increasingly important role. The traditional demonstration motion planning method of the robotic arm cannot be carried out quickly. The use of reinforcement learning algorithms to solve motion planning problems is a new research trend that has emerged in recent years. However, reinforcement learning algorithms are difficult to converge quickly in some complex tasks. This leads to inefficient and difficult training problems in actual training. This paper proposes a robotic arm motion planning method based on curriculum reinforcement learning. This method adopts the concept of obstacle effective sphere to simplify obstacles in the environment. According to the reinforcement learning agent’s real-time motion planning ability, the size of the effective sphere radius of the obstacle is adaptively adjusted so that the agent can train in an environment that matches its ability. The agent can first be trained in a simple environment and then gradually transition to a complete obstacle environment. The experiment in a virtual environment shows that this method can successfully perform motion planning. Comparing this method with the training effect of using only the PPO algorithm shows that this algorithm can effectively improve the efficiency of reinforcement learning training and reduce algorithm convergence difficulty.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于课程强化学习的机械臂运动规划
随着应用场景的快速变化,机械臂的运动规划功能发挥着越来越重要的作用。传统的机械臂演示运动规划方法无法快速实现。利用强化学习算法解决运动规划问题是近年来出现的一个新的研究趋势。然而,在一些复杂的任务中,强化学习算法难以快速收敛。这就导致了在实际训练中出现训练效率低、训练难度大的问题。提出了一种基于课程强化学习的机械臂运动规划方法。该方法采用障碍物有效球的概念,简化了环境中的障碍物。根据强化学习智能体的实时运动规划能力,自适应调整障碍物有效球半径的大小,使智能体在与其能力相匹配的环境中进行训练。智能体可以先在简单的环境中进行训练,然后逐渐过渡到完整的障碍环境。在虚拟环境中的实验表明,该方法可以成功地进行运动规划。将该方法与仅使用PPO算法的训练效果进行比较,表明该算法能有效提高强化学习训练效率,降低算法收敛难度。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Multiagent Motion Planning Based on Deep Reinforcement Learning in Complex Environments Design of an Unconventional Bionic Quadruped Robot with Low-degree-freedom of Movement Action Recognition Method for Multi-joint Industrial Robots Based on End-arm Vibration and BP Neural Network Multi-Mission Planning of Service Robot Based on L-HMM Obstacle Avoidable G2-continuous Trajectory Generated with Clothoid Spline Solution
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1