Guided deep reinforcement learning framework using automated curriculum scheme for accurate motion planning

IF 7.5 2区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Engineering Applications of Artificial Intelligence Pub Date : 2024-11-11 DOI:10.1016/j.engappai.2024.109541

Deun-Sol Cho , Jae-Min Cho , Won-Tae Kim

{"title":"Guided deep reinforcement learning framework using automated curriculum scheme for accurate motion planning","authors":"Deun-Sol Cho , Jae-Min Cho , Won-Tae Kim","doi":"10.1016/j.engappai.2024.109541","DOIUrl":null,"url":null,"abstract":"<div><div>Collaborative robotic arms in smart factories should ensure the safety and interactivity during their operation such as reaching and grasping objects. Especially, the advanced motion planner including the path planning and the motion control functions is essential for human-machine co-working. Since the traditional physics-based motion planning approaches require extreme computational resources to obtain near-optimal solutions, deep reinforcement learning algorithms have been actively adopted and have effectively solved the limitation. They, however, have the easy task preference problem, primarily taking the simpler ways for the more rewards, due to randomly training the agents how to reach the target points in the large-scale search spaces. Therefore, we propose a novel curriculum-based deep reinforcement learning framework that makes the agents learn the motion planning tasks in unbiased ways from the ones with the low complexities to the others with the high complexities. It uses the unsupervised learning algorithms to cluster the target points with the similar task complexities for generating the effective curriculum. In addition, the review and buffer flushing mechanisms are integrated into the framework to mitigate the catastrophic forgetting problem where the agent abruptly lose the previous learned knowledge upon learning new one in the curriculum. The evaluation results of the proposed framework show that the curriculum significantly enhances the success rate on the task with the highest complexity from 12% to 56% and the mechanisms improve the success rate on the tasks with the easier complexities from an average of 66% to 76.5%, despite requiring less training time.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"139 ","pages":"Article 109541"},"PeriodicalIF":7.5000,"publicationDate":"2024-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197624016993","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Collaborative robotic arms in smart factories should ensure the safety and interactivity during their operation such as reaching and grasping objects. Especially, the advanced motion planner including the path planning and the motion control functions is essential for human-machine co-working. Since the traditional physics-based motion planning approaches require extreme computational resources to obtain near-optimal solutions, deep reinforcement learning algorithms have been actively adopted and have effectively solved the limitation. They, however, have the easy task preference problem, primarily taking the simpler ways for the more rewards, due to randomly training the agents how to reach the target points in the large-scale search spaces. Therefore, we propose a novel curriculum-based deep reinforcement learning framework that makes the agents learn the motion planning tasks in unbiased ways from the ones with the low complexities to the others with the high complexities. It uses the unsupervised learning algorithms to cluster the target points with the similar task complexities for generating the effective curriculum. In addition, the review and buffer flushing mechanisms are integrated into the framework to mitigate the catastrophic forgetting problem where the agent abruptly lose the previous learned knowledge upon learning new one in the curriculum. The evaluation results of the proposed framework show that the curriculum significantly enhances the success rate on the task with the highest complexity from 12% to 56% and the mechanisms improve the success rate on the tasks with the easier complexities from an average of 66% to 76.5%, despite requiring less training time.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

利用自动课程计划指导深度强化学习框架，实现精确的运动规划

智能工厂中的协作机械臂在伸手抓取物体等操作过程中应确保安全性和互动性。特别是，包括路径规划和运动控制功能在内的高级运动规划器对于人机协同工作至关重要。由于传统的基于物理的运动规划方法需要极大的计算资源才能获得接近最优的解决方案，深度强化学习算法已被积极采用，并有效地解决了这一限制。然而，由于要随机训练代理如何在大规模搜索空间中到达目标点，它们存在任务偏好简单的问题，主要是为了获得更多奖励而采取更简单的方法。因此，我们提出了一种新颖的基于课程的深度强化学习框架，它能让代理以无偏见的方式学习运动规划任务，从复杂度低的任务到复杂度高的任务。它使用无监督学习算法对任务复杂度相似的目标点进行聚类，从而生成有效的课程。此外，该框架还集成了复习和缓冲区冲洗机制，以缓解灾难性遗忘问题，即代理在学习课程中的新知识时突然丢失之前学习的知识。对所提框架的评估结果表明，尽管所需的训练时间较少，但课程能显著提高复杂度最高任务的成功率，从 12% 提高到 56%，而机制能提高复杂度较低任务的成功率，平均从 66% 提高到 76.5%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Engineering Applications of Artificial Intelligence 工程技术-工程：电子与电气

CiteScore

9.60

自引率

10.00%

发文量

505

审稿时长

68 days

期刊介绍： Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.