基于离散-连续混合动作空间TD3的塔机实时平滑提升路径规划

Zhiyuan Yin, Kai Wang, Xin Ma
{"title":"基于离散-连续混合动作空间TD3的塔机实时平滑提升路径规划","authors":"Zhiyuan Yin, Kai Wang, Xin Ma","doi":"10.1145/3547578.3547592","DOIUrl":null,"url":null,"abstract":"Smoothness and rapidity are two important performance indexes for crane lifting path planning. Traditionally, cranes are modeled as multi-freedom robots and use robot path planning algorithms to plan path in continuous space. However, the paths planned by these methods are not smooth enough to operate. In addition, presently, most of proposed lifting path planning algorithms focus on static environments, requiring accurate environmental information to build maps, which is too computation expensive to meet the requirement of real-time path planning. In this paper, we propose a deep reinforcement learning-based lifting path planning algorithm for hybrid action spaces. The network structure is developed based on TD3. A new reward function is designed and hindsight experience replay is used to solve the reward sparsity problem in long distance path planning. The planning path is smooth and can achieve real-time path planning in unknown environments. The result of simulation experiments demonstrates the effectiveness of the proposed approach.","PeriodicalId":381600,"journal":{"name":"Proceedings of the 14th International Conference on Computer Modeling and Simulation","volume":"95 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A Real-time Smooth Lifting Path Planning for Tower Crane Based on TD3 with Discrete-Continuous Hybrid Action Space\",\"authors\":\"Zhiyuan Yin, Kai Wang, Xin Ma\",\"doi\":\"10.1145/3547578.3547592\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Smoothness and rapidity are two important performance indexes for crane lifting path planning. Traditionally, cranes are modeled as multi-freedom robots and use robot path planning algorithms to plan path in continuous space. However, the paths planned by these methods are not smooth enough to operate. In addition, presently, most of proposed lifting path planning algorithms focus on static environments, requiring accurate environmental information to build maps, which is too computation expensive to meet the requirement of real-time path planning. In this paper, we propose a deep reinforcement learning-based lifting path planning algorithm for hybrid action spaces. The network structure is developed based on TD3. A new reward function is designed and hindsight experience replay is used to solve the reward sparsity problem in long distance path planning. The planning path is smooth and can achieve real-time path planning in unknown environments. The result of simulation experiments demonstrates the effectiveness of the proposed approach.\",\"PeriodicalId\":381600,\"journal\":{\"name\":\"Proceedings of the 14th International Conference on Computer Modeling and Simulation\",\"volume\":\"95 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-06-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 14th International Conference on Computer Modeling and Simulation\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3547578.3547592\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 14th International Conference on Computer Modeling and Simulation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3547578.3547592","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

平稳性和快速性是起重机起重路径规划的两个重要性能指标。传统上,起重机建模为多自由度机器人,使用机器人路径规划算法在连续空间中规划路径。然而,这些方法规划的路径不够平滑,无法运行。此外,目前提出的升降路径规划算法大多集中在静态环境,需要精确的环境信息来构建地图,计算量大,无法满足实时路径规划的要求。本文提出了一种基于深度强化学习的混合动作空间升降路径规划算法。基于TD3开发了网络结构。设计了一种新的奖励函数,并利用后见之明的经验重放来解决长距离路径规划中的奖励稀疏性问题。规划路径平滑,可实现未知环境下的实时路径规划。仿真实验结果证明了该方法的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
A Real-time Smooth Lifting Path Planning for Tower Crane Based on TD3 with Discrete-Continuous Hybrid Action Space
Smoothness and rapidity are two important performance indexes for crane lifting path planning. Traditionally, cranes are modeled as multi-freedom robots and use robot path planning algorithms to plan path in continuous space. However, the paths planned by these methods are not smooth enough to operate. In addition, presently, most of proposed lifting path planning algorithms focus on static environments, requiring accurate environmental information to build maps, which is too computation expensive to meet the requirement of real-time path planning. In this paper, we propose a deep reinforcement learning-based lifting path planning algorithm for hybrid action spaces. The network structure is developed based on TD3. A new reward function is designed and hindsight experience replay is used to solve the reward sparsity problem in long distance path planning. The planning path is smooth and can achieve real-time path planning in unknown environments. The result of simulation experiments demonstrates the effectiveness of the proposed approach.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
A Multi-Agent-based Simulation Model for Operations in the Automated Container Terminal with a U-Shaped Layout Blood Damage Analysis of ECMO Centrifugal Pump Based on CFD Using Artificial Intelligence Expert Service System to Solve Power Resource Scheduling Problem A Model Based on Survival-based Credit Risk Assessment System of SMEs Simulation and Test of Nozzle Optimization of Residual Film Barrier Device based on Fluent
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1