感知范围内的同步学习与规划:一种局部路径规划方法

Lokesh Kumar;Arup Kumar Sadhu;Ranjan Dasgupta
{"title":"感知范围内的同步学习与规划:一种局部路径规划方法","authors":"Lokesh Kumar;Arup Kumar Sadhu;Ranjan Dasgupta","doi":"10.1109/TAI.2024.3438094","DOIUrl":null,"url":null,"abstract":"This article proposes an approach for local path planning. Unlike traditional approaches, the proposed local path planner simultaneously learns and plans within the sensing range (SLPA-SR) during local path planning. SLPA-SR is the synergy between the local path planner, the dynamic window approach (DWA), the obstacle avoidance by velocity obstacle (VO) approach, and the proposed next-best reward learning (NBR) algorithms. In the proposed SLPA-SR, the DWA acts as an actuator and helps to balance exploration and exploitation in the proposed NBR. In the proposed NBR, dimensions of state and action do not need to be defined a \n<italic>priori</i>\n; rather, dimensions of state and action change dynamically. The proposed SLPA-SR is simulated and experimentally validated on the TurtleBot3 Waffle Pi. The performance of the proposed SLPA-SR is tested in several typical environments, both in simulation and hardware experiments. The proposed SLPA-SR outperforms the contender algorithms (i.e., DWA, DWA-RL, improved time elastic band, predictive artificial potential field, and artificial potential field) by a significant margin in terms of run-time, linear velocity, angular velocity, success rate, average trajectory length, and average velocity. The efficacy of the proposed NBR is established by analyzing the percentage of exploitation, average reward, and state-action pair count.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"5 12","pages":"6399-6411"},"PeriodicalIF":0.0000,"publicationDate":"2024-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Simultaneous Learning and Planning Within Sensing Range: An Approach for Local Path Planning\",\"authors\":\"Lokesh Kumar;Arup Kumar Sadhu;Ranjan Dasgupta\",\"doi\":\"10.1109/TAI.2024.3438094\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This article proposes an approach for local path planning. Unlike traditional approaches, the proposed local path planner simultaneously learns and plans within the sensing range (SLPA-SR) during local path planning. SLPA-SR is the synergy between the local path planner, the dynamic window approach (DWA), the obstacle avoidance by velocity obstacle (VO) approach, and the proposed next-best reward learning (NBR) algorithms. In the proposed SLPA-SR, the DWA acts as an actuator and helps to balance exploration and exploitation in the proposed NBR. In the proposed NBR, dimensions of state and action do not need to be defined a \\n<italic>priori</i>\\n; rather, dimensions of state and action change dynamically. The proposed SLPA-SR is simulated and experimentally validated on the TurtleBot3 Waffle Pi. The performance of the proposed SLPA-SR is tested in several typical environments, both in simulation and hardware experiments. The proposed SLPA-SR outperforms the contender algorithms (i.e., DWA, DWA-RL, improved time elastic band, predictive artificial potential field, and artificial potential field) by a significant margin in terms of run-time, linear velocity, angular velocity, success rate, average trajectory length, and average velocity. The efficacy of the proposed NBR is established by analyzing the percentage of exploitation, average reward, and state-action pair count.\",\"PeriodicalId\":73305,\"journal\":{\"name\":\"IEEE transactions on artificial intelligence\",\"volume\":\"5 12\",\"pages\":\"6399-6411\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-08-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE transactions on artificial intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10623198/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on artificial intelligence","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10623198/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

本文提出了一种局部路径规划方法。与传统方法不同,本文提出的局部路径规划器在局部路径规划过程中同时学习和规划感知范围内的路径(SLPA-SR)。SLPA-SR是局部路径规划、动态窗口法(DWA)、速度障碍避障法(VO)和次优奖励学习(NBR)算法之间的协同。在拟议的SLPA-SR中,DWA充当执行器,帮助平衡拟议NBR中的勘探和开发。在拟议的NBR中,不需要先验地定义状态和行为的维度;相反,状态和动作的维度是动态变化的。所提出的SLPA-SR在TurtleBot3华夫派上进行了仿真和实验验证。在仿真和硬件实验两种典型环境中对所提出的SLPA-SR的性能进行了测试。在运行时间、线速度、角速度、成功率、平均轨迹长度和平均速度方面,SLPA-SR算法明显优于现有的竞争算法(即DWA、DWA- rl、改进时间弹性带、预测人工势场和人工势场)。所提出的NBR的有效性是通过分析利用的百分比、平均奖励和状态-行动对计数来确定的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Simultaneous Learning and Planning Within Sensing Range: An Approach for Local Path Planning
This article proposes an approach for local path planning. Unlike traditional approaches, the proposed local path planner simultaneously learns and plans within the sensing range (SLPA-SR) during local path planning. SLPA-SR is the synergy between the local path planner, the dynamic window approach (DWA), the obstacle avoidance by velocity obstacle (VO) approach, and the proposed next-best reward learning (NBR) algorithms. In the proposed SLPA-SR, the DWA acts as an actuator and helps to balance exploration and exploitation in the proposed NBR. In the proposed NBR, dimensions of state and action do not need to be defined a priori ; rather, dimensions of state and action change dynamically. The proposed SLPA-SR is simulated and experimentally validated on the TurtleBot3 Waffle Pi. The performance of the proposed SLPA-SR is tested in several typical environments, both in simulation and hardware experiments. The proposed SLPA-SR outperforms the contender algorithms (i.e., DWA, DWA-RL, improved time elastic band, predictive artificial potential field, and artificial potential field) by a significant margin in terms of run-time, linear velocity, angular velocity, success rate, average trajectory length, and average velocity. The efficacy of the proposed NBR is established by analyzing the percentage of exploitation, average reward, and state-action pair count.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
7.70
自引率
0.00%
发文量
0
期刊最新文献
Front Cover Table of Contents IEEE Transactions on Artificial Intelligence Publication Information Table of Contents Front Cover
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1