基于drl的自主水下航行器路径规划与避障

2023 IEEE International Conference on Mechatronics and Automation (ICMA) Pub Date : 2023-08-06 DOI:10.1109/ICMA57826.2023.10215663

Di Wu, Zhaolong Feng, Dongdong Hou, Rui Liu, Yufei Yin

{"title":"基于drl的自主水下航行器路径规划与避障","authors":"Di Wu, Zhaolong Feng, Dongdong Hou, Rui Liu, Yufei Yin","doi":"10.1109/ICMA57826.2023.10215663","DOIUrl":null,"url":null,"abstract":"Both path planning and obstacle avoidance are important for the navigation safety of autonomous underwater vehicles (AUVs) in unknown environments. In this paper, in order to adjust to the complexity and flexibility of underwater environments, path planning and obstacle avoidance algorithms based on value iterative network (VIN) and deep deterministic policy gradient (DDPG) respectively are proposed to navigate the AUV through an unknown complex area. With a simulation multi-beam sonar equipped to detect obstacles of subsea surroundings, a grid map is constructed online as inputs of VIN and DDPG. Taking advantage of generalization of deep reinforcement learning, methods studied in this paper have demonstrated validity in simulation experiments implemented in Unity3D where dynamic and static obstacles are randomly placed and experiments are conducted.","PeriodicalId":151364,"journal":{"name":"2023 IEEE International Conference on Mechatronics and Automation (ICMA)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"DRL-based path planning and obstacle avoidance of autonomous underwater vehicle\",\"authors\":\"Di Wu, Zhaolong Feng, Dongdong Hou, Rui Liu, Yufei Yin\",\"doi\":\"10.1109/ICMA57826.2023.10215663\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Both path planning and obstacle avoidance are important for the navigation safety of autonomous underwater vehicles (AUVs) in unknown environments. In this paper, in order to adjust to the complexity and flexibility of underwater environments, path planning and obstacle avoidance algorithms based on value iterative network (VIN) and deep deterministic policy gradient (DDPG) respectively are proposed to navigate the AUV through an unknown complex area. With a simulation multi-beam sonar equipped to detect obstacles of subsea surroundings, a grid map is constructed online as inputs of VIN and DDPG. Taking advantage of generalization of deep reinforcement learning, methods studied in this paper have demonstrated validity in simulation experiments implemented in Unity3D where dynamic and static obstacles are randomly placed and experiments are conducted.\",\"PeriodicalId\":151364,\"journal\":{\"name\":\"2023 IEEE International Conference on Mechatronics and Automation (ICMA)\",\"volume\":\"48 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-08-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 IEEE International Conference on Mechatronics and Automation (ICMA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICMA57826.2023.10215663\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE International Conference on Mechatronics and Automation (ICMA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMA57826.2023.10215663","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

路径规划和避障对于自主水下航行器在未知环境中的航行安全至关重要。为了适应水下环境的复杂性和灵活性，本文分别提出了基于值迭代网络(VIN)的路径规划算法和基于深度确定性策略梯度(DDPG)的避障算法，实现了AUV在未知复杂区域的导航。利用模拟多波束声纳来探测海底环境中的障碍物，在线构建网格图作为VIN和DDPG的输入。利用深度强化学习的泛化性，本文所研究的方法在Unity3D中进行了仿真实验，随机放置动态和静态障碍物并进行了实验，验证了其有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

DRL-based path planning and obstacle avoidance of autonomous underwater vehicle

Both path planning and obstacle avoidance are important for the navigation safety of autonomous underwater vehicles (AUVs) in unknown environments. In this paper, in order to adjust to the complexity and flexibility of underwater environments, path planning and obstacle avoidance algorithms based on value iterative network (VIN) and deep deterministic policy gradient (DDPG) respectively are proposed to navigate the AUV through an unknown complex area. With a simulation multi-beam sonar equipped to detect obstacles of subsea surroundings, a grid map is constructed online as inputs of VIN and DDPG. Taking advantage of generalization of deep reinforcement learning, methods studied in this paper have demonstrated validity in simulation experiments implemented in Unity3D where dynamic and static obstacles are randomly placed and experiments are conducted.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2023 IEEE International Conference on Mechatronics and Automation (ICMA)

自引率

0.00%

发文量

期刊最新文献

ICMA 2023 Conference Info A Parameter Fluctuation Impact Analysis Algorithm for The Control of Sealing Performance Stability Research on Composite Control Strategy of Off-Grid PV Inverter under Nonlinear Asymmetric Load A Low-Cost Skiing Motion Capture System Based on Monocular RGB Camera and MINS Fusion INS/GNSS/UWB/OD Robust Navigation Algorithm Based on Factor Graph