Di Wu, Zhaolong Feng, Dongdong Hou, Rui Liu, Yufei Yin
{"title":"基于drl的自主水下航行器路径规划与避障","authors":"Di Wu, Zhaolong Feng, Dongdong Hou, Rui Liu, Yufei Yin","doi":"10.1109/ICMA57826.2023.10215663","DOIUrl":null,"url":null,"abstract":"Both path planning and obstacle avoidance are important for the navigation safety of autonomous underwater vehicles (AUVs) in unknown environments. In this paper, in order to adjust to the complexity and flexibility of underwater environments, path planning and obstacle avoidance algorithms based on value iterative network (VIN) and deep deterministic policy gradient (DDPG) respectively are proposed to navigate the AUV through an unknown complex area. With a simulation multi-beam sonar equipped to detect obstacles of subsea surroundings, a grid map is constructed online as inputs of VIN and DDPG. Taking advantage of generalization of deep reinforcement learning, methods studied in this paper have demonstrated validity in simulation experiments implemented in Unity3D where dynamic and static obstacles are randomly placed and experiments are conducted.","PeriodicalId":151364,"journal":{"name":"2023 IEEE International Conference on Mechatronics and Automation (ICMA)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"DRL-based path planning and obstacle avoidance of autonomous underwater vehicle\",\"authors\":\"Di Wu, Zhaolong Feng, Dongdong Hou, Rui Liu, Yufei Yin\",\"doi\":\"10.1109/ICMA57826.2023.10215663\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Both path planning and obstacle avoidance are important for the navigation safety of autonomous underwater vehicles (AUVs) in unknown environments. In this paper, in order to adjust to the complexity and flexibility of underwater environments, path planning and obstacle avoidance algorithms based on value iterative network (VIN) and deep deterministic policy gradient (DDPG) respectively are proposed to navigate the AUV through an unknown complex area. With a simulation multi-beam sonar equipped to detect obstacles of subsea surroundings, a grid map is constructed online as inputs of VIN and DDPG. Taking advantage of generalization of deep reinforcement learning, methods studied in this paper have demonstrated validity in simulation experiments implemented in Unity3D where dynamic and static obstacles are randomly placed and experiments are conducted.\",\"PeriodicalId\":151364,\"journal\":{\"name\":\"2023 IEEE International Conference on Mechatronics and Automation (ICMA)\",\"volume\":\"48 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-08-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 IEEE International Conference on Mechatronics and Automation (ICMA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICMA57826.2023.10215663\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE International Conference on Mechatronics and Automation (ICMA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMA57826.2023.10215663","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
DRL-based path planning and obstacle avoidance of autonomous underwater vehicle
Both path planning and obstacle avoidance are important for the navigation safety of autonomous underwater vehicles (AUVs) in unknown environments. In this paper, in order to adjust to the complexity and flexibility of underwater environments, path planning and obstacle avoidance algorithms based on value iterative network (VIN) and deep deterministic policy gradient (DDPG) respectively are proposed to navigate the AUV through an unknown complex area. With a simulation multi-beam sonar equipped to detect obstacles of subsea surroundings, a grid map is constructed online as inputs of VIN and DDPG. Taking advantage of generalization of deep reinforcement learning, methods studied in this paper have demonstrated validity in simulation experiments implemented in Unity3D where dynamic and static obstacles are randomly placed and experiments are conducted.