基于近端策略优化的复杂海洋环境中感知受限的多usv动态路径规划

IF 5.5 2区工程技术 Q1 ENGINEERING, CIVIL Ocean Engineering Pub Date : 2025-05-15 Epub Date: 2025-03-11 DOI:10.1016/j.oceaneng.2025.120907

Xizhe Chen, Shihong Yin, Yujing Li, Zhengrong Xiang

{"title":"基于近端策略优化的复杂海洋环境中感知受限的多usv动态路径规划","authors":"Xizhe Chen, Shihong Yin, Yujing Li, Zhengrong Xiang","doi":"10.1016/j.oceaneng.2025.120907","DOIUrl":null,"url":null,"abstract":"<div><div>This paper addresses the path planning problem for unmanned surface vehicles (USVs) under distributed control in dynamic maritime environments. A novel proximal policy optimization (PPO)-based algorithm is proposed to overcome the challenges posed by limited sensing capabilities and environmental variability. By integrating the reciprocal velocity obstacle method, the algorithm significantly improves obstacle avoidance efficiency while ensuring compliance with the International Regulations for Preventing Collisions at Sea (COLREGs). To address the sparse reward problem inherent in PPO algorithms, a customized reward mechanism is designed, and a bidirectional gated recurrent unit network is introduced to process variable-length observation data caused by dynamic obstacle scenarios. Extensive simulation results demonstrate that the proposed algorithm achieves notable advantages in convergence, robustness, and real-time decision-making. Furthermore, ablation and extended experiments validate the effectiveness and generalization capability of the algorithm, confirming that the multi-USV system can achieve safe, efficient, and COLREGs-compliant path planning in highly dynamic and complex environments.</div></div>","PeriodicalId":19403,"journal":{"name":"Ocean Engineering","volume":"326 ","pages":"Article 120907"},"PeriodicalIF":5.5000,"publicationDate":"2025-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Dynamic path planning for multi-USV in complex ocean environments with limited perception via proximal policy optimization\",\"authors\":\"Xizhe Chen, Shihong Yin, Yujing Li, Zhengrong Xiang\",\"doi\":\"10.1016/j.oceaneng.2025.120907\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>This paper addresses the path planning problem for unmanned surface vehicles (USVs) under distributed control in dynamic maritime environments. A novel proximal policy optimization (PPO)-based algorithm is proposed to overcome the challenges posed by limited sensing capabilities and environmental variability. By integrating the reciprocal velocity obstacle method, the algorithm significantly improves obstacle avoidance efficiency while ensuring compliance with the International Regulations for Preventing Collisions at Sea (COLREGs). To address the sparse reward problem inherent in PPO algorithms, a customized reward mechanism is designed, and a bidirectional gated recurrent unit network is introduced to process variable-length observation data caused by dynamic obstacle scenarios. Extensive simulation results demonstrate that the proposed algorithm achieves notable advantages in convergence, robustness, and real-time decision-making. Furthermore, ablation and extended experiments validate the effectiveness and generalization capability of the algorithm, confirming that the multi-USV system can achieve safe, efficient, and COLREGs-compliant path planning in highly dynamic and complex environments.</div></div>\",\"PeriodicalId\":19403,\"journal\":{\"name\":\"Ocean Engineering\",\"volume\":\"326 \",\"pages\":\"Article 120907\"},\"PeriodicalIF\":5.5000,\"publicationDate\":\"2025-05-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Ocean Engineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0029801825006201\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/3/11 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, CIVIL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ocean Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0029801825006201","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/3/11 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}

引用次数: 0

摘要

研究了动态海洋环境下分布式控制下的无人水面航行器路径规划问题。提出了一种新的基于近端策略优化（PPO）的算法，以克服有限的感知能力和环境可变性带来的挑战。该算法结合速度互反避障方法，在保证符合《国际海上避碰规则》（COLREGs）的前提下，显著提高了避障效率。针对PPO算法固有的奖励稀疏问题，设计了自定义奖励机制，并引入双向门控循环单元网络处理动态障碍场景下的变长观测数据。大量的仿真结果表明，该算法在收敛性、鲁棒性和实时性等方面具有显著的优势。此外，烧蚀和扩展实验验证了该算法的有效性和泛化能力，证实了多usv系统可以在高动态和复杂环境中实现安全、高效和符合colregs的路径规划。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Dynamic path planning for multi-USV in complex ocean environments with limited perception via proximal policy optimization

This paper addresses the path planning problem for unmanned surface vehicles (USVs) under distributed control in dynamic maritime environments. A novel proximal policy optimization (PPO)-based algorithm is proposed to overcome the challenges posed by limited sensing capabilities and environmental variability. By integrating the reciprocal velocity obstacle method, the algorithm significantly improves obstacle avoidance efficiency while ensuring compliance with the International Regulations for Preventing Collisions at Sea (COLREGs). To address the sparse reward problem inherent in PPO algorithms, a customized reward mechanism is designed, and a bidirectional gated recurrent unit network is introduced to process variable-length observation data caused by dynamic obstacle scenarios. Extensive simulation results demonstrate that the proposed algorithm achieves notable advantages in convergence, robustness, and real-time decision-making. Furthermore, ablation and extended experiments validate the effectiveness and generalization capability of the algorithm, confirming that the multi-USV system can achieve safe, efficient, and COLREGs-compliant path planning in highly dynamic and complex environments.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Ocean Engineering 工程技术-工程：大洋

CiteScore

7.30

自引率

34.00%

发文量

2379

审稿时长

8.1 months

期刊介绍： Ocean Engineering provides a medium for the publication of original research and development work in the field of ocean engineering. Ocean Engineering seeks papers in the following topics.