基于近端策略优化的复杂海洋环境中感知受限的多usv动态路径规划

IF 5.5 2区 工程技术 Q1 ENGINEERING, CIVIL Ocean Engineering Pub Date : 2025-05-15 Epub Date: 2025-03-11 DOI:10.1016/j.oceaneng.2025.120907
Xizhe Chen, Shihong Yin, Yujing Li, Zhengrong Xiang
{"title":"基于近端策略优化的复杂海洋环境中感知受限的多usv动态路径规划","authors":"Xizhe Chen,&nbsp;Shihong Yin,&nbsp;Yujing Li,&nbsp;Zhengrong Xiang","doi":"10.1016/j.oceaneng.2025.120907","DOIUrl":null,"url":null,"abstract":"<div><div>This paper addresses the path planning problem for unmanned surface vehicles (USVs) under distributed control in dynamic maritime environments. A novel proximal policy optimization (PPO)-based algorithm is proposed to overcome the challenges posed by limited sensing capabilities and environmental variability. By integrating the reciprocal velocity obstacle method, the algorithm significantly improves obstacle avoidance efficiency while ensuring compliance with the International Regulations for Preventing Collisions at Sea (COLREGs). To address the sparse reward problem inherent in PPO algorithms, a customized reward mechanism is designed, and a bidirectional gated recurrent unit network is introduced to process variable-length observation data caused by dynamic obstacle scenarios. Extensive simulation results demonstrate that the proposed algorithm achieves notable advantages in convergence, robustness, and real-time decision-making. Furthermore, ablation and extended experiments validate the effectiveness and generalization capability of the algorithm, confirming that the multi-USV system can achieve safe, efficient, and COLREGs-compliant path planning in highly dynamic and complex environments.</div></div>","PeriodicalId":19403,"journal":{"name":"Ocean Engineering","volume":"326 ","pages":"Article 120907"},"PeriodicalIF":5.5000,"publicationDate":"2025-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Dynamic path planning for multi-USV in complex ocean environments with limited perception via proximal policy optimization\",\"authors\":\"Xizhe Chen,&nbsp;Shihong Yin,&nbsp;Yujing Li,&nbsp;Zhengrong Xiang\",\"doi\":\"10.1016/j.oceaneng.2025.120907\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>This paper addresses the path planning problem for unmanned surface vehicles (USVs) under distributed control in dynamic maritime environments. A novel proximal policy optimization (PPO)-based algorithm is proposed to overcome the challenges posed by limited sensing capabilities and environmental variability. By integrating the reciprocal velocity obstacle method, the algorithm significantly improves obstacle avoidance efficiency while ensuring compliance with the International Regulations for Preventing Collisions at Sea (COLREGs). To address the sparse reward problem inherent in PPO algorithms, a customized reward mechanism is designed, and a bidirectional gated recurrent unit network is introduced to process variable-length observation data caused by dynamic obstacle scenarios. Extensive simulation results demonstrate that the proposed algorithm achieves notable advantages in convergence, robustness, and real-time decision-making. Furthermore, ablation and extended experiments validate the effectiveness and generalization capability of the algorithm, confirming that the multi-USV system can achieve safe, efficient, and COLREGs-compliant path planning in highly dynamic and complex environments.</div></div>\",\"PeriodicalId\":19403,\"journal\":{\"name\":\"Ocean Engineering\",\"volume\":\"326 \",\"pages\":\"Article 120907\"},\"PeriodicalIF\":5.5000,\"publicationDate\":\"2025-05-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Ocean Engineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0029801825006201\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/3/11 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, CIVIL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ocean Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0029801825006201","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/3/11 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
引用次数: 0

摘要

研究了动态海洋环境下分布式控制下的无人水面航行器路径规划问题。提出了一种新的基于近端策略优化(PPO)的算法,以克服有限的感知能力和环境可变性带来的挑战。该算法结合速度互反避障方法,在保证符合《国际海上避碰规则》(COLREGs)的前提下,显著提高了避障效率。针对PPO算法固有的奖励稀疏问题,设计了自定义奖励机制,并引入双向门控循环单元网络处理动态障碍场景下的变长观测数据。大量的仿真结果表明,该算法在收敛性、鲁棒性和实时性等方面具有显著的优势。此外,烧蚀和扩展实验验证了该算法的有效性和泛化能力,证实了多usv系统可以在高动态和复杂环境中实现安全、高效和符合colregs的路径规划。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Dynamic path planning for multi-USV in complex ocean environments with limited perception via proximal policy optimization
This paper addresses the path planning problem for unmanned surface vehicles (USVs) under distributed control in dynamic maritime environments. A novel proximal policy optimization (PPO)-based algorithm is proposed to overcome the challenges posed by limited sensing capabilities and environmental variability. By integrating the reciprocal velocity obstacle method, the algorithm significantly improves obstacle avoidance efficiency while ensuring compliance with the International Regulations for Preventing Collisions at Sea (COLREGs). To address the sparse reward problem inherent in PPO algorithms, a customized reward mechanism is designed, and a bidirectional gated recurrent unit network is introduced to process variable-length observation data caused by dynamic obstacle scenarios. Extensive simulation results demonstrate that the proposed algorithm achieves notable advantages in convergence, robustness, and real-time decision-making. Furthermore, ablation and extended experiments validate the effectiveness and generalization capability of the algorithm, confirming that the multi-USV system can achieve safe, efficient, and COLREGs-compliant path planning in highly dynamic and complex environments.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Ocean Engineering
Ocean Engineering 工程技术-工程:大洋
CiteScore
7.30
自引率
34.00%
发文量
2379
审稿时长
8.1 months
期刊介绍: Ocean Engineering provides a medium for the publication of original research and development work in the field of ocean engineering. Ocean Engineering seeks papers in the following topics.
期刊最新文献
Incorporation of movement pattern analysis into route searching for ship ETA prediction Assessing wind-wave forces on smooth and rough horizontal cylinders by neural networks SPH-based numerical investigation into hydrodynamic performance and solitary-wave-based mooring screening of FPV array shielded by breakwaters Influence of earthquake duration on the non-linear slosh dynamics of chamfered bottom liquid tanks using Mixed-Eulerian Lagrangian approach Buckling of the constrained subsea FGP-GPLs liner under pressure loading and heating field based on multi-source data fusion scheme
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1