基于深度强化学习的自主水下航行器三维路径跟踪控制

IF 2 3区工程技术 Q2 ENGINEERING, MARINE Polish Maritime Research Pub Date : 2022-12-01 DOI:10.2478/pomr-2022-0042

Zhenyu Liang, Xingru Qu, Zhao Zhang, Cong Chen

{"title":"基于深度强化学习的自主水下航行器三维路径跟踪控制","authors":"Zhenyu Liang, Xingru Qu, Zhao Zhang, Cong Chen","doi":"10.2478/pomr-2022-0042","DOIUrl":null,"url":null,"abstract":"Abstract In this article, a deep reinforcement learning based three-dimensional path following control approach is proposed for an underactuated autonomous underwater vehicle (AUV). To be specific, kinematic control laws are employed by using the three-dimensional line-of-sight guidance and dynamic control laws are employed by using the twin delayed deep deterministic policy gradient algorithm (TD3), contributing to the surge velocity, pitch angle and heading angle control of an underactuated AUV. In order to solve the chattering of controllers, the action filter and the punishment function are built respectively, which can make control signals stable. Simulations are carried out to evaluate the performance of the proposed control approach. And results show that the AUV can complete the control mission successfully.","PeriodicalId":49681,"journal":{"name":"Polish Maritime Research","volume":"29 1","pages":"36 - 44"},"PeriodicalIF":2.0000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Three-Dimensional Path-Following Control of an Autonomous Underwater Vehicle Based on Deep Reinforcement Learning\",\"authors\":\"Zhenyu Liang, Xingru Qu, Zhao Zhang, Cong Chen\",\"doi\":\"10.2478/pomr-2022-0042\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Abstract In this article, a deep reinforcement learning based three-dimensional path following control approach is proposed for an underactuated autonomous underwater vehicle (AUV). To be specific, kinematic control laws are employed by using the three-dimensional line-of-sight guidance and dynamic control laws are employed by using the twin delayed deep deterministic policy gradient algorithm (TD3), contributing to the surge velocity, pitch angle and heading angle control of an underactuated AUV. In order to solve the chattering of controllers, the action filter and the punishment function are built respectively, which can make control signals stable. Simulations are carried out to evaluate the performance of the proposed control approach. And results show that the AUV can complete the control mission successfully.\",\"PeriodicalId\":49681,\"journal\":{\"name\":\"Polish Maritime Research\",\"volume\":\"29 1\",\"pages\":\"36 - 44\"},\"PeriodicalIF\":2.0000,\"publicationDate\":\"2022-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Polish Maritime Research\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.2478/pomr-2022-0042\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, MARINE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Polish Maritime Research","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.2478/pomr-2022-0042","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, MARINE","Score":null,"Total":0}

引用次数: 2

摘要

针对欠驱动自主水下航行器(AUV)，提出了一种基于深度强化学习的三维路径跟踪控制方法。其中，采用三维视线制导的运动学控制律和双延迟深度确定性策略梯度算法(TD3)的动态控制律，实现欠驱动水下航行器的浪涌速度、俯仰角和航向角控制。为了解决控制器的抖振问题，分别建立了动作滤波器和惩罚函数，使控制信号稳定。通过仿真来评估所提出的控制方法的性能。实验结果表明，该水下机器人能够成功完成控制任务。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Three-Dimensional Path-Following Control of an Autonomous Underwater Vehicle Based on Deep Reinforcement Learning

Abstract In this article, a deep reinforcement learning based three-dimensional path following control approach is proposed for an underactuated autonomous underwater vehicle (AUV). To be specific, kinematic control laws are employed by using the three-dimensional line-of-sight guidance and dynamic control laws are employed by using the twin delayed deep deterministic policy gradient algorithm (TD3), contributing to the surge velocity, pitch angle and heading angle control of an underactuated AUV. In order to solve the chattering of controllers, the action filter and the punishment function are built respectively, which can make control signals stable. Simulations are carried out to evaluate the performance of the proposed control approach. And results show that the AUV can complete the control mission successfully.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Polish Maritime Research 工程技术-工程：海洋

CiteScore

3.70

自引率

45.00%

发文量

审稿时长

>12 weeks

期刊介绍： The scope of the journal covers selected issues related to all phases of product lifecycle and corresponding technologies for offshore floating and fixed structures and their components. All researchers are invited to submit their original papers for peer review and publications related to methods of the design; production and manufacturing; maintenance and operational processes of such technical items as: all types of vessels and their equipment, fixed and floating offshore units and their components, autonomous underwater vehicle (AUV) and remotely operated vehicle (ROV). We welcome submissions from these fields in the following technical topics: ship hydrodynamics: buoyancy and stability; ship resistance and propulsion, etc., structural integrity of ship and offshore unit structures: materials; welding; fatigue and fracture, etc., marine equipment: ship and offshore unit power plants: overboarding equipment; etc.