基于深度强化学习的无人水面车辆速度和航向控制

2023 IEEE 12th Data Driven Control and Learning Systems Conference (DDCLS) Pub Date : 2023-05-12 DOI:10.1109/DDCLS58216.2023.10166143

Ting Wu, Hui Ye, Z. Xiang, Xiaofei Yang

{"title":"基于深度强化学习的无人水面车辆速度和航向控制","authors":"Ting Wu, Hui Ye, Z. Xiang, Xiaofei Yang","doi":"10.1109/DDCLS58216.2023.10166143","DOIUrl":null,"url":null,"abstract":"In this paper, a deep reinforcement learning-based speed and heading control method is proposed for an unmanned surface vehicle (USV). A deep deterministic policy gradient (DDPG) algorithm which combines with an actor-critic reinforcement learning mechanism, is adopted to provide continuous control variables by interacting with the environment. Moreover, two types of reward functions are created for speed and heading control of the USV. The control policy is trained by trial and error so that the USV can be guided to achieve the desired speed and heading angle steadily and rapidly. Simulation results verify the feasibility and effectiveness of the proposed approach by comparisons with classical PID control and S plane control.","PeriodicalId":415532,"journal":{"name":"2023 IEEE 12th Data Driven Control and Learning Systems Conference (DDCLS)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Speed and heading control of an unmanned surface vehicle using deep reinforcement learning\",\"authors\":\"Ting Wu, Hui Ye, Z. Xiang, Xiaofei Yang\",\"doi\":\"10.1109/DDCLS58216.2023.10166143\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, a deep reinforcement learning-based speed and heading control method is proposed for an unmanned surface vehicle (USV). A deep deterministic policy gradient (DDPG) algorithm which combines with an actor-critic reinforcement learning mechanism, is adopted to provide continuous control variables by interacting with the environment. Moreover, two types of reward functions are created for speed and heading control of the USV. The control policy is trained by trial and error so that the USV can be guided to achieve the desired speed and heading angle steadily and rapidly. Simulation results verify the feasibility and effectiveness of the proposed approach by comparisons with classical PID control and S plane control.\",\"PeriodicalId\":415532,\"journal\":{\"name\":\"2023 IEEE 12th Data Driven Control and Learning Systems Conference (DDCLS)\",\"volume\":\"45 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-05-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 IEEE 12th Data Driven Control and Learning Systems Conference (DDCLS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DDCLS58216.2023.10166143\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE 12th Data Driven Control and Learning Systems Conference (DDCLS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DDCLS58216.2023.10166143","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

针对无人水面车辆，提出了一种基于深度强化学习的速度和航向控制方法。采用深度确定性策略梯度(deep deterministic policy gradient, DDPG)算法，结合行动者-批评者强化学习机制，通过与环境的交互提供连续的控制变量。此外，为USV的速度和航向控制创建了两种类型的奖励函数。通过反复试验对控制策略进行训练，使无人潜航器能够稳定快速地达到所需的速度和航向角。仿真结果通过与经典PID控制和S平面控制的比较，验证了该方法的可行性和有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Speed and heading control of an unmanned surface vehicle using deep reinforcement learning

In this paper, a deep reinforcement learning-based speed and heading control method is proposed for an unmanned surface vehicle (USV). A deep deterministic policy gradient (DDPG) algorithm which combines with an actor-critic reinforcement learning mechanism, is adopted to provide continuous control variables by interacting with the environment. Moreover, two types of reward functions are created for speed and heading control of the USV. The control policy is trained by trial and error so that the USV can be guided to achieve the desired speed and heading angle steadily and rapidly. Simulation results verify the feasibility and effectiveness of the proposed approach by comparisons with classical PID control and S plane control.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2023 IEEE 12th Data Driven Control and Learning Systems Conference (DDCLS)

自引率

0.00%

发文量