Adaptive collision-free control for UAVs with discrete-time system based on reinforcement learning

Advances in Mechanical Engineering Pub Date : 2024-01-01 DOI:10.1177/16878132231225321

Xiaoyu Huo, Yanan Guo

引用次数: 0

Abstract

A flexible reinforcement learning (RL) optimal collision-avoidance control formulation for unmanned aerial vehicles (UAVs) with discrete-time frameworks is revealed in this work. By utilizing the neural network (NN) estimating capacity and the actor-critic control scheme of the RL technique, an adaptive RL optimal collision-free controller with a minimal learning parameter (MLP) is formulated, which is based on a novel strategic utility function. The optimal collision-avoidance control issue, which couldn’t be addressed in the prior literature, can be resolved by the suggested approaches. Furthermore, the proposed MPL adaptive optimal control formulation allows for a reduction in the quantity of adaptive laws, leading to reduced computational complexity. Additionally, a rigorous stability analysis is provided, demonstrating that the uniform ultimate boundedness (UUB) of all signals in the closed-loop system is ensured by the proposed adaptive RL. Finally, the simulation outcomes illustrate the effectiveness of the proposed optimal RL control approaches.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于强化学习的无人飞行器离散时间系统自适应无碰撞控制

本研究揭示了离散时间框架下无人驾驶飞行器（UAV）灵活的强化学习（RL）最优防碰撞控制方案。通过利用 RL 技术的神经网络（NN）估计能力和行为批判控制方案，制定了一种具有最小学习参数（MLP）的自适应 RL 最佳防碰撞控制器，该控制器基于一种新颖的战略效用函数。该方法解决了以往文献中无法解决的最优防碰撞控制问题。此外，所提出的 MPL 自适应最优控制公式可以减少自适应法则的数量，从而降低计算复杂度。此外，还提供了严格的稳定性分析，证明所提出的自适应 RL 可确保闭环系统中所有信号的统一终极约束性（UUB）。最后，仿真结果表明了所提出的最优 RL 控制方法的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Advances in Mechanical Engineering

自引率

0.00%

发文量