Adaptive collision-free control for UAVs with discrete-time system based on reinforcement learning

Xiaoyu Huo, Yanan Guo
{"title":"Adaptive collision-free control for UAVs with discrete-time system based on reinforcement learning","authors":"Xiaoyu Huo, Yanan Guo","doi":"10.1177/16878132231225321","DOIUrl":null,"url":null,"abstract":"A flexible reinforcement learning (RL) optimal collision-avoidance control formulation for unmanned aerial vehicles (UAVs) with discrete-time frameworks is revealed in this work. By utilizing the neural network (NN) estimating capacity and the actor-critic control scheme of the RL technique, an adaptive RL optimal collision-free controller with a minimal learning parameter (MLP) is formulated, which is based on a novel strategic utility function. The optimal collision-avoidance control issue, which couldn’t be addressed in the prior literature, can be resolved by the suggested approaches. Furthermore, the proposed MPL adaptive optimal control formulation allows for a reduction in the quantity of adaptive laws, leading to reduced computational complexity. Additionally, a rigorous stability analysis is provided, demonstrating that the uniform ultimate boundedness (UUB) of all signals in the closed-loop system is ensured by the proposed adaptive RL. Finally, the simulation outcomes illustrate the effectiveness of the proposed optimal RL control approaches.","PeriodicalId":502561,"journal":{"name":"Advances in Mechanical Engineering","volume":"17 5","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advances in Mechanical Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/16878132231225321","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

A flexible reinforcement learning (RL) optimal collision-avoidance control formulation for unmanned aerial vehicles (UAVs) with discrete-time frameworks is revealed in this work. By utilizing the neural network (NN) estimating capacity and the actor-critic control scheme of the RL technique, an adaptive RL optimal collision-free controller with a minimal learning parameter (MLP) is formulated, which is based on a novel strategic utility function. The optimal collision-avoidance control issue, which couldn’t be addressed in the prior literature, can be resolved by the suggested approaches. Furthermore, the proposed MPL adaptive optimal control formulation allows for a reduction in the quantity of adaptive laws, leading to reduced computational complexity. Additionally, a rigorous stability analysis is provided, demonstrating that the uniform ultimate boundedness (UUB) of all signals in the closed-loop system is ensured by the proposed adaptive RL. Finally, the simulation outcomes illustrate the effectiveness of the proposed optimal RL control approaches.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于强化学习的无人飞行器离散时间系统自适应无碰撞控制
本研究揭示了离散时间框架下无人驾驶飞行器(UAV)灵活的强化学习(RL)最优防碰撞控制方案。通过利用 RL 技术的神经网络(NN)估计能力和行为批判控制方案,制定了一种具有最小学习参数(MLP)的自适应 RL 最佳防碰撞控制器,该控制器基于一种新颖的战略效用函数。该方法解决了以往文献中无法解决的最优防碰撞控制问题。此外,所提出的 MPL 自适应最优控制公式可以减少自适应法则的数量,从而降低计算复杂度。此外,还提供了严格的稳定性分析,证明所提出的自适应 RL 可确保闭环系统中所有信号的统一终极约束性(UUB)。最后,仿真结果表明了所提出的最优 RL 控制方法的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Optimization of Al2O3 nanoparticle concentration and cutting parameters in hard milling under nanofluid MQL environment Analysis of the valve positioner pilot valve hole plugging Integrated design of insertions-extractions performance and contact reliability of spring-wire socket electrical connector Wear mechanisms of diamond segmenta in cutting of carbon fiber reinforced cement-based composite and optimizing in parameters Thermal transport exploration of ternary hybrid nanofluid flow in a non-Newtonian model with homogeneous-heterogeneous chemical reactions induced by vertical cylinder
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1