基于非策略强化学习的直升机非仿射偏航通道数据驱动跟踪控制

IF 5.7 2区计算机科学 Q1 ENGINEERING, AEROSPACE IEEE Transactions on Aerospace and Electronic Systems Pub Date : 2025-02-06 DOI:10.1109/TAES.2025.3539264

Kun Zhang;Shijie Luo;Huai-Ning Wu;Rong Su

{"title":"基于非策略强化学习的直升机非仿射偏航通道数据驱动跟踪控制","authors":"Kun Zhang;Shijie Luo;Huai-Ning Wu;Rong Su","doi":"10.1109/TAES.2025.3539264","DOIUrl":null,"url":null,"abstract":"This article presents an off-policy tracking control scheme for the continuous-time nonaffine yaw channel of uncrewed aerial vehicle helicopter. First, the article constructs an affine augmented system (AAS) within a parallel control structure to convert the original nonaffine tracking error dynamics into affine dynamics. Second, the article derives a stability criterion linking the nonaffine system and the AAS, demonstrating that the obtained zero-sum policy from the AAS can achieve the <inline-formula><tex-math>$H_\\infty$</tex-math></inline-formula> performance of the nonaffine system. Third, a data-driven off-policy tracking algorithm is designed for approximating the zero-sum solution of the Hamilton–Jacobi–Isaacs equations with unknown dynamics. Moreover, the recursive least squares process with a variable forgetting factor is employed to update the actor-critic neural network weights, with the algorithm's convergence being proven. Then, the uniformly ultimately bounded of tracking errors is guaranteed. Finally, two application examples are offered in simulation to validate the effectiveness of this presented method.","PeriodicalId":13157,"journal":{"name":"IEEE Transactions on Aerospace and Electronic Systems","volume":"61 3","pages":"7725-7737"},"PeriodicalIF":5.7000,"publicationDate":"2025-02-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Data-Driven Tracking Control for Nonaffine Yaw Channel of Helicopter via Off-Policy Reinforcement Learning\",\"authors\":\"Kun Zhang;Shijie Luo;Huai-Ning Wu;Rong Su\",\"doi\":\"10.1109/TAES.2025.3539264\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This article presents an off-policy tracking control scheme for the continuous-time nonaffine yaw channel of uncrewed aerial vehicle helicopter. First, the article constructs an affine augmented system (AAS) within a parallel control structure to convert the original nonaffine tracking error dynamics into affine dynamics. Second, the article derives a stability criterion linking the nonaffine system and the AAS, demonstrating that the obtained zero-sum policy from the AAS can achieve the <inline-formula><tex-math>$H_\\\\infty$</tex-math></inline-formula> performance of the nonaffine system. Third, a data-driven off-policy tracking algorithm is designed for approximating the zero-sum solution of the Hamilton–Jacobi–Isaacs equations with unknown dynamics. Moreover, the recursive least squares process with a variable forgetting factor is employed to update the actor-critic neural network weights, with the algorithm's convergence being proven. Then, the uniformly ultimately bounded of tracking errors is guaranteed. Finally, two application examples are offered in simulation to validate the effectiveness of this presented method.\",\"PeriodicalId\":13157,\"journal\":{\"name\":\"IEEE Transactions on Aerospace and Electronic Systems\",\"volume\":\"61 3\",\"pages\":\"7725-7737\"},\"PeriodicalIF\":5.7000,\"publicationDate\":\"2025-02-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Aerospace and Electronic Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10876598/\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, AEROSPACE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Aerospace and Electronic Systems","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10876598/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, AEROSPACE","Score":null,"Total":0}

引用次数: 0

摘要

针对无人机直升机的连续非仿射偏航信道，提出了一种离策略跟踪控制方案。首先，在并联控制结构中构建仿射增强系统，将原有的非仿射跟踪误差动力学转化为仿射动力学。其次，导出了连接非仿射系统和原子吸收系统的稳定性判据，证明了从原子吸收系统得到的零和策略可以达到$H_\infty$非仿射系统的性能。第三，设计了一种数据驱动的偏离策略跟踪算法，用于逼近具有未知动力学的Hamilton-Jacobi-Isaacs方程的零和解。此外，采用带变量遗忘因子的递归最小二乘方法更新行为评价神经网络权值，并证明了算法的收敛性。从而保证了跟踪误差的最终有界一致。最后给出了两个仿真应用实例，验证了该方法的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Data-Driven Tracking Control for Nonaffine Yaw Channel of Helicopter via Off-Policy Reinforcement Learning

This article presents an off-policy tracking control scheme for the continuous-time nonaffine yaw channel of uncrewed aerial vehicle helicopter. First, the article constructs an affine augmented system (AAS) within a parallel control structure to convert the original nonaffine tracking error dynamics into affine dynamics. Second, the article derives a stability criterion linking the nonaffine system and the AAS, demonstrating that the obtained zero-sum policy from the AAS can achieve the

$H_\infty$

performance of the nonaffine system. Third, a data-driven off-policy tracking algorithm is designed for approximating the zero-sum solution of the Hamilton–Jacobi–Isaacs equations with unknown dynamics. Moreover, the recursive least squares process with a variable forgetting factor is employed to update the actor-critic neural network weights, with the algorithm's convergence being proven. Then, the uniformly ultimately bounded of tracking errors is guaranteed. Finally, two application examples are offered in simulation to validate the effectiveness of this presented method.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE Transactions on Aerospace and Electronic Systems 工程技术-电信学

CiteScore

7.80

自引率

13.60%

发文量

433

审稿时长

8.7 months

期刊介绍： IEEE Transactions on Aerospace and Electronic Systems focuses on the organization, design, development, integration, and operation of complex systems for space, air, ocean, or ground environment. These systems include, but are not limited to, navigation, avionics, spacecraft, aerospace power, radar, sonar, telemetry, defense, transportation, automated testing, and command and control.