{"title":"Prescribed-Time Optimal Formation Synchronous Tracking Control of UAV-Ugvs Based on Reinforcement Learning","authors":"Shi-Xun Xiong, Guo-Ping Jiang, Yun-Xia Zhu, Xiao-Ming He, Shu-Han Chen","doi":"10.1002/rnc.7808","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>This paper explores the issue of prescribed-time optimal formation synchronous tracking control of nonlinear unmanned aerial vehicle (UAV) and unmanned ground vehicle (UGV) systems. Based on a novel constructed second-order nonlinear UAV-UGV swarm case, the reinforcement learning (RL) method is introduced to obtain the optimal control scheme, and a gradient descent method with a simple positive function for the Hamilton-Jacobi-Bellman (HJB) equation is improved to establish the adaptive actor and critic networks and solve the iterate adaptive laws, which allows adaptive parameters to be trained more thoroughly. Integrating the prescribed-time constraints of formation tasks, the incorporation of traditional prescribed-time functions can result in structural modifications within the RL framework and state coupling, thereby increasing the complexity of control strategy design. Hence, the prescribed-time functions are designed in the critic and actor networks, which address the state coupling and optimize the acquisition of adaptive parameters under the gradient descent method. Then, by employing the aforementioned methods, an optimal synchronous control scheme is proposed to address nonlinear UAV-UGV time-varying formation tracking at a settling time, and a pivotal scaling technique is used for formation stability analysis. Finally, simulation and experiment results are carried out to demonstrate the efficacy of the proposed approach.</p>\n </div>","PeriodicalId":50291,"journal":{"name":"International Journal of Robust and Nonlinear Control","volume":"35 6","pages":"2437-2450"},"PeriodicalIF":3.2000,"publicationDate":"2025-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Robust and Nonlinear Control","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/rnc.7808","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
This paper explores the issue of prescribed-time optimal formation synchronous tracking control of nonlinear unmanned aerial vehicle (UAV) and unmanned ground vehicle (UGV) systems. Based on a novel constructed second-order nonlinear UAV-UGV swarm case, the reinforcement learning (RL) method is introduced to obtain the optimal control scheme, and a gradient descent method with a simple positive function for the Hamilton-Jacobi-Bellman (HJB) equation is improved to establish the adaptive actor and critic networks and solve the iterate adaptive laws, which allows adaptive parameters to be trained more thoroughly. Integrating the prescribed-time constraints of formation tasks, the incorporation of traditional prescribed-time functions can result in structural modifications within the RL framework and state coupling, thereby increasing the complexity of control strategy design. Hence, the prescribed-time functions are designed in the critic and actor networks, which address the state coupling and optimize the acquisition of adaptive parameters under the gradient descent method. Then, by employing the aforementioned methods, an optimal synchronous control scheme is proposed to address nonlinear UAV-UGV time-varying formation tracking at a settling time, and a pivotal scaling technique is used for formation stability analysis. Finally, simulation and experiment results are carried out to demonstrate the efficacy of the proposed approach.
期刊介绍:
Papers that do not include an element of robust or nonlinear control and estimation theory will not be considered by the journal, and all papers will be expected to include significant novel content. The focus of the journal is on model based control design approaches rather than heuristic or rule based methods. Papers on neural networks will have to be of exceptional novelty to be considered for the journal.