Kun Zhang;Zhi-Xuan Zhang;Xiang Peng Xie;Jose de Jesus Rubio
{"title":"An Unknown Multiplayer Nonzero-Sum Game: Prescribed-Time Dynamic Event-Triggered Control via Adaptive Dynamic Programming","authors":"Kun Zhang;Zhi-Xuan Zhang;Xiang Peng Xie;Jose de Jesus Rubio","doi":"10.1109/TASE.2024.3484412","DOIUrl":null,"url":null,"abstract":"In this paper, the novel prescribed-time dynamic event-triggered control method of an unknown multiplayer nonzero-sum game (MP-NZSG) is designed by using adaptive dynamic programming (ADP). Firstly, a neural network-based identifier is constructed to estimate the unknown system dynamics. Subsequently, a novel ADP-based dynamic event-triggered control approach is advanced to ensure optimality and prescribed-time stability. A critic neural network (NN) is established for each player to approximate the Nash equilibrium solution of the dynamic event-triggered Hamilton-Jacobi-Isaacs (HJI) equation. This network employs a novel weight updating law, based on the experience replay technique, to alleviate the persistence of excitation condition. Furthermore, using the Lyapunov method, the uniform limit boundedness analysis of the neural network approximation error and multiplayer system is validated. Additionally, minimum inter-event time (MIET) is conclusively established to mitigate the notorious Zeno behaviour. Ultimately, the efficacy of the proposed method is rigorously substantiated through comprehensive simulation results. Note to Practitioners—Our research addresses the challenges of multi-component coordinated control, particularly in spacecraft attitude control. To handle these complexities, we propose an innovative adaptive dynamic event-triggered control approach. By integrating adaptive dynamic programming and neural networks, we effectively model and manage unknown system dynamics, enhancing the controller’s adaptability and robustness. Dynamic event-triggered policies are introduced to optimize system performance and reduce computational costs. The ADP-based prescribed time optimal control scheme prioritizes steady-state performance of nonlinear nonaffine systems, ensuring precise task completion within specified timeframes. Additionally, experience replay technology further fortifies the controller’s learning and adaptability to dynamic environments.","PeriodicalId":51060,"journal":{"name":"IEEE Transactions on Automation Science and Engineering","volume":"22 ","pages":"8317-8328"},"PeriodicalIF":6.4000,"publicationDate":"2024-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Automation Science and Engineering","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10736631/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
In this paper, the novel prescribed-time dynamic event-triggered control method of an unknown multiplayer nonzero-sum game (MP-NZSG) is designed by using adaptive dynamic programming (ADP). Firstly, a neural network-based identifier is constructed to estimate the unknown system dynamics. Subsequently, a novel ADP-based dynamic event-triggered control approach is advanced to ensure optimality and prescribed-time stability. A critic neural network (NN) is established for each player to approximate the Nash equilibrium solution of the dynamic event-triggered Hamilton-Jacobi-Isaacs (HJI) equation. This network employs a novel weight updating law, based on the experience replay technique, to alleviate the persistence of excitation condition. Furthermore, using the Lyapunov method, the uniform limit boundedness analysis of the neural network approximation error and multiplayer system is validated. Additionally, minimum inter-event time (MIET) is conclusively established to mitigate the notorious Zeno behaviour. Ultimately, the efficacy of the proposed method is rigorously substantiated through comprehensive simulation results. Note to Practitioners—Our research addresses the challenges of multi-component coordinated control, particularly in spacecraft attitude control. To handle these complexities, we propose an innovative adaptive dynamic event-triggered control approach. By integrating adaptive dynamic programming and neural networks, we effectively model and manage unknown system dynamics, enhancing the controller’s adaptability and robustness. Dynamic event-triggered policies are introduced to optimize system performance and reduce computational costs. The ADP-based prescribed time optimal control scheme prioritizes steady-state performance of nonlinear nonaffine systems, ensuring precise task completion within specified timeframes. Additionally, experience replay technology further fortifies the controller’s learning and adaptability to dynamic environments.
期刊介绍:
The IEEE Transactions on Automation Science and Engineering (T-ASE) publishes fundamental papers on Automation, emphasizing scientific results that advance efficiency, quality, productivity, and reliability. T-ASE encourages interdisciplinary approaches from computer science, control systems, electrical engineering, mathematics, mechanical engineering, operations research, and other fields. T-ASE welcomes results relevant to industries such as agriculture, biotechnology, healthcare, home automation, maintenance, manufacturing, pharmaceuticals, retail, security, service, supply chains, and transportation. T-ASE addresses a research community willing to integrate knowledge across disciplines and industries. For this purpose, each paper includes a Note to Practitioners that summarizes how its results can be applied or how they might be extended to apply in practice.