An Unknown Multiplayer Nonzero-Sum Game: Prescribed-Time Dynamic Event-Triggered Control via Adaptive Dynamic Programming

IF 6.4 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS IEEE Transactions on Automation Science and Engineering Pub Date : 2024-10-28 DOI:10.1109/TASE.2024.3484412
Kun Zhang;Zhi-Xuan Zhang;Xiang Peng Xie;Jose de Jesus Rubio
{"title":"An Unknown Multiplayer Nonzero-Sum Game: Prescribed-Time Dynamic Event-Triggered Control via Adaptive Dynamic Programming","authors":"Kun Zhang;Zhi-Xuan Zhang;Xiang Peng Xie;Jose de Jesus Rubio","doi":"10.1109/TASE.2024.3484412","DOIUrl":null,"url":null,"abstract":"In this paper, the novel prescribed-time dynamic event-triggered control method of an unknown multiplayer nonzero-sum game (MP-NZSG) is designed by using adaptive dynamic programming (ADP). Firstly, a neural network-based identifier is constructed to estimate the unknown system dynamics. Subsequently, a novel ADP-based dynamic event-triggered control approach is advanced to ensure optimality and prescribed-time stability. A critic neural network (NN) is established for each player to approximate the Nash equilibrium solution of the dynamic event-triggered Hamilton-Jacobi-Isaacs (HJI) equation. This network employs a novel weight updating law, based on the experience replay technique, to alleviate the persistence of excitation condition. Furthermore, using the Lyapunov method, the uniform limit boundedness analysis of the neural network approximation error and multiplayer system is validated. Additionally, minimum inter-event time (MIET) is conclusively established to mitigate the notorious Zeno behaviour. Ultimately, the efficacy of the proposed method is rigorously substantiated through comprehensive simulation results. Note to Practitioners—Our research addresses the challenges of multi-component coordinated control, particularly in spacecraft attitude control. To handle these complexities, we propose an innovative adaptive dynamic event-triggered control approach. By integrating adaptive dynamic programming and neural networks, we effectively model and manage unknown system dynamics, enhancing the controller’s adaptability and robustness. Dynamic event-triggered policies are introduced to optimize system performance and reduce computational costs. The ADP-based prescribed time optimal control scheme prioritizes steady-state performance of nonlinear nonaffine systems, ensuring precise task completion within specified timeframes. Additionally, experience replay technology further fortifies the controller’s learning and adaptability to dynamic environments.","PeriodicalId":51060,"journal":{"name":"IEEE Transactions on Automation Science and Engineering","volume":"22 ","pages":"8317-8328"},"PeriodicalIF":6.4000,"publicationDate":"2024-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Automation Science and Engineering","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10736631/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

In this paper, the novel prescribed-time dynamic event-triggered control method of an unknown multiplayer nonzero-sum game (MP-NZSG) is designed by using adaptive dynamic programming (ADP). Firstly, a neural network-based identifier is constructed to estimate the unknown system dynamics. Subsequently, a novel ADP-based dynamic event-triggered control approach is advanced to ensure optimality and prescribed-time stability. A critic neural network (NN) is established for each player to approximate the Nash equilibrium solution of the dynamic event-triggered Hamilton-Jacobi-Isaacs (HJI) equation. This network employs a novel weight updating law, based on the experience replay technique, to alleviate the persistence of excitation condition. Furthermore, using the Lyapunov method, the uniform limit boundedness analysis of the neural network approximation error and multiplayer system is validated. Additionally, minimum inter-event time (MIET) is conclusively established to mitigate the notorious Zeno behaviour. Ultimately, the efficacy of the proposed method is rigorously substantiated through comprehensive simulation results. Note to Practitioners—Our research addresses the challenges of multi-component coordinated control, particularly in spacecraft attitude control. To handle these complexities, we propose an innovative adaptive dynamic event-triggered control approach. By integrating adaptive dynamic programming and neural networks, we effectively model and manage unknown system dynamics, enhancing the controller’s adaptability and robustness. Dynamic event-triggered policies are introduced to optimize system performance and reduce computational costs. The ADP-based prescribed time optimal control scheme prioritizes steady-state performance of nonlinear nonaffine systems, ensuring precise task completion within specified timeframes. Additionally, experience replay technology further fortifies the controller’s learning and adaptability to dynamic environments.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
未知多人非零和博弈:通过自适应动态编程实现规定时间动态事件触发控制
本文采用自适应动态规划(ADP)方法,设计了未知多人非零和博弈(MP-NZSG)的新的规定时间动态事件触发控制方法。首先,构造基于神经网络的辨识器对未知系统动力学进行估计;随后,提出了一种新的基于adp的动态事件触发控制方法,以确保最优性和规定时间稳定性。为求解动态事件触发的Hamilton-Jacobi-Isaacs (HJI)方程的纳什均衡解,建立了每个参与者的评价神经网络(NN)。该网络采用了一种基于经验重放技术的权重更新规律,减轻了激励条件的持久性。此外,利用Lyapunov方法验证了神经网络近似误差和多系统的一致极限有界性分析。此外,最小事件间时间(MIET)最终确定,以减轻臭名昭著的芝诺行为。最后,通过综合仿真结果严格验证了所提方法的有效性。我们的研究解决了多组件协调控制的挑战,特别是在航天器姿态控制方面。为了处理这些复杂性,我们提出了一种创新的自适应动态事件触发控制方法。通过将自适应动态规划与神经网络相结合,有效地对未知系统动态进行建模和管理,增强了控制器的自适应性和鲁棒性。引入动态事件触发策略,优化系统性能,降低计算成本。基于adp的规定时间最优控制方案优先考虑非线性非仿射系统的稳态性能,确保在规定的时间内精确完成任务。此外,经验回放技术进一步加强了控制器的学习能力和对动态环境的适应性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
IEEE Transactions on Automation Science and Engineering
IEEE Transactions on Automation Science and Engineering 工程技术-自动化与控制系统
CiteScore
12.50
自引率
14.30%
发文量
404
审稿时长
3.0 months
期刊介绍: The IEEE Transactions on Automation Science and Engineering (T-ASE) publishes fundamental papers on Automation, emphasizing scientific results that advance efficiency, quality, productivity, and reliability. T-ASE encourages interdisciplinary approaches from computer science, control systems, electrical engineering, mathematics, mechanical engineering, operations research, and other fields. T-ASE welcomes results relevant to industries such as agriculture, biotechnology, healthcare, home automation, maintenance, manufacturing, pharmaceuticals, retail, security, service, supply chains, and transportation. T-ASE addresses a research community willing to integrate knowledge across disciplines and industries. For this purpose, each paper includes a Note to Practitioners that summarizes how its results can be applied or how they might be extended to apply in practice.
期刊最新文献
Robust Adaptive Control for Nonlinear Multi-Agent Systems: A Physics-Regularized Neural Backstepping Approach Fixed-Time Tracking Controller With Online Obstacle Avoiding Guidance for Unmanned Surface Vehicles Resilient Event Triggering-based Agent Uncertainty Compensation in Grid-Forming Autonomous Network Ride Comfort Improvement Through Preview Active Suspension Control With Speed Planning Self-Modified Dynamic Domain Adaptation for Industrial Soft Sensing
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1