Dynamic Event-Driven ADP for N-Player Nonzero-Sum Games of Constrained Nonlinear Systems

IF 6.4 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS IEEE Transactions on Automation Science and Engineering Pub Date : 2024-10-08 DOI:10.1109/TASE.2024.3467382
Siyu Guo;Yingnan Pan;Hongyi Li;Liang Cao
{"title":"Dynamic Event-Driven ADP for N-Player Nonzero-Sum Games of Constrained Nonlinear Systems","authors":"Siyu Guo;Yingnan Pan;Hongyi Li;Liang Cao","doi":"10.1109/TASE.2024.3467382","DOIUrl":null,"url":null,"abstract":"In this paper, the dynamic event-driven optimal control problem is investigated for a class of continuous-time nonlinear systems subject to asymmetric input constraints in the framework of nonzero-sum (NZS) games. Initially, by constructing a modified value function, the respective asymmetric input constraint requirements of the controllers involved in the NZS games are successfully satisfied. Then, based on the Bellman’s optimality principle, the N-coupled Hamilton-Jacobi equations are derived for the N-player NZS games. After that, the adaptive dynamic programming (ADP) method is employed to seek for the optimal control policies, in which the simpler single critic neural network structure, instead of the dual network structure of actor-critic in the typical ADP algorithm, is applied. Furthermore, an improved critic network weight updating law is proposed to ensure the stability of the closed-loop system without a hard-to-find initial admissible control scheme. In addition, in order to reduce the update frequency of the controllers to a greater extent, a dynamic event-driven mechanism with adjustable threshold is developed. Finally, a simulation example is given to demonstrate the validity of the developed event-driven control scheme. Note to Practitioners—This paper aims to address the NZS games problem for a category of multi-player continuous-time nonlinear systems featuring multiple input constraints. The applicability of this approach can be widely extended to practical domains, including control applications for reconfigurable robot systems, networked communication systems, etc. The majority of researches on multi-player NZS games problem are focused on the impact of symmetric input constraints. Especially under the premise of ensuring controller optimality, the challenge lies in how to ensure effective control functionality while subjecting the controller to asymmetric constraints. Furthermore, the existing ADP algorithms often depend on an initial admissible control, significantly elevating the implementation difficulty of control solutions in practical applications. To address these challenges, an improved ADP algorithm is developed for input-constrained nonlinear systems within a NZS game framework. This method not only guarantees that the optimal controllers under asymmetric constraints can stabilize all signals, but also avoids the search for challenging-to-find initial admissible controls, thus streamlining the control implementation process.","PeriodicalId":51060,"journal":{"name":"IEEE Transactions on Automation Science and Engineering","volume":"22 ","pages":"7657-7669"},"PeriodicalIF":6.4000,"publicationDate":"2024-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Automation Science and Engineering","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10709347/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

In this paper, the dynamic event-driven optimal control problem is investigated for a class of continuous-time nonlinear systems subject to asymmetric input constraints in the framework of nonzero-sum (NZS) games. Initially, by constructing a modified value function, the respective asymmetric input constraint requirements of the controllers involved in the NZS games are successfully satisfied. Then, based on the Bellman’s optimality principle, the N-coupled Hamilton-Jacobi equations are derived for the N-player NZS games. After that, the adaptive dynamic programming (ADP) method is employed to seek for the optimal control policies, in which the simpler single critic neural network structure, instead of the dual network structure of actor-critic in the typical ADP algorithm, is applied. Furthermore, an improved critic network weight updating law is proposed to ensure the stability of the closed-loop system without a hard-to-find initial admissible control scheme. In addition, in order to reduce the update frequency of the controllers to a greater extent, a dynamic event-driven mechanism with adjustable threshold is developed. Finally, a simulation example is given to demonstrate the validity of the developed event-driven control scheme. Note to Practitioners—This paper aims to address the NZS games problem for a category of multi-player continuous-time nonlinear systems featuring multiple input constraints. The applicability of this approach can be widely extended to practical domains, including control applications for reconfigurable robot systems, networked communication systems, etc. The majority of researches on multi-player NZS games problem are focused on the impact of symmetric input constraints. Especially under the premise of ensuring controller optimality, the challenge lies in how to ensure effective control functionality while subjecting the controller to asymmetric constraints. Furthermore, the existing ADP algorithms often depend on an initial admissible control, significantly elevating the implementation difficulty of control solutions in practical applications. To address these challenges, an improved ADP algorithm is developed for input-constrained nonlinear systems within a NZS game framework. This method not only guarantees that the optimal controllers under asymmetric constraints can stabilize all signals, but also avoids the search for challenging-to-find initial admissible controls, thus streamlining the control implementation process.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
受约束非线性系统 N 人非零和博弈的动态事件驱动 ADP
在非零和对策的框架下,研究了一类具有非对称输入约束的连续时间非线性系统的动态事件驱动最优控制问题。首先,通过构造一个修正的值函数,成功地满足了NZS博弈中控制器各自的非对称输入约束要求。然后,基于Bellman最优性原理,导出了n人NZS对策的n耦合Hamilton-Jacobi方程。然后,采用自适应动态规划(ADP)方法寻求最优控制策略,采用较简单的单批评家神经网络结构,取代了典型ADP算法中行动者-批评家的双网络结构。在此基础上,提出了一种改进的临界网络权值更新律,以保证闭环系统的稳定性,避免初始可接受控制方案难以找到。此外,为了更大程度地降低控制器的更新频率,开发了一种阈值可调的动态事件驱动机制。最后,通过仿真实例验证了所提出的事件驱动控制方案的有效性。从业人员注意:本文旨在解决一类具有多个输入约束的多人连续时间非线性系统的NZS游戏问题。这种方法的适用性可以广泛扩展到实际领域,包括可重构机器人系统的控制应用,网络通信系统等。对于多人博弈问题的研究大多集中在对称输入约束的影响上。特别是在保证控制器最优性的前提下,如何在使控制器受到非对称约束的情况下保证有效的控制功能是一个挑战。此外,现有的ADP算法往往依赖于初始允许控制,这大大提高了控制方案在实际应用中的实现难度。为了解决这些挑战,在NZS游戏框架中,为输入受限的非线性系统开发了改进的ADP算法。该方法不仅保证了非对称约束下的最优控制器能够稳定所有信号,而且避免了寻找难以找到的初始允许控制,从而简化了控制实现过程。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
IEEE Transactions on Automation Science and Engineering
IEEE Transactions on Automation Science and Engineering 工程技术-自动化与控制系统
CiteScore
12.50
自引率
14.30%
发文量
404
审稿时长
3.0 months
期刊介绍: The IEEE Transactions on Automation Science and Engineering (T-ASE) publishes fundamental papers on Automation, emphasizing scientific results that advance efficiency, quality, productivity, and reliability. T-ASE encourages interdisciplinary approaches from computer science, control systems, electrical engineering, mathematics, mechanical engineering, operations research, and other fields. T-ASE welcomes results relevant to industries such as agriculture, biotechnology, healthcare, home automation, maintenance, manufacturing, pharmaceuticals, retail, security, service, supply chains, and transportation. T-ASE addresses a research community willing to integrate knowledge across disciplines and industries. For this purpose, each paper includes a Note to Practitioners that summarizes how its results can be applied or how they might be extended to apply in practice.
期刊最新文献
Hybrid Event-Triggered Fuzzy Secure Consensus for PDE-Based Multi-Agent Systems Subject to Time Delays and Multiple Attacks An Advanced Hierarchical Control Strategy for Modeling and Stability Evaluation of a Novel Series-Connected Energy Routing System Nesterov Accelerated Gradient-Based Fixed-Time Convergent Actor-Critic Control for Nonlinear Systems Zero-Sum Game-based Optimal Estimation-Compensation Control for Multi-Agent Systems under Hybrid Attacks Resilient Synchronization of Multi-Leader MASs under Random Link Failure Constraints
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1