游戏中的安全追踪：在未知动态和约束条件下实现最优控制

IF 2.7 4区计算机科学 Q2 AUTOMATION & CONTROL SYSTEMS Asian Journal of Control Pub Date : 2024-05-01 DOI:10.1002/asjc.3397

Xiaohong Cui, Wenjie Chen, Binrui Wang, Kun Zhou

{"title":"游戏中的安全追踪：在未知动态和约束条件下实现最优控制","authors":"Xiaohong Cui, Wenjie Chen, Binrui Wang, Kun Zhou","doi":"10.1002/asjc.3397","DOIUrl":null,"url":null,"abstract":"<p>This paper introduces mix-zero-sum differential (MZSD) game theory to address multi-player tracking systems, offering a better understanding of the coexistence of cooperation and competition among players. Within this framework, we present an optimal safety tracking control (OSTC) method, which incorporates a control barrier function (CBF) into the value function to ensure that the tracking error remains within a specified range, thus guaranteeing safety while achieving optimization. Simultaneously, to eliminate the need for system dynamics, we propose a novel approach leveraging off-policy integral reinforcement learning (IRL) technology to obtain the Nash equilibrium solution of the MZSD games. We establish a unique critics–actors neural network (NN) structure that updates concurrently. Furthermore, we analyze stability and convergence using the Lyapunov method. We conduct two simulations to demonstrate the effectiveness of the proposed algorithm.</p>","PeriodicalId":55453,"journal":{"name":"Asian Journal of Control","volume":"26 6","pages":"3190-3209"},"PeriodicalIF":2.7000,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Safe tracking in games: Achieving optimal control with unknown dynamics and constraints\",\"authors\":\"Xiaohong Cui, Wenjie Chen, Binrui Wang, Kun Zhou\",\"doi\":\"10.1002/asjc.3397\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>This paper introduces mix-zero-sum differential (MZSD) game theory to address multi-player tracking systems, offering a better understanding of the coexistence of cooperation and competition among players. Within this framework, we present an optimal safety tracking control (OSTC) method, which incorporates a control barrier function (CBF) into the value function to ensure that the tracking error remains within a specified range, thus guaranteeing safety while achieving optimization. Simultaneously, to eliminate the need for system dynamics, we propose a novel approach leveraging off-policy integral reinforcement learning (IRL) technology to obtain the Nash equilibrium solution of the MZSD games. We establish a unique critics–actors neural network (NN) structure that updates concurrently. Furthermore, we analyze stability and convergence using the Lyapunov method. We conduct two simulations to demonstrate the effectiveness of the proposed algorithm.</p>\",\"PeriodicalId\":55453,\"journal\":{\"name\":\"Asian Journal of Control\",\"volume\":\"26 6\",\"pages\":\"3190-3209\"},\"PeriodicalIF\":2.7000,\"publicationDate\":\"2024-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Asian Journal of Control\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/asjc.3397\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Asian Journal of Control","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/asjc.3397","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

摘要

本文将混合零和差分博弈论（MZSD）引入多玩家跟踪系统，从而更好地理解玩家之间合作与竞争并存的关系。在此框架内，我们提出了一种最优安全跟踪控制（OSTC）方法，该方法在值函数中加入了控制障碍函数（CBF），以确保跟踪误差保持在指定范围内，从而在实现优化的同时保证安全性。同时，为了消除对系统动力学的需求，我们提出了一种利用非政策积分强化学习（IRL）技术的新方法，以获得 MZSD 博弈的纳什均衡解。我们建立了一种独特的批评者-行动者神经网络（NN）结构，可同时更新。此外，我们还利用 Lyapunov 方法分析了稳定性和收敛性。我们进行了两次模拟，以证明所提算法的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Safe tracking in games: Achieving optimal control with unknown dynamics and constraints

This paper introduces mix-zero-sum differential (MZSD) game theory to address multi-player tracking systems, offering a better understanding of the coexistence of cooperation and competition among players. Within this framework, we present an optimal safety tracking control (OSTC) method, which incorporates a control barrier function (CBF) into the value function to ensure that the tracking error remains within a specified range, thus guaranteeing safety while achieving optimization. Simultaneously, to eliminate the need for system dynamics, we propose a novel approach leveraging off-policy integral reinforcement learning (IRL) technology to obtain the Nash equilibrium solution of the MZSD games. We establish a unique critics–actors neural network (NN) structure that updates concurrently. Furthermore, we analyze stability and convergence using the Lyapunov method. We conduct two simulations to demonstrate the effectiveness of the proposed algorithm.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Asian Journal of Control 工程技术-自动化与控制系统

CiteScore

4.80

自引率

25.00%

发文量

253

审稿时长

7.2 months

期刊介绍： The Asian Journal of Control, an Asian Control Association (ACA) and Chinese Automatic Control Society (CACS) affiliated journal, is the first international journal originating from the Asia Pacific region. The Asian Journal of Control publishes papers on original theoretical and practical research and developments in the areas of control, involving all facets of control theory and its application. Published six times a year, the Journal aims to be a key platform for control communities throughout the world. The Journal provides a forum where control researchers and practitioners can exchange knowledge and experiences on the latest advances in the control areas, and plays an educational role for students and experienced researchers in other disciplines interested in this continually growing field. The scope of the journal is extensive. Topics include: The theory and design of control systems and components, encompassing: Robust and distributed control using geometric, optimal, stochastic and nonlinear methods Game theory and state estimation Adaptive control, including neural networks, learning, parameter estimation and system fault detection Artificial intelligence, fuzzy and expert systems Hierarchical and man-machine systems All parts of systems engineering which consider the reliability of components and systems Emerging application areas, such as: Robotics Mechatronics Computers for computer-aided design, manufacturing, and control of various industrial processes Space vehicles and aircraft, ships, and traffic Biomedical systems National economies Power systems Agriculture Natural resources.

期刊最新文献

Issue Information Issue Information Issue Information Issue Information Adaptive output feedback time-varying formation tracking of multi-agent system with a leader of unknown input