{"title":"游戏中的安全追踪:在未知动态和约束条件下实现最优控制","authors":"Xiaohong Cui, Wenjie Chen, Binrui Wang, Kun Zhou","doi":"10.1002/asjc.3397","DOIUrl":null,"url":null,"abstract":"<p>This paper introduces mix-zero-sum differential (MZSD) game theory to address multi-player tracking systems, offering a better understanding of the coexistence of cooperation and competition among players. Within this framework, we present an optimal safety tracking control (OSTC) method, which incorporates a control barrier function (CBF) into the value function to ensure that the tracking error remains within a specified range, thus guaranteeing safety while achieving optimization. Simultaneously, to eliminate the need for system dynamics, we propose a novel approach leveraging off-policy integral reinforcement learning (IRL) technology to obtain the Nash equilibrium solution of the MZSD games. We establish a unique critics–actors neural network (NN) structure that updates concurrently. Furthermore, we analyze stability and convergence using the Lyapunov method. We conduct two simulations to demonstrate the effectiveness of the proposed algorithm.</p>","PeriodicalId":55453,"journal":{"name":"Asian Journal of Control","volume":"26 6","pages":"3190-3209"},"PeriodicalIF":2.7000,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Safe tracking in games: Achieving optimal control with unknown dynamics and constraints\",\"authors\":\"Xiaohong Cui, Wenjie Chen, Binrui Wang, Kun Zhou\",\"doi\":\"10.1002/asjc.3397\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>This paper introduces mix-zero-sum differential (MZSD) game theory to address multi-player tracking systems, offering a better understanding of the coexistence of cooperation and competition among players. Within this framework, we present an optimal safety tracking control (OSTC) method, which incorporates a control barrier function (CBF) into the value function to ensure that the tracking error remains within a specified range, thus guaranteeing safety while achieving optimization. Simultaneously, to eliminate the need for system dynamics, we propose a novel approach leveraging off-policy integral reinforcement learning (IRL) technology to obtain the Nash equilibrium solution of the MZSD games. We establish a unique critics–actors neural network (NN) structure that updates concurrently. Furthermore, we analyze stability and convergence using the Lyapunov method. We conduct two simulations to demonstrate the effectiveness of the proposed algorithm.</p>\",\"PeriodicalId\":55453,\"journal\":{\"name\":\"Asian Journal of Control\",\"volume\":\"26 6\",\"pages\":\"3190-3209\"},\"PeriodicalIF\":2.7000,\"publicationDate\":\"2024-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Asian Journal of Control\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/asjc.3397\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Asian Journal of Control","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/asjc.3397","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
Safe tracking in games: Achieving optimal control with unknown dynamics and constraints
This paper introduces mix-zero-sum differential (MZSD) game theory to address multi-player tracking systems, offering a better understanding of the coexistence of cooperation and competition among players. Within this framework, we present an optimal safety tracking control (OSTC) method, which incorporates a control barrier function (CBF) into the value function to ensure that the tracking error remains within a specified range, thus guaranteeing safety while achieving optimization. Simultaneously, to eliminate the need for system dynamics, we propose a novel approach leveraging off-policy integral reinforcement learning (IRL) technology to obtain the Nash equilibrium solution of the MZSD games. We establish a unique critics–actors neural network (NN) structure that updates concurrently. Furthermore, we analyze stability and convergence using the Lyapunov method. We conduct two simulations to demonstrate the effectiveness of the proposed algorithm.
期刊介绍:
The Asian Journal of Control, an Asian Control Association (ACA) and Chinese Automatic Control Society (CACS) affiliated journal, is the first international journal originating from the Asia Pacific region. The Asian Journal of Control publishes papers on original theoretical and practical research and developments in the areas of control, involving all facets of control theory and its application.
Published six times a year, the Journal aims to be a key platform for control communities throughout the world.
The Journal provides a forum where control researchers and practitioners can exchange knowledge and experiences on the latest advances in the control areas, and plays an educational role for students and experienced researchers in other disciplines interested in this continually growing field. The scope of the journal is extensive.
Topics include:
The theory and design of control systems and components, encompassing:
Robust and distributed control using geometric, optimal, stochastic and nonlinear methods
Game theory and state estimation
Adaptive control, including neural networks, learning, parameter estimation
and system fault detection
Artificial intelligence, fuzzy and expert systems
Hierarchical and man-machine systems
All parts of systems engineering which consider the reliability of components and systems
Emerging application areas, such as:
Robotics
Mechatronics
Computers for computer-aided design, manufacturing, and control of
various industrial processes
Space vehicles and aircraft, ships, and traffic
Biomedical systems
National economies
Power systems
Agriculture
Natural resources.