{"title":"具有长期平均标准的零和非稳态随机博弈","authors":"Zewu Zheng, Xin Guo","doi":"10.1007/s00245-024-10182-x","DOIUrl":null,"url":null,"abstract":"<div><p>This paper is concerned with the existence and computation of an equilibrium for a non-stationary average stochastic zero-sum game with Borel spaces, in which the payoff functions and transition probabilities are allowed to change over time. First, we present an extension of the span-fixed point theorem for an operator to a sequence of time-dependent operators. Second, we find a new set of conditions, which is the generalization of the ergodicity ones in the existing literature. Using the extension of the span-fixed point theorem and the novel conditions, we prove the existence of a solution to the average-reward game equations (ARGEs). Third, by the ARGEs we establish the existence of the value and the equilibrium for this game. Moreover,by constructing an approximation sequence of the solution to the ARGEs, we provide a rolling horizon algorithm for computing the value and <span>\\( \\varepsilon \\)</span>-equilibria, and also prove the convergence of the algorithm. Finally, we illustrate the conditions and results in this paper by several energy management models.</p></div>","PeriodicalId":55566,"journal":{"name":"Applied Mathematics and Optimization","volume":"90 2","pages":""},"PeriodicalIF":1.6000,"publicationDate":"2024-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Zero-Sum Non-stationary Stochastic Games with the Long-Run Average Criterion\",\"authors\":\"Zewu Zheng, Xin Guo\",\"doi\":\"10.1007/s00245-024-10182-x\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>This paper is concerned with the existence and computation of an equilibrium for a non-stationary average stochastic zero-sum game with Borel spaces, in which the payoff functions and transition probabilities are allowed to change over time. First, we present an extension of the span-fixed point theorem for an operator to a sequence of time-dependent operators. Second, we find a new set of conditions, which is the generalization of the ergodicity ones in the existing literature. Using the extension of the span-fixed point theorem and the novel conditions, we prove the existence of a solution to the average-reward game equations (ARGEs). Third, by the ARGEs we establish the existence of the value and the equilibrium for this game. Moreover,by constructing an approximation sequence of the solution to the ARGEs, we provide a rolling horizon algorithm for computing the value and <span>\\\\( \\\\varepsilon \\\\)</span>-equilibria, and also prove the convergence of the algorithm. Finally, we illustrate the conditions and results in this paper by several energy management models.</p></div>\",\"PeriodicalId\":55566,\"journal\":{\"name\":\"Applied Mathematics and Optimization\",\"volume\":\"90 2\",\"pages\":\"\"},\"PeriodicalIF\":1.6000,\"publicationDate\":\"2024-09-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied Mathematics and Optimization\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s00245-024-10182-x\",\"RegionNum\":2,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MATHEMATICS, APPLIED\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Mathematics and Optimization","FirstCategoryId":"100","ListUrlMain":"https://link.springer.com/article/10.1007/s00245-024-10182-x","RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATHEMATICS, APPLIED","Score":null,"Total":0}
Zero-Sum Non-stationary Stochastic Games with the Long-Run Average Criterion
This paper is concerned with the existence and computation of an equilibrium for a non-stationary average stochastic zero-sum game with Borel spaces, in which the payoff functions and transition probabilities are allowed to change over time. First, we present an extension of the span-fixed point theorem for an operator to a sequence of time-dependent operators. Second, we find a new set of conditions, which is the generalization of the ergodicity ones in the existing literature. Using the extension of the span-fixed point theorem and the novel conditions, we prove the existence of a solution to the average-reward game equations (ARGEs). Third, by the ARGEs we establish the existence of the value and the equilibrium for this game. Moreover,by constructing an approximation sequence of the solution to the ARGEs, we provide a rolling horizon algorithm for computing the value and \( \varepsilon \)-equilibria, and also prove the convergence of the algorithm. Finally, we illustrate the conditions and results in this paper by several energy management models.
期刊介绍:
The Applied Mathematics and Optimization Journal covers a broad range of mathematical methods in particular those that bridge with optimization and have some connection with applications. Core topics include calculus of variations, partial differential equations, stochastic control, optimization of deterministic or stochastic systems in discrete or continuous time, homogenization, control theory, mean field games, dynamic games and optimal transport. Algorithmic, data analytic, machine learning and numerical methods which support the modeling and analysis of optimization problems are encouraged. Of great interest are papers which show some novel idea in either the theory or model which include some connection with potential applications in science and engineering.