{"title":"Continuous-time zero-sum games for Markov decision processes with risk-sensitive finite-horizon cost criterio on a general state space","authors":"Subrata Golui, Chandan Pal","doi":"10.17993/3cemp.2022.110250.76-92","DOIUrl":null,"url":null,"abstract":"In this manuscript, we study continuous-time risk-sensitive finite-horizon time-homogeneous zero-sum dynamic games for controlled Markov decision processes (MDP) on a Borel space. Here, the transition and payoff functions are extended real-valued functions. We prove the existence of the game’s value and the uniqueness of the solution of Shapley equation under some reasonable assumptions. Moreover, all possible saddle-point equilibria are completely characterized in the class of all admissible feedback multi-strategies. We also provide an example to support our assumptions.","PeriodicalId":365908,"journal":{"name":"3C Empresa. Investigación y pensamiento crítico","volume":"53 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"3C Empresa. Investigación y pensamiento crítico","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.17993/3cemp.2022.110250.76-92","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In this manuscript, we study continuous-time risk-sensitive finite-horizon time-homogeneous zero-sum dynamic games for controlled Markov decision processes (MDP) on a Borel space. Here, the transition and payoff functions are extended real-valued functions. We prove the existence of the game’s value and the uniqueness of the solution of Shapley equation under some reasonable assumptions. Moreover, all possible saddle-point equilibria are completely characterized in the class of all admissible feedback multi-strategies. We also provide an example to support our assumptions.