Continuous-time zero-sum games for Markov decision processes with risk-sensitive finite-horizon cost criterio on a general state space

3C Empresa. Investigación y pensamiento crítico Pub Date : 2022-12-29 DOI:10.17993/3cemp.2022.110250.76-92

Subrata Golui, Chandan Pal

引用次数: 0

Abstract

In this manuscript, we study continuous-time risk-sensitive finite-horizon time-homogeneous zero-sum dynamic games for controlled Markov decision processes (MDP) on a Borel space. Here, the transition and payoff functions are extended real-valued functions. We prove the existence of the game’s value and the uniqueness of the solution of Shapley equation under some reasonable assumptions. Moreover, all possible saddle-point equilibria are completely characterized in the class of all admissible feedback multi-strategies. We also provide an example to support our assumptions.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

一般状态空间上具有风险敏感有限视界代价准则的马尔可夫决策过程的连续时间零和博弈

本文研究了Borel空间上可控马尔可夫决策过程(MDP)的连续时间风险敏感有限视界时间齐次零和动态对策。这里，转移函数和收益函数是扩展实值函数。在合理的假设条件下，证明了该对策值的存在性和Shapley方程解的唯一性。此外，所有可能的鞍点均衡在所有可接受反馈多策略类中被完全表征。我们还提供了一个例子来支持我们的假设。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

3C Empresa. Investigación y pensamiento crítico

自引率

0.00%

发文量