Optimality equations in undiscounted Markov decision processes

Proceedings of the 38th IEEE Conference on Decision and Control (Cat. No.99CH36304) Pub Date : 1999-12-07 DOI:10.1109/CDC.1999.832928

M. Puterman

引用次数: 0

Abstract

We explore properties of the average and bias optimality equations in unichain Markov decision processes. We show that in unichain models, these equations have the same form, so that theory for gain optimality carries over to bias optimality.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

未折现马尔可夫决策过程的最优性方程

研究了单链马尔可夫决策过程中平均最优性方程和偏优性方程的性质。我们表明，在单链模型中，这些方程具有相同的形式，因此增益最优性理论延续到偏差最优性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of the 38th IEEE Conference on Decision and Control (Cat. No.99CH36304)

自引率

0.00%

发文量

期刊最新文献

A systematic and numerically efficient procedure for stable dynamic model inversion of LTI systems Controller design for improving the degree of stability of periodic solutions in forced nonlinear systems A Bayesian approach to the missing features problem in classification Stability analysis and systematic design of fuzzy controllers with simplified linear control rules Best linear unbiased estimation filters with FIR structures for state space signal models