Likelihood Ratio Gradient Estimation for Steady-State Parameters

Q1 Mathematics Stochastic Systems Pub Date : 2017-07-09 DOI:10.1287/STSY.2018.0023

P. Glynn, Mariana Olvera-Cravioto

引用次数: 8

Abstract

We consider a discrete-time Markov chain $\boldsymbol{\Phi}$ on a general state-space ${\sf X}$, whose transition probabilities are parameterized by a real-valued vector $\boldsymbol{\theta}$. Under the assumption that $\boldsymbol{\Phi}$ is geometrically ergodic with corresponding stationary distribution $\pi(\boldsymbol{\theta})$, we are interested in estimating the gradient $\nabla \alpha(\boldsymbol{\theta})$ of the steady-state expectation $$\alpha(\boldsymbol{\theta}) = \pi( \boldsymbol{\theta}) f.$$ To this end, we first give sufficient conditions for the differentiability of $\alpha(\boldsymbol{\theta})$ and for the calculation of its gradient via a sequence of finite horizon expectations. We then propose two different likelihood ratio estimators and analyze their limiting behavior.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

稳态参数的似然比梯度估计

我们考虑一般状态空间$｛\sf X｝$上的离散时间马尔可夫链$\boldsymbol｛\Phi｝$，其转移概率由实值向量$\boldsymbol｛\theta｝$参数化。在$\boldsymbol｛\Phi｝$与相应的平稳分布$\pi（\boldsymbol{theta｝，我们首先给出了$\alpha（\boldsymbol｛theta｝）$的可微性以及通过有限的视界期望序列计算其梯度的充分条件。然后，我们提出了两种不同的似然比估计，并分析了它们的极限行为。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Stochastic Systems Decision Sciences-Statistics, Probability and Uncertainty

CiteScore

3.70

自引率

0.00%

发文量