Learning equilibrium mean-variance strategy

IF 2.4 3区经济学 Q3 BUSINESS, FINANCE Mathematical Finance Pub Date : 2023-06-04 DOI:10.1111/mafi.12402

Min Dai, Yuchao Dong, Yanwei Jia

引用次数: 7

Abstract

We study a dynamic mean-variance portfolio optimization problem under the reinforcement learning framework, where an entropy regularizer is introduced to induce exploration. Due to the time–inconsistency involved in a mean-variance criterion, we aim to learn an equilibrium policy. Under an incomplete market setting, we obtain a semi-analytical, exploratory, equilibrium mean-variance policy that turns out to follow a Gaussian distribution. We then focus on a Gaussian mean return model and propose a reinforcement learning algorithm to find the equilibrium policy. Thanks to a thoroughly designed policy iteration procedure in our algorithm, we prove the convergence of our algorithm under mild conditions, despite that dynamic programming principle and the usual policy improvement theorem failing to hold for an equilibrium policy. Numerical experiments are given to demonstrate our algorithm. The design and implementation of our reinforcement learning algorithm apply to a general market setup.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

学习均衡均值方差策略

我们研究了一个在强化学习框架下的动态中方差投资组合优化问题，其中引入了熵正则化子来进行探索。由于均值方差标准中涉及时间不一致性，我们的目标是学习均衡策略。在不完全市场环境下，我们得到了一个半分析的、探索性的、均衡的均方差策略，它遵循高斯分布。然后，我们关注高斯平均收益模型，并提出了一种强化学习算法来寻找均衡策略。由于我们的算法中有一个彻底设计的策略迭代过程，我们证明了我们的算法在温和条件下的收敛性，尽管动态规划原理和通常的策略改进定理不能适用于均衡策略。数值实验证明了我们的算法。我们的强化学习算法的设计和实现适用于一般的市场设置。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Mathematical Finance 数学-数学跨学科应用

CiteScore

4.10

自引率

6.20%

发文量

审稿时长

>12 weeks

期刊介绍： Mathematical Finance seeks to publish original research articles focused on the development and application of novel mathematical and statistical methods for the analysis of financial problems. The journal welcomes contributions on new statistical methods for the analysis of financial problems. Empirical results will be appropriate to the extent that they illustrate a statistical technique, validate a model or provide insight into a financial problem. Papers whose main contribution rests on empirical results derived with standard approaches will not be considered.

期刊最新文献

Issue Information Issue Information A Cross-Border Market Model with Limited Transmission Capacities Dynamically Consistent Analysis of Realized Covariations in Term Structure Models Pricing and Hedging of SOFR Derivatives