{"title":"Discrete-Time Mean-Variance Strategy Based on Reinforcement Learning","authors":"Xiangyu Cui, Xun Li, Yun Shi, Si Zhao","doi":"arxiv-2312.15385","DOIUrl":null,"url":null,"abstract":"This paper studies a discrete-time mean-variance model based on reinforcement\nlearning. Compared with its continuous-time counterpart in \\cite{zhou2020mv},\nthe discrete-time model makes more general assumptions about the asset's return\ndistribution. Using entropy to measure the cost of exploration, we derive the\noptimal investment strategy, whose density function is also Gaussian type.\nAdditionally, we design the corresponding reinforcement learning algorithm.\nBoth simulation experiments and empirical analysis indicate that our\ndiscrete-time model exhibits better applicability when analyzing real-world\ndata than the continuous-time model.","PeriodicalId":501045,"journal":{"name":"arXiv - QuantFin - Portfolio Management","volume":"106 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuantFin - Portfolio Management","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2312.15385","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This paper studies a discrete-time mean-variance model based on reinforcement
learning. Compared with its continuous-time counterpart in \cite{zhou2020mv},
the discrete-time model makes more general assumptions about the asset's return
distribution. Using entropy to measure the cost of exploration, we derive the
optimal investment strategy, whose density function is also Gaussian type.
Additionally, we design the corresponding reinforcement learning algorithm.
Both simulation experiments and empirical analysis indicate that our
discrete-time model exhibits better applicability when analyzing real-world
data than the continuous-time model.