Deep Deterministic Policy Gradient for Portfolio Management

Firdaous Khemlichi, Hiba Chougrad, Y. I. Khamlichi, Abdessamad Elboushaki, Safae Elhaj Ben Ali
{"title":"Deep Deterministic Policy Gradient for Portfolio Management","authors":"Firdaous Khemlichi, Hiba Chougrad, Y. I. Khamlichi, Abdessamad Elboushaki, Safae Elhaj Ben Ali","doi":"10.1109/CiSt49399.2021.9357266","DOIUrl":null,"url":null,"abstract":"Portfolio management is a financial problem that has been the subject of much research over the years. It is a planning task where an agent constantly redistributes resources across a set of assets in order to achieve investment objectives and thereby maximize return. However, it remains difficult to obtain an optimal strategy in an environment as complex and dynamic as the financial market. Our article focuses on solving this stochastic control problem in order to obtain an optimal strategy that would allow us to make profitable decisions by interacting directly with the environment. To do this, we explore the power of deep reinforcement learning which differs from traditional Machine Learning by combining the task of predicting stock behavior and analyzing the optimal course of action in a single unit, thus aligning the problem of Machine Learning with the investor's objectives. As a method, we propose to use the Deep Deterministic Policy Gradient which is an off-policy algorithm and is used for environments with continuous action spaces. The obtained results demonstrate that the model achieves a higher rate of return than the strategy of “Uniform Buy and Hold” stocks and the strategy of “Buy Best Stock in last month”.","PeriodicalId":253233,"journal":{"name":"2020 6th IEEE Congress on Information Science and Technology (CiSt)","volume":"67 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 6th IEEE Congress on Information Science and Technology (CiSt)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CiSt49399.2021.9357266","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Portfolio management is a financial problem that has been the subject of much research over the years. It is a planning task where an agent constantly redistributes resources across a set of assets in order to achieve investment objectives and thereby maximize return. However, it remains difficult to obtain an optimal strategy in an environment as complex and dynamic as the financial market. Our article focuses on solving this stochastic control problem in order to obtain an optimal strategy that would allow us to make profitable decisions by interacting directly with the environment. To do this, we explore the power of deep reinforcement learning which differs from traditional Machine Learning by combining the task of predicting stock behavior and analyzing the optimal course of action in a single unit, thus aligning the problem of Machine Learning with the investor's objectives. As a method, we propose to use the Deep Deterministic Policy Gradient which is an off-policy algorithm and is used for environments with continuous action spaces. The obtained results demonstrate that the model achieves a higher rate of return than the strategy of “Uniform Buy and Hold” stocks and the strategy of “Buy Best Stock in last month”.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
投资组合管理的深度确定性策略梯度
投资组合管理是一个金融问题,多年来一直是许多研究的主题。它是一项计划任务,其中代理不断地在一组资产上重新分配资源,以实现投资目标,从而最大化回报。然而,在金融市场这样复杂多变的环境中,获得一个最优策略仍然是困难的。我们的文章重点是解决这个随机控制问题,以获得一个最优策略,使我们能够通过直接与环境互动做出有利可图的决策。为了做到这一点,我们探索了深度强化学习的力量,它不同于传统的机器学习,将预测股票行为的任务和分析单个单位的最佳行动过程结合起来,从而使机器学习的问题与投资者的目标保持一致。作为一种方法,我们建议使用深度确定性策略梯度,这是一种非策略算法,用于具有连续动作空间的环境。结果表明,该模型比“均匀买入并持有”策略和“买入上个月最好的股票”策略获得了更高的收益率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
A Hybrid CNN-CRF Inference Models for 3D Mesh Segmentation An Effective Packet Loss Recovery Scheme Using a Cache Server in IPTV Multicast Service TermInteract: An Online Tool for Terminologists Aimed at Providing Terminology Quality Metrics Proposing solutions with an application server implementing telephony services in the IMS network Corpus and Baseline Model for Domain-Specific Entity Recognition in German
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1