基于优化的频谱端到端深度强化学习,用于股票投资组合管理

IF 5.3 2区 经济学 Q1 BUSINESS, FINANCE Pacific-Basin Finance Journal Pub Date : 2025-06-01 Epub Date: 2025-03-17 DOI:10.1016/j.pacfin.2025.102746
Pengrui Yu , Siya Liu , Chengneng Jin , Runsheng Gu , Xiaomin Gong
{"title":"基于优化的频谱端到端深度强化学习,用于股票投资组合管理","authors":"Pengrui Yu ,&nbsp;Siya Liu ,&nbsp;Chengneng Jin ,&nbsp;Runsheng Gu ,&nbsp;Xiaomin Gong","doi":"10.1016/j.pacfin.2025.102746","DOIUrl":null,"url":null,"abstract":"<div><div>We propose a novel approach to equity portfolio optimization that combines spectral analysis and classical equity portfolio optimization theory with deep reinforcement learning in an end-to-end framework. We introduce the End-to-end Frequency Online Deep Deterministic Policy Gradient (EFO-DDPG) algorithm, which leverages discrete Fourier transform to decompose asset return sequences into frequency components. Unlike traditional methods that treat high-frequency components as noise, EFO-DDPG learns to adjust the influence of different frequency components dynamically. Moreover, the algorithm embeds a mean–variance portfolio optimization problem within a deep learning network, enhancing interpretability compared to black-box approaches. The framework models the investment problem as a Partially Observable Markov Decision Process (POMDP), using a state processing block with transformer encoders to capture complex relationships in the market data. By integrating spectral analysis, portfolio optimization theory, and online deep reinforcement learning, EFO-DDPG aims to adapt to non-stationary financial markets and generate superior investment strategies.</div></div>","PeriodicalId":48074,"journal":{"name":"Pacific-Basin Finance Journal","volume":"91 ","pages":"Article 102746"},"PeriodicalIF":5.3000,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Optimization-based spectral end-to-end deep reinforcement learning for equity portfolio management\",\"authors\":\"Pengrui Yu ,&nbsp;Siya Liu ,&nbsp;Chengneng Jin ,&nbsp;Runsheng Gu ,&nbsp;Xiaomin Gong\",\"doi\":\"10.1016/j.pacfin.2025.102746\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>We propose a novel approach to equity portfolio optimization that combines spectral analysis and classical equity portfolio optimization theory with deep reinforcement learning in an end-to-end framework. We introduce the End-to-end Frequency Online Deep Deterministic Policy Gradient (EFO-DDPG) algorithm, which leverages discrete Fourier transform to decompose asset return sequences into frequency components. Unlike traditional methods that treat high-frequency components as noise, EFO-DDPG learns to adjust the influence of different frequency components dynamically. Moreover, the algorithm embeds a mean–variance portfolio optimization problem within a deep learning network, enhancing interpretability compared to black-box approaches. The framework models the investment problem as a Partially Observable Markov Decision Process (POMDP), using a state processing block with transformer encoders to capture complex relationships in the market data. By integrating spectral analysis, portfolio optimization theory, and online deep reinforcement learning, EFO-DDPG aims to adapt to non-stationary financial markets and generate superior investment strategies.</div></div>\",\"PeriodicalId\":48074,\"journal\":{\"name\":\"Pacific-Basin Finance Journal\",\"volume\":\"91 \",\"pages\":\"Article 102746\"},\"PeriodicalIF\":5.3000,\"publicationDate\":\"2025-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Pacific-Basin Finance Journal\",\"FirstCategoryId\":\"96\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0927538X25000836\",\"RegionNum\":2,\"RegionCategory\":\"经济学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/3/17 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"BUSINESS, FINANCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pacific-Basin Finance Journal","FirstCategoryId":"96","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0927538X25000836","RegionNum":2,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/3/17 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"BUSINESS, FINANCE","Score":null,"Total":0}
引用次数: 0

摘要

我们提出了一种新的股票投资组合优化方法,该方法将频谱分析和经典股票投资组合优化理论与端到端框架中的深度强化学习相结合。我们引入了端到端频率在线深度确定性策略梯度(EFO-DDPG)算法,该算法利用离散傅里叶变换将资产返回序列分解为频率分量。与传统方法将高频分量视为噪声不同,EFO-DDPG可以动态地学习调整不同频率分量的影响。此外,该算法在深度学习网络中嵌入均值方差投资组合优化问题,与黑盒方法相比,增强了可解释性。该框架将投资问题建模为部分可观察马尔可夫决策过程(POMDP),使用带有变压器编码器的状态处理块来捕获市场数据中的复杂关系。通过整合谱分析、投资组合优化理论和在线深度强化学习,EFO-DDPG旨在适应非平稳金融市场并产生卓越的投资策略。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Optimization-based spectral end-to-end deep reinforcement learning for equity portfolio management
We propose a novel approach to equity portfolio optimization that combines spectral analysis and classical equity portfolio optimization theory with deep reinforcement learning in an end-to-end framework. We introduce the End-to-end Frequency Online Deep Deterministic Policy Gradient (EFO-DDPG) algorithm, which leverages discrete Fourier transform to decompose asset return sequences into frequency components. Unlike traditional methods that treat high-frequency components as noise, EFO-DDPG learns to adjust the influence of different frequency components dynamically. Moreover, the algorithm embeds a mean–variance portfolio optimization problem within a deep learning network, enhancing interpretability compared to black-box approaches. The framework models the investment problem as a Partially Observable Markov Decision Process (POMDP), using a state processing block with transformer encoders to capture complex relationships in the market data. By integrating spectral analysis, portfolio optimization theory, and online deep reinforcement learning, EFO-DDPG aims to adapt to non-stationary financial markets and generate superior investment strategies.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Pacific-Basin Finance Journal
Pacific-Basin Finance Journal BUSINESS, FINANCE-
CiteScore
6.80
自引率
6.50%
发文量
157
期刊介绍: The Pacific-Basin Finance Journal is aimed at providing a specialized forum for the publication of academic research on capital markets of the Asia-Pacific countries. Primary emphasis will be placed on the highest quality empirical and theoretical research in the following areas: • Market Micro-structure; • Investment and Portfolio Management; • Theories of Market Equilibrium; • Valuation of Financial and Real Assets; • Behavior of Asset Prices in Financial Sectors; • Normative Theory of Financial Management; • Capital Markets of Development; • Market Mechanisms.
期刊最新文献
Economic uncertainty, supply chain transmission, and corporate effective investment Does national culture affect cash holdings of social enterprises? Evidence from microfinance institutions FOF solution selection and application using interpretable machine learning models Small trades, order splitting, and stock returns: Evidence from China's stock markets How volatility model specification affects volatility targeting performance: Evidence from Taiwan
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1