基于优化的频谱端到端深度强化学习，用于股票投资组合管理

IF 5.3 2区经济学 Q1 BUSINESS, FINANCE Pacific-Basin Finance Journal Pub Date : 2025-06-01 Epub Date: 2025-03-17 DOI:10.1016/j.pacfin.2025.102746

Pengrui Yu , Siya Liu , Chengneng Jin , Runsheng Gu , Xiaomin Gong

{"title":"基于优化的频谱端到端深度强化学习，用于股票投资组合管理","authors":"Pengrui Yu , Siya Liu , Chengneng Jin , Runsheng Gu , Xiaomin Gong","doi":"10.1016/j.pacfin.2025.102746","DOIUrl":null,"url":null,"abstract":"<div><div>We propose a novel approach to equity portfolio optimization that combines spectral analysis and classical equity portfolio optimization theory with deep reinforcement learning in an end-to-end framework. We introduce the End-to-end Frequency Online Deep Deterministic Policy Gradient (EFO-DDPG) algorithm, which leverages discrete Fourier transform to decompose asset return sequences into frequency components. Unlike traditional methods that treat high-frequency components as noise, EFO-DDPG learns to adjust the influence of different frequency components dynamically. Moreover, the algorithm embeds a mean–variance portfolio optimization problem within a deep learning network, enhancing interpretability compared to black-box approaches. The framework models the investment problem as a Partially Observable Markov Decision Process (POMDP), using a state processing block with transformer encoders to capture complex relationships in the market data. By integrating spectral analysis, portfolio optimization theory, and online deep reinforcement learning, EFO-DDPG aims to adapt to non-stationary financial markets and generate superior investment strategies.</div></div>","PeriodicalId":48074,"journal":{"name":"Pacific-Basin Finance Journal","volume":"91 ","pages":"Article 102746"},"PeriodicalIF":5.3000,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Optimization-based spectral end-to-end deep reinforcement learning for equity portfolio management\",\"authors\":\"Pengrui Yu , Siya Liu , Chengneng Jin , Runsheng Gu , Xiaomin Gong\",\"doi\":\"10.1016/j.pacfin.2025.102746\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>We propose a novel approach to equity portfolio optimization that combines spectral analysis and classical equity portfolio optimization theory with deep reinforcement learning in an end-to-end framework. We introduce the End-to-end Frequency Online Deep Deterministic Policy Gradient (EFO-DDPG) algorithm, which leverages discrete Fourier transform to decompose asset return sequences into frequency components. Unlike traditional methods that treat high-frequency components as noise, EFO-DDPG learns to adjust the influence of different frequency components dynamically. Moreover, the algorithm embeds a mean–variance portfolio optimization problem within a deep learning network, enhancing interpretability compared to black-box approaches. The framework models the investment problem as a Partially Observable Markov Decision Process (POMDP), using a state processing block with transformer encoders to capture complex relationships in the market data. By integrating spectral analysis, portfolio optimization theory, and online deep reinforcement learning, EFO-DDPG aims to adapt to non-stationary financial markets and generate superior investment strategies.</div></div>\",\"PeriodicalId\":48074,\"journal\":{\"name\":\"Pacific-Basin Finance Journal\",\"volume\":\"91 \",\"pages\":\"Article 102746\"},\"PeriodicalIF\":5.3000,\"publicationDate\":\"2025-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Pacific-Basin Finance Journal\",\"FirstCategoryId\":\"96\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0927538X25000836\",\"RegionNum\":2,\"RegionCategory\":\"经济学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/3/17 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"BUSINESS, FINANCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pacific-Basin Finance Journal","FirstCategoryId":"96","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0927538X25000836","RegionNum":2,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/3/17 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"BUSINESS, FINANCE","Score":null,"Total":0}

引用次数: 0

摘要

我们提出了一种新的股票投资组合优化方法，该方法将频谱分析和经典股票投资组合优化理论与端到端框架中的深度强化学习相结合。我们引入了端到端频率在线深度确定性策略梯度（EFO-DDPG）算法，该算法利用离散傅里叶变换将资产返回序列分解为频率分量。与传统方法将高频分量视为噪声不同，EFO-DDPG可以动态地学习调整不同频率分量的影响。此外，该算法在深度学习网络中嵌入均值方差投资组合优化问题，与黑盒方法相比，增强了可解释性。该框架将投资问题建模为部分可观察马尔可夫决策过程（POMDP），使用带有变压器编码器的状态处理块来捕获市场数据中的复杂关系。通过整合谱分析、投资组合优化理论和在线深度强化学习，EFO-DDPG旨在适应非平稳金融市场并产生卓越的投资策略。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Optimization-based spectral end-to-end deep reinforcement learning for equity portfolio management

We propose a novel approach to equity portfolio optimization that combines spectral analysis and classical equity portfolio optimization theory with deep reinforcement learning in an end-to-end framework. We introduce the End-to-end Frequency Online Deep Deterministic Policy Gradient (EFO-DDPG) algorithm, which leverages discrete Fourier transform to decompose asset return sequences into frequency components. Unlike traditional methods that treat high-frequency components as noise, EFO-DDPG learns to adjust the influence of different frequency components dynamically. Moreover, the algorithm embeds a mean–variance portfolio optimization problem within a deep learning network, enhancing interpretability compared to black-box approaches. The framework models the investment problem as a Partially Observable Markov Decision Process (POMDP), using a state processing block with transformer encoders to capture complex relationships in the market data. By integrating spectral analysis, portfolio optimization theory, and online deep reinforcement learning, EFO-DDPG aims to adapt to non-stationary financial markets and generate superior investment strategies.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Pacific-Basin Finance Journal BUSINESS, FINANCE-

CiteScore

6.80

自引率

6.50%

发文量

157

期刊介绍： The Pacific-Basin Finance Journal is aimed at providing a specialized forum for the publication of academic research on capital markets of the Asia-Pacific countries. Primary emphasis will be placed on the highest quality empirical and theoretical research in the following areas: • Market Micro-structure; • Investment and Portfolio Management; • Theories of Market Equilibrium; • Valuation of Financial and Real Assets; • Behavior of Asset Prices in Financial Sectors; • Normative Theory of Financial Management; • Capital Markets of Development; • Market Mechanisms.