arXiv - QuantFin - Computational Finance最新文献_第6页

Financial Assets Dependency Prediction Utilizing Spatiotemporal Patterns 利用时空模式预测金融资产依赖性

arXiv - QuantFin - Computational Finance

Pub Date : 2024-06-13 DOI: arxiv-2406.11886

Haoren Zhu, Pengfei Zhao, Wilfred Siu Hung NG, Dik Lun Lee

Financial assets exhibit complex dependency structures, which are crucial forinvestors to create diversified portfolios to mitigate risk in volatilefinancial markets. To explore the financial asset dependencies dynamics, wepropose a novel approach that models the dependencies of assets as an AssetDependency Matrix (ADM) and treats the ADM sequences as image sequences. Thisallows us to leverage deep learning-based video prediction methods to capturethe spatiotemporal dependencies among assets. However, unlike images whereneighboring pixels exhibit explicit spatiotemporal dependencies due to thenatural continuity of object movements, assets in ADM do not have a naturalorder. This poses challenges to organizing the relational assets to revealbetter the spatiotemporal dependencies among neighboring assets for ADMforecasting. To tackle the challenges, we propose the Asset Dependency NeuralNetwork (ADNN), which employs the Convolutional Long Short-Term Memory(ConvLSTM) network, a highly successful method for video prediction. ADNN canemploy static and dynamic transformation functions to optimize therepresentations of the ADM. Through extensive experiments, we demonstrate thatour proposed framework consistently outperforms the baselines in the ADMprediction and downstream application tasks. This research contributes tounderstanding and predicting asset dependencies, offering valuable insights forfinancial market participants.

金融资产表现出复杂的依赖结构，这对于投资者在动荡的金融市场中建立多元化投资组合以降低风险至关重要。为了探索金融资产的依赖动态，我们提出了一种新方法，将资产依赖关系建模为资产依赖矩阵（ADM），并将 ADM 序列视为图像序列。这样，我们就可以利用基于深度学习的视频预测方法来捕捉资产之间的时空依赖关系。然而，与图像不同的是，由于物体运动的自然连续性，相邻像素表现出明确的时空依赖关系，而 ADM 中的资产没有自然顺序。这就给如何组织关联资产以更好地揭示相邻资产之间的时空依赖关系从而进行 ADM 预测带来了挑战。为了应对这些挑战，我们提出了资产依赖神经网络（Asset Dependency NeuralNetwork，ADNN），它采用了卷积长短期记忆（ConvLSTM）网络，这是一种非常成功的视频预测方法。ADNN 可以使用静态和动态变换函数来优化 ADM 的呈现。通过大量实验，我们证明了我们提出的框架在 ADM 预测和下游应用任务中始终优于基线。这项研究有助于理解和预测资产依赖关系，为金融市场参与者提供有价值的见解。

{"title":"Financial Assets Dependency Prediction Utilizing Spatiotemporal Patterns","authors":"Haoren Zhu, Pengfei Zhao, Wilfred Siu Hung NG, Dik Lun Lee","doi":"arxiv-2406.11886","DOIUrl":"https://doi.org/arxiv-2406.11886","url":null,"abstract":"Financial assets exhibit complex dependency structures, which are crucial for\u0000investors to create diversified portfolios to mitigate risk in volatile\u0000financial markets. To explore the financial asset dependencies dynamics, we\u0000propose a novel approach that models the dependencies of assets as an Asset\u0000Dependency Matrix (ADM) and treats the ADM sequences as image sequences. This\u0000allows us to leverage deep learning-based video prediction methods to capture\u0000the spatiotemporal dependencies among assets. However, unlike images where\u0000neighboring pixels exhibit explicit spatiotemporal dependencies due to the\u0000natural continuity of object movements, assets in ADM do not have a natural\u0000order. This poses challenges to organizing the relational assets to reveal\u0000better the spatiotemporal dependencies among neighboring assets for ADM\u0000forecasting. To tackle the challenges, we propose the Asset Dependency Neural\u0000Network (ADNN), which employs the Convolutional Long Short-Term Memory\u0000(ConvLSTM) network, a highly successful method for video prediction. ADNN can\u0000employ static and dynamic transformation functions to optimize the\u0000representations of the ADM. Through extensive experiments, we demonstrate that\u0000our proposed framework consistently outperforms the baselines in the ADM\u0000prediction and downstream application tasks. This research contributes to\u0000understanding and predicting asset dependencies, offering valuable insights for\u0000financial market participants.","PeriodicalId":501294,"journal":{"name":"arXiv - QuantFin - Computational Finance","volume":"111 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141520791","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Stock Movement Prediction with Multimodal Stable Fusion via Gated Cross-Attention Mechanism 通过门控交叉注意力机制进行多模态稳定融合的股票走势预测

arXiv - QuantFin - Computational Finance

Pub Date : 2024-06-06 DOI: arxiv-2406.06594

Chang Zong, Jian Shao, Weiming Lu, Yueting Zhuang

The accurate prediction of stock movements is crucial for investmentstrategies. Stock prices are subject to the influence of various forms ofinformation, including financial indicators, sentiment analysis, newsdocuments, and relational structures. Predominant analytical approaches,however, tend to address only unimodal or bimodal sources, neglecting thecomplexity of multimodal data. Further complicating the landscape are theissues of data sparsity and semantic conflicts between these modalities, whichare frequently overlooked by current models, leading to unstable performanceand limiting practical applicability. To address these shortcomings, this studyintroduces a novel architecture, named Multimodal Stable Fusion with GatedCross-Attention (MSGCA), designed to robustly integrate multimodal input forstock movement prediction. The MSGCA framework consists of three integralcomponents: (1) a trimodal encoding module, responsible for processingindicator sequences, dynamic documents, and a relational graph, andstandardizing their feature representations; (2) a cross-feature fusion module,where primary and consistent features guide the multimodal fusion of the threemodalities via a pair of gated cross-attention networks; and (3) a predictionmodule, which refines the fused features through temporal and dimensionalreduction to execute precise movement forecasting. Empirical evaluationsdemonstrate that the MSGCA framework exceeds current leading methods, achievingperformance gains of 8.1%, 6.1%, 21.7% and 31.6% on four multimodal datasets,respectively, attributed to its enhanced multimodal fusion stability.

准确预测股票走势对投资策略至关重要。股票价格受到各种形式信息的影响，包括金融指标、情绪分析、新闻文件和关系结构。然而，主流的分析方法往往只针对单模态或双模态来源，而忽视了多模态数据的复杂性。数据稀疏性和这些模式之间的语义冲突问题使情况更加复杂，而当前的模型经常忽略这些问题，导致性能不稳定，限制了实际应用性。为了解决这些问题，本研究引入了一种名为 "多模态稳定融合与门控交叉注意（MSGCA）"的新型架构，旨在稳健地整合多模态输入，以进行动物运动预测。MSGCA 框架由三个组成部分组成：(1) 三模态编码模块，负责处理指标序列、动态文档和关系图，并对其特征表示进行标准化处理；(2) 交叉特征融合模块，主要特征和一致特征通过一对门控交叉注意力网络引导三模态的多模态融合；(3) 预测模块，通过时间和维度还原完善融合特征，以执行精确的运动预测。实证评估表明，MSGCA 框架超越了当前的领先方法，在四个多模态数据集上分别实现了 8.1%、6.1%、21.7% 和 31.6% 的性能提升，这归功于其增强的多模态融合稳定性。

{"title":"Stock Movement Prediction with Multimodal Stable Fusion via Gated Cross-Attention Mechanism","authors":"Chang Zong, Jian Shao, Weiming Lu, Yueting Zhuang","doi":"arxiv-2406.06594","DOIUrl":"https://doi.org/arxiv-2406.06594","url":null,"abstract":"The accurate prediction of stock movements is crucial for investment\u0000strategies. Stock prices are subject to the influence of various forms of\u0000information, including financial indicators, sentiment analysis, news\u0000documents, and relational structures. Predominant analytical approaches,\u0000however, tend to address only unimodal or bimodal sources, neglecting the\u0000complexity of multimodal data. Further complicating the landscape are the\u0000issues of data sparsity and semantic conflicts between these modalities, which\u0000are frequently overlooked by current models, leading to unstable performance\u0000and limiting practical applicability. To address these shortcomings, this study\u0000introduces a novel architecture, named Multimodal Stable Fusion with Gated\u0000Cross-Attention (MSGCA), designed to robustly integrate multimodal input for\u0000stock movement prediction. The MSGCA framework consists of three integral\u0000components: (1) a trimodal encoding module, responsible for processing\u0000indicator sequences, dynamic documents, and a relational graph, and\u0000standardizing their feature representations; (2) a cross-feature fusion module,\u0000where primary and consistent features guide the multimodal fusion of the three\u0000modalities via a pair of gated cross-attention networks; and (3) a prediction\u0000module, which refines the fused features through temporal and dimensional\u0000reduction to execute precise movement forecasting. Empirical evaluations\u0000demonstrate that the MSGCA framework exceeds current leading methods, achieving\u0000performance gains of 8.1%, 6.1%, 21.7% and 31.6% on four multimodal datasets,\u0000respectively, attributed to its enhanced multimodal fusion stability.","PeriodicalId":501294,"journal":{"name":"arXiv - QuantFin - Computational Finance","volume":"6 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141501642","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Gated recurrent neural network with TPE Bayesian optimization for enhancing stock index prediction accuracy 采用 TPE 贝叶斯优化的门控递归神经网络提高股指预测准确性

arXiv - QuantFin - Computational Finance

Pub Date : 2024-06-02 DOI: arxiv-2406.02604

Bivas Dinda

The recent advancement of deep learning architectures, neural networks, andthe combination of abundant financial data and powerful computers aretransforming finance, leading us to develop an advanced method for predictingfuture stock prices. However, the accessibility of investment and trading ateveryone's fingertips made the stock markets increasingly intricate and proneto volatility. The increased complexity and volatility of the stock market havedriven demand for more models, which would effectively capture high volatilityand non-linear behavior of the different stock prices. This study exploredgated recurrent neural network (GRNN) algorithms such as LSTM (long short-termmemory), GRU (gated recurrent unit), and hybrid models like GRU-LSTM, LSTM-GRU,with Tree-structured Parzen Estimator (TPE) Bayesian optimization forhyperparameter optimization (TPE-GRNN). The aim is to improve the predictionaccuracy of the next day's closing price of the NIFTY 50 index, a prominentIndian stock market index, using TPE-GRNN. A combination of eight influentialfactors is carefully chosen from fundamental stock data, technical indicators,crude oil price, and macroeconomic data to train the models for capturing thechanges in the price of the index with the factors of the broader economy.Single-layer and multi-layer TPE-GRNN models have been developed. The models'performance is evaluated using standard matrices like R2, MAPE, and RMSE. Theanalysis of models' performance reveals the impact of feature selection andhyperparameter optimization (HPO) in enhancing stock index price predictionaccuracy. The results show that the MAPE of our proposed TPE-LSTM method is thelowest (best) with respect to all the previous models for stock index priceprediction.

近年来，深度学习架构、神经网络以及丰富的金融数据和强大的计算机的结合正在改变金融业，促使我们开发出一种预测未来股票价格的先进方法。然而，人人触手可及的投资和交易方式使股票市场变得越来越复杂和易变。股市的复杂性和波动性的增加促使人们需要更多的模型，以有效捕捉不同股票价格的高波动性和非线性行为。本研究探索了门控递归神经网络（GRNN）算法，如 LSTM（长短期记忆）、GRU（门控递归单元）和混合模型，如 GRU-LSTM、LSTM-GRU，以及用于超参数优化的树状结构帕岑估计器（TPE）贝叶斯优化（TPE-GRNN）。目的是利用 TPE-GRNN 提高对印度股市著名指数 NIFTY 50 指数次日收盘价的预测精度。我们从股票基本面数据、技术指标、原油价格和宏观经济数据中精心挑选了八个有影响力的因素组合来训练模型，以捕捉指数价格与更广泛经济因素的变化。模型的性能使用 R2、MAPE 和 RMSE 等标准矩阵进行评估。对模型性能的分析揭示了特征选择和超参数优化（HPO）对提高股指价格预测准确性的影响。结果表明，我们提出的 TPE-LSTM 方法的 MAPE 是之前所有股指价格预测模型中最低（最好）的。

{"title":"Gated recurrent neural network with TPE Bayesian optimization for enhancing stock index prediction accuracy","authors":"Bivas Dinda","doi":"arxiv-2406.02604","DOIUrl":"https://doi.org/arxiv-2406.02604","url":null,"abstract":"The recent advancement of deep learning architectures, neural networks, and\u0000the combination of abundant financial data and powerful computers are\u0000transforming finance, leading us to develop an advanced method for predicting\u0000future stock prices. However, the accessibility of investment and trading at\u0000everyone's fingertips made the stock markets increasingly intricate and prone\u0000to volatility. The increased complexity and volatility of the stock market have\u0000driven demand for more models, which would effectively capture high volatility\u0000and non-linear behavior of the different stock prices. This study explored\u0000gated recurrent neural network (GRNN) algorithms such as LSTM (long short-term\u0000memory), GRU (gated recurrent unit), and hybrid models like GRU-LSTM, LSTM-GRU,\u0000with Tree-structured Parzen Estimator (TPE) Bayesian optimization for\u0000hyperparameter optimization (TPE-GRNN). The aim is to improve the prediction\u0000accuracy of the next day's closing price of the NIFTY 50 index, a prominent\u0000Indian stock market index, using TPE-GRNN. A combination of eight influential\u0000factors is carefully chosen from fundamental stock data, technical indicators,\u0000crude oil price, and macroeconomic data to train the models for capturing the\u0000changes in the price of the index with the factors of the broader economy.\u0000Single-layer and multi-layer TPE-GRNN models have been developed. The models'\u0000performance is evaluated using standard matrices like R2, MAPE, and RMSE. The\u0000analysis of models' performance reveals the impact of feature selection and\u0000hyperparameter optimization (HPO) in enhancing stock index price prediction\u0000accuracy. The results show that the MAPE of our proposed TPE-LSTM method is the\u0000lowest (best) with respect to all the previous models for stock index price\u0000prediction.","PeriodicalId":501294,"journal":{"name":"arXiv - QuantFin - Computational Finance","volume":"7 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141550542","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Gaussian Recombining Split Tree 高斯重组分裂树

arXiv - QuantFin - Computational Finance

Pub Date : 2024-05-25 DOI: arxiv-2405.16333

Yury Lebedev, Arunava Banerjee

Binomial trees are widely used in the financial sector for valuing securitieswith early exercise characteristics, such as American stock options. However,while effective in many scenarios, pricing options with CRR binomial trees arelimited. Major limitations are volatility estimation, constant volatilityassumption, subjectivity in parameter choices, and impracticality ofinstantaneous delta hedging. This paper presents a novel tree: GaussianRecombining Split Tree (GRST), which is recombining and does not needlog-normality or normality market assumption. GRST generates a discreteprobability mass function of market data distribution, which approximates aGaussian distribution with known parameters at any chosen time interval. GRSTMixture builds upon the GRST concept while being flexible to fit a large classof market distributions and when given a 1-D time series data and moments ofdistributions at each time interval, fits a Gaussian mixture with the samemixture component probabilities applied at each time interval. GaussianRecombining Split Tre Mixture comprises several GRST tied using Gaussianmixture component probabilities at the first node. Our extensive empiricalanalysis shows that the option prices from the GRST align closely with themarket.

二叉树在金融领域被广泛用于评估具有提前行使特征的证券，如美式股票期权。然而，虽然在许多情况下都很有效，但用 CRR 二叉树为期权定价还是有局限性的。主要限制在于波动率估计、恒定波动率假设、参数选择的主观性以及瞬时三角对冲的不实用性。本文提出了一种新型树：高斯重组分裂树（GRST），它是重组树，不需要逻辑正态性或正态市场假设。GRST 可生成市场数据分布的离散概率质量函数，该函数近似于高斯分布，在任何选定的时间间隔内均具有已知参数。GRSTMixture 建立在 GRST 概念的基础上，可灵活拟合一大类市场分布，当给定一维时间序列数据和每个时间间隔的分布矩时，可拟合出一个高斯混合物，在每个时间间隔应用相同的混合物分量概率。高斯重组分裂混合物由多个 GRST 连接组成，在第一个节点使用高斯混合物成分概率。我们广泛的实证分析表明，GRST 得出的期权价格与市场密切相关。

{"title":"Gaussian Recombining Split Tree","authors":"Yury Lebedev, Arunava Banerjee","doi":"arxiv-2405.16333","DOIUrl":"https://doi.org/arxiv-2405.16333","url":null,"abstract":"Binomial trees are widely used in the financial sector for valuing securities\u0000with early exercise characteristics, such as American stock options. However,\u0000while effective in many scenarios, pricing options with CRR binomial trees are\u0000limited. Major limitations are volatility estimation, constant volatility\u0000assumption, subjectivity in parameter choices, and impracticality of\u0000instantaneous delta hedging. This paper presents a novel tree: Gaussian\u0000Recombining Split Tree (GRST), which is recombining and does not need\u0000log-normality or normality market assumption. GRST generates a discrete\u0000probability mass function of market data distribution, which approximates a\u0000Gaussian distribution with known parameters at any chosen time interval. GRST\u0000Mixture builds upon the GRST concept while being flexible to fit a large class\u0000of market distributions and when given a 1-D time series data and moments of\u0000distributions at each time interval, fits a Gaussian mixture with the same\u0000mixture component probabilities applied at each time interval. Gaussian\u0000Recombining Split Tre Mixture comprises several GRST tied using Gaussian\u0000mixture component probabilities at the first node. Our extensive empirical\u0000analysis shows that the option prices from the GRST align closely with the\u0000market.","PeriodicalId":501294,"journal":{"name":"arXiv - QuantFin - Computational Finance","volume":"62 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141166277","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Inference of Utilities and Time Preference in Sequential Decision-Making 顺序决策中的效用和时间偏好推断

arXiv - QuantFin - Computational Finance

Pub Date : 2024-05-24 DOI: arxiv-2405.15975

Haoyang Cao, Zhengqi Wu, Renyuan Xu

This paper introduces a novel stochastic control framework to enhance thecapabilities of automated investment managers, or robo-advisors, by accuratelyinferring clients' investment preferences from past activities. Our approachleverages a continuous-time model that incorporates utility functions and ageneric discounting scheme of a time-varying rate, tailored to each client'srisk tolerance, valuation of daily consumption, and significant life goals. Weaddress the resulting time inconsistency issue through state augmentation andthe establishment of the dynamic programming principle and the verificationtheorem. Additionally, we provide sufficient conditions for the identifiabilityof client investment preferences. To complement our theoretical developments,we propose a learning algorithm based on maximum likelihood estimation within adiscrete-time Markov Decision Process framework, augmented with entropyregularization. We prove that the log-likelihood function is locally concave,facilitating the fast convergence of our proposed algorithm. Practicaleffectiveness and efficiency are showcased through two numerical examples,including Merton's problem and an investment problem with unhedgeable risks. Our proposed framework not only advances financial technology by improvingpersonalized investment advice but also contributes broadly to other fieldssuch as healthcare, economics, and artificial intelligence, where understandingindividual preferences is crucial.

本文介绍了一种新颖的随机控制框架，通过从过去的活动中准确推断客户的投资偏好来增强自动投资经理或机器人顾问的能力。我们的方法利用了一个连续时间模型，该模型结合了效用函数和时间变化率的通用贴现方案，根据每位客户的风险承受能力、日常消费估值和重要人生目标量身定制。我们通过状态增强以及动态编程原理和验证定理的建立，解决了由此产生的时间不一致性问题。此外，我们还为客户投资偏好的可识别性提供了充分条件。为了补充我们的理论发展，我们在离散时间马尔可夫决策过程框架内提出了一种基于最大似然估计的学习算法，并对其进行了熵正则化处理。我们证明了对数似然函数是局部凹陷的，这有助于我们提出的算法快速收敛。通过两个数值示例，包括默顿问题和不可对冲风险的投资问题，展示了算法的实用性和效率。我们提出的框架不仅通过改进个性化投资建议推动了金融技术的发展，而且还为医疗保健、经济学和人工智能等其他领域做出了广泛贡献，在这些领域，理解个人偏好至关重要。

{"title":"Inference of Utilities and Time Preference in Sequential Decision-Making","authors":"Haoyang Cao, Zhengqi Wu, Renyuan Xu","doi":"arxiv-2405.15975","DOIUrl":"https://doi.org/arxiv-2405.15975","url":null,"abstract":"This paper introduces a novel stochastic control framework to enhance the\u0000capabilities of automated investment managers, or robo-advisors, by accurately\u0000inferring clients' investment preferences from past activities. Our approach\u0000leverages a continuous-time model that incorporates utility functions and a\u0000generic discounting scheme of a time-varying rate, tailored to each client's\u0000risk tolerance, valuation of daily consumption, and significant life goals. We\u0000address the resulting time inconsistency issue through state augmentation and\u0000the establishment of the dynamic programming principle and the verification\u0000theorem. Additionally, we provide sufficient conditions for the identifiability\u0000of client investment preferences. To complement our theoretical developments,\u0000we propose a learning algorithm based on maximum likelihood estimation within a\u0000discrete-time Markov Decision Process framework, augmented with entropy\u0000regularization. We prove that the log-likelihood function is locally concave,\u0000facilitating the fast convergence of our proposed algorithm. Practical\u0000effectiveness and efficiency are showcased through two numerical examples,\u0000including Merton's problem and an investment problem with unhedgeable risks. Our proposed framework not only advances financial technology by improving\u0000personalized investment advice but also contributes broadly to other fields\u0000such as healthcare, economics, and artificial intelligence, where understanding\u0000individual preferences is crucial.","PeriodicalId":501294,"journal":{"name":"arXiv - QuantFin - Computational Finance","volume":"22 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141166351","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Tackling Decision Processes with Non-Cumulative Objectives using Reinforcement Learning 利用强化学习处理非累积目标的决策过程

arXiv - QuantFin - Computational Finance

Pub Date : 2024-05-22 DOI: arxiv-2405.13609

Maximilian Nägele, Jan Olle, Thomas Fösel, Remmy Zen, Florian Marquardt

Markov decision processes (MDPs) are used to model a wide variety ofapplications ranging from game playing over robotics to finance. Their optimalpolicy typically maximizes the expected sum of rewards given at each step ofthe decision process. However, a large class of problems does not fitstraightforwardly into this framework: Non-cumulative Markov decision processes(NCMDPs), where instead of the expected sum of rewards, the expected value ofan arbitrary function of the rewards is maximized. Example functions includethe maximum of the rewards or their mean divided by their standard deviation.In this work, we introduce a general mapping of NCMDPs to standard MDPs. Thisallows all techniques developed to find optimal policies for MDPs, such asreinforcement learning or dynamic programming, to be directly applied to thelarger class of NCMDPs. Focusing on reinforcement learning, we showapplications in a diverse set of tasks, including classical control, portfoliooptimization in finance, and discrete optimization problems. Given ourapproach, we can improve both final performance and training time compared torelying on standard MDPs.

马尔可夫决策过程（Markov decision processes，MDPs）被用来模拟从机器人游戏到金融等各种应用。它们的最优策略通常是最大化决策过程中每一步所给奖励的预期总和。然而，有一大类问题无法直接纳入这一框架：非累积马尔可夫决策过程（NCMDPs），在这类问题中，最大化的不是奖励的预期总和，而是奖励的任意函数的预期值。在这项工作中，我们引入了 NCMDP 与标准 MDP 的一般映射。这使得所有为寻找 MDPs 最佳策略而开发的技术（如强化学习或动态编程）都能直接应用于更大类的 NCMDPs。我们以强化学习为重点，展示了在各种任务中的应用，包括经典控制、金融投资组合优化和离散优化问题。鉴于我们的方法，与依赖标准 MDPs 相比，我们可以提高最终性能并缩短训练时间。

{"title":"Tackling Decision Processes with Non-Cumulative Objectives using Reinforcement Learning","authors":"Maximilian Nägele, Jan Olle, Thomas Fösel, Remmy Zen, Florian Marquardt","doi":"arxiv-2405.13609","DOIUrl":"https://doi.org/arxiv-2405.13609","url":null,"abstract":"Markov decision processes (MDPs) are used to model a wide variety of\u0000applications ranging from game playing over robotics to finance. Their optimal\u0000policy typically maximizes the expected sum of rewards given at each step of\u0000the decision process. However, a large class of problems does not fit\u0000straightforwardly into this framework: Non-cumulative Markov decision processes\u0000(NCMDPs), where instead of the expected sum of rewards, the expected value of\u0000an arbitrary function of the rewards is maximized. Example functions include\u0000the maximum of the rewards or their mean divided by their standard deviation.\u0000In this work, we introduce a general mapping of NCMDPs to standard MDPs. This\u0000allows all techniques developed to find optimal policies for MDPs, such as\u0000reinforcement learning or dynamic programming, to be directly applied to the\u0000larger class of NCMDPs. Focusing on reinforcement learning, we show\u0000applications in a diverse set of tasks, including classical control, portfolio\u0000optimization in finance, and discrete optimization problems. Given our\u0000approach, we can improve both final performance and training time compared to\u0000relying on standard MDPs.","PeriodicalId":501294,"journal":{"name":"arXiv - QuantFin - Computational Finance","volume":"52 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141151250","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Trading Volume Maximization with Online Learning 通过在线学习实现交易量最大化

arXiv - QuantFin - Computational Finance

Pub Date : 2024-05-21 DOI: arxiv-2405.13102

Tommaso Cesari, Roberto Colomboni

We explore brokerage between traders in an online learning framework. At anyround $t$, two traders meet to exchange an asset, provided the exchange ismutually beneficial. The broker proposes a trading price, and each trader triesto sell their asset or buy the asset from the other party, depending on whetherthe price is higher or lower than their private valuations. A trade happens ifone trader is willing to sell and the other is willing to buy at the proposedprice. Previous work provided guidance to a broker aiming at enhancing traders'total earnings by maximizing the gain from trade, defined as the sum of thetraders' net utilities after each interaction. In contrast, we investigate howthe broker should behave to maximize the trading volume, i.e., the total numberof trades. We model the traders' valuations as an i.i.d. process with anunknown distribution. If the traders' valuations are revealed after eachinteraction (full-feedback), and the traders' valuations cumulativedistribution function (cdf) is continuous, we provide an algorithm achievinglogarithmic regret and show its optimality up to constant factors. If onlytheir willingness to sell or buy at the proposed price is revealed after eachinteraction ($2$-bit feedback), we provide an algorithm achievingpoly-logarithmic regret when the traders' valuations cdf is Lipschitz and showthat this rate is near-optimal. We complement our results by analyzing theimplications of dropping the regularity assumptions on the unknown traders'valuations cdf. If we drop the continuous cdf assumption, the regret ratedegrades to $Theta(sqrt{T})$ in the full-feedback case, where $T$ is the timehorizon. If we drop the Lipschitz cdf assumption, learning becomes impossiblein the $2$-bit feedback case.

我们在在线学习框架下探讨交易者之间的经纪活动。在任意一轮 $t$，两个交易者相遇交换资产，前提是交换对双方都有利。经纪人提出一个交易价格，每个交易者根据价格高于或低于他们的私人估值，尝试出售自己的资产或从另一方购买资产。如果一个交易者愿意卖出，而另一方愿意以提议的价格买入，交易就会发生。以前的工作为经纪人提供了指导，旨在通过最大化交易收益来提高交易者的总收益，交易收益被定义为每次互动后交易者的净效用总和。与此相反，我们研究的是经纪商应该如何做才能使交易量（即交易总数）最大化。我们将交易者的估值建模为分布未知的 i.i.d. 过程。如果交易者的估值在每次互动后都被揭示（完全反馈），且交易者的估值累积分布函数（ccd）是连续的，我们提供了一种实现对数遗憾的算法，并证明了其在常数因子以内的最优性。如果每次互动（2 美元位反馈）后只透露交易者按提议价格卖出或买入的意愿，当交易者的估值 cdf 为 Lipschitz 时，我们提供了一种实现对数遗憾的算法，并证明这一比率接近最优。我们通过分析放弃对未知交易者估值 cdf 的正则性假设的影响来补充我们的结果。如果我们放弃连续 cdf 假设，那么在全反馈情况下，后悔率就会降为 $θ(sqrt{T})$，其中 $T$ 是时间跨度。如果我们放弃 Lipschitz cdf 假设，在 2 $ 位反馈的情况下，学习将变得不可能。

{"title":"Trading Volume Maximization with Online Learning","authors":"Tommaso Cesari, Roberto Colomboni","doi":"arxiv-2405.13102","DOIUrl":"https://doi.org/arxiv-2405.13102","url":null,"abstract":"We explore brokerage between traders in an online learning framework. At any\u0000round $t$, two traders meet to exchange an asset, provided the exchange is\u0000mutually beneficial. The broker proposes a trading price, and each trader tries\u0000to sell their asset or buy the asset from the other party, depending on whether\u0000the price is higher or lower than their private valuations. A trade happens if\u0000one trader is willing to sell and the other is willing to buy at the proposed\u0000price. Previous work provided guidance to a broker aiming at enhancing traders'\u0000total earnings by maximizing the gain from trade, defined as the sum of the\u0000traders' net utilities after each interaction. In contrast, we investigate how\u0000the broker should behave to maximize the trading volume, i.e., the total number\u0000of trades. We model the traders' valuations as an i.i.d. process with an\u0000unknown distribution. If the traders' valuations are revealed after each\u0000interaction (full-feedback), and the traders' valuations cumulative\u0000distribution function (cdf) is continuous, we provide an algorithm achieving\u0000logarithmic regret and show its optimality up to constant factors. If only\u0000their willingness to sell or buy at the proposed price is revealed after each\u0000interaction ($2$-bit feedback), we provide an algorithm achieving\u0000poly-logarithmic regret when the traders' valuations cdf is Lipschitz and show\u0000that this rate is near-optimal. We complement our results by analyzing the\u0000implications of dropping the regularity assumptions on the unknown traders'\u0000valuations cdf. If we drop the continuous cdf assumption, the regret rate\u0000degrades to $Theta(sqrt{T})$ in the full-feedback case, where $T$ is the time\u0000horizon. If we drop the Lipschitz cdf assumption, learning becomes impossible\u0000in the $2$-bit feedback case.","PeriodicalId":501294,"journal":{"name":"arXiv - QuantFin - Computational Finance","volume":"28 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141151260","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

NIFTY Financial News Headlines Dataset NIFTY 金融新闻标题数据集

arXiv - QuantFin - Computational Finance

Pub Date : 2024-05-16 DOI: arxiv-2405.09747

Raeid Saqur, Ken Kato, Nicholas Vinden, Frank Rudzicz

We introduce and make publicly available the NIFTY Financial News Headlinesdataset, designed to facilitate and advance research in financial marketforecasting using large language models (LLMs). This dataset comprises twodistinct versions tailored for different modeling approaches: (i) NIFTY-LM,which targets supervised fine-tuning (SFT) of LLMs with an auto-regressive,causal language-modeling objective, and (ii) NIFTY-RL, formatted specificallyfor alignment methods (like reinforcement learning from human feedback (RLHF))to align LLMs via rejection sampling and reward modeling. Each dataset versionprovides curated, high-quality data incorporating comprehensive metadata,market indices, and deduplicated financial news headlines systematicallyfiltered and ranked to suit modern LLM frameworks. We also include experimentsdemonstrating some applications of the dataset in tasks like stock pricemovement and the role of LLM embeddings in information acquisition/richness.The NIFTY dataset along with utilities (like truncating prompt's context lengthsystematically) are available on Hugging Face athttps://huggingface.co/datasets/raeidsaqur/NIFTY.

我们介绍并公开了 NIFTY 金融新闻标题数据集，该数据集旨在促进和推动使用大型语言模型（LLM）进行金融市场预测的研究。该数据集包括两个为不同建模方法量身定制的不同版本：(i) NIFTY-LM，目标是以自动回归、因果语言建模为目标，对 LLM 进行有监督的微调（SFT）；(ii) NIFTY-RL，专门为对齐方法（如来自人类反馈的强化学习（RLHF））设计，通过拒绝采样和奖励建模对 LLM 进行对齐。每个数据集版本都提供了经过整理的高质量数据，其中包含全面的元数据、市场指数和经过系统过滤和排序的重复金融新闻标题，以适应现代 LLM 框架。我们还在实验中展示了该数据集在股票价格变动等任务中的一些应用，以及 LLM 嵌入在信息获取/丰富性中的作用。NIFTY 数据集和实用工具（如系统截断提示上下文长度）可在 Hugging Face 上获取，网址是：https://huggingface.co/datasets/raeidsaqur/NIFTY。

{"title":"NIFTY Financial News Headlines Dataset","authors":"Raeid Saqur, Ken Kato, Nicholas Vinden, Frank Rudzicz","doi":"arxiv-2405.09747","DOIUrl":"https://doi.org/arxiv-2405.09747","url":null,"abstract":"We introduce and make publicly available the NIFTY Financial News Headlines\u0000dataset, designed to facilitate and advance research in financial market\u0000forecasting using large language models (LLMs). This dataset comprises two\u0000distinct versions tailored for different modeling approaches: (i) NIFTY-LM,\u0000which targets supervised fine-tuning (SFT) of LLMs with an auto-regressive,\u0000causal language-modeling objective, and (ii) NIFTY-RL, formatted specifically\u0000for alignment methods (like reinforcement learning from human feedback (RLHF))\u0000to align LLMs via rejection sampling and reward modeling. Each dataset version\u0000provides curated, high-quality data incorporating comprehensive metadata,\u0000market indices, and deduplicated financial news headlines systematically\u0000filtered and ranked to suit modern LLM frameworks. We also include experiments\u0000demonstrating some applications of the dataset in tasks like stock price\u0000movement and the role of LLM embeddings in information acquisition/richness.\u0000The NIFTY dataset along with utilities (like truncating prompt's context length\u0000systematically) are available on Hugging Face at\u0000https://huggingface.co/datasets/raeidsaqur/NIFTY.","PeriodicalId":501294,"journal":{"name":"arXiv - QuantFin - Computational Finance","volume":"2 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141059900","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Cost-Benefit Analysis using Modular Dynamic Fault Tree Analysis and Monte Carlo Simulations for Condition-based Maintenance of Unmanned Systems 利用模块化动态故障树分析和蒙特卡罗模拟对无人系统进行基于状态的维护的成本效益分析

arXiv - QuantFin - Computational Finance

Pub Date : 2024-05-15 DOI: arxiv-2405.09519

Joseph M. Southgate, Katrina Groth, Peter Sandborn, Shapour Azarm

Recent developments in condition-based maintenance (CBM) have helped make ita promising approach to maintenance cost avoidance in engineering systems. Byperforming maintenance based on conditions of the component with regards tofailure or time, there is potential to avoid the large costs of system shutdownand maintenance delays. However, CBM requires a large investment cost comparedto other available maintenance strategies. The investment cost is required forresearch, development, and implementation. Despite the potential to avoidsignificant maintenance costs, the large investment cost of CBM makes decisionmakers hesitant to implement. This study is the first in the literature thatattempts to address the problem of conducting a cost-benefit analysis (CBA) forimplementing CBM concepts for unmanned systems. This paper proposes a methodfor conducting a CBA to determine the return on investment (ROI) of potentialCBM strategies. The CBA seeks to compare different CBM strategies based on thedifferences in the various maintenance requirements associated with maintaininga multi-component, unmanned system. The proposed method uses modular dynamicfault tree analysis (MDFTA) with Monte Carlo simulations (MCS) to assess thevarious maintenance requirements. The proposed method is demonstrated on anunmanned surface vessel (USV) example taken from the literature that consistsof 5 subsystems and 71 components. Following this USV example, it is found thatselecting different combinations of components for a CBM strategy can have asignificant impact on maintenance requirements and ROI by impacting costavoidances and investment costs.

基于状态的维护（CBM）的最新发展使其成为避免工程系统维护成本的一种可行方法。通过根据部件的故障或时间状况进行维护，有可能避免系统停机和维护延误所造成的巨大成本。然而，与其他可用的维护策略相比，CBM 需要很大的投资成本。投资成本需要用于研究、开发和实施。尽管 CBM 有可能避免大量的维护成本，但其高昂的投资成本还是让决策者对其实施犹豫不决。本研究是第一篇试图解决为无人系统实施 CBM 概念而进行成本效益分析（CBA）问题的文献。本文提出了一种进行成本效益分析的方法，以确定潜在 CBM 战略的投资回报率（ROI）。CBA 试图根据与维护多组件无人系统相关的各种维护要求的差异，对不同的 CBM 策略进行比较。建议的方法使用模块化动态故障树分析（MDFTA）和蒙特卡罗模拟（MCS）来评估各种维护要求。我们以文献中的一艘无人水面舰艇（USV）为例，对所提出的方法进行了演示，该艇由 5 个子系统和 71 个组件组成。根据这个 USV 例子，我们发现为 CBM 策略选择不同的组件组合会对维护要求和投资回报率产生重大影响，因为这会影响成本避免和投资成本。

{"title":"Cost-Benefit Analysis using Modular Dynamic Fault Tree Analysis and Monte Carlo Simulations for Condition-based Maintenance of Unmanned Systems","authors":"Joseph M. Southgate, Katrina Groth, Peter Sandborn, Shapour Azarm","doi":"arxiv-2405.09519","DOIUrl":"https://doi.org/arxiv-2405.09519","url":null,"abstract":"Recent developments in condition-based maintenance (CBM) have helped make it\u0000a promising approach to maintenance cost avoidance in engineering systems. By\u0000performing maintenance based on conditions of the component with regards to\u0000failure or time, there is potential to avoid the large costs of system shutdown\u0000and maintenance delays. However, CBM requires a large investment cost compared\u0000to other available maintenance strategies. The investment cost is required for\u0000research, development, and implementation. Despite the potential to avoid\u0000significant maintenance costs, the large investment cost of CBM makes decision\u0000makers hesitant to implement. This study is the first in the literature that\u0000attempts to address the problem of conducting a cost-benefit analysis (CBA) for\u0000implementing CBM concepts for unmanned systems. This paper proposes a method\u0000for conducting a CBA to determine the return on investment (ROI) of potential\u0000CBM strategies. The CBA seeks to compare different CBM strategies based on the\u0000differences in the various maintenance requirements associated with maintaining\u0000a multi-component, unmanned system. The proposed method uses modular dynamic\u0000fault tree analysis (MDFTA) with Monte Carlo simulations (MCS) to assess the\u0000various maintenance requirements. The proposed method is demonstrated on an\u0000unmanned surface vessel (USV) example taken from the literature that consists\u0000of 5 subsystems and 71 components. Following this USV example, it is found that\u0000selecting different combinations of components for a CBM strategy can have a\u0000significant impact on maintenance requirements and ROI by impacting cost\u0000avoidances and investment costs.","PeriodicalId":501294,"journal":{"name":"arXiv - QuantFin - Computational Finance","volume":"32 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141059899","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Can machine learning unlock new insights into high-frequency trading? 机器学习能否开启高频交易的新视角？

arXiv - QuantFin - Computational Finance

Pub Date : 2024-05-13 DOI: arxiv-2405.08101

G. Ibikunle, B. Moews, K. Rzayev

We design and train machine learning models to capture the nonlinearinteractions between financial market dynamics and high-frequency trading (HFT)activity. In doing so, we introduce new metrics to identify liquidity-demandingand -supplying HFT strategies. Both types of HFT strategies increase activityin response to information events and decrease it when trading speed isrestricted, with liquidity-supplying strategies demonstrating greaterresponsiveness. Liquidity-demanding HFT is positively linked with latencyarbitrage opportunities, whereas liquidity-supplying HFT is negatively related,aligning with theoretical expectations. Our metrics have implications forunderstanding the information production process in financial markets.

我们设计并训练机器学习模型，以捕捉金融市场动态与高频交易（HFT）活动之间的非线性互动。在此过程中，我们引入了新的指标来识别流动性需求型和供应型 HFT 策略。两种类型的 HFT 策略都会在信息事件发生时增加活动，而在交易速度受限时减少活动，其中流动性供应型策略表现出更强的反应能力。流动性需求型 HFT 与延迟套利机会呈正相关，而流动性供应型 HFT 则呈负相关，这与理论预期一致。我们的度量指标对理解金融市场的信息生产过程具有重要意义。

引用次数: 0