首页 > 最新文献

arXiv - QuantFin - Statistical Finance最新文献

英文 中文
Dimensionality reduction techniques to support insider trading detection 支持内幕交易检测的降维技术
Pub Date : 2024-03-01 DOI: arxiv-2403.00707
Adele Ravagnani, Fabrizio Lillo, Paola Deriu, Piero Mazzarisi, Francesca Medda, Antonio Russo
Identification of market abuse is an extremely complicated activity thatrequires the analysis of large and complex datasets. We propose an unsupervisedmachine learning method for contextual anomaly detection, which allows tosupport market surveillance aimed at identifying potential insider tradingactivities. This method lies in the reconstruction-based paradigm and employsprincipal component analysis and autoencoders as dimensionality reductiontechniques. The only input of this method is the trading position of eachinvestor active on the asset for which we have a price sensitive event (PSE).After determining reconstruction errors related to the trading profiles,several conditions are imposed in order to identify investors whose behaviorcould be suspicious of insider trading related to the PSE. As a case study, weapply our method to investor resolved data of Italian stocks around takeoverbids.
识别市场滥用是一项极其复杂的工作,需要对大量复杂的数据集进行分析。我们提出了一种用于上下文异常检测的无监督机器学习方法,可用于支持旨在识别潜在内部交易活动的市场监控。该方法属于基于重构的范例,采用了主成分分析和自动编码器作为降维技术。在确定了与交易概况相关的重构误差后,我们施加了几个条件,以识别其行为可能涉嫌与 PSE 相关的内幕交易的投资者。作为一项案例研究,我们将我们的方法应用于意大利股票收购要约前后的投资者解决数据。
{"title":"Dimensionality reduction techniques to support insider trading detection","authors":"Adele Ravagnani, Fabrizio Lillo, Paola Deriu, Piero Mazzarisi, Francesca Medda, Antonio Russo","doi":"arxiv-2403.00707","DOIUrl":"https://doi.org/arxiv-2403.00707","url":null,"abstract":"Identification of market abuse is an extremely complicated activity that\u0000requires the analysis of large and complex datasets. We propose an unsupervised\u0000machine learning method for contextual anomaly detection, which allows to\u0000support market surveillance aimed at identifying potential insider trading\u0000activities. This method lies in the reconstruction-based paradigm and employs\u0000principal component analysis and autoencoders as dimensionality reduction\u0000techniques. The only input of this method is the trading position of each\u0000investor active on the asset for which we have a price sensitive event (PSE).\u0000After determining reconstruction errors related to the trading profiles,\u0000several conditions are imposed in order to identify investors whose behavior\u0000could be suspicious of insider trading related to the PSE. As a case study, we\u0000apply our method to investor resolved data of Italian stocks around takeover\u0000bids.","PeriodicalId":501139,"journal":{"name":"arXiv - QuantFin - Statistical Finance","volume":"108 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140034841","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MambaStock: Selective state space model for stock prediction MambaStock:用于股票预测的选择性状态空间模型
Pub Date : 2024-02-29 DOI: arxiv-2402.18959
Zhuangwei Shi
The stock market plays a pivotal role in economic development, yet itsintricate volatility poses challenges for investors. Consequently, research andaccurate predictions of stock price movements are crucial for mitigating risks.Traditional time series models fall short in capturing nonlinearity, leading tounsatisfactory stock predictions. This limitation has spurred the widespreadadoption of neural networks for stock prediction, owing to their robustnonlinear generalization capabilities. Recently, Mamba, a structured statespace sequence model with a selection mechanism and scan module (S6), hasemerged as a powerful tool in sequence modeling tasks. Leveraging thisframework, this paper proposes a novel Mamba-based model for stock priceprediction, named MambaStock. The proposed MambaStock model effectively mineshistorical stock market data to predict future stock prices without handcraftedfeatures or extensive preprocessing procedures. Empirical studies on severalstocks indicate that the MambaStock model outperforms previous methods,delivering highly accurate predictions. This enhanced accuracy can assistinvestors and institutions in making informed decisions, aiming to maximizereturns while minimizing risks. This work underscores the value of Mamba intime-series forecasting. Source code is available athttps://github.com/zshicode/MambaStock.
股票市场在经济发展中起着举足轻重的作用,但其错综复杂的波动性给投资者带来了挑战。因此,研究并准确预测股价走势对于降低风险至关重要。传统的时间序列模型在捕捉非线性方面存在不足,导致股票预测结果不尽人意。由于神经网络具有强大的非线性泛化能力,这一局限性促使神经网络被广泛应用于股票预测。最近,具有选择机制和扫描模块(S6)的结构化状态空间序列模型 Mamba 成为序列建模任务中的有力工具。利用这一框架,本文提出了一种基于 Mamba 的新型股票价格预测模型,命名为 MambaStock。所提出的 MambaStock 模型可以有效地挖掘历史股票市场数据来预测未来股票价格,而无需手工制作特征或大量预处理程序。对几种股票的实证研究表明,MambaStock 模型优于以前的方法,能提供高精度的预测。这种更高的准确性可以帮助投资者和机构做出明智的决策,从而实现收益最大化和风险最小化。这项工作强调了 Mamba intime 系列预测的价值。源代码可在https://github.com/zshicode/MambaStock。
{"title":"MambaStock: Selective state space model for stock prediction","authors":"Zhuangwei Shi","doi":"arxiv-2402.18959","DOIUrl":"https://doi.org/arxiv-2402.18959","url":null,"abstract":"The stock market plays a pivotal role in economic development, yet its\u0000intricate volatility poses challenges for investors. Consequently, research and\u0000accurate predictions of stock price movements are crucial for mitigating risks.\u0000Traditional time series models fall short in capturing nonlinearity, leading to\u0000unsatisfactory stock predictions. This limitation has spurred the widespread\u0000adoption of neural networks for stock prediction, owing to their robust\u0000nonlinear generalization capabilities. Recently, Mamba, a structured state\u0000space sequence model with a selection mechanism and scan module (S6), has\u0000emerged as a powerful tool in sequence modeling tasks. Leveraging this\u0000framework, this paper proposes a novel Mamba-based model for stock price\u0000prediction, named MambaStock. The proposed MambaStock model effectively mines\u0000historical stock market data to predict future stock prices without handcrafted\u0000features or extensive preprocessing procedures. Empirical studies on several\u0000stocks indicate that the MambaStock model outperforms previous methods,\u0000delivering highly accurate predictions. This enhanced accuracy can assist\u0000investors and institutions in making informed decisions, aiming to maximize\u0000returns while minimizing risks. This work underscores the value of Mamba in\u0000time-series forecasting. Source code is available at\u0000https://github.com/zshicode/MambaStock.","PeriodicalId":501139,"journal":{"name":"arXiv - QuantFin - Statistical Finance","volume":"8 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-02-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140010546","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Navigating Complexity: Constrained Portfolio Analysis in High Dimensions with Tracking Error and Weight Constraints 驾驭复杂性:具有跟踪误差和权重约束的高维度受限投资组合分析
Pub Date : 2024-02-27 DOI: arxiv-2402.17523
Mehmet Caner, Qingliang Fan, Yingying Li
This paper analyzes the statistical properties of constrained portfolioformation in a high dimensional portfolio with a large number of assets.Namely, we consider portfolios with tracking error constraints, portfolios withtracking error jointly with weight (equality or inequality) restrictions, andportfolios with only weight restrictions. Tracking error is the portfolio'sperformance measured against a benchmark (an index usually), {color{black}{andweight constraints refers to specific allocation of assets within theportfolio, which often come in the form of regulatory requirement or fundprospectus.}} We show how these portfolios can be estimated consistently inlarge dimensions, even when the number of assets is larger than the time spanof the portfolio. We also provide rate of convergence results for weights ofthe constrained portfolio, risk of the constrained portfolio and the SharpeRatio of the constrained portfolio. To achieve those results we use a newmachine learning technique that merges factor models with nodewise regressionin statistics. Simulation results and empirics show very good performance ofour method.
本文分析了具有大量资产的高维投资组合中受约束投资组合形式的统计特性。也就是说,我们考虑了具有跟踪误差约束的投资组合、具有跟踪误差与权重(相等或不相等)联合约束的投资组合以及仅具有权重约束的投资组合。跟踪误差是指投资组合相对于基准(通常是指数)的表现,{color{black}{而权重限制是指投资组合内资产的具体分配,通常以监管要求或基金说明书的形式出现。}我们展示了这些投资组合如何在大维度上进行一致估计,即使资产数量大于投资组合的时间跨度。我们还提供了受约束投资组合权重、受约束投资组合风险和受约束投资组合夏普比率的收敛率结果。为了获得这些结果,我们使用了一种新的机器学习技术,该技术将因子模型与统计中的节点回归相结合。模拟结果和经验表明,我们的方法性能非常好。
{"title":"Navigating Complexity: Constrained Portfolio Analysis in High Dimensions with Tracking Error and Weight Constraints","authors":"Mehmet Caner, Qingliang Fan, Yingying Li","doi":"arxiv-2402.17523","DOIUrl":"https://doi.org/arxiv-2402.17523","url":null,"abstract":"This paper analyzes the statistical properties of constrained portfolio\u0000formation in a high dimensional portfolio with a large number of assets.\u0000Namely, we consider portfolios with tracking error constraints, portfolios with\u0000tracking error jointly with weight (equality or inequality) restrictions, and\u0000portfolios with only weight restrictions. Tracking error is the portfolio's\u0000performance measured against a benchmark (an index usually), {color{black}{and\u0000weight constraints refers to specific allocation of assets within the\u0000portfolio, which often come in the form of regulatory requirement or fund\u0000prospectus.}} We show how these portfolios can be estimated consistently in\u0000large dimensions, even when the number of assets is larger than the time span\u0000of the portfolio. We also provide rate of convergence results for weights of\u0000the constrained portfolio, risk of the constrained portfolio and the Sharpe\u0000Ratio of the constrained portfolio. To achieve those results we use a new\u0000machine learning technique that merges factor models with nodewise regression\u0000in statistics. Simulation results and empirics show very good performance of\u0000our method.","PeriodicalId":501139,"journal":{"name":"arXiv - QuantFin - Statistical Finance","volume":"53 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140011231","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CaT-GNN: Enhancing Credit Card Fraud Detection via Causal Temporal Graph Neural Networks CaT-GNN:通过因果时序图神经网络加强信用卡欺诈检测
Pub Date : 2024-02-22 DOI: arxiv-2402.14708
Yifan Duan, Guibin Zhang, Shilong Wang, Xiaojiang Peng, Wang Ziqi, Junyuan Mao, Hao Wu, Xinke Jiang, Kun Wang
Credit card fraud poses a significant threat to the economy. While GraphNeural Network (GNN)-based fraud detection methods perform well, they oftenoverlook the causal effect of a node's local structure on predictions. Thispaper introduces a novel method for credit card fraud detection, thetextbf{underline{Ca}}usal textbf{underline{T}}emporaltextbf{underline{G}}raph textbf{underline{N}}eural textbf{N}etwork(CaT-GNN), which leverages causal invariant learning to reveal inherentcorrelations within transaction data. By decomposing the problem into discoveryand intervention phases, CaT-GNN identifies causal nodes within the transactiongraph and applies a causal mixup strategy to enhance the model's robustness andinterpretability. CaT-GNN consists of two key components: Causal-Inspector andCausal-Intervener. The Causal-Inspector utilizes attention weights in thetemporal attention mechanism to identify causal and environment nodes withoutintroducing additional parameters. Subsequently, the Causal-Intervener performsa causal mixup enhancement on environment nodes based on the set of nodes.Evaluated on three datasets, including a private financial dataset and twopublic datasets, CaT-GNN demonstrates superior performance over existingstate-of-the-art methods. Our findings highlight the potential of integratingcausal reasoning with graph neural networks to improve fraud detectioncapabilities in financial transactions.
信用卡欺诈对经济构成了重大威胁。虽然基于图形神经网络(GNN)的欺诈检测方法性能良好,但它们往往忽略了节点的局部结构对预测的因果影响。本文介绍了一种新颖的信用卡欺诈检测方法--因果不变学习(CaT-GNN),它利用因果不变学习来揭示交易数据中固有的相关性。通过将问题分解为发现阶段和干预阶段,CaT-GNN 可识别交易图中的因果节点,并应用因果混合策略来增强模型的鲁棒性和可解释性。CaT-GNN 由两个关键组件组成:因果检测器(Causal-Inspector)和因果干预器(Causal-Intervener)。因果检测器利用时态注意力机制中的注意力权重来识别因果节点和环境节点,而无需引入额外参数。在三个数据集(包括一个私人金融数据集和两个公共数据集)上进行的评估显示,CaT-GNN 的性能优于现有的最先进方法。我们的研究结果凸显了将因果推理与图神经网络相结合以提高金融交易欺诈检测能力的潜力。
{"title":"CaT-GNN: Enhancing Credit Card Fraud Detection via Causal Temporal Graph Neural Networks","authors":"Yifan Duan, Guibin Zhang, Shilong Wang, Xiaojiang Peng, Wang Ziqi, Junyuan Mao, Hao Wu, Xinke Jiang, Kun Wang","doi":"arxiv-2402.14708","DOIUrl":"https://doi.org/arxiv-2402.14708","url":null,"abstract":"Credit card fraud poses a significant threat to the economy. While Graph\u0000Neural Network (GNN)-based fraud detection methods perform well, they often\u0000overlook the causal effect of a node's local structure on predictions. This\u0000paper introduces a novel method for credit card fraud detection, the\u0000textbf{underline{Ca}}usal textbf{underline{T}}emporal\u0000textbf{underline{G}}raph textbf{underline{N}}eural textbf{N}etwork\u0000(CaT-GNN), which leverages causal invariant learning to reveal inherent\u0000correlations within transaction data. By decomposing the problem into discovery\u0000and intervention phases, CaT-GNN identifies causal nodes within the transaction\u0000graph and applies a causal mixup strategy to enhance the model's robustness and\u0000interpretability. CaT-GNN consists of two key components: Causal-Inspector and\u0000Causal-Intervener. The Causal-Inspector utilizes attention weights in the\u0000temporal attention mechanism to identify causal and environment nodes without\u0000introducing additional parameters. Subsequently, the Causal-Intervener performs\u0000a causal mixup enhancement on environment nodes based on the set of nodes.\u0000Evaluated on three datasets, including a private financial dataset and two\u0000public datasets, CaT-GNN demonstrates superior performance over existing\u0000state-of-the-art methods. Our findings highlight the potential of integrating\u0000causal reasoning with graph neural networks to improve fraud detection\u0000capabilities in financial transactions.","PeriodicalId":501139,"journal":{"name":"arXiv - QuantFin - Statistical Finance","volume":"41 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139949644","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Quantifying neural network uncertainty under volatility clustering 量化波动聚类下的神经网络不确定性
Pub Date : 2024-02-22 DOI: arxiv-2402.14476
Steven Y. K. Wong, Jennifer S. K. Chan, Lamiae Azizi
Time-series with time-varying variance pose a unique challenge to uncertaintyquantification (UQ) methods. Time-varying variance, such as volatilityclustering as seen in financial time-series, can lead to large mismatch betweenpredicted uncertainty and forecast error. Building on recent advances in neuralnetwork UQ literature, we extend and simplify Deep Evidential Regression andDeep Ensembles into a unified framework to deal with UQ under the presence ofvolatility clustering. We show that a Scale Mixture Distribution is a simpleralternative to the Normal-Inverse-Gamma prior that provides favorablecomplexity-accuracy trade-off. To illustrate the performance of our proposedapproach, we apply it to two sets of financial time-series exhibitingvolatility clustering: cryptocurrencies and U.S. equities.
具有时变方差的时间序列对不确定性量化(UQ)方法提出了独特的挑战。时变方差,如金融时间序列中的波动性集群,会导致预测不确定性与预测误差之间的巨大不匹配。基于神经网络 UQ 文献的最新进展,我们将深度证据回归和深度集合扩展并简化为一个统一的框架,以处理存在波动率聚类情况下的 UQ。我们表明,规模混合分布是正负伽马先验的一个简单替代方案,它能提供有利的复杂性-准确性权衡。为了说明我们提出的方法的性能,我们将其应用于两组表现出波动性聚类的金融时间序列:加密货币和美国股票。
{"title":"Quantifying neural network uncertainty under volatility clustering","authors":"Steven Y. K. Wong, Jennifer S. K. Chan, Lamiae Azizi","doi":"arxiv-2402.14476","DOIUrl":"https://doi.org/arxiv-2402.14476","url":null,"abstract":"Time-series with time-varying variance pose a unique challenge to uncertainty\u0000quantification (UQ) methods. Time-varying variance, such as volatility\u0000clustering as seen in financial time-series, can lead to large mismatch between\u0000predicted uncertainty and forecast error. Building on recent advances in neural\u0000network UQ literature, we extend and simplify Deep Evidential Regression and\u0000Deep Ensembles into a unified framework to deal with UQ under the presence of\u0000volatility clustering. We show that a Scale Mixture Distribution is a simpler\u0000alternative to the Normal-Inverse-Gamma prior that provides favorable\u0000complexity-accuracy trade-off. To illustrate the performance of our proposed\u0000approach, we apply it to two sets of financial time-series exhibiting\u0000volatility clustering: cryptocurrencies and U.S. equities.","PeriodicalId":501139,"journal":{"name":"arXiv - QuantFin - Statistical Finance","volume":"67 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139949666","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Applying News and Media Sentiment Analysis for Generating Forex Trading Signals 应用新闻和媒体情绪分析生成外汇交易信号
Pub Date : 2024-02-19 DOI: arxiv-2403.00785
Oluwafemi F Olaiyapo
The objective of this research is to examine how sentiment analysis can beemployed to generate trading signals for the Foreign Exchange (Forex) market.The author assessed sentiment in social media posts and news articlespertaining to the United States Dollar (USD) using a combination of methods:lexicon-based analysis and the Naive Bayes machine learning algorithm. Thefindings indicate that sentiment analysis proves valuable in forecasting marketmovements and devising trading signals. Notably, its effectiveness isconsistent across different market conditions. The author concludes that byanalyzing sentiment expressed in news and social media, traders can gleaninsights into prevailing market sentiments towards the USD and other pertinentcountries, thereby aiding trading decision-making. This study underscores theimportance of weaving sentiment analysis into trading strategies as a pivotaltool for predicting market dynamics.
本研究的目的是探讨如何利用情感分析来生成外汇市场的交易信号。作者采用基于词典的分析和奈维贝叶斯(Naive Bayes)机器学习算法相结合的方法,评估了与美元(USD)有关的社交媒体帖子和新闻文章中的情感。研究结果表明,情感分析在预测市场动向和设计交易信号方面很有价值。值得注意的是,其有效性在不同的市场条件下是一致的。作者总结道,通过分析新闻和社交媒体中表达的情绪,交易者可以洞察市场对美元和其他相关国家的普遍情绪,从而帮助做出交易决策。这项研究强调了将情绪分析作为预测市场动态的重要工具纳入交易策略的重要性。
{"title":"Applying News and Media Sentiment Analysis for Generating Forex Trading Signals","authors":"Oluwafemi F Olaiyapo","doi":"arxiv-2403.00785","DOIUrl":"https://doi.org/arxiv-2403.00785","url":null,"abstract":"The objective of this research is to examine how sentiment analysis can be\u0000employed to generate trading signals for the Foreign Exchange (Forex) market.\u0000The author assessed sentiment in social media posts and news articles\u0000pertaining to the United States Dollar (USD) using a combination of methods:\u0000lexicon-based analysis and the Naive Bayes machine learning algorithm. The\u0000findings indicate that sentiment analysis proves valuable in forecasting market\u0000movements and devising trading signals. Notably, its effectiveness is\u0000consistent across different market conditions. The author concludes that by\u0000analyzing sentiment expressed in news and social media, traders can glean\u0000insights into prevailing market sentiments towards the USD and other pertinent\u0000countries, thereby aiding trading decision-making. This study underscores the\u0000importance of weaving sentiment analysis into trading strategies as a pivotal\u0000tool for predicting market dynamics.","PeriodicalId":501139,"journal":{"name":"arXiv - QuantFin - Statistical Finance","volume":"52 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140034646","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Ploutos: Towards interpretable stock movement prediction with financial large language model Ploutos:利用金融大型语言模型实现可解释的股票走势预测
Pub Date : 2024-02-18 DOI: arxiv-2403.00782
Hanshuang Tong, Jun Li, Ning Wu, Ming Gong, Dongmei Zhang, Qi Zhang
Recent advancements in large language models (LLMs) have opened new pathwaysfor many domains. However, the full potential of LLMs in financial investmentsremains largely untapped. There are two main challenges for typical deeplearning-based methods for quantitative finance. First, they struggle to fusetextual and numerical information flexibly for stock movement prediction.Second, traditional methods lack clarity and interpretability, which impedestheir application in scenarios where the justification for predictions isessential. To solve the above challenges, we propose Ploutos, a novel financialLLM framework that consists of PloutosGen and PloutosGPT. The PloutosGencontains multiple primary experts that can analyze different modal data, suchas text and numbers, and provide quantitative strategies from differentperspectives. Then PloutosGPT combines their insights and predictions andgenerates interpretable rationales. To generate accurate and faithfulrationales, the training strategy of PloutosGPT leverage rearview-mirrorprompting mechanism to guide GPT-4 to generate rationales, and a dynamic tokenweighting mechanism to finetune LLM by increasing key tokens weight. Extensiveexperiments show our framework outperforms the state-of-the-art methods on bothprediction accuracy and interpretability.
大型语言模型(LLM)的最新进展为许多领域开辟了新的途径。然而,LLMs 在金融投资领域的全部潜力在很大程度上仍未得到开发。基于深度学习的典型量化金融方法面临两大挑战。其次,传统方法缺乏清晰度和可解释性,这阻碍了它们在预测理由至关重要的场景中的应用。为了解决上述难题,我们提出了 Ploutos,一个由 PloutosGen 和 PloutosGPT 组成的新型金融LLM 框架。PloutosGen 包含多个初级专家,他们可以分析不同的模态数据,如文本和数字,并从不同角度提供量化策略。然后,PloutosGPT 结合他们的见解和预测,生成可解释的理由。为了生成准确、忠实的理由,PloutosGPT 的训练策略利用后视镜提示机制引导 GPT-4 生成理由,并利用动态标记加权机制通过增加关键标记的权重对 LLM 进行微调。广泛的实验表明,我们的框架在预测准确性和可解释性方面都优于最先进的方法。
{"title":"Ploutos: Towards interpretable stock movement prediction with financial large language model","authors":"Hanshuang Tong, Jun Li, Ning Wu, Ming Gong, Dongmei Zhang, Qi Zhang","doi":"arxiv-2403.00782","DOIUrl":"https://doi.org/arxiv-2403.00782","url":null,"abstract":"Recent advancements in large language models (LLMs) have opened new pathways\u0000for many domains. However, the full potential of LLMs in financial investments\u0000remains largely untapped. There are two main challenges for typical deep\u0000learning-based methods for quantitative finance. First, they struggle to fuse\u0000textual and numerical information flexibly for stock movement prediction.\u0000Second, traditional methods lack clarity and interpretability, which impedes\u0000their application in scenarios where the justification for predictions is\u0000essential. To solve the above challenges, we propose Ploutos, a novel financial\u0000LLM framework that consists of PloutosGen and PloutosGPT. The PloutosGen\u0000contains multiple primary experts that can analyze different modal data, such\u0000as text and numbers, and provide quantitative strategies from different\u0000perspectives. Then PloutosGPT combines their insights and predictions and\u0000generates interpretable rationales. To generate accurate and faithful\u0000rationales, the training strategy of PloutosGPT leverage rearview-mirror\u0000prompting mechanism to guide GPT-4 to generate rationales, and a dynamic token\u0000weighting mechanism to finetune LLM by increasing key tokens weight. Extensive\u0000experiments show our framework outperforms the state-of-the-art methods on both\u0000prediction accuracy and interpretability.","PeriodicalId":501139,"journal":{"name":"arXiv - QuantFin - Statistical Finance","volume":"13 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140034385","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
RAGIC: Risk-Aware Generative Adversarial Model for Stock Interval Construction RAGIC:股票区间构建的风险意识生成对抗模型
Pub Date : 2024-02-16 DOI: arxiv-2402.10760
Jingyi Gu, Wenlu Du, Guiling Wang
Efforts to predict stock market outcomes have yielded limited success due tothe inherently stochastic nature of the market, influenced by numerousunpredictable factors. Many existing prediction approaches focus onsingle-point predictions, lacking the depth needed for effectivedecision-making and often overlooking market risk. To bridge this gap, wepropose a novel model, RAGIC, which introduces sequence generation for stockinterval prediction to quantify uncertainty more effectively. Our approachleverages a Generative Adversarial Network (GAN) to produce future pricesequences infused with randomness inherent in financial markets. RAGIC'sgenerator includes a risk module, capturing the risk perception of informedinvestors, and a temporal module, accounting for historical price trends andseasonality. This multi-faceted generator informs the creation ofrisk-sensitive intervals through statistical inference, incorporatinghorizon-wise insights. The interval's width is carefully adjusted to reflectmarket volatility. Importantly, our approach relies solely on publiclyavailable data and incurs only low computational overhead. RAGIC's evaluationacross globally recognized broad-based indices demonstrates its balancedperformance, offering both accuracy and informativeness. Achieving a consistent95% coverage, RAGIC maintains a narrow interval width. This promising outcomesuggests that our approach effectively addresses the challenges of stock marketprediction while incorporating vital risk considerations.
由于市场本身具有随机性,受到众多不可预测因素的影响,预测股票市场结果的努力成果有限。许多现有的预测方法侧重于单点预测,缺乏有效决策所需的深度,而且往往忽略了市场风险。为了弥补这一缺陷,我们提出了一个新颖的模型 RAGIC,它为股票区间预测引入了序列生成,从而更有效地量化不确定性。我们的方法利用生成对抗网络(GAN)生成未来价格序列,并在其中注入金融市场固有的随机性。RAGIC 的生成器包括一个风险模块(捕捉知情投资者的风险意识)和一个时间模块(考虑历史价格趋势和季节性)。这个多方面的生成器通过统计推断,结合远景洞察力,为创建风险敏感区间提供信息。区间的宽度经过仔细调整,以反映市场波动性。重要的是,我们的方法完全依赖于公开数据,计算开销很低。RAGIC 在全球公认的宽基指数中的评估结果表明,它在准确性和信息量方面表现均衡。RAGIC 的覆盖率始终保持在 95%,并且保持了较窄的区间宽度。这一令人鼓舞的结果表明,我们的方法有效地解决了股票市场预测的难题,同时纳入了重要的风险考虑因素。
{"title":"RAGIC: Risk-Aware Generative Adversarial Model for Stock Interval Construction","authors":"Jingyi Gu, Wenlu Du, Guiling Wang","doi":"arxiv-2402.10760","DOIUrl":"https://doi.org/arxiv-2402.10760","url":null,"abstract":"Efforts to predict stock market outcomes have yielded limited success due to\u0000the inherently stochastic nature of the market, influenced by numerous\u0000unpredictable factors. Many existing prediction approaches focus on\u0000single-point predictions, lacking the depth needed for effective\u0000decision-making and often overlooking market risk. To bridge this gap, we\u0000propose a novel model, RAGIC, which introduces sequence generation for stock\u0000interval prediction to quantify uncertainty more effectively. Our approach\u0000leverages a Generative Adversarial Network (GAN) to produce future price\u0000sequences infused with randomness inherent in financial markets. RAGIC's\u0000generator includes a risk module, capturing the risk perception of informed\u0000investors, and a temporal module, accounting for historical price trends and\u0000seasonality. This multi-faceted generator informs the creation of\u0000risk-sensitive intervals through statistical inference, incorporating\u0000horizon-wise insights. The interval's width is carefully adjusted to reflect\u0000market volatility. Importantly, our approach relies solely on publicly\u0000available data and incurs only low computational overhead. RAGIC's evaluation\u0000across globally recognized broad-based indices demonstrates its balanced\u0000performance, offering both accuracy and informativeness. Achieving a consistent\u000095% coverage, RAGIC maintains a narrow interval width. This promising outcome\u0000suggests that our approach effectively addresses the challenges of stock market\u0000prediction while incorporating vital risk considerations.","PeriodicalId":501139,"journal":{"name":"arXiv - QuantFin - Statistical Finance","volume":"23 6 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139902514","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Towards Financially Inclusive Credit Products Through Financial Time Series Clustering 通过金融时间序列聚类开发普惠金融信贷产品
Pub Date : 2024-02-16 DOI: arxiv-2402.11066
Tristan Bester, Benjamin Rosman
Financial inclusion ensures that individuals have access to financialproducts and services that meet their needs. As a key contributing factor toeconomic growth and investment opportunity, financial inclusion increasesconsumer spending and consequently business development. It has been shown thatinstitutions are more profitable when they provide marginalised social groupsaccess to financial services. Customer segmentation based on consumertransaction data is a well-known strategy used to promote financial inclusion.While the required data is available to modern institutions, the challengeremains that segment annotations are usually difficult and/or expensive toobtain. This prevents the usage of time series classification models forcustomer segmentation based on domain expert knowledge. As a result, clusteringis an attractive alternative to partition customers into homogeneous groupsbased on the spending behaviour encoded within their transaction data. In thispaper, we present a solution to one of the key challenges preventing modernfinancial institutions from providing financially inclusive credit, savings andinsurance products: the inability to understand consumer financial behaviour,and hence risk, without the introduction of restrictive conventional creditscoring techniques. We present a novel time series clustering algorithm thatallows institutions to understand the financial behaviour of their customers.This enables unique product offerings to be provided based on the needs of thecustomer, without reliance on restrictive credit practices.
金融包容性确保个人能够获得满足其需求的金融产品和服务。作为促进经济增长和投资机会的一个关键因素,普惠金融增加了消费者支出,从而促进了企业发展。事实证明,当机构为边缘化社会群体提供金融服务时,其盈利能力会更强。虽然现代机构可以获得所需的数据,但面临的挑战仍然是细分市场注释通常难以获得和/或价格昂贵。这阻碍了基于领域专家知识的客户细分时间序列分类模型的使用。因此,聚类是一种有吸引力的替代方法,可根据交易数据中编码的消费行为将客户划分为同质群体。在本文中,我们针对阻碍现代金融机构提供金融包容性信贷、储蓄和保险产品的主要挑战之一提出了一个解决方案:在不引入限制性传统信用评分技术的情况下,无法了解消费者的金融行为,从而无法了解风险。我们提出了一种新颖的时间序列聚类算法,允许金融机构了解客户的金融行为,从而能够根据客户的需求提供独特的产品,而不依赖于限制性的信贷做法。
{"title":"Towards Financially Inclusive Credit Products Through Financial Time Series Clustering","authors":"Tristan Bester, Benjamin Rosman","doi":"arxiv-2402.11066","DOIUrl":"https://doi.org/arxiv-2402.11066","url":null,"abstract":"Financial inclusion ensures that individuals have access to financial\u0000products and services that meet their needs. As a key contributing factor to\u0000economic growth and investment opportunity, financial inclusion increases\u0000consumer spending and consequently business development. It has been shown that\u0000institutions are more profitable when they provide marginalised social groups\u0000access to financial services. Customer segmentation based on consumer\u0000transaction data is a well-known strategy used to promote financial inclusion.\u0000While the required data is available to modern institutions, the challenge\u0000remains that segment annotations are usually difficult and/or expensive to\u0000obtain. This prevents the usage of time series classification models for\u0000customer segmentation based on domain expert knowledge. As a result, clustering\u0000is an attractive alternative to partition customers into homogeneous groups\u0000based on the spending behaviour encoded within their transaction data. In this\u0000paper, we present a solution to one of the key challenges preventing modern\u0000financial institutions from providing financially inclusive credit, savings and\u0000insurance products: the inability to understand consumer financial behaviour,\u0000and hence risk, without the introduction of restrictive conventional credit\u0000scoring techniques. We present a novel time series clustering algorithm that\u0000allows institutions to understand the financial behaviour of their customers.\u0000This enables unique product offerings to be provided based on the needs of the\u0000customer, without reliance on restrictive credit practices.","PeriodicalId":501139,"journal":{"name":"arXiv - QuantFin - Statistical Finance","volume":"140 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139925578","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Emoji Driven Crypto Assets Market Reactions 表情符号驱动的加密资产市场反应
Pub Date : 2024-02-16 DOI: arxiv-2402.10481
Xiaorui Zuo, Yao-Tsung Chen, Wolfgang Karl Härdle
In the burgeoning realm of cryptocurrency, social media platforms likeTwitter have become pivotal in influencing market trends and investorsentiments. In our study, we leverage GPT-4 and a fine-tuned transformer-basedBERT model for a multimodal sentiment analysis, focusing on the impact of emojisentiment on cryptocurrency markets. By translating emojis into quantifiablesentiment data, we correlate these insights with key market indicators like BTCPrice and the VCRIX index. This approach may be fed into the development oftrading strategies aimed at utilizing social media elements to identify andforecast market trends. Crucially, our findings suggest that strategies basedon emoji sentiment can facilitate the avoidance of significant market downturnsand contribute to the stabilization of returns. This research underscores thepractical benefits of integrating advanced AI-driven analyses into financialstrategies, offering a nuanced perspective on the interplay between digitalcommunication and market dynamics in an academic context.
在蓬勃发展的加密货币领域,Twitter 等社交媒体平台已成为影响市场趋势和投资者情绪的关键。在我们的研究中,我们利用 GPT-4 和基于变换器的微调 BERT 模型进行多模态情感分析,重点研究表情符号对加密货币市场的影响。通过将表情符号转化为可量化的情绪数据,我们将这些见解与 BTCPrice 和 VCRIX 指数等关键市场指标相关联。这种方法可用于制定交易策略,旨在利用社交媒体元素识别和预测市场趋势。最重要的是,我们的研究结果表明,基于表情符号情绪的策略有助于避免市场大幅下滑,并有助于稳定回报。这项研究强调了将先进的人工智能驱动分析整合到金融策略中的实际好处,在学术背景下为数字通信与市场动态之间的相互作用提供了一个细致入微的视角。
{"title":"Emoji Driven Crypto Assets Market Reactions","authors":"Xiaorui Zuo, Yao-Tsung Chen, Wolfgang Karl Härdle","doi":"arxiv-2402.10481","DOIUrl":"https://doi.org/arxiv-2402.10481","url":null,"abstract":"In the burgeoning realm of cryptocurrency, social media platforms like\u0000Twitter have become pivotal in influencing market trends and investor\u0000sentiments. In our study, we leverage GPT-4 and a fine-tuned transformer-based\u0000BERT model for a multimodal sentiment analysis, focusing on the impact of emoji\u0000sentiment on cryptocurrency markets. By translating emojis into quantifiable\u0000sentiment data, we correlate these insights with key market indicators like BTC\u0000Price and the VCRIX index. This approach may be fed into the development of\u0000trading strategies aimed at utilizing social media elements to identify and\u0000forecast market trends. Crucially, our findings suggest that strategies based\u0000on emoji sentiment can facilitate the avoidance of significant market downturns\u0000and contribute to the stabilization of returns. This research underscores the\u0000practical benefits of integrating advanced AI-driven analyses into financial\u0000strategies, offering a nuanced perspective on the interplay between digital\u0000communication and market dynamics in an academic context.","PeriodicalId":501139,"journal":{"name":"arXiv - QuantFin - Statistical Finance","volume":"147 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139904116","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
arXiv - QuantFin - Statistical Finance
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1