首页 > 最新文献

arXiv - QuantFin - Statistical Finance最新文献

英文 中文
Leveraging RNNs and LSTMs for Synchronization Analysis in the Indian Stock Market: A Threshold-Based Classification Approach 利用 RNN 和 LSTM 对印度股市进行同步分析:基于阈值的分类方法
Pub Date : 2024-08-27 DOI: arxiv-2409.06728
Sanjay Sathish, Charu C Sharma
Our research presents a new approach for forecasting the synchronization ofstock prices using machine learning and non-linear time-series analysis. Tocapture the complex non-linear relationships between stock prices, we utilizerecurrence plots (RP) and cross-recurrence quantification analysis (CRQA). Bytransforming Cross Recurrence Plot (CRP) data into a time-series format, weenable the use of Recurrent Neural Networks (RNN) and Long Short-Term Memory(LSTM) networks for predicting stock price synchronization through bothregression and classification. We apply this methodology to a dataset of 20highly capitalized stocks from the Indian market over a 21-year period. Thefindings reveal that our approach can predict stock price synchronization, withan accuracy of 0.98 and F1 score of 0.83 offering valuable insights fordeveloping effective trading strategies and risk management tools.
我们的研究提出了一种利用机器学习和非线性时间序列分析预测股票价格同步性的新方法。为了捕捉股票价格之间复杂的非线性关系,我们利用了复现图(RP)和交叉复现量化分析(CRQA)。通过将交叉复现图(CRP)数据转换为时间序列格式,我们可以使用递归神经网络(RNN)和长短期记忆(LSTM)网络,通过回归和分类预测股价同步性。我们将这一方法应用于印度市场 21 年间 20 只高市值股票的数据集。研究结果表明,我们的方法可以预测股价同步性,准确率为 0.98,F1 得分为 0.83,为开发有效的交易策略和风险管理工具提供了宝贵的见解。
{"title":"Leveraging RNNs and LSTMs for Synchronization Analysis in the Indian Stock Market: A Threshold-Based Classification Approach","authors":"Sanjay Sathish, Charu C Sharma","doi":"arxiv-2409.06728","DOIUrl":"https://doi.org/arxiv-2409.06728","url":null,"abstract":"Our research presents a new approach for forecasting the synchronization of\u0000stock prices using machine learning and non-linear time-series analysis. To\u0000capture the complex non-linear relationships between stock prices, we utilize\u0000recurrence plots (RP) and cross-recurrence quantification analysis (CRQA). By\u0000transforming Cross Recurrence Plot (CRP) data into a time-series format, we\u0000enable the use of Recurrent Neural Networks (RNN) and Long Short-Term Memory\u0000(LSTM) networks for predicting stock price synchronization through both\u0000regression and classification. We apply this methodology to a dataset of 20\u0000highly capitalized stocks from the Indian market over a 21-year period. The\u0000findings reveal that our approach can predict stock price synchronization, with\u0000an accuracy of 0.98 and F1 score of 0.83 offering valuable insights for\u0000developing effective trading strategies and risk management tools.","PeriodicalId":501139,"journal":{"name":"arXiv - QuantFin - Statistical Finance","volume":"2 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142190942","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
LSR-IGRU: Stock Trend Prediction Based on Long Short-Term Relationships and Improved GRU LSR-IGRU:基于长期短期关系和改进 GRU 的股票走势预测
Pub Date : 2024-08-26 DOI: arxiv-2409.08282
Peng Zhu, Yuante Li, Yifan Hu, Qinyuan Liu, Dawei Cheng, Yuqi Liang
Stock price prediction is a challenging problem in the field of finance andreceives widespread attention. In recent years, with the rapid development oftechnologies such as deep learning and graph neural networks, more researchmethods have begun to focus on exploring the interrelationships between stocks.However, existing methods mostly focus on the short-term dynamic relationshipsof stocks and directly integrating relationship information with temporalinformation. They often overlook the complex nonlinear dynamic characteristicsand potential higher-order interaction relationships among stocks in the stockmarket. Therefore, we propose a stock price trend prediction model namedLSR-IGRU in this paper, which is based on long short-term stock relationshipsand an improved GRU input. Firstly, we construct a long short-term relationshipmatrix between stocks, where secondary industry information is employed for thefirst time to capture long-term relationships of stocks, and overnight priceinformation is utilized to establish short-term relationships. Next, we improvethe inputs of the GRU model at each step, enabling the model to moreeffectively integrate temporal information and long short-term relationshipinformation, thereby significantly improving the accuracy of predicting stocktrend changes. Finally, through extensive experiments on multiple datasets fromstock markets in China and the United States, we validate the superiority ofthe proposed LSR-IGRU model over the current state-of-the-art baseline models.We also apply the proposed model to the algorithmic trading system of afinancial company, achieving significantly higher cumulative portfolio returnscompared to other baseline methods. Our sources are released athttps://github.com/ZP1481616577/Baselines_LSR-IGRU.
股票价格预测是金融领域的一个挑战性问题,受到广泛关注。近年来,随着深度学习和图神经网络等技术的快速发展,越来越多的研究方法开始关注股票之间相互关系的探索。然而,现有方法大多关注股票的短期动态关系,并直接将关系信息与时间信息进行整合。然而,现有方法大多关注股票的短期动态关系,并直接将关系信息与时间信息整合,往往忽略了股票市场中股票之间复杂的非线性动态特征和潜在的高阶交互关系。因此,我们在本文中提出了基于股票长期短期关系和改进的 GRU 输入的股价趋势预测模型LSR-IGRU。首先,我们构建了股票之间的长期短期关系矩阵,首次利用二级行业信息捕捉股票的长期关系,并利用隔夜价格信息建立短期关系。接下来,我们改进了 GRU 模型的每一步输入,使模型能够更有效地整合时间信息和长期短期关系信息,从而显著提高了预测股票走势变化的准确性。最后,通过在中国和美国股票市场的多个数据集上进行广泛实验,我们验证了所提出的 LSR-IGRU 模型优于当前最先进的基线模型。我们还将所提出的模型应用于一家金融公司的算法交易系统,与其他基线方法相比,该模型获得了显著较高的累计投资组合回报。我们的资料来源发布在https://github.com/ZP1481616577/Baselines_LSR-IGRU.
{"title":"LSR-IGRU: Stock Trend Prediction Based on Long Short-Term Relationships and Improved GRU","authors":"Peng Zhu, Yuante Li, Yifan Hu, Qinyuan Liu, Dawei Cheng, Yuqi Liang","doi":"arxiv-2409.08282","DOIUrl":"https://doi.org/arxiv-2409.08282","url":null,"abstract":"Stock price prediction is a challenging problem in the field of finance and\u0000receives widespread attention. In recent years, with the rapid development of\u0000technologies such as deep learning and graph neural networks, more research\u0000methods have begun to focus on exploring the interrelationships between stocks.\u0000However, existing methods mostly focus on the short-term dynamic relationships\u0000of stocks and directly integrating relationship information with temporal\u0000information. They often overlook the complex nonlinear dynamic characteristics\u0000and potential higher-order interaction relationships among stocks in the stock\u0000market. Therefore, we propose a stock price trend prediction model named\u0000LSR-IGRU in this paper, which is based on long short-term stock relationships\u0000and an improved GRU input. Firstly, we construct a long short-term relationship\u0000matrix between stocks, where secondary industry information is employed for the\u0000first time to capture long-term relationships of stocks, and overnight price\u0000information is utilized to establish short-term relationships. Next, we improve\u0000the inputs of the GRU model at each step, enabling the model to more\u0000effectively integrate temporal information and long short-term relationship\u0000information, thereby significantly improving the accuracy of predicting stock\u0000trend changes. Finally, through extensive experiments on multiple datasets from\u0000stock markets in China and the United States, we validate the superiority of\u0000the proposed LSR-IGRU model over the current state-of-the-art baseline models.\u0000We also apply the proposed model to the algorithmic trading system of a\u0000financial company, achieving significantly higher cumulative portfolio returns\u0000compared to other baseline methods. Our sources are released at\u0000https://github.com/ZP1481616577/Baselines_LSR-IGRU.","PeriodicalId":501139,"journal":{"name":"arXiv - QuantFin - Statistical Finance","volume":"15 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142250091","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
StockTime: A Time Series Specialized Large Language Model Architecture for Stock Price Prediction StockTime:用于股价预测的时间序列专用大型语言模型架构
Pub Date : 2024-08-25 DOI: arxiv-2409.08281
Shengkun Wang, Taoran Ji, Linhan Wang, Yanshen Sun, Shang-Ching Liu, Amit Kumar, Chang-Tien Lu
The stock price prediction task holds a significant role in the financialdomain and has been studied for a long time. Recently, large language models(LLMs) have brought new ways to improve these predictions. While recentfinancial large language models (FinLLMs) have shown considerable progress infinancial NLP tasks compared to smaller pre-trained language models (PLMs),challenges persist in stock price forecasting. Firstly, effectively integratingthe modalities of time series data and natural language to fully leverage thesecapabilities remains complex. Secondly, FinLLMs focus more on analysis andinterpretability, which can overlook the essential features of time seriesdata. Moreover, due to the abundance of false and redundant information infinancial markets, models often produce less accurate predictions when facedwith such input data. In this paper, we introduce StockTime, a novel LLM-basedarchitecture designed specifically for stock price data. Unlike recent FinLLMs,StockTime is specifically designed for stock price time series data. Itleverages the natural ability of LLMs to predict the next token by treatingstock prices as consecutive tokens, extracting textual information such asstock correlations, statistical trends and timestamps directly from these stockprices. StockTime then integrates both textual and time series data into theembedding space. By fusing this multimodal data, StockTime effectively predictsstock prices across arbitrary look-back periods. Our experiments demonstratethat StockTime outperforms recent LLMs, as it gives more accurate predictionswhile reducing memory usage and runtime costs.
股票价格预测任务在金融领域占有重要地位,对它的研究由来已久。最近,大型语言模型(LLMs)为改进这些预测带来了新方法。与预训练的小型语言模型(PLMs)相比,最近的金融大型语言模型(FinLLMs)在金融 NLP 任务方面取得了长足的进步,但在股票价格预测方面仍然存在挑战。首先,有效整合时间序列数据和自然语言模式以充分利用这些能力仍然很复杂。其次,金融语言模型更注重分析和可解释性,这可能会忽略时间序列数据的基本特征。此外,由于金融市场中存在大量虚假和冗余信息,模型在面对此类输入数据时往往无法做出准确的预测。在本文中,我们介绍了专门针对股价数据设计的基于 LLM 的新型架构 StockTime。与最近的金融 LLM 不同,StockTime 专为股票价格时间序列数据而设计。它利用 LLM 预测下一个代币的天然能力,将股票价格视为连续代币,直接从这些股票价格中提取文本信息,如股票相关性、统计趋势和时间戳。然后,StockTime 将文本和时间序列数据整合到嵌入空间中。通过融合这些多模态数据,StockTime 可以有效预测任意回溯期的股票价格。我们的实验证明,StockTime 的性能优于最近的 LLM,因为它能提供更准确的预测,同时降低内存使用率和运行成本。
{"title":"StockTime: A Time Series Specialized Large Language Model Architecture for Stock Price Prediction","authors":"Shengkun Wang, Taoran Ji, Linhan Wang, Yanshen Sun, Shang-Ching Liu, Amit Kumar, Chang-Tien Lu","doi":"arxiv-2409.08281","DOIUrl":"https://doi.org/arxiv-2409.08281","url":null,"abstract":"The stock price prediction task holds a significant role in the financial\u0000domain and has been studied for a long time. Recently, large language models\u0000(LLMs) have brought new ways to improve these predictions. While recent\u0000financial large language models (FinLLMs) have shown considerable progress in\u0000financial NLP tasks compared to smaller pre-trained language models (PLMs),\u0000challenges persist in stock price forecasting. Firstly, effectively integrating\u0000the modalities of time series data and natural language to fully leverage these\u0000capabilities remains complex. Secondly, FinLLMs focus more on analysis and\u0000interpretability, which can overlook the essential features of time series\u0000data. Moreover, due to the abundance of false and redundant information in\u0000financial markets, models often produce less accurate predictions when faced\u0000with such input data. In this paper, we introduce StockTime, a novel LLM-based\u0000architecture designed specifically for stock price data. Unlike recent FinLLMs,\u0000StockTime is specifically designed for stock price time series data. It\u0000leverages the natural ability of LLMs to predict the next token by treating\u0000stock prices as consecutive tokens, extracting textual information such as\u0000stock correlations, statistical trends and timestamps directly from these stock\u0000prices. StockTime then integrates both textual and time series data into the\u0000embedding space. By fusing this multimodal data, StockTime effectively predicts\u0000stock prices across arbitrary look-back periods. Our experiments demonstrate\u0000that StockTime outperforms recent LLMs, as it gives more accurate predictions\u0000while reducing memory usage and runtime costs.","PeriodicalId":501139,"journal":{"name":"arXiv - QuantFin - Statistical Finance","volume":"17 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142268344","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimizing Performance: How Compact Models Match or Exceed GPT's Classification Capabilities through Fine-Tuning 优化性能:紧凑型模型如何通过微调匹配或超越 GPT 的分类能力
Pub Date : 2024-08-22 DOI: arxiv-2409.11408
Baptiste Lefort, Eric Benhamou, Jean-Jacques Ohana, David Saltiel, Beatrice Guez
In this paper, we demonstrate that non-generative, small-sized models such asFinBERT and FinDRoBERTa, when fine-tuned, can outperform GPT-3.5 and GPT-4models in zero-shot learning settings in sentiment analysis for financial news.These fine-tuned models show comparable results to GPT-3.5 when it isfine-tuned on the task of determining market sentiment from daily financialnews summaries sourced from Bloomberg. To fine-tune and compare these models,we created a novel database, which assigns a market score to each piece of newswithout human interpretation bias, systematically identifying the mentionedcompanies and analyzing whether their stocks have gone up, down, or remainedneutral. Furthermore, the paper shows that the assumptions of Condorcet's JuryTheorem do not hold suggesting that fine-tuned small models are not independentof the fine-tuned GPT models, indicating behavioural similarities. Lastly, theresulted fine-tuned models are made publicly available on HuggingFace,providing a resource for further research in financial sentiment analysis andtext classification.
在本文中,我们证明了非生成的小型模型,如FinBERT和FinDRoBERTa,经过微调后,可以在金融新闻情感分析的零点学习设置中优于GPT-3.5和GPT-4模型。这些经过微调的模型在对GPT-3.5进行微调后,在从彭博社的每日金融新闻摘要中判断市场情感的任务上显示出与GPT-3.5相当的结果。为了对这些模型进行微调和比较,我们创建了一个新颖的数据库,在没有人为解读偏差的情况下,为每条新闻分配一个市场得分,系统地识别被提及的公司,分析其股票是上涨、下跌还是保持中立。此外,本文还表明,孔多塞评判定理的假设并不成立,这表明微调小模型与微调 GPT 模型并不独立,这表明了行为上的相似性。最后,本文在 HuggingFace 上公开了微调模型的结果,为进一步研究金融情感分析和文本分类提供了资源。
{"title":"Optimizing Performance: How Compact Models Match or Exceed GPT's Classification Capabilities through Fine-Tuning","authors":"Baptiste Lefort, Eric Benhamou, Jean-Jacques Ohana, David Saltiel, Beatrice Guez","doi":"arxiv-2409.11408","DOIUrl":"https://doi.org/arxiv-2409.11408","url":null,"abstract":"In this paper, we demonstrate that non-generative, small-sized models such as\u0000FinBERT and FinDRoBERTa, when fine-tuned, can outperform GPT-3.5 and GPT-4\u0000models in zero-shot learning settings in sentiment analysis for financial news.\u0000These fine-tuned models show comparable results to GPT-3.5 when it is\u0000fine-tuned on the task of determining market sentiment from daily financial\u0000news summaries sourced from Bloomberg. To fine-tune and compare these models,\u0000we created a novel database, which assigns a market score to each piece of news\u0000without human interpretation bias, systematically identifying the mentioned\u0000companies and analyzing whether their stocks have gone up, down, or remained\u0000neutral. Furthermore, the paper shows that the assumptions of Condorcet's Jury\u0000Theorem do not hold suggesting that fine-tuned small models are not independent\u0000of the fine-tuned GPT models, indicating behavioural similarities. Lastly, the\u0000resulted fine-tuned models are made publicly available on HuggingFace,\u0000providing a resource for further research in financial sentiment analysis and\u0000text classification.","PeriodicalId":501139,"journal":{"name":"arXiv - QuantFin - Statistical Finance","volume":"17 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142250090","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
EX-DRL: Hedging Against Heavy Losses with EXtreme Distributional Reinforcement Learning EX-DRL:利用极端分布强化学习抵御严重损失
Pub Date : 2024-08-22 DOI: arxiv-2408.12446
Parvin Malekzadeh, Zissis Poulos, Jacky Chen, Zeyu Wang, Konstantinos N. Plataniotis
Recent advancements in Distributional Reinforcement Learning (DRL) formodeling loss distributions have shown promise in developing hedging strategiesin derivatives markets. A common approach in DRL involves learning thequantiles of loss distributions at specified levels using Quantile Regression(QR). This method is particularly effective in option hedging due to its directquantile-based risk assessment, such as Value at Risk (VaR) and ConditionalValue at Risk (CVaR). However, these risk measures depend on the accurateestimation of extreme quantiles in the loss distribution's tail, which can beimprecise in QR-based DRL due to the rarity and extremity of tail data, ashighlighted in the literature. To address this issue, we propose EXtreme DRL(EX-DRL), which enhances extreme quantile prediction by modeling the tail ofthe loss distribution with a Generalized Pareto Distribution (GPD). This methodintroduces supplementary data to mitigate the scarcity of extreme quantileobservations, thereby improving estimation accuracy through QR. Comprehensiveexperiments on gamma hedging options demonstrate that EX-DRL improves existingQR-based models by providing more precise estimates of extreme quantiles,thereby improving the computation and reliability of risk metrics for complexfinancial risk management.
分布强化学习(DRL)在损失分布建模方面的最新进展,为衍生品市场对冲策略的开发带来了希望。DRL 中的一种常见方法是利用定量回归(QR)学习指定水平上损失分布的定量。这种方法在期权对冲中尤为有效,因为它可以直接进行基于量值的风险评估,如风险值(VaR)和条件风险值(CVaR)。然而,这些风险度量依赖于对损失分布尾部极端量值的精确估计,由于尾部数据的稀缺性和极端性,基于 QR 的 DRL 难以精确估计尾部数据。为了解决这个问题,我们提出了 EXtreme DRL(EX-DRL),它通过使用广义帕累托分布(GPD)对损失分布的尾部进行建模,从而增强了极端量值预测。该方法引入了补充数据,以缓解极端量级观测数据稀缺的问题,从而通过 QR 提高了估计精度。对伽马对冲期权的综合实验表明,EX-DRL 改进了现有的基于 QR 的模型,提供了更精确的极端量值估计,从而改进了用于综合金融风险管理的风险度量的计算和可靠性。
{"title":"EX-DRL: Hedging Against Heavy Losses with EXtreme Distributional Reinforcement Learning","authors":"Parvin Malekzadeh, Zissis Poulos, Jacky Chen, Zeyu Wang, Konstantinos N. Plataniotis","doi":"arxiv-2408.12446","DOIUrl":"https://doi.org/arxiv-2408.12446","url":null,"abstract":"Recent advancements in Distributional Reinforcement Learning (DRL) for\u0000modeling loss distributions have shown promise in developing hedging strategies\u0000in derivatives markets. A common approach in DRL involves learning the\u0000quantiles of loss distributions at specified levels using Quantile Regression\u0000(QR). This method is particularly effective in option hedging due to its direct\u0000quantile-based risk assessment, such as Value at Risk (VaR) and Conditional\u0000Value at Risk (CVaR). However, these risk measures depend on the accurate\u0000estimation of extreme quantiles in the loss distribution's tail, which can be\u0000imprecise in QR-based DRL due to the rarity and extremity of tail data, as\u0000highlighted in the literature. To address this issue, we propose EXtreme DRL\u0000(EX-DRL), which enhances extreme quantile prediction by modeling the tail of\u0000the loss distribution with a Generalized Pareto Distribution (GPD). This method\u0000introduces supplementary data to mitigate the scarcity of extreme quantile\u0000observations, thereby improving estimation accuracy through QR. Comprehensive\u0000experiments on gamma hedging options demonstrate that EX-DRL improves existing\u0000QR-based models by providing more precise estimates of extreme quantiles,\u0000thereby improving the computation and reliability of risk metrics for complex\u0000financial risk management.","PeriodicalId":501139,"journal":{"name":"arXiv - QuantFin - Statistical Finance","volume":"121 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142190917","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dynamical analysis of financial stocks network: improving forecasting using network properties 金融股网络动态分析:利用网络特性改进预测
Pub Date : 2024-08-21 DOI: arxiv-2408.11759
Ixandra Achitouv
Applying a network analysis to stock return correlations, we study thedynamical properties of the network and how they correlate with the marketreturn, finding meaningful variables that partially capture the complexdynamical processes of stock interactions and the market structure. We then usethe individual properties of stocks within the network along with the globalones, to find correlations with the future returns of individual S&P 500stocks. Applying these properties as input variables for forecasting, we find a50% improvement on the R2score in the prediction of stock returns on long timescales (per year), and 3% on short time scales (2 days), relative to baselinemodels without network variables.
通过对股票收益相关性进行网络分析,我们研究了网络的动态特性及其与市场收益的相关性,发现了一些有意义的变量,它们部分捕捉到了股票互动和市场结构的复杂动态过程。然后,我们利用网络中股票的个别属性和全球属性,找出与标准普尔 500 指数个股未来回报的相关性。通过将这些属性作为预测的输入变量,我们发现相对于没有网络变量的基线模型,在预测股票回报率方面,长时间尺度(每年)的 R2 分数提高了 50%,短时间尺度(2 天)的 R2 分数提高了 3%。
{"title":"Dynamical analysis of financial stocks network: improving forecasting using network properties","authors":"Ixandra Achitouv","doi":"arxiv-2408.11759","DOIUrl":"https://doi.org/arxiv-2408.11759","url":null,"abstract":"Applying a network analysis to stock return correlations, we study the\u0000dynamical properties of the network and how they correlate with the market\u0000return, finding meaningful variables that partially capture the complex\u0000dynamical processes of stock interactions and the market structure. We then use\u0000the individual properties of stocks within the network along with the global\u0000ones, to find correlations with the future returns of individual S&P 500\u0000stocks. Applying these properties as input variables for forecasting, we find a\u000050% improvement on the R2score in the prediction of stock returns on long time\u0000scales (per year), and 3% on short time scales (2 days), relative to baseline\u0000models without network variables.","PeriodicalId":501139,"journal":{"name":"arXiv - QuantFin - Statistical Finance","volume":"51 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142190915","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Combining supervised and unsupervised learning methods to predict financial market movements 结合监督和非监督学习方法预测金融市场动向
Pub Date : 2024-08-19 DOI: arxiv-2409.03762
Gabriel Rodrigues Palma, Mariusz Skoczeń, Phil Maguire
The decisions traders make to buy or sell an asset depend on variousanalyses, with expertise required to identify patterns that can be exploitedfor profit. In this paper we identify novel features extracted from emergentand well-established financial markets using linear models and Gaussian MixtureModels (GMM) with the aim of finding profitable opportunities. We usedapproximately six months of data consisting of minute candles from the Bitcoin,Pepecoin, and Nasdaq markets to derive and compare the proposed novel featureswith commonly used ones. These features were extracted based on the previous 59minutes for each market and used to identify predictions for the hour ahead. Weexplored the performance of various machine learning strategies, such as RandomForests (RF) and K-Nearest Neighbours (KNN) to classify market movements. Anaive random approach to selecting trading decisions was used as a benchmark,with outcomes assumed to be equally likely. We used a temporal cross-validationapproach using test sets of 40%, 30% and 20% of total hours to evaluate thelearning algorithms' performances. Our results showed that filtering the timeseries facilitates algorithms' generalisation. The GMM filtering approachrevealed that the KNN and RF algorithms produced higher average returns thanthe random algorithm.
交易者买入或卖出资产的决定取决于各种分析,需要专业知识来识别可利用的盈利模式。在本文中,我们使用线性模型和高斯混杂模型(GMM)从新兴和成熟的金融市场中识别出新的特征,目的是寻找盈利机会。我们使用了大约六个月的数据,包括比特币、佩佩币和纳斯达克市场的分钟蜡烛图,得出了所提出的新特征,并将其与常用特征进行了比较。这些特征是根据每个市场的前 59 分钟提取的,并用于识别对未来一小时的预测。我们探索了各种机器学习策略的性能,如随机森林(RF)和 K-Nearest Neighbours(KNN),以对市场走势进行分类。我们将选择交易决策的随机方法作为基准,假定结果的可能性相同。我们采用时间交叉验证方法,使用总小时数的 40%、30% 和 20% 的测试集来评估学习算法的性能。结果表明,过滤时间序列有助于算法的泛化。GMM 过滤方法表明,KNN 和 RF 算法比随机算法产生了更高的平均收益。
{"title":"Combining supervised and unsupervised learning methods to predict financial market movements","authors":"Gabriel Rodrigues Palma, Mariusz Skoczeń, Phil Maguire","doi":"arxiv-2409.03762","DOIUrl":"https://doi.org/arxiv-2409.03762","url":null,"abstract":"The decisions traders make to buy or sell an asset depend on various\u0000analyses, with expertise required to identify patterns that can be exploited\u0000for profit. In this paper we identify novel features extracted from emergent\u0000and well-established financial markets using linear models and Gaussian Mixture\u0000Models (GMM) with the aim of finding profitable opportunities. We used\u0000approximately six months of data consisting of minute candles from the Bitcoin,\u0000Pepecoin, and Nasdaq markets to derive and compare the proposed novel features\u0000with commonly used ones. These features were extracted based on the previous 59\u0000minutes for each market and used to identify predictions for the hour ahead. We\u0000explored the performance of various machine learning strategies, such as Random\u0000Forests (RF) and K-Nearest Neighbours (KNN) to classify market movements. A\u0000naive random approach to selecting trading decisions was used as a benchmark,\u0000with outcomes assumed to be equally likely. We used a temporal cross-validation\u0000approach using test sets of 40%, 30% and 20% of total hours to evaluate the\u0000learning algorithms' performances. Our results showed that filtering the time\u0000series facilitates algorithms' generalisation. The GMM filtering approach\u0000revealed that the KNN and RF algorithms produced higher average returns than\u0000the random algorithm.","PeriodicalId":501139,"journal":{"name":"arXiv - QuantFin - Statistical Finance","volume":"9 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142190940","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A new measure of risk using Fourier analysis 利用傅立叶分析法衡量风险的新方法
Pub Date : 2024-08-18 DOI: arxiv-2408.10279
Michael Grabinski, Galiya Klinkova
We use Fourier analysis to access risk in financial products. With it weanalyze price changes of e.g. stocks. Via Fourier analysis we scrutinizequantitatively whether the frequency of change is higher than a change in(conserved) company value would allow. If it is the case, it would be a clearindicator of speculation and with it risk. The entire methods or better itsapplication is fairly new. However, there were severe flaws in previousattempts; making the results (not the method) doubtful. We corrected all thesemistakes by e.g. using Fourier transformation instead of discrete Fourieranalysis. Our analysis is reliable in the entire frequency band, even forfre-quency of 1/1d or higher if the prices are noted accordingly. For thestocks scrutinized we found that the price of stocks changes disproportionallywithin one week which clearly indicates spec-ulation. It would be aninteresting extension to apply the method to crypto currencies as thesecurrencies have no conserved value which makes normal considerations ofvolatility difficult.
我们利用傅立叶分析法来了解金融产品的风险。我们用它来分析股票等产品的价格变化。通过傅立叶分析法,我们可以定量分析价格变化的频率是否高于公司价值(保守)变化所允许的频率。如果是这样,这将是投机和风险的明确指标。整个方法或其更好的应用都是相当新的。然而,之前的尝试存在严重缺陷,导致结果(而非方法)存疑。我们通过使用傅里叶变换而不是离散傅里叶分析等方法纠正了所有这些错误。我们的分析在整个频段内都是可靠的,如果价格得到相应的记录,甚至 1/1d 或更高频率的分析也是可靠的。对于所研究的股票,我们发现股票价格在一周内的变化不成比例,这清楚地表明了投机行为。将该方法应用于加密货币将是一个有趣的扩展,因为这些货币没有保存价值,这使得对波动性的正常考虑变得困难。
{"title":"A new measure of risk using Fourier analysis","authors":"Michael Grabinski, Galiya Klinkova","doi":"arxiv-2408.10279","DOIUrl":"https://doi.org/arxiv-2408.10279","url":null,"abstract":"We use Fourier analysis to access risk in financial products. With it we\u0000analyze price changes of e.g. stocks. Via Fourier analysis we scrutinize\u0000quantitatively whether the frequency of change is higher than a change in\u0000(conserved) company value would allow. If it is the case, it would be a clear\u0000indicator of speculation and with it risk. The entire methods or better its\u0000application is fairly new. However, there were severe flaws in previous\u0000attempts; making the results (not the method) doubtful. We corrected all these\u0000mistakes by e.g. using Fourier transformation instead of discrete Fourier\u0000analysis. Our analysis is reliable in the entire frequency band, even for\u0000fre-quency of 1/1d or higher if the prices are noted accordingly. For the\u0000stocks scrutinized we found that the price of stocks changes disproportionally\u0000within one week which clearly indicates spec-ulation. It would be an\u0000interesting extension to apply the method to crypto currencies as these\u0000currencies have no conserved value which makes normal considerations of\u0000volatility difficult.","PeriodicalId":501139,"journal":{"name":"arXiv - QuantFin - Statistical Finance","volume":"7 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142190916","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Stylized facts in Web3 Web3 中的风格化事实
Pub Date : 2024-08-14 DOI: arxiv-2408.07653
A. Christian Silva, Shen-Ning Tung, Wwi-Ru Chen
This paper presents a comprehensive statistical analysis of the Web3ecosystem, comparing various Web3 tokens with traditional financial assetsacross multiple time scales. We examine probability distributions, tailbehaviors, and other key stylized facts of the returns for a diverse range oftokens, including decentralized exchanges, liquidity pools, and centralizedexchanges. Despite functional differences, most tokens exhibit well-establishedempirical facts, including unconditional probability density of returns withheavy tails gradually becoming Gaussian and volatility clustering. Furthermore,we compare assets traded on centralized (CEX) and decentralized (DEX)exchanges, finding that DEXs exhibit similar stylized facts despite differenttrading mechanisms and often divergent long-term performance. We propose thatthis similarity is attributable to arbitrageurs striving to maintain similarcentralized and decentralized prices. Our study contributes to a betterunderstanding of the dynamics of Web3 tokens and the relationship between CEXand DEX markets, with important implications for risk management, pricingmodels, and portfolio construction in the rapidly evolving DeFi landscape.These results add to the growing body of literature on cryptocurrency marketsand provide insights that can guide the development of more accurate models forDeFi markets.
本文对 Web3 生态系统进行了全面的统计分析,将各种 Web3 代币与传统金融资产在多个时间尺度上进行了比较。我们研究了各种代币(包括分散式交易所、流动性池和集中式交易所)收益的概率分布、尾部行为和其他关键的风格化事实。尽管在功能上存在差异,但大多数代币都表现出了既定的经验事实,包括收益率的无条件概率密度(重尾逐渐变为高斯分布)和波动率集群。此外,我们还比较了在集中式交易所(CEX)和分散式交易所(DEX)交易的资产,发现尽管交易机制不同,长期表现也往往不同,但分散式交易所表现出类似的风格化事实。我们提出,这种相似性可归因于套利者努力维持类似的集中式和分散式价格。我们的研究有助于更好地理解 Web3 代币的动态以及 CEX 和 DEX 市场之间的关系,对快速发展的 DeFi 市场中的风险管理、定价模型和投资组合构建具有重要意义。
{"title":"Stylized facts in Web3","authors":"A. Christian Silva, Shen-Ning Tung, Wwi-Ru Chen","doi":"arxiv-2408.07653","DOIUrl":"https://doi.org/arxiv-2408.07653","url":null,"abstract":"This paper presents a comprehensive statistical analysis of the Web3\u0000ecosystem, comparing various Web3 tokens with traditional financial assets\u0000across multiple time scales. We examine probability distributions, tail\u0000behaviors, and other key stylized facts of the returns for a diverse range of\u0000tokens, including decentralized exchanges, liquidity pools, and centralized\u0000exchanges. Despite functional differences, most tokens exhibit well-established\u0000empirical facts, including unconditional probability density of returns with\u0000heavy tails gradually becoming Gaussian and volatility clustering. Furthermore,\u0000we compare assets traded on centralized (CEX) and decentralized (DEX)\u0000exchanges, finding that DEXs exhibit similar stylized facts despite different\u0000trading mechanisms and often divergent long-term performance. We propose that\u0000this similarity is attributable to arbitrageurs striving to maintain similar\u0000centralized and decentralized prices. Our study contributes to a better\u0000understanding of the dynamics of Web3 tokens and the relationship between CEX\u0000and DEX markets, with important implications for risk management, pricing\u0000models, and portfolio construction in the rapidly evolving DeFi landscape.\u0000These results add to the growing body of literature on cryptocurrency markets\u0000and provide insights that can guide the development of more accurate models for\u0000DeFi markets.","PeriodicalId":501139,"journal":{"name":"arXiv - QuantFin - Statistical Finance","volume":"33 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142190935","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Model-based and empirical analyses of stochastic fluctuations in economy and finance 基于模型和经验的经济和金融随机波动分析
Pub Date : 2024-08-14 DOI: arxiv-2408.16010
Rubina Zadourian
The objective of this work is the investigation of complexity, asymmetry,stochasticity and non-linearity of the financial and economic systems by usingthe tools of statistical mechanics and information theory. More precisely, thisthesis concerns statistical-based modeling and empirical analyses withapplications in finance, forecasting, production processes and game theory. Inthese areas the time dependence of probability distributions is of primeinterest and can be measured or exactly calculated for model systems. Thecorrelation coefficients and moments are among the useful quantities todescribe the dynamics and the correlations between random variables. However,the full investigation can only be achieved if the probability distributionfunction of the variable is known; its derivation is one of the main focuses ofthe present work.
这项工作的目标是利用统计力学和信息论的工具,研究金融和经济系统的复杂性、不对称性、随机性和非线性。更确切地说,本论文涉及基于统计的建模和实证分析,并应用于金融、预测、生产过程和博弈论等领域。在这些领域中,概率分布的时间依赖性是人们最感兴趣的问题,可以对模型系统进行测量或精确计算。相关系数和矩是描述随机变量间动态和相关性的有用量。然而,只有在已知变量的概率分布函数的情况下,才能对其进行全面研究。
{"title":"Model-based and empirical analyses of stochastic fluctuations in economy and finance","authors":"Rubina Zadourian","doi":"arxiv-2408.16010","DOIUrl":"https://doi.org/arxiv-2408.16010","url":null,"abstract":"The objective of this work is the investigation of complexity, asymmetry,\u0000stochasticity and non-linearity of the financial and economic systems by using\u0000the tools of statistical mechanics and information theory. More precisely, this\u0000thesis concerns statistical-based modeling and empirical analyses with\u0000applications in finance, forecasting, production processes and game theory. In\u0000these areas the time dependence of probability distributions is of prime\u0000interest and can be measured or exactly calculated for model systems. The\u0000correlation coefficients and moments are among the useful quantities to\u0000describe the dynamics and the correlations between random variables. However,\u0000the full investigation can only be achieved if the probability distribution\u0000function of the variable is known; its derivation is one of the main focuses of\u0000the present work.","PeriodicalId":501139,"journal":{"name":"arXiv - QuantFin - Statistical Finance","volume":"2 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142190938","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
arXiv - QuantFin - Statistical Finance
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1