arXiv - QuantFin - Computational Finance最新文献

英文中文

MLP, XGBoost, KAN, TDNN, and LSTM-GRU Hybrid RNN with Attention for SPX and NDX European Call Option Pricing 针对 SPX 和 NDX 欧洲看涨期权定价的 MLP、XGBoost、KAN、TDNN 和 LSTM-GRU 混合 RNN 注意事项

arXiv - QuantFin - Computational Finance

Pub Date : 2024-08-26 DOI: arxiv-2409.06724

Boris Ter-Avanesov, Homayoon Beigi

We explore the performance of various artificial neural networkarchitectures, including a multilayer perceptron (MLP), Kolmogorov-Arnoldnetwork (KAN), LSTM-GRU hybrid recursive neural network (RNN) models, and atime-delay neural network (TDNN) for pricing European call options. In thisstudy, we attempt to leverage the ability of supervised learning methods, suchas ANNs, KANs, and gradient-boosted decision trees, to approximate complexmultivariate functions in order to calibrate option prices based on past marketdata. The motivation for using ANNs and KANs is the Universal ApproximationTheorem and Kolmogorov-Arnold Representation Theorem, respectively.Specifically, we use S&P 500 (SPX) and NASDAQ 100 (NDX) index options tradedduring 2015-2023 with times to maturity ranging from 15 days to over 4 years(OptionMetrics IvyDB US dataset). Black & Scholes's (BS) PDE cite{Black1973}model's performance in pricing the same options compared to real data is usedas a benchmark. This model relies on strong assumptions, and it has beenobserved and discussed in the literature that real data does not match itspredictions. Supervised learning methods are widely used as an alternative forcalibrating option prices due to some of the limitations of this model. In ourexperiments, the BS model underperforms compared to all of the others. Also,the best TDNN model outperforms the best MLP model on all error metrics. Weimplement a simple self-attention mechanism to enhance the RNN models,significantly improving their performance. The best-performing model overall isthe LSTM-GRU hybrid RNN model with attention. Also, the KAN model outperformsthe TDNN and MLP models. We analyze the performance of all models by ticker,moneyness category, and over/under/correctly-priced percentage.

我们探索了各种人工神经网络架构的性能，包括用于欧式看涨期权定价的多层感知器（MLP）、Kolmogorov-Arnold 网络（KAN）、LSTM-GRU 混合递归神经网络（RNN）模型和时间延迟神经网络（TDNN）。在这项研究中，我们试图利用监督学习方法的能力，如 ANNs、KANs 和梯度提升决策树，来逼近复杂的多变量函数，从而根据过去的市场数据来校准期权价格。使用ANNs和KANs的动机分别是普适逼近定理和科尔莫哥罗夫-阿诺德表征定理。具体来说，我们使用了2015-2023年间交易的标准普尔500（SPX）和纳斯达克100（NDX）指数期权，到期时间从15天到超过4年不等（OptionMetrics IvyDB美国数据集）。布莱克和斯科尔斯（Black & Scholes，BS）的 PDE 模型与真实数据相比在相同期权定价方面的表现被用作基准。该模型依赖于强有力的假设，文献中已经观察到并讨论过真实数据与其预测不符的情况。由于该模型的一些局限性，监督学习方法被广泛用作校准期权价格的替代方法。在我们的实验中，BS 模型的表现不如其他所有模型。此外，在所有误差指标上，最佳 TDNN 模型都优于最佳 MLP 模型。我们实施了一种简单的自我关注机制来增强 RNN 模型，从而显著提高了它们的性能。总体表现最好的模型是带有注意力的 LSTM-GRU 混合 RNN 模型。此外，KAN 模型的性能也优于 TDNN 和 MLP 模型。我们按股票、资金类别和定价过高/过低/过高定价百分比分析了所有模型的性能。

{"title":"MLP, XGBoost, KAN, TDNN, and LSTM-GRU Hybrid RNN with Attention for SPX and NDX European Call Option Pricing","authors":"Boris Ter-Avanesov, Homayoon Beigi","doi":"arxiv-2409.06724","DOIUrl":"https://doi.org/arxiv-2409.06724","url":null,"abstract":"We explore the performance of various artificial neural network\u0000architectures, including a multilayer perceptron (MLP), Kolmogorov-Arnold\u0000network (KAN), LSTM-GRU hybrid recursive neural network (RNN) models, and a\u0000time-delay neural network (TDNN) for pricing European call options. In this\u0000study, we attempt to leverage the ability of supervised learning methods, such\u0000as ANNs, KANs, and gradient-boosted decision trees, to approximate complex\u0000multivariate functions in order to calibrate option prices based on past market\u0000data. The motivation for using ANNs and KANs is the Universal Approximation\u0000Theorem and Kolmogorov-Arnold Representation Theorem, respectively.\u0000Specifically, we use S&P 500 (SPX) and NASDAQ 100 (NDX) index options traded\u0000during 2015-2023 with times to maturity ranging from 15 days to over 4 years\u0000(OptionMetrics IvyDB US dataset). Black & Scholes's (BS) PDE cite{Black1973}\u0000model's performance in pricing the same options compared to real data is used\u0000as a benchmark. This model relies on strong assumptions, and it has been\u0000observed and discussed in the literature that real data does not match its\u0000predictions. Supervised learning methods are widely used as an alternative for\u0000calibrating option prices due to some of the limitations of this model. In our\u0000experiments, the BS model underperforms compared to all of the others. Also,\u0000the best TDNN model outperforms the best MLP model on all error metrics. We\u0000implement a simple self-attention mechanism to enhance the RNN models,\u0000significantly improving their performance. The best-performing model overall is\u0000the LSTM-GRU hybrid RNN model with attention. Also, the KAN model outperforms\u0000the TDNN and MLP models. We analyze the performance of all models by ticker,\u0000moneyness category, and over/under/correctly-priced percentage.","PeriodicalId":501294,"journal":{"name":"arXiv - QuantFin - Computational Finance","volume":"23 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142208130","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

EUR-USD Exchange Rate Forecasting Based on Information Fusion with Large Language Models and Deep Learning Methods 基于大语言模型和深度学习方法信息融合的欧元兑美元汇率预测

arXiv - QuantFin - Computational Finance

Pub Date : 2024-08-23 DOI: arxiv-2408.13214

Hongcheng Ding, Xuanze Zhao, Zixiao Jiang, Shamsul Nahar Abdullah, Deshinta Arrova Dewi

Accurate forecasting of the EUR/USD exchange rate is crucial for investors,businesses, and policymakers. This paper proposes a novel framework, IUS, thatintegrates unstructured textual data from news and analysis with structureddata on exchange rates and financial indicators to enhance exchange rateprediction. The IUS framework employs large language models for sentimentpolarity scoring and exchange rate movement classification of texts. Thesetextual features are combined with quantitative features and input into aCausality-Driven Feature Generator. An Optuna-optimized Bi-LSTM model is thenused to forecast the EUR/USD exchange rate. Experiments demonstrate that theproposed method outperforms benchmark models, reducing MAE by 10.69% and RMSEby 9.56% compared to the best performing baseline. Results also show thebenefits of data fusion, with the combination of unstructured and structureddata yielding higher accuracy than structured data alone. Furthermore, featureselection using the top 12 important quantitative features combined with thetextual features proves most effective. The proposed IUS framework andOptuna-Bi-LSTM model provide a powerful new approach for exchange rateforecasting through multi-source data integration.

准确预测欧元兑美元汇率对投资者、企业和政策制定者至关重要。本文提出了一个新颖的框架 IUS，它将来自新闻和分析的非结构化文本数据与汇率和金融指标的结构化数据整合在一起，以增强汇率预测能力。IUS 框架采用大型语言模型对文本进行情感极性评分和汇率变动分类。所设定的文本特征与定量特征相结合，并输入因果关系驱动特征生成器。然后使用 Optuna 优化的 Bi-LSTM 模型预测欧元/美元汇率。实验表明，所提出的方法优于基准模型，与性能最好的基线相比，MAE 降低了 10.69%，RMSE 降低了 9.56%。实验结果还显示了数据融合的优势，非结构化数据和结构化数据的结合比单独使用结构化数据的准确率更高。此外，使用前 12 个重要的定量特征结合文本特征进行特征选择被证明是最有效的。所提出的 IUS 框架和 Optuna-Bi-LSTM 模型为通过多源数据融合进行汇率预测提供了一种强大的新方法。

{"title":"EUR-USD Exchange Rate Forecasting Based on Information Fusion with Large Language Models and Deep Learning Methods","authors":"Hongcheng Ding, Xuanze Zhao, Zixiao Jiang, Shamsul Nahar Abdullah, Deshinta Arrova Dewi","doi":"arxiv-2408.13214","DOIUrl":"https://doi.org/arxiv-2408.13214","url":null,"abstract":"Accurate forecasting of the EUR/USD exchange rate is crucial for investors,\u0000businesses, and policymakers. This paper proposes a novel framework, IUS, that\u0000integrates unstructured textual data from news and analysis with structured\u0000data on exchange rates and financial indicators to enhance exchange rate\u0000prediction. The IUS framework employs large language models for sentiment\u0000polarity scoring and exchange rate movement classification of texts. These\u0000textual features are combined with quantitative features and input into a\u0000Causality-Driven Feature Generator. An Optuna-optimized Bi-LSTM model is then\u0000used to forecast the EUR/USD exchange rate. Experiments demonstrate that the\u0000proposed method outperforms benchmark models, reducing MAE by 10.69% and RMSE\u0000by 9.56% compared to the best performing baseline. Results also show the\u0000benefits of data fusion, with the combination of unstructured and structured\u0000data yielding higher accuracy than structured data alone. Furthermore, feature\u0000selection using the top 12 important quantitative features combined with the\u0000textual features proves most effective. The proposed IUS framework and\u0000Optuna-Bi-LSTM model provide a powerful new approach for exchange rate\u0000forecasting through multi-source data integration.","PeriodicalId":501294,"journal":{"name":"arXiv - QuantFin - Computational Finance","volume":"46 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142208126","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Open-FinLLMs: Open Multimodal Large Language Models for Financial Applications Open-FinLLMs：用于金融应用的开放式多模态大语言模型

arXiv - QuantFin - Computational Finance

Pub Date : 2024-08-20 DOI: arxiv-2408.11878

Qianqian Xie, Dong Li, Mengxi Xiao, Zihao Jiang, Ruoyu Xiang, Xiao Zhang, Zhengyu Chen, Yueru He, Weiguang Han, Yuzhe Yang, Shunian Chen, Yifei Zhang, Lihang Shen, Daniel Kim, Zhiwei Liu, Zheheng Luo, Yangyang Yu, Yupeng Cao, Zhiyang Deng, Zhiyuan Yao, Haohang Li, Duanyu Feng, Yongfu Dai, VijayaSai Somasundaram, Peng Lu, Yilun Zhao, Yitao Long, Guojun Xiong, Kaleb Smith, Honghai Yu, Yanzhao Lai, Min Peng, Jianyun Nie, Jordan W. Suchow, Xiao-Yang Liu, Benyou Wang, Alejandro Lopez-Lira, Jimin Huang, Sophia Ananiadou

Large language models (LLMs) have advanced financial applications, yet theyoften lack sufficient financial knowledge and struggle with tasks involvingmulti-modal inputs like tables and time series data. To address theselimitations, we introduce textit{Open-FinLLMs}, a series of Financial LLMs. Webegin with FinLLaMA, pre-trained on a 52 billion token financial corpus,incorporating text, tables, and time-series data to embed comprehensivefinancial knowledge. FinLLaMA is then instruction fine-tuned with 573Kfinancial instructions, resulting in FinLLaMA-instruct, which enhances taskperformance. Finally, we present FinLLaVA, a multimodal LLM trained with 1.43Mimage-text instructions to handle complex financial data types. Extensiveevaluations demonstrate FinLLaMA's superior performance over LLaMA3-8B,LLaMA3.1-8B, and BloombergGPT in both zero-shot and few-shot settings across 19and 4 datasets, respectively. FinLLaMA-instruct outperforms GPT-4 and otherFinancial LLMs on 15 datasets. FinLLaVA excels in understanding tables andcharts across 4 multimodal tasks. Additionally, FinLLaMA achieves impressiveSharpe Ratios in trading simulations, highlighting its robust financialapplication capabilities. We will continually maintain and improve our modelsand benchmarks to support ongoing innovation in academia and industry.

大型语言模型（LLMs）在金融领域有着先进的应用，但它们往往缺乏足够的金融知识，在处理涉及表格和时间序列数据等多模态输入的任务时举步维艰。为了解决这些限制，我们引入了一系列金融 LLMs：textit{Open-FinLLMs}。我们从 FinLLaMA 开始，在 520 亿个代币的金融语料库上进行预训练，结合文本、表格和时间序列数据，嵌入全面的金融知识。然后，我们使用 573K 条金融指令对 FinLLaMA 进行了指令微调，最终形成了 FinLLaMA-instruct，从而提高了任务性能。最后，我们介绍了 FinLLaVA，这是一种使用 1.43Mimage-text 指令训练的多模态 LLM，用于处理复杂的金融数据类型。广泛的评估结果表明，在 19 个数据集和 4 个数据集上，FinLLaMA 在零点击和少点击设置下的性能分别优于 LLaMA3-8B、LLaMA3.1-8B 和 BloombergGPT。在 15 个数据集上，FinLLaMA-instruct 优于 GPT-4 和其他金融 LLM。在 4 项多模态任务中，FinLLaVA 在理解表格和图表方面表现出色。此外，FinLLaMA 还在模拟交易中取得了令人印象深刻的沙普比率（Sharpe Ratios），彰显了其强大的金融应用能力。我们将不断维护和改进我们的模型和基准，以支持学术界和业界的持续创新。

{"title":"Open-FinLLMs: Open Multimodal Large Language Models for Financial Applications","authors":"Qianqian Xie, Dong Li, Mengxi Xiao, Zihao Jiang, Ruoyu Xiang, Xiao Zhang, Zhengyu Chen, Yueru He, Weiguang Han, Yuzhe Yang, Shunian Chen, Yifei Zhang, Lihang Shen, Daniel Kim, Zhiwei Liu, Zheheng Luo, Yangyang Yu, Yupeng Cao, Zhiyang Deng, Zhiyuan Yao, Haohang Li, Duanyu Feng, Yongfu Dai, VijayaSai Somasundaram, Peng Lu, Yilun Zhao, Yitao Long, Guojun Xiong, Kaleb Smith, Honghai Yu, Yanzhao Lai, Min Peng, Jianyun Nie, Jordan W. Suchow, Xiao-Yang Liu, Benyou Wang, Alejandro Lopez-Lira, Jimin Huang, Sophia Ananiadou","doi":"arxiv-2408.11878","DOIUrl":"https://doi.org/arxiv-2408.11878","url":null,"abstract":"Large language models (LLMs) have advanced financial applications, yet they\u0000often lack sufficient financial knowledge and struggle with tasks involving\u0000multi-modal inputs like tables and time series data. To address these\u0000limitations, we introduce textit{Open-FinLLMs}, a series of Financial LLMs. We\u0000begin with FinLLaMA, pre-trained on a 52 billion token financial corpus,\u0000incorporating text, tables, and time-series data to embed comprehensive\u0000financial knowledge. FinLLaMA is then instruction fine-tuned with 573K\u0000financial instructions, resulting in FinLLaMA-instruct, which enhances task\u0000performance. Finally, we present FinLLaVA, a multimodal LLM trained with 1.43M\u0000image-text instructions to handle complex financial data types. Extensive\u0000evaluations demonstrate FinLLaMA's superior performance over LLaMA3-8B,\u0000LLaMA3.1-8B, and BloombergGPT in both zero-shot and few-shot settings across 19\u0000and 4 datasets, respectively. FinLLaMA-instruct outperforms GPT-4 and other\u0000Financial LLMs on 15 datasets. FinLLaVA excels in understanding tables and\u0000charts across 4 multimodal tasks. Additionally, FinLLaMA achieves impressive\u0000Sharpe Ratios in trading simulations, highlighting its robust financial\u0000application capabilities. We will continually maintain and improve our models\u0000and benchmarks to support ongoing innovation in academia and industry.","PeriodicalId":501294,"journal":{"name":"arXiv - QuantFin - Computational Finance","volume":"66 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142208127","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Deep-MacroFin: Informed Equilibrium Neural Network for Continuous Time Economic Models Deep-MacroFin：用于连续时间经济模型的知情均衡神经网络

arXiv - QuantFin - Computational Finance

Pub Date : 2024-08-19 DOI: arxiv-2408.10368

Yuntao Wu, Jiayuan Guo, Goutham Gopalakrishna, Zisis Poulos

In this paper, we present Deep-MacroFin, a comprehensive framework designedto solve partial differential equations, with a particular focus on models incontinuous time economics. This framework leverages deep learningmethodologies, including conventional Multi-Layer Perceptrons and the newlydeveloped Kolmogorov-Arnold Networks. It is optimized using economicinformation encapsulated by Hamilton-Jacobi-Bellman equations and coupledalgebraic equations. The application of neural networks holds the promise ofaccurately resolving high-dimensional problems with fewer computational demandsand limitations compared to standard numerical methods. This versatileframework can be readily adapted for elementary differential equations, andsystems of differential equations, even in cases where the solutions mayexhibit discontinuities. Importantly, it offers a more straightforward anduser-friendly implementation than existing libraries.

在本文中，我们介绍了 Deep-MacroFin，这是一个旨在求解偏微分方程的综合框架，尤其侧重于连续时间经济学模型。该框架利用深度学习方法，包括传统的多层感知器和新开发的 Kolmogorov-Arnold 网络。它利用由汉密尔顿-雅各比-贝尔曼方程和耦合代数方程封装的经济信息进行优化。与标准数值方法相比，神经网络的应用有望减少计算需求和限制，准确解决高维问题。这种多用途框架可以很容易地适用于初等微分方程和微分方程系统，即使在解可能表现出不连续性的情况下也是如此。重要的是，与现有库相比，它提供了更直接和用户友好的实现方式。

引用次数: 0

Causality-Inspired Models for Financial Time Series Forecasting 金融时间序列预测的因果关系启发模型

arXiv - QuantFin - Computational Finance

Pub Date : 2024-08-19 DOI: arxiv-2408.09960

Daniel Cunha Oliveira, Yutong Lu, Xi Lin, Mihai Cucuringu, Andre Fujita

We introduce a novel framework to financial time series forecasting thatleverages causality-inspired models to balance the trade-off between invarianceto distributional changes and minimization of prediction errors. To the best ofour knowledge, this is the first study to conduct a comprehensive comparativeanalysis among state-of-the-art causal discovery algorithms, benchmarkedagainst non-causal feature selection techniques, in the application offorecasting asset returns. Empirical evaluations demonstrate the efficacy ofour approach in yielding stable and accurate predictions, outperformingbaseline models, particularly in tumultuous market conditions.

我们为金融时间序列预测引入了一个新框架，该框架利用因果启发模型来平衡对分布变化的不变性和预测误差最小化之间的权衡。据我们所知，这是第一项在资产回报率预测应用中，以非因果特征选择技术为基准，对最先进的因果发现算法进行全面比较分析的研究。实证评估证明了我们的方法在产生稳定而准确的预测方面的功效，其性能优于基准模型，尤其是在动荡的市场条件下。

引用次数: 0

Enhancing Startup Success Predictions in Venture Capital: A GraphRAG Augmented Multivariate Time Series Method 增强风险投资中的初创企业成功预测：GraphRAG 多变量时间序列增强方法

arXiv - QuantFin - Computational Finance

Pub Date : 2024-08-18 DOI: arxiv-2408.09420

Gao Zitian, Xiao Yihao

In the Venture Capital(VC) industry, predicting the success of startups ischallenging due to limited financial data and the need for subjective revenueforecasts. Previous methods based on time series analysis or deep learningoften fall short as they fail to incorporate crucial inter-companyrelationships such as competition and collaboration. Regarding the issues, wepropose a novel approach using GrahphRAG augmented time series model. WithGraphRAG, time series predictive methods are enhanced by integrating thesevital relationships into the analysis framework, allowing for a more dynamicunderstanding of the startup ecosystem in venture capital. Our experimentalresults demonstrate that our model significantly outperforms previous models instartup success predictions. To the best of our knowledge, our work is thefirst application work of GraphRAG.

在风险投资（VC）行业，由于财务数据有限且需要主观收入预测，因此预测初创企业的成功与否非常具有挑战性。以往基于时间序列分析或深度学习的方法往往无法将竞争和合作等关键的公司间关系纳入其中，因而存在不足。针对这些问题，我们提出了一种使用 GrahphRAG 增强时间序列模型的新方法。通过将这些重要关系纳入分析框架，GraphRAG 增强了时间序列预测方法，从而能够更加动态地了解风险投资中的初创企业生态系统。实验结果表明，在创业成功预测方面，我们的模型明显优于之前的模型。据我们所知，我们的工作是 GraphRAG 的首次应用工作。

引用次数: 0

Gradient Reduction Convolutional Neural Network Policy for Financial Deep Reinforcement Learning 用于金融深度强化学习的梯度降低卷积神经网络策略

arXiv - QuantFin - Computational Finance

Pub Date : 2024-08-16 DOI: arxiv-2408.11859

Sina Montazeri, Haseebullah Jumakhan, Sonia Abrasiabian, Amir Mirzaeinia

Building on our prior explorations of convolutional neural networks (CNNs)for financial data processing, this paper introduces two significantenhancements to refine our CNN model's predictive performance and robustnessfor financial tabular data. Firstly, we integrate a normalization layer at theinput stage to ensure consistent feature scaling, addressing the issue ofdisparate feature magnitudes that can skew the learning process. Thismodification is hypothesized to aid in stabilizing the training dynamics andimproving the model's generalization across diverse financial datasets.Secondly, we employ a Gradient Reduction Architecture, where earlier layers arewider and subsequent layers are progressively narrower. This enhancement isdesigned to enable the model to capture more complex and subtle patterns withinthe data, a crucial factor in accurately predicting financial outcomes. Theseadvancements directly respond to the limitations identified in previousstudies, where simpler models struggled with the complexity and variabilityinherent in financial applications. Initial tests confirm that these changesimprove accuracy and model stability, suggesting that deeper and more nuancednetwork architectures can significantly benefit financial predictive tasks.This paper details the implementation of these enhancements and evaluates theirimpact on the model's performance in a controlled experimental setting.

基于我们之前对用于金融数据处理的卷积神经网络（CNN）的探索，本文引入了两项重大改进，以完善 CNN 模型的预测性能和对金融表格数据的鲁棒性。首先，我们在输入阶段集成了一个归一化层，以确保一致的特征缩放，从而解决可能会影响学习过程的不同特征量级的问题。这一修改被认为有助于稳定训练动态，提高模型在不同金融数据集上的泛化能力。其次，我们采用了梯度缩减架构，即前面的层更宽，后面的层逐渐变窄。这一改进旨在使模型能够捕捉数据中更复杂、更微妙的模式，这是准确预测金融结果的关键因素。在以往的研究中，较简单的模型难以应对金融应用中固有的复杂性和可变性，而这些改进直接应对了这些局限性。初步测试证实，这些变化提高了准确性和模型的稳定性，表明更深入、更细致的网络架构能显著提高金融预测任务的效率。本文详细介绍了这些增强功能的实现过程，并在受控实验环境中评估了它们对模型性能的影响。

{"title":"Gradient Reduction Convolutional Neural Network Policy for Financial Deep Reinforcement Learning","authors":"Sina Montazeri, Haseebullah Jumakhan, Sonia Abrasiabian, Amir Mirzaeinia","doi":"arxiv-2408.11859","DOIUrl":"https://doi.org/arxiv-2408.11859","url":null,"abstract":"Building on our prior explorations of convolutional neural networks (CNNs)\u0000for financial data processing, this paper introduces two significant\u0000enhancements to refine our CNN model's predictive performance and robustness\u0000for financial tabular data. Firstly, we integrate a normalization layer at the\u0000input stage to ensure consistent feature scaling, addressing the issue of\u0000disparate feature magnitudes that can skew the learning process. This\u0000modification is hypothesized to aid in stabilizing the training dynamics and\u0000improving the model's generalization across diverse financial datasets.\u0000Secondly, we employ a Gradient Reduction Architecture, where earlier layers are\u0000wider and subsequent layers are progressively narrower. This enhancement is\u0000designed to enable the model to capture more complex and subtle patterns within\u0000the data, a crucial factor in accurately predicting financial outcomes. These\u0000advancements directly respond to the limitations identified in previous\u0000studies, where simpler models struggled with the complexity and variability\u0000inherent in financial applications. Initial tests confirm that these changes\u0000improve accuracy and model stability, suggesting that deeper and more nuanced\u0000network architectures can significantly benefit financial predictive tasks.\u0000This paper details the implementation of these enhancements and evaluates their\u0000impact on the model's performance in a controlled experimental setting.","PeriodicalId":501294,"journal":{"name":"arXiv - QuantFin - Computational Finance","volume":"6 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142208131","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Stochastic Calculus for Option Pricing with Convex Duality, Logistic Model, and Numerical Examination 期权定价的随机微积分与凸对偶、逻辑模型和数值检验

arXiv - QuantFin - Computational Finance

Pub Date : 2024-08-11 DOI: arxiv-2408.05672

Zheng Cao

This thesis explores the historical progression and theoretical constructs offinancial mathematics, with an in-depth exploration of Stochastic Calculus asshowcased in the Binomial Asset Pricing Model and the Continuous-Time Models. Acomprehensive survey of stochastic calculus principles applied to optionpricing is offered, highlighting insights from Peter Carr and LorenzoTorricelli's ``Convex Duality in Continuous Option Pricing Models". Thismanuscript adopts techniques such as Monte-Carlo Simulation and machinelearning algorithms to examine the propositions of Carr and Torricelli, drawingcomparisons between the Logistic and Bachelier models. Additionally, itsuggests directions for potential future research on option pricing methods.

本论文探讨了金融数学的历史发展和理论构建，深入探讨了二项式资产定价模型和连续时间模型中的随机微积分。该书全面介绍了应用于期权定价的随机微积分原理，重点介绍了彼得-卡尔和洛伦佐-托里切利的《连续期权定价模型中的凸对偶性》一书中的见解。这篇手稿采用蒙特卡罗模拟和机器学习算法等技术来研究卡尔和托里切利的命题，并对 Logistic 模型和 Bachelier 模型进行了比较。此外，它还为未来可能的期权定价方法研究指明了方向。

引用次数: 0

Why Groups Matter: Necessity of Group Structures in Attributions 群体为何重要？群体结构在归因中的必要性

arXiv - QuantFin - Computational Finance

Pub Date : 2024-08-11 DOI: arxiv-2408.05701

Dangxing Chen, Jingfeng Chen, Weicheng Ye

Explainable machine learning methods have been accompanied by substantialdevelopment. Despite their success, the existing approaches focus more on thegeneral framework with no prior domain expertise. High-stakes financial sectorshave extensive domain knowledge of the features. Hence, it is expected thatexplanations of models will be consistent with domain knowledge to ensureconceptual soundness. In this work, we study the group structures of features that are naturallyformed in the financial dataset. Our study shows the importance of consideringgroup structures that conform to the regulations. When group structures arepresent, direct applications of explainable machine learning methods, such asShapley values and Integrated Gradients, may not provide consistentexplanations; alternatively, group versions of the Shapley value can provideconsistent explanations. We contain detailed examples to concentrate on thepractical perspective of our framework.

可解释的机器学习方法得到了长足的发展。尽管这些方法取得了成功，但现有的方法更侧重于一般框架，事先并不具备领域专业知识。高风险的金融行业拥有广泛的特征领域知识。因此，我们希望模型的解释与领域知识保持一致，以确保概念的合理性。在这项工作中，我们研究了金融数据集中自然形成的特征组结构。我们的研究表明，考虑符合法规的群体结构非常重要。当存在群体结构时，直接应用可解释的机器学习方法（如夏普利值和综合梯度）可能无法提供一致的解释；或者，夏普利值的群体版本可以提供一致的解释。我们包含了详细的示例，以集中展示我们框架的实用性。

引用次数: 0

Consumer Transactions Simulation through Generative Adversarial Networks 通过生成式对抗网络模拟消费者交易

arXiv - QuantFin - Computational Finance

Pub Date : 2024-08-07 DOI: arxiv-2408.03655

Sergiy Tkachuk, Szymon Łukasik, Anna Wróblewska

In the rapidly evolving domain of large-scale retail data systems,envisioning and simulating future consumer transactions has become a crucialarea of interest. It offers significant potential to fortify demand forecastingand fine-tune inventory management. This paper presents an innovativeapplication of Generative Adversarial Networks (GANs) to generate syntheticretail transaction data, specifically focusing on a novel system architecturethat combines consumer behavior modeling with stock-keeping unit (SKU)availability constraints to address real-world assortment optimizationchallenges. We diverge from conventional methodologies by integrating SKU datainto our GAN architecture and using more sophisticated embedding methods (e.g.,hyper-graphs). This design choice enables our system to generate not onlysimulated consumer purchase behaviors but also reflects the dynamic interplaybetween consumer behavior and SKU availability -- an aspect often overlooked,among others, because of data scarcity in legacy retail simulation models. OurGAN model generates transactions under stock constraints, pioneering aresourceful experimental system with practical implications for real-worldretail operation and strategy. Preliminary results demonstrate enhanced realismin simulated transactions measured by comparing generated items with real onesusing methods employed earlier in related studies. This underscores thepotential for more accurate predictive modeling.

在快速发展的大规模零售数据系统领域，设想和模拟未来的消费者交易已成为一个重要的关注领域。它为加强需求预测和微调库存管理提供了巨大的潜力。本文介绍了生成对抗网络（GANs）在生成合成零售交易数据方面的创新应用，特别关注一种新颖的系统架构，该架构将消费者行为建模与库存单位（SKU）可用性约束相结合，以解决现实世界中的分类优化难题。与传统方法不同的是，我们将 SKU 数据整合到我们的 GAN 架构中，并使用更复杂的嵌入方法（如超图）。这种设计选择使我们的系统不仅能生成模拟的消费者购买行为，还能反映消费者行为与 SKU 可用性之间的动态相互作用，而由于传统零售模拟模型中数据稀缺等原因，这一点常常被忽视。我们的 GAN 模型在库存约束条件下生成交易，开创了一个资源丰富的实验系统，对现实世界的零售运营和战略具有实际意义。初步结果表明，通过比较生成的商品和真实商品，并使用相关研究中早期使用的方法，模拟交易的真实性得到了增强。这凸显了更精确预测建模的潜力。

{"title":"Consumer Transactions Simulation through Generative Adversarial Networks","authors":"Sergiy Tkachuk, Szymon Łukasik, Anna Wróblewska","doi":"arxiv-2408.03655","DOIUrl":"https://doi.org/arxiv-2408.03655","url":null,"abstract":"In the rapidly evolving domain of large-scale retail data systems,\u0000envisioning and simulating future consumer transactions has become a crucial\u0000area of interest. It offers significant potential to fortify demand forecasting\u0000and fine-tune inventory management. This paper presents an innovative\u0000application of Generative Adversarial Networks (GANs) to generate synthetic\u0000retail transaction data, specifically focusing on a novel system architecture\u0000that combines consumer behavior modeling with stock-keeping unit (SKU)\u0000availability constraints to address real-world assortment optimization\u0000challenges. We diverge from conventional methodologies by integrating SKU data\u0000into our GAN architecture and using more sophisticated embedding methods (e.g.,\u0000hyper-graphs). This design choice enables our system to generate not only\u0000simulated consumer purchase behaviors but also reflects the dynamic interplay\u0000between consumer behavior and SKU availability -- an aspect often overlooked,\u0000among others, because of data scarcity in legacy retail simulation models. Our\u0000GAN model generates transactions under stock constraints, pioneering a\u0000resourceful experimental system with practical implications for real-world\u0000retail operation and strategy. Preliminary results demonstrate enhanced realism\u0000in simulated transactions measured by comparing generated items with real ones\u0000using methods employed earlier in related studies. This underscores the\u0000potential for more accurate predictive modeling.","PeriodicalId":501294,"journal":{"name":"arXiv - QuantFin - Computational Finance","volume":"183 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141968706","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

arXiv - QuantFin - Computational Finance

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀