首页 > 最新文献

arXiv - QuantFin - Computational Finance最新文献

英文 中文
Evaluating Microscopic and Macroscopic Models for Derivative Contracts on Commodity Indices 评估商品指数衍生合约的微观和宏观模型
Pub Date : 2024-07-17 DOI: arxiv-2408.00784
Alberto Manzano, Emanuele Nastasi, Andrea Pallavicini, Carlos Vázquez
In this article, we analyze two modeling approaches for the pricing ofderivative contracts on a commodity index. The first one is a microscopicapproach, where the components of the index are modeled individually, and theindex price is derived from their combination. The second one is a macroscopicapproach, where the index is modeled directly. While the microscopic approachoffers greater flexibility, its calibration results to be more challenging,thus leading practitioners to favor the macroscopic approach. However, in themacroscopic model, the lack of explicit futures curve dynamics raises questionsabout its ability to accurately capture the behavior of the index and itssensitivities. In order to investigate this, we calibrate both models usingderivatives of the S&P GSCI Crude Oil excess-return index and compare theirpricing and sensitivities on path-dependent options, such as autocallablecontracts. This research provides insights into the suitability of macroscopicmodels for pricing and hedging purposes in real scenarios.
本文分析了商品指数衍生合约定价的两种建模方法。第一种是微观方法,即对指数的各个组成部分分别建模,然后根据它们的组合得出指数价格。第二种是宏观方法,即直接建立指数模型。虽然微观方法具有更大的灵活性,但其校准结果更具挑战性,因此从业人员更倾向于宏观方法。然而,在宏观模型中,由于缺乏明确的期货曲线动态,人们对其准确捕捉指数行为及其敏感性的能力产生了疑问。为了研究这个问题,我们使用 S&P GSCI 原油超额收益指数的衍生品校准了这两个模型,并比较了它们对路径依赖期权(如自动赎回合约)的定价和敏感性。这项研究为宏观模型在实际情况下的定价和对冲目的的适用性提供了见解。
{"title":"Evaluating Microscopic and Macroscopic Models for Derivative Contracts on Commodity Indices","authors":"Alberto Manzano, Emanuele Nastasi, Andrea Pallavicini, Carlos Vázquez","doi":"arxiv-2408.00784","DOIUrl":"https://doi.org/arxiv-2408.00784","url":null,"abstract":"In this article, we analyze two modeling approaches for the pricing of\u0000derivative contracts on a commodity index. The first one is a microscopic\u0000approach, where the components of the index are modeled individually, and the\u0000index price is derived from their combination. The second one is a macroscopic\u0000approach, where the index is modeled directly. While the microscopic approach\u0000offers greater flexibility, its calibration results to be more challenging,\u0000thus leading practitioners to favor the macroscopic approach. However, in the\u0000macroscopic model, the lack of explicit futures curve dynamics raises questions\u0000about its ability to accurately capture the behavior of the index and its\u0000sensitivities. In order to investigate this, we calibrate both models using\u0000derivatives of the S&P GSCI Crude Oil excess-return index and compare their\u0000pricing and sensitivities on path-dependent options, such as autocallable\u0000contracts. This research provides insights into the suitability of macroscopic\u0000models for pricing and hedging purposes in real scenarios.","PeriodicalId":501294,"journal":{"name":"arXiv - QuantFin - Computational Finance","volume":"77 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141932409","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
FinDKG: Dynamic Knowledge Graphs with Large Language Models for Detecting Global Trends in Financial Markets FinDKG:利用大型语言模型检测金融市场全球趋势的动态知识图谱
Pub Date : 2024-07-15 DOI: arxiv-2407.10909
Xiaohui Victor Li, Francesco Sanna Passino
Dynamic knowledge graphs (DKGs) are popular structures to express differenttypes of connections between objects over time. They can also serve as anefficient mathematical tool to represent information extracted from complexunstructured data sources, such as text or images. Within financialapplications, DKGs could be used to detect trends for strategic thematicinvesting, based on information obtained from financial news articles. In thiswork, we explore the properties of large language models (LLMs) as dynamicknowledge graph generators, proposing a novel open-source fine-tuned LLM forthis purpose, called the Integrated Contextual Knowledge Graph Generator(ICKG). We use ICKG to produce a novel open-source DKG from a corpus offinancial news articles, called FinDKG, and we propose an attention-based GNNarchitecture for analysing it, called KGTransformer. We test the performance ofthe proposed model on benchmark datasets and FinDKG, demonstrating superiorperformance on link prediction tasks. Additionally, we evaluate the performanceof the KGTransformer on FinDKG for thematic investing, showing it canoutperform existing thematic ETFs.
动态知识图谱(DKGs)是一种流行的结构,用于表达对象之间随时间发生的不同类型的联系。它们还可以作为一种高效的数学工具,用于表示从复杂的非结构化数据源(如文本或图像)中提取的信息。在金融应用中,DKGs 可用于根据从金融新闻文章中获取的信息,检测战略主题投资的趋势。在这项工作中,我们探索了大型语言模型(LLM)作为动态知识图谱生成器的特性,并为此提出了一种新型开源微调 LLM,称为集成上下文知识图谱生成器(ICKG)。我们使用 ICKG 从金融新闻文章语料库中生成了一种新型开源 DKG,称为 FinDKG,并提出了一种基于注意力的 GNN 架构来分析它,称为 KGTransformer。我们在基准数据集和 FinDKG 上测试了所提模型的性能,结果表明该模型在链接预测任务中表现出色。此外,我们还在 FinDKG 上评估了 KGTransformer 在主题投资方面的性能,结果表明它可以超越现有的主题 ETF。
{"title":"FinDKG: Dynamic Knowledge Graphs with Large Language Models for Detecting Global Trends in Financial Markets","authors":"Xiaohui Victor Li, Francesco Sanna Passino","doi":"arxiv-2407.10909","DOIUrl":"https://doi.org/arxiv-2407.10909","url":null,"abstract":"Dynamic knowledge graphs (DKGs) are popular structures to express different\u0000types of connections between objects over time. They can also serve as an\u0000efficient mathematical tool to represent information extracted from complex\u0000unstructured data sources, such as text or images. Within financial\u0000applications, DKGs could be used to detect trends for strategic thematic\u0000investing, based on information obtained from financial news articles. In this\u0000work, we explore the properties of large language models (LLMs) as dynamic\u0000knowledge graph generators, proposing a novel open-source fine-tuned LLM for\u0000this purpose, called the Integrated Contextual Knowledge Graph Generator\u0000(ICKG). We use ICKG to produce a novel open-source DKG from a corpus of\u0000financial news articles, called FinDKG, and we propose an attention-based GNN\u0000architecture for analysing it, called KGTransformer. We test the performance of\u0000the proposed model on benchmark datasets and FinDKG, demonstrating superior\u0000performance on link prediction tasks. Additionally, we evaluate the performance\u0000of the KGTransformer on FinDKG for thematic investing, showing it can\u0000outperform existing thematic ETFs.","PeriodicalId":501294,"journal":{"name":"arXiv - QuantFin - Computational Finance","volume":"2012 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141719350","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine learning in weekly movement prediction 每周运动预测中的机器学习
Pub Date : 2024-07-13 DOI: arxiv-2407.09831
Han Gui
To predict the future movements of stock markets, numerous studiesconcentrate on daily data and employ various machine learning (ML) models asbenchmarks that often vary and lack standardization across different researchworks. This paper tries to solve the problem from a fresh standpoint by aimingto predict the weekly movements, and introducing a novel benchmark of randomtraders. This benchmark is independent of any ML model, thus making it moreobjective and potentially serving as a commonly recognized standard. Duringtraining process, apart from the basic features such as technical indicators,scaling laws and directional changes are introduced as additional features,furthermore, the training datasets are also adjusted by assigning varyingweights to different samples, the weighting approach allows the models toemphasize specific samples. On back-testing, several trained models show goodperformance, with the multi-layer perception (MLP) demonstrating stability androbustness across extensive and comprehensive data that include upward,downward and cyclic trends. The unique perspective of this work that focuses onweekly movements, incorporates new features and creates an objective benchmark,contributes to the existing literature on stock market prediction.
为了预测股票市场的未来走势,许多研究都集中在每日数据上,并采用各种机器学习(ML)模型作为基准,但不同的研究成果往往各不相同,缺乏标准化。本文试图从一个全新的角度来解决这个问题,即预测每周的走势,并引入一个随机交易者的新基准。该基准独立于任何 ML 模型,因此更具客观性,并有可能成为公认的标准。在训练过程中,除了技术指标等基本特征外,还引入了缩放规律和方向变化作为附加特征,此外,还通过为不同样本分配不同权重来调整训练数据集,权重方法允许模型强调特定样本。在回溯测试中,几个训练有素的模型表现出了良好的性能,其中多层感知(MLP)在广泛而全面的数据(包括上升、下降和周期趋势)中表现出了稳定性和稳健性。这项工作以独特的视角关注每周的走势,纳入了新的特征,并创建了一个客观的基准,为现有的股市预测文献做出了贡献。
{"title":"Machine learning in weekly movement prediction","authors":"Han Gui","doi":"arxiv-2407.09831","DOIUrl":"https://doi.org/arxiv-2407.09831","url":null,"abstract":"To predict the future movements of stock markets, numerous studies\u0000concentrate on daily data and employ various machine learning (ML) models as\u0000benchmarks that often vary and lack standardization across different research\u0000works. This paper tries to solve the problem from a fresh standpoint by aiming\u0000to predict the weekly movements, and introducing a novel benchmark of random\u0000traders. This benchmark is independent of any ML model, thus making it more\u0000objective and potentially serving as a commonly recognized standard. During\u0000training process, apart from the basic features such as technical indicators,\u0000scaling laws and directional changes are introduced as additional features,\u0000furthermore, the training datasets are also adjusted by assigning varying\u0000weights to different samples, the weighting approach allows the models to\u0000emphasize specific samples. On back-testing, several trained models show good\u0000performance, with the multi-layer perception (MLP) demonstrating stability and\u0000robustness across extensive and comprehensive data that include upward,\u0000downward and cyclic trends. The unique perspective of this work that focuses on\u0000weekly movements, incorporates new features and creates an objective benchmark,\u0000contributes to the existing literature on stock market prediction.","PeriodicalId":501294,"journal":{"name":"arXiv - QuantFin - Computational Finance","volume":"45 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141719351","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Attribution Methods in Asset Pricing: Do They Account for Risk? 资产定价中的归因方法:它们考虑风险了吗?
Pub Date : 2024-07-12 DOI: arxiv-2407.08953
Dangxing Chen, Yuan Gao
Over the past few decades, machine learning models have been extremelysuccessful. As a result of axiomatic attribution methods, feature contributionshave been explained more clearly and rigorously. There are, however, fewstudies that have examined domain knowledge in conjunction with the axioms. Inthis study, we examine asset pricing in finance, a field closely related torisk management. Consequently, when applying machine learning models, we mustensure that the attribution methods reflect the underlying risks accurately. Inthis work, we present and study several axioms derived from asset pricingdomain knowledge. It is shown that while Shapley value and Integrated Gradientspreserve most axioms, neither can satisfy all axioms. Using extensiveanalytical and empirical examples, we demonstrate how attribution methods canreflect risks and when they should not be used.
过去几十年来,机器学习模型取得了巨大成功。由于采用了公理归因方法,特征贡献得到了更清晰、更严格的解释。然而,很少有研究将领域知识与公理结合起来进行研究。在本研究中,我们考察了金融领域的资产定价,这是一个与风险管理密切相关的领域。因此,在应用机器学习模型时,我们必须确保归因方法能准确反映潜在风险。在这项工作中,我们提出并研究了从资产定价领域知识中得出的几条公理。研究表明,虽然夏普利值和综合梯度保留了大部分公理,但两者都不能满足所有公理。通过大量的分析和经验实例,我们证明了归因方法如何反映风险,以及何时不应使用这些方法。
{"title":"Attribution Methods in Asset Pricing: Do They Account for Risk?","authors":"Dangxing Chen, Yuan Gao","doi":"arxiv-2407.08953","DOIUrl":"https://doi.org/arxiv-2407.08953","url":null,"abstract":"Over the past few decades, machine learning models have been extremely\u0000successful. As a result of axiomatic attribution methods, feature contributions\u0000have been explained more clearly and rigorously. There are, however, few\u0000studies that have examined domain knowledge in conjunction with the axioms. In\u0000this study, we examine asset pricing in finance, a field closely related to\u0000risk management. Consequently, when applying machine learning models, we must\u0000ensure that the attribution methods reflect the underlying risks accurately. In\u0000this work, we present and study several axioms derived from asset pricing\u0000domain knowledge. It is shown that while Shapley value and Integrated Gradients\u0000preserve most axioms, neither can satisfy all axioms. Using extensive\u0000analytical and empirical examples, we demonstrate how attribution methods can\u0000reflect risks and when they should not be used.","PeriodicalId":501294,"journal":{"name":"arXiv - QuantFin - Computational Finance","volume":"54 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141719421","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unified continuous-time q-learning for mean-field game and mean-field control problems 均场博弈和均场控制问题的统一连续时间 q-learning
Pub Date : 2024-07-05 DOI: arxiv-2407.04521
Xiaoli Wei, Xiang Yu, Fengyi Yuan
This paper studies the continuous-time q-learning in the mean-fieldjump-diffusion models from the representative agent's perspective. To overcomethe challenge when the population distribution may not be directly observable,we introduce the integrated q-function in decoupled form (decoupledIq-function) and establish its martingale characterization together with thevalue function, which provides a unified policy evaluation rule for bothmean-field game (MFG) and mean-field control (MFC) problems. Moreover,depending on the task to solve the MFG or MFC problem, we can employ thedecoupled Iq-function by different means to learn the mean-field equilibriumpolicy or the mean-field optimal policy respectively. As a result, we devise aunified q-learning algorithm for both MFG and MFC problems by utilizing alltest policies stemming from the mean-field interactions. For several examplesin the jump-diffusion setting, within and beyond the LQ framework, we canobtain the exact parameterization of the decoupled Iq-functions and the valuefunctions, and illustrate our algorithm from the representative agent'sperspective with satisfactory performance.
本文从代表代理的角度研究了均值-场跳跃-扩散模型中的连续时间q-学习。为了克服当种群分布可能无法直接观测时的挑战,我们引入了解耦形式的集成 q 函数(解耦 q 函数),并将其与值函数一起建立了马丁格尔特性,从而为均场博弈(MFG)和均场控制(MFC)问题提供了统一的策略评估规则。此外,根据求解 MFG 或 MFC 问题的任务不同,我们可以通过不同的方法利用解耦 Iq 函数来分别学习均值场均衡策略或均值场最优策略。因此,我们利用均值场相互作用产生的所有检验策略,为 MFG 和 MFC 问题设计了一种统一的 q-learning 算法。对于跳跃扩散设置中的几个例子,在 LQ 框架之内和之外,我们可以获得解耦 Iq 函数和价值函数的精确参数化,并从代表代理的角度说明了我们的算法,结果令人满意。
{"title":"Unified continuous-time q-learning for mean-field game and mean-field control problems","authors":"Xiaoli Wei, Xiang Yu, Fengyi Yuan","doi":"arxiv-2407.04521","DOIUrl":"https://doi.org/arxiv-2407.04521","url":null,"abstract":"This paper studies the continuous-time q-learning in the mean-field\u0000jump-diffusion models from the representative agent's perspective. To overcome\u0000the challenge when the population distribution may not be directly observable,\u0000we introduce the integrated q-function in decoupled form (decoupled\u0000Iq-function) and establish its martingale characterization together with the\u0000value function, which provides a unified policy evaluation rule for both\u0000mean-field game (MFG) and mean-field control (MFC) problems. Moreover,\u0000depending on the task to solve the MFG or MFC problem, we can employ the\u0000decoupled Iq-function by different means to learn the mean-field equilibrium\u0000policy or the mean-field optimal policy respectively. As a result, we devise a\u0000unified q-learning algorithm for both MFG and MFC problems by utilizing all\u0000test policies stemming from the mean-field interactions. For several examples\u0000in the jump-diffusion setting, within and beyond the LQ framework, we can\u0000obtain the exact parameterization of the decoupled Iq-functions and the value\u0000functions, and illustrate our algorithm from the representative agent's\u0000perspective with satisfactory performance.","PeriodicalId":501294,"journal":{"name":"arXiv - QuantFin - Computational Finance","volume":"37 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141573213","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
GraphCNNpred: A stock market indices prediction using a Graph based deep learning system GraphCNNpred:使用基于图形的深度学习系统预测股市指数
Pub Date : 2024-07-04 DOI: arxiv-2407.03760
Yuhui Jin
Deep learning techniques for predicting stock market prices is an populartopic in the field of data science. Customized feature engineering arises aspre-processing tools of different stock market dataset. In this paper, we givea graph neural network based convolutional neural network (CNN) model, that canbe applied on diverse source of data, in the attempt to extract features topredict the trends of indices of text{S}&text{P} 500, NASDAQ, DJI, NYSE, andRUSSEL.
用于预测股市价格的深度学习技术是数据科学领域的一个热门话题。定制化特征工程是处理不同股市数据集的工具。在本文中,我们给出了一个基于图神经网络的卷积神经网络(CNN)模型,该模型可以应用于不同的数据源,试图提取特征来预测(text{S}&text{P} 500、NASDAQ、DJI、NYSE 和 RUSSEL)指数的走势。
{"title":"GraphCNNpred: A stock market indices prediction using a Graph based deep learning system","authors":"Yuhui Jin","doi":"arxiv-2407.03760","DOIUrl":"https://doi.org/arxiv-2407.03760","url":null,"abstract":"Deep learning techniques for predicting stock market prices is an popular\u0000topic in the field of data science. Customized feature engineering arises as\u0000pre-processing tools of different stock market dataset. In this paper, we give\u0000a graph neural network based convolutional neural network (CNN) model, that can\u0000be applied on diverse source of data, in the attempt to extract features to\u0000predict the trends of indices of text{S}&text{P} 500, NASDAQ, DJI, NYSE, and\u0000RUSSEL.","PeriodicalId":501294,"journal":{"name":"arXiv - QuantFin - Computational Finance","volume":"12 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141573476","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Structure of Financial Equity Research Reports -- Identification of the Most Frequently Asked Questions in Financial Analyst Reports to Automate Equity Research Using Llama 3 and GPT-4 金融股票研究报告的结构 -- 利用 Llama 3 和 GPT-4 识别金融分析师报告中最常见的问题,实现股票研究自动化
Pub Date : 2024-07-04 DOI: arxiv-2407.18327
Adria Pop, Jan Spörer, Siegfried Handschuh
This research dissects financial equity research reports (ERRs) by mappingtheir content into categories. There is insufficient empirical analysis of the questions answered in ERRs.In particular, it is not understood how frequently certain information appears,what information is considered essential, and what information requires humanjudgment to distill into an ERR. The study analyzes 72 ERRs sentence-by-sentence, classifying their 4940sentences into 169 unique question archetypes. We did not predefine thequestions but derived them solely from the statements in the ERRs. Thisapproach provides an unbiased view of the content of the observed ERRs.Subsequently, we used public corporate reports to classify the questions'potential for automation. Answers were labeled "text-extractable" if theanswers to the question were accessible in corporate reports. 78.7% of the questions in ERRs can be automated. Those automatable questionconsist of 48.2% text-extractable (suited to processing by large languagemodels, LLMs) and 30.5% database-extractable questions. Only 21.3% of questionsrequire human judgment to answer. We empirically validate using Llama-3-70B and GPT-4-turbo-2024-04-09 thatrecent advances in language generation and information extraction enable theautomation of approximately 80% of the statements in ERRs. Surprisingly, themodels complement each other's strengths and weaknesses well. The research confirms that the current writing process of ERRs can likelybenefit from additional automation, improving quality and efficiency. Theresearch thus allows us to quantify the potential impacts of introducing largelanguage models in the ERR writing process. The full question list, including the archetypes and their frequency, will bemade available online after peer review.
本研究将金融股票研究报告(ERRs)的内容分门别类,对其进行剖析。特别是,人们不了解某些信息出现的频率、哪些信息被认为是必要的、哪些信息需要人为判断才能提炼成 ERR。本研究对 72 篇 ERR 进行了逐句分析,将其中的 4940 句分为 169 种独特的问题原型。我们没有预先定义这些问题,而是完全根据 ERR 中的语句得出这些问题。随后,我们利用公开的公司报告对问题的自动化潜力进行了分类。如果问题的答案可以在公司报告中获取,则答案被标记为 "可提取文本"。企业资源报告中 78.7% 的问题可以自动化。这些可自动处理的问题包括 48.2% 的可文本提取问题(适合大型语言模型处理)和 30.5% 的可数据库提取问题。只有 21.3% 的问题需要人工判断才能回答。我们使用 Llama-3-70B 和 GPT-4-turbo-2024-04-09 进行了实证验证,语言生成和信息提取方面的最新进展使得ERR 中约 80% 的语句可以实现自动化。令人惊奇的是,这些模型很好地互补了彼此的优缺点。研究证实,目前的《紧急救济报告》撰写过程有可能从更多的自动化中获益,从而提高质量和效率。因此,这项研究使我们能够量化在 ERR 撰写过程中引入大语言模型的潜在影响。完整的问题清单,包括原型及其频率,将在同行评审后在网上公布。
{"title":"The Structure of Financial Equity Research Reports -- Identification of the Most Frequently Asked Questions in Financial Analyst Reports to Automate Equity Research Using Llama 3 and GPT-4","authors":"Adria Pop, Jan Spörer, Siegfried Handschuh","doi":"arxiv-2407.18327","DOIUrl":"https://doi.org/arxiv-2407.18327","url":null,"abstract":"This research dissects financial equity research reports (ERRs) by mapping\u0000their content into categories. There is insufficient empirical analysis of the questions answered in ERRs.\u0000In particular, it is not understood how frequently certain information appears,\u0000what information is considered essential, and what information requires human\u0000judgment to distill into an ERR. The study analyzes 72 ERRs sentence-by-sentence, classifying their 4940\u0000sentences into 169 unique question archetypes. We did not predefine the\u0000questions but derived them solely from the statements in the ERRs. This\u0000approach provides an unbiased view of the content of the observed ERRs.\u0000Subsequently, we used public corporate reports to classify the questions'\u0000potential for automation. Answers were labeled \"text-extractable\" if the\u0000answers to the question were accessible in corporate reports. 78.7% of the questions in ERRs can be automated. Those automatable question\u0000consist of 48.2% text-extractable (suited to processing by large language\u0000models, LLMs) and 30.5% database-extractable questions. Only 21.3% of questions\u0000require human judgment to answer. We empirically validate using Llama-3-70B and GPT-4-turbo-2024-04-09 that\u0000recent advances in language generation and information extraction enable the\u0000automation of approximately 80% of the statements in ERRs. Surprisingly, the\u0000models complement each other's strengths and weaknesses well. The research confirms that the current writing process of ERRs can likely\u0000benefit from additional automation, improving quality and efficiency. The\u0000research thus allows us to quantify the potential impacts of introducing large\u0000language models in the ERR writing process. The full question list, including the archetypes and their frequency, will be\u0000made available online after peer review.","PeriodicalId":501294,"journal":{"name":"arXiv - QuantFin - Computational Finance","volume":"50 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141865953","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CatMemo at the FinLLM Challenge Task: Fine-Tuning Large Language Models using Data Fusion in Financial Applications CatMemo 参加 FinLLM 挑战任务:利用金融应用中的数据融合微调大型语言模型
Pub Date : 2024-07-02 DOI: arxiv-2407.01953
Yupeng Cao, Zhiyuan Yao, Zhi Chen, Zhiyang Deng
The integration of Large Language Models (LLMs) into financial analysis hasgarnered significant attention in the NLP community. This paper presents oursolution to IJCAI-2024 FinLLM challenge, investigating the capabilities of LLMswithin three critical areas of financial tasks: financial classification,financial text summarization, and single stock trading. We adopted Llama3-8Band Mistral-7B as base models, fine-tuning them through Parameter EfficientFine-Tuning (PEFT) and Low-Rank Adaptation (LoRA) approaches. To enhance modelperformance, we combine datasets from task 1 and task 2 for data fusion. Ourapproach aims to tackle these diverse tasks in a comprehensive and integratedmanner, showcasing LLMs' capacity to address diverse and complex financialtasks with improved accuracy and decision-making capabilities.
将大型语言模型(LLMs)集成到金融分析中已引起 NLP 界的极大关注。本文针对 IJCAI-2024 FinLLM 挑战提出了我们的解决方案,研究了 LLM 在金融任务的三个关键领域中的能力:金融分类、金融文本摘要和单一股票交易。我们采用 Llama3-8B 和 Mistral-7B 作为基础模型,通过参数高效微调(PEFT)和低级别自适应(LoRA)方法对其进行微调。为了提高模型性能,我们将任务 1 和任务 2 的数据集结合起来进行数据融合。我们的方法旨在以全面、综合的方式解决这些不同的任务,展示 LLMs 解决多样化、复杂的金融任务的能力,并提高准确性和决策能力。
{"title":"CatMemo at the FinLLM Challenge Task: Fine-Tuning Large Language Models using Data Fusion in Financial Applications","authors":"Yupeng Cao, Zhiyuan Yao, Zhi Chen, Zhiyang Deng","doi":"arxiv-2407.01953","DOIUrl":"https://doi.org/arxiv-2407.01953","url":null,"abstract":"The integration of Large Language Models (LLMs) into financial analysis has\u0000garnered significant attention in the NLP community. This paper presents our\u0000solution to IJCAI-2024 FinLLM challenge, investigating the capabilities of LLMs\u0000within three critical areas of financial tasks: financial classification,\u0000financial text summarization, and single stock trading. We adopted Llama3-8B\u0000and Mistral-7B as base models, fine-tuning them through Parameter Efficient\u0000Fine-Tuning (PEFT) and Low-Rank Adaptation (LoRA) approaches. To enhance model\u0000performance, we combine datasets from task 1 and task 2 for data fusion. Our\u0000approach aims to tackle these diverse tasks in a comprehensive and integrated\u0000manner, showcasing LLMs' capacity to address diverse and complex financial\u0000tasks with improved accuracy and decision-making capabilities.","PeriodicalId":501294,"journal":{"name":"arXiv - QuantFin - Computational Finance","volume":"13 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141520787","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Predicting public market behavior from private equity deals 从私募股权交易预测公共市场行为
Pub Date : 2024-07-01 DOI: arxiv-2407.01818
Paolo Barucca, Flaviano Morone
We process private equity transactions to predict public market behavior witha logit model. Specifically, we estimate our model to predict quarterly returnsfor both the broad market and for individual sectors. Our hypothesis is thatprivate equity investments (in aggregate) carry predictive signal aboutpublicly traded securities. The key source of such predictive signal is thefact that, during their diligence process, private equity fund managers areprivy to valuable company information that may not yet be reflected in thepublic markets at the time of their investment. Thus, we posit that we candiscover investors' collective near-term insight via detailed analysis of thetiming and nature of the deals they execute. We evaluate the accuracy of theestimated model by applying it to test data where we know the correct outputvalue. Remarkably, our model performs consistently better than a null modelsimply based on return statistics, while showing a predictive accuracy of up to70% in sectors such as Consumer Services, Communications, and Non EnergyMinerals.
我们采用 logit 模型处理私募股权交易,以预测公开市场行为。具体来说,我们通过估计模型来预测大盘和个别行业的季度回报率。我们的假设是,私募股权投资(总体而言)带有对公开交易证券的预测信号。这种预测信号的主要来源是,私募股权基金经理在其勤勉尽责的过程中,有机会获得有价值的公司信息,而这些信息在其投资时可能尚未反映在公开市场上。因此,我们认为,通过对投资者执行交易的时机和性质进行详细分析,我们可以发现投资者的近期集体洞察力。我们将估计模型应用于已知正确输出值的测试数据,以此评估模型的准确性。值得注意的是,我们的模型在消费服务、通信和非能源矿产等行业的预测准确率高达 70%,表现始终优于单纯基于回报统计的无效模型。
{"title":"Predicting public market behavior from private equity deals","authors":"Paolo Barucca, Flaviano Morone","doi":"arxiv-2407.01818","DOIUrl":"https://doi.org/arxiv-2407.01818","url":null,"abstract":"We process private equity transactions to predict public market behavior with\u0000a logit model. Specifically, we estimate our model to predict quarterly returns\u0000for both the broad market and for individual sectors. Our hypothesis is that\u0000private equity investments (in aggregate) carry predictive signal about\u0000publicly traded securities. The key source of such predictive signal is the\u0000fact that, during their diligence process, private equity fund managers are\u0000privy to valuable company information that may not yet be reflected in the\u0000public markets at the time of their investment. Thus, we posit that we can\u0000discover investors' collective near-term insight via detailed analysis of the\u0000timing and nature of the deals they execute. We evaluate the accuracy of the\u0000estimated model by applying it to test data where we know the correct output\u0000value. Remarkably, our model performs consistently better than a null model\u0000simply based on return statistics, while showing a predictive accuracy of up to\u000070% in sectors such as Consumer Services, Communications, and Non Energy\u0000Minerals.","PeriodicalId":501294,"journal":{"name":"arXiv - QuantFin - Computational Finance","volume":"136 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141520786","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AlphaForge: A Framework to Mine and Dynamically Combine Formulaic Alpha Factors AlphaForge:挖掘并动态组合公式化阿尔法因子的框架
Pub Date : 2024-06-26 DOI: arxiv-2406.18394
Hao Shi, Cuicui Luo, Weili Song, Xinting Zhang, Xiang Ao
The variability and low signal-to-noise ratio in financial data, combinedwith the necessity for interpretability, make the alpha factor mining workflowa crucial component of quantitative investment. Transitioning from early manualextraction to genetic programming, the most advanced approach in this domaincurrently employs reinforcement learning to mine a set of combination factorswith fixed weights. However, the performance of resultant alpha factorsexhibits inconsistency, and the inflexibility of fixed factor weights provesinsufficient in adapting to the dynamic nature of financial markets. To addressthis issue, this paper proposes a two-stage formulaic alpha generatingframework AlphaForge, for alpha factor mining and factor combination. Thisframework employs a generative-predictive neural network to generate factors,leveraging the robust spatial exploration capabilities inherent in deeplearning while concurrently preserving diversity. The combination model withinthe framework incorporates the temporal performance of factors for selectionand dynamically adjusts the weights assigned to each component alpha factor.Experiments conducted on real-world datasets demonstrate that our proposedmodel outperforms contemporary benchmarks in formulaic alpha factor mining.Furthermore, our model exhibits a notable enhancement in portfolio returnswithin the realm of quantitative investment.
金融数据的多变性和低信噪比,再加上可解释性的必要性,使得阿尔法因子挖掘工作流程成为量化投资的重要组成部分。从早期的人工提取到遗传编程,该领域目前最先进的方法是采用强化学习来挖掘一组具有固定权重的组合因子。然而,所得到的阿尔法因子表现出不一致性,而且固定因子权重缺乏灵活性,不足以适应金融市场的动态特性。为了解决这个问题,本文提出了一个两阶段公式化阿尔法生成框架 AlphaForge,用于阿尔法因子挖掘和因子组合。该框架采用生成-预测神经网络生成因子,利用深度学习固有的稳健空间探索能力,同时保持多样性。在现实世界数据集上进行的实验表明,我们提出的模型在公式化阿尔法因子挖掘方面优于当代基准。此外,我们的模型在量化投资领域的投资组合回报率方面也有显著提升。
{"title":"AlphaForge: A Framework to Mine and Dynamically Combine Formulaic Alpha Factors","authors":"Hao Shi, Cuicui Luo, Weili Song, Xinting Zhang, Xiang Ao","doi":"arxiv-2406.18394","DOIUrl":"https://doi.org/arxiv-2406.18394","url":null,"abstract":"The variability and low signal-to-noise ratio in financial data, combined\u0000with the necessity for interpretability, make the alpha factor mining workflow\u0000a crucial component of quantitative investment. Transitioning from early manual\u0000extraction to genetic programming, the most advanced approach in this domain\u0000currently employs reinforcement learning to mine a set of combination factors\u0000with fixed weights. However, the performance of resultant alpha factors\u0000exhibits inconsistency, and the inflexibility of fixed factor weights proves\u0000insufficient in adapting to the dynamic nature of financial markets. To address\u0000this issue, this paper proposes a two-stage formulaic alpha generating\u0000framework AlphaForge, for alpha factor mining and factor combination. This\u0000framework employs a generative-predictive neural network to generate factors,\u0000leveraging the robust spatial exploration capabilities inherent in deep\u0000learning while concurrently preserving diversity. The combination model within\u0000the framework incorporates the temporal performance of factors for selection\u0000and dynamically adjusts the weights assigned to each component alpha factor.\u0000Experiments conducted on real-world datasets demonstrate that our proposed\u0000model outperforms contemporary benchmarks in formulaic alpha factor mining.\u0000Furthermore, our model exhibits a notable enhancement in portfolio returns\u0000within the realm of quantitative investment.","PeriodicalId":501294,"journal":{"name":"arXiv - QuantFin - Computational Finance","volume":"35 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141501637","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
arXiv - QuantFin - Computational Finance
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1