首页 > 最新文献

arXiv - CS - Computational Engineering, Finance, and Science最新文献

英文 中文
L3iTC at the FinLLM Challenge Task: Quantization for Financial Text Classification & Summarization L3iTC 参加 FinLLM 挑战任务:金融文本分类和摘要的量化
Pub Date : 2024-08-06 DOI: arxiv-2408.03033
Elvys Linhares Pontes, Carlos-Emiliano González-Gallardo, Mohamed Benjannet, Caryn Qu, Antoine Doucet
This article details our participation (L3iTC) in the FinLLM Challenge Task2024, focusing on two key areas: Task 1, financial text classification, andTask 2, financial text summarization. To address these challenges, wefine-tuned several large language models (LLMs) to optimize performance foreach task. Specifically, we used 4-bit quantization and LoRA to determine whichlayers of the LLMs should be trained at a lower precision. This approach notonly accelerated the fine-tuning process on the training data provided by theorganizers but also enabled us to run the models on low GPU memory. Ourfine-tuned models achieved third place for the financial classification taskwith an F1-score of 0.7543 and secured sixth place in the financialsummarization task on the official test datasets.
本文详细介绍了我们(L3iTC)参与 FinLLM Challenge Task2024 的情况,重点关注两个关键领域:任务 1:金融文本分类;任务 2:金融文本摘要。为了应对这些挑战,我们对多个大型语言模型(LLM)进行了微调,以优化每个任务的性能。具体来说,我们使用 4 位量化和 LoRA 来确定 LLM 的哪些层应该以较低的精度进行训练。这种方法不仅加快了对组织者提供的训练数据进行微调的过程,还使我们能够在较低的 GPU 内存上运行模型。我们微调后的模型在金融分类任务中取得了第三名的好成绩,F1 分数为 0.7543,并在官方测试数据集的金融摘要任务中取得了第六名的好成绩。
{"title":"L3iTC at the FinLLM Challenge Task: Quantization for Financial Text Classification & Summarization","authors":"Elvys Linhares Pontes, Carlos-Emiliano González-Gallardo, Mohamed Benjannet, Caryn Qu, Antoine Doucet","doi":"arxiv-2408.03033","DOIUrl":"https://doi.org/arxiv-2408.03033","url":null,"abstract":"This article details our participation (L3iTC) in the FinLLM Challenge Task\u00002024, focusing on two key areas: Task 1, financial text classification, and\u0000Task 2, financial text summarization. To address these challenges, we\u0000fine-tuned several large language models (LLMs) to optimize performance for\u0000each task. Specifically, we used 4-bit quantization and LoRA to determine which\u0000layers of the LLMs should be trained at a lower precision. This approach not\u0000only accelerated the fine-tuning process on the training data provided by the\u0000organizers but also enabled us to run the models on low GPU memory. Our\u0000fine-tuned models achieved third place for the financial classification task\u0000with an F1-score of 0.7543 and secured sixth place in the financial\u0000summarization task on the official test datasets.","PeriodicalId":501309,"journal":{"name":"arXiv - CS - Computational Engineering, Finance, and Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141936370","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SETN: Stock Embedding Enhanced with Textual and Network Information SETN:利用文本和网络信息增强股票嵌入功能
Pub Date : 2024-08-06 DOI: arxiv-2408.02899
Takehiro Takayanagi, Hiroki Sakaji, Kiyoshi Izumi
Stock embedding is a method for vector representation of stocks. There is agrowing demand for vector representations of stock, i.e., stock embedding, inwealth management sectors, and the method has been applied to various taskssuch as stock price prediction, portfolio optimization, and similar fundidentifications. Stock embeddings have the advantage of enabling thequantification of relative relationships between stocks, and they can extractuseful information from unstructured data such as text and network data. Inthis study, we propose stock embedding enhanced with textual and networkinformation (SETN) using a domain-adaptive pre-trained transformer-based modelto embed textual information and a graph neural network model to grasp networkinformation. We evaluate the performance of our proposed model on relatedcompany information extraction tasks. We also demonstrate that stock embeddingsobtained from the proposed model perform better in creating thematic funds thanthose obtained from baseline methods, providing a promising pathway for variousapplications in the wealth management industry.
股票嵌入是一种股票向量表示方法。财富管理领域对股票向量表示(即股票嵌入)的需求日益增长,该方法已被应用于各种任务,如股票价格预测、投资组合优化和类似的基金识别。股票嵌入的优点是可以量化股票之间的相对关系,并且可以从文本和网络数据等非结构化数据中提取有用的信息。在本研究中,我们提出了增强文本和网络信息的股票嵌入(SETN),使用基于变换器的领域自适应预训练模型来嵌入文本信息,并使用图神经网络模型来掌握网络信息。我们在相关公司信息提取任务中评估了所提模型的性能。我们还证明,与基线方法相比,通过所提模型获得的股票嵌入在创建主题基金方面表现更好,这为财富管理行业的各种应用提供了一条前景广阔的途径。
{"title":"SETN: Stock Embedding Enhanced with Textual and Network Information","authors":"Takehiro Takayanagi, Hiroki Sakaji, Kiyoshi Izumi","doi":"arxiv-2408.02899","DOIUrl":"https://doi.org/arxiv-2408.02899","url":null,"abstract":"Stock embedding is a method for vector representation of stocks. There is a\u0000growing demand for vector representations of stock, i.e., stock embedding, in\u0000wealth management sectors, and the method has been applied to various tasks\u0000such as stock price prediction, portfolio optimization, and similar fund\u0000identifications. Stock embeddings have the advantage of enabling the\u0000quantification of relative relationships between stocks, and they can extract\u0000useful information from unstructured data such as text and network data. In\u0000this study, we propose stock embedding enhanced with textual and network\u0000information (SETN) using a domain-adaptive pre-trained transformer-based model\u0000to embed textual information and a graph neural network model to grasp network\u0000information. We evaluate the performance of our proposed model on related\u0000company information extraction tasks. We also demonstrate that stock embeddings\u0000obtained from the proposed model perform better in creating thematic funds than\u0000those obtained from baseline methods, providing a promising pathway for various\u0000applications in the wealth management industry.","PeriodicalId":501309,"journal":{"name":"arXiv - CS - Computational Engineering, Finance, and Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141936371","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Constructing Mechanical Design Agent Based on Large Language Models 基于大型语言模型构建机械设计代理
Pub Date : 2024-08-04 DOI: arxiv-2408.02087
Jiaxing Lu, Heran Li, Fangwei Ning, Yixuan Wang, Xinze Li, Yan Shi
Since ancient times, mechanical design aids have been developed to assisthuman users, aimed at improving the efficiency and effectiveness of design.However, even with the widespread use of contemporary Computer-Aided Design(CAD) systems, there are still high learning costs, repetitive work, and otherchallenges. In recent years, the rise of Large Language Models (LLMs) hasintroduced new productivity opportunities to the field of mechanical design.Yet, it remains unrealistic to rely on LLMs alone to complete mechanical designtasks directly. Through a series of explorations, we propose a method forconstructing a comprehensive Mechanical Design Agent (MDA) by guiding LLMlearning. To verify the validity of our proposed method, we conducted a seriesof experiments and presented relevant cases.
自古以来,人们一直在开发机械设计辅助工具来帮助人类用户,旨在提高设计的效率和效果。然而,即使当代计算机辅助设计(CAD)系统得到了广泛应用,仍然存在学习成本高、重复性工作多等挑战。近年来,大型语言模型(LLM)的兴起为机械设计领域带来了新的生产力机遇,然而仅依靠 LLM 直接完成机械设计任务仍不现实。通过一系列探索,我们提出了一种通过引导 LLM 学习来构建综合机械设计代理(MDA)的方法。为了验证所提方法的有效性,我们进行了一系列实验,并展示了相关案例。
{"title":"Constructing Mechanical Design Agent Based on Large Language Models","authors":"Jiaxing Lu, Heran Li, Fangwei Ning, Yixuan Wang, Xinze Li, Yan Shi","doi":"arxiv-2408.02087","DOIUrl":"https://doi.org/arxiv-2408.02087","url":null,"abstract":"Since ancient times, mechanical design aids have been developed to assist\u0000human users, aimed at improving the efficiency and effectiveness of design.\u0000However, even with the widespread use of contemporary Computer-Aided Design\u0000(CAD) systems, there are still high learning costs, repetitive work, and other\u0000challenges. In recent years, the rise of Large Language Models (LLMs) has\u0000introduced new productivity opportunities to the field of mechanical design.\u0000Yet, it remains unrealistic to rely on LLMs alone to complete mechanical design\u0000tasks directly. Through a series of explorations, we propose a method for\u0000constructing a comprehensive Mechanical Design Agent (MDA) by guiding LLM\u0000learning. To verify the validity of our proposed method, we conducted a series\u0000of experiments and presented relevant cases.","PeriodicalId":501309,"journal":{"name":"arXiv - CS - Computational Engineering, Finance, and Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-08-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141936373","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bilateral Trade Flow Prediction by Gravity-informed Graph Auto-encoder 利用重力信息图自动编码器预测双边贸易往来
Pub Date : 2024-08-04 DOI: arxiv-2408.01938
Naoto Minakawa, Kiyoshi Izumi, Hiroki Sakaji
The gravity models has been studied to analyze interaction between twoobjects such as trade amount between a pair of countries, human migrationbetween a pair of countries and traffic flow between two cities. Particularlyin the international trade, predicting trade amount is instrumental to industryand government in business decision making and determining economic policies.Whereas the gravity models well captures such interaction between objects, themodel simplifies the interaction to extract essential relationships or needshandcrafted features to drive the models. Recent studies indicate theconnection between graph neural networks (GNNs) and the gravity models ininternational trade. However, to our best knowledge, hardly any previousstudies in the this domain directly predicts trade amount by GNNs. We proposeGGAE (Gravity-informed Graph Auto-encoder) and its surrogate model, which isinspired by the gravity model, showing trade amount prediction by the gravitymodel can be formulated as an edge weight prediction problem in GNNs and solvedby GGAE and its surrogate model. Furthermore, we conducted experiments toindicate GGAE with GNNs can improve trade amount prediction compared to thetraditional gravity model by considering complex relationships.
引力模型被用来分析两个对象之间的相互作用,如一对国家之间的贸易额、一对国家之间的人口迁移和两个城市之间的交通流量。虽然引力模型能很好地捕捉对象间的这种相互作用,但它们简化了相互作用以提取本质关系或需要人工特征来驱动模型。最近的研究表明,图神经网络(GNN)与引力模型在国际贸易中存在联系。然而,据我们所知,在这一领域几乎没有任何以往的研究能通过图神经网络直接预测贸易额。我们提出了受引力模型启发的 GGAE(Gravity-infformed Graph Auto-encoder,引力信息图自动编码器)及其代理模型,表明引力模型的贸易额预测可以表述为 GNN 中的边权重预测问题,并通过 GGAE 及其代理模型求解。此外,我们还通过实验证明,与传统的重力模型相比,GGAE 与 GNN 可以通过考虑复杂的关系改进贸易额预测。
{"title":"Bilateral Trade Flow Prediction by Gravity-informed Graph Auto-encoder","authors":"Naoto Minakawa, Kiyoshi Izumi, Hiroki Sakaji","doi":"arxiv-2408.01938","DOIUrl":"https://doi.org/arxiv-2408.01938","url":null,"abstract":"The gravity models has been studied to analyze interaction between two\u0000objects such as trade amount between a pair of countries, human migration\u0000between a pair of countries and traffic flow between two cities. Particularly\u0000in the international trade, predicting trade amount is instrumental to industry\u0000and government in business decision making and determining economic policies.\u0000Whereas the gravity models well captures such interaction between objects, the\u0000model simplifies the interaction to extract essential relationships or needs\u0000handcrafted features to drive the models. Recent studies indicate the\u0000connection between graph neural networks (GNNs) and the gravity models in\u0000international trade. However, to our best knowledge, hardly any previous\u0000studies in the this domain directly predicts trade amount by GNNs. We propose\u0000GGAE (Gravity-informed Graph Auto-encoder) and its surrogate model, which is\u0000inspired by the gravity model, showing trade amount prediction by the gravity\u0000model can be formulated as an edge weight prediction problem in GNNs and solved\u0000by GGAE and its surrogate model. Furthermore, we conducted experiments to\u0000indicate GGAE with GNNs can improve trade amount prediction compared to the\u0000traditional gravity model by considering complex relationships.","PeriodicalId":501309,"journal":{"name":"arXiv - CS - Computational Engineering, Finance, and Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-08-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141936375","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fed-RD: Privacy-Preserving Federated Learning for Financial Crime Detection Fed-RD:用于金融犯罪检测的隐私保护联合学习
Pub Date : 2024-08-03 DOI: arxiv-2408.01609
Md. Saikat Islam Khan, Aparna Gupta, Oshani Seneviratne, Stacy Patterson
We introduce Federated Learning for Relational Data (Fed-RD), a novelprivacy-preserving federated learning algorithm specifically developed forfinancial transaction datasets partitioned vertically and horizontally acrossparties. Fed-RD strategically employs differential privacy and securemultiparty computation to guarantee the privacy of training data. We providetheoretical analysis of the end-to-end privacy of the training algorithm andpresent experimental results on realistic synthetic datasets. Our resultsdemonstrate that Fed-RD achieves high model accuracy with minimal degradationas privacy increases, while consistently surpassing benchmark results.
我们介绍了关系数据联合学习(Fed-RD),这是一种新颖的隐私保护联合学习算法,专门针对跨方纵向和横向分割的金融交易数据集而开发。Fed-RD 战略性地采用了差分隐私和安全多方计算来保证训练数据的隐私性。我们对训练算法的端到端隐私进行了理论分析,并展示了在现实合成数据集上的实验结果。我们的结果表明,Fed-RD 能够实现较高的模型准确性,而且随着隐私程度的提高,模型准确性的下降幅度也很小,同时还能不断超越基准结果。
{"title":"Fed-RD: Privacy-Preserving Federated Learning for Financial Crime Detection","authors":"Md. Saikat Islam Khan, Aparna Gupta, Oshani Seneviratne, Stacy Patterson","doi":"arxiv-2408.01609","DOIUrl":"https://doi.org/arxiv-2408.01609","url":null,"abstract":"We introduce Federated Learning for Relational Data (Fed-RD), a novel\u0000privacy-preserving federated learning algorithm specifically developed for\u0000financial transaction datasets partitioned vertically and horizontally across\u0000parties. Fed-RD strategically employs differential privacy and secure\u0000multiparty computation to guarantee the privacy of training data. We provide\u0000theoretical analysis of the end-to-end privacy of the training algorithm and\u0000present experimental results on realistic synthetic datasets. Our results\u0000demonstrate that Fed-RD achieves high model accuracy with minimal degradation\u0000as privacy increases, while consistently surpassing benchmark results.","PeriodicalId":501309,"journal":{"name":"arXiv - CS - Computational Engineering, Finance, and Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141936377","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Impact of Major Health Events on Pharmaceutical Stocks: A Comprehensive Analysis Using Macroeconomic and Market Indicators 重大健康事件对医药股的影响:利用宏观经济和市场指标进行综合分析
Pub Date : 2024-08-03 DOI: arxiv-2408.01883
Morteza Maleki, SeyedAli Ghahari
This study investigates the impact of significant health events onpharmaceutical stock performance, employing a comprehensive analysisincorporating macroeconomic and market indicators. Using Ordinary Least Squares(OLS) regression, we evaluate the effects of thirteen major health events since2000, including the Anthrax attacks, SARS outbreak, H1N1 pandemic, and COVID-19pandemic, on the pharmaceutical sector. The analysis covers different phases ofeach event beginning, peak, and ending to capture their temporal influence onstock prices. Our findings reveal distinct patterns in stock performance,driven by market reactions to the initial news, peak impact, and eventualresolution of these crises. We also examine scenarios with and without keymacroeconomic (MA) and market (MI) indicators to isolate their contributions.This detailed examination provides valuable insights for investors,policymakers, and stakeholders in understanding the interplay between majorhealth events and health market dynamics, guiding better decision-making duringfuture health related disruptions.
本研究通过对宏观经济和市场指标的综合分析,探讨了重大健康事件对医药股表现的影响。利用普通最小二乘法(OLS)回归,我们评估了 2000 年以来 13 次重大健康事件对医药行业的影响,包括炭疽袭击、SARS 爆发、H1N1 大流行和 COVID-19 大流行。分析涵盖了每个事件开始、高峰和结束的不同阶段,以捕捉其对股票价格的时间影响。我们的研究结果揭示了这些危机的初始消息、高峰影响和最终解决的市场反应所驱动的股票表现的独特模式。我们还研究了有关键宏观经济(MA)和市场(MI)指标和没有关键宏观经济(MA)和市场(MI)指标的情景,以区分它们的贡献。这种详细的研究为投资者、政策制定者和利益相关者了解重大健康事件和健康市场动态之间的相互作用提供了有价值的见解,从而指导他们在未来与健康相关的混乱中做出更好的决策。
{"title":"Impact of Major Health Events on Pharmaceutical Stocks: A Comprehensive Analysis Using Macroeconomic and Market Indicators","authors":"Morteza Maleki, SeyedAli Ghahari","doi":"arxiv-2408.01883","DOIUrl":"https://doi.org/arxiv-2408.01883","url":null,"abstract":"This study investigates the impact of significant health events on\u0000pharmaceutical stock performance, employing a comprehensive analysis\u0000incorporating macroeconomic and market indicators. Using Ordinary Least Squares\u0000(OLS) regression, we evaluate the effects of thirteen major health events since\u00002000, including the Anthrax attacks, SARS outbreak, H1N1 pandemic, and COVID-19\u0000pandemic, on the pharmaceutical sector. The analysis covers different phases of\u0000each event beginning, peak, and ending to capture their temporal influence on\u0000stock prices. Our findings reveal distinct patterns in stock performance,\u0000driven by market reactions to the initial news, peak impact, and eventual\u0000resolution of these crises. We also examine scenarios with and without key\u0000macroeconomic (MA) and market (MI) indicators to isolate their contributions.\u0000This detailed examination provides valuable insights for investors,\u0000policymakers, and stakeholders in understanding the interplay between major\u0000health events and health market dynamics, guiding better decision-making during\u0000future health related disruptions.","PeriodicalId":501309,"journal":{"name":"arXiv - CS - Computational Engineering, Finance, and Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141936376","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
HRFT: Mining High-Frequency Risk Factor Collections End-to-End via Transformer HRFT:通过变压器端到端挖掘高频风险因素集合
Pub Date : 2024-08-02 DOI: arxiv-2408.01271
Wenyan Xu, Rundong Wang, Chen Li, Yonghong Hu, Zhonghua Lu
In quantitative trading, it is common to find patterns in short term volatiletrends of the market. These patterns are known as High Frequency (HF) riskfactors, serving as key indicators of future stock price volatility.Traditionally, these risk factors were generated by financial models relyingheavily on domain-specific knowledge manually added rather than extensivemarket data. Inspired by symbolic regression (SR), which infers mathematicallaws from data, we treat the extraction of formulaic risk factors fromhigh-frequency trading (HFT) market data as an SR task. In this paper, wechallenge the manual construction of risk factors and propose an end-to-endmethodology, Intraday Risk Factor Transformer (IRFT), to directly predictcomplete formulaic factors, including constants. We use a hybridsymbolic-numeric vocabulary where symbolic tokens represent operators/stockfeatures and numeric tokens represent constants. We train a Transformer modelon the HFT dataset to generate complete formulaic HF risk factors withoutrelying on a predefined skeleton of operators. It determines the general shapeof the stock volatility law up to a choice of constants. We refine thepredicted constants (a, b) using the Broyden Fletcher Goldfarb Shanno algorithm(BFGS) to mitigate non-linear issues. Compared to the 10 approaches in SRBench,a living benchmark for SR, IRFT gains a 30% excess investment return on theHS300 and SP500 datasets, with inference times orders of magnitude faster thantheirs in HF risk factor mining tasks.
在量化交易中,从市场短期波动趋势中发现规律是很常见的。这些模式被称为高频(HF)风险因子,是未来股价波动的关键指标。传统上,这些风险因子是由金融模型生成的,主要依赖于人工添加的特定领域知识,而不是广泛的市场数据。受到从数据中推断数学法则的符号回归(SR)的启发,我们将从高频交易(HFT)市场数据中提取公式化风险因子视为 SR 任务。在本文中,我们挑战了手动构建风险因子的方法,并提出了一种端到端的方法--日内风险因子转换器(IRFT),可直接预测包括常数在内的完整公式因子。我们使用符号-数字混合词汇,其中符号标记代表运算符/股票特征,数字标记代表常数。我们在 HFT 数据集上训练 Transformer 模型,以生成完整的公式化高频风险因子,而无需依赖预定义的运算符骨架。它确定了股票波动率规律的一般形状,直至常数的选择。我们使用 Broyden Fletcher Goldfarb Shanno 算法(BFGS)对预测常数(a、b)进行细化,以缓解非线性问题。与 SRBench(SR 的活基准)中的 10 种方法相比,IRFT 在 HS300 和 SP500 数据集上获得了 30% 的超额投资回报,其推理时间比它们在高频风险因素挖掘任务中的推理时间快了几个数量级。
{"title":"HRFT: Mining High-Frequency Risk Factor Collections End-to-End via Transformer","authors":"Wenyan Xu, Rundong Wang, Chen Li, Yonghong Hu, Zhonghua Lu","doi":"arxiv-2408.01271","DOIUrl":"https://doi.org/arxiv-2408.01271","url":null,"abstract":"In quantitative trading, it is common to find patterns in short term volatile\u0000trends of the market. These patterns are known as High Frequency (HF) risk\u0000factors, serving as key indicators of future stock price volatility.\u0000Traditionally, these risk factors were generated by financial models relying\u0000heavily on domain-specific knowledge manually added rather than extensive\u0000market data. Inspired by symbolic regression (SR), which infers mathematical\u0000laws from data, we treat the extraction of formulaic risk factors from\u0000high-frequency trading (HFT) market data as an SR task. In this paper, we\u0000challenge the manual construction of risk factors and propose an end-to-end\u0000methodology, Intraday Risk Factor Transformer (IRFT), to directly predict\u0000complete formulaic factors, including constants. We use a hybrid\u0000symbolic-numeric vocabulary where symbolic tokens represent operators/stock\u0000features and numeric tokens represent constants. We train a Transformer model\u0000on the HFT dataset to generate complete formulaic HF risk factors without\u0000relying on a predefined skeleton of operators. It determines the general shape\u0000of the stock volatility law up to a choice of constants. We refine the\u0000predicted constants (a, b) using the Broyden Fletcher Goldfarb Shanno algorithm\u0000(BFGS) to mitigate non-linear issues. Compared to the 10 approaches in SRBench,\u0000a living benchmark for SR, IRFT gains a 30% excess investment return on the\u0000HS300 and SP500 datasets, with inference times orders of magnitude faster than\u0000theirs in HF risk factor mining tasks.","PeriodicalId":501309,"journal":{"name":"arXiv - CS - Computational Engineering, Finance, and Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141936378","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A deep spatio-temporal attention model of dynamic functional network connectivity shows sensitivity to Alzheimer's in asymptomatic individuals 动态功能网络连接的深度时空注意力模型显示无症状个体对阿尔茨海默氏症的敏感性
Pub Date : 2024-08-01 DOI: arxiv-2408.00378
Yuxiang Wei, Anees Abrol, James Lah, Deqiang Qiu, Vince D. Calhoun
Alzheimer's disease (AD) progresses from asymptomatic changes to clinicalsymptoms, emphasizing the importance of early detection for proper treatment.Functional magnetic resonance imaging (fMRI), particularly dynamic functionalnetwork connectivity (dFNC), has emerged as an important biomarker for AD.Nevertheless, studies probing at-risk subjects in the pre-symptomatic stageusing dFNC are limited. To identify at-risk subjects and understand alterationsof dFNC in different stages, we leverage deep learning advancements andintroduce a transformer-convolution framework for predicting at-risk subjectsbased on dFNC, incorporating spatial-temporal self-attention to capture brainnetwork dependencies and temporal dynamics. Our model significantly outperformsother popular machine learning methods. By analyzing individuals with diagnosedAD and mild cognitive impairment (MCI), we studied the AD progression andobserved a higher similarity between MCI and asymptomatic AD. The interpretableanalysis highlights the cognitive-control network's diagnostic importance, withthe model focusing on intra-visual domain dFNC when predicting asymptomatic ADsubjects.
功能磁共振成像(fMRI),尤其是动态功能网络连接(dFNC),已成为阿尔茨海默病(AD)的重要生物标志物。然而,利用dFNC探测症状前阶段高危人群的研究非常有限。为了识别高危人群并了解不同阶段dFNC的变化,我们利用深度学习的进步,引入了一个基于dFNC的变压器-卷积框架来预测高危人群,并结合空间-时间自我关注来捕捉脑网络的依赖性和时间动态。我们的模型明显优于其他流行的机器学习方法。通过分析已确诊的AD和轻度认知障碍(MCI)患者,我们研究了AD的发展过程,发现MCI和无症状AD之间有更高的相似性。可解释的分析凸显了认知控制网络在诊断中的重要性,该模型在预测无症状AD受试者时侧重于视觉域内的dFNC。
{"title":"A deep spatio-temporal attention model of dynamic functional network connectivity shows sensitivity to Alzheimer's in asymptomatic individuals","authors":"Yuxiang Wei, Anees Abrol, James Lah, Deqiang Qiu, Vince D. Calhoun","doi":"arxiv-2408.00378","DOIUrl":"https://doi.org/arxiv-2408.00378","url":null,"abstract":"Alzheimer's disease (AD) progresses from asymptomatic changes to clinical\u0000symptoms, emphasizing the importance of early detection for proper treatment.\u0000Functional magnetic resonance imaging (fMRI), particularly dynamic functional\u0000network connectivity (dFNC), has emerged as an important biomarker for AD.\u0000Nevertheless, studies probing at-risk subjects in the pre-symptomatic stage\u0000using dFNC are limited. To identify at-risk subjects and understand alterations\u0000of dFNC in different stages, we leverage deep learning advancements and\u0000introduce a transformer-convolution framework for predicting at-risk subjects\u0000based on dFNC, incorporating spatial-temporal self-attention to capture brain\u0000network dependencies and temporal dynamics. Our model significantly outperforms\u0000other popular machine learning methods. By analyzing individuals with diagnosed\u0000AD and mild cognitive impairment (MCI), we studied the AD progression and\u0000observed a higher similarity between MCI and asymptomatic AD. The interpretable\u0000analysis highlights the cognitive-control network's diagnostic importance, with\u0000the model focusing on intra-visual domain dFNC when predicting asymptomatic AD\u0000subjects.","PeriodicalId":501309,"journal":{"name":"arXiv - CS - Computational Engineering, Finance, and Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141884035","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multiscale topology optimization of functionally graded lattice structures based on physics-augmented neural network material models 基于物理增强神经网络材料模型的功能分级晶格结构的多尺度拓扑优化
Pub Date : 2024-08-01 DOI: arxiv-2408.00510
Jonathan Stollberg, Tarun Gangwar, Oliver Weeger, Dominik Schillinger
We present a new framework for the simultaneous optimiziation of both thetopology as well as the relative density grading of cellular structures andmaterials, also known as lattices. Due to manufacturing constraints, theoptimization problem falls into the class of NP-complete mixed-integernonlinear programming problems. To tackle this difficulty, we obtain a relaxedproblem from a multiplicative split of the relative density and a penalizationapproach. The sensitivities of the objective function are derived such that anygradient-based solver might be applied for the iterative update of the designvariables. In a next step, we introduce a material model that is parametric inthe design variables of interest and suitable to describe the isotropicdeformation behavior of quasi-stochastic lattices. For that, we derive andimplement further physical constraints and enhance a physics-augmented neuralnetwork from the literature that was formulated initially for rhombicmaterials. Finally, to illustrate the applicability of the method, weincorporate the material model into our computational framework and exemplaryoptimize two-and three-dimensional benchmark structures as well as a complexaircraft component.
我们提出了一种新的框架,用于同时优化蜂窝结构和材料(也称为晶格)的拓扑结构和相对密度分级。由于制造限制,优化问题属于 NP-complete(NP-complete)混合非线性编程问题。为了解决这一难题,我们从相对密度的乘法分割和惩罚方法中得到了一个宽松的问题。目标函数的敏感性被推导出来,因此任何基于梯度的求解器都可以用于设计变量的迭代更新。下一步,我们将引入一种材料模型,该模型与相关设计变量参数化,适用于描述准随机晶格的各向同性变形行为。为此,我们进一步推导并实施了物理约束,并增强了文献中最初针对菱形材料制定的物理增强神经网络。最后,为了说明该方法的适用性,我们将材料模型纳入计算框架,并对二维和三维基准结构以及一个复杂的飞机部件进行了示范性优化。
{"title":"Multiscale topology optimization of functionally graded lattice structures based on physics-augmented neural network material models","authors":"Jonathan Stollberg, Tarun Gangwar, Oliver Weeger, Dominik Schillinger","doi":"arxiv-2408.00510","DOIUrl":"https://doi.org/arxiv-2408.00510","url":null,"abstract":"We present a new framework for the simultaneous optimiziation of both the\u0000topology as well as the relative density grading of cellular structures and\u0000materials, also known as lattices. Due to manufacturing constraints, the\u0000optimization problem falls into the class of NP-complete mixed-integer\u0000nonlinear programming problems. To tackle this difficulty, we obtain a relaxed\u0000problem from a multiplicative split of the relative density and a penalization\u0000approach. The sensitivities of the objective function are derived such that any\u0000gradient-based solver might be applied for the iterative update of the design\u0000variables. In a next step, we introduce a material model that is parametric in\u0000the design variables of interest and suitable to describe the isotropic\u0000deformation behavior of quasi-stochastic lattices. For that, we derive and\u0000implement further physical constraints and enhance a physics-augmented neural\u0000network from the literature that was formulated initially for rhombic\u0000materials. Finally, to illustrate the applicability of the method, we\u0000incorporate the material model into our computational framework and exemplary\u0000optimize two-and three-dimensional benchmark structures as well as a complex\u0000aircraft component.","PeriodicalId":501309,"journal":{"name":"arXiv - CS - Computational Engineering, Finance, and Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141884034","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Online Prediction-Assisted Safe Reinforcement Learning for Electric Vehicle Charging Station Recommendation in Dynamically Coupled Transportation-Power Systems 在线预测辅助安全强化学习用于动态耦合交通-电力系统中的电动汽车充电站推荐
Pub Date : 2024-07-30 DOI: arxiv-2407.20679
Qionghua Liao, Guilong Li, Jiajie Yu, Ziyuan Gu, Wei Ma
With the proliferation of electric vehicles (EVs), the transportation networkand power grid become increasingly interdependent and coupled via chargingstations. The concomitant growth in charging demand has posed challenges forboth networks, highlighting the importance of charging coordination. Existingliterature largely overlooks the interactions between power grid security andtraffic efficiency. In view of this, we study the en-route charging station(CS) recommendation problem for EVs in dynamically coupled transportation-powersystems. The system-level objective is to maximize the overall trafficefficiency while ensuring the safety of the power grid. This problem is for thefirst time formulated as a constrained Markov decision process (CMDP), and anonline prediction-assisted safe reinforcement learning (OP-SRL) method isproposed to learn the optimal and secure policy by extending the PPO method. Tobe specific, we mainly address two challenges. First, the constrainedoptimization problem is converted into an equivalent unconstrained optimizationproblem by applying the Lagrangian method. Second, to account for the uncertainlong-time delay between performing CS recommendation and commencing charging,we put forward an online sequence-to-sequence (Seq2Seq) predictor for stateaugmentation to guide the agent in making forward-thinking decisions. Finally,we conduct comprehensive experimental studies based on the Nguyen-Dupuisnetwork and a large-scale real-world road network, coupled with IEEE 33-bus andIEEE 69-bus distribution systems, respectively. Results demonstrate that theproposed method outperforms baselines in terms of road network efficiency,power grid safety, and EV user satisfaction. The case study on the real-worldnetwork also illustrates the applicability in the practical context.
随着电动汽车(EV)的普及,交通网络和电网变得越来越相互依赖,并通过充电站相互连接。随之而来的充电需求增长给两个网络都带来了挑战,凸显了充电协调的重要性。现有文献在很大程度上忽视了电网安全与交通效率之间的相互作用。有鉴于此,我们研究了动态耦合交通-电力系统中的电动汽车途中充电站(CS)推荐问题。系统级目标是在确保电网安全的前提下最大化整体交通效率。该问题首次被表述为受约束马尔可夫决策过程(CMDP),并提出了一种在线预测辅助安全强化学习(OP-SRL)方法,通过扩展 PPO 方法来学习最优安全策略。具体来说,我们主要解决了两个难题。首先,通过应用拉格朗日方法,将有约束优化问题转换为等效的无约束优化问题。其次,考虑到从执行 CS 推荐到开始充电之间不确定的长时间延迟,我们提出了一个在线序列到序列(Seq2Seq)预测器来进行状态增强,以指导代理做出前瞻性决策。最后,我们基于 Nguyen-Dupuis 网络和大规模真实路网,分别结合 IEEE 33 总线和 IEEE 69 总线配电系统,进行了全面的实验研究。结果表明,所提出的方法在路网效率、电网安全和电动汽车用户满意度方面都优于基准方法。对现实世界网络的案例研究也说明了该方法在实际环境中的适用性。
{"title":"Online Prediction-Assisted Safe Reinforcement Learning for Electric Vehicle Charging Station Recommendation in Dynamically Coupled Transportation-Power Systems","authors":"Qionghua Liao, Guilong Li, Jiajie Yu, Ziyuan Gu, Wei Ma","doi":"arxiv-2407.20679","DOIUrl":"https://doi.org/arxiv-2407.20679","url":null,"abstract":"With the proliferation of electric vehicles (EVs), the transportation network\u0000and power grid become increasingly interdependent and coupled via charging\u0000stations. The concomitant growth in charging demand has posed challenges for\u0000both networks, highlighting the importance of charging coordination. Existing\u0000literature largely overlooks the interactions between power grid security and\u0000traffic efficiency. In view of this, we study the en-route charging station\u0000(CS) recommendation problem for EVs in dynamically coupled transportation-power\u0000systems. The system-level objective is to maximize the overall traffic\u0000efficiency while ensuring the safety of the power grid. This problem is for the\u0000first time formulated as a constrained Markov decision process (CMDP), and an\u0000online prediction-assisted safe reinforcement learning (OP-SRL) method is\u0000proposed to learn the optimal and secure policy by extending the PPO method. To\u0000be specific, we mainly address two challenges. First, the constrained\u0000optimization problem is converted into an equivalent unconstrained optimization\u0000problem by applying the Lagrangian method. Second, to account for the uncertain\u0000long-time delay between performing CS recommendation and commencing charging,\u0000we put forward an online sequence-to-sequence (Seq2Seq) predictor for state\u0000augmentation to guide the agent in making forward-thinking decisions. Finally,\u0000we conduct comprehensive experimental studies based on the Nguyen-Dupuis\u0000network and a large-scale real-world road network, coupled with IEEE 33-bus and\u0000IEEE 69-bus distribution systems, respectively. Results demonstrate that the\u0000proposed method outperforms baselines in terms of road network efficiency,\u0000power grid safety, and EV user satisfaction. The case study on the real-world\u0000network also illustrates the applicability in the practical context.","PeriodicalId":501309,"journal":{"name":"arXiv - CS - Computational Engineering, Finance, and Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141863329","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
arXiv - CS - Computational Engineering, Finance, and Science
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1