Proceedings of the Third ACM International Conference on AI in Finance最新文献

英文中文

Addressing Non-Stationarity in FX Trading with Online Model Selection of Offline RL Experts 利用离线RL专家的在线模型选择解决外汇交易中的非平稳性问题

Proceedings of the Third ACM International Conference on AI in Finance

Pub Date : 2022-10-26 DOI: 10.1145/3533271.3561780

Antonio Riva, L. Bisi, P. Liotet, Luca Sabbioni, Edoardo Vittori, Marco Pinciroli, Michele Trapletti, Marcello Restelli

Reinforcement learning has proven to be successful in obtaining profitable trading policies; however, the effectiveness of such strategies is strongly conditioned to market stationarity. This hypothesis is challenged by the regime switches frequently experienced by practitioners; thus, when many models are available, validation may become a difficult task. We propose to overcome the issue by explicitly modeling the trading task as a non-stationary reinforcement learning problem. Nevertheless, state-of-the-art RL algorithms for this setting usually require task distribution or dynamics to be predictable, an assumption that can hardly be true in the financial framework. In this work, we propose, instead, a method for the dynamic selection of the best RL agent which is only driven by profit performance. Our modular two-layer approach allows choosing the best strategy among a set of RL models through an online-learning algorithm. While we could select any combination of algorithms in principle, our solution employs two state-of-the-art algorithms: Fitted Q-Iteration (FQI) for the RL layer and Optimistic Adapt ML-Prod (OAMP) for the online learning one. The proposed approach is tested on two simulated FX trading tasks, using actual historical data for the AUS/USD and GBP/USD currency pairs.

强化学习已被证明在获得有利可图的交易策略方面是成功的;然而，这种策略的有效性在很大程度上取决于市场的平稳性。这一假设受到从业人员经常经历的制度转换的挑战;因此，当有许多模型可用时，验证可能成为一项困难的任务。我们建议通过明确地将交易任务建模为非平稳强化学习问题来克服这个问题。然而，用于这种设置的最先进的强化学习算法通常要求任务分配或动态是可预测的，这一假设在金融框架中几乎不可能成立。在这项工作中，我们提出了一种动态选择最佳RL代理的方法，该方法仅由利润表现驱动。我们的模块化两层方法允许通过在线学习算法在一组强化学习模型中选择最佳策略。虽然我们原则上可以选择任何算法组合，但我们的解决方案采用了两种最先进的算法:用于强化学习层的拟合q -迭代(FQI)和用于在线学习层的乐观适应ML-Prod (OAMP)。采用澳元/美元和英镑/美元货币对的实际历史数据，在两个模拟外汇交易任务中测试了所提出的方法。

{"title":"Addressing Non-Stationarity in FX Trading with Online Model Selection of Offline RL Experts","authors":"Antonio Riva, L. Bisi, P. Liotet, Luca Sabbioni, Edoardo Vittori, Marco Pinciroli, Michele Trapletti, Marcello Restelli","doi":"10.1145/3533271.3561780","DOIUrl":"https://doi.org/10.1145/3533271.3561780","url":null,"abstract":"Reinforcement learning has proven to be successful in obtaining profitable trading policies; however, the effectiveness of such strategies is strongly conditioned to market stationarity. This hypothesis is challenged by the regime switches frequently experienced by practitioners; thus, when many models are available, validation may become a difficult task. We propose to overcome the issue by explicitly modeling the trading task as a non-stationary reinforcement learning problem. Nevertheless, state-of-the-art RL algorithms for this setting usually require task distribution or dynamics to be predictable, an assumption that can hardly be true in the financial framework. In this work, we propose, instead, a method for the dynamic selection of the best RL agent which is only driven by profit performance. Our modular two-layer approach allows choosing the best strategy among a set of RL models through an online-learning algorithm. While we could select any combination of algorithms in principle, our solution employs two state-of-the-art algorithms: Fitted Q-Iteration (FQI) for the RL layer and Optimistic Adapt ML-Prod (OAMP) for the online learning one. The proposed approach is tested on two simulated FX trading tasks, using actual historical data for the AUS/USD and GBP/USD currency pairs.","PeriodicalId":134888,"journal":{"name":"Proceedings of the Third ACM International Conference on AI in Finance","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116014066","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Objective Driven Portfolio Construction Using Reinforcement Learning 基于强化学习的目标驱动组合构建

Proceedings of the Third ACM International Conference on AI in Finance

Pub Date : 2022-10-26 DOI: 10.1145/3533271.3561764

Tina Wang, Jithin Pradeep, Jerry Zikun Chen

Recent advancement in reinforcement learning has enabled robust data-driven direct optimization on the investor’s objectives without estimating the stock movements as in the traditional two-step approach [8]. Given diverse investment styles, a single trading strategy cannot serve different investor objectives. We propose an objective function formulation to augment the direct optimization approach in AlphaPortfolio (Cong et al. [6]). In addition to simple baseline Sharpe ratio used in AlphaPortfolio, we add three investor’s objectives for (i) achieving excess alpha by maximizing the information ratio; (ii) mitigating downside risks through optimizing maximum drawdown-adjusted return; and (iii) reducing transaction costs via restricting the turnover rate. We also introduce four new features: momentum, short-term reversal, drawdown, and maximum drawdown to the framework. Our objective function formulation allows for controlling the trade-off between both maximum drawdown and turnover with respect to realized return, creating flexible trading strategies for various risk appetites. The maximum drawdown efficient frontier curve, derived using a range of values of hyper-parameter α, reflects the similar concave relationship as observed in the theoretical study by Chekhlov et al. [5]. To improve the interpretability of the deep neural network and drive insights into traditional factor investment, we further explore the drivers that contribute to the top and bottom performing firms by running regression analysis using Random Forest, which achieves R2 of approximately 0.8 in producing the same winner scores as our model. Finally, to uncover the balance between profits and diversification, we investigate the impact of the trading size on strategy behaviors.

强化学习的最新进展已经实现了对投资者目标的稳健数据驱动的直接优化，而无需像传统的两步方法那样估计股票走势。考虑到多样化的投资风格，单一的交易策略无法满足不同投资者的目标。我们提出了一个目标函数公式来增强AlphaPortfolio中的直接优化方法(Cong et al.[6])。除了在AlphaPortfolio中使用的简单基准夏普比率之外，我们还增加了三个投资者的目标:(i)通过最大化信息比率来实现超额阿尔法;(ii)通过优化最大回调收益来降低下行风险;(三)通过限制换手率降低交易成本。我们还向框架引入了四个新特性:动量、短期反转、回调和最大回调。我们的目标函数公式允许在实现回报方面控制最大回撤和营业额之间的权衡，为各种风险偏好创建灵活的交易策略。利用超参数α值范围推导出的最大降压有效边界曲线，反映了Chekhlov等人在理论研究中观察到的类似凹关系。为了提高深度神经网络的可解释性并推动对传统要素投资的见解，我们通过使用随机森林(Random Forest)进行回归分析，进一步探索了对表现最好和最差的公司做出贡献的驱动因素，在产生与我们的模型相同的赢家得分时，其R2约为0.8。最后，为了揭示利润与多元化之间的平衡，我们研究了交易规模对策略行为的影响。

{"title":"Objective Driven Portfolio Construction Using Reinforcement Learning","authors":"Tina Wang, Jithin Pradeep, Jerry Zikun Chen","doi":"10.1145/3533271.3561764","DOIUrl":"https://doi.org/10.1145/3533271.3561764","url":null,"abstract":"Recent advancement in reinforcement learning has enabled robust data-driven direct optimization on the investor’s objectives without estimating the stock movements as in the traditional two-step approach [8]. Given diverse investment styles, a single trading strategy cannot serve different investor objectives. We propose an objective function formulation to augment the direct optimization approach in AlphaPortfolio (Cong et al. [6]). In addition to simple baseline Sharpe ratio used in AlphaPortfolio, we add three investor’s objectives for (i) achieving excess alpha by maximizing the information ratio; (ii) mitigating downside risks through optimizing maximum drawdown-adjusted return; and (iii) reducing transaction costs via restricting the turnover rate. We also introduce four new features: momentum, short-term reversal, drawdown, and maximum drawdown to the framework. Our objective function formulation allows for controlling the trade-off between both maximum drawdown and turnover with respect to realized return, creating flexible trading strategies for various risk appetites. The maximum drawdown efficient frontier curve, derived using a range of values of hyper-parameter α, reflects the similar concave relationship as observed in the theoretical study by Chekhlov et al. [5]. To improve the interpretability of the deep neural network and drive insights into traditional factor investment, we further explore the drivers that contribute to the top and bottom performing firms by running regression analysis using Random Forest, which achieves R2 of approximately 0.8 in producing the same winner scores as our model. Finally, to uncover the balance between profits and diversification, we investigate the impact of the trading size on strategy behaviors.","PeriodicalId":134888,"journal":{"name":"Proceedings of the Third ACM International Conference on AI in Finance","volume":"57 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116043788","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Understanding Counterfactual Generation using Maximum Mean Discrepancy 利用最大平均差异理解反事实生成

Proceedings of the Third ACM International Conference on AI in Finance

Pub Date : 2022-10-26 DOI: 10.1145/3533271.3561759

Wei Zhang, Brian Barr, J. Paisley

With the dramatic development of deep learning in the past decade, interpretability has been one of the most important challenges that often prevents neural networks from being applied to fields such as finance. Among many existing explainable analyses, counterfactual generation has become widely used for understanding neural networks and making tailored recommendations. However, few studies are devoted to providing quantitative measures for evaluating counterfactuals. In this paper, we propose a quantitative approach based on maximum mean discrepancy (MMD). We employ several existing counterfactual methods to demonstrate this proposed method on the MNIST image data set and two tabular financial data sets, Lending Club (LCD) and Give Me Some Credit (GMC). The results demonstrate the potential usefulness as well as the simplicity of the proposed method.

随着过去十年深度学习的迅猛发展，可解释性一直是阻碍神经网络应用于金融等领域的最重要挑战之一。在许多现有的可解释分析中，反事实生成已被广泛用于理解神经网络并提出量身定制的建议。然而，很少有研究致力于提供定量的方法来评估反事实。在本文中，我们提出了一种基于最大平均差异(MMD)的定量方法。我们采用了几种现有的反事实方法，在MNIST图像数据集和两个表格金融数据集，Lending Club (LCD)和Give Me Some Credit (GMC)上验证了该方法。结果表明，该方法具有潜在的实用性和简单性。

引用次数: 1

Incentivising Market Making in Financial Markets 激励金融市场的做市行为

Proceedings of the Third ACM International Conference on AI in Finance

Pub Date : 2022-10-26 DOI: 10.1145/3533271.3561706

Ji Qi, Carmine Ventre

In their pursue for profit, market makers contribute liquidity and thus play a fundamental role for the health of financial markets. The mechanism used to rank bids and asks in order-driven markets can influence trader behaviour and discourage market making, with obvious consequences on market fundamentals. This is the rationale behind market trading mechanisms, which assign weight to both the spread of two-sided orders and order prices. In this work, we assess the effectiveness of this proposal from a game-theoretic standpoint. We use strategic agents and explicitly define a utility function that treats the probability of a trader becoming a market maker as a pure strategy. We then employ empirical game-theoretic analysis to analyse the market at equilibrium; we illustrate the strategic responses to different setups of the matching mechanisms, how agents are incentivised to become market makers, agent behaviour and market states. Our analysis shows that this spread-based priority works well to reduce market volatility and maintain trading volume, provided that an appropriate setting is used, which weighs spread ranking and price ranking .

在追求利润的过程中，做市商提供流动性，因此对金融市场的健康发挥着根本作用。在订单驱动的市场中，用于对买入价和卖出价进行排序的机制可能会影响交易员的行为，抑制做市行为，对市场基本面产生明显影响。这是市场交易机制背后的基本原理，该机制为双边订单价差和订单价格赋予权重。在这项工作中，我们从博弈论的角度评估了这一建议的有效性。我们使用战略代理并明确定义了一个效用函数，该函数将交易者成为做市商的概率视为纯粹的策略。然后，我们运用经验博弈论分析来分析均衡状态下的市场;我们说明了对不同匹配机制设置的战略反应，如何激励代理人成为做市商，代理人行为和市场状态。我们的分析表明，这种基于价差的优先级可以很好地减少市场波动并保持交易量，只要使用适当的设置，即权衡价差排名和价格排名。

引用次数: 0

Adversarial Fraud Generation for Improved Detection 改进检测的对抗性欺诈生成

Proceedings of the Third ACM International Conference on AI in Finance

Pub Date : 2022-10-26 DOI: 10.1145/3533271.3561723

Anubha Pandey, Alekhya Bhatraju, Shiv Markam, Deepak L. Bhatt

Generative Adversarial Networks (GANs) are known for their ability to learn data distribution and hence exist as a suitable alternative to handle class imbalance through oversampling. However, it still fails to capture the diversity of the minority class owing to their limited representation, for example, frauds in our study. Particularly the fraudulent patterns closer to the class boundary get missed by the model. This paper proposes using GANs to simulate fraud transaction patterns conditioned on genuine transactions, thereby enabling the model to learn a translation function between both spaces. Further to synthesize fraudulent samples from the class boundary, we trained GANs using losses inspired by data poisoning attack literature and discussed their efficacy in improving fraud detection classifier performance. The efficacy of our proposed framework is demonstrated through experimental results on the publicly available European Credit-Card Dataset and CIS Fraud Dataset.

生成对抗网络(GANs)以其学习数据分布的能力而闻名，因此作为通过过采样处理类不平衡的合适替代方案而存在。然而，由于少数族裔的代表性有限，例如我们研究中的欺诈行为，它仍然未能捕捉到少数族裔的多样性。特别是靠近类边界的欺骗性模式会被模型忽略。本文提出使用gan模拟以真实交易为条件的欺诈交易模式，从而使模型能够学习两个空间之间的转换函数。为了进一步从类边界合成欺诈样本，我们使用数据中毒攻击文献启发的损失来训练gan，并讨论了它们在提高欺诈检测分类器性能方面的效果。通过在公开可用的欧洲信用卡数据集和CIS欺诈数据集上的实验结果证明了我们提出的框架的有效性。

引用次数: 0

Graph and tensor-train recurrent neural networks for high-dimensional models of limit order books 高维极限序书模型的图和张量训练递归神经网络

Proceedings of the Third ACM International Conference on AI in Finance

Pub Date : 2022-10-26 DOI: 10.1145/3533271.3561710

Jacobo Roa-Vicens, Y. Xu, Ricardo Silva, D. Mandic

Recurrent neural networks (RNNs) have proven to be particularly effective for the paradigms of learning and modelling time series. However, sequential data of high dimensions are considerably more difficult and computationally expensive to model, as the number of parameters required to train the RNN grows exponentially with data dimensionality. This is also the case with time series from limit order books, the electronic registries where prices of securities are formed in public markets. To this end, tensorization of neural networks provides an efficient method to reduce the number of model parameters, and has been applied successfully to high-dimensional series such as video sequences and financial time series, for example, using tensor-train RNNs (TTRNNs). However, such TTRNNs suffer from a number of shortcomings, including: (i) model sensitivity to the ordering of core tensor contractions; (ii) training sensitivity to weight initialization; and (iii) exploding or vanishing gradient problems due to the recurrent propagation through the tensor-train topology. Recent studies showed that embedding a multi-linear graph filter to model RNN states (Recurrent Graph Tensor Network, RGTN) provides enhanced flexibility and expressive power to tensor networks, while mitigating the shortcomings of TTRNNs. In this paper, we demonstrate the advantages arising from the use of graph filters to model limit order book sequences of high dimension as compared with the state-of-the-art benchmarks. It is shown that the combination of the graph module (to mitigate problematic gradients) with the radial structure (to make the tensor network architecture flexible) results in substantial improvements in output variance, training time and number of parameters required, without any sacrifice in accuracy.

递归神经网络(RNNs)已被证明对时间序列的学习和建模范式特别有效。然而，由于训练RNN所需的参数数量随着数据维数呈指数级增长，高维的序列数据的建模难度和计算成本要高得多。这也适用于限价订单的时间序列，限价订单是在公开市场上形成证券价格的电子登记。为此，神经网络的张张化提供了一种有效的方法来减少模型参数的数量，并且已经成功地应用于高维序列，例如视频序列和金融时间序列，例如使用张量训练rnn (TTRNNs)。然而，这种ttrnn存在许多缺点，包括:(i)模型对核心张量收缩排序的敏感性;(ii)训练对权重初始化的敏感性;(3)由于张量列拓扑结构的循环传播而引起的梯度爆炸或消失问题。最近的研究表明，嵌入一个多线性图滤波器来建模RNN状态(Recurrent graph Tensor Network, RGTN)为张量网络提供了增强的灵活性和表达能力，同时减轻了ttrnn的缺点。在本文中，我们展示了与最先进的基准相比，使用图滤波器对高维极限订单序列建模所产生的优势。结果表明，图模块(缓解有问题的梯度)与径向结构(使张量网络架构灵活)的结合在输出方差、训练时间和所需参数数量方面有了很大的改善，而精度没有任何牺牲。

{"title":"Graph and tensor-train recurrent neural networks for high-dimensional models of limit order books","authors":"Jacobo Roa-Vicens, Y. Xu, Ricardo Silva, D. Mandic","doi":"10.1145/3533271.3561710","DOIUrl":"https://doi.org/10.1145/3533271.3561710","url":null,"abstract":"Recurrent neural networks (RNNs) have proven to be particularly effective for the paradigms of learning and modelling time series. However, sequential data of high dimensions are considerably more difficult and computationally expensive to model, as the number of parameters required to train the RNN grows exponentially with data dimensionality. This is also the case with time series from limit order books, the electronic registries where prices of securities are formed in public markets. To this end, tensorization of neural networks provides an efficient method to reduce the number of model parameters, and has been applied successfully to high-dimensional series such as video sequences and financial time series, for example, using tensor-train RNNs (TTRNNs). However, such TTRNNs suffer from a number of shortcomings, including: (i) model sensitivity to the ordering of core tensor contractions; (ii) training sensitivity to weight initialization; and (iii) exploding or vanishing gradient problems due to the recurrent propagation through the tensor-train topology. Recent studies showed that embedding a multi-linear graph filter to model RNN states (Recurrent Graph Tensor Network, RGTN) provides enhanced flexibility and expressive power to tensor networks, while mitigating the shortcomings of TTRNNs. In this paper, we demonstrate the advantages arising from the use of graph filters to model limit order book sequences of high dimension as compared with the state-of-the-art benchmarks. It is shown that the combination of the graph module (to mitigate problematic gradients) with the radial structure (to make the tensor network architecture flexible) results in substantial improvements in output variance, training time and number of parameters required, without any sacrifice in accuracy.","PeriodicalId":134888,"journal":{"name":"Proceedings of the Third ACM International Conference on AI in Finance","volume":"97 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122342126","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Sequential Banking Products Recommendation and User Profiling in One Go 顺序银行产品推荐和用户分析在一个去

Proceedings of the Third ACM International Conference on AI in Finance

Pub Date : 2022-10-26 DOI: 10.1145/3533271.3561697

Alexandre Boulenger, David C. Liu, George Philippe Farajalla

How can banks recommend relevant banking products such as debit, credit cards or term deposits, as well as learn a rich user representation for segmentation and user profiling, all via a single model? We present a sequence-to-item recommendation framework that uses a novel input data representation, accounting for the sequential and temporal context of both item ownership and user metadata, fed to a multi-head self-attentive encoder. We assess the performance of our model on the largest publicly available banking product recommendation dataset. Our model achieves 98.9% Precision@1 and 40.2% Precision@5, outperforming a state-of-the-art model as well as a common XGBoost-based baseline model tailored for this dataset and a system reportedly employed in industry for this task. Next, using the encoder embedding we obtain a continuous representation of users and their past product behavior. We demonstrate, in a case study, that this representation can be used for user segmentation and profiling, both critical to decision-making in organizations; for example, in designing and differentiating value propositions. The proposed approach is more inclusive and objective than the traditional ones employed by banks. With this work, we expose the benefits of employing a recommendation model based on self-attention in a real-world setting. The continuous user representation learned can yield far more impact than individual user-level recommendations. Both the proposed model and approach to segmentation and profiling are also applicable in other industries, beyond banking.

银行如何通过单一模型推荐相关的银行产品，如借记卡、信用卡或定期存款，以及学习丰富的用户表示进行细分和用户分析?我们提出了一个序列到项目的推荐框架，该框架使用了一种新的输入数据表示，考虑了项目所有权和用户元数据的顺序和时间上下文，并将其提供给多头自关注编码器。我们在最大的公开可用的银行产品推荐数据集上评估我们的模型的性能。我们的模型达到98.9% Precision@1和40.2% Precision@5，优于最先进的模型，以及为该数据集量身定制的基于xgboost的通用基线模型和据报道在工业中用于此任务的系统。接下来，使用编码器嵌入，我们获得了用户及其过去产品行为的连续表示。我们在一个案例研究中证明，这种表示可以用于用户细分和分析，这对组织中的决策都至关重要;例如，在设计和区分价值主张时。拟议中的方法比银行采用的传统方法更具包容性和客观性。通过这项工作，我们揭示了在现实世界中使用基于自我关注的推荐模型的好处。不断学习的用户表示比单个用户级别的推荐产生更大的影响。所提出的分割和分析的模型和方法也适用于银行业以外的其他行业。

{"title":"Sequential Banking Products Recommendation and User Profiling in One Go","authors":"Alexandre Boulenger, David C. Liu, George Philippe Farajalla","doi":"10.1145/3533271.3561697","DOIUrl":"https://doi.org/10.1145/3533271.3561697","url":null,"abstract":"How can banks recommend relevant banking products such as debit, credit cards or term deposits, as well as learn a rich user representation for segmentation and user profiling, all via a single model? We present a sequence-to-item recommendation framework that uses a novel input data representation, accounting for the sequential and temporal context of both item ownership and user metadata, fed to a multi-head self-attentive encoder. We assess the performance of our model on the largest publicly available banking product recommendation dataset. Our model achieves 98.9% Precision@1 and 40.2% Precision@5, outperforming a state-of-the-art model as well as a common XGBoost-based baseline model tailored for this dataset and a system reportedly employed in industry for this task. Next, using the encoder embedding we obtain a continuous representation of users and their past product behavior. We demonstrate, in a case study, that this representation can be used for user segmentation and profiling, both critical to decision-making in organizations; for example, in designing and differentiating value propositions. The proposed approach is more inclusive and objective than the traditional ones employed by banks. With this work, we expose the benefits of employing a recommendation model based on self-attention in a real-world setting. The continuous user representation learned can yield far more impact than individual user-level recommendations. Both the proposed model and approach to segmentation and profiling are also applicable in other industries, beyond banking.","PeriodicalId":134888,"journal":{"name":"Proceedings of the Third ACM International Conference on AI in Finance","volume":"518 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123103728","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Cost-Efficient Reinforcement Learning for Optimal Trade Execution on Dynamic Market Environment 动态市场环境下最优交易执行的成本-效率强化学习

Proceedings of the Third ACM International Conference on AI in Finance

Pub Date : 2022-10-26 DOI: 10.1145/3533271.3561761

Di Chen, Yada Zhu, Miao Liu, Jianbo Li

Learning a high-performance trade execution model via reinforcement learning (RL) requires interaction with the real dynamic market. However, the massive interactions required by direct RL would result in a significant training overhead. In this paper, we propose a cost-efficient reinforcement learning (RL) approach called Deep Dyna-Double Q-learning (D3Q), which integrates deep reinforcement learning and planning to reduce the training overhead while improving the trading performance. Specifically, D3Q includes a learnable market environment model, which approximates the market impact using real market experience, to enhance policy learning via the learned environment. Meanwhile, we propose a novel state-balanced exploration scheme to solve the exploration bias caused by the non-increasing residual inventory during the trade execution to accelerate model learning. As demonstrated by our extensive experiments, the proposed D3Q framework significantly increases sample efficiency and outperforms state-of-the-art methods on average trading cost as well.

通过强化学习(RL)学习高性能的交易执行模型需要与真实的动态市场进行交互。然而，直接强化学习所需的大量交互将导致大量的训练开销。在本文中，我们提出了一种成本高效的强化学习(RL)方法，称为深度动力-双q学习(D3Q)，它将深度强化学习和计划相结合，以减少训练开销，同时提高交易性能。具体而言，D3Q包括一个可学习的市场环境模型，该模型使用真实的市场经验来近似市场影响，以增强通过学习环境的政策学习。同时，我们提出了一种新的状态平衡探索方案来解决交易执行过程中由于剩余库存不增加而导致的探索偏差，以加速模型的学习。正如我们广泛的实验所证明的那样，所提出的D3Q框架显着提高了样本效率，并且在平均交易成本方面优于最先进的方法。

引用次数: 0

Asymmetric Autoencoders for Factor-Based Covariance Matrix Estimation 基于因子协方差矩阵估计的非对称自编码器

Proceedings of the Third ACM International Conference on AI in Finance

Pub Date : 2022-10-26 DOI: 10.1145/3533271.3561715

Kevin Huynh, Gregor Lenhard

Estimating high dimensional covariance matrices for portfolio optimization is challenging because the number of parameters to be estimated grows quadratically in the number of assets. When the matrix dimension exceeds the sample size, the sample covariance matrix becomes singular. A possible solution is to impose a (latent) factor structure for the cross-section of asset returns as in the popular capital asset pricing model. Recent research suggests dimension reduction techniques to estimate the factors in a data-driven fashion. We present an asymmetric autoencoder neural network-based estimator that incorporates the factor structure in its architecture and jointly estimates the factors and their loadings. We test our method against well established dimension reduction techniques from the literature and compare them to observable factors as benchmark in an empirical experiment using stock returns of the past five decades. Results show that the proposed estimator is very competitive, as it significantly outperforms the benchmark across most scenarios. Analyzing the loadings, we find that the constructed factors are related to the stocks’ sector classification.

估计用于投资组合优化的高维协方差矩阵具有挑战性，因为要估计的参数数量随资产数量呈二次增长。当矩阵维数超过样本量时，样本协方差矩阵变为奇异。一种可能的解决方案是像流行的资本资产定价模型一样，对资产回报的横截面施加(潜在)因素结构。最近的研究建议采用降维技术，以数据驱动的方式估计因素。我们提出了一种基于非对称自编码器神经网络的估计器，该估计器将因子结构纳入其体系结构，并联合估计因子及其负载。我们将我们的方法与文献中完善的降维技术进行比较，并将其与可观察因素作为基准进行比较，并使用过去五十年的股票回报进行实证实验。结果表明，提议的估计器非常有竞争力，因为它在大多数场景中都明显优于基准。通过对其载荷的分析，我们发现所构建的因子与股票的行业分类有关。

{"title":"Asymmetric Autoencoders for Factor-Based Covariance Matrix Estimation","authors":"Kevin Huynh, Gregor Lenhard","doi":"10.1145/3533271.3561715","DOIUrl":"https://doi.org/10.1145/3533271.3561715","url":null,"abstract":"Estimating high dimensional covariance matrices for portfolio optimization is challenging because the number of parameters to be estimated grows quadratically in the number of assets. When the matrix dimension exceeds the sample size, the sample covariance matrix becomes singular. A possible solution is to impose a (latent) factor structure for the cross-section of asset returns as in the popular capital asset pricing model. Recent research suggests dimension reduction techniques to estimate the factors in a data-driven fashion. We present an asymmetric autoencoder neural network-based estimator that incorporates the factor structure in its architecture and jointly estimates the factors and their loadings. We test our method against well established dimension reduction techniques from the literature and compare them to observable factors as benchmark in an empirical experiment using stock returns of the past five decades. Results show that the proposed estimator is very competitive, as it significantly outperforms the benchmark across most scenarios. Analyzing the loadings, we find that the constructed factors are related to the stocks’ sector classification.","PeriodicalId":134888,"journal":{"name":"Proceedings of the Third ACM International Conference on AI in Finance","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133907677","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Intelligent Inventory Management for Cryptocurrency Brokers 加密货币经纪人的智能库存管理

Proceedings of the Third ACM International Conference on AI in Finance

Pub Date : 2022-10-26 DOI: 10.1145/3533271.3561661

Christopher Felder, J. Seemüller

In equity trading, internalization is the predominant execution method for uninformed order flow, allowing retail brokers to realize cost savings and thereby offer price improvements to customers. In cryptocurrency trading, there are doubts as to whether informed and uninformed traders can be distinguished in the same way, leading brokers to seek cost savings through internal order matching instead. Using the historical order flow of the German cryptocurrency broker BISON, we present a prediction-based approach to internal order matching: Upon receiving a customer order, our model forecasts whether future order flow will be sufficient to neutralize the order before the settlement date. With a prediction accuracy of 85%, it enables brokers to match three-quarters of order volume internally, which is three times as much as a traditional static approach, and realize meaningful cost savings, even after accounting for common minimum price improvements.

在股票交易中，内部化是无信息订单流的主要执行方法，允许零售经纪人实现成本节约，从而为客户提供价格改进。在加密货币交易中，人们怀疑是否可以以同样的方式区分知情和不知情的交易者，这导致经纪商转而通过内部订单匹配来寻求成本节约。使用德国加密货币经纪商BISON的历史订单流，我们提出了一种基于预测的内部订单匹配方法:在收到客户订单后，我们的模型预测未来的订单流是否足以在结算日期之前抵消订单。凭借85%的预测准确率，它使经纪商能够在内部匹配四分之三的订单量，这是传统静态方法的三倍，并且即使在考虑了共同的最低价格改进之后，也能实现有意义的成本节约。

引用次数: 0

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of the Third ACM International Conference on AI in Finance

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀