首页 > 最新文献

Proceedings of the Third ACM International Conference on AI in Finance最新文献

英文 中文
Monotonic Neural Additive Models: Pursuing Regulated Machine Learning Models for Credit Scoring 单调神经加性模型:追求信用评分的调节机器学习模型
Pub Date : 2022-09-21 DOI: 10.1145/3533271.3561691
Dangxing Chen, Weicheng Ye
The forecasting of credit default risk has been an active research field for several decades. Historically, logistic regression has been used as a major tool due to its compliance with regulatory requirements: transparency, explainability, and fairness. In recent years, researchers have increasingly used complex and advanced machine learning methods to improve prediction accuracy. Even though a machine learning method could potentially improve the model accuracy, it complicates simple logistic regression, deteriorates explainability, and often violates fairness. In the absence of compliance with regulatory requirements, even highly accurate machine learning methods are unlikely to be accepted by companies for credit scoring. In this paper, we introduce a novel class of monotonic neural additive models, which meet regulatory requirements by simplifying neural network architecture and enforcing monotonicity. By utilizing the special architectural features of the neural additive model, the monotonic neural additive model penalizes monotonicity violations effectively. Consequently, the computational cost of training a monotonic neural additive model is similar to that of training a neural additive model, as a free lunch. We demonstrate through empirical results that our new model is as accurate as black-box fully-connected neural networks, providing a highly accurate and regulated machine learning method.
几十年来,信用违约风险预测一直是一个活跃的研究领域。从历史上看,逻辑回归一直被用作主要工具,因为它符合监管要求:透明度、可解释性和公平性。近年来,研究人员越来越多地使用复杂和先进的机器学习方法来提高预测精度。尽管机器学习方法可以潜在地提高模型的准确性,但它使简单的逻辑回归变得复杂,降低了可解释性,并且经常违反公平性。在不遵守监管要求的情况下,即使是高度精确的机器学习方法也不太可能被公司接受用于信用评分。本文引入了一类新的单调神经加性模型,该模型通过简化神经网络结构和增强单调性来满足调节要求。单调神经加性模型利用神经加性模型特有的结构特征,对单调性违规行为进行了有效的惩罚。因此,训练单调神经加性模型的计算成本与训练神经加性模型的计算成本类似,就像一顿免费的午餐。我们通过实证结果证明,我们的新模型与黑盒全连接神经网络一样准确,提供了一种高度精确和规范的机器学习方法。
{"title":"Monotonic Neural Additive Models: Pursuing Regulated Machine Learning Models for Credit Scoring","authors":"Dangxing Chen, Weicheng Ye","doi":"10.1145/3533271.3561691","DOIUrl":"https://doi.org/10.1145/3533271.3561691","url":null,"abstract":"The forecasting of credit default risk has been an active research field for several decades. Historically, logistic regression has been used as a major tool due to its compliance with regulatory requirements: transparency, explainability, and fairness. In recent years, researchers have increasingly used complex and advanced machine learning methods to improve prediction accuracy. Even though a machine learning method could potentially improve the model accuracy, it complicates simple logistic regression, deteriorates explainability, and often violates fairness. In the absence of compliance with regulatory requirements, even highly accurate machine learning methods are unlikely to be accepted by companies for credit scoring. In this paper, we introduce a novel class of monotonic neural additive models, which meet regulatory requirements by simplifying neural network architecture and enforcing monotonicity. By utilizing the special architectural features of the neural additive model, the monotonic neural additive model penalizes monotonicity violations effectively. Consequently, the computational cost of training a monotonic neural additive model is similar to that of training a neural additive model, as a free lunch. We demonstrate through empirical results that our new model is as accurate as black-box fully-connected neural networks, providing a highly accurate and regulated machine learning method.","PeriodicalId":134888,"journal":{"name":"Proceedings of the Third ACM International Conference on AI in Finance","volume":"142 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116431321","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
RESHAPE: Explaining Accounting Anomalies in Financial Statement Audits by enhancing SHapley Additive exPlanations 重塑:通过加强SHapley加性解释解释财务报表审计中的会计异常
Pub Date : 2022-09-19 DOI: 10.1145/3533271.3561667
Ricardo Müller, Marco Schreyer, Timur Sattarov, Damian Borth
Detecting accounting anomalies is a recurrent challenge in financial statement audits. Recently, novel methods derived from Deep-Learning (DL) such as Autoencoder Neural Networks (AENNs) have been proposed to audit the large volumes of a statement’s underlying accounting records. However, due to their vast number of parameters, such models exhibit the drawback of being inherently opaque. At the same time, the concealing of a model’s inner workings often hinders its real-world application in financial audits, since auditors must reasonably explain and justify their audit decisions. Nowadays, various Explainable AI (XAI) techniques have been proposed to address this challenge, e.g., SHapley Additive exPlanations (SHAP). However, in unsupervised DL as often applied in financial audits, these methods explain the model output at the level of encoded variables. As a result, the explanations of AENNs are often hard to comprehend by human auditors. To mitigate this drawback, we propose Reconstruction Error SHapley Additive exPlanations Extension (RESHAPE), which explains the model output on an aggregated attribute level. In addition, we introduce an evaluation framework to compare the versatility of XAI methods in auditing. Our experimental results show empirical evidence that RESHAPE results in versatile explanations compared to state-of-the-art baselines. We envision such attribute-level explanations as a necessary next step in the adoption of unsupervised DL techniques in financial auditing.
在财务报表审计中,发现会计异常是一个反复出现的挑战。最近,人们提出了基于深度学习(DL)的新方法,如自动编码器神经网络(AENNs),用于审计大量报表的基础会计记录。然而,由于它们有大量的参数,这些模型表现出固有的不透明的缺点。同时,由于审计师必须合理地解释和证明其审计决策的合理性,模型内部工作原理的隐藏往往会阻碍其在财务审计中的实际应用。如今,各种可解释的人工智能(XAI)技术已经被提出来解决这一挑战,例如,SHapley加性解释(SHAP)。然而,在财务审计中经常应用的无监督深度学习中,这些方法在编码变量的水平上解释模型输出。因此,人工审计师通常很难理解aenn的解释。为了减轻这个缺点,我们提出了重建错误SHapley加性解释扩展(重塑),它在聚合属性级别上解释模型输出。此外,我们还介绍了一个评估框架,以比较XAI方法在审计中的多功能性。我们的实验结果显示,与最先进的基线相比,在多种解释中重塑结果的经验证据。我们设想这种属性级解释是在财务审计中采用无监督深度学习技术的必要下一步。
{"title":"RESHAPE: Explaining Accounting Anomalies in Financial Statement Audits by enhancing SHapley Additive exPlanations","authors":"Ricardo Müller, Marco Schreyer, Timur Sattarov, Damian Borth","doi":"10.1145/3533271.3561667","DOIUrl":"https://doi.org/10.1145/3533271.3561667","url":null,"abstract":"Detecting accounting anomalies is a recurrent challenge in financial statement audits. Recently, novel methods derived from Deep-Learning (DL) such as Autoencoder Neural Networks (AENNs) have been proposed to audit the large volumes of a statement’s underlying accounting records. However, due to their vast number of parameters, such models exhibit the drawback of being inherently opaque. At the same time, the concealing of a model’s inner workings often hinders its real-world application in financial audits, since auditors must reasonably explain and justify their audit decisions. Nowadays, various Explainable AI (XAI) techniques have been proposed to address this challenge, e.g., SHapley Additive exPlanations (SHAP). However, in unsupervised DL as often applied in financial audits, these methods explain the model output at the level of encoded variables. As a result, the explanations of AENNs are often hard to comprehend by human auditors. To mitigate this drawback, we propose Reconstruction Error SHapley Additive exPlanations Extension (RESHAPE), which explains the model output on an aggregated attribute level. In addition, we introduce an evaluation framework to compare the versatility of XAI methods in auditing. Our experimental results show empirical evidence that RESHAPE results in versatile explanations compared to state-of-the-art baselines. We envision such attribute-level explanations as a necessary next step in the adoption of unsupervised DL techniques in financial auditing.","PeriodicalId":134888,"journal":{"name":"Proceedings of the Third ACM International Conference on AI in Finance","volume":"130 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115421730","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Federated and Privacy-Preserving Learning of Accounting Data in Financial Statement Audits 财务报表审计中会计数据的联合和隐私保护学习
Pub Date : 2022-08-26 DOI: 10.1145/3533271.3561674
Marco Schreyer, Timur Sattarov, Damian Borth
The ongoing ‘digital transformation’ fundamentally changes audit evidence’s nature, recording, and volume. Nowadays, the International Standards on Auditing (ISA) requires auditors to examine vast volumes of a financial statement’s underlying digital accounting records. As a result, audit firms also ‘digitize’ their analytical capabilities and invest in Deep Learning (DL), a successful sub-discipline of Machine Learning. The application of DL offers the ability to learn specialized audit models from data of multiple clients, e.g., organizations operating in the same industry or jurisdiction. In general, regulations require auditors to adhere to strict data confidentiality measures. At the same time, recent intriguing discoveries showed that large-scale DL models are vulnerable to leaking sensitive training data information. Today, it often remains unclear how audit firms can apply DL models while complying with data protection regulations. In this work, we propose a Federated Learning framework to train DL models on auditing relevant accounting data of multiple clients. The framework encompasses Differential Privacy and Split Learning capabilities to mitigate data confidentiality risks at model inference. Our results provide empirical evidence that auditors can benefit from DL models that accumulate knowledge from multiple sources of proprietary client data.
正在进行的“数字化转型”从根本上改变了审计证据的性质、记录和数量。如今,国际审计准则(ISA)要求审计人员检查大量财务报表的基础数字会计记录。因此,审计公司也将其分析能力“数字化”,并投资于深度学习(DL),这是机器学习的一个成功的分支学科。DL的应用提供了从多个客户的数据中学习专业审计模型的能力,例如,在同一行业或管辖区内运营的组织。一般来说,法规要求审计师遵守严格的数据保密措施。与此同时,最近有趣的发现表明,大规模深度学习模型容易泄露敏感的训练数据信息。如今,审计公司如何在遵守数据保护法规的同时应用深度学习模型往往仍不清楚。在这项工作中,我们提出了一个联邦学习框架来训练DL模型审计多个客户的相关会计数据。该框架包含差分隐私和分割学习功能,以减轻模型推理时的数据机密性风险。我们的研究结果提供了经验证据,表明审计师可以从从多个专有客户数据来源积累知识的深度学习模型中受益。
{"title":"Federated and Privacy-Preserving Learning of Accounting Data in Financial Statement Audits","authors":"Marco Schreyer, Timur Sattarov, Damian Borth","doi":"10.1145/3533271.3561674","DOIUrl":"https://doi.org/10.1145/3533271.3561674","url":null,"abstract":"The ongoing ‘digital transformation’ fundamentally changes audit evidence’s nature, recording, and volume. Nowadays, the International Standards on Auditing (ISA) requires auditors to examine vast volumes of a financial statement’s underlying digital accounting records. As a result, audit firms also ‘digitize’ their analytical capabilities and invest in Deep Learning (DL), a successful sub-discipline of Machine Learning. The application of DL offers the ability to learn specialized audit models from data of multiple clients, e.g., organizations operating in the same industry or jurisdiction. In general, regulations require auditors to adhere to strict data confidentiality measures. At the same time, recent intriguing discoveries showed that large-scale DL models are vulnerable to leaking sensitive training data information. Today, it often remains unclear how audit firms can apply DL models while complying with data protection regulations. In this work, we propose a Federated Learning framework to train DL models on auditing relevant accounting data of multiple clients. The framework encompasses Differential Privacy and Split Learning capabilities to mitigate data confidentiality risks at model inference. Our results provide empirical evidence that auditors can benefit from DL models that accumulate knowledge from multiple sources of proprietary client data.","PeriodicalId":134888,"journal":{"name":"Proceedings of the Third ACM International Conference on AI in Finance","volume":"70 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127613105","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Online Learning for Mixture of Multivariate Hawkes Processes 多元Hawkes过程混合的在线学习
Pub Date : 2022-08-16 DOI: 10.1145/3533271.3561771
Mohsen Ghassemi, Niccolò Dalmasso, Simran Lamba, V. Potluru, Sameena Shah, T. Balch, M. Veloso
Online learning of Hawkes processes has received increasing attention in the last couple of years especially for modeling a network of actors. However, these works typically either model the rich interaction between the events or the latent cluster of the actors or the network structure between the actors. We propose to model the latent structure of the network of actors as well as their rich interaction across events for real-world settings of medical and financial applications. Experimental results on both synthetic and real-world data showcase the efficacy of our approach.
Hawkes过程的在线学习在过去几年中受到了越来越多的关注,特别是对参与者网络的建模。然而,这些作品通常要么模拟事件之间的丰富互动,要么模拟参与者的潜在集群,要么模拟参与者之间的网络结构。我们建议对参与者网络的潜在结构以及他们在医疗和金融应用的现实世界设置中的事件之间的丰富交互进行建模。在合成数据和实际数据上的实验结果显示了我们的方法的有效性。
{"title":"Online Learning for Mixture of Multivariate Hawkes Processes","authors":"Mohsen Ghassemi, Niccolò Dalmasso, Simran Lamba, V. Potluru, Sameena Shah, T. Balch, M. Veloso","doi":"10.1145/3533271.3561771","DOIUrl":"https://doi.org/10.1145/3533271.3561771","url":null,"abstract":"Online learning of Hawkes processes has received increasing attention in the last couple of years especially for modeling a network of actors. However, these works typically either model the rich interaction between the events or the latent cluster of the actors or the network structure between the actors. We propose to model the latent structure of the network of actors as well as their rich interaction across events for real-world settings of medical and financial applications. Experimental results on both synthetic and real-world data showcase the efficacy of our approach.","PeriodicalId":134888,"journal":{"name":"Proceedings of the Third ACM International Conference on AI in Finance","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132268943","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Strategic Asset Allocation with Illiquid Alternatives 非流动性选择的战略资产配置
Pub Date : 2022-07-15 DOI: 10.1145/3533271.3561769
Eric Luxenberg, Stephen P. Boyd, Mykel J. Kochenderfer, M. Beek, Wen Cao, Steven Diamond, A. Ulitsky, Kunal Menda, V. Vairavamurthy
We address the problem of strategic asset allocation (SAA) with portfolios that include illiquid alternative asset classes. The main challenge in portfolio construction with illiquid asset classes is that we do not have direct control over our positions, as we do in liquid asset classes. Instead we can only make commitments; the position builds up over time as capital calls come in, and reduces over time as distributions occur, neither of which the investor has direct control over. The effect on positions of our commitments is subject to a delay, typically of a few years, and is also unknown or stochastic. A further challenge is the requirement that we can meet the capital calls, with very high probability, with our liquid assets. We formulate the illiquid dynamics as a random linear system, and propose a convex optimization based model predictive control (MPC) policy for allocating liquid assets and making new illiquid commitments in each period. Despite the challenges of time delay and uncertainty, we show that this policy attains performance surprisingly close to a fictional setting where we pretend the illiquid asset classes are completely liquid, and we can arbitrarily and immediately adjust our positions. In this paper we focus on the growth problem, with no external liabilities or income, but the method is readily extended to handle this case.
我们通过包括非流动性另类资产类别的投资组合解决战略资产配置问题。构建非流动性资产类别的投资组合面临的主要挑战是,我们不能像在流动性资产类别中那样直接控制自己的头寸。相反,我们只能做出承诺;随着时间的推移,随着资金的进入,头寸会逐渐增加,随着分配的发生,头寸会逐渐减少,这两者投资者都无法直接控制。我们的承诺对立场的影响是有延迟的,通常是几年,而且是未知的或随机的。另一个挑战是,要求我们有能力(以非常高的概率)用我们的流动资产满足资本要求。我们将非流动性动力学描述为一个随机线性系统,并提出了一种基于凸优化的模型预测控制(MPC)策略,用于分配流动资产并在每个时期做出新的非流动性承诺。尽管存在时间延迟和不确定性的挑战,但我们表明,该政策的表现惊人地接近于一个虚构的设置,在这个设置中,我们假设非流动性资产类别是完全流动的,我们可以任意地立即调整我们的头寸。在本文中,我们关注的是增长问题,没有外部负债或收入,但该方法很容易推广到处理这种情况。
{"title":"Strategic Asset Allocation with Illiquid Alternatives","authors":"Eric Luxenberg, Stephen P. Boyd, Mykel J. Kochenderfer, M. Beek, Wen Cao, Steven Diamond, A. Ulitsky, Kunal Menda, V. Vairavamurthy","doi":"10.1145/3533271.3561769","DOIUrl":"https://doi.org/10.1145/3533271.3561769","url":null,"abstract":"We address the problem of strategic asset allocation (SAA) with portfolios that include illiquid alternative asset classes. The main challenge in portfolio construction with illiquid asset classes is that we do not have direct control over our positions, as we do in liquid asset classes. Instead we can only make commitments; the position builds up over time as capital calls come in, and reduces over time as distributions occur, neither of which the investor has direct control over. The effect on positions of our commitments is subject to a delay, typically of a few years, and is also unknown or stochastic. A further challenge is the requirement that we can meet the capital calls, with very high probability, with our liquid assets. We formulate the illiquid dynamics as a random linear system, and propose a convex optimization based model predictive control (MPC) policy for allocating liquid assets and making new illiquid commitments in each period. Despite the challenges of time delay and uncertainty, we show that this policy attains performance surprisingly close to a fictional setting where we pretend the illiquid asset classes are completely liquid, and we can arbitrarily and immediately adjust our positions. In this paper we focus on the growth problem, with no external liabilities or income, but the method is readily extended to handle this case.","PeriodicalId":134888,"journal":{"name":"Proceedings of the Third ACM International Conference on AI in Finance","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114769265","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Deep Hedging: Continuous Reinforcement Learning for Hedging of General Portfolios across Multiple Risk Aversions 深度套期保值:跨多重风险厌恶的通用投资组合套期保值的持续强化学习
Pub Date : 2022-07-15 DOI: 10.1145/3533271.3561731
Phillip Murray, Ben Wood, Hans Buehler, Magnus Wiese, Mikko S. Pakkanen
We present a method for finding optimal hedging policies for arbitrary initial portfolios and market states. We develop a novel actor-critic algorithm for solving general risk-averse stochastic control problems and use it to learn hedging strategies across multiple risk aversion levels simultaneously. We demonstrate the effectiveness of the approach with a numerical example in a stochastic volatility environment.
提出了一种针对任意初始投资组合和市场状态寻找最优对冲策略的方法。我们开发了一种新的行为者-批评家算法来解决一般的风险厌恶随机控制问题,并使用它来同时学习多个风险厌恶水平的对冲策略。通过一个随机波动环境下的数值算例验证了该方法的有效性。
{"title":"Deep Hedging: Continuous Reinforcement Learning for Hedging of General Portfolios across Multiple Risk Aversions","authors":"Phillip Murray, Ben Wood, Hans Buehler, Magnus Wiese, Mikko S. Pakkanen","doi":"10.1145/3533271.3561731","DOIUrl":"https://doi.org/10.1145/3533271.3561731","url":null,"abstract":"We present a method for finding optimal hedging policies for arbitrary initial portfolios and market states. We develop a novel actor-critic algorithm for solving general risk-averse stochastic control problems and use it to learn hedging strategies across multiple risk aversion levels simultaneously. We demonstrate the effectiveness of the approach with a numerical example in a stochastic volatility environment.","PeriodicalId":134888,"journal":{"name":"Proceedings of the Third ACM International Conference on AI in Finance","volume":"56 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126753699","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Learning Mutual Fund Categorization using Natural Language Processing 使用自然语言处理学习共同基金分类
Pub Date : 2022-07-11 DOI: 10.1145/3533271.3561748
Dimitrios Vamvourellis, M. Tóth, Dhruv Desai, D. Mehta, S. Pasquali
Categorization of mutual funds or Exchange-Traded-funds (ETFs) have long served the financial analysts to perform peer analysis for various purposes starting from competitor analysis, to quantifying portfolio diversification. The categorization methodology usually relies on fund composition data in the structured format extracted from the Form N-1A. Here, we initiate a study to learn the categorization system directly from the unstructured data as depicted in the forms using natural language processing (NLP). Positing as a multi-class classification problem with the input data being only the investment strategy description as reported in the form and the target variable being the Lipper Global categories, and using various NLP models, we show that the categorization system can indeed be learned with high accuracy. We discuss implications and applications of our findings as well as limitations of existing pre-trained architectures in applying them to learn fund categorization.
长期以来,共同基金或交易所交易基金(etf)的分类为金融分析师提供了从竞争对手分析到量化投资组合多样化等各种目的的同行分析。分类方法通常依赖于从N-1A表格中提取的结构化基金组成数据。在这里,我们启动了一项研究,使用自然语言处理(NLP)直接从表格中描述的非结构化数据中学习分类系统。假设一个多类分类问题,输入数据仅为表格中报告的投资策略描述,目标变量为Lipper Global类别,并使用各种NLP模型,我们表明分类系统确实可以以高精度学习。我们讨论了我们的发现的意义和应用,以及现有的预训练架构在应用它们来学习基金分类方面的局限性。
{"title":"Learning Mutual Fund Categorization using Natural Language Processing","authors":"Dimitrios Vamvourellis, M. Tóth, Dhruv Desai, D. Mehta, S. Pasquali","doi":"10.1145/3533271.3561748","DOIUrl":"https://doi.org/10.1145/3533271.3561748","url":null,"abstract":"Categorization of mutual funds or Exchange-Traded-funds (ETFs) have long served the financial analysts to perform peer analysis for various purposes starting from competitor analysis, to quantifying portfolio diversification. The categorization methodology usually relies on fund composition data in the structured format extracted from the Form N-1A. Here, we initiate a study to learn the categorization system directly from the unstructured data as depicted in the forms using natural language processing (NLP). Positing as a multi-class classification problem with the input data being only the investment strategy description as reported in the form and the target variable being the Lipper Global categories, and using various NLP models, we show that the categorization system can indeed be learned with high accuracy. We discuss implications and applications of our findings as well as limitations of existing pre-trained architectures in applying them to learn fund categorization.","PeriodicalId":134888,"journal":{"name":"Proceedings of the Third ACM International Conference on AI in Finance","volume":"73 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127389827","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Supervised similarity learning for corporate bonds using Random Forest proximities 基于随机森林近似的公司债券监督相似学习
Pub Date : 2022-07-10 DOI: 10.1145/3533271.3561736
Jerinsh Jeyapaulraj, Dhruv Desai, D. Mehta, Peter Chu, S. Pasquali, P. Sommer
Financial literature consists of ample research on similarity and comparison of financial assets and securities such as stocks, bonds, mutual funds, etc. However, going beyond correlations or aggregate statistics has been arduous since financial datasets are noisy, lack useful features, have missing data and often lack ground truth or annotated labels. However, though similarity extrapolated from these traditional models heuristically may work well on an aggregate level, such as risk management when looking at large portfolios, they often fail when used for portfolio construction and trading which require a local and dynamic measure of similarity on top of global measure. In this paper we propose a supervised similarity framework for corporate bonds which allows for inference based on both local and global measures. From a machine learning perspective, this paper emphasis that random forest (RF), which is usually viewed as a supervised learning algorithm, can also be used as a similarity learning (more specifically, a distance metric learning) algorithm. In addition, this framework proposes a novel metric to evaluate similarities, and analyses other metrics which further demonstrate that RF outperforms all other methods experimented with, in this work.
金融文献包括对金融资产与证券(如股票、债券、共同基金等)的相似性和比较的大量研究。然而,超越相关性或汇总统计一直是艰巨的,因为金融数据集是嘈杂的,缺乏有用的特征,有缺失的数据,往往缺乏基础真理或注释标签。然而,尽管从这些传统模型中启发式地推断出的相似性可能在总体水平上工作得很好,例如在查看大型投资组合时的风险管理,但当用于投资组合构建和交易时,它们通常会失败,因为这些投资组合需要在全局度量之上进行局部和动态的相似性度量。在本文中,我们提出了一个公司债券的监督相似性框架,该框架允许基于局部和全局措施的推断。从机器学习的角度来看,本文强调随机森林(RF),通常被视为一种监督学习算法,也可以用作相似学习(更具体地说,是距离度量学习)算法。此外,该框架提出了一种新的度量来评估相似性,并分析了其他度量,这些度量进一步证明RF优于本工作中实验过的所有其他方法。
{"title":"Supervised similarity learning for corporate bonds using Random Forest proximities","authors":"Jerinsh Jeyapaulraj, Dhruv Desai, D. Mehta, Peter Chu, S. Pasquali, P. Sommer","doi":"10.1145/3533271.3561736","DOIUrl":"https://doi.org/10.1145/3533271.3561736","url":null,"abstract":"Financial literature consists of ample research on similarity and comparison of financial assets and securities such as stocks, bonds, mutual funds, etc. However, going beyond correlations or aggregate statistics has been arduous since financial datasets are noisy, lack useful features, have missing data and often lack ground truth or annotated labels. However, though similarity extrapolated from these traditional models heuristically may work well on an aggregate level, such as risk management when looking at large portfolios, they often fail when used for portfolio construction and trading which require a local and dynamic measure of similarity on top of global measure. In this paper we propose a supervised similarity framework for corporate bonds which allows for inference based on both local and global measures. From a machine learning perspective, this paper emphasis that random forest (RF), which is usually viewed as a supervised learning algorithm, can also be used as a similarity learning (more specifically, a distance metric learning) algorithm. In addition, this framework proposes a novel metric to evaluate similarities, and analyses other metrics which further demonstrate that RF outperforms all other methods experimented with, in this work.","PeriodicalId":134888,"journal":{"name":"Proceedings of the Third ACM International Conference on AI in Finance","volume":"80 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128411779","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Reinforcement Learning for Intra-and-Inter-Bank Borrowing and Lending Mean Field Control Game 银行同业借贷平均场控制博弈的强化学习
Pub Date : 2022-07-07 DOI: 10.1145/3533271.3561743
Jimin Lin, Andrea Angiuli, Nils Detering, J. Fouque, M. Laurière
We propose a mean field control game (MFCG) model for the intra-and-inter-bank borrowing and lending problem. This framework allows to study the competitive game arising between groups of collaborative banks. The solution is provided in terms of an asymptotic Nash equilibrium between the groups in the infinite horizon. A three-timescale reinforcement learning algorithm is applied to learn the optimal borrowing and lending strategy in a data driven way when the model is unknown. An empirical numerical analysis shows the importance of the three-timescale, the impact of the exploration strategy when the model is unknown, and the convergence of the algorithm.
针对银行内部和银行间的借贷问题,我们提出了一个平均场控制博弈(MFCG)模型。该框架允许研究合作银行集团之间产生的竞争博弈。用无穷视界上群间的渐近纳什均衡给出了解。在模型未知的情况下,采用三时间尺度强化学习算法,以数据驱动的方式学习最优借贷策略。实证数值分析表明了三个时间尺度的重要性、模型未知时探索策略的影响以及算法的收敛性。
{"title":"Reinforcement Learning for Intra-and-Inter-Bank Borrowing and Lending Mean Field Control Game","authors":"Jimin Lin, Andrea Angiuli, Nils Detering, J. Fouque, M. Laurière","doi":"10.1145/3533271.3561743","DOIUrl":"https://doi.org/10.1145/3533271.3561743","url":null,"abstract":"We propose a mean field control game (MFCG) model for the intra-and-inter-bank borrowing and lending problem. This framework allows to study the competitive game arising between groups of collaborative banks. The solution is provided in terms of an asymptotic Nash equilibrium between the groups in the infinite horizon. A three-timescale reinforcement learning algorithm is applied to learn the optimal borrowing and lending strategy in a data driven way when the model is unknown. An empirical numerical analysis shows the importance of the three-timescale, the impact of the exploration strategy when the model is unknown, and the convergence of the algorithm.","PeriodicalId":134888,"journal":{"name":"Proceedings of the Third ACM International Conference on AI in Finance","volume":"28 6","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133003512","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Market Making with Scaled Beta Policies 按比例贝塔政策做市
Pub Date : 2022-07-07 DOI: 10.1145/3533271.3561745
Joseph Jerome, Gregory Palmer, Rahul Savani
This paper introduces a new representation for the actions of a market maker in an order-driven market. This representation uses scaled beta distributions, and generalises three approaches taken in the artificial intelligence for market making literature: single price-level selection, ladder strategies, and “market making at the touch”. Ladder strategies place uniform volume across an interval of contiguous prices. Scaled beta distribution based policies generalise these, allowing volume to be skewed across the price interval. We demonstrate that this flexibility is useful for inventory management, one of the key challenges faced by a market maker. We conduct three main experiments: first, we compare our more flexible beta-based actions with the special case of ladder strategies; then, we investigate the performance of simple fixed distributions; and finally, we devise and evaluate a simple and intuitive dynamic control policy that adjusts actions in a continuous manner depending on the signed inventory that the market maker has acquired. All empirical evaluations use a high-fidelity limit order book simulator based on historical data with 50 levels on each side.
本文介绍了指令驱动市场中做市商行为的一种新的表示形式。这种表示使用缩放beta分布,并概括了人工智能做市文献中采用的三种方法:单一价格水平选择、阶梯策略和“触碰做市”。阶梯策略在连续价格区间内放置均匀的成交量。基于比例贝塔分布的策略概括了这些,允许交易量在价格区间内倾斜。我们证明了这种灵活性对于库存管理是有用的,这是做市商面临的主要挑战之一。我们进行了三个主要实验:首先,我们将更灵活的基于beta的行为与阶梯策略的特殊情况进行比较;然后,我们研究了简单固定分布的性能;最后,我们设计并评估了一个简单直观的动态控制策略,该策略根据做市商获得的已签署库存以连续的方式调整行动。所有的经验评估使用高保真的限价订单模拟器,基于历史数据,每侧有50个级别。
{"title":"Market Making with Scaled Beta Policies","authors":"Joseph Jerome, Gregory Palmer, Rahul Savani","doi":"10.1145/3533271.3561745","DOIUrl":"https://doi.org/10.1145/3533271.3561745","url":null,"abstract":"This paper introduces a new representation for the actions of a market maker in an order-driven market. This representation uses scaled beta distributions, and generalises three approaches taken in the artificial intelligence for market making literature: single price-level selection, ladder strategies, and “market making at the touch”. Ladder strategies place uniform volume across an interval of contiguous prices. Scaled beta distribution based policies generalise these, allowing volume to be skewed across the price interval. We demonstrate that this flexibility is useful for inventory management, one of the key challenges faced by a market maker. We conduct three main experiments: first, we compare our more flexible beta-based actions with the special case of ladder strategies; then, we investigate the performance of simple fixed distributions; and finally, we devise and evaluate a simple and intuitive dynamic control policy that adjusts actions in a continuous manner depending on the signed inventory that the market maker has acquired. All empirical evaluations use a high-fidelity limit order book simulator based on historical data with 50 levels on each side.","PeriodicalId":134888,"journal":{"name":"Proceedings of the Third ACM International Conference on AI in Finance","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131575892","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
期刊
Proceedings of the Third ACM International Conference on AI in Finance
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1