首页 > 最新文献

Journal of Finance and Data Science最新文献

英文 中文
The use of predictive analytics in finance 预测分析在金融中的应用
Q1 Mathematics Pub Date : 2022-11-01 DOI: 10.1016/j.jfds.2022.05.003
Daniel Broby

Statistical and computational methods are being increasingly integrated into Decision Support Systems to aid management and help with strategic decisions. Researchers need to fully understand the use of such techniques in order to make predictions when using financial data. This paper therefore presents a method based literature review focused on the predictive analytics domain. The study comprehensively covers classification, regression, clustering, association and time series models. It expands existing explanatory statistical modelling into the realm of computational modelling. The methods explored enable the prediction of the future through the analysis of financial time series and cross-sectional data that is collected, stored and processed in Information Systems. The output of such models allow financial managers and risk oversight professionals to achieve better outcomes. This review brings the various predictive analytic methods in finance together under one domain.

统计和计算方法正越来越多地集成到决策支持系统中,以帮助管理和帮助战略决策。研究人员需要充分了解这些技术的使用,以便在使用财务数据时做出预测。因此,本文提出了一种基于预测分析领域的文献综述方法。该研究全面涵盖了分类、回归、聚类、关联和时间序列模型。它将现有的解释性统计模型扩展到计算模型的领域。所探索的方法可以通过分析在信息系统中收集、存储和处理的金融时间序列和横截面数据来预测未来。这些模型的输出使财务经理和风险监督专业人员能够取得更好的结果。本文综述了金融领域的各种预测分析方法。
{"title":"The use of predictive analytics in finance","authors":"Daniel Broby","doi":"10.1016/j.jfds.2022.05.003","DOIUrl":"https://doi.org/10.1016/j.jfds.2022.05.003","url":null,"abstract":"<div><p>Statistical and computational methods are being increasingly integrated into Decision Support Systems to aid management and help with strategic decisions. Researchers need to fully understand the use of such techniques in order to make predictions when using financial data. This paper therefore presents a method based literature review focused on the predictive analytics domain. The study comprehensively covers classification, regression, clustering, association and time series models. It expands existing explanatory statistical modelling into the realm of computational modelling. The methods explored enable the prediction of the future through the analysis of financial time series and cross-sectional data that is collected, stored and processed in Information Systems. The output of such models allow financial managers and risk oversight professionals to achieve better outcomes. This review brings the various predictive analytic methods in finance together under one domain.</p></div>","PeriodicalId":36340,"journal":{"name":"Journal of Finance and Data Science","volume":"8 ","pages":"Pages 145-161"},"PeriodicalIF":0.0,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2405918822000071/pdfft?md5=9b4df085f510c9fc0fa1fbac52010d37&pid=1-s2.0-S2405918822000071-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"92057698","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine learning portfolio allocation 机器学习投资组合分配
Q1 Mathematics Pub Date : 2022-11-01 DOI: 10.1016/j.jfds.2021.12.001
Michael Pinelis , David Ruppert

We find economically and statistically significant gains when using machine learning for portfolio allocation between the market index and risk-free asset. Optimal portfolio rules for time-varying expected returns and volatility are implemented with two Random Forest models. One model is employed in forecasting monthly excess returns with macroeconomic factors including payout yields. The second is used to estimate the prevailing volatility. Reward-risk timing with machine learning provides substantial improvements over the buy-and-hold in utility, risk-adjusted returns, and maximum drawdowns. This paper presents a unifying framework for machine learning applied to both return- and volatility-timing.

当使用机器学习在市场指数和无风险资产之间进行投资组合分配时,我们发现在经济上和统计上都有显著的收益。用两个随机森林模型实现了时变预期收益和波动率的最优投资组合规则。其中一个模型用于预测包括派息率在内的宏观经济因素的月度超额回报。第二种是用来估计当前的波动率。与效用、风险调整回报和最大回收量相比,机器学习的风险奖励时机提供了实质性的改进。本文提出了一个统一的机器学习框架,应用于回报和波动时序。
{"title":"Machine learning portfolio allocation","authors":"Michael Pinelis ,&nbsp;David Ruppert","doi":"10.1016/j.jfds.2021.12.001","DOIUrl":"https://doi.org/10.1016/j.jfds.2021.12.001","url":null,"abstract":"<div><p>We find economically and statistically significant gains when using machine learning for portfolio allocation between the market index and risk-free asset. Optimal portfolio rules for time-varying expected returns and volatility are implemented with two Random Forest models. One model is employed in forecasting monthly excess returns with macroeconomic factors including payout yields. The second is used to estimate the prevailing volatility. Reward-risk timing with machine learning provides substantial improvements over the buy-and-hold in utility, risk-adjusted returns, and maximum drawdowns. This paper presents a unifying framework for machine learning applied to both return- and volatility-timing.</p></div>","PeriodicalId":36340,"journal":{"name":"Journal of Finance and Data Science","volume":"8 ","pages":"Pages 35-54"},"PeriodicalIF":0.0,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2405918821000155/pdfft?md5=68c09e5e42a490b0df888e1badb3c66a&pid=1-s2.0-S2405918821000155-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"92057696","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Are there trade-offs with mandating timely disclosure of cybersecurity incidents? Evidence from state-level data breach disclosure laws 强制要求及时披露网络安全事件是否存在权衡?来自州级数据泄露披露法的证据
Q1 Mathematics Pub Date : 2022-11-01 DOI: 10.1016/j.jfds.2022.08.001
Musaib Ashraf, John (Xuefeng) Jiang, Isabel Yanyan Wang

On March 23, 2022, the SEC proposed that firms publicly disclose their cybersecurity incidents within four days of discovery. In the U.S., state-level data breach disclosure laws require firms to disclose the occurrence of a data breach, with some mandating disclosure within a deadline while others do not. Exploiting this state-level variation in disclosure deadlines, we find that, when facing a deadline, firms disclose a data breach 90 percent faster but are 58 percent less likely to disclose breach details. Investors respond negatively to delayed breach disclosures but are forgiving of a delay when it is used to gather more breach details. Our study highlights the trade-offs of mandating a disclosure deadline for cybersecurity incidents.

2022年3月23日,美国证券交易委员会提议,公司在发现网络安全事件后的四天内公开披露其网络安全事件。在美国,州级数据泄露披露法要求公司披露数据泄露的发生情况,有些州要求在最后期限内披露,而有些州则没有。利用各州在披露截止日期上的差异,我们发现,当面临截止日期时,公司披露数据泄露的速度要快90%,但披露泄露细节的可能性要低58%。投资者对延迟披露违规行为的反应是负面的,但如果是为了收集更多的违规细节,他们会原谅延迟。我们的研究强调了强制网络安全事件披露截止日期的权衡。
{"title":"Are there trade-offs with mandating timely disclosure of cybersecurity incidents? Evidence from state-level data breach disclosure laws","authors":"Musaib Ashraf,&nbsp;John (Xuefeng) Jiang,&nbsp;Isabel Yanyan Wang","doi":"10.1016/j.jfds.2022.08.001","DOIUrl":"https://doi.org/10.1016/j.jfds.2022.08.001","url":null,"abstract":"<div><p>On March 23, 2022, the SEC proposed that firms publicly disclose their cybersecurity incidents within four days of discovery. In the U.S., state-level data breach disclosure laws require firms to disclose the occurrence of a data breach, with some mandating disclosure within a deadline while others do not. Exploiting this state-level variation in disclosure deadlines, we find that, when facing a deadline, firms disclose a data breach 90 percent faster but are 58 percent less likely to disclose breach details. Investors respond negatively to delayed breach disclosures but are forgiving of a delay when it is used to gather more breach details. Our study highlights the trade-offs of mandating a disclosure deadline for cybersecurity incidents.</p></div>","PeriodicalId":36340,"journal":{"name":"Journal of Finance and Data Science","volume":"8 ","pages":"Pages 202-213"},"PeriodicalIF":0.0,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2405918822000101/pdfft?md5=12292f55581a3ddd898da95c706a8ab9&pid=1-s2.0-S2405918822000101-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"92105793","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Performance attribution of machine learning methods for stock returns prediction 股票收益预测的机器学习方法的性能归因
Q1 Mathematics Pub Date : 2022-11-01 DOI: 10.1016/j.jfds.2022.04.002
Stéphane Daul, Thibault Jaisson, Alexandra Nagy

We analyze the performance of investable portfolios built using predicted stock returns from machine learning methods and attribute their performance to linear, marginal non-linear and interaction effects. We use a large set of features including price-based, fundamental-based, and sentiment-based descriptors and use model averaging in the validation procedure to get robust out-of-sample predictions. We find that the superiority of regression trees and neural networks comes from two points: their strong regularization mechanism and their capacity to capture interaction effects. The non-linear component of the marginal predictions on the other hand has no predictive power. Thanks to our methodology, we manage to isolate and study in detail the interaction component. We find that it has significative long term performance independent from the linear modeling and is stable through time.

我们分析了使用机器学习方法预测股票收益构建的可投资组合的绩效,并将其绩效归因于线性、边际非线性和相互作用效应。我们使用了大量的特征,包括基于价格的、基于基本面的和基于情绪的描述符,并在验证过程中使用模型平均来获得稳健的样本外预测。我们发现回归树和神经网络的优势来自两点:它们强大的正则化机制和捕捉交互效应的能力。另一方面,边际预测的非线性成分没有预测能力。由于我们的方法,我们成功地隔离并详细研究了交互组件。我们发现它具有显著的独立于线性建模的长期性能,并且随时间稳定。
{"title":"Performance attribution of machine learning methods for stock returns prediction","authors":"Stéphane Daul,&nbsp;Thibault Jaisson,&nbsp;Alexandra Nagy","doi":"10.1016/j.jfds.2022.04.002","DOIUrl":"https://doi.org/10.1016/j.jfds.2022.04.002","url":null,"abstract":"<div><p>We analyze the performance of investable portfolios built using predicted stock returns from machine learning methods and attribute their performance to linear, marginal non-linear and interaction effects. We use a large set of features including price-based, fundamental-based, and sentiment-based descriptors and use model averaging in the validation procedure to get robust out-of-sample predictions. We find that the superiority of regression trees and neural networks comes from two points: their strong regularization mechanism and their capacity to capture interaction effects. The non-linear component of the marginal predictions on the other hand has no predictive power. Thanks to our methodology, we manage to isolate and study in detail the interaction component. We find that it has significative long term performance independent from the linear modeling and is stable through time.</p></div>","PeriodicalId":36340,"journal":{"name":"Journal of Finance and Data Science","volume":"8 ","pages":"Pages 86-104"},"PeriodicalIF":0.0,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2405918822000022/pdfft?md5=a333af8ca27553f8a065459247c83328&pid=1-s2.0-S2405918822000022-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"92143374","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improving insurers’ loss reserve error prediction: Adopting combined unsupervised-supervised machine learning techniques in risk management 改进保险公司损失准备金误差预测:在风险管理中采用联合无监督监督机器学习技术
Q1 Mathematics Pub Date : 2022-11-01 DOI: 10.1016/j.jfds.2022.09.003
In Jung Song , Wookjae Heo

Emerging literature focuses on insurers' earnings management using estimated liability for unpaid claims, known as loss reserve. An insurance company generally uses the traditional estimation methods with linear estimation to measure loss reserve error, but those methods are often criticized for several statistical shortcomings, such as estimation technique, correlated contributing variables, ignorance of the interactions, and higher-order terms. To overcome such shortcomings, this paper proposes an unsupervised-supervised machine learning approach, hierarchical clustering, and artificial neural network (ANN) by adopting a combined unsupervised-supervised method, cluster analysis (i.e., unsupervised), and various supervised machine learning algorithms such as Boostings, Support Vector Machine (SVM) and RReliefF. We show evidence that each cluster has its own foundation variables to predict and Boosting and ANN estimation provide a more efficient framework to improve insurers' reserve error. Also, the different value and order of RReliefF between Boosting and OLS show the under-or over-estimated predictor, and each year's influential variables are found to be consistent over time, which indicates that the firm's previous year's loss reserve model can predict the future loss reserve error. This paper contributes to the existing literature by suggesting a more robust, consistent, and efficient prediction method (i.e., unsupervised-supervised combination method) to improve insurers' loss reserve error prediction.

新兴文献着重于保险公司的盈余管理使用估计负债未付索赔,被称为损失准备金。保险公司通常采用线性估计的传统估计方法来测量损失准备金误差,但这些方法经常因估计技术、相关贡献变量、忽略相互作用和高阶项等统计缺陷而受到批评。为了克服这些缺点,本文提出了一种无监督-监督机器学习方法,即分层聚类和人工神经网络(ANN),采用无监督-监督方法、聚类分析(即无监督)和各种监督机器学习算法(如boosting、支持向量机(SVM)和RReliefF)相结合的方法。我们证明了每个聚类都有自己的基础变量来预测,而Boosting和ANN估计提供了一个更有效的框架来改善保险公司的准备金误差。此外,Boosting和OLS之间的RReliefF值和阶数的不同显示了预测因子的低估或高估,并且发现每年的影响变量随时间的变化是一致的,这表明公司上一年的损失准备模型可以预测未来的损失准备误差。本文在现有文献的基础上,提出了一种更稳健、更一致、更高效的预测方法(即无监督-监督组合法),以改进保险公司的损失准备金误差预测。
{"title":"Improving insurers’ loss reserve error prediction: Adopting combined unsupervised-supervised machine learning techniques in risk management","authors":"In Jung Song ,&nbsp;Wookjae Heo","doi":"10.1016/j.jfds.2022.09.003","DOIUrl":"10.1016/j.jfds.2022.09.003","url":null,"abstract":"<div><p>Emerging literature focuses on insurers' earnings management using estimated liability for unpaid claims, known as loss reserve. An insurance company generally uses the traditional estimation methods with linear estimation to measure loss reserve error, but those methods are often criticized for several statistical shortcomings, such as estimation technique, correlated contributing variables, ignorance of the interactions, and higher-order terms. To overcome such shortcomings, this paper proposes an unsupervised-supervised machine learning approach, hierarchical clustering, and artificial neural network (ANN) by adopting a combined unsupervised-supervised method, cluster analysis (i.e., unsupervised), and various supervised machine learning algorithms such as Boostings, Support Vector Machine (SVM) and RReliefF. We show evidence that each cluster has its own foundation variables to predict and Boosting and ANN estimation provide a more efficient framework to improve insurers' reserve error. Also, the different value and order of RReliefF between Boosting and OLS show the under-or over-estimated predictor, and each year's influential variables are found to be consistent over time, which indicates that the firm's previous year's loss reserve model can predict the future loss reserve error. This paper contributes to the existing literature by suggesting a more robust, consistent, and efficient prediction method (i.e., unsupervised-supervised combination method) to improve insurers' loss reserve error prediction.</p></div>","PeriodicalId":36340,"journal":{"name":"Journal of Finance and Data Science","volume":"8 ","pages":"Pages 233-254"},"PeriodicalIF":0.0,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2405918822000137/pdfft?md5=8e507494fba02f659065c034fd48e212&pid=1-s2.0-S2405918822000137-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121438424","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
FinLex: An effective use of word embeddings for financial lexicon generation FinLex:有效地使用词嵌入来生成金融词汇
Q1 Mathematics Pub Date : 2022-11-01 DOI: 10.1016/j.jfds.2021.10.001
Sanjiv R. Das , Michele Donini , Muhammad Bilal Zafar , John He , Krishnaram Kenthapadi

We present a simple and effective methodology for the generation of lexicons (word lists) that may be used in natural language scoring applications. In particular, in the finance industry, word lists have become ubiquitous for sentiment scoring. These have been derived from dictionaries such as the Harvard Inquirer and require manual curation. Here, we present an automated approach to the curation of lexicons, which makes automatic preparation of any word list immediate. We show that our automated word lists deliver comparable performance to traditional lexicons on machine learning classification tasks. This new approach will enable finance academics and practitioners to create and deploy new word lists in addition to the few traditional ones in a facile manner.

我们提出了一种简单而有效的方法来生成词典(单词列表),可以用于自然语言评分应用程序。特别是在金融行业,单词列表已经成为情绪评分的普遍工具。这些词汇来自《哈佛问询报》(Harvard Inquirer)等词典,需要人工管理。在这里,我们提出了一种自动的方法来管理词汇,这使得自动准备任何单词列表立即。我们表明,在机器学习分类任务上,我们的自动单词列表提供了与传统词汇相当的性能。这种新方法将使金融学者和从业人员能够以一种简便的方式创建和部署除了少数传统单词之外的新单词列表。
{"title":"FinLex: An effective use of word embeddings for financial lexicon generation","authors":"Sanjiv R. Das ,&nbsp;Michele Donini ,&nbsp;Muhammad Bilal Zafar ,&nbsp;John He ,&nbsp;Krishnaram Kenthapadi","doi":"10.1016/j.jfds.2021.10.001","DOIUrl":"10.1016/j.jfds.2021.10.001","url":null,"abstract":"<div><p>We present a simple and effective methodology for the generation of lexicons (word lists) that may be used in natural language scoring applications. In particular, in the finance industry, word lists have become ubiquitous for sentiment scoring. These have been derived from dictionaries such as the Harvard Inquirer and require manual curation. Here, we present an automated approach to the curation of lexicons, which makes automatic preparation of any word list immediate. We show that our automated word lists deliver comparable performance to traditional lexicons on machine learning classification tasks. This new approach will enable finance academics and practitioners to create and deploy new word lists in addition to the few traditional ones in a facile manner.</p></div>","PeriodicalId":36340,"journal":{"name":"Journal of Finance and Data Science","volume":"8 ","pages":"Pages 1-11"},"PeriodicalIF":0.0,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2405918821000131/pdfft?md5=f870155829ce1c2b61a45c753663ba75&pid=1-s2.0-S2405918821000131-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122227008","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
CapitalVX: A machine learning model for startup selection and exit prediction CapitalVX:一个用于创业公司选择和退出预测的机器学习模型
Q1 Mathematics Pub Date : 2021-11-01 DOI: 10.1016/j.jfds.2021.04.001
Greg Ross , Sanjiv Das , Daniel Sciro , Hussain Raza

Using a big data set of venture capital financing and related startup firms from Crunchbase, this paper develops a machine-learning model called CapitalVX (for “Capital Venture eXchange”) to predict the outcomes for startups, i.e., whether they will exit successfully through an IPO or acquisition, fail, or remain private. Using a large feature set, the out-of-sample accuracy of predictions on startup outcomes and follow-on funding is 80–89%. This research suggests that VC/PE firms may be able to benefit from using machine learning to screen potential investments using publicly available information, diverting this time instead into mentoring and monitoring the investments they make.

本文使用来自Crunchbase的风险投资融资和相关创业公司的大数据集,开发了一个名为CapitalVX(“资本风险交易”)的机器学习模型,以预测创业公司的结果,即他们是否会通过IPO或收购成功退出,失败或保持私有。使用大型特征集,对创业结果和后续资金的预测的样本外准确度为80-89%。这项研究表明,VC/PE公司可能会受益于使用机器学习来利用公开信息筛选潜在的投资,而不是将时间转移到指导和监控他们所做的投资上。
{"title":"CapitalVX: A machine learning model for startup selection and exit prediction","authors":"Greg Ross ,&nbsp;Sanjiv Das ,&nbsp;Daniel Sciro ,&nbsp;Hussain Raza","doi":"10.1016/j.jfds.2021.04.001","DOIUrl":"https://doi.org/10.1016/j.jfds.2021.04.001","url":null,"abstract":"<div><p>Using a big data set of venture capital financing and related startup firms from Crunchbase, this paper develops a machine-learning model called CapitalVX (for “Capital Venture eXchange”) to predict the outcomes for startups, i.e., whether they will exit successfully through an IPO or acquisition, fail, or remain private. Using a large feature set, the out-of-sample accuracy of predictions on startup outcomes and follow-on funding is 80–89%. This research suggests that VC/PE firms may be able to benefit from using machine learning to screen potential investments using publicly available information, diverting this time instead into mentoring and monitoring the investments they make.</p></div>","PeriodicalId":36340,"journal":{"name":"Journal of Finance and Data Science","volume":"7 ","pages":"Pages 94-114"},"PeriodicalIF":0.0,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.jfds.2021.04.001","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91709763","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Short-term bitcoin market prediction via machine learning 通过机器学习进行短期比特币市场预测
Q1 Mathematics Pub Date : 2021-11-01 DOI: 10.1016/j.jfds.2021.03.001
Patrick Jaquart, David Dann, Christof Weinhardt

We analyze the predictability of the bitcoin market across prediction horizons ranging from 1 to 60 min. In doing so, we test various machine learning models and find that, while all models outperform a random classifier, recurrent neural networks and gradient boosting classifiers are especially well-suited for the examined prediction tasks. We use a comprehensive feature set, including technical, blockchain-based, sentiment-/interest-based, and asset-based features. Our results show that technical features remain most relevant for most methods, followed by selected blockchain-based and sentiment-/interest-based features. Additionally, we find that predictability increases for longer prediction horizons. Although a quantile-based long-short trading strategy generates monthly returns of up to 39% before transaction costs, it leads to negative returns after taking transaction costs into account due to the particularly short holding periods.

我们分析了比特币市场在1到60分钟的预测范围内的可预测性。在此过程中,我们测试了各种机器学习模型,发现虽然所有模型都优于随机分类器,但循环神经网络和梯度增强分类器特别适合于所检查的预测任务。我们使用全面的功能集,包括技术,基于区块链,基于情感/兴趣和基于资产的功能。我们的研究结果表明,技术特征仍然与大多数方法最相关,其次是选定的基于区块链和基于情感/兴趣的特征。此外,我们发现预测范围越长,可预测性越高。尽管基于分位数的多空交易策略在扣除交易成本前的月回报率高达39%,但由于持有时间特别短,考虑交易成本后的月回报率为负。
{"title":"Short-term bitcoin market prediction via machine learning","authors":"Patrick Jaquart,&nbsp;David Dann,&nbsp;Christof Weinhardt","doi":"10.1016/j.jfds.2021.03.001","DOIUrl":"https://doi.org/10.1016/j.jfds.2021.03.001","url":null,"abstract":"<div><p>We analyze the predictability of the bitcoin market across prediction horizons ranging from 1 to 60 min. In doing so, we test various machine learning models and find that, while all models outperform a random classifier, recurrent neural networks and gradient boosting classifiers are especially well-suited for the examined prediction tasks. We use a comprehensive feature set, including technical, blockchain-based, sentiment-/interest-based, and asset-based features. Our results show that technical features remain most relevant for most methods, followed by selected blockchain-based and sentiment-/interest-based features. Additionally, we find that predictability increases for longer prediction horizons. Although a quantile-based long-short trading strategy generates monthly returns of up to 39% before transaction costs, it leads to negative returns after taking transaction costs into account due to the particularly short holding periods.</p></div>","PeriodicalId":36340,"journal":{"name":"Journal of Finance and Data Science","volume":"7 ","pages":"Pages 45-66"},"PeriodicalIF":0.0,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.jfds.2021.03.001","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91747221","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 64
Inventory effects on the price dynamics of VSTOXX futures quantified via machine learning 通过机器学习量化库存对VSTOXX期货价格动态的影响
Q1 Mathematics Pub Date : 2021-11-01 DOI: 10.1016/j.jfds.2021.06.001
Daniel Guterding

The VSTOXX index tracks the expected 30-day volatility of the EURO STOXX 50 equity index. Futures on the VSTOXX index can, therefore, be used to hedge against economic uncertainty. We investigate the effect of trader inventory on the price of VSTOXX futures through a combination of stochastic processes and machine learning methods. We formulate a simple and efficient pricing methodology for VSTOXX futures, which assumes a Heston-type stochastic process for the underlying EURO STOXX 50 market. Under these dynamics, approximate analytical formulas for the implied volatility smile and the VSTOXX index have recently been derived. We use the EURO STOXX 50 option implied volatilities and the VSTOXX index value to estimate the parameters of this Heston model. Following the calibration, we calculate theoretical VSTOXX futures prices and compare them to the actual market prices. While theoretical and market prices are usually in line, we also observe time periods, during which the market price does not agree with our Heston model. We collect a variety of market features that could potentially explain the price deviations and calibrate two machine learning models to the price difference: a regularized linear model and a random forest. We find that both models indicate a strong influence of accumulated trader positions on the VSTOXX futures price.

VSTOXX指数跟踪欧洲STOXX 50指数的预期30天波动率。因此,VSTOXX指数的期货可以用来对冲经济的不确定性。我们通过随机过程和机器学习相结合的方法来研究交易者库存对VSTOXX期货价格的影响。我们为欧洲斯托克50指数期货制定了一个简单而有效的定价方法,该方法假设基础欧洲斯托克50指数市场具有赫斯顿型随机过程。在这些动态下,最近导出了隐含波动率微笑和VSTOXX指数的近似解析公式。我们使用欧元斯托克50期权隐含波动率和VSTOXX指数值来估计赫斯顿模型的参数。在校准之后,我们计算理论VSTOXX期货价格,并将其与实际市场价格进行比较。虽然理论价格和市场价格通常是一致的,但我们也观察到市场价格与我们的赫斯顿模型不一致的时间段。我们收集了各种可能解释价格偏差的市场特征,并将两个机器学习模型校准为价格差异:正则化线性模型和随机森林。我们发现这两个模型都表明累积交易者头寸对VSTOXX期货价格有很强的影响。
{"title":"Inventory effects on the price dynamics of VSTOXX futures quantified via machine learning","authors":"Daniel Guterding","doi":"10.1016/j.jfds.2021.06.001","DOIUrl":"10.1016/j.jfds.2021.06.001","url":null,"abstract":"<div><p>The VSTOXX index tracks the expected 30-day volatility of the EURO STOXX 50 equity index. Futures on the VSTOXX index can, therefore, be used to hedge against economic uncertainty. We investigate the effect of trader inventory on the price of VSTOXX futures through a combination of stochastic processes and machine learning methods. We formulate a simple and efficient pricing methodology for VSTOXX futures, which assumes a Heston-type stochastic process for the underlying EURO STOXX 50 market. Under these dynamics, approximate analytical formulas for the implied volatility smile and the VSTOXX index have recently been derived. We use the EURO STOXX 50 option implied volatilities and the VSTOXX index value to estimate the parameters of this Heston model. Following the calibration, we calculate theoretical VSTOXX futures prices and compare them to the actual market prices. While theoretical and market prices are usually in line, we also observe time periods, during which the market price does not agree with our Heston model. We collect a variety of market features that could potentially explain the price deviations and calibrate two machine learning models to the price difference: a regularized linear model and a random forest. We find that both models indicate a strong influence of accumulated trader positions on the VSTOXX futures price.</p></div>","PeriodicalId":36340,"journal":{"name":"Journal of Finance and Data Science","volume":"7 ","pages":"Pages 126-142"},"PeriodicalIF":0.0,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.jfds.2021.06.001","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91485732","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Enhanced PD-implied ratings by targeting the credit rating migration matrix 通过针对信用评级迁移矩阵增强pd隐含评级
Q1 Mathematics Pub Date : 2021-11-01 DOI: 10.1016/j.jfds.2021.05.001
Jin-Chuan Duan , Shuping Li

A high-quality and granular probability of default (PD) model is on many practical dimensions far superior to any categorical credit rating system. Business adoption of a PD model, however, needs to factor in the long-established business/regulatory conventions built around letter-based credit ratings. A mapping methodology that converts granular PDs into letter ratings via referencing the historical default experience of some credit rating agency exists in the literature. This paper improves the PD implied rating (PDiR) methodology by targeting the historical credit migration matrix instead of simply default rates. This enhanced PDiR methodology makes it possible to bypass the reliance on arbitrarily extrapolated target default rates for the AAA and AA+ categories, a necessity due to the fact that the historical realized default rates on these two top rating grades are typically zero.

一个高质量和粒度的违约概率(PD)模型在许多实际维度上远远优于任何分类信用评级系统。然而,采用PD模型的业务需要考虑长期建立的业务/监管惯例,这些惯例是围绕基于信件的信用评级建立的。文献中存在一种映射方法,通过参考某些信用评级机构的历史违约经验,将颗粒级pd转换为字母评级。本文通过针对历史信用迁移矩阵而不是简单的违约率来改进PD隐含评级(PDiR)方法。这种增强的PDiR方法可以绕过对任意外推AAA和AA+类别的目标违约率的依赖,这是必要的,因为这两个最高评级等级的历史已实现违约率通常为零。
{"title":"Enhanced PD-implied ratings by targeting the credit rating migration matrix","authors":"Jin-Chuan Duan ,&nbsp;Shuping Li","doi":"10.1016/j.jfds.2021.05.001","DOIUrl":"https://doi.org/10.1016/j.jfds.2021.05.001","url":null,"abstract":"<div><p>A high-quality and granular probability of default (PD) model is on many practical dimensions far superior to any categorical credit rating system. Business adoption of a PD model, however, needs to factor in the long-established business/regulatory conventions built around letter-based credit ratings. A mapping methodology that converts granular PDs into letter ratings via referencing the historical default experience of some credit rating agency exists in the literature. This paper improves the PD implied rating (PDiR) methodology by targeting the historical credit migration matrix instead of simply default rates. This enhanced PDiR methodology makes it possible to bypass the reliance on arbitrarily extrapolated target default rates for the AAA and AA<sup>+</sup> categories, a necessity due to the fact that the historical realized default rates on these two top rating grades are typically zero.</p></div>","PeriodicalId":36340,"journal":{"name":"Journal of Finance and Data Science","volume":"7 ","pages":"Pages 115-125"},"PeriodicalIF":0.0,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.jfds.2021.05.001","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91709762","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
期刊
Journal of Finance and Data Science
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1