Forests for Fama

Joseph Simonian
{"title":"Forests for Fama","authors":"Joseph Simonian","doi":"10.3905/jfds.2021.1.086","DOIUrl":null,"url":null,"abstract":"In this article, the author addresses Eugene Fama’s skepticism regarding the predictability of stock market bubbles. To do so, he applies two ensemble learning methods, the random cut forest and random forest algorithms, to build a model that predicts large near-term drawdowns based on patterns in stock price behavior. The model includes three predictive variables. The first factor is an anomaly score produced by random cut forest, an algorithm specifically designed to detect outliers in streaming data. The second and third factors are the standard deviation of price returns and the return convexity over specified time horizons, with return convexity defined as the difference between one-year price returns and six-month price returns. The author’s predictions are based on random forest regressions. He applies the model to a large universe of equity sectors and factors. Blocked time-series cross-validation is used to evaluate the predictive efficacy of the model. The author shows that across the sectors and factors considered, the model presented produces predictive scores that are strongly positive. Although bubble prediction is surely a multidimensional endeavor that requires input from a variety of tools and sources, the author demonstrates that a framework built upon ensemble methods can be informationally additive to the detection of bubblelike behavior across a wide array of stocks.","PeriodicalId":199045,"journal":{"name":"The Journal of Financial Data Science","volume":"67 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Journal of Financial Data Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3905/jfds.2021.1.086","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In this article, the author addresses Eugene Fama’s skepticism regarding the predictability of stock market bubbles. To do so, he applies two ensemble learning methods, the random cut forest and random forest algorithms, to build a model that predicts large near-term drawdowns based on patterns in stock price behavior. The model includes three predictive variables. The first factor is an anomaly score produced by random cut forest, an algorithm specifically designed to detect outliers in streaming data. The second and third factors are the standard deviation of price returns and the return convexity over specified time horizons, with return convexity defined as the difference between one-year price returns and six-month price returns. The author’s predictions are based on random forest regressions. He applies the model to a large universe of equity sectors and factors. Blocked time-series cross-validation is used to evaluate the predictive efficacy of the model. The author shows that across the sectors and factors considered, the model presented produces predictive scores that are strongly positive. Although bubble prediction is surely a multidimensional endeavor that requires input from a variety of tools and sources, the author demonstrates that a framework built upon ensemble methods can be informationally additive to the detection of bubblelike behavior across a wide array of stocks.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Fama的森林
在这篇文章中,作者阐述了尤金·法玛对股市泡沫可预测性的怀疑。为此,他应用了两种集成学习方法,随机砍伐森林和随机森林算法,建立了一个模型,该模型可以根据股票价格行为模式预测近期的大幅下跌。该模型包含三个预测变量。第一个因素是随机砍伐森林产生的异常分数,这是一种专门设计用于检测流数据中的异常值的算法。第二个和第三个因素是价格回报的标准差和特定时间范围内的回报凸性,其中回报凸性定义为1年价格回报与6个月价格回报之间的差异。作者的预测是基于随机森林回归的。他将该模型应用于大量股票行业和因素。采用阻塞时间序列交叉验证来评估模型的预测效果。作者表明,在考虑的各个部门和因素中,所提出的模型产生的预测分数是非常积极的。虽然泡沫预测肯定是一个多维的努力,需要从各种工具和来源的输入,但作者证明了建立在集成方法上的框架可以在信息上添加到对大量股票的泡沫行为的检测中。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Managing Editor’s Letter Explainable Machine Learning Models of Consumer Credit Risk Predicting Returns with Machine Learning across Horizons, Firm Size, and Time Deep Calibration with Artificial Neural Network: A Performance Comparison on Option-Pricing Models RIFT: Pretraining and Applications for Representations of Interrelated Financial Time Series
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1