{"title":"用机器学习组合探索因子动物园","authors":"Halis Sak , Tao Huang , Michael T. Chng","doi":"10.1016/j.irfa.2024.103599","DOIUrl":null,"url":null,"abstract":"<div><div>With the growing reliance on machine-learning (ML) methods in finance, an understanding of their long-term efficacy and underlying mechanism is needed. We document the time-varying importance of different stock characteristics over an 18-year (1998–2016) out-of-sample period to determine whether ML models, when trained on a large set of firm and trading characteristics, can consistently outperform factor models. Utilizing a combination of linear and nonlinear models, we form a ML portfolio that consistently generates a significant alpha against factor models, ranging from 2.14 to 2.74% per month. We uncover patterns in characteristic dominance that alternates between arbitrage and financial constraint features. The variation correlates with the US credit cycle, and highlights a fundamental economic mechanism underlying the ML portfolio’s performance. The study’s impact extends to both academics and practitioners, providing insights into the economic drivers of stock returns and the practical implementation of ML methods in portfolio construction.</div></div>","PeriodicalId":48226,"journal":{"name":"International Review of Financial Analysis","volume":"96 ","pages":"Article 103599"},"PeriodicalIF":7.5000,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Exploring the factor zoo with a machine-learning portfolio\",\"authors\":\"Halis Sak , Tao Huang , Michael T. Chng\",\"doi\":\"10.1016/j.irfa.2024.103599\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>With the growing reliance on machine-learning (ML) methods in finance, an understanding of their long-term efficacy and underlying mechanism is needed. We document the time-varying importance of different stock characteristics over an 18-year (1998–2016) out-of-sample period to determine whether ML models, when trained on a large set of firm and trading characteristics, can consistently outperform factor models. Utilizing a combination of linear and nonlinear models, we form a ML portfolio that consistently generates a significant alpha against factor models, ranging from 2.14 to 2.74% per month. We uncover patterns in characteristic dominance that alternates between arbitrage and financial constraint features. The variation correlates with the US credit cycle, and highlights a fundamental economic mechanism underlying the ML portfolio’s performance. The study’s impact extends to both academics and practitioners, providing insights into the economic drivers of stock returns and the practical implementation of ML methods in portfolio construction.</div></div>\",\"PeriodicalId\":48226,\"journal\":{\"name\":\"International Review of Financial Analysis\",\"volume\":\"96 \",\"pages\":\"Article 103599\"},\"PeriodicalIF\":7.5000,\"publicationDate\":\"2024-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Review of Financial Analysis\",\"FirstCategoryId\":\"96\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1057521924005313\",\"RegionNum\":1,\"RegionCategory\":\"经济学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"BUSINESS, FINANCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Review of Financial Analysis","FirstCategoryId":"96","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1057521924005313","RegionNum":1,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BUSINESS, FINANCE","Score":null,"Total":0}
引用次数: 0
摘要
随着金融领域对机器学习(ML)方法的依赖日益增加,我们需要了解这些方法的长期功效和内在机制。我们记录了不同股票特征在 18 年(1998-2016 年)样本外期间随时间变化的重要性,以确定在对大量公司和交易特征进行训练后,ML 模型是否能持续优于因子模型。利用线性和非线性模型的组合,我们形成了一个 ML 投资组合,该投资组合能够持续产生显著的阿尔法(alpha)收益,与因子模型相比,每月收益率从 2.14% 到 2.74% 不等。我们发现了在套利和财务约束特征之间交替出现的特征支配模式。这种变化与美国信贷周期相关,并凸显了 ML 投资组合业绩背后的基本经济机制。这项研究对学术界和从业人员都有影响,为股票回报的经济驱动因素以及在投资组合构建中实际应用 ML 方法提供了见解。
Exploring the factor zoo with a machine-learning portfolio
With the growing reliance on machine-learning (ML) methods in finance, an understanding of their long-term efficacy and underlying mechanism is needed. We document the time-varying importance of different stock characteristics over an 18-year (1998–2016) out-of-sample period to determine whether ML models, when trained on a large set of firm and trading characteristics, can consistently outperform factor models. Utilizing a combination of linear and nonlinear models, we form a ML portfolio that consistently generates a significant alpha against factor models, ranging from 2.14 to 2.74% per month. We uncover patterns in characteristic dominance that alternates between arbitrage and financial constraint features. The variation correlates with the US credit cycle, and highlights a fundamental economic mechanism underlying the ML portfolio’s performance. The study’s impact extends to both academics and practitioners, providing insights into the economic drivers of stock returns and the practical implementation of ML methods in portfolio construction.
期刊介绍:
The International Review of Financial Analysis (IRFA) is an impartial refereed journal designed to serve as a platform for high-quality financial research. It welcomes a diverse range of financial research topics and maintains an unbiased selection process. While not limited to U.S.-centric subjects, IRFA, as its title suggests, is open to valuable research contributions from around the world.