Textual Information and IPO Underpricing: A Machine Learning Approach

The Journal of Financial Data Science Pub Date : 2023-03-14 DOI:10.3905/jfds.2023.1.121

Apostolos G. Katsafados, Ion Androutsopoulos, Ilias Chalkidis, Manos Fergadiotis, George N. Leledakis, Emmanouil G. Pyrgiotakis

{"title":"Textual Information and IPO Underpricing: A Machine Learning Approach","authors":"Apostolos G. Katsafados, Ion Androutsopoulos, Ilias Chalkidis, Manos Fergadiotis, George N. Leledakis, Emmanouil G. Pyrgiotakis","doi":"10.3905/jfds.2023.1.121","DOIUrl":null,"url":null,"abstract":"This study examines the predictive power of textual information from S-1 filings in explaining initial public offering (IPO) underpricing. The authors’ approach differs from previous research because they utilize several machine learning algorithms to predict whether an IPO will be underpriced or not, as well as the magnitude of the underpricing. Using a sample of 2,481 US IPOs, they find that textual information can effectively complement financial variables in terms of prediction accuracy because models that use both sources of data produce more accurate estimates. In particular, the model with the best performance using only financial variables achieves 67.5% accuracy whereas the best model with both textual and financial data appears a substantial improvement (6.1%). Also, the use of sophisticated machine learning models drives an increase in the predictive accuracy compared to the traditional logistic regression model (2.5%). The authors attribute the findings to the fact that textual information can reduce the ex ante valuation uncertainty of IPO firms. Finally, they create a portfolio of IPOs based on the out-of-sample machine learning predictions, which remarkably achieves 27.90% average returns. Their portfolio achieves extraordinary abnormal returns in various time dimensions (both in the short and long run), achieving up to 30% better yield than the benchmark.","PeriodicalId":199045,"journal":{"name":"The Journal of Financial Data Science","volume":"250 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Journal of Financial Data Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3905/jfds.2023.1.121","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

Abstract

This study examines the predictive power of textual information from S-1 filings in explaining initial public offering (IPO) underpricing. The authors’ approach differs from previous research because they utilize several machine learning algorithms to predict whether an IPO will be underpriced or not, as well as the magnitude of the underpricing. Using a sample of 2,481 US IPOs, they find that textual information can effectively complement financial variables in terms of prediction accuracy because models that use both sources of data produce more accurate estimates. In particular, the model with the best performance using only financial variables achieves 67.5% accuracy whereas the best model with both textual and financial data appears a substantial improvement (6.1%). Also, the use of sophisticated machine learning models drives an increase in the predictive accuracy compared to the traditional logistic regression model (2.5%). The authors attribute the findings to the fact that textual information can reduce the ex ante valuation uncertainty of IPO firms. Finally, they create a portfolio of IPOs based on the out-of-sample machine learning predictions, which remarkably achieves 27.90% average returns. Their portfolio achieves extraordinary abnormal returns in various time dimensions (both in the short and long run), achieving up to 30% better yield than the benchmark.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

文本信息与IPO定价:一种机器学习方法

本研究考察了S-1文件文本信息在解释首次公开发行(IPO)定价过低中的预测能力。作者的方法与之前的研究不同，因为他们使用了几种机器学习算法来预测IPO是否会被低估，以及低估的程度。他们以2481宗美国ipo为样本，发现在预测准确性方面，文本信息可以有效地补充财务变量，因为同时使用这两种数据来源的模型会产生更准确的估计。特别是，仅使用金融变量的最佳模型达到了67.5%的准确率，而同时使用文本和财务数据的最佳模型则有了实质性的改进(6.1%)。此外，与传统的逻辑回归模型(2.5%)相比，使用复杂的机器学习模型可以提高预测精度。作者将这一发现归因于文本信息可以降低IPO公司事前估值的不确定性。最后，他们根据样本外机器学习预测创建了一个ipo投资组合，平均回报率达到了27.90%。他们的投资组合在不同的时间维度(包括短期和长期)都实现了非凡的异常回报，比基准收益率高出30%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

The Journal of Financial Data Science

自引率

0.00%

发文量

期刊最新文献

Managing Editor’s Letter Explainable Machine Learning Models of Consumer Credit Risk Predicting Returns with Machine Learning across Horizons, Firm Size, and Time Deep Calibration with Artificial Neural Network: A Performance Comparison on Option-Pricing Models RIFT: Pretraining and Applications for Representations of Interrelated Financial Time Series