Textual Information and IPO Underpricing: A Machine Learning Approach

Apostolos G. Katsafados, Ion Androutsopoulos, Ilias Chalkidis, Manos Fergadiotis, George N. Leledakis, Emmanouil G. Pyrgiotakis
{"title":"Textual Information and IPO Underpricing: A Machine Learning Approach","authors":"Apostolos G. Katsafados, Ion Androutsopoulos, Ilias Chalkidis, Manos Fergadiotis, George N. Leledakis, Emmanouil G. Pyrgiotakis","doi":"10.3905/jfds.2023.1.121","DOIUrl":null,"url":null,"abstract":"This study examines the predictive power of textual information from S-1 filings in explaining initial public offering (IPO) underpricing. The authors’ approach differs from previous research because they utilize several machine learning algorithms to predict whether an IPO will be underpriced or not, as well as the magnitude of the underpricing. Using a sample of 2,481 US IPOs, they find that textual information can effectively complement financial variables in terms of prediction accuracy because models that use both sources of data produce more accurate estimates. In particular, the model with the best performance using only financial variables achieves 67.5% accuracy whereas the best model with both textual and financial data appears a substantial improvement (6.1%). Also, the use of sophisticated machine learning models drives an increase in the predictive accuracy compared to the traditional logistic regression model (2.5%). The authors attribute the findings to the fact that textual information can reduce the ex ante valuation uncertainty of IPO firms. Finally, they create a portfolio of IPOs based on the out-of-sample machine learning predictions, which remarkably achieves 27.90% average returns. Their portfolio achieves extraordinary abnormal returns in various time dimensions (both in the short and long run), achieving up to 30% better yield than the benchmark.","PeriodicalId":199045,"journal":{"name":"The Journal of Financial Data Science","volume":"250 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Journal of Financial Data Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3905/jfds.2023.1.121","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

This study examines the predictive power of textual information from S-1 filings in explaining initial public offering (IPO) underpricing. The authors’ approach differs from previous research because they utilize several machine learning algorithms to predict whether an IPO will be underpriced or not, as well as the magnitude of the underpricing. Using a sample of 2,481 US IPOs, they find that textual information can effectively complement financial variables in terms of prediction accuracy because models that use both sources of data produce more accurate estimates. In particular, the model with the best performance using only financial variables achieves 67.5% accuracy whereas the best model with both textual and financial data appears a substantial improvement (6.1%). Also, the use of sophisticated machine learning models drives an increase in the predictive accuracy compared to the traditional logistic regression model (2.5%). The authors attribute the findings to the fact that textual information can reduce the ex ante valuation uncertainty of IPO firms. Finally, they create a portfolio of IPOs based on the out-of-sample machine learning predictions, which remarkably achieves 27.90% average returns. Their portfolio achieves extraordinary abnormal returns in various time dimensions (both in the short and long run), achieving up to 30% better yield than the benchmark.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
文本信息与IPO定价:一种机器学习方法
本研究考察了S-1文件文本信息在解释首次公开发行(IPO)定价过低中的预测能力。作者的方法与之前的研究不同,因为他们使用了几种机器学习算法来预测IPO是否会被低估,以及低估的程度。他们以2481宗美国ipo为样本,发现在预测准确性方面,文本信息可以有效地补充财务变量,因为同时使用这两种数据来源的模型会产生更准确的估计。特别是,仅使用金融变量的最佳模型达到了67.5%的准确率,而同时使用文本和财务数据的最佳模型则有了实质性的改进(6.1%)。此外,与传统的逻辑回归模型(2.5%)相比,使用复杂的机器学习模型可以提高预测精度。作者将这一发现归因于文本信息可以降低IPO公司事前估值的不确定性。最后,他们根据样本外机器学习预测创建了一个ipo投资组合,平均回报率达到了27.90%。他们的投资组合在不同的时间维度(包括短期和长期)都实现了非凡的异常回报,比基准收益率高出30%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Managing Editor’s Letter Explainable Machine Learning Models of Consumer Credit Risk Predicting Returns with Machine Learning across Horizons, Firm Size, and Time Deep Calibration with Artificial Neural Network: A Performance Comparison on Option-Pricing Models RIFT: Pretraining and Applications for Representations of Interrelated Financial Time Series
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1