Predicting the Entrepreneurial Success of Crowdfunding Campaigns Using Model-Based Machine Learning Methods

Q2 Decision Sciences International Journal of Crowd Science Pub Date : 2022-04-15 DOI:10.26599/IJCS.2022.9100003
Michael Safo Oduro;Han Yu;Hong Huang
{"title":"Predicting the Entrepreneurial Success of Crowdfunding Campaigns Using Model-Based Machine Learning Methods","authors":"Michael Safo Oduro;Han Yu;Hong Huang","doi":"10.26599/IJCS.2022.9100003","DOIUrl":null,"url":null,"abstract":"A common phenomenon that increasingly stimulates the interest of investors, companies, and entrepreneurs involved in crowd funding activities particularly on the Kickstarter website is identifying metrics that make such campaigns markedly successful. This study seeks to gauge the importance of key predictive variables or features based on statistical analysis, identify model-based machine learning methods based on performance assessment that predict success of a campaigns, and compare the selected different machine learning algorithms. To achieve our research objectives and maximize insight into the dataset used, feature engineering was performed. Then, machine learning models, inclusive of Logistic Regression (LR), Support Vector Machines (SVMs) in the form of Linear Discriminant Analysis (LDA), Quadratic Discriminant Analysis (QDA), and random forest analysis (bagging and boosting), were performed and compared via cross validation approaches in terms of their resulting test error rates, F1 score, Accuracy, Precision, and Recall rates. Of the machine learning models employed for predictive analysis, the test error rates and the other classification metric scores obtained across the three cross-validation approaches identified bagging and gradient boosting (the SVMs) as more robust methods for predicting success of Kickstarter projects. The major research objectives in this paper have been achieved by accessing the performance of key statistical learning methods that guides the choice of learning methods or models and giving us a measure of the quality of the ultimately chosen model. However, Bayesian semi-parametric approaches are of future research consideration. These methods facilitate the usage of an infinite number of parameters to capture information regarding the underlying distributions of even more complex data.","PeriodicalId":32381,"journal":{"name":"International Journal of Crowd Science","volume":"6 1","pages":"7-16"},"PeriodicalIF":0.0000,"publicationDate":"2022-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/9736195/9745472/09758663.pdf","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Crowd Science","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/9758663/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Decision Sciences","Score":null,"Total":0}
引用次数: 8

Abstract

A common phenomenon that increasingly stimulates the interest of investors, companies, and entrepreneurs involved in crowd funding activities particularly on the Kickstarter website is identifying metrics that make such campaigns markedly successful. This study seeks to gauge the importance of key predictive variables or features based on statistical analysis, identify model-based machine learning methods based on performance assessment that predict success of a campaigns, and compare the selected different machine learning algorithms. To achieve our research objectives and maximize insight into the dataset used, feature engineering was performed. Then, machine learning models, inclusive of Logistic Regression (LR), Support Vector Machines (SVMs) in the form of Linear Discriminant Analysis (LDA), Quadratic Discriminant Analysis (QDA), and random forest analysis (bagging and boosting), were performed and compared via cross validation approaches in terms of their resulting test error rates, F1 score, Accuracy, Precision, and Recall rates. Of the machine learning models employed for predictive analysis, the test error rates and the other classification metric scores obtained across the three cross-validation approaches identified bagging and gradient boosting (the SVMs) as more robust methods for predicting success of Kickstarter projects. The major research objectives in this paper have been achieved by accessing the performance of key statistical learning methods that guides the choice of learning methods or models and giving us a measure of the quality of the ultimately chosen model. However, Bayesian semi-parametric approaches are of future research consideration. These methods facilitate the usage of an infinite number of parameters to capture information regarding the underlying distributions of even more complex data.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于模型的机器学习方法预测众筹活动的创业成功
一个越来越激发参与众筹活动的投资者、公司和企业家兴趣的常见现象,尤其是在Kickstarter网站上,就是确定使此类活动显著成功的指标。本研究试图基于统计分析来衡量关键预测变量或特征的重要性,基于预测活动成功的绩效评估来确定基于模型的机器学习方法,并比较所选的不同机器学习算法。为了实现我们的研究目标并最大限度地深入了解所使用的数据集,进行了特征工程。然后,执行机器学习模型,包括逻辑回归(LR)、线性判别分析(LDA)形式的支持向量机(SVM)、二次判别分析(QDA)和随机森林分析(装袋和提升),并通过交叉验证方法对其测试错误率、F1分、准确度、精度和召回率进行比较。在用于预测分析的机器学习模型中,通过三种交叉验证方法获得的测试错误率和其他分类指标得分将装袋和梯度增强(SVM)确定为预测Kickstarter项目成功的更稳健的方法。本文的主要研究目标是通过评估关键统计学习方法的性能来实现的,这些方法指导了学习方法或模型的选择,并为我们提供了最终选择的模型质量的衡量标准。然而,贝叶斯半参数方法是未来研究的考虑因素。这些方法便于使用无限数量的参数来捕获关于更复杂数据的潜在分布的信息。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
International Journal of Crowd Science
International Journal of Crowd Science Decision Sciences-Decision Sciences (miscellaneous)
CiteScore
2.70
自引率
0.00%
发文量
20
审稿时长
24 weeks
期刊最新文献
Contents Front Cover Improving Energy Harvesting System from Ambient RF Sources in Social Systems with Overcrowding Editorial of Cyber-Physical Social Systems and Smart Environments CGLS Method for Efficient Equalization of OFDM Systems Under Doubly Dispersive Fading Channels with an Application Into 6G Communications in Smart Overcrowded
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1