Initial Coin Offering Prediction Comparison Using Ridge Regression, Artificial Neural Network, Random Forest Regression, and Hybrid ANN-Ridge

Mendel Pub Date : 2023-12-20 DOI:10.13164/mendel.2023.2.283
Toai Kim Tran, R. Šenkeřík, Hahn Thi Xuan Vo, Huan Minh Vo, Adam Ulrich, Marek Musil, I. Zelinka
{"title":"Initial Coin Offering Prediction Comparison Using Ridge Regression, Artificial Neural Network, Random Forest Regression, and Hybrid ANN-Ridge","authors":"Toai Kim Tran, R. Šenkeřík, Hahn Thi Xuan Vo, Huan Minh Vo, Adam Ulrich, Marek Musil, I. Zelinka","doi":"10.13164/mendel.2023.2.283","DOIUrl":null,"url":null,"abstract":"\n \n \nCan machine learning take a prediction to win an investment in ICO (Initial Coin Offering)? In this research work, our objective is to answer this question. Four popular and lower computational demanding approaches including Ridge regression (RR), Artificial neural network (ANN), Random forest regression (RFR), and a hybrid ANN-Ridge regression are compared in terms of accuracy metrics to predict ICO value after six months. We use a dataset collected from 109 ICOs that were obtained from the cryptocurrency websites after data preprocessing. The dataset consists of 12 fields covering the main factors that affect the value of an ICO. One-hot encoding technique is applied to convert the alphanumeric form into a binary format to perform better predictions; thus, the dataset has been expanded to 128 columns and 109 rows. Input data (variables) and ICO value are non-linear dependent. The Artificial neural network algorithm offers a bio-inspired mathematical model to solve the complex non-linear relationship between input variables and ICO value. The linear regression model has problems with overfitting and multicollinearity that make the ICO prediction inaccurate. On the contrary, the Ridge regression algorithm overcomes the correlation problem that independent variables are highly correlated to the output value when dealing with ICO data. Random forest regression does avoid overfitting by growing a large decision tree to minimize the prediction error. Hybrid ANN-Ridge regression leverages the strengths of both algorithms to improve prediction accuracy. By combining ANN’s ability to capture complex non-linear relationships with the regularization capabilities of Ridge regression, the hybrid can potentially provide better predictive performance compared to using either algorithm individually. After the training process with the cross-validation technique and the parameter fitting process, we obtained several models but selected three of the best in each algorithm based on metrics of RMSE (Root Mean Square Error), R2 (R-squared), and MAE (Mean Absolute Error). The validation results show that the presented Ridge regression approach has an accuracy of at most 99% of the actual value. The Artificial neural network predicts the ICO value with an accuracy of up to 98% of the actual value after six months. Additionally, the Random forest regression and the hybrid ANN-Ridge regression improve the predictive accuracy to 98% actual value. \n \n \n","PeriodicalId":38293,"journal":{"name":"Mendel","volume":"54 3","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-12-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mendel","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.13164/mendel.2023.2.283","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Can machine learning take a prediction to win an investment in ICO (Initial Coin Offering)? In this research work, our objective is to answer this question. Four popular and lower computational demanding approaches including Ridge regression (RR), Artificial neural network (ANN), Random forest regression (RFR), and a hybrid ANN-Ridge regression are compared in terms of accuracy metrics to predict ICO value after six months. We use a dataset collected from 109 ICOs that were obtained from the cryptocurrency websites after data preprocessing. The dataset consists of 12 fields covering the main factors that affect the value of an ICO. One-hot encoding technique is applied to convert the alphanumeric form into a binary format to perform better predictions; thus, the dataset has been expanded to 128 columns and 109 rows. Input data (variables) and ICO value are non-linear dependent. The Artificial neural network algorithm offers a bio-inspired mathematical model to solve the complex non-linear relationship between input variables and ICO value. The linear regression model has problems with overfitting and multicollinearity that make the ICO prediction inaccurate. On the contrary, the Ridge regression algorithm overcomes the correlation problem that independent variables are highly correlated to the output value when dealing with ICO data. Random forest regression does avoid overfitting by growing a large decision tree to minimize the prediction error. Hybrid ANN-Ridge regression leverages the strengths of both algorithms to improve prediction accuracy. By combining ANN’s ability to capture complex non-linear relationships with the regularization capabilities of Ridge regression, the hybrid can potentially provide better predictive performance compared to using either algorithm individually. After the training process with the cross-validation technique and the parameter fitting process, we obtained several models but selected three of the best in each algorithm based on metrics of RMSE (Root Mean Square Error), R2 (R-squared), and MAE (Mean Absolute Error). The validation results show that the presented Ridge regression approach has an accuracy of at most 99% of the actual value. The Artificial neural network predicts the ICO value with an accuracy of up to 98% of the actual value after six months. Additionally, the Random forest regression and the hybrid ANN-Ridge regression improve the predictive accuracy to 98% actual value.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用岭回归、人工神经网络、随机森林回归和混合人工神经网络-岭进行首次代币发行预测比较
机器学习能否通过预测赢得 ICO(首次代币发行)投资?在这项研究工作中,我们的目标就是回答这个问题。我们比较了四种流行且计算要求较低的方法(包括岭回归(RR)、人工神经网络(ANN)、随机森林回归(RFR)和混合 ANN-Ridge 回归)在预测六个月后 ICO 价值方面的准确性指标。我们使用从 109 个 ICO 收集的数据集,这些数据集是在数据预处理后从加密货币网站上获得的。数据集由 12 个字段组成,涵盖了影响 ICO 价值的主要因素。为了更好地进行预测,采用了单热编码技术将字母数字形式转换为二进制格式;因此,数据集扩展为 128 列和 109 行。输入数据(变量)和 ICO 值是非线性依赖关系。人工神经网络算法提供了一种生物启发数学模型,以解决输入变量和 ICO 值之间复杂的非线性关系。线性回归模型存在过度拟合和多重共线性问题,导致 ICO 预测不准确。相反,在处理 ICO 数据时,岭回归算法克服了自变量与输出值高度相关的相关性问题。随机森林回归通过生长一棵大决策树来最小化预测误差,从而避免了过度拟合。混合 ANN-Ridge 回归利用了这两种算法的优势来提高预测准确性。通过将 ANN 捕捉复杂非线性关系的能力与 Ridge 回归的正则化能力相结合,混合算法有可能提供比单独使用其中一种算法更好的预测性能。在使用交叉验证技术和参数拟合过程进行训练后,我们得到了多个模型,但根据 RMSE(均方根误差)、R2(R 平方)和 MAE(平均绝对误差)等指标,在每种算法中选出了三个最佳模型。验证结果表明,所提出的岭回归方法的准确率最高可达实际值的 99%。人工神经网络预测六个月后 ICO 值的准确率高达实际值的 98%。此外,随机森林回归和混合人工神经网络-岭回归将预测准确率提高到实际值的 98%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Mendel
Mendel Decision Sciences-Decision Sciences (miscellaneous)
CiteScore
2.20
自引率
0.00%
发文量
7
期刊最新文献
Detecting Outliers Using Modified Recursive PCA Algorithm For Dynamic Streaming Data Stock and Structured Warrant Portfolio Optimization Using Black-Litterman Model and Binomial Method Optimized Fixed-Time Synergetic Controller via a modified Salp Swarm Algorithm for Acute and Chronic HBV Transmission System Initial Coin Offering Prediction Comparison Using Ridge Regression, Artificial Neural Network, Random Forest Regression, and Hybrid ANN-Ridge Predicting Football Match Outcomes with Machine Learning Approaches
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1