Utilizing Statistical Tests for Comparing Machine Learning Algorithms

H. Hamarashid
{"title":"Utilizing Statistical Tests for Comparing Machine Learning Algorithms","authors":"H. Hamarashid","doi":"10.24017/SCIENCE.2021.1.8","DOIUrl":null,"url":null,"abstract":"The mean result of machine learning models is determined by utilizing k-fold cross-validation. The algorithm with the best average performance should surpass those with the poorest. But what if the difference in average outcomes is the consequence of a statistical anomaly? To conduct whether or not the mean result differences between two algorithms is genuine then statistical hypothesis test is utilized. Using statistical hypothesis testing, this study will demonstrate how to compare machine learning algorithms. The output of several machine learning algorithms or simulation pipelines is compared during model selection. The model that performs the best based on your performance measure becomes the last model, which can be utilized to make predictions on new data. With classification and regression prediction models it can be conducted by utilizing traditional machine learning and deep learning methods. The difficulty is to identify whether or not the difference between two models is accurate.","PeriodicalId":17866,"journal":{"name":"Kurdistan Journal of Applied Research","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2021-07-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Kurdistan Journal of Applied Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.24017/SCIENCE.2021.1.8","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

Abstract

The mean result of machine learning models is determined by utilizing k-fold cross-validation. The algorithm with the best average performance should surpass those with the poorest. But what if the difference in average outcomes is the consequence of a statistical anomaly? To conduct whether or not the mean result differences between two algorithms is genuine then statistical hypothesis test is utilized. Using statistical hypothesis testing, this study will demonstrate how to compare machine learning algorithms. The output of several machine learning algorithms or simulation pipelines is compared during model selection. The model that performs the best based on your performance measure becomes the last model, which can be utilized to make predictions on new data. With classification and regression prediction models it can be conducted by utilizing traditional machine learning and deep learning methods. The difficulty is to identify whether or not the difference between two models is accurate.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用统计测试比较机器学习算法
机器学习模型的平均结果是利用k-fold交叉验证确定的。平均性能最好的算法应该超过平均性能最差的算法。但是,如果平均结果的差异是统计异常的结果呢?为了判断两种算法之间的平均结果差异是否真实,使用统计假设检验。使用统计假设检验,本研究将演示如何比较机器学习算法。在模型选择过程中比较几种机器学习算法或仿真管道的输出。根据您的性能度量,表现最好的模型将成为最后一个模型,该模型可用于对新数据进行预测。有了分类和回归预测模型,可以利用传统的机器学习和深度学习方法进行预测。困难在于确定两个模型之间的差异是否准确。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
16
审稿时长
12 weeks
期刊最新文献
A Wavelet Shrinkage Mixed with a Single-level 2D Discrete Wavelet Transform for Image Denoising Assessing the Impact of Modified Initial Abstraction Ratios and Slope Adjusted Curve Number on Runoff Prediction in the Watersheds of Sulaimani Province. Assessment of the Antifungal Activity of PMMA-MgO and PMMA-Ag Nanocomposite Multi-Label Feature Selection with Graph-based Ant Colony Optimization and Generalized Jaccard Similarity Evaluate the Implementation of WHO Infection Prevention and Control Core Components Among Health Care Facilities
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1