关于机器学习、ROC分析和显著性统计检验

Object recognition supported by user interaction for service robots Pub Date : 2002-12-10 DOI:10.1109/ICPR.2002.1048273

M. Maloof

{"title":"关于机器学习、ROC分析和显著性统计检验","authors":"M. Maloof","doi":"10.1109/ICPR.2002.1048273","DOIUrl":null,"url":null,"abstract":"Receiver operating characteristic (ROC) analysis is being used with greater frequency as an evaluation methodology in machine learning and pattern recognition. Researchers have used ANOVA to determine if the results from such analysis are statistically significant. Yet, in the medical decision making community, the prevailing method is LABMRMC. Although this latter method uses ANOVA, before doing so, it applies the Jackknife method to account for case-sample variance. To determine whether these two tests make the same decisions regarding statistical significance, we conducted a Monte Carlo simulation using several problems derived from Gaussian distributions, three machine-learning algorithms, ROC analysis, ANOVA, and LABMRMC. Results suggest that the decisions these tests make are not the same, even for simple problems. Furthermore, the larger issue is that since ANOVA does not account for case-sample variance, one cannot generalize experimental results to the population from which the data were drawn.","PeriodicalId":159502,"journal":{"name":"Object recognition supported by user interaction for service robots","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2002-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"23","resultStr":"{\"title\":\"On machine learning, ROC analysis, and statistical tests of significance\",\"authors\":\"M. Maloof\",\"doi\":\"10.1109/ICPR.2002.1048273\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Receiver operating characteristic (ROC) analysis is being used with greater frequency as an evaluation methodology in machine learning and pattern recognition. Researchers have used ANOVA to determine if the results from such analysis are statistically significant. Yet, in the medical decision making community, the prevailing method is LABMRMC. Although this latter method uses ANOVA, before doing so, it applies the Jackknife method to account for case-sample variance. To determine whether these two tests make the same decisions regarding statistical significance, we conducted a Monte Carlo simulation using several problems derived from Gaussian distributions, three machine-learning algorithms, ROC analysis, ANOVA, and LABMRMC. Results suggest that the decisions these tests make are not the same, even for simple problems. Furthermore, the larger issue is that since ANOVA does not account for case-sample variance, one cannot generalize experimental results to the population from which the data were drawn.\",\"PeriodicalId\":159502,\"journal\":{\"name\":\"Object recognition supported by user interaction for service robots\",\"volume\":\"8 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2002-12-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"23\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Object recognition supported by user interaction for service robots\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICPR.2002.1048273\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Object recognition supported by user interaction for service robots","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICPR.2002.1048273","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 23

摘要

接受者工作特征(ROC)分析在机器学习和模式识别中被越来越多地用作评估方法。研究人员使用方差分析来确定这种分析的结果是否具有统计学意义。然而，在医疗决策界，流行的方法是LABMRMC。虽然后一种方法使用方差分析，但在这样做之前，它应用Jackknife方法来解释病例-样本方差。为了确定这两个测试是否在统计显著性方面做出相同的决定，我们使用高斯分布、三种机器学习算法、ROC分析、ANOVA和LABMRMC衍生的几个问题进行了蒙特卡罗模拟。结果表明，这些测试做出的决定是不一样的，即使是简单的问题。此外，更大的问题是，由于方差分析不考虑病例-样本方差，因此不能将实验结果推广到抽取数据的总体。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

On machine learning, ROC analysis, and statistical tests of significance

Receiver operating characteristic (ROC) analysis is being used with greater frequency as an evaluation methodology in machine learning and pattern recognition. Researchers have used ANOVA to determine if the results from such analysis are statistically significant. Yet, in the medical decision making community, the prevailing method is LABMRMC. Although this latter method uses ANOVA, before doing so, it applies the Jackknife method to account for case-sample variance. To determine whether these two tests make the same decisions regarding statistical significance, we conducted a Monte Carlo simulation using several problems derived from Gaussian distributions, three machine-learning algorithms, ROC analysis, ANOVA, and LABMRMC. Results suggest that the decisions these tests make are not the same, even for simple problems. Furthermore, the larger issue is that since ANOVA does not account for case-sample variance, one cannot generalize experimental results to the population from which the data were drawn.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Object recognition supported by user interaction for service robots

自引率

0.00%

发文量

期刊最新文献

Pattern recognition for humanitarian de-mining Data clustering using evidence accumulation Facial expression recognition using pseudo 3-D hidden Markov models Speeding up SVM decision based on mirror points Real-time tracking and estimation of plane pose