预期误差成本能否证明在多个 alpha 水平下测试假设是合理的，而不是寻找难以捉摸的最佳 alpha？

arXiv - STAT - Applications Pub Date : 2024-07-31 DOI:arxiv-2407.21303

Janet Aisbett

{"title":"预期误差成本能否证明在多个 alpha 水平下测试假设是合理的，而不是寻找难以捉摸的最佳 alpha？","authors":"Janet Aisbett","doi":"arxiv-2407.21303","DOIUrl":null,"url":null,"abstract":"Simultaneous testing of one hypothesis at multiple alpha levels can be\nperformed within a conventional Neyman-Pearson framework. This is achieved by\ntreating the hypothesis as a family of hypotheses, each member of which\nexplicitly concerns test level as well as effect size. Such testing encourages\nresearchers to think about error rates and strength of evidence in both the\nstatistical design and reporting stages of a study. Here, we show that these\nmulti-alpha level tests can deliver acceptable expected total error costs. We\nfirst present formulas for expected error costs from single alpha and multiple\nalpha level tests, given prior probabilities of effect sizes that have either\ndichotomous or continuous distributions. Error costs are tied to decisions,\nwith different decisions assumed for each of the potential outcomes in the\nmulti-alpha level case. Expected total costs for tests at single and multiple\nalpha levels are then compared with optimal costs. This comparison highlights\nhow sensitive optimization is to estimated error costs and to assumptions about\nprevalence. Testing at multiple default thresholds removes the need to formally\nidentify decisions, or to model costs and prevalence as required in\noptimization approaches. Although total expected error costs with this approach\nwill not be optimal, our results suggest they may be lower, on average, than\nwhen so-called optimal test levels are based on mis-specified models.","PeriodicalId":501172,"journal":{"name":"arXiv - STAT - Applications","volume":"46 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Can expected error costs justify testing a hypothesis at multiple alpha levels rather than searching for an elusive optimal alpha?\",\"authors\":\"Janet Aisbett\",\"doi\":\"arxiv-2407.21303\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Simultaneous testing of one hypothesis at multiple alpha levels can be\\nperformed within a conventional Neyman-Pearson framework. This is achieved by\\ntreating the hypothesis as a family of hypotheses, each member of which\\nexplicitly concerns test level as well as effect size. Such testing encourages\\nresearchers to think about error rates and strength of evidence in both the\\nstatistical design and reporting stages of a study. Here, we show that these\\nmulti-alpha level tests can deliver acceptable expected total error costs. We\\nfirst present formulas for expected error costs from single alpha and multiple\\nalpha level tests, given prior probabilities of effect sizes that have either\\ndichotomous or continuous distributions. Error costs are tied to decisions,\\nwith different decisions assumed for each of the potential outcomes in the\\nmulti-alpha level case. Expected total costs for tests at single and multiple\\nalpha levels are then compared with optimal costs. This comparison highlights\\nhow sensitive optimization is to estimated error costs and to assumptions about\\nprevalence. Testing at multiple default thresholds removes the need to formally\\nidentify decisions, or to model costs and prevalence as required in\\noptimization approaches. Although total expected error costs with this approach\\nwill not be optimal, our results suggest they may be lower, on average, than\\nwhen so-called optimal test levels are based on mis-specified models.\",\"PeriodicalId\":501172,\"journal\":{\"name\":\"arXiv - STAT - Applications\",\"volume\":\"46 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-07-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - STAT - Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2407.21303\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - STAT - Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2407.21303","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

在传统的奈曼-皮尔逊（Neyman-Pearson）框架内，可以在多个α水平上同时检验一个假设。具体做法是将假设视为一系列假设，其中每个假设都明确涉及检验水平和效应大小。这种检验鼓励研究人员在研究的统计设计和报告阶段考虑误差率和证据强度。在这里，我们证明了这些多α水平检验可以提供可接受的预期总误差成本。我们首先给出了单一α和多重α水平检验的预期误差成本公式，给定的先验概率为效应大小的二分或连续分布。误差成本与决策息息相关，在多α水平情况下，假设每个潜在结果都有不同的决策。然后将单α和多α水平测试的预期总成本与最优成本进行比较。这种比较凸显了优化对估计误差成本和流行率假设的敏感程度。在多个默认阈值下进行检测，就不需要正式确定决策，也不需要按照优化方法的要求对成本和流行率进行建模。尽管采用这种方法的总预期误差成本不会达到最佳水平，但我们的结果表明，平均而言，其误差成本可能低于基于错误模型的所谓最佳检测水平。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Can expected error costs justify testing a hypothesis at multiple alpha levels rather than searching for an elusive optimal alpha?

Simultaneous testing of one hypothesis at multiple alpha levels can be performed within a conventional Neyman-Pearson framework. This is achieved by treating the hypothesis as a family of hypotheses, each member of which explicitly concerns test level as well as effect size. Such testing encourages researchers to think about error rates and strength of evidence in both the statistical design and reporting stages of a study. Here, we show that these multi-alpha level tests can deliver acceptable expected total error costs. We first present formulas for expected error costs from single alpha and multiple alpha level tests, given prior probabilities of effect sizes that have either dichotomous or continuous distributions. Error costs are tied to decisions, with different decisions assumed for each of the potential outcomes in the multi-alpha level case. Expected total costs for tests at single and multiple alpha levels are then compared with optimal costs. This comparison highlights how sensitive optimization is to estimated error costs and to assumptions about prevalence. Testing at multiple default thresholds removes the need to formally identify decisions, or to model costs and prevalence as required in optimization approaches. Although total expected error costs with this approach will not be optimal, our results suggest they may be lower, on average, than when so-called optimal test levels are based on mis-specified models.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

arXiv - STAT - Applications

自引率

0.00%

发文量