Optimising the Trade-Off Between Type I and Type II Errors: A Review and Extensions

arXiv - STAT - Methodology Pub Date : 2024-09-18 DOI:arxiv-2409.12081

Andrew P Grieve

{"title":"Optimising the Trade-Off Between Type I and Type II Errors: A Review and Extensions","authors":"Andrew P Grieve","doi":"arxiv-2409.12081","DOIUrl":null,"url":null,"abstract":"In clinical studies upon which decisions are based there are two types of\nerrors that can be made: a type I error arises when the decision is taken to\ndeclare a positive outcome when the truth is in fact negative, and a type II\nerror arises when the decision is taken to declare a negative outcome when the\ntruth is in fact positive. Commonly the primary analysis of such a study\nentails a two-sided hypothesis test with a type I error rate of 5% and the\nstudy is designed to have a sufficiently low type II error rate, for example\n10% or 20%. These values are arbitrary and often do not reflect the clinical,\nor regulatory, context of the study and ignore both the relative costs of\nmaking either type of error and the sponsor's prior belief that the drug is\nsuperior to either placebo, or a standard of care if relevant. This simplistic\napproach has recently been challenged by numerous authors both from a\nfrequentist and Bayesian perspective since when resources are constrained there\nwill be a need to consider a trade-off between type I and type II errors. In\nthis paper we review proposals to utilise the trade-off by formally\nacknowledging the costs to optimise the choice of error rates for simple, point\nnull and alternative hypotheses and extend the results to composite, or\ninterval hypotheses, showing links to the Probability of Success of a clinical\nstudy.","PeriodicalId":501425,"journal":{"name":"arXiv - STAT - Methodology","volume":"123 14 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - STAT - Methodology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.12081","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

In clinical studies upon which decisions are based there are two types of errors that can be made: a type I error arises when the decision is taken to declare a positive outcome when the truth is in fact negative, and a type II error arises when the decision is taken to declare a negative outcome when the truth is in fact positive. Commonly the primary analysis of such a study entails a two-sided hypothesis test with a type I error rate of 5% and the study is designed to have a sufficiently low type II error rate, for example 10% or 20%. These values are arbitrary and often do not reflect the clinical, or regulatory, context of the study and ignore both the relative costs of making either type of error and the sponsor's prior belief that the drug is superior to either placebo, or a standard of care if relevant. This simplistic approach has recently been challenged by numerous authors both from a frequentist and Bayesian perspective since when resources are constrained there will be a need to consider a trade-off between type I and type II errors. In this paper we review proposals to utilise the trade-off by formally acknowledging the costs to optimise the choice of error rates for simple, point null and alternative hypotheses and extend the results to composite, or interval hypotheses, showing links to the Probability of Success of a clinical study.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

优化 I 型和 II 型误差之间的权衡：回顾与扩展

在作为决策依据的临床研究中，可能会出现两种类型的错误：当决定宣布阳性结果时，如果真相实际上是阴性，就会出现 I 型错误；当决定宣布阴性结果时，如果真相实际上是阳性，就会出现 II 型错误。通常情况下，此类研究的初步分析需要进行双侧假设检验，I 型错误率为 5%，研究的 II 型错误率要足够低，例如 10%或 20%。这些数值都是任意设定的，通常不能反映研究的临床或监管背景，而且忽略了出现任一类型错误的相对成本，以及申办者事先认为药物优于安慰剂或相关护理标准的信念。这种简单化的方法最近受到了许多学者的质疑，无论是从频繁论者还是贝叶斯论者的角度来看，因为当资源有限时，就需要考虑 I 类和 II 类错误之间的权衡。在本文中，我们回顾了利用这种权衡的建议，即通过正式确认成本来优化简单假说、点空假说和替代假说的错误率选择，并将结果扩展到复合假说或区间假说，显示与临床研究成功概率的联系。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

arXiv - STAT - Methodology

自引率

0.00%

发文量