{"title":"优化 I 型和 II 型误差之间的权衡:回顾与扩展","authors":"Andrew P Grieve","doi":"arxiv-2409.12081","DOIUrl":null,"url":null,"abstract":"In clinical studies upon which decisions are based there are two types of\nerrors that can be made: a type I error arises when the decision is taken to\ndeclare a positive outcome when the truth is in fact negative, and a type II\nerror arises when the decision is taken to declare a negative outcome when the\ntruth is in fact positive. Commonly the primary analysis of such a study\nentails a two-sided hypothesis test with a type I error rate of 5% and the\nstudy is designed to have a sufficiently low type II error rate, for example\n10% or 20%. These values are arbitrary and often do not reflect the clinical,\nor regulatory, context of the study and ignore both the relative costs of\nmaking either type of error and the sponsor's prior belief that the drug is\nsuperior to either placebo, or a standard of care if relevant. This simplistic\napproach has recently been challenged by numerous authors both from a\nfrequentist and Bayesian perspective since when resources are constrained there\nwill be a need to consider a trade-off between type I and type II errors. In\nthis paper we review proposals to utilise the trade-off by formally\nacknowledging the costs to optimise the choice of error rates for simple, point\nnull and alternative hypotheses and extend the results to composite, or\ninterval hypotheses, showing links to the Probability of Success of a clinical\nstudy.","PeriodicalId":501425,"journal":{"name":"arXiv - STAT - Methodology","volume":"123 14 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Optimising the Trade-Off Between Type I and Type II Errors: A Review and Extensions\",\"authors\":\"Andrew P Grieve\",\"doi\":\"arxiv-2409.12081\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In clinical studies upon which decisions are based there are two types of\\nerrors that can be made: a type I error arises when the decision is taken to\\ndeclare a positive outcome when the truth is in fact negative, and a type II\\nerror arises when the decision is taken to declare a negative outcome when the\\ntruth is in fact positive. Commonly the primary analysis of such a study\\nentails a two-sided hypothesis test with a type I error rate of 5% and the\\nstudy is designed to have a sufficiently low type II error rate, for example\\n10% or 20%. These values are arbitrary and often do not reflect the clinical,\\nor regulatory, context of the study and ignore both the relative costs of\\nmaking either type of error and the sponsor's prior belief that the drug is\\nsuperior to either placebo, or a standard of care if relevant. This simplistic\\napproach has recently been challenged by numerous authors both from a\\nfrequentist and Bayesian perspective since when resources are constrained there\\nwill be a need to consider a trade-off between type I and type II errors. In\\nthis paper we review proposals to utilise the trade-off by formally\\nacknowledging the costs to optimise the choice of error rates for simple, point\\nnull and alternative hypotheses and extend the results to composite, or\\ninterval hypotheses, showing links to the Probability of Success of a clinical\\nstudy.\",\"PeriodicalId\":501425,\"journal\":{\"name\":\"arXiv - STAT - Methodology\",\"volume\":\"123 14 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - STAT - Methodology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.12081\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - STAT - Methodology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.12081","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
在作为决策依据的临床研究中,可能会出现两种类型的错误:当决定宣布阳性结果时,如果真相实际上是阴性,就会出现 I 型错误;当决定宣布阴性结果时,如果真相实际上是阳性,就会出现 II 型错误。通常情况下,此类研究的初步分析需要进行双侧假设检验,I 型错误率为 5%,研究的 II 型错误率要足够低,例如 10%或 20%。这些数值都是任意设定的,通常不能反映研究的临床或监管背景,而且忽略了出现任一类型错误的相对成本,以及申办者事先认为药物优于安慰剂或相关护理标准的信念。这种简单化的方法最近受到了许多学者的质疑,无论是从频繁论者还是贝叶斯论者的角度来看,因为当资源有限时,就需要考虑 I 类和 II 类错误之间的权衡。在本文中,我们回顾了利用这种权衡的建议,即通过正式确认成本来优化简单假说、点空假说和替代假说的错误率选择,并将结果扩展到复合假说或区间假说,显示与临床研究成功概率的联系。
Optimising the Trade-Off Between Type I and Type II Errors: A Review and Extensions
In clinical studies upon which decisions are based there are two types of
errors that can be made: a type I error arises when the decision is taken to
declare a positive outcome when the truth is in fact negative, and a type II
error arises when the decision is taken to declare a negative outcome when the
truth is in fact positive. Commonly the primary analysis of such a study
entails a two-sided hypothesis test with a type I error rate of 5% and the
study is designed to have a sufficiently low type II error rate, for example
10% or 20%. These values are arbitrary and often do not reflect the clinical,
or regulatory, context of the study and ignore both the relative costs of
making either type of error and the sponsor's prior belief that the drug is
superior to either placebo, or a standard of care if relevant. This simplistic
approach has recently been challenged by numerous authors both from a
frequentist and Bayesian perspective since when resources are constrained there
will be a need to consider a trade-off between type I and type II errors. In
this paper we review proposals to utilise the trade-off by formally
acknowledging the costs to optimise the choice of error rates for simple, point
null and alternative hypotheses and extend the results to composite, or
interval hypotheses, showing links to the Probability of Success of a clinical
study.