VAMP1RE: a single criterion for rating and ranking confidence-interval procedures

IIE Transactions Pub Date : 2015-05-15 DOI:10.1080/0740817X.2015.1047068

Yingchieh Yeh, B. Schmeiser

{"title":"VAMP1RE: a single criterion for rating and ranking confidence-interval procedures","authors":"Yingchieh Yeh, B. Schmeiser","doi":"10.1080/0740817X.2015.1047068","DOIUrl":null,"url":null,"abstract":"We propose VAMP1RE, a single criterion for rating and ranking confidence-interval procedures (CIPs) that use a fixed sample size. The quality of a CIP is traditionally thought to be many dimensional, typically composed of the probability of covering the unknown performance measure and the mean (and sometimes the standard deviation) of interval width, each of these over some set of nominal coverage probabilities. These many criteria reflect symptoms, rather than causes, of CIP quality. The VAMP1RE criterion focuses on two causes: departure from validity—violation of assumptions—and inability to mimic—the dissimilarity, for every data set, of a CIP’s interval to that of an ideal CIP. The ideal CIP is both valid (that is, adheres to all assumptions) and is an agreed-upon standard; possibly the ideal CIP is allowed knowledge not available to the real-world CIPs of interest. A high inability to mimic the ideal CIP implies that a CIP uses data inefficiently. For a given CIP, the VAMP1RE criterion is the expected squared difference between Schruben’s coverage values (analogous to p values) arising from the given CIP and from the ideal CIP. The implication is that an interval arising from a particular data set is good not because it is large or small but, rather, it is good to the extent that it is similar to the interval provided by the ideal CIP. We discuss the relationship to Schruben’s coverage function, provide a graphical interpretation, decompose the VAMP1RE criterion into the two cause components, and provide examples to illustrate that the VAMP1RE criterion provides numerical values that are useful for rating and ranking CIPs.","PeriodicalId":13379,"journal":{"name":"IIE Transactions","volume":"47 1","pages":"1203 - 1216"},"PeriodicalIF":0.0000,"publicationDate":"2015-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/0740817X.2015.1047068","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IIE Transactions","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/0740817X.2015.1047068","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

Abstract

We propose VAMP1RE, a single criterion for rating and ranking confidence-interval procedures (CIPs) that use a fixed sample size. The quality of a CIP is traditionally thought to be many dimensional, typically composed of the probability of covering the unknown performance measure and the mean (and sometimes the standard deviation) of interval width, each of these over some set of nominal coverage probabilities. These many criteria reflect symptoms, rather than causes, of CIP quality. The VAMP1RE criterion focuses on two causes: departure from validity—violation of assumptions—and inability to mimic—the dissimilarity, for every data set, of a CIP’s interval to that of an ideal CIP. The ideal CIP is both valid (that is, adheres to all assumptions) and is an agreed-upon standard; possibly the ideal CIP is allowed knowledge not available to the real-world CIPs of interest. A high inability to mimic the ideal CIP implies that a CIP uses data inefficiently. For a given CIP, the VAMP1RE criterion is the expected squared difference between Schruben’s coverage values (analogous to p values) arising from the given CIP and from the ideal CIP. The implication is that an interval arising from a particular data set is good not because it is large or small but, rather, it is good to the extent that it is similar to the interval provided by the ideal CIP. We discuss the relationship to Schruben’s coverage function, provide a graphical interpretation, decompose the VAMP1RE criterion into the two cause components, and provide examples to illustrate that the VAMP1RE criterion provides numerical values that are useful for rating and ranking CIPs.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

VAMP1RE:对置信区间程序进行评级和排序的单一标准

我们提出VAMP1RE，这是使用固定样本量对置信区间程序(cip)进行评级和排名的单一标准。传统上，CIP的质量被认为是多维的，通常由覆盖未知性能度量的概率和间隔宽度的平均值(有时是标准偏差)组成，其中每一个都在一组名义覆盖概率上。这些标准反映的是CIP质量的症状，而不是原因。VAMP1RE标准侧重于两个原因:偏离有效性-违反假设-无法模仿-对于每个数据集，CIP的间隔与理想CIP的间隔不同。理想的CIP是有效的(也就是说，坚持所有的假设)，并且是一个商定的标准;可能理想的CIP是允许知识不提供给现实世界的CIP感兴趣。高度不能模拟理想的CIP意味着CIP不能有效地使用数据。对于给定的CIP, VAMP1RE准则是由给定CIP和理想CIP产生的Schruben覆盖值(类似于p值)之间的预期平方差。这意味着，由特定数据集产生的区间是好的，不是因为它大或小，而是因为它与理想CIP提供的区间相似。我们讨论了与Schruben覆盖函数的关系，提供了一个图形化的解释，将VAMP1RE标准分解为两个原因组件，并提供示例来说明VAMP1RE标准提供了对cip评级和排名有用的数值。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊