Risk minimization using robust experimental or sampling designs and mixture of designs

IF 0.8 4区数学 Q3 STATISTICS & PROBABILITY Journal of Statistical Planning and Inference Pub Date : 2024-09-29 DOI:10.1016/j.jspi.2024.106241

Ejub Talovic, Yves Tillé

{"title":"Risk minimization using robust experimental or sampling designs and mixture of designs","authors":"Ejub Talovic, Yves Tillé","doi":"10.1016/j.jspi.2024.106241","DOIUrl":null,"url":null,"abstract":"<div><div>For both experimental and sampling designs, the efficiency or balance of designs has been extensively studied. There are many ways to incorporate auxiliary information into designs. However, when we use balanced designs to decrease the variance due to an auxiliary variable, the variance may increase due to an effect which we define as lack of robustness. This robustness can be written as the largest eigenvalue of the variance operator of a sampling or experimental design. If this eigenvalue is large, then it might induce a large variance in the Horvitz–Thompson estimator of the total. We calculate or estimate the largest eigenvalue of the most common designs. We determine lower, upper bounds and approximations of this eigenvalue for different designs. Then, we compare these results with simulations that show the trade-off between efficiency and robustness. Those results can be used to determine the proper choice of designs for experiments such as clinical trials or surveys. We also propose a new and simple method for mixing two sampling designs, which allows to use a tuning parameter between two sampling designs. This method is then compared to the Gram–Schmidt walk design, which also governs the trade-off between robustness and efficiency. A set of simulation studies shows that our method of mixture gives similar results to the Gram–Schmidt walk design while having an interpretable variance matrix.</div></div>","PeriodicalId":50039,"journal":{"name":"Journal of Statistical Planning and Inference","volume":"236 ","pages":"Article 106241"},"PeriodicalIF":0.8000,"publicationDate":"2024-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Statistical Planning and Inference","FirstCategoryId":"100","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0378375824000983","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}

引用次数: 0

Abstract

For both experimental and sampling designs, the efficiency or balance of designs has been extensively studied. There are many ways to incorporate auxiliary information into designs. However, when we use balanced designs to decrease the variance due to an auxiliary variable, the variance may increase due to an effect which we define as lack of robustness. This robustness can be written as the largest eigenvalue of the variance operator of a sampling or experimental design. If this eigenvalue is large, then it might induce a large variance in the Horvitz–Thompson estimator of the total. We calculate or estimate the largest eigenvalue of the most common designs. We determine lower, upper bounds and approximations of this eigenvalue for different designs. Then, we compare these results with simulations that show the trade-off between efficiency and robustness. Those results can be used to determine the proper choice of designs for experiments such as clinical trials or surveys. We also propose a new and simple method for mixing two sampling designs, which allows to use a tuning parameter between two sampling designs. This method is then compared to the Gram–Schmidt walk design, which also governs the trade-off between robustness and efficiency. A set of simulation studies shows that our method of mixture gives similar results to the Gram–Schmidt walk design while having an interpretable variance matrix.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

利用稳健的实验或抽样设计以及混合设计最大限度地降低风险

对于实验设计和抽样设计而言，设计的效率或平衡性已得到广泛研究。将辅助信息纳入设计的方法有很多。然而，当我们使用平衡设计来减少由辅助变量引起的方差时，方差可能会由于我们定义为缺乏稳健性的效应而增大。这种稳健性可以写成抽样或实验设计的方差算子的最大特征值。如果该特征值较大，则可能会导致霍维兹-汤普森总估计值的方差较大。我们计算或估计最常见设计的最大特征值。我们为不同的设计确定该特征值的下限、上限和近似值。然后，我们将这些结果与模拟结果进行比较，以显示效率和稳健性之间的权衡。这些结果可用于确定临床试验或调查等实验设计的正确选择。我们还提出了一种简单的混合两种抽样设计的新方法，可以在两种抽样设计之间使用一个调整参数。然后，我们将这种方法与格拉姆-施密特行走设计进行了比较，后者也能在稳健性和效率之间做出权衡。一组模拟研究表明，我们的混合方法得出了与格拉姆-施密特行走设计相似的结果，同时具有可解释的方差矩阵。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Journal of Statistical Planning and Inference 数学-统计学与概率论

CiteScore

2.10

自引率

11.10%

发文量

审稿时长

3-6 weeks

期刊介绍： The Journal of Statistical Planning and Inference offers itself as a multifaceted and all-inclusive bridge between classical aspects of statistics and probability, and the emerging interdisciplinary aspects that have a potential of revolutionizing the subject. While we maintain our traditional strength in statistical inference, design, classical probability, and large sample methods, we also have a far more inclusive and broadened scope to keep up with the new problems that confront us as statisticians, mathematicians, and scientists. We publish high quality articles in all branches of statistics, probability, discrete mathematics, machine learning, and bioinformatics. We also especially welcome well written and up to date review articles on fundamental themes of statistics, probability, machine learning, and general biostatistics. Thoughtful letters to the editors, interesting problems in need of a solution, and short notes carrying an element of elegance or beauty are equally welcome.

期刊最新文献

On misspecification in cusp-type change-point models Estimation and testing for varying-coefficient single-index quantile regression models Fixed-budget optimal designs for multi-fidelity computer experiments Nonparametric regression with predictors missing at random and the scale depending on auxiliary covariates Editorial Board