Rerandomization with diminishing covariate imbalance and diverging number of covariates

Yuhao Wang, Xinran Li
{"title":"Rerandomization with diminishing covariate imbalance and diverging number of covariates","authors":"Yuhao Wang, Xinran Li","doi":"10.1214/22-aos2235","DOIUrl":null,"url":null,"abstract":"Completely randomized experiments have been the gold standard for drawing causal inference because they can balance all potential confounding on average. However, they may suffer from unbalanced covariates for realized treatment assignments. Rerandomization, a design that rerandomizes the treatment assignment until a prespecified covariate balance criterion is met, has recently got attention due to its easy implementation, improved covariate balance and more efficient inference. Researchers have then suggested to use the treatment assignments that minimize the covariate imbalance, namely the optimally balanced design. This has caused again the long-time controversy between two philosophies for designing experiments: randomization versus optimal and thus almost deterministic designs. Existing literature argued that rerandomization with overly balanced observed covariates can lead to highly imbalanced unobserved covariates, making it vulnerable to model misspecification. On the contrary, rerandomization with properly balanced covariates can provide robust inference for treatment effects while sacrific-ing some efficiency compared to the ideally optimal design. In this paper, we show it is possible that, by making the covariate imbalance diminishing at a proper rate as the sample size increases, rerandomization can achieve its ideally optimal precision that one can expect with perfectly balanced covariates, while still maintaining its robustness. We further investigate conditions on the number of covariates for achieving the desired optimality. Our results rely on a more delicate asymptotic analysis for rerandomization, allowing both diminishing covariate imbalance threshold (or equivalently the acceptance probability) and diverging number of covariates. The derived theory for rerandomization provides a deeper understanding of its large-sample property and can better guide its practical implementation. Furthermore, it also helps reconcile the controversy between randomized and optimal designs in an asymptotic sense.","PeriodicalId":22375,"journal":{"name":"The Annals of Statistics","volume":"170 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Annals of Statistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1214/22-aos2235","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

Abstract

Completely randomized experiments have been the gold standard for drawing causal inference because they can balance all potential confounding on average. However, they may suffer from unbalanced covariates for realized treatment assignments. Rerandomization, a design that rerandomizes the treatment assignment until a prespecified covariate balance criterion is met, has recently got attention due to its easy implementation, improved covariate balance and more efficient inference. Researchers have then suggested to use the treatment assignments that minimize the covariate imbalance, namely the optimally balanced design. This has caused again the long-time controversy between two philosophies for designing experiments: randomization versus optimal and thus almost deterministic designs. Existing literature argued that rerandomization with overly balanced observed covariates can lead to highly imbalanced unobserved covariates, making it vulnerable to model misspecification. On the contrary, rerandomization with properly balanced covariates can provide robust inference for treatment effects while sacrific-ing some efficiency compared to the ideally optimal design. In this paper, we show it is possible that, by making the covariate imbalance diminishing at a proper rate as the sample size increases, rerandomization can achieve its ideally optimal precision that one can expect with perfectly balanced covariates, while still maintaining its robustness. We further investigate conditions on the number of covariates for achieving the desired optimality. Our results rely on a more delicate asymptotic analysis for rerandomization, allowing both diminishing covariate imbalance threshold (or equivalently the acceptance probability) and diverging number of covariates. The derived theory for rerandomization provides a deeper understanding of its large-sample property and can better guide its practical implementation. Furthermore, it also helps reconcile the controversy between randomized and optimal designs in an asymptotic sense.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
协变量不平衡减少和协变量数目分散的再随机化
完全随机实验一直是得出因果推理的黄金标准,因为它们平均可以平衡所有潜在的混杂因素。然而,对于已实现的治疗分配,它们可能会受到协变量不平衡的影响。再随机化是一种将处理分配重新随机化,直到满足预先指定的协变量平衡标准的设计,由于其易于实现,改善了协变量平衡和更有效的推理,最近受到了关注。研究人员随后建议使用使协变量不平衡最小化的处理分配,即最优平衡设计。这再次引起了两种设计实验的哲学之间的长期争论:随机化与最优设计,因此几乎是确定性设计。现有文献认为,观察到的协变量过于平衡的再随机化会导致未观察到的协变量高度不平衡,使其容易出现模型错配。相反,与理想的最优设计相比,适当平衡协变量的再随机化可以为治疗效果提供稳健的推断,同时牺牲一些效率。在本文中,我们表明,通过使协变量不平衡随着样本量的增加而以适当的速率递减,再随机化可以达到理想的最佳精度,即人们可以期望具有完全平衡的协变量,同时仍然保持其稳健性。我们进一步研究了实现期望最优性的协变量数的条件。我们的结果依赖于更精细的再随机化渐近分析,允许减少协变量不平衡阈值(或等效的接受概率)和分散的协变量数量。导出的再随机化理论对其大样本特性有了更深入的理解,可以更好地指导其实际实施。此外,它还有助于在渐近意义上调和随机和最优设计之间的争议。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Maximum likelihood for high-noise group orbit estimation and single-particle cryo-EM Local Whittle estimation of high-dimensional long-run variance and precision matrices Efficient estimation of the maximal association between multiple predictors and a survival outcome The impacts of unobserved covariates on covariate-adaptive randomized experiments Estimation of expected Euler characteristic curves of nonstationary smooth random fields
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1