Distributionally Robust and Generalizable Inference

IF 3.4 1区数学 Q1 STATISTICS & PROBABILITY Statistical Science Pub Date : 2023-11-01 DOI:10.1214/23-sts902

Dominik Rothenhäusler, Peter Bühlmann

引用次数: 3

Abstract

We discuss recently developed methods that quantify the stability and generalizability of statistical findings under distributional changes. In many practical problems, the data is not drawn i.i.d. from the target population. For example, unobserved sampling bias, batch effects, or unknown associations might inflate the variance compared to i.i.d. sampling. For reliable statistical inference, it is thus necessary to account for these types of variation. We discuss and review two methods that allow to quantify distribution stability based on a single dataset. The first method computes the sensitivity of a parameter under worst-case distributional perturbations to understand which types of shift pose a threat to external validity. The second method treats distributional shifts as random which allows to assess average robustness (instead of worst-case). Based on a stability analysis of multiple estimators on a single dataset, it integrates both sampling and distributional uncertainty into a single confidence interval.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

分布鲁棒和可推广推理

我们讨论了最近发展的方法来量化分布变化下统计结果的稳定性和概括性。在许多实际问题中，数据不是直接从目标人群中提取的。例如，与i.i.d抽样相比，未观察到的抽样偏差、批处理效应或未知关联可能会扩大方差。因此，为了可靠的统计推断，有必要考虑这些类型的变化。我们讨论并回顾了基于单个数据集量化分布稳定性的两种方法。第一种方法计算参数在最坏情况下分布扰动的敏感性，以了解哪种类型的移位对外部有效性构成威胁。第二种方法将分布移位视为随机，从而可以评估平均鲁棒性(而不是最坏情况)。它基于对单个数据集上多个估计量的稳定性分析，将抽样不确定性和分布不确定性集成到单个置信区间中。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Statistical Science 数学-统计学与概率论

CiteScore

6.50

自引率

1.80%

发文量

审稿时长

>12 weeks

期刊介绍： The central purpose of Statistical Science is to convey the richness, breadth and unity of the field by presenting the full range of contemporary statistical thought at a moderate technical level, accessible to the wide community of practitioners, researchers and students of statistics and probability.

期刊最新文献

On the mixed-model analysis of covariance in cluster-randomized trials. Replicable Bandits for Digital Health Interventions. Bayesian Transfer Learning. On the Use of Auxiliary Variables in Multilevel Regression and Poststratification. Scalable Empirical Bayes Inference and Bayesian Sensitivity Analysis.