Propensity Scores in the Design of Observational Studies for Causal Effects

IF 2.8 2区数学 Q2 BIOLOGY Biometrika Pub Date : 2022-09-28 DOI:10.1093/biomet/asac054

P. Rosenbaum, D. Rubin

{"title":"Propensity Scores in the Design of Observational Studies for Causal Effects","authors":"P. Rosenbaum, D. Rubin","doi":"10.1093/biomet/asac054","DOIUrl":null,"url":null,"abstract":"\n The design of any study, whether experimental or observational, that is intended to estimate the causal effects of a treatment condition relative to a control condition, refers to those activities that precede any examination of outcome variables. As defined in our 1983 article (Rosenbaum & Rubin, 1983), the propensity score is the unit-level conditional probability of assignment to treatment versus control given the observed covariates; so, the propensity score explicitly does not involve any outcome variables, in contrast to other summaries of variables sometimes used in observational studies. Balancing the distributions of covariates in the treatment and control groups by matching or balancing on the propensity score is therefore an aspect of the design of the observational study. In this invited comment on our 1983 article, we review the situation in the early 1980’s, and we recall some apparent paradoxes that propensity scores helped to resolve. We demonstrate that it is possible to balance an enormous number of low-dimensional summaries of a high-dimensional covariate, even though it is generally impossible to match individuals closely for all of the components of a high-dimensional covariate. In a sense, there is only one crucial observed covariate, the propensity score, and there is one crucial unobserved covariate, the ‘principal unobserved covariate’. The propensity score and the principal unobserved covariate are equal when treatment assignment is strongly ignorable, that is, unconfounded. Controlling for observed covariates is a prelude to the crucial step from association to causation, the step that addresses potential biases from unmeasured covariates. The design of an observational study also prepares for the step to causation: by selecting comparisons to increase the design sensitivity, by seeking opportunities to detect bias, by seeking mutually supportive evidence affected by different biases, by incorporating quasi-experimental devices such as multiple control groups, and by including the economist’s instruments. All of these considerations reflect the formal development of sensitivity analyses that were largely informal prior to the 1980s.","PeriodicalId":9001,"journal":{"name":"Biometrika","volume":" ","pages":""},"PeriodicalIF":2.8000,"publicationDate":"2022-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biometrika","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1093/biomet/asac054","RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOLOGY","Score":null,"Total":0}

引用次数: 6

Abstract

The design of any study, whether experimental or observational, that is intended to estimate the causal effects of a treatment condition relative to a control condition, refers to those activities that precede any examination of outcome variables. As defined in our 1983 article (Rosenbaum & Rubin, 1983), the propensity score is the unit-level conditional probability of assignment to treatment versus control given the observed covariates; so, the propensity score explicitly does not involve any outcome variables, in contrast to other summaries of variables sometimes used in observational studies. Balancing the distributions of covariates in the treatment and control groups by matching or balancing on the propensity score is therefore an aspect of the design of the observational study. In this invited comment on our 1983 article, we review the situation in the early 1980’s, and we recall some apparent paradoxes that propensity scores helped to resolve. We demonstrate that it is possible to balance an enormous number of low-dimensional summaries of a high-dimensional covariate, even though it is generally impossible to match individuals closely for all of the components of a high-dimensional covariate. In a sense, there is only one crucial observed covariate, the propensity score, and there is one crucial unobserved covariate, the ‘principal unobserved covariate’. The propensity score and the principal unobserved covariate are equal when treatment assignment is strongly ignorable, that is, unconfounded. Controlling for observed covariates is a prelude to the crucial step from association to causation, the step that addresses potential biases from unmeasured covariates. The design of an observational study also prepares for the step to causation: by selecting comparisons to increase the design sensitivity, by seeking opportunities to detect bias, by seeking mutually supportive evidence affected by different biases, by incorporating quasi-experimental devices such as multiple control groups, and by including the economist’s instruments. All of these considerations reflect the formal development of sensitivity analyses that were largely informal prior to the 1980s.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

因果效应观察研究设计中的倾向性得分

任何研究的设计，无论是实验性的还是观察性的，旨在估计治疗条件相对于对照条件的因果影响，都是指在检查结果变量之前的那些活动。正如我们1983年的文章（Rosenbaum&Rubin，1983）中所定义的，倾向得分是在观察到的协变量的情况下，分配给治疗与控制的单位水平条件概率；因此，与观察性研究中有时使用的其他变量汇总相比，倾向评分明确不涉及任何结果变量。因此，通过匹配或平衡倾向得分来平衡治疗组和对照组中协变量的分布是观察性研究设计的一个方面。在这篇受邀对我们1983年的文章发表的评论中，我们回顾了20世纪80年代初的情况，并回顾了倾向得分帮助解决的一些明显的悖论。我们证明了平衡高维协变量的大量低维摘要是可能的，尽管通常不可能为高维协变的所有成分密切匹配个体。从某种意义上说，只有一个关键的观察到的协变量，即倾向得分，还有一个重要的未观察到的协变量，即“主要未观察到协变量”。当治疗分配是强可忽略的，即不成立时，倾向得分和主要未观察协变量是相等的。控制观察到的协变量是从关联到因果关系的关键步骤的前奏，这一步骤解决了未测量协变量的潜在偏差。观察性研究的设计也为因果关系的步骤做了准备：通过选择比较来提高设计灵敏度，通过寻找发现偏见的机会，通过寻求受不同偏见影响的相互支持的证据，通过结合准实验装置，如多个对照组，以及通过纳入经济学家的工具。所有这些考虑都反映了敏感性分析的正式发展，在20世纪80年代之前，敏感性分析基本上是非正式的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Biometrika 生物-生物学

CiteScore

5.50

自引率

3.70%

发文量

审稿时长

6-12 weeks

期刊介绍： Biometrika is primarily a journal of statistics in which emphasis is placed on papers containing original theoretical contributions of direct or potential value in applications. From time to time, papers in bordering fields are also published.

期刊最新文献

Post-selection inference for causal effects after causal discovery. On the asymptotic validity of confidence sets for linear functionals of solutions to integral equations. Resampling methods with multiply imputed data. A More Robust Approach to Multivariable Mendelian Randomization. Aggregating dependent signals with heavy-tailed combination tests.