Observational studies最新文献

英文中文

Regression Discontinuity Designs in the Econometrics Literature 计量经济学文献中的回归不连续设计

Observational studies

Pub Date : 2021-06-04 DOI: 10.1353/obs.2017.0003

G. Imbens

Abstract:Many decades after being introduced by Thistlewaite and Campbell (1960), regression discontinuity designs have become an important tool for causal inference in social sciences. Researchers have found the methods to be widely applicable in settings where eligibility or incentives for participation in programs is at least partially regulated. Alongside, and motivated by, the many studies applying regression discontinuity methods there have been a number of methodological studies improving our understanding, and implementation, of, these methods. Here I report on some of the recent advances in the econometrics literature.

摘要：在Thistlewaite和Campbell（1960）提出回归不连续设计几十年后，回归不连续性设计已成为社会科学中因果推理的重要工具。研究人员发现，这些方法广泛适用于参与项目的资格或激励措施至少部分受到监管的环境。除了应用回归不连续性方法的许多研究之外，还有许多方法论研究提高了我们对这些方法的理解和实施。在这里，我报告了计量经济学文献的一些最新进展。

引用次数: 0

Observational Studies and Study Designs: An Epidemiologic Perspective 观察性研究和研究设计：一个流行病学的视角

Observational studies

Pub Date : 2021-06-04 DOI: 10.1353/obs.2015.0025

T. J. Vander Weele

引用次数: 0

Statistical Criticism, Self-Criticism and the Scientific Method 统计批评、自我批评与科学方法

Observational studies

Pub Date : 2021-06-04 DOI: 10.1353/obs.2018.0007

D. Rindskopf

引用次数: 0

Potential for Bias Inflation with Grouped Data: A Comparison of Estimators and a Sensitivity Analysis Strategy 分组数据的偏差通货膨胀潜力：估计值与敏感性分析策略的比较

Observational studies

Pub Date : 2021-06-04 DOI: 10.1353/obs.2018.0016

M. Scott, Ronli Diakow, J. Hill, J. Middleton

Abstract:We are concerned with the unbiased estimation of a treatment effect in the context of non-experimental studies with grouped or multilevel data. When analyzing such data with this goal, practitioners typically include as many predictors (controls) as possible, in an attempt to satisfy ignorability of the treatment assignment. In the multilevel setting with two levels, there are two classes of potential confounders that one must consider, and attempts to satisfy ignorability conditional on just one set would lead to a different treatment effect estimator than attempts to satisfy the other (or both). The three estimators considered in this paper are so-called “within,” “between” and OLS estimators. We generate bounds on the potential differences in bias for these competing estimators to inform model selection. Our approach relies on a parametric model for grouped data and omitted confounders and establishes a framework for sensitivity analysis in the two-level modeling context. The method relies on information obtained from parameters estimated under a variety of multilevel model specifications. We characterize the strength of the confounding and corresponding bias using easily interpretable parameters and graphical displays. We apply this approach to data from a multinational educational evaluation study. We demonstrate the extent to which different treatment effect estimators may be robust to potential unobserved individual- and group-level confounding.

摘要：我们关注的是在分组或多级数据的非实验研究中对治疗效果的无偏估计。当以此为目标分析这些数据时，从业者通常会包括尽可能多的预测因素（对照），以满足治疗分配的可忽略性。在具有两个水平的多水平设置中，必须考虑两类潜在的混杂因素，并且试图满足仅一个集合的可忽略性将导致与试图满足另一个（或两者）不同的治疗效果估计量。本文考虑的三种估计量是所谓的“内部”、“之间”和OLS估计量。我们为这些竞争估计器生成潜在偏差的边界，以告知模型选择。我们的方法依赖于分组数据的参数模型和省略的混杂因素，并在两级建模环境中建立了敏感性分析框架。该方法依赖于从各种多级模型规范下估计的参数中获得的信息。我们使用易于解释的参数和图形显示来表征混杂的强度和相应的偏倚。我们将这种方法应用于一项跨国教育评估研究的数据。我们证明了不同的治疗效果估计量在多大程度上对潜在的未观察到的个体和群体水平的混杂因素具有稳健性。

{"title":"Potential for Bias Inflation with Grouped Data: A Comparison of Estimators and a Sensitivity Analysis Strategy","authors":"M. Scott, Ronli Diakow, J. Hill, J. Middleton","doi":"10.1353/obs.2018.0016","DOIUrl":"https://doi.org/10.1353/obs.2018.0016","url":null,"abstract":"Abstract:We are concerned with the unbiased estimation of a treatment effect in the context of non-experimental studies with grouped or multilevel data. When analyzing such data with this goal, practitioners typically include as many predictors (controls) as possible, in an attempt to satisfy ignorability of the treatment assignment. In the multilevel setting with two levels, there are two classes of potential confounders that one must consider, and attempts to satisfy ignorability conditional on just one set would lead to a different treatment effect estimator than attempts to satisfy the other (or both). The three estimators considered in this paper are so-called “within,” “between” and OLS estimators. We generate bounds on the potential differences in bias for these competing estimators to inform model selection. Our approach relies on a parametric model for grouped data and omitted confounders and establishes a framework for sensitivity analysis in the two-level modeling context. The method relies on information obtained from parameters estimated under a variety of multilevel model specifications. We characterize the strength of the confounding and corresponding bias using easily interpretable parameters and graphical displays. We apply this approach to data from a multinational educational evaluation study. We demonstrate the extent to which different treatment effect estimators may be robust to potential unobserved individual- and group-level confounding.","PeriodicalId":74335,"journal":{"name":"Observational studies","volume":"4 1","pages":"111 - 149"},"PeriodicalIF":0.0,"publicationDate":"2021-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1353/obs.2018.0016","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45391755","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Book review of “Causality in a Social World” by Guanglei Hong 洪光磊《社会世界中的因果关系》书评

Observational studies

Pub Date : 2021-06-04 DOI: 10.1353/obs.2016.0001

K. Frank, G. Saw, Ran Xu

As the introduction of Guanglei Hong’s Causality in a Social World makes clear, this book would not be necessary if all treatments we wished to study had constant effects through simple mechanisms on independent individuals who were randomly assigned to treatments. While, such conditions may hold in some idealized agricultural settings, this is not the phenomenon we encounter in a social policy oriented world with human agency. In response, Hong presents a coherent theoretical and empirical framework for estimating causality when people choose their own treatments, when they encounter mediating and moderating effects of treatments and when they influence others’ choices and outcomes. The book is presented in four large sections: overview, moderation, mediation and spillover, with a chapter introducing the core ideas in each section (chapters 4, 7, 11 and 14 respectively). Beyond merely consolidating her own foundational work, the book is steeped in deep and historical statistical principles of sampling, propensity score analysis, mediation and moderation, and spill-over mechanisms. Ultimately, the book will mark a passageway from underlying statistical principles to a framework that may endure and expand beyond even what Hong anticipates.

正如洪光磊《社会世界中的因果关系》一书的引言所表明的那样，如果我们希望研究的所有治疗方法都通过简单的机制对随机分配到治疗组的独立个体产生持续的影响，那么这本书就没有必要了。虽然，这种情况在一些理想化的农业环境中可能存在，但这不是我们在一个以人类为主体的社会政策导向的世界中遇到的现象。作为回应，Hong提出了一个连贯的理论和经验框架，用于估计当人们选择自己的治疗时，当他们遇到治疗的中介和调节效应时，以及当他们影响他人的选择和结果时的因果关系。本书分为四大部分:概述、适度、调解和溢出，每一部分都有一章介绍核心思想(分别为第4、7、11和14章)。除了巩固她自己的基础工作，这本书是沉浸在抽样，倾向得分分析，调解和调节，溢出机制的深刻和历史统计原则。最终，这本书将标志着从基本的统计原理到一个框架的通道，这个框架甚至可能比洪的预期更持久和扩展。

{"title":"Book review of “Causality in a Social World” by Guanglei Hong","authors":"K. Frank, G. Saw, Ran Xu","doi":"10.1353/obs.2016.0001","DOIUrl":"https://doi.org/10.1353/obs.2016.0001","url":null,"abstract":"As the introduction of Guanglei Hong’s Causality in a Social World makes clear, this book would not be necessary if all treatments we wished to study had constant effects through simple mechanisms on independent individuals who were randomly assigned to treatments. While, such conditions may hold in some idealized agricultural settings, this is not the phenomenon we encounter in a social policy oriented world with human agency. In response, Hong presents a coherent theoretical and empirical framework for estimating causality when people choose their own treatments, when they encounter mediating and moderating effects of treatments and when they influence others’ choices and outcomes. The book is presented in four large sections: overview, moderation, mediation and spillover, with a chapter introducing the core ideas in each section (chapters 4, 7, 11 and 14 respectively). Beyond merely consolidating her own foundational work, the book is steeped in deep and historical statistical principles of sampling, propensity score analysis, mediation and moderation, and spill-over mechanisms. Ultimately, the book will mark a passageway from underlying statistical principles to a framework that may endure and expand beyond even what Hong anticipates.","PeriodicalId":74335,"journal":{"name":"Observational studies","volume":"2 1","pages":"86 - 89"},"PeriodicalIF":0.0,"publicationDate":"2021-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1353/obs.2016.0001","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48217783","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Understanding Regression Discontinuity Designs As Observational Studies 将回归不连续性设计理解为观察性研究

Observational studies

Pub Date : 2021-06-04 DOI: 10.1353/obs.2017.0005

J. Sekhon, R. Titiunik

Thistlethwaite and Campbell (1960) proposed to use a “regression-discontinuity analysis” in settings where exposure to a treatment or intervention is determined by an observable score and a fixed cutoff. The type of setting they described, now widely known as the regression discontinuity (RD) design, is one where units receive a score, and a binary treatment is assigned according to a very specific rule. In the simplest case, all units whose score is above a known cutoff are assigned to the treatment condition, and all units whose score is below the cutoff are assigned to the control (i.e., absence of treatment) condition. Thistlethwaite and Campbell insightfully noted that, under appropriate assumptions, the discontinuity in the probability of treatment status induced by such an assignment rule could be leveraged to learn about the effect of the treatment at the cutoff. Their seminal contribution led to what is now one of the most rigorous non-experimental research designs across the social and biomedical sciences. See Cook (2008), Imbens and Lemieux (2008) and Lee and Lemieux (2010) for reviews, and the recent volume edited by Cattaneo and Escanciano (2017) for recent specific applications and methodological developments.

Thistlethwaite和Campbell（1960）提出，在接受治疗或干预的情况下，使用“回归不连续性分析”是由可观察的分数和固定的截止值决定的。他们描述的设置类型，现在被广泛称为回归不连续性（RD）设计，是一种单元获得分数，并根据非常具体的规则分配二元处理的设置。在最简单的情况下，将得分高于已知临界值的所有单元分配给治疗条件，将得分低于临界值的全部单元分配给对照（即不治疗）条件。Thistlethwaite和Campbell深刻地指出，在适当的假设下，可以利用这种分配规则引起的治疗状态概率的不连续性来了解截止时治疗的效果。他们的开创性贡献导致了现在社会科学和生物医学领域最严格的非实验研究设计之一。参见Cook（2008）、Imbens和Lemieux（2008）以及Lee和Lemiux（2010）的综述，以及Cattaneo和Escanciano（2017）编辑的最新一卷的最新具体应用和方法发展。

{"title":"Understanding Regression Discontinuity Designs As Observational Studies","authors":"J. Sekhon, R. Titiunik","doi":"10.1353/obs.2017.0005","DOIUrl":"https://doi.org/10.1353/obs.2017.0005","url":null,"abstract":"Thistlethwaite and Campbell (1960) proposed to use a “regression-discontinuity analysis” in settings where exposure to a treatment or intervention is determined by an observable score and a fixed cutoff. The type of setting they described, now widely known as the regression discontinuity (RD) design, is one where units receive a score, and a binary treatment is assigned according to a very specific rule. In the simplest case, all units whose score is above a known cutoff are assigned to the treatment condition, and all units whose score is below the cutoff are assigned to the control (i.e., absence of treatment) condition. Thistlethwaite and Campbell insightfully noted that, under appropriate assumptions, the discontinuity in the probability of treatment status induced by such an assignment rule could be leveraged to learn about the effect of the treatment at the cutoff. Their seminal contribution led to what is now one of the most rigorous non-experimental research designs across the social and biomedical sciences. See Cook (2008), Imbens and Lemieux (2008) and Lee and Lemieux (2010) for reviews, and the recent volume edited by Cattaneo and Escanciano (2017) for recent specific applications and methodological developments.","PeriodicalId":74335,"journal":{"name":"Observational studies","volume":"3 1","pages":"174 - 182"},"PeriodicalIF":0.0,"publicationDate":"2021-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1353/obs.2017.0005","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49355673","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 18

The non-zero mean SIMEX: Improving estimation in the face of measurement error 非零均值SIMEX：面对测量误差改进估计

Observational studies

Pub Date : 2021-06-04 DOI: 10.1353/obs.2015.0005

Nabila Parveen, E. Moodie, B. Brenner

Abstract:The simulation extrapolation method developed by Cook and Stefanski (1995) is a simulation based technique for estimating and reducing bias due to additive measurement error armed only with knowledge of the variance of the measurement error distribution. However there are many instances in which validation data are not available, and measurement error is known not to have mean zero. For example, in assessing phylogenetic cluster size of HIV viruses, cluster size is systematically underestimated since clustering can only be performed on the viruses of those individuals who have presented for testing. In this setting, it is not possible to obtain validation data; however, using knowledge gleaned from the literature, the distribution of the errors may be estimated. In this work, we extend the simulation extrapolation procedure to accommodate errors with non-zero means, motivated by an interest in determining behavioural correlates of HIV phylogenetic cluster size. We provide theoretical justification for the generalization to the non-zero mean measurement error case, proving its consistency and demonstrating its performance via simulation. We then apply the result to data from a the province of Quebec in Canada to show that findings from a naïve analysis are robust to a substantial range of possible measurement error distributions.

摘要：Cook和Stefanski（1995）开发的模拟外推方法是一种基于模拟的技术，用于估计和减少由于附加测量误差引起的偏差，只需了解测量误差分布的方差。然而，在许多情况下，验证数据不可用，并且已知测量误差不为零。例如，在评估HIV病毒的系统发育簇大小时，簇大小被系统地低估了，因为只能对那些提交测试的个体的病毒进行聚类。在此设置中，无法获取验证数据；然而，使用从文献中收集的知识，可以估计误差的分布。在这项工作中，我们扩展了模拟外推程序，以适应非零均值的误差，这是出于对确定HIV系统发育簇大小的行为相关性的兴趣。我们为推广到非零平均测量误差情况提供了理论依据，证明了其一致性，并通过仿真证明了其性能。然后，我们将结果应用于加拿大魁北克省的数据，以表明天真分析的结果对大量可能的测量误差分布是稳健的。

{"title":"The non-zero mean SIMEX: Improving estimation in the face of measurement error","authors":"Nabila Parveen, E. Moodie, B. Brenner","doi":"10.1353/obs.2015.0005","DOIUrl":"https://doi.org/10.1353/obs.2015.0005","url":null,"abstract":"Abstract:The simulation extrapolation method developed by Cook and Stefanski (1995) is a simulation based technique for estimating and reducing bias due to additive measurement error armed only with knowledge of the variance of the measurement error distribution. However there are many instances in which validation data are not available, and measurement error is known not to have mean zero. For example, in assessing phylogenetic cluster size of HIV viruses, cluster size is systematically underestimated since clustering can only be performed on the viruses of those individuals who have presented for testing. In this setting, it is not possible to obtain validation data; however, using knowledge gleaned from the literature, the distribution of the errors may be estimated. In this work, we extend the simulation extrapolation procedure to accommodate errors with non-zero means, motivated by an interest in determining behavioural correlates of HIV phylogenetic cluster size. We provide theoretical justification for the generalization to the non-zero mean measurement error case, proving its consistency and demonstrating its performance via simulation. We then apply the result to data from a the province of Quebec in Canada to show that findings from a naïve analysis are robust to a substantial range of possible measurement error distributions.","PeriodicalId":74335,"journal":{"name":"Observational studies","volume":"1 1","pages":"123 - 90"},"PeriodicalIF":0.0,"publicationDate":"2021-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1353/obs.2015.0005","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49458199","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Application of Propensity Scores to a Continuous Exposure: Effect of Lead Exposure in Early Childhood on Reading and Mathematics Scores 倾向性得分在持续暴露中的应用：儿童早期铅暴露对阅读和数学成绩的影响

Observational studies

Pub Date : 2021-06-04 DOI: 10.1353/obs.2015.0002

M. Elliott, Nanhua Zhang, Dylan S. Small

Abstract:The estimation of causal effects in observational studies is usually limited by the lack of randomization, which can result in different treatment or exposure groups differing systematically with respect to characteristics that influence outcomes. To remove such systematic differences, methods to ’’balance” subjects on observed covariates across treatment or exposure levels have been developed over the past three decades. These methods have been primarily developed in settings with binary treatment or exposures. However, in many observational studies, the exposures are continuous instead of being binary or discrete, and are usually considered as doses of treatment. In this manuscript we consider estimating the causal effect of early childhood lead exposure on youth academic achievement, where the exposure variable blood lead concentration can take any values that are greater than or equal to 0, using three balancing methods: propensity score analysis, non-bipartite matching, and Bayesian regression trees. We find some evidence that the standard logistic regression analysis controlling for age and socioeconomic confounders used in previous analyses (Zhang et al. (2013)) overstates the effect of lead exposure on performance on standardized mathematics and reading examinations; however, significant declines remain, including at doses currently below the recommended exposure levels.

摘要：观察性研究中因果效应的估计通常受到缺乏随机化的限制，这可能导致不同的治疗或暴露组在影响结果的特征方面存在系统性差异。为了消除这种系统性差异，在过去三十年中，已经开发出了在不同治疗或暴露水平的观察到的协变量上“平衡”受试者的方法。这些方法主要是在二元治疗或暴露的环境中开发的。然而，在许多观察性研究中，暴露是连续的，而不是二元或离散的，通常被视为治疗剂量。在这篇手稿中，我们考虑使用三种平衡方法来估计儿童早期铅暴露对青少年学业成绩的因果影响，其中暴露变量血铅浓度可以取大于或等于0的任何值：倾向得分分析、非二分匹配和贝叶斯回归树。我们发现一些证据表明，先前分析中使用的控制年龄和社会经济混杂因素的标准逻辑回归分析（Zhang et al.（2013））夸大了铅暴露对标准化数学和阅读考试成绩的影响；然而，仍有显著下降，包括目前低于建议暴露水平的剂量。

{"title":"Application of Propensity Scores to a Continuous Exposure: Effect of Lead Exposure in Early Childhood on Reading and Mathematics Scores","authors":"M. Elliott, Nanhua Zhang, Dylan S. Small","doi":"10.1353/obs.2015.0002","DOIUrl":"https://doi.org/10.1353/obs.2015.0002","url":null,"abstract":"Abstract:The estimation of causal effects in observational studies is usually limited by the lack of randomization, which can result in different treatment or exposure groups differing systematically with respect to characteristics that influence outcomes. To remove such systematic differences, methods to ’’balance” subjects on observed covariates across treatment or exposure levels have been developed over the past three decades. These methods have been primarily developed in settings with binary treatment or exposures. However, in many observational studies, the exposures are continuous instead of being binary or discrete, and are usually considered as doses of treatment. In this manuscript we consider estimating the causal effect of early childhood lead exposure on youth academic achievement, where the exposure variable blood lead concentration can take any values that are greater than or equal to 0, using three balancing methods: propensity score analysis, non-bipartite matching, and Bayesian regression trees. We find some evidence that the standard logistic regression analysis controlling for age and socioeconomic confounders used in previous analyses (Zhang et al. (2013)) overstates the effect of lead exposure on performance on standardized mathematics and reading examinations; however, significant declines remain, including at doses currently below the recommended exposure levels.","PeriodicalId":74335,"journal":{"name":"Observational studies","volume":"1 1","pages":"30 - 55"},"PeriodicalIF":0.0,"publicationDate":"2021-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1353/obs.2015.0002","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43901491","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

The Choice of Neighborhood in Regression Discontinuity Designs 回归不连续设计中邻域的选择

Observational studies

Pub Date : 2021-06-04 DOI: 10.1353/obs.2017.0002

M. D. Cattaneo, Cattaneo

The seminal paper of Thistlethwaite and Campbell (1960) is one of the greatest breakthroughs in program evaluation and causal inference for observational studies. The originally coined Regression-Discontinuity Analysis, and nowadays widely known as the Regression Discontinuity (RD) design, is likely the most credible and internally valid quantitative approach for the analysis and interpretation of non-experimental data. Early reviews and perspectives on RD designs include Cook (2008), Imbens and Lemieux (2008) and Lee and Lemieux (2010); see also Cattaneo and Escanciano (2017) for a contemporaneous edited volume with more recent overviews, discussions, and references. The key design feature in RD is that units have an observable running variable, score or index, and are assigned to treatment whenever this variable exceeds a known cutoff. Empirical work in RD designs seeks to compare the response of units just below the cutoff (control group) to the response of units just above (treatment group) to learn about the treatment effects of interest. It is by now generally recognized that the most important task in practice is to select the appropriate neighborhood near the cutoff, that is, to correctly determine which observations near the cutoff will be used. Localizing near the cutoff is crucial because empirical findings can be quite sensitive to which observations are included in the analysis. Several neighborhood selection methods have been developed in the literature depending on the goal (e.g., estimation, inference, falsification, graphical presentation), the underlying assumptions invoked (e.g., parametric specification, continuity/nonparametric specification, local randomization), the parameter of interest (e.g., sharp, fuzzy, kink), and even the specific design (e.g., single-cutoff, multi-cutoff, geographic). We offer a comprehensive discussion of both deprecated and modern neighborhood selection approaches available in the literature, following their historical as well as methodological evolution over the last decades. We focus on the prototypical case of a continuously distributed running variable for the most part, though we also discuss the discrete-valued case towards the end of the discussion. The bulk of the presentation focuses on neighborhood selection for estimation and inference, outlining different methods and approaches according to, roughly speaking, the size of a typical selected neighborhood in each case, going from the largest to smallest neighborhood. Figure 1 provides a heuristic summary, which we

Thistlethwaite和Campbell（1960）的开创性论文是观察性研究中程序评估和因果推断的最大突破之一。最初创造的回归不连续性分析，现在被广泛称为回归不连续（RD）设计，可能是分析和解释非实验数据的最可信和内部有效的定量方法。早期对RD设计的评论和观点包括Cook（2008）、Imbens和Lemieux（2008）以及Lee和Lemiux（2010）；另请参见Cattaneo和Escanciano（2017），以获取同期编辑的卷，其中包含更新的概述、讨论和参考文献。RD的关键设计特征是，单元具有可观察的运行变量、分数或指数，并且每当该变量超过已知临界值时，就被分配给治疗。RD设计中的经验工作试图将刚好低于临界值的单位（对照组）的响应与刚好高于临界值的单元（治疗组）的反应进行比较，以了解感兴趣的治疗效果。到目前为止，人们普遍认为，实践中最重要的任务是在截止点附近选择合适的邻域，也就是说，正确地确定将使用截止点附近的哪些观测值。在截止点附近定位是至关重要的，因为经验发现对分析中包含的观察结果非常敏感。文献中已经开发了几种邻域选择方法，这些方法取决于目标（例如，估计、推断、伪造、图形表示）、调用的基本假设（例如，参数规范、连续性/非参数规范、局部随机化）、感兴趣的参数（例如，尖锐、模糊、扭结）、，甚至是特定的设计（例如，单截止、多截止、地理）。我们对文献中的废弃和现代邻域选择方法进行了全面的讨论，遵循了它们在过去几十年中的历史和方法论演变。我们在很大程度上关注连续分布运行变量的原型情况，尽管在讨论的最后我们也讨论了离散值情况。演示的大部分内容集中在用于估计和推断的邻域选择上，粗略地说，根据每种情况下典型选定邻域的大小，从最大邻域到最小邻域，概述了不同的方法和方法。图1提供了一个启发式总结，我们

{"title":"The Choice of Neighborhood in Regression Discontinuity Designs","authors":"M. D. Cattaneo, Cattaneo","doi":"10.1353/obs.2017.0002","DOIUrl":"https://doi.org/10.1353/obs.2017.0002","url":null,"abstract":"The seminal paper of Thistlethwaite and Campbell (1960) is one of the greatest breakthroughs in program evaluation and causal inference for observational studies. The originally coined Regression-Discontinuity Analysis, and nowadays widely known as the Regression Discontinuity (RD) design, is likely the most credible and internally valid quantitative approach for the analysis and interpretation of non-experimental data. Early reviews and perspectives on RD designs include Cook (2008), Imbens and Lemieux (2008) and Lee and Lemieux (2010); see also Cattaneo and Escanciano (2017) for a contemporaneous edited volume with more recent overviews, discussions, and references. The key design feature in RD is that units have an observable running variable, score or index, and are assigned to treatment whenever this variable exceeds a known cutoff. Empirical work in RD designs seeks to compare the response of units just below the cutoff (control group) to the response of units just above (treatment group) to learn about the treatment effects of interest. It is by now generally recognized that the most important task in practice is to select the appropriate neighborhood near the cutoff, that is, to correctly determine which observations near the cutoff will be used. Localizing near the cutoff is crucial because empirical findings can be quite sensitive to which observations are included in the analysis. Several neighborhood selection methods have been developed in the literature depending on the goal (e.g., estimation, inference, falsification, graphical presentation), the underlying assumptions invoked (e.g., parametric specification, continuity/nonparametric specification, local randomization), the parameter of interest (e.g., sharp, fuzzy, kink), and even the specific design (e.g., single-cutoff, multi-cutoff, geographic). We offer a comprehensive discussion of both deprecated and modern neighborhood selection approaches available in the literature, following their historical as well as methodological evolution over the last decades. We focus on the prototypical case of a continuously distributed running variable for the most part, though we also discuss the discrete-valued case towards the end of the discussion. The bulk of the presentation focuses on neighborhood selection for estimation and inference, outlining different methods and approaches according to, roughly speaking, the size of a typical selected neighborhood in each case, going from the largest to smallest neighborhood. Figure 1 provides a heuristic summary, which we","PeriodicalId":74335,"journal":{"name":"Observational studies","volume":"3 1","pages":"134 - 146"},"PeriodicalIF":0.0,"publicationDate":"2021-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1353/obs.2017.0002","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44027642","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 42

Book review of “Observation and Experiment: An Introduction to Causal Inference” by Paul R. Rosenbaum 保罗·罗森鲍姆《观察与实验：因果推理导论》书评

Observational studies

Pub Date : 2021-06-04 DOI: 10.1353/obs.2017.0008

Dylan S. Small

The economist Paul Samuelson said, “My belief is that nothing that can be expressed by mathematics cannot be expressed by careful use of literary words.” Paul Rosenbaum brings this perspective to causal inference in his new book Observation and Experiment: An Introduction to Causal Inference (Harvard University Press, 2017). The book is a luminous presentation of concepts and strategies for causal inference with a minimum of technical material. An example of how Rosenbaum explains causal inference in a literary way is his use of a passage from Robert Frost’s poem “The Road Not Taken” to illuminate how causal questions involve comparing potential outcomes under two or more treatments where we can only see one potential outcome:

经济学家保罗·萨缪尔森(Paul Samuelson)说:“我的信念是，任何能用数学表达的东西，都不能通过仔细使用文学词汇来表达。”保罗·罗森鲍姆(Paul Rosenbaum)在他的新书《观察与实验:因果推理导论》(哈佛大学出版社，2017年)中将这一观点带入因果推理。这本书是一个发光的概念和策略的因果推理与最低限度的技术材料。罗森鲍姆如何用文学的方式解释因果推理的一个例子是，他使用了罗伯特·弗罗斯特的诗《未选择的路》中的一段来说明因果问题是如何涉及在两种或多种处理下比较潜在结果的，而我们只能看到一种潜在结果:

引用次数: 0

首页上一页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Observational studies

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀