Journal of Causal Inference最新文献_第3页

Quantitative probing: Validating causal models with quantitative domain knowledge 定量探索:用定量领域知识验证因果模型

IF 1.4 4区医学 Q2 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS

Journal of Causal Inference

Pub Date : 2022-09-07 DOI: 10.1515/jci-2022-0060

Daniel Grünbaum, M. L. Stern, E. Lang

Abstract We propose quantitative probing as a model-agnostic framework for validating causal models in the presence of quantitative domain knowledge. The method is constructed in analogy to the train/test split in correlation-based machine learning. It is consistent with the logic of scientific discovery and enhances current causal validation strategies. The effectiveness of the method is illustrated using Pearl’s sprinkler example, before a thorough simulation-based investigation is conducted. Limits of the technique are identified by studying exemplary failing scenarios, which are furthermore used to propose a list of topics for future research and improvements of the presented version of quantitative probing. A guide for practitioners is included to facilitate the incorporation of quantitative probing in causal modelling applications. The code for integrating quantitative probing into causal analysis, as well as the code for the presented simulation-based studies of the effectiveness of quantitative probing are provided in two separate open-source Python packages.

摘要我们提出定量探究作为一个模型不可知的框架，用于在定量领域知识存在的情况下验证因果模型。该方法的构造类似于基于相关性的机器学习中的训练/测试分割。它与科学发现的逻辑是一致的，并增强了当前的因果验证策略。在进行彻底的基于模拟的调查之前，用Pearl的洒水车实例说明了该方法的有效性。通过研究典型的失败场景来确定该技术的局限性，并进一步用于提出未来研究和改进定量探测的主题列表。为从业人员的指南包括，以促进在因果建模应用定量探测的结合。在两个独立的开源Python包中提供了将定量探测集成到因果分析中的代码，以及对定量探测有效性进行基于模拟的研究的代码。

引用次数: 3

Personalized decision making – A conceptual introduction 个性化决策-概念介绍

IF 1.4 4区医学 Q2 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS

Journal of Causal Inference

Pub Date : 2022-08-19 DOI: 10.48550/arXiv.2208.09558

Scott Mueller

Abstract Personalized decision making targets the behavior of a specific individual, while population-based decision making concerns a subpopulation resembling that individual. This article clarifies the distinction between the two and explains why the former leads to more informed decisions. We further show that by combining experimental and observational studies, we can obtain valuable information about individual behavior and, consequently, improve decisions over those obtained from experimental studies alone. In particular, we show examples where such a combination discriminates between individuals who can benefit from a treatment and those who cannot – information that would not be revealed by experimental studies alone. We outline areas where this method could be of benefit to both policy makers and individuals involved.

个性化决策针对的是特定个体的行为，而基于群体的决策关注的是与该个体相似的亚群体。本文阐明了两者之间的区别，并解释了为什么前者会导致更明智的决策。我们进一步表明，通过结合实验和观察研究，我们可以获得有关个体行为的有价值的信息，因此，比单独从实验研究中获得的信息更能改进决策。特别是，我们展示了一些例子，说明这种组合区分了能够从治疗中受益的个体和不能从治疗中受益的个体——这些信息仅通过实验研究是无法揭示的。我们概述了这种方法可能对政策制定者和相关个人都有益的领域。

引用次数: 24

Exploiting neighborhood interference with low-order interactions under unit randomized design 单位随机设计下低阶相互作用的邻域干扰研究

IF 1.4 4区医学 Q2 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS

Journal of Causal Inference

Pub Date : 2022-08-10 DOI: 10.48550/arXiv.2208.05553

Mayleen Cortez, Matthew Eichhorn, C. Yu

Abstract Network interference, where the outcome of an individual is affected by the treatment assignment of those in their social network, is pervasive in real-world settings. However, it poses a challenge to estimating causal effects. We consider the task of estimating the total treatment effect (TTE), or the difference between the average outcomes of the population when everyone is treated versus when no one is, under network interference. Under a Bernoulli randomized design, we provide an unbiased estimator for the TTE when network interference effects are constrained to low-order interactions among neighbors of an individual. We make no assumptions on the graph other than bounded degree, allowing for well-connected networks that may not be easily clustered. We derive a bound on the variance of our estimator and show in simulated experiments that it performs well compared with standard estimators for the TTE. We also derive a minimax lower bound on the mean squared error of our estimator, which suggests that the difficulty of estimation can be characterized by the degree of interactions in the potential outcomes model. We also prove that our estimator is asymptotically normal under boundedness conditions on the network degree and potential outcomes model. Central to our contribution is a new framework for balancing model flexibility and statistical complexity as captured by this low-order interactions structure.

网络干扰是指个体的行为结果受到其社会网络中其他人的待遇分配的影响，这种现象在现实世界中普遍存在。然而，它对估计因果关系提出了挑战。我们考虑在网络干扰下估计总治疗效果(TTE)的任务，或每个人都接受治疗与没有人接受治疗时人群平均结果之间的差异。在伯努利随机化设计下，我们给出了当网络干扰效应被限制为个体之间的低阶相互作用时，TTE的无偏估计量。除了有界度，我们没有对图做任何假设，考虑到连接良好的网络可能不容易聚类。我们推导了估计器方差的一个界，并在模拟实验中表明，与TTE的标准估计器相比，它的性能很好。我们还推导了估计器均方误差的最小极大下界，这表明估计的难度可以通过潜在结果模型中的相互作用程度来表征。我们还证明了在网络度和潜在结果模型的有界条件下我们的估计量是渐近正态的。我们贡献的核心是平衡模型灵活性和统计复杂性的新框架，正如这个低阶交互结构所捕获的那样。

{"title":"Exploiting neighborhood interference with low-order interactions under unit randomized design","authors":"Mayleen Cortez, Matthew Eichhorn, C. Yu","doi":"10.48550/arXiv.2208.05553","DOIUrl":"https://doi.org/10.48550/arXiv.2208.05553","url":null,"abstract":"Abstract Network interference, where the outcome of an individual is affected by the treatment assignment of those in their social network, is pervasive in real-world settings. However, it poses a challenge to estimating causal effects. We consider the task of estimating the total treatment effect (TTE), or the difference between the average outcomes of the population when everyone is treated versus when no one is, under network interference. Under a Bernoulli randomized design, we provide an unbiased estimator for the TTE when network interference effects are constrained to low-order interactions among neighbors of an individual. We make no assumptions on the graph other than bounded degree, allowing for well-connected networks that may not be easily clustered. We derive a bound on the variance of our estimator and show in simulated experiments that it performs well compared with standard estimators for the TTE. We also derive a minimax lower bound on the mean squared error of our estimator, which suggests that the difficulty of estimation can be characterized by the degree of interactions in the potential outcomes model. We also prove that our estimator is asymptotically normal under boundedness conditions on the network degree and potential outcomes model. Central to our contribution is a new framework for balancing model flexibility and statistical complexity as captured by this low-order interactions structure.","PeriodicalId":48576,"journal":{"name":"Journal of Causal Inference","volume":"52 1","pages":""},"PeriodicalIF":1.4,"publicationDate":"2022-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73433770","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 8

Sensitivity analysis for causal decomposition analysis: Assessing robustness toward omitted variable bias 因果分解分析的敏感性分析:对遗漏变量偏差的稳健性评估

IF 1.4 4区医学 Q2 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS

Journal of Causal Inference

Pub Date : 2022-05-26 DOI: 10.1515/jci-2022-0031

S. Park, Suyeon Kang, Chioun Lee, Shujie Ma

Abstract A key objective of decomposition analysis is to identify a factor (the “mediator”) contributing to disparities in an outcome between social groups. In decomposition analysis, a scholarly interest often centers on estimating how much the disparity (e.g., health disparities between Black women and White men) would be reduced/remain if we set the mediator (e.g., education) distribution of one social group equal to another. However, causally identifying disparity reduction and remaining depends on the no omitted mediator–outcome confounding assumption, which is not empirically testable. Therefore, we propose a set of sensitivity analyses to assess the robustness of disparity reduction to possible unobserved confounding. We derived general bias formulas for disparity reduction, which can be used beyond a particular statistical model and do not require any functional assumptions. Moreover, the same bias formulas apply with unobserved confounding measured before and after the group status. On the basis of the formulas, we provide sensitivity analysis techniques based on regression coefficients and R 2 {R}^{2} values by extending the existing approaches. The R 2 {R}^{2} -based sensitivity analysis offers a straightforward interpretation of sensitivity parameters and a standard way to report the robustness of research findings. Although we introduce sensitivity analysis techniques in the context of decomposition analysis, they can be utilized in any mediation setting based on interventional indirect effects when the exposure is randomized (or conditionally ignorable given covariates).

分解分析的一个关键目标是确定导致社会群体之间结果差异的因素(“中介”)。在分解分析中，学术兴趣通常集中在如果我们设置一个社会群体与另一个社会群体相等的中介(例如，教育)分布，估计差距(例如，黑人女性和白人男性之间的健康差距)将减少或保持多少。然而，因果关系识别差异减少和剩余取决于没有遗漏的中介结果混淆假设，这是没有经验可检验的。因此，我们提出了一套敏感性分析来评估差异减少对可能的未观察到的混淆的稳健性。我们推导了减少差异的一般偏差公式，它可以在特定的统计模型之外使用，并且不需要任何功能假设。此外，同样的偏差公式适用于未观察到的混杂在之前和之后的组状态测量。在此基础上，对现有方法进行了扩展，提出了基于回归系数和r2 {R}^{2}值的敏感性分析技术。基于r2 {R}^{2}的敏感性分析提供了对敏感性参数的直接解释和报告研究结果稳健性的标准方法。虽然我们在分解分析的背景下引入了敏感性分析技术，但当暴露是随机的(或给定协变量的条件下可忽略)时，它们可以用于任何基于介入间接效应的中介设置。

{"title":"Sensitivity analysis for causal decomposition analysis: Assessing robustness toward omitted variable bias","authors":"S. Park, Suyeon Kang, Chioun Lee, Shujie Ma","doi":"10.1515/jci-2022-0031","DOIUrl":"https://doi.org/10.1515/jci-2022-0031","url":null,"abstract":"Abstract A key objective of decomposition analysis is to identify a factor (the “mediator”) contributing to disparities in an outcome between social groups. In decomposition analysis, a scholarly interest often centers on estimating how much the disparity (e.g., health disparities between Black women and White men) would be reduced/remain if we set the mediator (e.g., education) distribution of one social group equal to another. However, causally identifying disparity reduction and remaining depends on the no omitted mediator–outcome confounding assumption, which is not empirically testable. Therefore, we propose a set of sensitivity analyses to assess the robustness of disparity reduction to possible unobserved confounding. We derived general bias formulas for disparity reduction, which can be used beyond a particular statistical model and do not require any functional assumptions. Moreover, the same bias formulas apply with unobserved confounding measured before and after the group status. On the basis of the formulas, we provide sensitivity analysis techniques based on regression coefficients and R 2 {R}^{2} values by extending the existing approaches. The R 2 {R}^{2} -based sensitivity analysis offers a straightforward interpretation of sensitivity parameters and a standard way to report the robustness of research findings. Although we introduce sensitivity analysis techniques in the context of decomposition analysis, they can be utilized in any mediation setting based on interventional indirect effects when the exposure is randomized (or conditionally ignorable given covariates).","PeriodicalId":48576,"journal":{"name":"Journal of Causal Inference","volume":"15 1","pages":""},"PeriodicalIF":1.4,"publicationDate":"2022-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"72716650","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Matched design for marginal causal effect on restricted mean survival time in observational studies 观察性研究中限定平均生存时间边际因果效应的匹配设计

IF 1.4 4区医学 Q2 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS

Journal of Causal Inference

Pub Date : 2022-05-04 DOI: 10.1515/jci-2022-0035

Zihan Lin, A. Ni, Bo Lu

Abstract Investigating the causal relationship between exposure and time-to-event outcome is an important topic in biomedical research. Previous literature has discussed the potential issues of using hazard ratio (HR) as the marginal causal effect measure due to noncollapsibility. In this article, we advocate using restricted mean survival time (RMST) difference as a marginal causal effect measure, which is collapsible and has a simple interpretation as the difference of area under survival curves over a certain time horizon. To address both measured and unmeasured confounding, a matched design with sensitivity analysis is proposed. Matching is used to pair similar treated and untreated subjects together, which is generally more robust than outcome modeling due to potential misspecifications. Our propensity score matched RMST difference estimator is shown to be asymptotically unbiased, and the corresponding variance estimator is calculated by accounting for the correlation due to matching. Simulation studies also demonstrate that our method has adequate empirical performance and outperforms several competing methods used in practice. To assess the impact of unmeasured confounding, we develop a sensitivity analysis strategy by adapting the E-value approach to matched data. We apply the proposed method to the Atherosclerosis Risk in Communities Study (ARIC) to examine the causal effect of smoking on stroke-free survival.

研究暴露与事件发生时间之间的因果关系是生物医学研究的一个重要课题。先前的文献讨论了使用风险比(HR)作为非溃散性的边际因果效应度量的潜在问题。在本文中，我们主张使用限制平均生存时间(RMST)差作为边际因果效应度量，它是可折叠的，并且可以简单地解释为一定时间范围内生存曲线下面积的差。为了解决测量和未测量的混淆，提出了一种匹配设计和灵敏度分析。匹配用于将相似的治疗和未治疗的受试者配对在一起，由于潜在的错误规范，这通常比结果建模更稳健。我们的倾向分数匹配RMST差估计是渐近无偏的，相应的方差估计是通过考虑匹配的相关性来计算的。仿真研究也表明我们的方法具有足够的经验性能，并且优于实践中使用的几种竞争方法。为了评估未测量混杂的影响，我们通过对匹配数据采用e值方法开发了一种敏感性分析策略。我们将提出的方法应用于社区动脉粥样硬化风险研究(ARIC)，以检验吸烟对无卒中生存的因果影响。

{"title":"Matched design for marginal causal effect on restricted mean survival time in observational studies","authors":"Zihan Lin, A. Ni, Bo Lu","doi":"10.1515/jci-2022-0035","DOIUrl":"https://doi.org/10.1515/jci-2022-0035","url":null,"abstract":"Abstract Investigating the causal relationship between exposure and time-to-event outcome is an important topic in biomedical research. Previous literature has discussed the potential issues of using hazard ratio (HR) as the marginal causal effect measure due to noncollapsibility. In this article, we advocate using restricted mean survival time (RMST) difference as a marginal causal effect measure, which is collapsible and has a simple interpretation as the difference of area under survival curves over a certain time horizon. To address both measured and unmeasured confounding, a matched design with sensitivity analysis is proposed. Matching is used to pair similar treated and untreated subjects together, which is generally more robust than outcome modeling due to potential misspecifications. Our propensity score matched RMST difference estimator is shown to be asymptotically unbiased, and the corresponding variance estimator is calculated by accounting for the correlation due to matching. Simulation studies also demonstrate that our method has adequate empirical performance and outperforms several competing methods used in practice. To assess the impact of unmeasured confounding, we develop a sensitivity analysis strategy by adapting the E-value approach to matched data. We apply the proposed method to the Atherosclerosis Risk in Communities Study (ARIC) to examine the causal effect of smoking on stroke-free survival.","PeriodicalId":48576,"journal":{"name":"Journal of Causal Inference","volume":"29 12 1","pages":""},"PeriodicalIF":1.4,"publicationDate":"2022-05-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82731058","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Robust inference for matching under rolling enrollment 滚动招生下匹配的鲁棒推理

IF 1.4 4区医学 Q2 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS

Journal of Causal Inference

Pub Date : 2022-05-02 DOI: 10.1515/jci-2022-0055

Amanda K. Glazer, Samuel D. Pimentel

Abstract Matching in observational studies faces complications when units enroll in treatment on a rolling basis. While each treated unit has a specific time of entry into the study, control units each have many possible comparison, or “pseudo-treatment,” times. Valid inference must account for correlations between repeated measures for a single unit, and researchers must decide how flexibly to match across time and units. We provide three important innovations. First, we introduce a new matched design, GroupMatch with instance replacement, allowing maximum flexibility in control selection. This new design searches over all possible comparison times for each treated-control pairing and is more amenable to analysis than past methods. Second, we propose a block bootstrap approach for inference in matched designs with rolling enrollment and demonstrate that it accounts properly for complex correlations across matched sets in our new design and several other contexts. Third, we develop a falsification test to detect violations of the timepoint agnosticism assumption, which is needed to permit flexible matching across time. We demonstrate the practical value of these tools via simulations and a case study of the impact of short-term injuries on batting performance in major league baseball.

摘要:观察性研究中的匹配在单位滚动入组治疗时面临并发症。虽然每个治疗组都有一个特定的进入研究的时间，但每个控制组都有许多可能的比较时间，或“伪治疗”时间。有效的推断必须考虑到单个单位的重复测量之间的相关性，研究人员必须决定如何灵活地在时间和单位之间进行匹配。我们提供了三个重要的创新。首先，我们引入了一种新的匹配设计，具有实例替换的GroupMatch，允许最大限度地灵活选择控件。这种新设计搜索了每个处理-对照配对的所有可能的比较时间，并且比过去的方法更易于分析。其次，我们提出了一种块引导方法用于滚动入学匹配设计中的推理，并证明它可以正确地解释我们的新设计和其他几个上下文中匹配集之间的复杂相关性。第三，我们开发了一个证伪检验来检测违反时间点不可知论假设的情况，这是允许灵活的跨时间匹配所必需的。我们通过模拟和短期受伤对棒球大联盟打击表现影响的案例研究来证明这些工具的实用价值。

{"title":"Robust inference for matching under rolling enrollment","authors":"Amanda K. Glazer, Samuel D. Pimentel","doi":"10.1515/jci-2022-0055","DOIUrl":"https://doi.org/10.1515/jci-2022-0055","url":null,"abstract":"Abstract Matching in observational studies faces complications when units enroll in treatment on a rolling basis. While each treated unit has a specific time of entry into the study, control units each have many possible comparison, or “pseudo-treatment,” times. Valid inference must account for correlations between repeated measures for a single unit, and researchers must decide how flexibly to match across time and units. We provide three important innovations. First, we introduce a new matched design, GroupMatch with instance replacement, allowing maximum flexibility in control selection. This new design searches over all possible comparison times for each treated-control pairing and is more amenable to analysis than past methods. Second, we propose a block bootstrap approach for inference in matched designs with rolling enrollment and demonstrate that it accounts properly for complex correlations across matched sets in our new design and several other contexts. Third, we develop a falsification test to detect violations of the timepoint agnosticism assumption, which is needed to permit flexible matching across time. We demonstrate the practical value of these tools via simulations and a case study of the impact of short-term injuries on batting performance in major league baseball.","PeriodicalId":48576,"journal":{"name":"Journal of Causal Inference","volume":"4 1","pages":""},"PeriodicalIF":1.4,"publicationDate":"2022-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79025660","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Conditional average treatment effect estimation with marginally constrained models 基于边际约束模型的条件平均治疗效果估计

IF 1.4 4区医学 Q2 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS

Journal of Causal Inference

Pub Date : 2022-04-29 DOI: 10.1515/jci-2022-0027

W. A. van Amsterdam, R. Ranganath

Abstract Treatment effect estimates are often available from randomized controlled trials as a single average treatment effect for a certain patient population. Estimates of the conditional average treatment effect (CATE) are more useful for individualized treatment decision-making, but randomized trials are often too small to estimate the CATE. Examples in medical literature make use of the relative treatment effect (e.g. an odds ratio) reported by randomized trials to estimate the CATE using large observational datasets. One approach to estimating these CATE models is by using the relative treatment effect as an offset, while estimating the covariate-specific untreated risk. We observe that the odds ratios reported in randomized controlled trials are not the odds ratios that are needed in offset models because trials often report the marginal odds ratio. We introduce a constraint or a regularizer to better use marginal odds ratios from randomized controlled trials and find that under the standard observational causal inference assumptions, this approach provides a consistent estimate of the CATE. Next, we show that the offset approach is not valid for CATE estimation in the presence of unobserved confounding. We study if the offset assumption and the marginal constraint lead to better approximations of the CATE relative to the alternative of using the average treatment effect estimate from the randomized trial. We empirically show that when the underlying CATE has sufficient variation, the constraint and offset approaches lead to closer approximations to the CATE.

治疗效果估计通常来自随机对照试验，作为特定患者群体的单一平均治疗效果。条件平均治疗效果(CATE)的估计对个性化治疗决策更有用，但随机试验往往太小而无法估计CATE。医学文献中的例子利用随机试验报告的相对治疗效果(如优势比)，利用大型观察数据集估计CATE。估计这些CATE模型的一种方法是使用相对治疗效果作为抵消，同时估计协变量特异性未经治疗的风险。我们观察到随机对照试验中报告的比值比并不是偏移模型中需要的比值比，因为试验通常报告的是边际比值比。我们引入了一个约束或正则化器来更好地利用随机对照试验的边际优势比，并发现在标准观察性因果推理假设下，该方法提供了一致的CATE估计。接下来，我们证明了在存在未观察到的混淆的情况下，偏移方法对CATE估计无效。我们研究了相对于使用随机试验的平均治疗效果估计的替代方案，偏移假设和边际约束是否能更好地近似CATE。我们的经验表明，当潜在的CATE有足够的变化时，约束和偏移方法导致更接近CATE的近似。

{"title":"Conditional average treatment effect estimation with marginally constrained models","authors":"W. A. van Amsterdam, R. Ranganath","doi":"10.1515/jci-2022-0027","DOIUrl":"https://doi.org/10.1515/jci-2022-0027","url":null,"abstract":"Abstract Treatment effect estimates are often available from randomized controlled trials as a single average treatment effect for a certain patient population. Estimates of the conditional average treatment effect (CATE) are more useful for individualized treatment decision-making, but randomized trials are often too small to estimate the CATE. Examples in medical literature make use of the relative treatment effect (e.g. an odds ratio) reported by randomized trials to estimate the CATE using large observational datasets. One approach to estimating these CATE models is by using the relative treatment effect as an offset, while estimating the covariate-specific untreated risk. We observe that the odds ratios reported in randomized controlled trials are not the odds ratios that are needed in offset models because trials often report the marginal odds ratio. We introduce a constraint or a regularizer to better use marginal odds ratios from randomized controlled trials and find that under the standard observational causal inference assumptions, this approach provides a consistent estimate of the CATE. Next, we show that the offset approach is not valid for CATE estimation in the presence of unobserved confounding. We study if the offset assumption and the marginal constraint lead to better approximations of the CATE relative to the alternative of using the average treatment effect estimate from the randomized trial. We empirically show that when the underlying CATE has sufficient variation, the constraint and offset approaches lead to closer approximations to the CATE.","PeriodicalId":48576,"journal":{"name":"Journal of Causal Inference","volume":"19 1","pages":""},"PeriodicalIF":1.4,"publicationDate":"2022-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90261544","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Lasso approach to covariate selection and average treatment effect estimation for clustered RCTs using design-based methods 基于设计方法的聚类随机对照试验协变量选择和平均治疗效果估计的Lasso方法

IF 1.4 4区医学 Q2 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS

Journal of Causal Inference

Pub Date : 2022-01-01 DOI: 10.1515/jci-2021-0036

Peter Z. Schochet

Abstract Statistical power is often a concern for clustered randomized control trials (RCTs) due to variance inflation from design effects and the high cost of adding study clusters (such as hospitals, schools, or communities). While covariate pre-specification can improve power for estimating regression-adjusted average treatment effects (ATEs), further precision gains can be achieved through covariate selection once primary outcomes have been collected. This article uses design-based methods underlying clustered RCTs to develop Lasso methods for the post-hoc selection of covariates for ATE estimation that avoids a lack of transparency and model overfitting. Our focus is on two-stage estimators: in the first stage, Lasso estimation is conducted using data on cluster-level averages or sums, and in the second stage, standard ATE estimators are adjusted for covariates using the first-stage Lasso results. We discuss l 1 {l}_{1} consistency of the estimated Lasso coefficients, asymptotic normality of the ATE estimators, and design-based variance estimation. The nonparametric approach applies to continuous, binary, and discrete outcomes. We present simulation results and demonstrate the method using data from a federally funded clustered RCT testing the effects of school-based programs promoting behavioral health.

由于设计效应带来的方差膨胀和增加研究集群(如医院、学校或社区)的高成本，统计能力通常是聚类随机对照试验(rct)关注的问题。虽然协变量预规范可以提高估计回归调整平均治疗效果(ATEs)的能力，但一旦收集了主要结果，就可以通过协变量选择进一步提高精度。本文使用基于设计的聚类随机对照试验方法来开发Lasso方法，用于ATE估计的协变量事后选择，以避免缺乏透明度和模型过拟合。我们的重点是两阶段估计器:在第一阶段，Lasso估计是使用簇水平平均值或总和的数据进行的，在第二阶段，使用第一阶段Lasso结果调整标准ATE估计器的协变量。我们讨论了估计Lasso系数的1 {l}_{1}一致性、ATE估计量的渐近正态性和基于设计的方差估计。非参数方法适用于连续、二值和离散结果。我们展示了模拟结果，并使用联邦资助的聚类随机对照试验的数据展示了该方法，该试验测试了以学校为基础的项目促进行为健康的效果。

{"title":"A Lasso approach to covariate selection and average treatment effect estimation for clustered RCTs using design-based methods","authors":"Peter Z. Schochet","doi":"10.1515/jci-2021-0036","DOIUrl":"https://doi.org/10.1515/jci-2021-0036","url":null,"abstract":"Abstract Statistical power is often a concern for clustered randomized control trials (RCTs) due to variance inflation from design effects and the high cost of adding study clusters (such as hospitals, schools, or communities). While covariate pre-specification can improve power for estimating regression-adjusted average treatment effects (ATEs), further precision gains can be achieved through covariate selection once primary outcomes have been collected. This article uses design-based methods underlying clustered RCTs to develop Lasso methods for the post-hoc selection of covariates for ATE estimation that avoids a lack of transparency and model overfitting. Our focus is on two-stage estimators: in the first stage, Lasso estimation is conducted using data on cluster-level averages or sums, and in the second stage, standard ATE estimators are adjusted for covariates using the first-stage Lasso results. We discuss l 1 {l}_{1} consistency of the estimated Lasso coefficients, asymptotic normality of the ATE estimators, and design-based variance estimation. The nonparametric approach applies to continuous, binary, and discrete outcomes. We present simulation results and demonstrate the method using data from a federally funded clustered RCT testing the effects of school-based programs promoting behavioral health.","PeriodicalId":48576,"journal":{"name":"Journal of Causal Inference","volume":"9 1","pages":"494 - 514"},"PeriodicalIF":1.4,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84301133","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Decomposition of the total effect for two mediators: A natural mediated interaction effect framework. 两个中介的总效应分解:一个自然中介的相互作用效应框架。

IF 1.4 4区医学 Q2 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS

Journal of Causal Inference

Pub Date : 2022-01-01 DOI: 10.1515/jci-2020-0017

Xin Gao, Li Li, Li Luo

Mediation analysis has been used in many disciplines to explain the mechanism or process that underlies an observed relationship between an exposure variable and an outcome variable via the inclusion of mediators. Decompositions of the total effect (TE) of an exposure variable into effects characterizing mediation pathways and interactions have gained an increasing amount of interest in the last decade. In this work, we develop decompositions for scenarios where two mediators are causally sequential or non-sequential. Current developments in this area have primarily focused on either decompositions without interaction components or with interactions but assuming no causally sequential order between the mediators. We propose a new concept called natural mediated interaction (MI) effect that captures the two-way and three-way interactions for both scenarios and extends the two-way MIs in the literature. We develop a unified approach for decomposing the TE into the effects that are due to mediation only, interaction only, both mediation and interaction, neither mediation nor interaction within the counterfactual framework. Finally, we compare our proposed decomposition to an existing method in a non-sequential two-mediator scenario using simulated data, and illustrate the proposed decomposition for a sequential two-mediator scenario using a real data analysis.

中介分析已在许多学科中使用，通过包含中介来解释暴露变量和结果变量之间观察到的关系的机制或过程。在过去十年中，暴露变量的总效应(TE)分解为表征中介途径和相互作用的效应已获得越来越多的兴趣。在这项工作中，我们为两个中介是因果顺序或非顺序的场景开发了分解。目前这一领域的发展主要集中在没有相互作用成分的分解或有相互作用但假设介质之间没有因果顺序的分解。我们提出了一个新的概念，称为自然介导的相互作用(MI)效应，它捕获了两种情况下的双向和三向相互作用，并扩展了文献中的双向MI。我们开发了一种统一的方法，将TE分解为仅由于中介，仅由于互动，中介和互动，在反事实框架内既不中介也不互动的影响。最后，我们使用模拟数据将我们提出的分解方法与非顺序双介质场景中的现有方法进行了比较，并使用真实数据分析说明了顺序双介质场景的拟议分解方法。

{"title":"Decomposition of the total effect for two mediators: A natural mediated interaction effect framework.","authors":"Xin Gao, Li Li, Li Luo","doi":"10.1515/jci-2020-0017","DOIUrl":"https://doi.org/10.1515/jci-2020-0017","url":null,"abstract":"<p><p>Mediation analysis has been used in many disciplines to explain the mechanism or process that underlies an observed relationship between an exposure variable and an outcome variable via the inclusion of mediators. Decompositions of the total effect (TE) of an exposure variable into effects characterizing mediation pathways and interactions have gained an increasing amount of interest in the last decade. In this work, we develop decompositions for scenarios where two mediators are causally sequential or non-sequential. Current developments in this area have primarily focused on either decompositions without interaction components or with interactions but assuming no causally sequential order between the mediators. We propose a new concept called natural mediated interaction (MI) effect that captures the two-way and three-way interactions for both scenarios and extends the two-way MIs in the literature. We develop a unified approach for decomposing the TE into the effects that are due to mediation only, interaction only, both mediation and interaction, neither mediation nor interaction within the counterfactual framework. Finally, we compare our proposed decomposition to an existing method in a non-sequential two-mediator scenario using simulated data, and illustrate the proposed decomposition for a sequential two-mediator scenario using a real data analysis.</p>","PeriodicalId":48576,"journal":{"name":"Journal of Causal Inference","volume":"10 1","pages":"18-44"},"PeriodicalIF":1.4,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9139468/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10600650","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Sensitivity analysis for causal effects with generalized linear models 广义线性模型因果效应的敏感性分析

IF 1.4 4区医学 Q2 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS

Journal of Causal Inference

Pub Date : 2022-01-01 DOI: 10.1515/jci-2022-0040

A. Sjölander, E. Gabriel, I. Ciocănea-Teodorescu

Abstract Residual confounding is a common source of bias in observational studies. In this article, we build upon a series of sensitivity analyses methods for residual confounding developed by Brumback et al. and Chiba whose sensitivity parameters are constructed to quantify deviation from conditional exchangeability, given measured confounders. These sensitivity parameters are combined with the observed data to produce a “bias-corrected” estimate of the causal effect of interest. We provide important generalizations of these sensitivity analyses, by allowing for arbitrary exposures and a wide range of different causal effect measures, through the specification of the target causal effect as a parameter in a generalized linear model with the arbitrary link function. We show how our generalized sensitivity analysis can be easily implemented with standard software, and how its sensitivity parameters can be calibrated against measured confounders. We demonstrate our sensitivity analysis with an application to publicly available data from a cohort study of behavior patterns and coronary heart disease.

残留混杂是观察性研究中常见的偏倚来源。在本文中，我们建立在Brumback等人和Chiba开发的一系列残余混杂的敏感性分析方法的基础上，这些方法的敏感性参数被构建为量化给定测量混杂因素的条件可交换性偏差。这些敏感性参数与观察到的数据相结合，产生对感兴趣的因果效应的“偏差校正”估计。通过在具有任意链接函数的广义线性模型中指定目标因果效应作为参数，通过允许任意暴露和广泛的不同因果效应测量，我们提供了这些敏感性分析的重要概括。我们展示了如何使用标准软件轻松实现广义灵敏度分析，以及如何根据测量的混杂因素校准其灵敏度参数。我们通过对行为模式和冠心病的队列研究的公开可用数据的应用来证明我们的敏感性分析。

引用次数: 0