首页 > 最新文献

Journal of Causal Inference最新文献

英文 中文
Novel bounds for causal effects based on sensitivity parameters on the risk difference scale 基于风险差异量表上敏感性参数的因果效应的新界限
IF 1.4 4区 医学 Q2 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2021-01-01 DOI: 10.1515/jci-2021-0024
A. Sjölander, O. Hössjer
Abstract Unmeasured confounding is an important threat to the validity of observational studies. A common way to deal with unmeasured confounding is to compute bounds for the causal effect of interest, that is, a range of values that is guaranteed to include the true effect, given the observed data. Recently, bounds have been proposed that are based on sensitivity parameters, which quantify the degree of unmeasured confounding on the risk ratio scale. These bounds can be used to compute an E-value, that is, the degree of confounding required to explain away an observed association, on the risk ratio scale. We complement and extend this previous work by deriving analogous bounds, based on sensitivity parameters on the risk difference scale. We show that our bounds can also be used to compute an E-value, on the risk difference scale. We compare our novel bounds with previous bounds through a real data example and a simulation study.
未测量的混杂是对观察性研究有效性的一个重要威胁。处理不可测量的混杂的一种常用方法是计算感兴趣的因果效应的界限,即在给定观测数据的情况下,保证包含真实效应的值范围。最近,人们提出了基于敏感性参数的界限,它量化了风险比尺度上未测量的混杂程度。这些界限可以用来计算e值,即在风险比尺度上解释观察到的关联所需的混淆程度。我们通过推导基于风险差异尺度上的敏感性参数的类似边界来补充和扩展先前的工作。我们表明,我们的界限也可以用于计算风险差尺度上的e值。通过一个实际的数据例子和仿真研究,比较了我们的新边界和以前的边界。
{"title":"Novel bounds for causal effects based on sensitivity parameters on the risk difference scale","authors":"A. Sjölander, O. Hössjer","doi":"10.1515/jci-2021-0024","DOIUrl":"https://doi.org/10.1515/jci-2021-0024","url":null,"abstract":"Abstract Unmeasured confounding is an important threat to the validity of observational studies. A common way to deal with unmeasured confounding is to compute bounds for the causal effect of interest, that is, a range of values that is guaranteed to include the true effect, given the observed data. Recently, bounds have been proposed that are based on sensitivity parameters, which quantify the degree of unmeasured confounding on the risk ratio scale. These bounds can be used to compute an E-value, that is, the degree of confounding required to explain away an observed association, on the risk ratio scale. We complement and extend this previous work by deriving analogous bounds, based on sensitivity parameters on the risk difference scale. We show that our bounds can also be used to compute an E-value, on the risk difference scale. We compare our novel bounds with previous bounds through a real data example and a simulation study.","PeriodicalId":48576,"journal":{"name":"Journal of Causal Inference","volume":"191 1","pages":"190 - 210"},"PeriodicalIF":1.4,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83067085","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Instrumental variable regression via kernel maximum moment loss 通过核最大矩损失的工具变量回归
IF 1.4 4区 医学 Q2 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2020-10-15 DOI: 10.1515/jci-2022-0073
Rui Zhang, M. Imaizumi, B. Scholkopf, Krikamol Muandet
Abstract We investigate a simple objective for nonlinear instrumental variable (IV) regression based on a kernelized conditional moment restriction known as a maximum moment restriction (MMR). The MMR objective is formulated by maximizing the interaction between the residual and the instruments belonging to a unit ball in a reproducing kernel Hilbert space. First, it allows us to simplify the IV regression as an empirical risk minimization problem, where the risk function depends on the reproducing kernel on the instrument and can be estimated by a U-statistic or V-statistic. Second, on the basis this simplification, we are able to provide consistency and asymptotic normality results in both parametric and nonparametric settings. Finally, we provide easy-to-use IV regression algorithms with an efficient hyperparameter selection procedure. We demonstrate the effectiveness of our algorithms using experiments on both synthetic and real-world data.
我们研究了一个基于核化条件矩约束的非线性工具变量(IV)回归的简单目标,即最大矩约束(MMR)。MMR目标是通过在再现核希尔伯特空间中最大化残差和属于单位球的仪器之间的相互作用来制定的。首先,它允许我们将IV回归简化为经验风险最小化问题,其中风险函数依赖于仪器上的再现核,并且可以通过u统计量或v统计量进行估计。其次,在此简化的基础上,我们能够在参数和非参数设置中提供一致性和渐近正态性结果。最后,我们提供了易于使用的IV回归算法与一个有效的超参数选择程序。我们通过合成数据和真实数据的实验证明了算法的有效性。
{"title":"Instrumental variable regression via kernel maximum moment loss","authors":"Rui Zhang, M. Imaizumi, B. Scholkopf, Krikamol Muandet","doi":"10.1515/jci-2022-0073","DOIUrl":"https://doi.org/10.1515/jci-2022-0073","url":null,"abstract":"Abstract We investigate a simple objective for nonlinear instrumental variable (IV) regression based on a kernelized conditional moment restriction known as a maximum moment restriction (MMR). The MMR objective is formulated by maximizing the interaction between the residual and the instruments belonging to a unit ball in a reproducing kernel Hilbert space. First, it allows us to simplify the IV regression as an empirical risk minimization problem, where the risk function depends on the reproducing kernel on the instrument and can be estimated by a U-statistic or V-statistic. Second, on the basis this simplification, we are able to provide consistency and asymptotic normality results in both parametric and nonparametric settings. Finally, we provide easy-to-use IV regression algorithms with an efficient hyperparameter selection procedure. We demonstrate the effectiveness of our algorithms using experiments on both synthetic and real-world data.","PeriodicalId":48576,"journal":{"name":"Journal of Causal Inference","volume":"1 1","pages":""},"PeriodicalIF":1.4,"publicationDate":"2020-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83678311","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Learning linear non-Gaussian graphical models with multidirected edges 学习具有多向边的线性非高斯图形模型
IF 1.4 4区 医学 Q2 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2020-10-11 DOI: 10.1515/jci-2020-0027
Yiheng Liu, Elina Robeva, Huanqing Wang
Abstract In this article, we propose a new method to learn the underlying acyclic mixed graph of a linear non-Gaussian structural equation model with given observational data. We build on an algorithm proposed by Wang and Drton, and we show that one can augment the hidden variable structure of the recovered model by learning multidirected edges rather than only directed and bidirected ones. Multidirected edges appear when more than two of the observed variables have a hidden common cause. We detect the presence of such hidden causes by looking at higher order cumulants and exploiting the multi-trek rule. Our method recovers the correct structure when the underlying graph is a bow-free acyclic mixed graph with potential multidirected edges.
本文提出了一种新的方法来学习具有给定观测数据的线性非高斯结构方程模型的底层无环混合图。我们在Wang和Drton提出的算法的基础上,证明了可以通过学习多向边来增强恢复模型的隐变量结构,而不仅仅是有向边和双向边。当两个以上的观测变量有一个隐藏的共同原因时,就会出现多向边。我们通过观察高阶累积量和利用多重跋涉规则来检测这些隐藏原因的存在。当底层图是具有潜在多向边的无弓无环混合图时,我们的方法恢复了正确的结构。
{"title":"Learning linear non-Gaussian graphical models with multidirected edges","authors":"Yiheng Liu, Elina Robeva, Huanqing Wang","doi":"10.1515/jci-2020-0027","DOIUrl":"https://doi.org/10.1515/jci-2020-0027","url":null,"abstract":"Abstract In this article, we propose a new method to learn the underlying acyclic mixed graph of a linear non-Gaussian structural equation model with given observational data. We build on an algorithm proposed by Wang and Drton, and we show that one can augment the hidden variable structure of the recovered model by learning multidirected edges rather than only directed and bidirected ones. Multidirected edges appear when more than two of the observed variables have a hidden common cause. We detect the presence of such hidden causes by looking at higher order cumulants and exploiting the multi-trek rule. Our method recovers the correct structure when the underlying graph is a bow-free acyclic mixed graph with potential multidirected edges.","PeriodicalId":48576,"journal":{"name":"Journal of Causal Inference","volume":"12 1","pages":"250 - 263"},"PeriodicalIF":1.4,"publicationDate":"2020-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84219125","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Randomized graph cluster randomization 随机图聚类随机化
IF 1.4 4区 医学 Q2 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2020-09-04 DOI: 10.1515/jci-2022-0014
J. Ugander, Hao Yin
Abstract The global average treatment effect (GATE) is a primary quantity of interest in the study of causal inference under network interference. With a correctly specified exposure model of the interference, the Horvitz–Thompson (HT) and Hájek estimators of the GATE are unbiased and consistent, respectively, yet known to exhibit extreme variance under many designs and in many settings of interest. With a fixed clustering of the interference graph, graph cluster randomization (GCR) designs have been shown to greatly reduce variance compared to node-level random assignment, but even so the variance is still often prohibitively large. In this work, we propose a randomized version of the GCR design, descriptively named randomized graph cluster randomization (RGCR), which uses a random clustering rather than a single fixed clustering. By considering an ensemble of many different clustering assignments, this design avoids a key problem with GCR where the network exposure probability of a given node can be exponentially small in a single clustering. We propose two inherently randomized graph decomposition algorithms for use with RGCR designs, randomized 3-net and 1-hop-max, adapted from the prior work on multiway graph cut problems and the probabilistic approximation of (graph) metrics. We also propose weighted extensions of these two algorithms with slight additional advantages. All these algorithms result in network exposure probabilities that can be estimated efficiently. We derive structure-dependent upper bounds on the variance of the HT estimator of the GATE, depending on the metric structure of the graph driving the interference. Where the best-known such upper bound for the HT estimator under a GCR design is exponential in the parameters of the metric structure, we give a comparable upper bound under RGCR that is instead polynomial in the same parameters. We provide extensive simulations comparing RGCR and GCR designs, observing substantial improvements in GATE estimation in a variety of settings.
摘要全局平均处理效应(global average treatment effect, GATE)是研究网络干扰下因果推理的一个重要参数。有了正确指定的干扰暴露模型,GATE的Horvitz-Thompson (HT)和Hájek估计器分别是无偏和一致的,但已知在许多设计和许多感兴趣的设置下表现出极端的差异。对于干扰图的固定聚类,与节点级随机分配相比,图簇随机化(GCR)设计已被证明可以大大减少方差,但即使如此,方差仍然经常大得令人难以置信。在这项工作中,我们提出了一个随机版本的GCR设计,描述性地命名为随机图聚类随机化(RGCR),它使用随机聚类而不是单个固定聚类。通过考虑许多不同聚类分配的集成,该设计避免了GCR的一个关键问题,即给定节点的网络暴露概率在单个聚类中可能呈指数级小。我们提出了两种用于RGCR设计的固有随机图分解算法,随机3-net和1-hop-max,它们改编自先前关于多路图切问题和(图)度量的概率逼近的工作。我们还提出了这两种算法的加权扩展,并增加了一些额外的优点。所有这些算法都可以有效地估计网络暴露概率。我们根据驱动干涉的图的度量结构,推导出GATE的HT估计量方差的结构相关的上界。其中最著名的在GCR设计下的HT估计量的上界在度量结构的参数中是指数的,我们给出了RGCR下的一个类似的上界,它在相同的参数中是多项式的。我们提供了比较RGCR和GCR设计的广泛模拟,观察到在各种设置下GATE估计的实质性改进。
{"title":"Randomized graph cluster randomization","authors":"J. Ugander, Hao Yin","doi":"10.1515/jci-2022-0014","DOIUrl":"https://doi.org/10.1515/jci-2022-0014","url":null,"abstract":"Abstract The global average treatment effect (GATE) is a primary quantity of interest in the study of causal inference under network interference. With a correctly specified exposure model of the interference, the Horvitz–Thompson (HT) and Hájek estimators of the GATE are unbiased and consistent, respectively, yet known to exhibit extreme variance under many designs and in many settings of interest. With a fixed clustering of the interference graph, graph cluster randomization (GCR) designs have been shown to greatly reduce variance compared to node-level random assignment, but even so the variance is still often prohibitively large. In this work, we propose a randomized version of the GCR design, descriptively named randomized graph cluster randomization (RGCR), which uses a random clustering rather than a single fixed clustering. By considering an ensemble of many different clustering assignments, this design avoids a key problem with GCR where the network exposure probability of a given node can be exponentially small in a single clustering. We propose two inherently randomized graph decomposition algorithms for use with RGCR designs, randomized 3-net and 1-hop-max, adapted from the prior work on multiway graph cut problems and the probabilistic approximation of (graph) metrics. We also propose weighted extensions of these two algorithms with slight additional advantages. All these algorithms result in network exposure probabilities that can be estimated efficiently. We derive structure-dependent upper bounds on the variance of the HT estimator of the GATE, depending on the metric structure of the graph driving the interference. Where the best-known such upper bound for the HT estimator under a GCR design is exponential in the parameters of the metric structure, we give a comparable upper bound under RGCR that is instead polynomial in the same parameters. We provide extensive simulations comparing RGCR and GCR designs, observing substantial improvements in GATE estimation in a variety of settings.","PeriodicalId":48576,"journal":{"name":"Journal of Causal Inference","volume":"77 2-3 1","pages":""},"PeriodicalIF":1.4,"publicationDate":"2020-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78140869","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 20
Estimating causal effects with the neural autoregressive density estimator 用神经自回归密度估计器估计因果效应
IF 1.4 4区 医学 Q2 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2020-08-17 DOI: 10.1515/jci-2020-0007
Sergio Garrido, S. Borysov, Jeppe Rich, F. Pereira
Abstract The estimation of causal effects is fundamental in situations where the underlying system will be subject to active interventions. Part of building a causal inference engine is defining how variables relate to each other, that is, defining the functional relationship between variables entailed by the graph conditional dependencies. In this article, we deviate from the common assumption of linear relationships in causal models by making use of neural autoregressive density estimators and use them to estimate causal effects within Pearl’s do-calculus framework. Using synthetic data, we show that the approach can retrieve causal effects from non-linear systems without explicitly modeling the interactions between the variables and include confidence bands using the non-parametric bootstrap. We also explore scenarios that deviate from the ideal causal effect estimation setting such as poor data support or unobserved confounders.
在基础系统将受到积极干预的情况下,因果效应的估计是基本的。构建因果推理引擎的一部分工作是定义变量如何相互关联,也就是说,定义由图条件依赖关系所包含的变量之间的函数关系。在本文中,我们通过使用神经自回归密度估计器来偏离因果模型中线性关系的常见假设,并使用它们来估计Pearl的do-calculus框架内的因果效应。使用合成数据,我们表明该方法可以从非线性系统中检索因果效应,而无需显式建模变量之间的相互作用,并使用非参数自举包括置信带。我们还探讨了偏离理想因果效应估计设置的情况,如数据支持不足或未观察到的混杂因素。
{"title":"Estimating causal effects with the neural autoregressive density estimator","authors":"Sergio Garrido, S. Borysov, Jeppe Rich, F. Pereira","doi":"10.1515/jci-2020-0007","DOIUrl":"https://doi.org/10.1515/jci-2020-0007","url":null,"abstract":"Abstract The estimation of causal effects is fundamental in situations where the underlying system will be subject to active interventions. Part of building a causal inference engine is defining how variables relate to each other, that is, defining the functional relationship between variables entailed by the graph conditional dependencies. In this article, we deviate from the common assumption of linear relationships in causal models by making use of neural autoregressive density estimators and use them to estimate causal effects within Pearl’s do-calculus framework. Using synthetic data, we show that the approach can retrieve causal effects from non-linear systems without explicitly modeling the interactions between the variables and include confidence bands using the non-parametric bootstrap. We also explore scenarios that deviate from the ideal causal effect estimation setting such as poor data support or unobserved confounders.","PeriodicalId":48576,"journal":{"name":"Journal of Causal Inference","volume":"2 1","pages":"211 - 228"},"PeriodicalIF":1.4,"publicationDate":"2020-08-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86219822","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Conditional as-if analyses in randomized experiments 随机实验中的条件假设分析
IF 1.4 4区 医学 Q2 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2020-08-03 DOI: 10.1515/jci-2021-0012
Nicole E. Pashley, Guillaume W. Basse, Luke W. Miratrix
Abstract The injunction to “analyze the way you randomize” is well known to statisticians since Fisher advocated for randomization as the basis of inference. Yet even those convinced by the merits of randomization-based inference seldom follow this injunction to the letter. Bernoulli randomized experiments are often analyzed as completely randomized experiments, and completely randomized experiments are analyzed as if they had been stratified; more generally, it is not uncommon to analyze an experiment as if it had been randomized differently. This article examines the theoretical foundation behind this practice within a randomization-based framework. Specifically, we ask when is it legitimate to analyze an experiment randomized according to one design as if it had been randomized according to some other design. We show that a sufficient condition for this type of analysis to be valid is that the design used for analysis should be derived from the original design by an appropriate form of conditioning. We use our theory to justify certain existing methods, question others, and finally suggest new methodological insights such as conditioning on approximate covariate balance.
自从费雪主张将随机化作为推理的基础以来,统计学家就熟知“以随机化的方式进行分析”这条戒律。然而,即使是那些相信基于随机的推理的优点的人也很少严格遵守这一禁令。伯努利随机实验通常被分析为完全随机实验,完全随机实验被分析为分层;更一般地说,分析一个实验,就好像它是随机的一样,这并不罕见。本文在基于随机化的框架中研究了这一实践背后的理论基础。具体来说,我们要问的是,在什么情况下,根据一种随机设计来分析一个实验,就像它是根据另一种随机设计来分析一样,是合理的。我们证明,这种分析有效的充分条件是,用于分析的设计应通过适当形式的条件作用从原始设计中推导出来。我们用我们的理论来证明某些现有方法,质疑其他方法,并最终提出新的方法见解,如近似协变量平衡的条件。
{"title":"Conditional as-if analyses in randomized experiments","authors":"Nicole E. Pashley, Guillaume W. Basse, Luke W. Miratrix","doi":"10.1515/jci-2021-0012","DOIUrl":"https://doi.org/10.1515/jci-2021-0012","url":null,"abstract":"Abstract The injunction to “analyze the way you randomize” is well known to statisticians since Fisher advocated for randomization as the basis of inference. Yet even those convinced by the merits of randomization-based inference seldom follow this injunction to the letter. Bernoulli randomized experiments are often analyzed as completely randomized experiments, and completely randomized experiments are analyzed as if they had been stratified; more generally, it is not uncommon to analyze an experiment as if it had been randomized differently. This article examines the theoretical foundation behind this practice within a randomization-based framework. Specifically, we ask when is it legitimate to analyze an experiment randomized according to one design as if it had been randomized according to some other design. We show that a sufficient condition for this type of analysis to be valid is that the design used for analysis should be derived from the original design by an appropriate form of conditioning. We use our theory to justify certain existing methods, question others, and finally suggest new methodological insights such as conditioning on approximate covariate balance.","PeriodicalId":48576,"journal":{"name":"Journal of Causal Inference","volume":"92 1","pages":"264 - 284"},"PeriodicalIF":1.4,"publicationDate":"2020-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83798256","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Properties of restricted randomization with implications for experimental design 限制随机化的性质及其对实验设计的影响
IF 1.4 4区 医学 Q2 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2020-06-26 DOI: 10.1515/jci-2021-0057
Mattias Nordin, M. Schultzberg
Abstract Recently, there has been increasing interest in the use of heavily restricted randomization designs which enforce balance on observed covariates in randomized controlled trials. However, when restrictions are strict, there is a risk that the treatment effect estimator will have a very high mean squared error (MSE). In this article, we formalize this risk and propose a novel combinatoric-based approach to describe and address this issue. First, we validate our new approach by re-proving some known properties of complete randomization and restricted randomization. Second, we propose a novel diagnostic measure for restricted designs that only use the information embedded in the combinatorics of the design. Third, we show that the variance of the MSE of the difference-in-means estimator in a randomized experiment is a linear function of this diagnostic measure. Finally, we identify situations in which restricted designs can lead to an increased risk of getting a high MSE and discuss how our diagnostic measure can be used to detect such designs. Our results have implications for any restricted randomization design and can be used to evaluate the trade-off between enforcing balance on observed covariates and avoiding too restrictive designs.
摘要近年来,人们对使用严格限制的随机化设计越来越感兴趣,这种设计在随机对照试验中强制平衡观察到的协变量。然而,当限制很严格时,存在治疗效果估计量具有非常高的均方误差(MSE)的风险。在本文中,我们将这种风险形式化,并提出一种新的基于组合的方法来描述和解决这个问题。首先,我们通过重新证明完全随机化和受限随机化的一些已知性质来验证我们的新方法。其次,我们提出了一种新的诊断方法,用于仅使用嵌入在设计组合中的信息的限制性设计。第三,我们证明了随机实验中均值差估计量的MSE方差是该诊断度量的线性函数。最后,我们确定了限制性设计可能导致获得高MSE风险增加的情况,并讨论了如何使用我们的诊断措施来检测此类设计。我们的研究结果对任何限制性随机化设计都有启示,可以用来评估强制观察协变量平衡和避免过于严格的设计之间的权衡。
{"title":"Properties of restricted randomization with implications for experimental design","authors":"Mattias Nordin, M. Schultzberg","doi":"10.1515/jci-2021-0057","DOIUrl":"https://doi.org/10.1515/jci-2021-0057","url":null,"abstract":"Abstract Recently, there has been increasing interest in the use of heavily restricted randomization designs which enforce balance on observed covariates in randomized controlled trials. However, when restrictions are strict, there is a risk that the treatment effect estimator will have a very high mean squared error (MSE). In this article, we formalize this risk and propose a novel combinatoric-based approach to describe and address this issue. First, we validate our new approach by re-proving some known properties of complete randomization and restricted randomization. Second, we propose a novel diagnostic measure for restricted designs that only use the information embedded in the combinatorics of the design. Third, we show that the variance of the MSE of the difference-in-means estimator in a randomized experiment is a linear function of this diagnostic measure. Finally, we identify situations in which restricted designs can lead to an increased risk of getting a high MSE and discuss how our diagnostic measure can be used to detect such designs. Our results have implications for any restricted randomization design and can be used to evaluate the trade-off between enforcing balance on observed covariates and avoiding too restrictive designs.","PeriodicalId":48576,"journal":{"name":"Journal of Causal Inference","volume":"39 1","pages":"227 - 245"},"PeriodicalIF":1.4,"publicationDate":"2020-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89531588","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Decision-theoretic foundations for statistical causality 统计因果关系的决策理论基础
IF 1.4 4区 医学 Q2 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2020-04-26 DOI: 10.1515/jci-2020-0008
P. Dawid
Abstract We develop a mathematical and interpretative foundation for the enterprise of decision-theoretic (DT) statistical causality, which is a straightforward way of representing and addressing causal questions. DT reframes causal inference as “assisted decision-making” and aims to understand when, and how, I can make use of external data, typically observational, to help me solve a decision problem by taking advantage of assumed relationships between the data and my problem. The relationships embodied in any representation of a causal problem require deeper justification, which is necessarily context-dependent. Here we clarify the considerations needed to support applications of the DT methodology. Exchangeability considerations are used to structure the required relationships, and a distinction drawn between intention to treat and intervention to treat forms the basis for the enabling condition of “ignorability.” We also show how the DT perspective unifies and sheds light on other popular formalisations of statistical causality, including potential responses and directed acyclic graphs.
摘要:我们为决策理论(DT)统计因果关系的企业开发了一个数学和解释基础,这是一个简单的方式来表示和解决因果问题。DT将因果推理重新定义为“辅助决策”,旨在了解我何时以及如何利用外部数据,通常是观察数据,通过利用数据与问题之间的假设关系来帮助我解决决策问题。因果问题的任何表征所体现的关系都需要更深层次的论证,这必然是依赖于上下文的。在这里,我们澄清了支持DT方法应用所需的考虑因素。互换性考虑用于构建所需的关系,并且在治疗意图和干预治疗之间划出的区别形成了“可忽略性”启用条件的基础。我们还展示了DT视角如何统一和阐明其他流行的统计因果关系形式化,包括潜在响应和有向无环图。
{"title":"Decision-theoretic foundations for statistical causality","authors":"P. Dawid","doi":"10.1515/jci-2020-0008","DOIUrl":"https://doi.org/10.1515/jci-2020-0008","url":null,"abstract":"Abstract We develop a mathematical and interpretative foundation for the enterprise of decision-theoretic (DT) statistical causality, which is a straightforward way of representing and addressing causal questions. DT reframes causal inference as “assisted decision-making” and aims to understand when, and how, I can make use of external data, typically observational, to help me solve a decision problem by taking advantage of assumed relationships between the data and my problem. The relationships embodied in any representation of a causal problem require deeper justification, which is necessarily context-dependent. Here we clarify the considerations needed to support applications of the DT methodology. Exchangeability considerations are used to structure the required relationships, and a distinction drawn between intention to treat and intervention to treat forms the basis for the enabling condition of “ignorability.” We also show how the DT perspective unifies and sheds light on other popular formalisations of statistical causality, including potential responses and directed acyclic graphs.","PeriodicalId":48576,"journal":{"name":"Journal of Causal Inference","volume":"40 1","pages":"39 - 77"},"PeriodicalIF":1.4,"publicationDate":"2020-04-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79851043","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 29
The variance of causal effect estimators for binary v-structures 二元v型结构的因果效应估计量的方差
IF 1.4 4区 医学 Q2 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2020-04-20 DOI: 10.1515/jci-2021-0025
Jack Kuipers, G. Moffa
Abstract Adjusting for covariates is a well-established method to estimate the total causal effect of an exposure variable on an outcome of interest. Depending on the causal structure of the mechanism under study, there may be different adjustment sets, equally valid from a theoretical perspective, leading to identical causal effects. However, in practice, with finite data, estimators built on different sets may display different precisions. To investigate the extent of this variability, we consider the simplest non-trivial non-linear model of a v-structure on three nodes for binary data. We explicitly compute and compare the variance of the two possible different causal estimators. Further, by going beyond leading-order asymptotics, we show that there are parameter regimes where the set with the asymptotically optimal variance does depend on the edge coefficients, a result that is not captured by the recent leading-order developments for general causal models. As a practical consequence, the adjustment set selection needs to account for the relative magnitude of the relationships between variables with respect to the sample size and cannot rely on purely graphical criteria.
摘要协变量调整是一种公认的估计暴露变量对感兴趣的结果的总因果效应的方法。根据所研究机制的因果结构,可能存在不同的调整集,从理论角度来看,这些调整集同样有效,从而导致相同的因果效应。然而,在实践中,对于有限的数据,建立在不同集合上的估计器可能会显示不同的精度。为了研究这种可变性的程度,我们考虑了二进制数据的三个节点上的v结构的最简单的非平凡非线性模型。我们显式地计算和比较两种可能不同的因果估计量的方差。此外,通过超越首阶渐近,我们表明存在参数区域,其中具有渐近最优方差的集合确实依赖于边缘系数,这一结果没有被最近的一般因果模型的首阶发展所捕获。作为一个实际的结果,调整集的选择需要考虑变量之间相对于样本量的关系的相对大小,不能纯粹依赖于图形标准。
{"title":"The variance of causal effect estimators for binary v-structures","authors":"Jack Kuipers, G. Moffa","doi":"10.1515/jci-2021-0025","DOIUrl":"https://doi.org/10.1515/jci-2021-0025","url":null,"abstract":"Abstract Adjusting for covariates is a well-established method to estimate the total causal effect of an exposure variable on an outcome of interest. Depending on the causal structure of the mechanism under study, there may be different adjustment sets, equally valid from a theoretical perspective, leading to identical causal effects. However, in practice, with finite data, estimators built on different sets may display different precisions. To investigate the extent of this variability, we consider the simplest non-trivial non-linear model of a v-structure on three nodes for binary data. We explicitly compute and compare the variance of the two possible different causal estimators. Further, by going beyond leading-order asymptotics, we show that there are parameter regimes where the set with the asymptotically optimal variance does depend on the edge coefficients, a result that is not captured by the recent leading-order developments for general causal models. As a practical consequence, the adjustment set selection needs to account for the relative magnitude of the relationships between variables with respect to the sample size and cannot rely on purely graphical criteria.","PeriodicalId":48576,"journal":{"name":"Journal of Causal Inference","volume":"41 1","pages":"90 - 105"},"PeriodicalIF":1.4,"publicationDate":"2020-04-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81036119","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A generalized double robust Bayesian model averaging approach to causal effect estimation with application to the study of osteoporotic fractures 基于广义双鲁棒贝叶斯模型平均法的因果效应估计及其在骨质疏松性骨折研究中的应用
IF 1.4 4区 医学 Q2 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2020-03-25 DOI: 10.1515/jci-2021-0023
D. Talbot, C. Beaudoin
Abstract Analysts often use data-driven approaches to supplement their knowledge when selecting covariates for effect estimation. Multiple variable selection procedures for causal effect estimation have been devised in recent years, but additional developments are still required to adequately address the needs of analysts. We propose a generalized Bayesian causal effect estimation (GBCEE) algorithm to perform variable selection and produce double robust (DR) estimates of causal effects for binary or continuous exposures and outcomes. GBCEE employs a prior distribution that targets the selection of true confounders and predictors of the outcome for the unbiased estimation of causal effects with reduced standard errors. The Bayesian machinery allows GBCEE to directly produce inferences for its estimate. In simulations, GBCEE was observed to perform similarly or to outperform DR alternatives. Its ability to directly produce inferences is also an important advantage from a computational perspective. The method is finally illustrated for the estimation of the effect of meeting physical activity recommendations on the risk of hip or upper-leg fractures among older women in the study of osteoporotic fractures. The 95% confidence interval produced by GBCEE is 61% narrower than that of a DR estimator adjusting for all potential confounders in this illustration.
分析人员在选择协变量进行效果估计时,经常使用数据驱动的方法来补充他们的知识。近年来,因果效应估计的多变量选择程序已经被设计出来,但仍然需要进一步的发展来充分满足分析人员的需求。我们提出了一种广义贝叶斯因果效应估计(GBCEE)算法来执行变量选择,并对二元或连续暴露和结果的因果效应产生双鲁棒(DR)估计。GBCEE采用先验分布,目标是选择真正的混杂因素和结果的预测因子,以减少标准误差,对因果效应进行无偏估计。贝叶斯机制允许GBCEE直接为其估计产生推论。在模拟中,GBCEE被观察到表现类似或优于DR替代方案。从计算的角度来看,它直接产生推理的能力也是一个重要的优势。最后,在骨质疏松性骨折的研究中,该方法被用于评估满足体力活动建议对老年妇女髋部或上肢骨折风险的影响。GBCEE产生的95%置信区间比本例中调整所有潜在混杂因素的DR估计器的置信区间窄61%。
{"title":"A generalized double robust Bayesian model averaging approach to causal effect estimation with application to the study of osteoporotic fractures","authors":"D. Talbot, C. Beaudoin","doi":"10.1515/jci-2021-0023","DOIUrl":"https://doi.org/10.1515/jci-2021-0023","url":null,"abstract":"Abstract Analysts often use data-driven approaches to supplement their knowledge when selecting covariates for effect estimation. Multiple variable selection procedures for causal effect estimation have been devised in recent years, but additional developments are still required to adequately address the needs of analysts. We propose a generalized Bayesian causal effect estimation (GBCEE) algorithm to perform variable selection and produce double robust (DR) estimates of causal effects for binary or continuous exposures and outcomes. GBCEE employs a prior distribution that targets the selection of true confounders and predictors of the outcome for the unbiased estimation of causal effects with reduced standard errors. The Bayesian machinery allows GBCEE to directly produce inferences for its estimate. In simulations, GBCEE was observed to perform similarly or to outperform DR alternatives. Its ability to directly produce inferences is also an important advantage from a computational perspective. The method is finally illustrated for the estimation of the effect of meeting physical activity recommendations on the risk of hip or upper-leg fractures among older women in the study of osteoporotic fractures. The 95% confidence interval produced by GBCEE is 61% narrower than that of a DR estimator adjusting for all potential confounders in this illustration.","PeriodicalId":48576,"journal":{"name":"Journal of Causal Inference","volume":"3 1","pages":"335 - 371"},"PeriodicalIF":1.4,"publicationDate":"2020-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78500321","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
期刊
Journal of Causal Inference
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1