首页 > 最新文献

Journal of Causal Inference最新文献

英文 中文
Doubly robust estimators for generalizing treatment effects on survival outcomes from randomized controlled trials to a target population. 将随机对照试验对生存结果的治疗效果推广到目标人群的双稳健估计器。
IF 1.7 4区 医学 Q2 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2022-01-01 Epub Date: 2022-12-09 DOI: 10.1515/jci-2022-0004
Dasom Lee, Shu Yang, Xiaofei Wang

In the presence of heterogeneity between the randomized controlled trial (RCT) participants and the target population, evaluating the treatment effect solely based on the RCT often leads to biased quantification of the real-world treatment effect. To address the problem of lack of generalizability for the treatment effect estimated by the RCT sample, we leverage observational studies with large samples that are representative of the target population. This article concerns evaluating treatment effects on survival outcomes for a target population and considers a broad class of estimands that are functionals of treatment-specific survival functions, including differences in survival probability and restricted mean survival times. Motivated by two intuitive but distinct approaches, i.e., imputation based on survival outcome regression and weighting based on inverse probability of sampling, censoring, and treatment assignment, we propose a semiparametric estimator through the guidance of the efficient influence function. The proposed estimator is doubly robust in the sense that it is consistent for the target population estimands if either the survival model or the weighting model is correctly specified and is locally efficient when both are correct. In addition, as an alternative to parametric estimation, we employ the nonparametric method of sieves for flexible and robust estimation of the nuisance functions and show that the resulting estimator retains the root-n consistency and efficiency, the so-called rate-double robustness. Simulation studies confirm the theoretical properties of the proposed estimator and show that it outperforms competitors. We apply the proposed method to estimate the effect of adjuvant chemotherapy on survival in patients with early-stage resected non-small cell lung cancer.

在随机对照试验(RCT)参与者与目标人群之间存在异质性的情况下,仅根据 RCT 评估治疗效果往往会导致对真实世界治疗效果的量化存在偏差。为了解决随机对照试验样本估计的治疗效果缺乏普遍性的问题,我们利用了具有目标人群代表性的大样本观察研究。本文涉及评估治疗对目标人群生存结果的影响,并考虑了作为治疗特异性生存函数的一大类估计值,包括生存概率差异和受限平均生存时间。受两种直观但截然不同的方法(即基于生存结果回归的估算和基于抽样、普查和治疗分配的逆概率的加权)的启发,我们通过有效影响函数的指导提出了一种半参数估计器。所提出的估计器具有双重稳健性,即如果生存模型或加权模型中的任何一个指定正确,它对于目标人群估计值都是一致的;如果两个模型都正确,它就是局部有效的。此外,作为参数估计的替代方法,我们还采用了非参数筛分法对骚扰函数进行灵活稳健的估计,并证明所得到的估计值保持了根n一致性和效率,即所谓的率双稳健性。模拟研究证实了所提估计方法的理论特性,并表明它优于竞争对手。我们将提出的方法用于估计辅助化疗对早期切除的非小细胞肺癌患者生存期的影响。
{"title":"Doubly robust estimators for generalizing treatment effects on survival outcomes from randomized controlled trials to a target population.","authors":"Dasom Lee, Shu Yang, Xiaofei Wang","doi":"10.1515/jci-2022-0004","DOIUrl":"10.1515/jci-2022-0004","url":null,"abstract":"<p><p>In the presence of heterogeneity between the randomized controlled trial (RCT) participants and the target population, evaluating the treatment effect solely based on the RCT often leads to biased quantification of the real-world treatment effect. To address the problem of lack of generalizability for the treatment effect estimated by the RCT sample, we leverage observational studies with large samples that are representative of the target population. This article concerns evaluating treatment effects on survival outcomes for a target population and considers a broad class of estimands that are functionals of treatment-specific survival functions, including differences in survival probability and restricted mean survival times. Motivated by two intuitive but distinct approaches, i.e., imputation based on survival outcome regression and weighting based on inverse probability of sampling, censoring, and treatment assignment, we propose a semiparametric estimator through the guidance of the efficient influence function. The proposed estimator is doubly robust in the sense that it is consistent for the target population estimands if either the survival model or the weighting model is correctly specified and is locally efficient when both are correct. In addition, as an alternative to parametric estimation, we employ the nonparametric method of sieves for flexible and robust estimation of the nuisance functions and show that the resulting estimator retains the root-<i>n</i> consistency and efficiency, the so-called rate-double robustness. Simulation studies confirm the theoretical properties of the proposed estimator and show that it outperforms competitors. We apply the proposed method to estimate the effect of adjuvant chemotherapy on survival in patients with early-stage resected non-small cell lung cancer.</p>","PeriodicalId":48576,"journal":{"name":"Journal of Causal Inference","volume":"10 1","pages":"415-440"},"PeriodicalIF":1.7,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10457100/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10119183","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Identifying HIV sequences that escape antibody neutralization using random forests and collaborative targeted learning 利用随机森林和协作目标学习识别逃避抗体中和的HIV序列
IF 1.4 4区 医学 Q2 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2022-01-01 DOI: 10.1515/jci-2021-0053
Yutong Jin, D. Benkeser
Abstract Recent studies have indicated that it is possible to protect individuals from HIV infection using passive infusion of monoclonal antibodies. However, in order for monoclonal antibodies to confer robust protection, the antibodies must be capable of neutralizing many possible strains of the virus. This is particularly challenging in the context of a highly diverse pathogen like HIV. It is therefore of great interest to leverage existing observational data sources to discover antibodies that are able to neutralize HIV viruses via residues where existing antibodies show modest protection. Such information feeds directly into the clinical trial pipeline for monoclonal antibody therapies by providing information on (i) whether and to what extent combinations of antibodies can generate superior protection and (ii) strategies for analyzing past clinical trials to identify in vivo evidence of antibody resistance. These observational data include genetic features of many diverse HIV genetic sequences, as well as in vitro measures of antibody resistance. The statistical learning problem we are interested in is developing statistical methodology that can be used to analyze these data to identify important genetic features that are significantly associated with antibody resistance. This is a challenging problem owing to the high-dimensional and strongly correlated nature of the genetic sequence data. To overcome these challenges, we propose an outcome-adaptive, collaborative targeted minimum loss-based estimation approach using random forests. We demonstrate via simulation that the approach enjoys important statistical benefits over existing approaches in terms of bias, mean squared error, and type I error. We apply the approach to the Compile, Analyze, and Tally Nab Panels database to identify AA positions that are potentially causally related to resistance to neutralization by several different antibodies.
最近的研究表明,被动输注单克隆抗体可以保护个体免受HIV感染。然而,为了使单克隆抗体具有强大的保护作用,抗体必须能够中和许多可能的病毒株。在艾滋病毒等高度多样化的病原体的背景下,这尤其具有挑战性。因此,利用现有的观察数据源来发现能够通过现有抗体显示适度保护的残基来中和艾滋病毒的抗体是非常有兴趣的。这些信息通过提供以下信息,直接输入到单克隆抗体治疗的临床试验管道中:(i)抗体组合是否以及在多大程度上可以产生更好的保护;(ii)分析过去临床试验的策略,以确定体内抗体耐药性的证据。这些观察数据包括许多不同HIV基因序列的遗传特征,以及抗体耐药性的体外测量。我们感兴趣的统计学习问题是开发统计方法,可以用来分析这些数据,以确定与抗体耐药性显著相关的重要遗传特征。由于基因序列数据的高维性和强相关性,这是一个具有挑战性的问题。为了克服这些挑战,我们提出了一种基于随机森林的结果自适应、协作目标最小损失估计方法。我们通过模拟证明,该方法在偏差、均方误差和I型误差方面比现有方法具有重要的统计优势。我们将该方法应用于编译、分析和计数Nab面板数据库,以确定可能与几种不同抗体的中和抗性有因果关系的AA位置。
{"title":"Identifying HIV sequences that escape antibody neutralization using random forests and collaborative targeted learning","authors":"Yutong Jin, D. Benkeser","doi":"10.1515/jci-2021-0053","DOIUrl":"https://doi.org/10.1515/jci-2021-0053","url":null,"abstract":"Abstract Recent studies have indicated that it is possible to protect individuals from HIV infection using passive infusion of monoclonal antibodies. However, in order for monoclonal antibodies to confer robust protection, the antibodies must be capable of neutralizing many possible strains of the virus. This is particularly challenging in the context of a highly diverse pathogen like HIV. It is therefore of great interest to leverage existing observational data sources to discover antibodies that are able to neutralize HIV viruses via residues where existing antibodies show modest protection. Such information feeds directly into the clinical trial pipeline for monoclonal antibody therapies by providing information on (i) whether and to what extent combinations of antibodies can generate superior protection and (ii) strategies for analyzing past clinical trials to identify in vivo evidence of antibody resistance. These observational data include genetic features of many diverse HIV genetic sequences, as well as in vitro measures of antibody resistance. The statistical learning problem we are interested in is developing statistical methodology that can be used to analyze these data to identify important genetic features that are significantly associated with antibody resistance. This is a challenging problem owing to the high-dimensional and strongly correlated nature of the genetic sequence data. To overcome these challenges, we propose an outcome-adaptive, collaborative targeted minimum loss-based estimation approach using random forests. We demonstrate via simulation that the approach enjoys important statistical benefits over existing approaches in terms of bias, mean squared error, and type I error. We apply the approach to the Compile, Analyze, and Tally Nab Panels database to identify AA positions that are potentially causally related to resistance to neutralization by several different antibodies.","PeriodicalId":48576,"journal":{"name":"Journal of Causal Inference","volume":"43 1","pages":"280 - 295"},"PeriodicalIF":1.4,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73879080","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bias attenuation results for dichotomization of a continuous confounder 偏置衰减的结果为二分类的连续混杂
IF 1.4 4区 医学 Q2 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2022-01-01 DOI: 10.1515/jci-2022-0047
E. Gabriel, J. M. Pena, A. Sjölander
Abstract It is well-known that dichotomization can cause bias and loss of efficiency in estimation. One can easily construct examples where adjusting for a dichotomized confounder causes bias in causal estimation. There are additional examples in the literature where adjusting for a dichotomized confounder can be more biased than not adjusting at all. The message is clear, do not dichotomize. What is unclear is if there are scenarios where adjusting for the dichotomized confounder always leads to lower bias than not adjusting. We propose several sets of conditions that characterize scenarios where one should always adjust for the dichotomized confounder to reduce bias. We then highlight scenarios where the decision to adjust should be made more cautiously. To our knowledge, this is the first formal presentation of conditions that give information about when one should and potentially should not adjust for a dichotomized confounder.
摘要二分类方法在估计中会引起偏差和效率损失,这是众所周知的。我们可以很容易地构造一些例子,其中调整二分类混杂因素会导致因果估计中的偏差。文献中还有其他例子,其中调整二分类混杂因素可能比根本不调整更有偏见。信息很清楚,不要一分为二。目前尚不清楚的是,是否存在对二分类混杂因素进行调整总是比不进行调整导致更低偏差的情况。我们提出了几组条件,这些条件表征了人们应该始终调整二分类混杂因素以减少偏差的情况。然后,我们重点介绍了应该更加谨慎地做出调整决定的场景。据我们所知,这是第一次正式提出条件,提供了关于何时应该和可能不应该调整二分类混杂因素的信息。
{"title":"Bias attenuation results for dichotomization of a continuous confounder","authors":"E. Gabriel, J. M. Pena, A. Sjölander","doi":"10.1515/jci-2022-0047","DOIUrl":"https://doi.org/10.1515/jci-2022-0047","url":null,"abstract":"Abstract It is well-known that dichotomization can cause bias and loss of efficiency in estimation. One can easily construct examples where adjusting for a dichotomized confounder causes bias in causal estimation. There are additional examples in the literature where adjusting for a dichotomized confounder can be more biased than not adjusting at all. The message is clear, do not dichotomize. What is unclear is if there are scenarios where adjusting for the dichotomized confounder always leads to lower bias than not adjusting. We propose several sets of conditions that characterize scenarios where one should always adjust for the dichotomized confounder to reduce bias. We then highlight scenarios where the decision to adjust should be made more cautiously. To our knowledge, this is the first formal presentation of conditions that give information about when one should and potentially should not adjust for a dichotomized confounder.","PeriodicalId":48576,"journal":{"name":"Journal of Causal Inference","volume":"15 1","pages":"515 - 526"},"PeriodicalIF":1.4,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88518511","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Optimal weighting for estimating generalized average treatment effects 估计广义平均处理效果的最优加权
IF 1.4 4区 医学 Q2 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2022-01-01 DOI: 10.1515/jci-2021-0018
Nathan Kallus, Michele Santacatterina
Abstract In causal inference, a variety of causal effect estimands have been studied, including the sample, uncensored, target, conditional, optimal subpopulation, and optimal weighted average treatment effects. Ad hoc methods have been developed for each estimand based on inverse probability weighting (IPW) and on outcome regression modeling, but these may be sensitive to model misspecification, practical violations of positivity, or both. The contribution of this article is twofold. First, we formulate the generalized average treatment effect (GATE) to unify these causal estimands as well as their IPW estimates. Second, we develop a method based on Kernel optimal matching (KOM) to optimally estimate GATE and to find the GATE most easily estimable by KOM, which we term the Kernel optimal weighted average treatment effect. KOM provides uniform control on the conditional mean squared error of a weighted estimator over a class of models while simultaneously controlling for precision. We study its theoretical properties and evaluate its comparative performance in a simulation study. We illustrate the use of KOM for GATE estimation in two case studies: comparing spine surgical interventions and studying the effect of peer support on people living with HIV.
在因果推理中,研究了各种因果效应估计,包括样本、未删减、目标、条件、最优亚群和最优加权平均处理效应。基于逆概率加权(IPW)和结果回归建模,已经为每个估计开发了特别的方法,但这些方法可能对模型错误规范、实际违反正性或两者都很敏感。这篇文章的贡献是双重的。首先,我们制定了广义平均处理效应(GATE)来统一这些因果估计及其IPW估计。其次,我们开发了一种基于核最优匹配(KOM)的方法来最优估计GATE,并找到最容易被KOM估计的GATE,我们称之为核最优加权平均处理效果。KOM在控制精度的同时,对一类模型上加权估计器的条件均方误差提供统一的控制。我们研究了它的理论性质,并在仿真研究中评价了它的比较性能。我们在两个案例研究中说明了KOM对GATE估计的使用:比较脊柱外科干预和研究同伴支持对艾滋病毒感染者的影响。
{"title":"Optimal weighting for estimating generalized average treatment effects","authors":"Nathan Kallus, Michele Santacatterina","doi":"10.1515/jci-2021-0018","DOIUrl":"https://doi.org/10.1515/jci-2021-0018","url":null,"abstract":"Abstract In causal inference, a variety of causal effect estimands have been studied, including the sample, uncensored, target, conditional, optimal subpopulation, and optimal weighted average treatment effects. Ad hoc methods have been developed for each estimand based on inverse probability weighting (IPW) and on outcome regression modeling, but these may be sensitive to model misspecification, practical violations of positivity, or both. The contribution of this article is twofold. First, we formulate the generalized average treatment effect (GATE) to unify these causal estimands as well as their IPW estimates. Second, we develop a method based on Kernel optimal matching (KOM) to optimally estimate GATE and to find the GATE most easily estimable by KOM, which we term the Kernel optimal weighted average treatment effect. KOM provides uniform control on the conditional mean squared error of a weighted estimator over a class of models while simultaneously controlling for precision. We study its theoretical properties and evaluate its comparative performance in a simulation study. We illustrate the use of KOM for GATE estimation in two case studies: comparing spine surgical interventions and studying the effect of peer support on people living with HIV.","PeriodicalId":48576,"journal":{"name":"Journal of Causal Inference","volume":"126 1","pages":"123 - 140"},"PeriodicalIF":1.4,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87728464","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Decision-theoretic foundations for statistical causality: Response to Shpitser 统计因果关系的决策理论基础:对Shpitser的回应
IF 1.4 4区 医学 Q2 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2022-01-01 DOI: 10.1515/jci-2022-0013
P. Dawid
Abstract I thank Ilya Shpitser for his comments on my article, and discuss the use of models with restricted interventions.
我感谢Ilya Shpitser对我的文章的评论,并讨论了限制干预模型的使用。
{"title":"Decision-theoretic foundations for statistical causality: Response to Shpitser","authors":"P. Dawid","doi":"10.1515/jci-2022-0013","DOIUrl":"https://doi.org/10.1515/jci-2022-0013","url":null,"abstract":"Abstract I thank Ilya Shpitser for his comments on my article, and discuss the use of models with restricted interventions.","PeriodicalId":48576,"journal":{"name":"Journal of Causal Inference","volume":"7 1","pages":"217 - 220"},"PeriodicalIF":1.4,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76057063","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A note on efficient minimum cost adjustment sets in causal graphical models 因果图模型中有效最小成本调整集的注释
IF 1.4 4区 医学 Q2 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2022-01-01 DOI: 10.1515/jci-2022-0015
Ezequiel Smucler, A. Rotnitzky
Abstract We study the selection of adjustment sets for estimating the interventional mean under an individualized treatment rule. We assume a non-parametric causal graphical model with, possibly, hidden variables and at least one adjustment set composed of observable variables. Moreover, we assume that observable variables have positive costs associated with them. We define the cost of an observable adjustment set as the sum of the costs of the variables that comprise it. We show that in this setting there exist adjustment sets that are minimum cost optimal, in the sense that they yield non-parametric estimators of the interventional mean with the smallest asymptotic variance among those that control for observable adjustment sets that have minimum cost. Our results are based on the construction of a special flow network associated with the original causal graph. We show that a minimum cost optimal adjustment set can be found by computing a maximum flow on the network, and then finding the set of vertices that are reachable from the source by augmenting paths. The optimaladj Python package implements the algorithms introduced in this article.
摘要研究了个体化治疗规则下介入均值估计的调整集选择。我们假设一个非参数因果图模型,可能有隐藏变量和至少一个由可观察变量组成的调整集。此外,我们假设可观察变量具有与之相关的正成本。我们将可观察调整集的成本定义为组成该调整集的变量成本的总和。我们表明,在这种情况下,存在最小成本最优的调整集,因为它们产生的干预均值的非参数估计量在那些控制具有最小成本的可观察调整集的调整集中具有最小渐近方差。我们的结果是基于与原始因果图相关联的特殊流网络的构建。我们表明,通过计算网络上的最大流量,然后通过增加路径找到从源可到达的顶点集,可以找到最小成本最优调整集。optimaladj Python包实现了本文中介绍的算法。
{"title":"A note on efficient minimum cost adjustment sets in causal graphical models","authors":"Ezequiel Smucler, A. Rotnitzky","doi":"10.1515/jci-2022-0015","DOIUrl":"https://doi.org/10.1515/jci-2022-0015","url":null,"abstract":"Abstract We study the selection of adjustment sets for estimating the interventional mean under an individualized treatment rule. We assume a non-parametric causal graphical model with, possibly, hidden variables and at least one adjustment set composed of observable variables. Moreover, we assume that observable variables have positive costs associated with them. We define the cost of an observable adjustment set as the sum of the costs of the variables that comprise it. We show that in this setting there exist adjustment sets that are minimum cost optimal, in the sense that they yield non-parametric estimators of the interventional mean with the smallest asymptotic variance among those that control for observable adjustment sets that have minimum cost. Our results are based on the construction of a special flow network associated with the original causal graph. We show that a minimum cost optimal adjustment set can be found by computing a maximum flow on the network, and then finding the set of vertices that are reachable from the source by augmenting paths. The optimaladj Python package implements the algorithms introduced in this article.","PeriodicalId":48576,"journal":{"name":"Journal of Causal Inference","volume":"7 1","pages":"174 - 189"},"PeriodicalIF":1.4,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78734030","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Treatment effect optimisation in dynamic environments 动态环境下的治疗效果优化
IF 1.4 4区 医学 Q2 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2022-01-01 DOI: 10.1515/jci-2020-0009
Jeroen Berrevoets, Sam Verboven, W. Verbeke
Abstract Applying causal methods to fields such as healthcare, marketing, and economics receives increasing interest. In particular, optimising the individual-treatment-effect – often referred to as uplift modelling – has peaked in areas such as precision medicine and targeted advertising. While existing techniques have proven useful in many settings, they suffer vividly in a dynamic environment. To address this issue, we propose a novel optimisation target that is easily incorporated in bandit algorithms. Incorporating this target creates a causal model which we name an uplifted contextual multi-armed bandit. Experiments on real and simulated data show the proposed method to effectively improve upon the state-of-the-art. All our code is made available online at https://github.com/vub-dl/u-cmab.
将因果方法应用于医疗保健、市场营销和经济学等领域受到越来越多的关注。特别是,优化个人治疗效果——通常被称为提升模型——在精准医疗和定向广告等领域已经达到顶峰。虽然现有技术已被证明在许多情况下都是有用的,但它们在动态环境中受到的影响却十分明显。为了解决这个问题,我们提出了一个新的优化目标,很容易纳入强盗算法。把这个目标结合起来,就产生了一个因果模型,我们称之为一个被提升的语境多手强盗。实际数据和仿真数据的实验表明,该方法在现有的基础上得到了有效的改进。我们所有的代码都可以在https://github.com/vub-dl/u-cmab上获得。
{"title":"Treatment effect optimisation in dynamic environments","authors":"Jeroen Berrevoets, Sam Verboven, W. Verbeke","doi":"10.1515/jci-2020-0009","DOIUrl":"https://doi.org/10.1515/jci-2020-0009","url":null,"abstract":"Abstract Applying causal methods to fields such as healthcare, marketing, and economics receives increasing interest. In particular, optimising the individual-treatment-effect – often referred to as uplift modelling – has peaked in areas such as precision medicine and targeted advertising. While existing techniques have proven useful in many settings, they suffer vividly in a dynamic environment. To address this issue, we propose a novel optimisation target that is easily incorporated in bandit algorithms. Incorporating this target creates a causal model which we name an uplifted contextual multi-armed bandit. Experiments on real and simulated data show the proposed method to effectively improve upon the state-of-the-art. All our code is made available online at https://github.com/vub-dl/u-cmab.","PeriodicalId":48576,"journal":{"name":"Journal of Causal Inference","volume":"1 1","pages":"106 - 122"},"PeriodicalIF":1.4,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77372681","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Causal inference with imperfect instrumental variables 不完全工具变量下的因果推理
IF 1.4 4区 医学 Q2 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2021-11-04 DOI: 10.1515/jci-2021-0065
N. Miklin, M. Gachechiladze, George Moreno, R. Chaves
Abstract Instrumental variables allow for quantification of cause and effect relationships even in the absence of interventions. To achieve this, a number of causal assumptions must be met, the most important of which is the independence assumption, which states that the instrument and any confounding factor must be independent. However, if this independence condition is not met, can we still work with imperfect instrumental variables? Imperfect instruments can manifest themselves by violations of the instrumental inequalities that constrain the set of correlations in the scenario. In this article, we establish a quantitative relationship between such violations of instrumental inequalities and the minimal amount of measurement dependence required to explain them for the case of discrete observed variables. As a result, we provide adapted inequalities that are valid in the presence of a relaxed measurement dependence assumption in the instrumental scenario. This allows for the adaptation of existing and new lower bounds on the average causal effect for instrumental scenarios with binary outcomes. Finally, we discuss our findings in the context of quantum mechanics.
工具变量允许在没有干预的情况下对因果关系进行量化。要做到这一点,必须满足一些因果假设,其中最重要的是独立性假设,即工具和任何混淆因素必须是独立的。然而,如果不满足这个独立性条件,我们还能处理不完美的工具变量吗?不完美的工具可以通过违反工具不平等来表现出来,而工具不平等限制了场景中的相关集。在本文中,我们建立了这种违反仪器不等式和最小量的测量依赖之间的定量关系,以解释离散观察变量的情况。因此,我们提供了在仪器场景中存在宽松的测量依赖假设的情况下有效的适应不等式。这允许对具有二元结果的工具情景的平均因果效应的现有和新的下限进行调整。最后,我们在量子力学的背景下讨论我们的发现。
{"title":"Causal inference with imperfect instrumental variables","authors":"N. Miklin, M. Gachechiladze, George Moreno, R. Chaves","doi":"10.1515/jci-2021-0065","DOIUrl":"https://doi.org/10.1515/jci-2021-0065","url":null,"abstract":"Abstract Instrumental variables allow for quantification of cause and effect relationships even in the absence of interventions. To achieve this, a number of causal assumptions must be met, the most important of which is the independence assumption, which states that the instrument and any confounding factor must be independent. However, if this independence condition is not met, can we still work with imperfect instrumental variables? Imperfect instruments can manifest themselves by violations of the instrumental inequalities that constrain the set of correlations in the scenario. In this article, we establish a quantitative relationship between such violations of instrumental inequalities and the minimal amount of measurement dependence required to explain them for the case of discrete observed variables. As a result, we provide adapted inequalities that are valid in the presence of a relaxed measurement dependence assumption in the instrumental scenario. This allows for the adaptation of existing and new lower bounds on the average causal effect for instrumental scenarios with binary outcomes. Finally, we discuss our findings in the context of quantum mechanics.","PeriodicalId":48576,"journal":{"name":"Journal of Causal Inference","volume":"232 1","pages":"45 - 63"},"PeriodicalIF":1.4,"publicationDate":"2021-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76562151","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Double machine learning and automated confounder selection: A cautionary tale 双重机器学习和自动混淆选择:一个警世故事
IF 1.4 4区 医学 Q2 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2021-08-25 DOI: 10.1515/jci-2022-0078
Paul Hünermund, Beyers Louw, Itamar Caspi
Abstract Double machine learning (DML) has become an increasingly popular tool for automated variable selection in high-dimensional settings. Even though the ability to deal with a large number of potential covariates can render selection-on-observables assumptions more plausible, there is at the same time a growing risk that endogenous variables are included, which would lead to the violation of conditional independence. This article demonstrates that DML is very sensitive to the inclusion of only a few “bad controls” in the covariate space. The resulting bias varies with the nature of the theoretical causal model, which raises concerns about the feasibility of selecting control variables in a data-driven way.
双机器学习(DML)已经成为一种越来越流行的高维环境中自动变量选择的工具。尽管处理大量潜在协变量的能力可以使可观测选择假设更加合理,但同时也存在内生变量被包括在内的风险,这将导致违反条件独立性。本文证明了DML对协变量空间中仅包含少数“坏控制”非常敏感。由此产生的偏差随理论因果模型的性质而变化,这引起了人们对以数据驱动的方式选择控制变量的可行性的关注。
{"title":"Double machine learning and automated confounder selection: A cautionary tale","authors":"Paul Hünermund, Beyers Louw, Itamar Caspi","doi":"10.1515/jci-2022-0078","DOIUrl":"https://doi.org/10.1515/jci-2022-0078","url":null,"abstract":"Abstract Double machine learning (DML) has become an increasingly popular tool for automated variable selection in high-dimensional settings. Even though the ability to deal with a large number of potential covariates can render selection-on-observables assumptions more plausible, there is at the same time a growing risk that endogenous variables are included, which would lead to the violation of conditional independence. This article demonstrates that DML is very sensitive to the inclusion of only a few “bad controls” in the covariate space. The resulting bias varies with the nature of the theoretical causal model, which raises concerns about the feasibility of selecting control variables in a data-driven way.","PeriodicalId":48576,"journal":{"name":"Journal of Causal Inference","volume":"2010 1","pages":""},"PeriodicalIF":1.4,"publicationDate":"2021-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82542755","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Adaptive normalization for IPW estimation IPW估计的自适应归一化
IF 1.4 4区 医学 Q2 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2021-06-14 DOI: 10.1515/jci-2022-0019
Samir Khan, J. Ugander
Abstract Inverse probability weighting (IPW) is a general tool in survey sampling and causal inference, used in both Horvitz–Thompson estimators, which normalize by the sample size, and Hájek/self-normalized estimators, which normalize by the sum of the inverse probability weights. In this work, we study a family of IPW estimators, first proposed by Trotter and Tukey in the context of Monte Carlo problems, that are normalized by an affine combination of the sample size and a sum of inverse weights. We show how selecting an estimator from this family in a data-dependent way to minimize asymptotic variance leads to an iterative procedure that converges to an estimator with connections to regression control methods. We refer to such estimators as adaptively normalized estimators. For mean estimation in survey sampling, the adaptively normalized estimator has asymptotic variance that is never worse than the Horvitz–Thompson and Hájek estimators. Going further, we show that adaptive normalization can be used to propose improvements of the augmented IPW (AIPW) estimator, average treatment effect (ATE) estimators, and policy learning objectives. Appealingly, these proposals preserve both the asymptotic efficiency of AIPW and the regret bounds for policy learning with IPW objectives, and deliver consistent finite sample improvements in simulations for all three of mean estimation, ATE estimation, and policy learning.
逆概率加权(IPW)是一种用于调查抽样和因果推理的通用工具,用于通过样本量进行归一化的Horvitz-Thompson估计量和通过逆概率权和进行归一化的Hájek/自归一化估计量。在这项工作中,我们研究了一类IPW估计量,它们首先由Trotter和Tukey在蒙特卡罗问题的背景下提出,通过样本大小和逆权和的仿射组合进行归一化。我们展示了如何以数据相关的方式从这个族中选择一个估计量来最小化渐近方差,从而导致一个迭代过程收敛到一个与回归控制方法有联系的估计量。我们把这样的估计量称为自适应归一化估计量。对于调查抽样的均值估计,自适应归一化估计量的渐近方差不会比Horvitz-Thompson和Hájek估计量差。进一步,我们表明自适应归一化可以用来提出增强IPW (AIPW)估计器、平均处理效果(ATE)估计器和策略学习目标的改进。值得注意的是,这些建议既保留了AIPW的渐近效率,又保留了具有IPW目标的策略学习的遗憾界限,并在模拟中为所有三种方法(均值估计、ATE估计和策略学习)提供了一致的有限样本改进。
{"title":"Adaptive normalization for IPW estimation","authors":"Samir Khan, J. Ugander","doi":"10.1515/jci-2022-0019","DOIUrl":"https://doi.org/10.1515/jci-2022-0019","url":null,"abstract":"Abstract Inverse probability weighting (IPW) is a general tool in survey sampling and causal inference, used in both Horvitz–Thompson estimators, which normalize by the sample size, and Hájek/self-normalized estimators, which normalize by the sum of the inverse probability weights. In this work, we study a family of IPW estimators, first proposed by Trotter and Tukey in the context of Monte Carlo problems, that are normalized by an affine combination of the sample size and a sum of inverse weights. We show how selecting an estimator from this family in a data-dependent way to minimize asymptotic variance leads to an iterative procedure that converges to an estimator with connections to regression control methods. We refer to such estimators as adaptively normalized estimators. For mean estimation in survey sampling, the adaptively normalized estimator has asymptotic variance that is never worse than the Horvitz–Thompson and Hájek estimators. Going further, we show that adaptive normalization can be used to propose improvements of the augmented IPW (AIPW) estimator, average treatment effect (ATE) estimators, and policy learning objectives. Appealingly, these proposals preserve both the asymptotic efficiency of AIPW and the regret bounds for policy learning with IPW objectives, and deliver consistent finite sample improvements in simulations for all three of mean estimation, ATE estimation, and policy learning.","PeriodicalId":48576,"journal":{"name":"Journal of Causal Inference","volume":"49 1","pages":""},"PeriodicalIF":1.4,"publicationDate":"2021-06-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83878902","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
期刊
Journal of Causal Inference
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1