Causal effect on a target population: A sensitivity analysis to handle missing covariates

IF 1.7 4区 医学 Q2 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Journal of Causal Inference Pub Date : 2021-05-13 DOI:10.1515/jci-2021-0059
B. Colnet, J. Josse, G. Varoquaux, Erwan Scornet
{"title":"Causal effect on a target population: A sensitivity analysis to handle missing covariates","authors":"B. Colnet, J. Josse, G. Varoquaux, Erwan Scornet","doi":"10.1515/jci-2021-0059","DOIUrl":null,"url":null,"abstract":"Abstract Randomized controlled trials (RCTs) are often considered the gold standard for estimating causal effect, but they may lack external validity when the population eligible to the RCT is substantially different from the target population. Having at hand a sample of the target population of interest allows us to generalize the causal effect. Identifying the treatment effect in the target population requires covariates to capture all treatment effect modifiers that are shifted between the two sets. Standard estimators then use either weighting (IPSW), outcome modeling (G-formula), or combine the two in doubly robust approaches (AIPSW). However, such covariates are often not available in both sets. In this article, after proving L 1 {L}^{1} -consistency of these three estimators, we compute the expected bias induced by a missing covariate, assuming a Gaussian distribution, a continuous outcome, and a semi-parametric model. Under this setting, we perform a sensitivity analysis for each missing covariate pattern and compute the sign of the expected bias. We also show that there is no gain in linearly imputing a partially unobserved covariate. Finally, we study the substitution of a missing covariate by a proxy. We illustrate all these results on simulations, as well as semi-synthetic benchmarks using data from the Tennessee student/teacher achievement ratio (STAR), and a real-world example from critical care medicine.","PeriodicalId":48576,"journal":{"name":"Journal of Causal Inference","volume":"19 1","pages":"372 - 414"},"PeriodicalIF":1.7000,"publicationDate":"2021-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Causal Inference","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1515/jci-2021-0059","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATHEMATICS, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 8

Abstract

Abstract Randomized controlled trials (RCTs) are often considered the gold standard for estimating causal effect, but they may lack external validity when the population eligible to the RCT is substantially different from the target population. Having at hand a sample of the target population of interest allows us to generalize the causal effect. Identifying the treatment effect in the target population requires covariates to capture all treatment effect modifiers that are shifted between the two sets. Standard estimators then use either weighting (IPSW), outcome modeling (G-formula), or combine the two in doubly robust approaches (AIPSW). However, such covariates are often not available in both sets. In this article, after proving L 1 {L}^{1} -consistency of these three estimators, we compute the expected bias induced by a missing covariate, assuming a Gaussian distribution, a continuous outcome, and a semi-parametric model. Under this setting, we perform a sensitivity analysis for each missing covariate pattern and compute the sign of the expected bias. We also show that there is no gain in linearly imputing a partially unobserved covariate. Finally, we study the substitution of a missing covariate by a proxy. We illustrate all these results on simulations, as well as semi-synthetic benchmarks using data from the Tennessee student/teacher achievement ratio (STAR), and a real-world example from critical care medicine.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
对目标人群的因果效应:处理缺失协变量的敏感性分析
随机对照试验(RCT)通常被认为是估计因果效应的金标准,但当符合RCT的人群与目标人群有很大差异时,它们可能缺乏外部效度。手头有感兴趣的目标人群的样本使我们能够概括因果关系。确定目标人群中的治疗效果需要协变量来捕获在两组之间转移的所有治疗效果修饰符。然后,标准估计器要么使用加权(IPSW),要么使用结果建模(g公式),要么使用双稳健方法(AIPSW)将两者结合起来。然而,这类协变量在两个集合中往往不可用。在本文中,在证明了这三个估计量的L 1 {L}^{1} -相合性之后,我们计算了由缺失协变量引起的期望偏差,假设高斯分布,结果连续,半参数模型。在此设置下,我们对每个缺失的协变量模式进行敏感性分析,并计算期望偏差的符号。我们还表明,在线性输入部分未观察到的协变量时没有增益。最后,我们研究了用代理替换缺失的协变量。我们在模拟中说明了所有这些结果,以及使用田纳西州学生/教师成就比(STAR)数据的半合成基准,以及来自重症监护医学的现实世界示例。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Journal of Causal Inference
Journal of Causal Inference Decision Sciences-Statistics, Probability and Uncertainty
CiteScore
1.90
自引率
14.30%
发文量
15
审稿时长
86 weeks
期刊介绍: Journal of Causal Inference (JCI) publishes papers on theoretical and applied causal research across the range of academic disciplines that use quantitative tools to study causality.
期刊最新文献
Evaluating Boolean relationships in Configurational Comparative Methods Comparison of open-source software for producing directed acyclic graphs. LINGUISTIC FEATURES AND PRESENTATION OF MATERIALS ON ENGLISH TEXTBOOK “WHEN ENGLISH RINGS A BELL” BASED ON BSNP Heterogeneous interventional effects with multiple mediators: Semiparametric and nonparametric approaches Attributable fraction and related measures: Conceptual relations in the counterfactual framework
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1