Epidemiologic Methods最新文献_第8页

The mean prevalence 平均患病率

Q3 Mathematics

Epidemiologic Methods

Pub Date : 2020-01-01 DOI: 10.1515/em-2019-0033

F. Habibzadeh, P. Habibzadeh

引用次数: 2

Population attributable fractions for continuously distributed exposures 连续分布暴露的人口归因分数

Q3 Mathematics

Epidemiologic Methods

Pub Date : 2020-01-01 DOI: 10.1515/em-2019-0037

J. Ferguson, Fabrizio Maturo, S. Yusuf, M. O’Donnell

Abstract When estimating population attributable fractions (PAF), it is common to partition a naturally continuous exposure into a categorical risk factor. While prior risk factor categorization can help estimation and interpretation, it can result in underestimation of the disease burden attributable to the exposure as well as biased comparisons across different exposures and risk factors. Here, we propose sensible PAF estimands for continuous exposures under a potential outcomes framework. In contrast to previous approaches, we incorporate estimation of the minimum risk exposure value (MREV) into our procedures. While for exposures such as tobacco usage, a sensible value of the MREV is known, often it is unknown and needs to be estimated. Second, in the setting that the MREV value is an extreme-value of the exposure lying in the distributional tail, we argue that the natural estimator of PAF may be both statistically biased and highly volatile; instead, we consider a family of modified PAFs which include the natural estimate of PAF as a limit. A graphical comparison of this set of modified PAF for differing risk factors may be a better way to rank risk factors as intervention targets, compared to the standard PAF calculation. Finally, we analyse the bias that may ensue from prior risk factor categorization, examining whether categorization is ever a good idea, and suggest interpretations of categorized-estimands within a causal inference setting.

在估计人群归因分数(PAF)时，通常将自然连续暴露划分为分类风险因素。虽然先前的风险因素分类有助于估计和解释，但它可能导致对可归因于暴露的疾病负担的低估，以及在不同暴露和风险因素之间进行有偏见的比较。在这里，我们提出了在潜在结果框架下持续暴露的合理PAF估计。与以前的方法相反，我们将最小风险暴露值(MREV)的估计纳入我们的程序。虽然对于烟草使用等暴露，已知的最大rev值是合理的，但它往往是未知的，需要估计。其次，在MREV值是分布尾部暴露的极值的情况下，我们认为PAF的自然估计量可能在统计上有偏差，并且具有高度的波动性;相反，我们考虑了一类修正的PAF，其中包括PAF的自然估计作为极限。与标准PAF计算方法相比，对这组针对不同危险因素的修正PAF进行图形比较可能是对危险因素作为干预目标进行排序的更好方法。最后，我们分析了可能从先前的风险因素分类中产生的偏差，检查分类是否曾经是一个好主意，并提出了在因果推理设置中对分类估计的解释。

{"title":"Population attributable fractions for continuously distributed exposures","authors":"J. Ferguson, Fabrizio Maturo, S. Yusuf, M. O’Donnell","doi":"10.1515/em-2019-0037","DOIUrl":"https://doi.org/10.1515/em-2019-0037","url":null,"abstract":"Abstract When estimating population attributable fractions (PAF), it is common to partition a naturally continuous exposure into a categorical risk factor. While prior risk factor categorization can help estimation and interpretation, it can result in underestimation of the disease burden attributable to the exposure as well as biased comparisons across different exposures and risk factors. Here, we propose sensible PAF estimands for continuous exposures under a potential outcomes framework. In contrast to previous approaches, we incorporate estimation of the minimum risk exposure value (MREV) into our procedures. While for exposures such as tobacco usage, a sensible value of the MREV is known, often it is unknown and needs to be estimated. Second, in the setting that the MREV value is an extreme-value of the exposure lying in the distributional tail, we argue that the natural estimator of PAF may be both statistically biased and highly volatile; instead, we consider a family of modified PAFs which include the natural estimate of PAF as a limit. A graphical comparison of this set of modified PAF for differing risk factors may be a better way to rank risk factors as intervention targets, compared to the standard PAF calculation. Finally, we analyse the bias that may ensue from prior risk factor categorization, examining whether categorization is ever a good idea, and suggest interpretations of categorized-estimands within a causal inference setting.","PeriodicalId":37999,"journal":{"name":"Epidemiologic Methods","volume":"232 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75758425","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

Extrapolating sparse gold standard cause of death designations to characterize broader catchment areas 外推稀疏的金标准死因名称来描述更广泛的流域

Q3 Mathematics

Epidemiologic Methods

Pub Date : 2020-01-01 DOI: 10.1515/em-2019-0031

R. Lyles, S. Cunningham, Suprateek Kundu, Q. Bassat, I. Mandomando, C. Sacoor, Victor Akelo, D. Onyango, Emily Zielinski-Gutierrez, Allan W. Taylor

Abstract Objectives The Child Health and Mortality Prevention Surveillance (CHAMPS) Network is designed to elucidate and track causes of under-5 child mortality and stillbirth in multiple sites in sub-Saharan Africa and South Asia using advanced surveillance, laboratory and pathology methods. Expert panels provide an arguable gold standard determination of underlying cause of death (CoD) on a subset of child deaths, in part through examining tissue obtained via minimally invasive tissue sampling (MITS) procedures. We consider estimating a population-level distribution of CoDs based on this sparse but precise data, in conjunction with data on subgrouping characteristics that are measured on the broader population of cases and are potentially associated with selection for MITS and with cause-specific mortality. Methods We illustrate how estimation of each underlying CoD proportion using all available data can be addressed equivalently in terms of a Horvitz-Thompson adjustment or a direct standardization, uncovering insights relevant to the designation of appropriate subgroups to adjust for non-representative sampling. Taking advantage of the functional form of the result when expressed as a multinomial distribution-based maximum likelihood estimator, we propose small-sample adjustments to Bayesian credible intervals based on Jeffreys or related weakly informative Dirichlet prior distributions. Results Our analyses of early data from CHAMPS sites in Kenya and Mozambique and accompanying simulation studies demonstrate the validity of the adjustment approach under attendant assumptions, together with marked performance improvements associated with the proposed adjusted Bayesian credible intervals. Conclusions Adjustment for non-representative sampling of those validated via gold standard diagnostic methods is a critical endeavor for epidemiologic studies like CHAMPS that seek extrapolation of CoD proportion estimates.

儿童健康和死亡预防监测(CHAMPS)网络旨在利用先进的监测、实验室和病理学方法，阐明和跟踪撒哈拉以南非洲和南亚多个地点5岁以下儿童死亡和死产的原因。专家小组通过检查通过微创组织取样(MITS)程序获得的组织，对一部分儿童死亡的潜在死因(CoD)提供了一个有争议的金标准。我们考虑根据这些稀疏但精确的数据，结合在更广泛的病例群体中测量的亚组特征数据，估计CoDs的人口水平分布，这些数据可能与MITS的选择和病因特异性死亡率有关。我们说明了如何使用所有可用数据来估计每个潜在的CoD比例，可以根据Horvitz-Thompson调整或直接标准化来等效地解决问题，揭示了与指定适当的子组以调整非代表性抽样相关的见解。利用结果的函数形式表示为基于多项分布的极大似然估计量，我们提出了基于Jeffreys或相关弱信息Dirichlet先验分布的贝叶斯可信区间的小样本调整。我们对肯尼亚和莫桑比克CHAMPS站点的早期数据进行了分析，并进行了相应的模拟研究，结果表明，在相应的假设下，调整方法的有效性，以及与所提出的调整贝叶斯可信区间相关的显著性能改进。通过金标准诊断方法验证的非代表性样本的调整对于像CHAMPS这样寻求CoD比例估计值外推的流行病学研究是至关重要的。

{"title":"Extrapolating sparse gold standard cause of death designations to characterize broader catchment areas","authors":"R. Lyles, S. Cunningham, Suprateek Kundu, Q. Bassat, I. Mandomando, C. Sacoor, Victor Akelo, D. Onyango, Emily Zielinski-Gutierrez, Allan W. Taylor","doi":"10.1515/em-2019-0031","DOIUrl":"https://doi.org/10.1515/em-2019-0031","url":null,"abstract":"Abstract Objectives The Child Health and Mortality Prevention Surveillance (CHAMPS) Network is designed to elucidate and track causes of under-5 child mortality and stillbirth in multiple sites in sub-Saharan Africa and South Asia using advanced surveillance, laboratory and pathology methods. Expert panels provide an arguable gold standard determination of underlying cause of death (CoD) on a subset of child deaths, in part through examining tissue obtained via minimally invasive tissue sampling (MITS) procedures. We consider estimating a population-level distribution of CoDs based on this sparse but precise data, in conjunction with data on subgrouping characteristics that are measured on the broader population of cases and are potentially associated with selection for MITS and with cause-specific mortality. Methods We illustrate how estimation of each underlying CoD proportion using all available data can be addressed equivalently in terms of a Horvitz-Thompson adjustment or a direct standardization, uncovering insights relevant to the designation of appropriate subgroups to adjust for non-representative sampling. Taking advantage of the functional form of the result when expressed as a multinomial distribution-based maximum likelihood estimator, we propose small-sample adjustments to Bayesian credible intervals based on Jeffreys or related weakly informative Dirichlet prior distributions. Results Our analyses of early data from CHAMPS sites in Kenya and Mozambique and accompanying simulation studies demonstrate the validity of the adjustment approach under attendant assumptions, together with marked performance improvements associated with the proposed adjusted Bayesian credible intervals. Conclusions Adjustment for non-representative sampling of those validated via gold standard diagnostic methods is a critical endeavor for epidemiologic studies like CHAMPS that seek extrapolation of CoD proportion estimates.","PeriodicalId":37999,"journal":{"name":"Epidemiologic Methods","volume":"19 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90912114","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Heterogeneous indirect effects for multiple mediators using interventional effect models 利用介入效应模型研究多种介质的异质性间接效应

Q3 Mathematics

Epidemiologic Methods

Pub Date : 2020-01-01 DOI: 10.1515/em-2020-0023

W. W. Loh, B. Moerkerke, T. Loeys, S. Vansteelandt

Abstract Decomposing an exposure effect on an outcome into separate natural indirect effects through multiple mediators requires strict assumptions, such as correctly postulating the causal structure of the mediators, and no unmeasured confounding among the mediators. In contrast, interventional indirect effects for multiple mediators can be identified even when – as often – the mediators either have an unknown causal structure, or share unmeasured common causes, or both. Existing estimation methods for interventional indirect effects require calculating each distinct indirect effect in turn. This can quickly become unwieldy or unfeasible, especially when investigating indirect effect measures that may be modified by observed baseline characteristics. In this article, we introduce simplified estimation procedures for such heterogeneous interventional indirect effects using interventional effect models. Interventional effect models are a class of marginal structural models that encode the interventional indirect effects as causal model parameters, thus readily permitting effect modification by baseline covariates using (statistical) interaction terms. The mediators and outcome can be continuous or noncontinuous. We propose two estimation procedures: one using inverse weighting by the counterfactual mediator density or mass functions, and another using Monte Carlo integration. The former has the advantage of not requiring an outcome model, but is susceptible to finite sample biases due to highly variable weights. The latter has the advantage of consistent estimation under a correctly specified (parametric) outcome model, but is susceptible to biases due to extrapolation. The estimators are illustrated using publicly available data assessing whether the indirect effects of self-efficacy on fatigue via self-reported post-traumatic stress disorder symptoms vary across different levels of negative coping among health care workers during the COVID-19 outbreak.

通过多种介质将暴露对结果的影响分解为单独的自然间接影响需要严格的假设，例如正确假设介质的因果结构，并且介质之间没有不可测量的混淆。相比之下，多种介质的干预性间接影响可以确定，即使(通常)介质要么具有未知的因果结构，要么具有无法测量的共同原因，或两者兼而有之。现有的干预间接效应估计方法需要依次计算每一种不同的间接效应。这可能很快变得笨拙或不可行，特别是在调查可能被观察到的基线特征修改的间接影响测量时。在本文中，我们介绍了使用干预效应模型对这种异质干预间接效应的简化估计方法。干预效应模型是一类边际结构模型，它将干预间接效应编码为因果模型参数，因此很容易允许使用(统计)相互作用项的基线协变量修改效果。介质和结果可以是连续的，也可以是非连续的。我们提出了两种估计方法:一种使用反事实中介密度或质量函数的逆加权，另一种使用蒙特卡罗积分。前者的优点是不需要结果模型，但由于高度可变的权重，它容易受到有限样本偏差的影响。后者在正确指定的(参数)结果模型下具有一致估计的优点，但由于外推而容易产生偏差。这些估计值是使用公开可用的数据来说明的，这些数据评估了自我效能感通过自我报告的创伤后应激障碍症状对疲劳的间接影响，在COVID-19疫情期间，卫生保健工作者的不同消极应对水平之间是否存在差异。

{"title":"Heterogeneous indirect effects for multiple mediators using interventional effect models","authors":"W. W. Loh, B. Moerkerke, T. Loeys, S. Vansteelandt","doi":"10.1515/em-2020-0023","DOIUrl":"https://doi.org/10.1515/em-2020-0023","url":null,"abstract":"Abstract Decomposing an exposure effect on an outcome into separate natural indirect effects through multiple mediators requires strict assumptions, such as correctly postulating the causal structure of the mediators, and no unmeasured confounding among the mediators. In contrast, interventional indirect effects for multiple mediators can be identified even when – as often – the mediators either have an unknown causal structure, or share unmeasured common causes, or both. Existing estimation methods for interventional indirect effects require calculating each distinct indirect effect in turn. This can quickly become unwieldy or unfeasible, especially when investigating indirect effect measures that may be modified by observed baseline characteristics. In this article, we introduce simplified estimation procedures for such heterogeneous interventional indirect effects using interventional effect models. Interventional effect models are a class of marginal structural models that encode the interventional indirect effects as causal model parameters, thus readily permitting effect modification by baseline covariates using (statistical) interaction terms. The mediators and outcome can be continuous or noncontinuous. We propose two estimation procedures: one using inverse weighting by the counterfactual mediator density or mass functions, and another using Monte Carlo integration. The former has the advantage of not requiring an outcome model, but is susceptible to finite sample biases due to highly variable weights. The latter has the advantage of consistent estimation under a correctly specified (parametric) outcome model, but is susceptible to biases due to extrapolation. The estimators are illustrated using publicly available data assessing whether the indirect effects of self-efficacy on fatigue via self-reported post-traumatic stress disorder symptoms vary across different levels of negative coping among health care workers during the COVID-19 outbreak.","PeriodicalId":37999,"journal":{"name":"Epidemiologic Methods","volume":"28 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77992838","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 7

A comparison of cause-specific and competing risk models to assess risk factors for dementia 评估痴呆危险因素的病因特异性和竞争风险模型的比较

Q3 Mathematics

Epidemiologic Methods

Pub Date : 2020-01-01 DOI: 10.1515/em-2019-0036

M. Waller, G. Mishra, A. Dobson

Abstract The study of dementia risk factors is complicated by the competing risk of dying. The standard approaches are the cause-specific Cox proportional hazard model with deaths treated as censoring events (and removed from the risk set) and the Fine and Gray sub-distribution hazard model in which those who die remain in the risk set. An alternative approach is to modify the risk set between these extremes. We propose a novel method of doing this based on estimating the time at which the person might have been diagnosed if they had not died using a parametric survival model, and then applying the cause-specific and Fine and Gray models to the modified dataset. We compare these methods using data on dementia from the Australian Longitudinal Study on Women’s Health and discuss the assumptions and limitations of each model. The results from survival models to assess risk factors for dementia varied considerably between the cause-specific model and the models designed to account for competing risks. Therefore, when assessing risk factors in the presence of competing risks it is important to examine results from: the cause-specific model, different models which account for competing risks, and the model which assesses risk factors associated with the competing risk.

死亡的竞争风险使痴呆危险因素的研究变得复杂。标准方法是病因特异性Cox比例风险模型，其中死亡被视为审查事件(并从风险集中删除)，以及Fine和Gray子分布风险模型，其中死亡的人仍在风险集中。另一种方法是修改这两个极端之间的风险设置。我们提出了一种新的方法，该方法基于使用参数生存模型估计如果患者没有死亡，则患者可能被诊断的时间，然后将原因特定模型和Fine and Gray模型应用于修改后的数据集。我们使用澳大利亚妇女健康纵向研究的痴呆数据对这些方法进行比较，并讨论每个模型的假设和局限性。用于评估痴呆风险因素的生存模型的结果在病因特异性模型和用于考虑竞争风险的模型之间差异很大。因此，在评估存在竞争风险的风险因素时，重要的是要检查以下结果:特定原因模型，考虑竞争风险的不同模型，以及评估与竞争风险相关的风险因素的模型。

{"title":"A comparison of cause-specific and competing risk models to assess risk factors for dementia","authors":"M. Waller, G. Mishra, A. Dobson","doi":"10.1515/em-2019-0036","DOIUrl":"https://doi.org/10.1515/em-2019-0036","url":null,"abstract":"Abstract The study of dementia risk factors is complicated by the competing risk of dying. The standard approaches are the cause-specific Cox proportional hazard model with deaths treated as censoring events (and removed from the risk set) and the Fine and Gray sub-distribution hazard model in which those who die remain in the risk set. An alternative approach is to modify the risk set between these extremes. We propose a novel method of doing this based on estimating the time at which the person might have been diagnosed if they had not died using a parametric survival model, and then applying the cause-specific and Fine and Gray models to the modified dataset. We compare these methods using data on dementia from the Australian Longitudinal Study on Women’s Health and discuss the assumptions and limitations of each model. The results from survival models to assess risk factors for dementia varied considerably between the cause-specific model and the models designed to account for competing risks. Therefore, when assessing risk factors in the presence of competing risks it is important to examine results from: the cause-specific model, different models which account for competing risks, and the model which assesses risk factors associated with the competing risk.","PeriodicalId":37999,"journal":{"name":"Epidemiologic Methods","volume":"36 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89724042","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Modeling of Clinical Phenotypes Assessed at Discrete Study Visits. 在离散研究访问中评估临床表型的建模。

Q3 Mathematics

Epidemiologic Methods

Pub Date : 2019-12-01 Epub Date: 2019-08-02 DOI: 10.1515/em-2018-0011

Emily J Huang, Ravi Varadhan, Michelle C Carlson

In studies of clinical phenotypes, such as dementia, disability, and frailty, participants are typically assessed at in-person clinic visits. Thus, the precise time of onset for the phenotype is unknown. The discreteness of the clinic visits yields grouped event time data. We investigate how to perform a risk factor analysis in the case of grouped data. Since visits can be months to years apart, numbers of ties can be large, causing the exact tie-handling method of the Cox model to be computationally infeasible. We propose two, new, computationally efficient approximations to the exact method: Laplace approximation and an analytic approximation. Through extensive simulation studies, we compare these new methods to the Prentice-Gloeckler model and the Cox model using Efron's and Breslow's tie-handling methods. In addition, we compare the methods in an application to a large cohort study (N = 3,605) on the development of clinical frailty in older adults. In our simulations, the Laplace approximation has low bias in all settings, and the analytic approximation has low bias in settings where the regression coefficient is not large in magnitude. Their corresponding confidence intervals also have approximately the nominal coverage probability. In the data application, the results from the approximations are nearly identical to that of the Prentice-Gloeckler model.

在临床表型的研究中，如痴呆、残疾和虚弱，参与者通常在亲自诊所就诊时进行评估。因此，确切的发病时间为表型是未知的。诊所访问的离散性产生分组事件时间数据。我们研究如何在分组数据的情况下进行风险因素分析。由于访问可能相隔数月至数年，因此联系的数量可能很大，导致Cox模型的确切联系处理方法在计算上不可行。我们提出了两种新的、计算效率高的近似方法:拉普拉斯近似和解析近似。通过广泛的模拟研究，我们将这些新方法与Prentice-Gloeckler模型以及使用Efron和Breslow的捆绑处理方法的Cox模型进行了比较。此外，我们将这些方法应用于一项大型队列研究(N = 3,605)，研究老年人临床虚弱的发展。在我们的模拟中，拉普拉斯近似在所有设置中都具有低偏差，而解析近似在回归系数不是很大的设置中具有低偏差。它们对应的置信区间也近似于名义覆盖概率。在数据应用中，近似的结果与Prentice-Gloeckler模型的结果几乎相同。

{"title":"Modeling of Clinical Phenotypes Assessed at Discrete Study Visits.","authors":"Emily J Huang, Ravi Varadhan, Michelle C Carlson","doi":"10.1515/em-2018-0011","DOIUrl":"https://doi.org/10.1515/em-2018-0011","url":null,"abstract":"In studies of clinical phenotypes, such as dementia, disability, and frailty, participants are typically assessed at in-person clinic visits. Thus, the precise time of onset for the phenotype is unknown. The discreteness of the clinic visits yields grouped event time data. We investigate how to perform a risk factor analysis in the case of grouped data. Since visits can be months to years apart, numbers of ties can be large, causing the exact tie-handling method of the Cox model to be computationally infeasible. We propose two, new, computationally efficient approximations to the exact method: Laplace approximation and an analytic approximation. Through extensive simulation studies, we compare these new methods to the Prentice-Gloeckler model and the Cox model using Efron's and Breslow's tie-handling methods. In addition, we compare the methods in an application to a large cohort study (N = 3,605) on the development of clinical frailty in older adults. In our simulations, the Laplace approximation has low bias in all settings, and the analytic approximation has low bias in settings where the regression coefficient is not large in magnitude. Their corresponding confidence intervals also have approximately the nominal coverage probability. In the data application, the results from the approximations are nearly identical to that of the Prentice-Gloeckler model.","PeriodicalId":37999,"journal":{"name":"Epidemiologic Methods","volume":"8 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1515/em-2018-0011","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"38821278","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Causal Mediation Analysis in the Presence of a Misclassified Binary Exposure 存在误分类二元暴露的因果中介分析

Q3 Mathematics

Epidemiologic Methods

Pub Date : 2019-11-29 DOI: 10.1515/em-2016-0006

Zhichao Jiang, T. VanderWeele

Abstract Mediation analysis is popular in examining the extent to which the effect of an exposure on an outcome is through an intermediate variable. When the exposure is subject to misclassification, the effects estimated can be severely biased. In this paper, when the mediator is binary, we first study the bias on traditional direct and indirect effect estimates in the presence of conditional non-differential misclassification of a binary exposure. We show that in the absence of interaction, the misclassification of the exposure will bias the direct effect towards the null but can bias the indirect effect in either direction. We then develop an EM algorithm approach to correcting for the misclassification, and conduct simulation studies to assess the performance of the correction approach. Finally, we apply the approach to National Center for Health Statistics birth certificate data to study the effect of smoking status on the preterm birth mediated through pre-eclampsia.

摘要中介分析在检查暴露对结果的影响程度是通过中间变量进行的方面很受欢迎。当暴露受到错误分类时，估计的影响可能有严重偏差。在本文中，当中介为二元时，我们首先研究了二元暴露存在条件非微分错分类时传统直接效应和间接效应估计的偏差。我们表明，在没有相互作用的情况下，暴露的错误分类将使直接效应向零偏倚，但可以使间接效应向两个方向偏倚。然后，我们开发了一种EM算法方法来纠正错误分类，并进行仿真研究来评估纠正方法的性能。最后，我们将此方法应用于国家卫生统计中心的出生证明数据，研究吸烟状况对通过先兆子痫介导的早产的影响。

引用次数: 3

Regression analysis of unmeasured confounding 未测量混杂因素的回归分析

Q3 Mathematics

Epidemiologic Methods

Pub Date : 2019-08-22 DOI: 10.1515/em-2019-0028

B. Knaeble, B. Osting, M. Abramson

Abstract When studying the causal effect of x on y, researchers may conduct regression and report a confidence interval for the slope coefficient β x ${beta }_{x}$ . This common confidence interval provides an assessment of uncertainty from sampling error, but it does not assess uncertainty from confounding. An intervention on x may produce a response in y that is unexpected, and our misinterpretation of the slope happens when there are confounding factors w. When w are measured we may conduct multiple regression, but when w are unmeasured it is common practice to include a precautionary statement when reporting the confidence interval, warning against unwarranted causal interpretation. If the goal is robust causal interpretation then we can do something more informative. Uncertainty, in the specification of three confounding parameters can be propagated through an equation to produce a confounding interval. Here, we develop supporting mathematical theory and describe an example application. Our proposed methodology applies well to studies of a continuous response or rare outcome. It is a general method for quantifying error from model uncertainty. Whereas, confidence intervals are used to assess uncertainty from unmeasured individuals, confounding intervals can be used to assess uncertainty from unmeasured attributes.

在研究x对y的因果关系时，研究人员可以进行回归并报告斜率系数β x ${beta}_{x}$的置信区间。这个通用置信区间提供了抽样误差不确定性的评估，但它不能评估混杂的不确定性。对x的干预可能会在y中产生意想不到的响应，当存在混淆因素w时，我们对斜率的误解就会发生。当w被测量时，我们可能会进行多元回归，但当w未被测量时，通常的做法是在报告置信区间时包括预防性声明，警告不合理的因果解释。如果目标是健全的因果解释，那么我们可以做一些更有信息量的事情。不确定性，在规定的三个混杂参数可以通过一个方程传播产生一个混杂区间。在这里，我们开发了支持数学理论并描述了一个示例应用程序。我们提出的方法适用于连续反应或罕见结果的研究。这是对模型不确定性误差进行量化的一般方法。然而，置信区间用于评估来自未测量个体的不确定性，混淆区间可用于评估来自未测量属性的不确定性。

{"title":"Regression analysis of unmeasured confounding","authors":"B. Knaeble, B. Osting, M. Abramson","doi":"10.1515/em-2019-0028","DOIUrl":"https://doi.org/10.1515/em-2019-0028","url":null,"abstract":"Abstract When studying the causal effect of x on y, researchers may conduct regression and report a confidence interval for the slope coefficient β x ${beta }_{x}$ . This common confidence interval provides an assessment of uncertainty from sampling error, but it does not assess uncertainty from confounding. An intervention on x may produce a response in y that is unexpected, and our misinterpretation of the slope happens when there are confounding factors w. When w are measured we may conduct multiple regression, but when w are unmeasured it is common practice to include a precautionary statement when reporting the confidence interval, warning against unwarranted causal interpretation. If the goal is robust causal interpretation then we can do something more informative. Uncertainty, in the specification of three confounding parameters can be propagated through an equation to produce a confounding interval. Here, we develop supporting mathematical theory and describe an example application. Our proposed methodology applies well to studies of a continuous response or rare outcome. It is a general method for quantifying error from model uncertainty. Whereas, confidence intervals are used to assess uncertainty from unmeasured individuals, confounding intervals can be used to assess uncertainty from unmeasured attributes.","PeriodicalId":37999,"journal":{"name":"Epidemiologic Methods","volume":"9 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81842128","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

Instrumental Variable Estimation with the R Package ivtools 工具变量估计与R包ivtools

Q3 Mathematics

Epidemiologic Methods

Pub Date : 2019-07-20 DOI: 10.1515/EM-2018-0024

Arvid Sjolander, T. Martinussen

Abstract Instrumental variables is a popular method in epidemiology and related fields, to estimate causal effects in the presence of unmeasured confounding. Traditionally, instrumental variable analyses have been confined to linear models, in which the causal parameter of interest is typically estimated with two-stage least squares. Recently, the methodology has been extended in several directions, including two-stage estimation and so-called G-estimation in nonlinear (e. g. logistic and Cox proportional hazards) models. This paper presents a new R package, ivtools, which implements many of these new instrumental variable methods. We briefly review the theory of two-stage estimation and G-estimation, and illustrate the functionality of the ivtools package by analyzing publicly available data from a cohort study on vitamin D and mortality.

工具变量是流行病学和相关领域中常用的一种方法，用于估计存在未测量混杂因素时的因果效应。传统上，工具变量分析仅限于线性模型，其中感兴趣的因果参数通常用两阶段最小二乘法估计。近年来，该方法在多个方向上得到了扩展，包括两阶段估计和非线性(如非线性)中的g估计。logistic和Cox比例风险)模型。本文提出了一个新的R包，ivtools，它实现了许多这些新的工具变量方法。我们简要回顾了两阶段估计和g估计的理论，并通过分析一项关于维生素D和死亡率的队列研究的公开数据来说明ivtools包的功能。

引用次数: 33

Posterior predictive treatment assignment methods for causal inference in the context of time-varying treatments 时变治疗背景下因果推理的后验预测治疗分配方法

Q3 Mathematics

Epidemiologic Methods

Pub Date : 2019-07-15 DOI: 10.1515/em-2019-0024

Shirley X Liao, Lucas R. F. Henneman, C. Zigler

Abstract Marginal structural models (MSM) with inverse probability weighting (IPW) are used to estimate causal effects of time-varying treatments, but can result in erratic finite-sample performance when there is low overlap in covariate distributions across different treatment patterns. Modifications to IPW which target the average treatment effect (ATE) estimand either introduce bias or rely on unverifiable parametric assumptions and extrapolation. This paper extends an alternate estimand, the ATE on the overlap population (ATO) which is estimated on a sub-population with a reasonable probability of receiving alternate treatment patterns in time-varying treatment settings. To estimate the ATO within an MSM framework, this paper extends a stochastic pruning method based on the posterior predictive treatment assignment (PPTA) (Zigler, C. M., and M. Cefalu. 2017. “Posterior Predictive Treatment Assignment for Estimating Causal Effects with Limited Overlap.” eprint arXiv:1710.08749.) as well as a weighting analog (Li, F., K. L. Morgan, and A. M. Zaslavsky. 2018. “Balancing Covariates via Propensity Score Weighting.” Journal of the American Statistical Association 113: 390–400, https://doi.org/10.1080/01621459.2016.1260466.) to the time-varying treatment setting. Simulations demonstrate the performance of these extensions compared against IPW and stabilized weighting with regard to bias, efficiency, and coverage. Finally, an analysis using these methods is performed on Medicare beneficiaries residing across 18,480 ZIP codes in the U.S. to evaluate the effect of coal-fired power plant emissions exposure on ischemic heart disease (IHD) hospitalization, accounting for seasonal patterns that lead to change in treatment over time.

具有逆概率加权(IPW)的边际结构模型(MSM)用于估计时变处理的因果效应，但当不同处理模式的协变量分布重叠度较低时，可能导致有限样本性能不稳定。针对平均治疗效果(ATE)估计的IPW修改要么引入偏差，要么依赖于无法验证的参数假设和外推。本文扩展了一个替代估计，即重叠群体(ATO)的ATE，该估计是在时变治疗设置中接受替代治疗模式的合理概率的亚群体上估计的。为了在MSM框架内估计ATO，本文扩展了一种基于后检预测处理分配(PPTA)的随机修剪方法(Zigler, C. M.和M. Cefalu. 2017)。“估计有限重叠因果效应的后验预测治疗分配”。)以及加权模拟(Li, F.， K. L. Morgan, and a . M. Zaslavsky. 2018)。“通过倾向得分加权平衡协变量。”美国统计协会杂志113:390-400,https://doi.org/10.1080/01621459.2016.1260466.)的时变治疗设置。仿真证明了这些扩展与IPW和稳定加权相比在偏置、效率和覆盖方面的性能。最后，使用这些方法对居住在美国18480个邮政编码的医疗保险受益人进行了分析，以评估燃煤电厂排放暴露对缺血性心脏病(IHD)住院治疗的影响，并考虑了导致治疗随时间变化的季节性模式。

{"title":"Posterior predictive treatment assignment methods for causal inference in the context of time-varying treatments","authors":"Shirley X Liao, Lucas R. F. Henneman, C. Zigler","doi":"10.1515/em-2019-0024","DOIUrl":"https://doi.org/10.1515/em-2019-0024","url":null,"abstract":"Abstract Marginal structural models (MSM) with inverse probability weighting (IPW) are used to estimate causal effects of time-varying treatments, but can result in erratic finite-sample performance when there is low overlap in covariate distributions across different treatment patterns. Modifications to IPW which target the average treatment effect (ATE) estimand either introduce bias or rely on unverifiable parametric assumptions and extrapolation. This paper extends an alternate estimand, the ATE on the overlap population (ATO) which is estimated on a sub-population with a reasonable probability of receiving alternate treatment patterns in time-varying treatment settings. To estimate the ATO within an MSM framework, this paper extends a stochastic pruning method based on the posterior predictive treatment assignment (PPTA) (Zigler, C. M., and M. Cefalu. 2017. “Posterior Predictive Treatment Assignment for Estimating Causal Effects with Limited Overlap.” eprint arXiv:1710.08749.) as well as a weighting analog (Li, F., K. L. Morgan, and A. M. Zaslavsky. 2018. “Balancing Covariates via Propensity Score Weighting.” Journal of the American Statistical Association 113: 390–400, https://doi.org/10.1080/01621459.2016.1260466.) to the time-varying treatment setting. Simulations demonstrate the performance of these extensions compared against IPW and stabilized weighting with regard to bias, efficiency, and coverage. Finally, an analysis using these methods is performed on Medicare beneficiaries residing across 18,480 ZIP codes in the U.S. to evaluate the effect of coal-fired power plant emissions exposure on ischemic heart disease (IHD) hospitalization, accounting for seasonal patterns that lead to change in treatment over time.","PeriodicalId":37999,"journal":{"name":"Epidemiologic Methods","volume":"40 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90796155","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0