首页 > 最新文献

Epidemiology最新文献

英文 中文
A structural description of biases that generate immortal time. 从结构上描述产生不朽时间的偏差。
IF 4.7 2区 医学 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH Pub Date : 2024-11-04 DOI: 10.1097/EDE.0000000000001808
Miguel A Hernán, Jonathan A C Sterne, Julian P T Higgins, Ian Shrier, Sonia Hernández-Díaz

Immortal time arises when individuals in the analysis are either selected based on post-assignment eligibility criteria or assigned to treatment strategies based on post-eligibility information. Explicit target trial emulation prevents the introduction of immortal time in survival analyses of observational data because it synchronizes eligibility and treatment assignment at the start of follow-up. Describing the structure of the biases that generate immortal time is facilitated by specifying the target trial so that the procedures to determine eligibility and assignment can be appropriately evaluated. Selection based on eligibility criteria applied after treatment assignment at the start of follow-up results in immortal time when the analysis starts the follow-up at assignment. Misclassification of assignment to treatment strategies based on treatment received after the start of follow-up results in immortal time when the treatment strategies are not distinguishable at the start of follow-up. The above selection and misclassification can be represented using causal diagrams. We summarize analytic approaches that prevent immortal time when longitudinal data are available from the time of treatment assignment. The term "immortal time bias" suggests that the source of the bias is the immortal time, but it is selection or misclassification that generates the immortal time, leading to bias.

当分析中的个体根据分配后的资格标准进行选择或根据资格后的信息分配治疗策略时,就会出现不死时间。明确的目标试验模拟可以防止在观察数据的生存分析中引入不朽时间,因为它在随访开始时同步了资格和治疗分配。明确目标试验有助于描述产生不死时间的偏差结构,从而对确定资格和分配的程序进行适当的评估。在随访开始时,根据治疗分配后应用的资格标准进行选择,会导致在分配时开始随访分析时产生不死时间。根据随访开始后接受的治疗,对治疗策略的分配进行错误分类,导致在随访开始时无法区分治疗策略,从而造成永恒的时间。上述选择和错误分类可以用因果图来表示。我们总结了一些分析方法,这些方法可以在有从治疗分配开始的纵向数据时防止不朽时间。不朽时间偏差 "一词表明,偏差的来源是不朽时间,但产生不朽时间并导致偏差的是选择或错误分类。
{"title":"A structural description of biases that generate immortal time.","authors":"Miguel A Hernán, Jonathan A C Sterne, Julian P T Higgins, Ian Shrier, Sonia Hernández-Díaz","doi":"10.1097/EDE.0000000000001808","DOIUrl":"https://doi.org/10.1097/EDE.0000000000001808","url":null,"abstract":"<p><p>Immortal time arises when individuals in the analysis are either selected based on post-assignment eligibility criteria or assigned to treatment strategies based on post-eligibility information. Explicit target trial emulation prevents the introduction of immortal time in survival analyses of observational data because it synchronizes eligibility and treatment assignment at the start of follow-up. Describing the structure of the biases that generate immortal time is facilitated by specifying the target trial so that the procedures to determine eligibility and assignment can be appropriately evaluated. Selection based on eligibility criteria applied after treatment assignment at the start of follow-up results in immortal time when the analysis starts the follow-up at assignment. Misclassification of assignment to treatment strategies based on treatment received after the start of follow-up results in immortal time when the treatment strategies are not distinguishable at the start of follow-up. The above selection and misclassification can be represented using causal diagrams. We summarize analytic approaches that prevent immortal time when longitudinal data are available from the time of treatment assignment. The term \"immortal time bias\" suggests that the source of the bias is the immortal time, but it is selection or misclassification that generates the immortal time, leading to bias.</p>","PeriodicalId":11779,"journal":{"name":"Epidemiology","volume":" ","pages":""},"PeriodicalIF":4.7,"publicationDate":"2024-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142567283","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Preventable Fraction in the Context of Disease Progression. 在疾病进展过程中可预防的分数。
IF 4.7 2区 医学 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH Pub Date : 2024-11-01 Epub Date: 2024-07-23 DOI: 10.1097/EDE.0000000000001770
Bronner P Gonçalves, Etsuji Suzuki

The relevance of the epidemiologic concept of preventable fraction to the study of the population-level impact of preventive exposures is unequivocal. Here, we discuss how the preventable fraction can be usefully understood for the class of outcomes that relate to disease progression (e.g., clinical severity given diagnosis), and, under the principal stratification framework, derive an expression for this quantity for this type of outcome. In particular, we show that, in the context of disease progression, the preventable fraction is a function of the effect on the postdiagnosis outcome in the principal stratum in the unexposed group who would have disease regardless of exposure status. This work will facilitate an understanding of the contribution of principal effects to the impact of preventive exposures at the population level.

可预防分数这一流行病学概念对于研究预防性暴露对人群的影响具有明确的意义。在此,我们将讨论如何有效地理解与疾病进展相关的一类结果(如确诊时的临床严重程度)的可预防分数,并在主要分层框架下推导出这类结果的可预防分数表达式。特别是,我们表明,在疾病进展的背景下,可预防部分是对未暴露组主要分层中诊断后结果的影响的函数,而未暴露组无论暴露状况如何都会患病。这项工作将有助于理解主要效应对人群预防性暴露影响的贡献。
{"title":"Preventable Fraction in the Context of Disease Progression.","authors":"Bronner P Gonçalves, Etsuji Suzuki","doi":"10.1097/EDE.0000000000001770","DOIUrl":"10.1097/EDE.0000000000001770","url":null,"abstract":"<p><p>The relevance of the epidemiologic concept of preventable fraction to the study of the population-level impact of preventive exposures is unequivocal. Here, we discuss how the preventable fraction can be usefully understood for the class of outcomes that relate to disease progression (e.g., clinical severity given diagnosis), and, under the principal stratification framework, derive an expression for this quantity for this type of outcome. In particular, we show that, in the context of disease progression, the preventable fraction is a function of the effect on the postdiagnosis outcome in the principal stratum in the unexposed group who would have disease regardless of exposure status. This work will facilitate an understanding of the contribution of principal effects to the impact of preventive exposures at the population level.</p>","PeriodicalId":11779,"journal":{"name":"Epidemiology","volume":" ","pages":"801-804"},"PeriodicalIF":4.7,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141751375","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Methods for Extending Inferences From Observational Studies: Considering Causal Structures, Identification Assumptions, and Estimators. 扩展观察研究推论的方法:考虑因果结构、识别假设和估算器。
IF 4.7 2区 医学 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH Pub Date : 2024-11-01 Epub Date: 2024-08-09 DOI: 10.1097/EDE.0000000000001780
Eleanor Hayes-Larson, Yixuan Zhou, L Paloma Rojas-Saunero, Crystal Shaw, Marissa J Seamans, M Maria Glymour, Audrey R Murchland, Daniel Westreich, Elizabeth Rose Mayeda

Most prior work in quantitative approaches to generalizability and transportability emphasizes extending causal effect estimates from randomized trials to target populations. Extending findings from observational studies is also of scientific interest, and identifiability assumptions and estimation methods differ from randomized settings when there is selection on both the exposure and exposure-outcome mediators in combination with exposure-outcome confounders (and both confounders and mediators can modify exposure-outcome effects). We argue that this causal structure is common in observational studies, particularly in the field of life course epidemiology, for example, when extending estimates of the effect of an early-life exposure on a later-life outcome from a cohort enrolled in midlife or late life. We describe identifiability assumptions and identification using observed data in such settings, highlighting differences from work extending findings from randomized trials. We describe statistical methods, including weighting, outcome modeling, and doubly robust approaches, to estimate potential outcome means and average treatment effects in the target population and illustrate performance of the methods in a simulation study. We show that in the presence of selection into the study sample on both exposure and confounders, estimators must be able to address confounding in the target population. When there is also selection on mediators of the exposure-outcome relationship, estimators need to be able to use different sets of variables to account for selection (including the mediator), and confounding. We discuss conceptual implications of our results as well as highlight unresolved practical questions for applied work to extend findings from observational studies to target populations.

之前关于可推广性和可迁移性定量方法的大部分工作都强调将因果效应估计从随机试验扩展到目标人群。当暴露和暴露-结果中介因素与暴露-结果混杂因素(混杂因素和中介因素都能改变暴露-结果效应)都存在选择时,可识别性假设和估算方法就与随机环境不同。我们认为,这种因果结构在观察性研究中很常见,尤其是在生命过程流行病学领域,例如,当从中晚期入组的队列中扩展早期暴露对晚期结果影响的估计时。我们介绍了在这种情况下使用观察数据进行可识别性假设和识别的方法,强调了与随机试验结果扩展工作的不同之处。我们介绍了统计方法,包括加权、结果建模和双重稳健方法,以估计目标人群中潜在的结果平均值和平均治疗效果,并在模拟研究中说明了这些方法的性能。我们表明,如果研究样本中存在对暴露和混杂因素的选择,估计方法必须能够解决目标人群中的混杂问题。当暴露-结果关系的中介因素也存在选择时,估计方法必须能够使用不同的变量集来解释选择(包括中介因素)和混杂。我们讨论了我们的结果在概念上的影响,并强调了应用工作中尚未解决的实际问题,以便将观察性研究的结果推广到目标人群。
{"title":"Methods for Extending Inferences From Observational Studies: Considering Causal Structures, Identification Assumptions, and Estimators.","authors":"Eleanor Hayes-Larson, Yixuan Zhou, L Paloma Rojas-Saunero, Crystal Shaw, Marissa J Seamans, M Maria Glymour, Audrey R Murchland, Daniel Westreich, Elizabeth Rose Mayeda","doi":"10.1097/EDE.0000000000001780","DOIUrl":"10.1097/EDE.0000000000001780","url":null,"abstract":"<p><p>Most prior work in quantitative approaches to generalizability and transportability emphasizes extending causal effect estimates from randomized trials to target populations. Extending findings from observational studies is also of scientific interest, and identifiability assumptions and estimation methods differ from randomized settings when there is selection on both the exposure and exposure-outcome mediators in combination with exposure-outcome confounders (and both confounders and mediators can modify exposure-outcome effects). We argue that this causal structure is common in observational studies, particularly in the field of life course epidemiology, for example, when extending estimates of the effect of an early-life exposure on a later-life outcome from a cohort enrolled in midlife or late life. We describe identifiability assumptions and identification using observed data in such settings, highlighting differences from work extending findings from randomized trials. We describe statistical methods, including weighting, outcome modeling, and doubly robust approaches, to estimate potential outcome means and average treatment effects in the target population and illustrate performance of the methods in a simulation study. We show that in the presence of selection into the study sample on both exposure and confounders, estimators must be able to address confounding in the target population. When there is also selection on mediators of the exposure-outcome relationship, estimators need to be able to use different sets of variables to account for selection (including the mediator), and confounding. We discuss conceptual implications of our results as well as highlight unresolved practical questions for applied work to extend findings from observational studies to target populations.</p>","PeriodicalId":11779,"journal":{"name":"Epidemiology","volume":" ","pages":"753-763"},"PeriodicalIF":4.7,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11560714/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141909830","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mixture Models for Social Epidemiology: Opportunities and Cautions. 社会流行病学的混合模型:机遇与注意事项。
IF 4.7 2区 医学 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH Pub Date : 2024-11-01 Epub Date: 2024-08-01 DOI: 10.1097/EDE.0000000000001778
Alina Schnake-Mahl, Ghassan Badri Hamra
{"title":"Mixture Models for Social Epidemiology: Opportunities and Cautions.","authors":"Alina Schnake-Mahl, Ghassan Badri Hamra","doi":"10.1097/EDE.0000000000001778","DOIUrl":"10.1097/EDE.0000000000001778","url":null,"abstract":"","PeriodicalId":11779,"journal":{"name":"Epidemiology","volume":" ","pages":"748-752"},"PeriodicalIF":4.7,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141859350","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Counterfactual Analysis of Impact of Cesarean Birth in a First Birth on Severe Maternal Morbidity in the Subsequent Birth. 首次分娩剖腹产对再次分娩严重产妇发病率影响的反事实分析。
IF 4.7 2区 医学 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH Pub Date : 2024-11-01 Epub Date: 2024-07-26 DOI: 10.1097/EDE.0000000000001775
Shalmali Bane, Jonathan M Snowden, Julia F Simard, Michelle Odden, Peiyi Kan, Elliott K Main, Suzan L Carmichael

Background: It is known that cesarean birth affects maternal outcomes in subsequent pregnancies, but specific effect estimates are lacking. We sought to quantify the effect of cesarean birth reduction among nulliparous, term, singleton, vertex (NTSV) births (i.e., preventable cesarean births) on severe maternal morbidity (SMM) in the second birth.

Methods: We examined birth certificates linked with maternal hospitalization data (2007-2019) from California for NTSV births with a second birth (N = 779,382). The exposure was cesarean delivery in the first birth and the outcome was SMM in the second birth. We used adjusted Poisson regression models to calculate risk ratios and population attributable fraction for SMM in the second birth and conducted a counterfactual impact analysis to estimate how lowering NTSV cesarean births could reduce SMM in the second birth.

Results: The adjusted risk ratio for SMM in the second birth given a prior cesarean birth was 1.7 (95% confidence interval: 1.5, 1.9); 15.5% (95% confidence interval: 15.3%, 15.7%) of this SMM may be attributable to prior cesarean birth. In a counterfactual analysis where 12% of the California population was least likely to get a cesarean birth instead delivered vaginally, we observed 174 fewer SMM events in a population of individuals with a low-risk first birth and subsequent birth.

Conclusion: In our counterfactual analysis, lowering primary cesarean birth among an NTSV population was associated with fewer downstream SMM events in subsequent births and overall. Additionally, our findings reflect the importance of considering the cumulative accrual of risks across the reproductive life course.

背景:众所周知,剖宫产会影响产妇以后的妊娠结局,但缺乏具体的效果估计。我们试图量化减少无子宫、足月、单胎、顶点(NTSV)分娩(即可预防的剖宫产)中的剖宫产对第二胎严重孕产妇发病率(SMM)的影响:我们研究了加利福尼亚州与产妇住院数据相关联的出生证明(2007-19 年),其中包括有第二次分娩的 NTSV 新生儿(N=779,382)。第一胎为剖宫产,第二胎为SMM。我们使用调整后的泊松回归模型计算第二胎SMM的风险比和人口可归因分数,并进行了反事实影响分析,以估计降低NTSV剖宫产率可如何减少第二胎SMM:结果:如果产妇之前曾进行过剖宫产,则第二次分娩的SMM调整风险比为1.7(95% CI 1.5-1.9);其中15.5%(95% CI 15.3%-15.7%)的SMM可能归因于之前的剖宫产。在一项反事实分析中,加利福尼亚州最不可能进行剖宫产的人群中有12%经阴道分娩,我们观察到在低风险首次分娩和随后分娩的人群中,SMM事件减少了174例:在我们的反事实分析中,在 NTSV 人群中降低初次剖宫产率与后续分娩和总体分娩中减少下游 SMM 事件有关。此外,我们的研究结果还反映了考虑整个生育期风险累积的重要性。
{"title":"A Counterfactual Analysis of Impact of Cesarean Birth in a First Birth on Severe Maternal Morbidity in the Subsequent Birth.","authors":"Shalmali Bane, Jonathan M Snowden, Julia F Simard, Michelle Odden, Peiyi Kan, Elliott K Main, Suzan L Carmichael","doi":"10.1097/EDE.0000000000001775","DOIUrl":"10.1097/EDE.0000000000001775","url":null,"abstract":"<p><strong>Background: </strong>It is known that cesarean birth affects maternal outcomes in subsequent pregnancies, but specific effect estimates are lacking. We sought to quantify the effect of cesarean birth reduction among nulliparous, term, singleton, vertex (NTSV) births (i.e., preventable cesarean births) on severe maternal morbidity (SMM) in the second birth.</p><p><strong>Methods: </strong>We examined birth certificates linked with maternal hospitalization data (2007-2019) from California for NTSV births with a second birth (N = 779,382). The exposure was cesarean delivery in the first birth and the outcome was SMM in the second birth. We used adjusted Poisson regression models to calculate risk ratios and population attributable fraction for SMM in the second birth and conducted a counterfactual impact analysis to estimate how lowering NTSV cesarean births could reduce SMM in the second birth.</p><p><strong>Results: </strong>The adjusted risk ratio for SMM in the second birth given a prior cesarean birth was 1.7 (95% confidence interval: 1.5, 1.9); 15.5% (95% confidence interval: 15.3%, 15.7%) of this SMM may be attributable to prior cesarean birth. In a counterfactual analysis where 12% of the California population was least likely to get a cesarean birth instead delivered vaginally, we observed 174 fewer SMM events in a population of individuals with a low-risk first birth and subsequent birth.</p><p><strong>Conclusion: </strong>In our counterfactual analysis, lowering primary cesarean birth among an NTSV population was associated with fewer downstream SMM events in subsequent births and overall. Additionally, our findings reflect the importance of considering the cumulative accrual of risks across the reproductive life course.</p>","PeriodicalId":11779,"journal":{"name":"Epidemiology","volume":" ","pages":"853-863"},"PeriodicalIF":4.7,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11560597/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141765749","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Causal Roadmap and Simulations to Improve the Rigor and Reproducibility of Real-data Applications. 改善真实数据应用的严谨性和可重复性的因果路线图和模拟。
IF 4.7 2区 医学 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH Pub Date : 2024-11-01 Epub Date: 2024-08-01 DOI: 10.1097/EDE.0000000000001773
Nerissa Nance, Maya L Petersen, Mark van der Laan, Laura B Balzer

The Causal Roadmap outlines a systematic approach to asking and answering questions of cause and effect: define the quantity of interest, evaluate needed assumptions, conduct statistical estimation, and carefully interpret results. To protect research integrity, it is essential that the algorithm for statistical estimation and inference be prespecified prior to conducting any effectiveness analyses. However, it is often unclear which algorithm will perform optimally for the real-data application. Instead, there is a temptation to simply implement one's favorite algorithm, recycling prior code or relying on the default settings of a computing package. Here, we call for the use of simulations that realistically reflect the application, including key characteristics such as strong confounding and dependent or missing outcomes, to objectively compare candidate estimators and facilitate full specification of the statistical analysis plan. Such simulations are informed by the Causal Roadmap and conducted after data collection but prior to effect estimation. We illustrate with two worked examples. First, in an observational longitudinal study, we use outcome-blind simulations to inform nuisance parameter estimation and variance estimation for longitudinal targeted minimum loss-based estimation. Second, in a cluster randomized trial with missing outcomes, we use treatment-blind simulations to examine type-I error control in two-stage targeted minimum loss-based estimation. In both examples, realistic simulations empower us to prespecify an estimation approach with strong expected finite sample performance, and also produce quality-controlled computing code for the actual analysis. Together, this process helps to improve the rigor and reproducibility of our research.

因果关系路线图概述了提出和回答因果关系问题的系统方法:定义感兴趣的数量、评估所需假设、进行统计估算并仔细解释结果。为了保护研究的完整性,在进行任何有效性分析之前,必须预先确定统计估算和推断的算法。然而,人们往往不清楚哪种算法在实际数据应用中表现最佳。相反,人们往往会简单地执行自己喜欢的算法,重复使用先前的代码或依赖于计算软件包的默认设置。在此,我们呼吁使用能真实反映应用的模拟,包括强混杂、依赖或缺失结果等关键特征,以客观地比较候选估计器,并促进统计分析计划的全面规范化。此类模拟以因果关系路线图为依据,在数据收集之后、效应估计之前进行。我们用两个实例来说明。首先,在一项观察性纵向研究中,我们使用结果盲模拟为基于最小损失的纵向目标估算的滋扰参数估计和方差估计提供信息。其次,在一项结果缺失的群组随机试验中,我们使用治疗盲模拟来检验基于最小损失的两阶段目标估计中的I型误差控制。在这两个例子中,现实模拟使我们有能力预先指定一种估计方法,这种方法预计具有很强的有限样本性能,同时还能为实际分析提供质量可控的计算代码。这一过程有助于提高我们研究的严谨性和可重复性。
{"title":"The Causal Roadmap and Simulations to Improve the Rigor and Reproducibility of Real-data Applications.","authors":"Nerissa Nance, Maya L Petersen, Mark van der Laan, Laura B Balzer","doi":"10.1097/EDE.0000000000001773","DOIUrl":"10.1097/EDE.0000000000001773","url":null,"abstract":"<p><p>The Causal Roadmap outlines a systematic approach to asking and answering questions of cause and effect: define the quantity of interest, evaluate needed assumptions, conduct statistical estimation, and carefully interpret results. To protect research integrity, it is essential that the algorithm for statistical estimation and inference be prespecified prior to conducting any effectiveness analyses. However, it is often unclear which algorithm will perform optimally for the real-data application. Instead, there is a temptation to simply implement one's favorite algorithm, recycling prior code or relying on the default settings of a computing package. Here, we call for the use of simulations that realistically reflect the application, including key characteristics such as strong confounding and dependent or missing outcomes, to objectively compare candidate estimators and facilitate full specification of the statistical analysis plan. Such simulations are informed by the Causal Roadmap and conducted after data collection but prior to effect estimation. We illustrate with two worked examples. First, in an observational longitudinal study, we use outcome-blind simulations to inform nuisance parameter estimation and variance estimation for longitudinal targeted minimum loss-based estimation. Second, in a cluster randomized trial with missing outcomes, we use treatment-blind simulations to examine type-I error control in two-stage targeted minimum loss-based estimation. In both examples, realistic simulations empower us to prespecify an estimation approach with strong expected finite sample performance, and also produce quality-controlled computing code for the actual analysis. Together, this process helps to improve the rigor and reproducibility of our research.</p>","PeriodicalId":11779,"journal":{"name":"Epidemiology","volume":" ","pages":"791-800"},"PeriodicalIF":4.7,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11444352/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141859351","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Associations Between Gestational Residential Radon Exposure and Term Low Birthweight in Connecticut, USA. 美国康涅狄格州妊娠期住宅氡暴露与足月低出生体重之间的关系。
IF 4.7 2区 医学 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH Pub Date : 2024-11-01 Epub Date: 2024-07-23 DOI: 10.1097/EDE.0000000000001771
Seulkee Heo, Longxiang Li, Ji-Young Son, Petros Koutrakis, Michelle L Bell

Background: Studies suggest biologic mechanisms for gestational exposure to radiation and impaired fetal development. We explored associations between gestational radon exposure and term low birthweight, for which evidence is limited.

Methods: We examined data for 68,159 singleton full-term births in Connecticut, United States, 2016-2018. Using a radon spatiotemporal model, we estimated ZIP code-level basement and ground-level exposures during pregnancy and trimesters for each participant's address at birth or delivery. We used logistic regression models, including confounders, to estimate odds ratios (ORs) for term low birth weight in four exposure quartiles (Q1-Q4) with the lowest exposure group (Q1) as the reference.

Results: Exposure levels to basement radon throughout pregnancy (0.27-3.02 pCi/L) were below the guideline level set by the US Environmental Protection Agency (4 pCi/L). The ORs for term low birth weight in the second-highest (Q3; 1.01-1.33 pCi/L) exposure group compared with the reference (<0.79 pCi/L) group for basement radon during the first trimester was 1.22 (95% confidence interval [CI] = 1.02, 1.45). The OR in the highest (Q4; 1.34-4.43 pCi/L) quartile group compared with the reference group during the first trimester was 1.26 (95% CI = 1.05, 1.50). Risks from basement radon were higher for participants with lower income, lower maternal education levels, or living in urban regions.

Conclusion: This study found increased term low birth weight risks for increases in basement radon. Results have implications for infants' health for exposure to radon at levels below the current national guideline for indoor radon concentrations and building remediations.

背景:研究表明,妊娠期暴露于辐射与胎儿发育受损之间存在生物机制。我们探讨了妊娠期氡暴露与足月低出生体重之间的关联,这方面的证据有限:我们研究了 2016-2018 年美国康涅狄格州 68,159 例单胎足月新生儿的数据。利用氡时空模型,我们估算了每位参与者出生或分娩时地址的 ZIP 代码级地下室和地面氡暴露量。我们使用逻辑回归模型(包括混杂因素)估算了四个暴露四分位数(Q1 至 Q4)中足月低出生体重的几率比(ORs),并以最低暴露组(Q1)作为参照:整个孕期的地下室氡暴露水平(0.27-3.02 pCi/L)均低于美国环境保护局设定的指导水平(4 pCi/L)。与参考值相比,第二高(Q3;1.01-1.33 pCi/L)辐照组的足月低出生体重的 ORs(结论:本研究发现,随着地下室氡含量的增加,足月新生儿体重不足的风险也会增加。如果氡暴露水平低于现行的国家室内氡浓度和建筑补救指南,研究结果将对婴儿健康产生影响。
{"title":"Associations Between Gestational Residential Radon Exposure and Term Low Birthweight in Connecticut, USA.","authors":"Seulkee Heo, Longxiang Li, Ji-Young Son, Petros Koutrakis, Michelle L Bell","doi":"10.1097/EDE.0000000000001771","DOIUrl":"10.1097/EDE.0000000000001771","url":null,"abstract":"<p><strong>Background: </strong>Studies suggest biologic mechanisms for gestational exposure to radiation and impaired fetal development. We explored associations between gestational radon exposure and term low birthweight, for which evidence is limited.</p><p><strong>Methods: </strong>We examined data for 68,159 singleton full-term births in Connecticut, United States, 2016-2018. Using a radon spatiotemporal model, we estimated ZIP code-level basement and ground-level exposures during pregnancy and trimesters for each participant's address at birth or delivery. We used logistic regression models, including confounders, to estimate odds ratios (ORs) for term low birth weight in four exposure quartiles (Q1-Q4) with the lowest exposure group (Q1) as the reference.</p><p><strong>Results: </strong>Exposure levels to basement radon throughout pregnancy (0.27-3.02 pCi/L) were below the guideline level set by the US Environmental Protection Agency (4 pCi/L). The ORs for term low birth weight in the second-highest (Q3; 1.01-1.33 pCi/L) exposure group compared with the reference (<0.79 pCi/L) group for basement radon during the first trimester was 1.22 (95% confidence interval [CI] = 1.02, 1.45). The OR in the highest (Q4; 1.34-4.43 pCi/L) quartile group compared with the reference group during the first trimester was 1.26 (95% CI = 1.05, 1.50). Risks from basement radon were higher for participants with lower income, lower maternal education levels, or living in urban regions.</p><p><strong>Conclusion: </strong>This study found increased term low birth weight risks for increases in basement radon. Results have implications for infants' health for exposure to radon at levels below the current national guideline for indoor radon concentrations and building remediations.</p>","PeriodicalId":11779,"journal":{"name":"Epidemiology","volume":" ","pages":"834-843"},"PeriodicalIF":4.7,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11560713/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141751373","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Don't Let Your Analysis Go to Seed: On the Impact of Random Seed on Machine Learning-based Causal Inference. 不要让你的分析变成种子:随机种子对基于机器学习的因果推理的影响。
IF 4.7 2区 医学 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH Pub Date : 2024-11-01 Epub Date: 2024-08-16 DOI: 10.1097/EDE.0000000000001782
Lindsey Schader, Weishan Song, Russell Kempker, David Benkeser

Machine learning techniques for causal effect estimation can enhance the reliability of epidemiologic analyses, reducing their dependence on correct model specifications. However, the stochastic nature of many machine learning algorithms implies that the results derived from such approaches may be influenced by the random seed that is set before model fitting. In this work, we highlight the substantial influence of random seeds on a popular approach for machine learning-based causal effect estimation, namely doubly robust estimators. We illustrate that varying seeds can yield divergent scientific interpretations of doubly robust estimates produced from the same dataset. We propose techniques for stabilizing results across random seeds and, through an extensive simulation study, demonstrate that these techniques effectively neutralize seed-related variability without compromising the statistical efficiency of the estimators. Based on these findings, we offer practical guidelines to minimize the influence of random seeds in real-world applications, and we encourage researchers to explore the variability due to random seeds when implementing any method that involves random steps.

用于因果效应估计的机器学习技术可以提高流行病学分析的可靠性,减少对正确模型规格的依赖。然而,许多机器学习算法的随机性意味着,这些方法得出的结果可能会受到模型拟合前设置的随机种子的影响。在这项工作中,我们强调了随机种子对基于机器学习的因果效应估计的一种流行方法(即双重稳健估计器)的重大影响。我们说明,不同的种子会对同一数据集产生的双重稳健估计产生不同的科学解释。我们提出了稳定随机种子结果的技术,并通过广泛的模拟研究证明,这些技术能有效中和与种子相关的变异性,而不会影响估计器的统计效率。基于这些发现,我们提出了在实际应用中尽量减少随机种子影响的实用指南,并鼓励研究人员在实施任何涉及随机步骤的方法时,探索随机种子导致的变异性。
{"title":"Don't Let Your Analysis Go to Seed: On the Impact of Random Seed on Machine Learning-based Causal Inference.","authors":"Lindsey Schader, Weishan Song, Russell Kempker, David Benkeser","doi":"10.1097/EDE.0000000000001782","DOIUrl":"10.1097/EDE.0000000000001782","url":null,"abstract":"<p><p>Machine learning techniques for causal effect estimation can enhance the reliability of epidemiologic analyses, reducing their dependence on correct model specifications. However, the stochastic nature of many machine learning algorithms implies that the results derived from such approaches may be influenced by the random seed that is set before model fitting. In this work, we highlight the substantial influence of random seeds on a popular approach for machine learning-based causal effect estimation, namely doubly robust estimators. We illustrate that varying seeds can yield divergent scientific interpretations of doubly robust estimates produced from the same dataset. We propose techniques for stabilizing results across random seeds and, through an extensive simulation study, demonstrate that these techniques effectively neutralize seed-related variability without compromising the statistical efficiency of the estimators. Based on these findings, we offer practical guidelines to minimize the influence of random seeds in real-world applications, and we encourage researchers to explore the variability due to random seeds when implementing any method that involves random steps.</p>","PeriodicalId":11779,"journal":{"name":"Epidemiology","volume":" ","pages":"764-778"},"PeriodicalIF":4.7,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141992225","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Pseudo-random Number Generator Influences on Average Treatment Effect Estimates Obtained with Machine Learning. 伪随机数生成器对机器学习获得的平均治疗效果估计值的影响》(Pseudo-Random Number Generator Influences on Average Treatment Effect Estimates obtained with Machine Learning)。
IF 4.7 2区 医学 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH Pub Date : 2024-11-01 Epub Date: 2024-08-16 DOI: 10.1097/EDE.0000000000001785
Ashley I Naimi, Ya-Hui Yu, Lisa M Bodnar

Background: The use of machine learning to estimate exposure effects introduces a dependence between the results of an empirical study and the value of the seed used to fix the pseudo-random number generator.

Methods: We used data from 10,038 pregnant women and a 10% subsample (N = 1004) to examine the extent to which the risk difference for the relation between fruit and vegetable consumption and preeclampsia risk changes under different seed values. We fit an augmented inverse probability weighted estimator with two Super Learner algorithms: a simple algorithm including random forests and single-layer neural networks and a more complex algorithm with a mix of tree-based, regression-based, penalized, and simple algorithms. We evaluated the distributions of risk differences, standard errors, and P values that result from 5000 different seed value selections.

Results: Our findings suggest important variability in the risk difference estimates, as well as an important effect of the stacking algorithm used. The interquartile range width of the risk differences in the full sample with the simple algorithm was 13 per 1000. However, all other interquartile ranges were roughly an order of magnitude lower. The medians of the distributions of risk differences differed according to the sample size and the algorithm used.

Conclusions: Our findings add another dimension of concern regarding the potential for "p-hacking," and further warrant the need to move away from simplistic evidentiary thresholds in empirical research. When empirical results depend on pseudo-random number generator seed values, caution is warranted in interpreting these results.

背景:使用机器学习估算暴露效应会在实证研究结果与用于固定伪随机数生成器的种子值之间引入一种依赖关系:我们使用了来自 10,038 名孕妇和 10% 的子样本(N = 1,004)的数据,研究了在不同的种子值下,水果和蔬菜摄入量与子痫前期风险之间的风险差异变化程度。我们用两种超级学习器算法拟合了一个增强的反概率加权估计器:一种是包括随机森林和单层神经网络的简单算法,另一种是混合了基于树、基于回归、惩罚算法和简单算法的更复杂算法。我们评估了 5000 个不同种子值的风险差异、标准误差和 p 值的分布情况:结果:我们的研究结果表明,风险差异估计值存在很大差异,所使用的堆叠算法也有重要影响。在使用简单算法的全样本中,风险差异的四分位数范围宽度(IQRw)为 13‰。然而,所有其他的 IQR 都低了大约一个数量级。风险差异分布的中位数因样本量和所用算法而异:我们的发现为 "p-黑客 "的可能性增添了新的担忧,并进一步证明了在实证研究中摒弃简单的证据阈值的必要性。当实证结果依赖于伪随机数生成器种子值时,在解释这些结果时必须谨慎。
{"title":"Pseudo-random Number Generator Influences on Average Treatment Effect Estimates Obtained with Machine Learning.","authors":"Ashley I Naimi, Ya-Hui Yu, Lisa M Bodnar","doi":"10.1097/EDE.0000000000001785","DOIUrl":"10.1097/EDE.0000000000001785","url":null,"abstract":"<p><strong>Background: </strong>The use of machine learning to estimate exposure effects introduces a dependence between the results of an empirical study and the value of the seed used to fix the pseudo-random number generator.</p><p><strong>Methods: </strong>We used data from 10,038 pregnant women and a 10% subsample (N = 1004) to examine the extent to which the risk difference for the relation between fruit and vegetable consumption and preeclampsia risk changes under different seed values. We fit an augmented inverse probability weighted estimator with two Super Learner algorithms: a simple algorithm including random forests and single-layer neural networks and a more complex algorithm with a mix of tree-based, regression-based, penalized, and simple algorithms. We evaluated the distributions of risk differences, standard errors, and P values that result from 5000 different seed value selections.</p><p><strong>Results: </strong>Our findings suggest important variability in the risk difference estimates, as well as an important effect of the stacking algorithm used. The interquartile range width of the risk differences in the full sample with the simple algorithm was 13 per 1000. However, all other interquartile ranges were roughly an order of magnitude lower. The medians of the distributions of risk differences differed according to the sample size and the algorithm used.</p><p><strong>Conclusions: </strong>Our findings add another dimension of concern regarding the potential for \"p-hacking,\" and further warrant the need to move away from simplistic evidentiary thresholds in empirical research. When empirical results depend on pseudo-random number generator seed values, caution is warranted in interpreting these results.</p>","PeriodicalId":11779,"journal":{"name":"Epidemiology","volume":" ","pages":"779-786"},"PeriodicalIF":4.7,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11560583/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141992227","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Re: Bias in Calculation of Attributable Fractions Using Relative Risks from Nonsmokers Only. 关于仅使用不吸烟者的相对风险计算可归因分数的偏差。
IF 4.7 2区 医学 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH Pub Date : 2024-11-01 Epub Date: 2024-09-30 DOI: 10.1097/EDE.0000000000001786
Etsuji Suzuki, Eiji Yamamoto
{"title":"Re: Bias in Calculation of Attributable Fractions Using Relative Risks from Nonsmokers Only.","authors":"Etsuji Suzuki, Eiji Yamamoto","doi":"10.1097/EDE.0000000000001786","DOIUrl":"10.1097/EDE.0000000000001786","url":null,"abstract":"","PeriodicalId":11779,"journal":{"name":"Epidemiology","volume":" ","pages":"e21-e22"},"PeriodicalIF":4.7,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142003946","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Epidemiology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1