Pub Date : 2026-01-12DOI: 10.1007/s10654-025-01341-7
Kaicheng Wang, Lindsey Rosman, Haidong Lu
Machine learning (ML) algorithms are increasingly used to estimate propensity score with expectation of improving causal inference. However, the validity of data-driven ML-based approaches for confounder selection and adjustment remains unclear. In this study, we emulated the device-stratified secondary analysis of the PARADIGM-HF trial among U.S. veterans with heart failure and implanted cardiac devices from 2016 to 2020. We benchmarked observational estimates from three propensity score approaches against the trial results. (1) logistic regression with pre-specified confounders (2), generalized boosted models (GBM) using the same pre-specified confounders, and (3) GBM with expanded covariates and automated feature selection. Logistic regression-based propensity score approach yielded estimates closest to the trial (HR = 0.93, 95% CI 0.61-1.42; 23-month RR = 0.86, 95% CI 0.57-1.24 vs. trial HR = 0.81, 95% CI 0.61-1.06). Despite better predictive performance, GBM with pre-specified confounders showed no improvement over the logistic regression approach (HR = 0.97, 95% CI 0.68-1.37; RR = 0.96, 95% CI 0.89-1.98). Moreover, GBM with expanded covariates and data-driven automated feature selection substantially increased bias (HR = 0.61, 95% CI 0.30-1.23; RR = 0.69, 95% CI 0.36-1.04). Our findings suggest that ML-based propensity score methods do not inherently improve causal estimation possibly due to residual confounding from omitted or partially adjusted variables and may introduce overadjustment bias when combined with automated feature selection. These results underscore the importance of careful confounder specification and causal reasoning over algorithmic complexity in causal inference.
机器学习(ML)算法越来越多地用于估计倾向得分,期望改善因果推理。然而,数据驱动的基于ml的混杂选择和调整方法的有效性仍然不清楚。在这项研究中,我们模拟了2016年至2020年美国退伍军人心力衰竭和植入心脏装置的PARADIGM-HF试验的器械分层二次分析。我们将三种倾向评分方法的观察性估计与试验结果进行基准比较。(1)预先指定混杂因素的逻辑回归(2),使用相同预先指定混杂因素的广义增强模型(GBM),以及(3)扩展协变量和自动特征选择的GBM。基于Logistic回归的倾向评分方法得出的估计值与试验最接近(HR = 0.93, 95% CI 0.61-1.42; 23个月的RR = 0.86, 95% CI 0.57-1.24,而试验HR = 0.81, 95% CI 0.61-1.06)。尽管具有更好的预测性能,但与逻辑回归方法相比,预先指定混杂因素的GBM没有改善(HR = 0.97, 95% CI 0.68-1.37; RR = 0.96, 95% CI 0.89-1.98)。此外,扩展协变量的GBM和数据驱动的自动特征选择大大增加了偏差(HR = 0.61, 95% CI 0.30-1.23; RR = 0.69, 95% CI 0.36-1.04)。我们的研究结果表明,基于机器学习的倾向评分方法并不能从本质上改善因果估计,这可能是由于遗漏或部分调整变量的残留混淆,并且当与自动特征选择相结合时可能会引入过度调整偏差。这些结果强调了在因果推理中,谨慎的混杂规范和因果推理在算法复杂性上的重要性。
{"title":"Machine learning versus logistic regression for propensity score estimation: a trial emulation benchmarked against the PARADIGM-HF randomized trial.","authors":"Kaicheng Wang, Lindsey Rosman, Haidong Lu","doi":"10.1007/s10654-025-01341-7","DOIUrl":"10.1007/s10654-025-01341-7","url":null,"abstract":"<p><p>Machine learning (ML) algorithms are increasingly used to estimate propensity score with expectation of improving causal inference. However, the validity of data-driven ML-based approaches for confounder selection and adjustment remains unclear. In this study, we emulated the device-stratified secondary analysis of the PARADIGM-HF trial among U.S. veterans with heart failure and implanted cardiac devices from 2016 to 2020. We benchmarked observational estimates from three propensity score approaches against the trial results. (1) logistic regression with pre-specified confounders (2), generalized boosted models (GBM) using the same pre-specified confounders, and (3) GBM with expanded covariates and automated feature selection. Logistic regression-based propensity score approach yielded estimates closest to the trial (HR = 0.93, 95% CI 0.61-1.42; 23-month RR = 0.86, 95% CI 0.57-1.24 vs. trial HR = 0.81, 95% CI 0.61-1.06). Despite better predictive performance, GBM with pre-specified confounders showed no improvement over the logistic regression approach (HR = 0.97, 95% CI 0.68-1.37; RR = 0.96, 95% CI 0.89-1.98). Moreover, GBM with expanded covariates and data-driven automated feature selection substantially increased bias (HR = 0.61, 95% CI 0.30-1.23; RR = 0.69, 95% CI 0.36-1.04). Our findings suggest that ML-based propensity score methods do not inherently improve causal estimation possibly due to residual confounding from omitted or partially adjusted variables and may introduce overadjustment bias when combined with automated feature selection. These results underscore the importance of careful confounder specification and causal reasoning over algorithmic complexity in causal inference.</p>","PeriodicalId":11907,"journal":{"name":"European Journal of Epidemiology","volume":" ","pages":""},"PeriodicalIF":5.9,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145951526","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-12DOI: 10.1007/s10654-025-01324-8
Yuankai Zhang,Roby Joehanes,Tianxiao Huan,Lukas M Weber,Qiong Yang,Kathryn L Lunetta,Daniel Levy,Chunyu Liu
Mendelian randomization has emerged as a powerful tool for exploring causal relationships in observational studies by using genetic variants as instrumental variables. While multivariable Mendelian randomization extends this approach to simultaneously address multiple exposures, it faces significant challenges with highly correlated exposures, particularly in high-dimensional settings such as multi-omics data. Conventional MVMR methods, which are primarily based on linear regression models, may suffer from multicollinearity and reduced statistical power when analyzing correlated exposures. The increasing availability of high-dimensional multi-omics data has highlighted the limitations of conventional MVMR approaches in analyzing correlated exposures while maintaining biological interpretability. To address these challenges, we propose integrating latent factor analysis into the MVMR framework, enabling dimension reduction without compromising interpretability. Through extensive simulation studies, we demonstrate that our method maintains a well-controlled false positive rate and offers superior sensitivity compared to conventional MVMR approaches. We apply our method to investigate the causal relationship between DNA methylation and mitochondrial DNA copy number. Our method offers a significant advantage in scenarios with highly correlated exposures driven by common latent factors or shared pathways, especially when individual effects are sparse. By applying our method to correlated multi-omics data, we can uncover new insights into the molecular mechanisms underlying complex phenotypes.
{"title":"Integration of latent factor analysis into multivariable Mendelian randomization.","authors":"Yuankai Zhang,Roby Joehanes,Tianxiao Huan,Lukas M Weber,Qiong Yang,Kathryn L Lunetta,Daniel Levy,Chunyu Liu","doi":"10.1007/s10654-025-01324-8","DOIUrl":"https://doi.org/10.1007/s10654-025-01324-8","url":null,"abstract":"Mendelian randomization has emerged as a powerful tool for exploring causal relationships in observational studies by using genetic variants as instrumental variables. While multivariable Mendelian randomization extends this approach to simultaneously address multiple exposures, it faces significant challenges with highly correlated exposures, particularly in high-dimensional settings such as multi-omics data. Conventional MVMR methods, which are primarily based on linear regression models, may suffer from multicollinearity and reduced statistical power when analyzing correlated exposures. The increasing availability of high-dimensional multi-omics data has highlighted the limitations of conventional MVMR approaches in analyzing correlated exposures while maintaining biological interpretability. To address these challenges, we propose integrating latent factor analysis into the MVMR framework, enabling dimension reduction without compromising interpretability. Through extensive simulation studies, we demonstrate that our method maintains a well-controlled false positive rate and offers superior sensitivity compared to conventional MVMR approaches. We apply our method to investigate the causal relationship between DNA methylation and mitochondrial DNA copy number. Our method offers a significant advantage in scenarios with highly correlated exposures driven by common latent factors or shared pathways, especially when individual effects are sparse. By applying our method to correlated multi-omics data, we can uncover new insights into the molecular mechanisms underlying complex phenotypes.","PeriodicalId":11907,"journal":{"name":"European Journal of Epidemiology","volume":"39 1","pages":""},"PeriodicalIF":13.6,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145949748","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-12DOI: 10.1007/s10654-025-01335-5
Sif E Carlsen,Emily Jarden,Caroline H Hemmingsen,Lone Schmidt,Sarah Hjorth,Maarit K Leinonen,Ulrika Nörby,Lina S Mørch,Susanne K Kjaer,Hedvig Nordeng,Marie Hargreave
Observational studies have linked maternal hormonal contraception use to childhood cancer risk, but findings are inconsistent. A systematic review was conducted of this potential relationship. A systematic search was performed in PubMed, Embase, Scopus, Cochrane, and Web of Science databases until April 9, 2025. Studies reporting maternal hormonal contraception use before or during pregnancy and childhood cancer risk (0-19 years) were eligible. We included studies providing risk estimates in English or Scandinavian languages. Newcastle-Ottawa Scale was used to assess study quality. Meta-analysis using fixed and random effects was used to pool relative risks (RRs) with 95% confidence intervals (CIs) for childhood cancer according to maternal hormonal contraception use (1) up to or during pregnancy, and (2) exclusively during pregnancy. We included 27 studies (24 case-control and 3 cohort), totaling 11,067 childhood cancer cases. Maternal hormonal contraception use up to and during pregnancy increased risk of any childhood cancer (RR = 1.18; 95% CI = 1.10-1.26), leukemia (RR = 1.24; 95% CI = 1.06-1.45), and lymphoid leukemia (RR = 1.17; 95% CI = 1.06-1.28). Exposures during pregnancy showed higher risk estimate for any cancer (RR = 1.32; 95% CI = 1.12-1.56) and leukemia (RR = 1.63; 95% CI = 1.07-2.49). Most studies were moderate (70%) or high (26%) quality. Maternal hormonal contraception use may increase childhood cancer risk, particularly for leukemia, and during pregnancy. Further prospective studies are needed, focusing on specific hormonal contraception substances and exposure timing.
观察性研究将母亲使用激素避孕与儿童癌症风险联系起来,但研究结果并不一致。对这种潜在的关系进行了系统的回顾。系统检索PubMed, Embase, Scopus, Cochrane和Web of Science数据库,直到2025年4月9日。报告孕妇在怀孕前或怀孕期间使用激素避孕和儿童癌症风险(0-19岁)的研究符合条件。我们纳入了用英语或斯堪的纳维亚语言提供风险评估的研究。采用纽卡斯尔-渥太华量表评估研究质量。采用固定效应和随机效应的荟萃分析,根据(1)怀孕前或怀孕期间以及(2)仅在怀孕期间使用激素避孕,汇总儿童癌症的相对危险度(rr), 95%置信区间(CIs)。我们纳入了27项研究(24项病例对照和3项队列研究),共计11067例儿童癌症病例。孕妇在怀孕前后和怀孕期间使用激素避孕药会增加任何儿童癌症(RR = 1.18; 95% CI = 1.10-1.26)、白血病(RR = 1.24; 95% CI = 1.06-1.45)和淋巴细胞白血病(RR = 1.17; 95% CI = 1.06-1.28)的风险。怀孕期间暴露在暴露环境中,患任何癌症(RR = 1.32; 95% CI = 1.12-1.56)和白血病(RR = 1.63; 95% CI = 1.07-2.49)的风险都较高。大多数研究为中等(70%)或高(26%)质量。孕妇使用激素避孕可能会增加儿童患癌症的风险,尤其是白血病和怀孕期间。需要进一步的前瞻性研究,重点是具体的激素避孕物质和暴露时间。
{"title":"Maternal hormonal contraception use and childhood cancer risk: a systematic review and meta-analysis.","authors":"Sif E Carlsen,Emily Jarden,Caroline H Hemmingsen,Lone Schmidt,Sarah Hjorth,Maarit K Leinonen,Ulrika Nörby,Lina S Mørch,Susanne K Kjaer,Hedvig Nordeng,Marie Hargreave","doi":"10.1007/s10654-025-01335-5","DOIUrl":"https://doi.org/10.1007/s10654-025-01335-5","url":null,"abstract":"Observational studies have linked maternal hormonal contraception use to childhood cancer risk, but findings are inconsistent. A systematic review was conducted of this potential relationship. A systematic search was performed in PubMed, Embase, Scopus, Cochrane, and Web of Science databases until April 9, 2025. Studies reporting maternal hormonal contraception use before or during pregnancy and childhood cancer risk (0-19 years) were eligible. We included studies providing risk estimates in English or Scandinavian languages. Newcastle-Ottawa Scale was used to assess study quality. Meta-analysis using fixed and random effects was used to pool relative risks (RRs) with 95% confidence intervals (CIs) for childhood cancer according to maternal hormonal contraception use (1) up to or during pregnancy, and (2) exclusively during pregnancy. We included 27 studies (24 case-control and 3 cohort), totaling 11,067 childhood cancer cases. Maternal hormonal contraception use up to and during pregnancy increased risk of any childhood cancer (RR = 1.18; 95% CI = 1.10-1.26), leukemia (RR = 1.24; 95% CI = 1.06-1.45), and lymphoid leukemia (RR = 1.17; 95% CI = 1.06-1.28). Exposures during pregnancy showed higher risk estimate for any cancer (RR = 1.32; 95% CI = 1.12-1.56) and leukemia (RR = 1.63; 95% CI = 1.07-2.49). Most studies were moderate (70%) or high (26%) quality. Maternal hormonal contraception use may increase childhood cancer risk, particularly for leukemia, and during pregnancy. Further prospective studies are needed, focusing on specific hormonal contraception substances and exposure timing.","PeriodicalId":11907,"journal":{"name":"European Journal of Epidemiology","volume":"255 1","pages":""},"PeriodicalIF":13.6,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145949749","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-01Epub Date: 2025-10-27DOI: 10.1007/s10654-025-01302-0
Minqing Yan, Jie Shen, Mengjia Zhao, Leqi Fei, Yuhui Huang, Minyu Wu, Ting Shen, Gulisiya Hailili, Dan Liu, Geng Zong, Yan Zheng, Dong Hang, Changzheng Yuan
The associations of coffee and tea intake with long-term risk of dementia have not been thoroughly established. Additionally, the potential mediating roles of circulating inflammatory biomarkers in these associations remain less explored. We included 6,001 participants from the Health and Retirement Study (HRS, 2013-2020) and 2,650 participants from the Framingham Heart Study Offspring cohort (FOS, 1998-2018), all free of dementia at baseline. Coffee and tea intake was assessed using a semi-quantitative food frequency questionnaire in both cohorts. Dementia diagnosis was ascertained using a validated algorithm and clinical review panel. Cox proportional hazard models were utilized to evaluate the associations of coffee and tea intake with dementia. Mediation analysis was conducted to examine whether circulating inflammatory biomarkers mediated these associations. During a median follow-up of 7.0 years in HRS and 11.1 years in FOS, 231 individuals in HRS and 204 in FOS developed all-cause dementia. Compared with intake of less than 1 cup of coffee per day, consuming ≥ 2 cups daily had a 28-37% lower risk of dementia (Hazards ratio [HR] = 0.72, 95% confidence interval [CI]: 0.52, 0.99, P-trend = 0.045 in HRS; HR = 0.63, 95% CI: 0.45, 0.90, P-trend = 0.015 in FOS). Compared to non-consumers, moderate tea consumption was associated with a lower dementia risk in HRS (HR = 0.65, 95% CI: 0.48, 0.89 for > 0 to < 1 cup/day; HR = 0.53, 95% CI: 0.30, 0.94 for ≥ 1 to < 2 cups/day), but no significant association was observed in FOS. In the mediation analysis, the association between coffee intake and dementia was partially mediated by interleukin-10 (IL-10, 29.30%), Cystatin C (24.45%), C-reactive protein (CRP, 16.54%), interleukin-1 receptor antagonist (IL-1RA, 11.06%), and soluble tumor necrosis factor receptor-1 (sTNFR-1, 10.78%). In conclusion, higher coffee consumption (≥ 2 cups per day) is associated with a lower risk of dementia, partially mediated by a set of inflammatory biomarkers. Moderate intake of tea (0-2 cups per day) may relate to a lower risk of dementia. Further large-scale observational and interventional studies are warranted to confirm these findings.
{"title":"Coffee and tea intake, circulating inflammatory biomarkers, and long-term risk of dementia: findings from two longitudinal studies.","authors":"Minqing Yan, Jie Shen, Mengjia Zhao, Leqi Fei, Yuhui Huang, Minyu Wu, Ting Shen, Gulisiya Hailili, Dan Liu, Geng Zong, Yan Zheng, Dong Hang, Changzheng Yuan","doi":"10.1007/s10654-025-01302-0","DOIUrl":"10.1007/s10654-025-01302-0","url":null,"abstract":"<p><p>The associations of coffee and tea intake with long-term risk of dementia have not been thoroughly established. Additionally, the potential mediating roles of circulating inflammatory biomarkers in these associations remain less explored. We included 6,001 participants from the Health and Retirement Study (HRS, 2013-2020) and 2,650 participants from the Framingham Heart Study Offspring cohort (FOS, 1998-2018), all free of dementia at baseline. Coffee and tea intake was assessed using a semi-quantitative food frequency questionnaire in both cohorts. Dementia diagnosis was ascertained using a validated algorithm and clinical review panel. Cox proportional hazard models were utilized to evaluate the associations of coffee and tea intake with dementia. Mediation analysis was conducted to examine whether circulating inflammatory biomarkers mediated these associations. During a median follow-up of 7.0 years in HRS and 11.1 years in FOS, 231 individuals in HRS and 204 in FOS developed all-cause dementia. Compared with intake of less than 1 cup of coffee per day, consuming ≥ 2 cups daily had a 28-37% lower risk of dementia (Hazards ratio [HR] = 0.72, 95% confidence interval [CI]: 0.52, 0.99, P-trend = 0.045 in HRS; HR = 0.63, 95% CI: 0.45, 0.90, P-trend = 0.015 in FOS). Compared to non-consumers, moderate tea consumption was associated with a lower dementia risk in HRS (HR = 0.65, 95% CI: 0.48, 0.89 for > 0 to < 1 cup/day; HR = 0.53, 95% CI: 0.30, 0.94 for ≥ 1 to < 2 cups/day), but no significant association was observed in FOS. In the mediation analysis, the association between coffee intake and dementia was partially mediated by interleukin-10 (IL-10, 29.30%), Cystatin C (24.45%), C-reactive protein (CRP, 16.54%), interleukin-1 receptor antagonist (IL-1RA, 11.06%), and soluble tumor necrosis factor receptor-1 (sTNFR-1, 10.78%). In conclusion, higher coffee consumption (≥ 2 cups per day) is associated with a lower risk of dementia, partially mediated by a set of inflammatory biomarkers. Moderate intake of tea (0-2 cups per day) may relate to a lower risk of dementia. Further large-scale observational and interventional studies are warranted to confirm these findings.</p>","PeriodicalId":11907,"journal":{"name":"European Journal of Epidemiology","volume":" ","pages":"27-38"},"PeriodicalIF":5.9,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145376375","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-01Epub Date: 2026-01-24DOI: 10.1007/s10654-025-01360-4
Can Hou, Haowen Liu, Viktor H Ahlqvist, Elisabet Unnur Gisladottir, Yao Yang, Huazhen Yang, Fang Fang, Unnur A Valdimarsdóttir, Huan Song
The rapid expansion of large-scale electronic health record (EHR) data has underscored the necessity for advanced analytical methods, such as disease network analyses, to comprehensively identify and interpret multimorbidity patterns and disease progression pathways. To overcome existing obstacles associated with performing sophisticated disease network analyses on EHR data, we developed DiNetxify, an open-source Python package implementing our recently introduced three-dimensional (3D) disease network analysis method ( https://hzcohort.github.io/DiNetxify/ ). DiNetxify provides a dedicated data class for handling various EHR data, comprehensive modular functions for executing complete 3D disease network analyses, and visualization functions for interactive exploration of results. The package is efficient, user-friendly, and optimized for large-scale EHR datasets. It supports diverse study designs, customizable analysis parameters, and parallel computing for enhanced performance. Through a case study utilizing UK Biobank data to investigate disease networks associated with short leukocyte telomere length, we demonstrated the capability of DiNetxify to identify meaningful disease clusters and progression patterns consistent with established knowledge while uncovering novel insights. Computationally, the software successfully completed analyses involving cohorts exceeding half a million exposed individuals within 17 h, using moderate computational resources. We thus anticipate that DiNetxify can significantly reduce technical barriers to facilitate broader adoption of advanced disease network analysis techniques by different researchers, thereby enhancing the exploration of EHR data to improve the understanding of holistic health dynamics.
{"title":"DiNetxify-a python package for three‑dimensional disease network analysis based on electronic health record data.","authors":"Can Hou, Haowen Liu, Viktor H Ahlqvist, Elisabet Unnur Gisladottir, Yao Yang, Huazhen Yang, Fang Fang, Unnur A Valdimarsdóttir, Huan Song","doi":"10.1007/s10654-025-01360-4","DOIUrl":"10.1007/s10654-025-01360-4","url":null,"abstract":"<p><p>The rapid expansion of large-scale electronic health record (EHR) data has underscored the necessity for advanced analytical methods, such as disease network analyses, to comprehensively identify and interpret multimorbidity patterns and disease progression pathways. To overcome existing obstacles associated with performing sophisticated disease network analyses on EHR data, we developed DiNetxify, an open-source Python package implementing our recently introduced three-dimensional (3D) disease network analysis method ( https://hzcohort.github.io/DiNetxify/ ). DiNetxify provides a dedicated data class for handling various EHR data, comprehensive modular functions for executing complete 3D disease network analyses, and visualization functions for interactive exploration of results. The package is efficient, user-friendly, and optimized for large-scale EHR datasets. It supports diverse study designs, customizable analysis parameters, and parallel computing for enhanced performance. Through a case study utilizing UK Biobank data to investigate disease networks associated with short leukocyte telomere length, we demonstrated the capability of DiNetxify to identify meaningful disease clusters and progression patterns consistent with established knowledge while uncovering novel insights. Computationally, the software successfully completed analyses involving cohorts exceeding half a million exposed individuals within 17 h, using moderate computational resources. We thus anticipate that DiNetxify can significantly reduce technical barriers to facilitate broader adoption of advanced disease network analysis techniques by different researchers, thereby enhancing the exploration of EHR data to improve the understanding of holistic health dynamics.</p>","PeriodicalId":11907,"journal":{"name":"European Journal of Epidemiology","volume":" ","pages":"119-126"},"PeriodicalIF":5.9,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146040215","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-01Epub Date: 2025-10-27DOI: 10.1007/s10654-025-01282-1
Fangyu Liu, Jennifer A Schrack, Keenan A Walker, Jeremy Walston, Rasika A Mathias, Michael E Griswold, Priya Palta, B Gwen Windham, John W Jackson
Clinical trials have shown favorable effects of exercise on frailty, supporting physical activity (PA) as a treatment and prevention strategy. Proteomics studies suggest that PA alters levels of many proteins, some of which may function as molecules in the biological processes underlying frailty. However, these studies have focused on structured exercise programs or cross-sectional PA-protein associations. Therefore, the effects of long-term PA on frailty-associated proteins remain unknown. Among 14,898 middle-aged adults, we emulated a target trial that assigned individuals to either (i) achieve and maintain the recommended PA level (≥ 150 min/week of moderate-to-vigorous physical activity [MVPA]) through 6 (± 0.3) years of follow-up or (ii) follow a "natural course" strategy, where all individuals engage in various amounts of habitual MVPA. We estimated the effects of long-term adherence to recommended MVPA versus the natural course strategy on 45 previously identified frailty-associated proteins at the end of the follow-up using inverse probability of weighting (IPW) and iterative conditional expectations (ICE). We found that long-term adherence to recommended MVPA improved the population levels of many frailty-associated proteins (ranged from 0.04 to 0.11 standard deviation); the greatest benefits were seen in proteins involved in the nervous system (e.g., voltage-dependent calcium channel subunit alpha-2/delta-3 [CACNA2D3], contactin-1 [CNTN1], neural cell adhesion molecule 1 [NCAM1], and transmembrane protein 132D [TMEM132D]) and inflammation (e.g., high-temperature requirement serine protease A1 [HTRA1] and C-reactive protein [CRP]). Our findings suggest improved nervous system and reduced inflammation as the biological basis of long-term engagement in adequate PA as an intervention strategy for frailty.
{"title":"The effect of long-term adherence to physical activity recommendations in midlife on plasma proteins associated with frailty in the Atherosclerosis Risk in Communities (ARIC) study.","authors":"Fangyu Liu, Jennifer A Schrack, Keenan A Walker, Jeremy Walston, Rasika A Mathias, Michael E Griswold, Priya Palta, B Gwen Windham, John W Jackson","doi":"10.1007/s10654-025-01282-1","DOIUrl":"10.1007/s10654-025-01282-1","url":null,"abstract":"<p><p>Clinical trials have shown favorable effects of exercise on frailty, supporting physical activity (PA) as a treatment and prevention strategy. Proteomics studies suggest that PA alters levels of many proteins, some of which may function as molecules in the biological processes underlying frailty. However, these studies have focused on structured exercise programs or cross-sectional PA-protein associations. Therefore, the effects of long-term PA on frailty-associated proteins remain unknown. Among 14,898 middle-aged adults, we emulated a target trial that assigned individuals to either (i) achieve and maintain the recommended PA level (≥ 150 min/week of moderate-to-vigorous physical activity [MVPA]) through 6 (± 0.3) years of follow-up or (ii) follow a \"natural course\" strategy, where all individuals engage in various amounts of habitual MVPA. We estimated the effects of long-term adherence to recommended MVPA versus the natural course strategy on 45 previously identified frailty-associated proteins at the end of the follow-up using inverse probability of weighting (IPW) and iterative conditional expectations (ICE). We found that long-term adherence to recommended MVPA improved the population levels of many frailty-associated proteins (ranged from 0.04 to 0.11 standard deviation); the greatest benefits were seen in proteins involved in the nervous system (e.g., voltage-dependent calcium channel subunit alpha-2/delta-3 [CACNA2D3], contactin-1 [CNTN1], neural cell adhesion molecule 1 [NCAM1], and transmembrane protein 132D [TMEM132D]) and inflammation (e.g., high-temperature requirement serine protease A1 [HTRA1] and C-reactive protein [CRP]). Our findings suggest improved nervous system and reduced inflammation as the biological basis of long-term engagement in adequate PA as an intervention strategy for frailty.</p>","PeriodicalId":11907,"journal":{"name":"European Journal of Epidemiology","volume":" ","pages":"51-69"},"PeriodicalIF":5.9,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12881102/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145376351","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-26DOI: 10.1007/s10654-025-01351-5
Albert Hofman
{"title":"A note of thanks","authors":"Albert Hofman","doi":"10.1007/s10654-025-01351-5","DOIUrl":"https://doi.org/10.1007/s10654-025-01351-5","url":null,"abstract":"","PeriodicalId":11907,"journal":{"name":"European Journal of Epidemiology","volume":"32 1","pages":""},"PeriodicalIF":13.6,"publicationDate":"2025-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145830269","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-13DOI: 10.1007/s10654-025-01312-y
Ellen E Walters,Susan E Luczak,Christopher R Beam,Malin Ericsson,William S Kremen,Robert F Krueger,Kristian E Markon,Matt McGue,Marianne Nygaard,Matthew S Panizzon,Brenda L Plassman,Chandra A Reynolds,Perminder S Sachdev,Anbu Thalamuthu,Keith E Whitfield,Nancy L Pedersen,Margaret Gatz,
We reply to the letter to the editor by Soohyeon Ko (Eur J Epidemiol, https://doi.org/10.1007/s10654-025-01305-x ) concerning our article by Walters et al. (Eur J Epidemiol, https://doi.org/10.1007/s10654-025-01286-x ). We reiterate that genetic explanations contribute to understanding why education is protective against dementia, alongside influences reflecting the whole of one's family and societal context. We also caution that genetic explanations should not be misinterpreted as deterministic.
我们回复了Soohyeon Ko (Eur J epidemiology, https://doi.org/10.1007/s10654-025-01305-x)就Walters et al. (Eur J epidemiology, https://doi.org/10.1007/s10654-025-01286-x)的文章给编辑的信。我们重申,基因解释有助于理解为什么教育可以预防痴呆症,以及反映整个家庭和社会背景的影响。我们还警告说,基因解释不应被误解为决定性的。
{"title":"Reply to Ko, Null within-twin estimates on education and dementia: cautions for within-family contrasts.","authors":"Ellen E Walters,Susan E Luczak,Christopher R Beam,Malin Ericsson,William S Kremen,Robert F Krueger,Kristian E Markon,Matt McGue,Marianne Nygaard,Matthew S Panizzon,Brenda L Plassman,Chandra A Reynolds,Perminder S Sachdev,Anbu Thalamuthu,Keith E Whitfield,Nancy L Pedersen,Margaret Gatz, ","doi":"10.1007/s10654-025-01312-y","DOIUrl":"https://doi.org/10.1007/s10654-025-01312-y","url":null,"abstract":"We reply to the letter to the editor by Soohyeon Ko (Eur J Epidemiol, https://doi.org/10.1007/s10654-025-01305-x ) concerning our article by Walters et al. (Eur J Epidemiol, https://doi.org/10.1007/s10654-025-01286-x ). We reiterate that genetic explanations contribute to understanding why education is protective against dementia, alongside influences reflecting the whole of one's family and societal context. We also caution that genetic explanations should not be misinterpreted as deterministic.","PeriodicalId":11907,"journal":{"name":"European Journal of Epidemiology","volume":"1 1","pages":""},"PeriodicalIF":13.6,"publicationDate":"2025-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145732661","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-03DOI: 10.1007/s10654-025-01301-1
Marco Piccininni, Vanessa Didelez, Mats J. Stensrud
Screening colonoscopy has been shown to reduce colorectal cancer incidence. However, the magnitude of this effect is debated. There is concern that some trial participants already had colorectal cancer at baseline. The screening procedure could not prevent disease occurrence in these participants, leading to “prevalence bias”. Some authors have argued that the effect of interest is confined to participants without disease at baseline, and failing to exclude prevalent cases supposedly leads to effect underestimation. Yet, the issue is debated, with other authors arguing that conventional randomized trials provide the effects that are most relevant to public health. Here we present new, formal arguments that clarify misconceptions in this debate. We show that, under mild assumptions, the so-called “prevalence bias” is not a concern when researchers are interested in estimating risk differences, rather than risk ratios. This is because of a statistical property of the causal risk difference when outcomes are rare, called “doomed-selection stability”.
{"title":"Choosing a sensible contrast makes “prevalence bias” irrelevant in screening colonoscopy trials","authors":"Marco Piccininni, Vanessa Didelez, Mats J. Stensrud","doi":"10.1007/s10654-025-01301-1","DOIUrl":"https://doi.org/10.1007/s10654-025-01301-1","url":null,"abstract":"Screening colonoscopy has been shown to reduce colorectal cancer incidence. However, the magnitude of this effect is debated. There is concern that some trial participants already had colorectal cancer at baseline. The screening procedure could not prevent disease occurrence in these participants, leading to “prevalence bias”. Some authors have argued that the effect of interest is confined to participants without disease at baseline, and failing to exclude prevalent cases supposedly leads to effect underestimation. Yet, the issue is debated, with other authors arguing that conventional randomized trials provide the effects that are most relevant to public health. Here we present new, formal arguments that clarify misconceptions in this debate. We show that, under mild assumptions, the so-called “prevalence bias” is not a concern when researchers are interested in estimating risk differences, rather than risk ratios. This is because of a statistical property of the causal risk difference when outcomes are rare, called “doomed-selection stability”.","PeriodicalId":11907,"journal":{"name":"European Journal of Epidemiology","volume":"58 1","pages":""},"PeriodicalIF":13.6,"publicationDate":"2025-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145657358","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-01Epub Date: 2025-09-01DOI: 10.1007/s10654-025-01295-w
Yunyun Liu, Chi Pang Wen, Junlong Pan, Jiameng Cui, Wanzhu Lu, Tong Sun, Xian Ning, June Han Lee, Wenyuan Li, Huakang Tu, Xifeng Wu
Previous studies on serum iron levels and mortality risk have yielded inconsistent findings based on single-point measurements. How serum iron levels and their longitudinal changes influence all-cause and cause-specific mortality remains unknown. This study investigated associations between baseline serum iron levels, their longitudinal changes, and all-cause and cause-specific mortality in a prospective cohort. Participants were recruited from the Taiwan MJ cohort (1997-2007) and followed until December 31, 2022. Baseline serum iron was categorized as low, normal, or high. Based on changes at a second visit, participants were further classified as persistent normal, progression to abnormal, reversion to normal, or persistent abnormal. Cox proportional hazard models were used for analysis. Over a median follow-up of 19.0 years, 33,005 deaths occurred. Fully adjusted models demonstrated J-shaped associations between serum iron and all-cause and cause-specific mortality (all P < 0.001), with higher all-cause mortality risks in low (HR 1.27, 95% CI [1.23, 1.31]) and high iron groups (HR 1.37, 95% CI [1.30, 1.44]). Compared to persistent normal levels, those with progression to abnormal, reversion to normal, or persistent abnormal serum iron exhibited elevated mortality risks (HRs: 1.22 [1.15, 1.30], 1.16 [1.09, 1.24], 1.49 [1.36, 1.63], respectively). Moreover, maintaining normal serum iron status alongside a healthy lifestyle exhibited the lowest mortality risks. Long term abnormal serum iron status was linked to increased mortality, which could be mitigated through lifestyle modifications, suggesting significance of serum iron monitoring and potential intervention.
{"title":"Associations of serum iron and its status change with mortality risk: prospective findings from the MJ cohort.","authors":"Yunyun Liu, Chi Pang Wen, Junlong Pan, Jiameng Cui, Wanzhu Lu, Tong Sun, Xian Ning, June Han Lee, Wenyuan Li, Huakang Tu, Xifeng Wu","doi":"10.1007/s10654-025-01295-w","DOIUrl":"10.1007/s10654-025-01295-w","url":null,"abstract":"<p><p>Previous studies on serum iron levels and mortality risk have yielded inconsistent findings based on single-point measurements. How serum iron levels and their longitudinal changes influence all-cause and cause-specific mortality remains unknown. This study investigated associations between baseline serum iron levels, their longitudinal changes, and all-cause and cause-specific mortality in a prospective cohort. Participants were recruited from the Taiwan MJ cohort (1997-2007) and followed until December 31, 2022. Baseline serum iron was categorized as low, normal, or high. Based on changes at a second visit, participants were further classified as persistent normal, progression to abnormal, reversion to normal, or persistent abnormal. Cox proportional hazard models were used for analysis. Over a median follow-up of 19.0 years, 33,005 deaths occurred. Fully adjusted models demonstrated J-shaped associations between serum iron and all-cause and cause-specific mortality (all P < 0.001), with higher all-cause mortality risks in low (HR 1.27, 95% CI [1.23, 1.31]) and high iron groups (HR 1.37, 95% CI [1.30, 1.44]). Compared to persistent normal levels, those with progression to abnormal, reversion to normal, or persistent abnormal serum iron exhibited elevated mortality risks (HRs: 1.22 [1.15, 1.30], 1.16 [1.09, 1.24], 1.49 [1.36, 1.63], respectively). Moreover, maintaining normal serum iron status alongside a healthy lifestyle exhibited the lowest mortality risks. Long term abnormal serum iron status was linked to increased mortality, which could be mitigated through lifestyle modifications, suggesting significance of serum iron monitoring and potential intervention.</p>","PeriodicalId":11907,"journal":{"name":"European Journal of Epidemiology","volume":" ","pages":"1419-1429"},"PeriodicalIF":5.9,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144947616","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}