首页 > 最新文献

European Journal of Epidemiology最新文献

英文 中文
An extension of the validation cohort of the Dutch Early-Stage Melanoma (D-ESMEL) study for stage-specific analyses. 荷兰早期黑色素瘤(D-ESMEL)研究阶段特异性分析验证队列的扩展。
IF 13.6 1区 医学 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH Pub Date : 2026-01-12 DOI: 10.1007/s10654-025-01337-3
Catherine Zhou,Antien L Mooyaart,Nikita Hulscher,Thamila Kerkour,Jasper Ouwerkerk,Marieke W J Louwman,Marlies Wakkee,Yunlei Li,Quirinus J M Voorham,Annette Bruggink,Tamar E C Nijsten,Loes M Hollestein
There is a high need for accurate prognostic models among stage II melanoma to determine who may benefit from (neo)adjuvant systemic therapy. The Dutch Early- Stage Melanoma (D-ESMEL) study was designed to identify new prognostic features in a population-based sample of stage I/II melanoma patients in addition to American Joint Committee of Cancer (AJCC) staging. The validation cohort of the D-ESMEL study employs a nested case-control design. Initially, controls were randomly sampled to develop prognostic that included both known and new prognostic factors to assess the additive value of new prognostic factors. As a consequence, most controls had a very thin melanoma (<1.0 mm) while most cases had a thicker melanoma (>2.0 mm). This resulted in insufficient variability and high weights for stage II controls when applying weighted analyses in absolute risk prediction models. Therefore, randomly sampled controls were re-matched on AJCC stage (stage IA, IB, IIA, IIB, IIC), and new stage-matched controls were collected for cases who could not be rematched. The original D-ESMEL validation cohort included 5,815 stage I/II melanoma patients, of whom 154 developed distant metastasis (cases). 98/154 Cases were stage II and only 24 stage II controls were included, while the stage-matched design now includes 153 stage-matched case-control sets of which 97 stage II cases and 97 stage II controls derived from a population-based cohort of 5,785 stage I/II patients. The updated design increased the biological variability among stage II controls, balanced weights in weighted analyses and thereby facilitating reliable subgroup analyses in this clinically important subgroup.
在II期黑色素瘤中,需要精确的预后模型来确定谁可能从(新)辅助全身治疗中受益。荷兰早期黑色素瘤(D-ESMEL)研究旨在确定除美国癌症联合委员会(AJCC)分期外,基于人群的I/II期黑色素瘤患者样本中的新预后特征。D-ESMEL研究的验证队列采用嵌套病例对照设计。最初,随机抽取对照组,制定包括已知和新的预后因素的预后评估,以评估新预后因素的附加价值。结果,大多数对照组的黑色素瘤很薄(2.0毫米)。当在绝对风险预测模型中应用加权分析时,这导致II期控制的变异性不足和权重过高。因此,在AJCC分期(IA期、IB期、IIA期、IIB期、IIC期)随机抽样对照进行重新匹配,无法重新匹配的病例收集新的分期匹配对照。最初的D-ESMEL验证队列包括5815例I/II期黑色素瘤患者,其中154例发生远处转移。98/154例为II期,仅包括24例II期对照,而阶段匹配设计现在包括153个阶段匹配的病例对照组,其中97例II期病例和97例II期对照来自5,785例I/II期患者的基于人群的队列。更新的设计增加了II期对照的生物学变异性,平衡了加权分析中的权重,从而促进了这一临床重要亚组的可靠亚组分析。
{"title":"An extension of the validation cohort of the Dutch Early-Stage Melanoma (D-ESMEL) study for stage-specific analyses.","authors":"Catherine Zhou,Antien L Mooyaart,Nikita Hulscher,Thamila Kerkour,Jasper Ouwerkerk,Marieke W J Louwman,Marlies Wakkee,Yunlei Li,Quirinus J M Voorham,Annette Bruggink,Tamar E C Nijsten,Loes M Hollestein","doi":"10.1007/s10654-025-01337-3","DOIUrl":"https://doi.org/10.1007/s10654-025-01337-3","url":null,"abstract":"There is a high need for accurate prognostic models among stage II melanoma to determine who may benefit from (neo)adjuvant systemic therapy. The Dutch Early- Stage Melanoma (D-ESMEL) study was designed to identify new prognostic features in a population-based sample of stage I/II melanoma patients in addition to American Joint Committee of Cancer (AJCC) staging. The validation cohort of the D-ESMEL study employs a nested case-control design. Initially, controls were randomly sampled to develop prognostic that included both known and new prognostic factors to assess the additive value of new prognostic factors. As a consequence, most controls had a very thin melanoma (<1.0 mm) while most cases had a thicker melanoma (>2.0 mm). This resulted in insufficient variability and high weights for stage II controls when applying weighted analyses in absolute risk prediction models. Therefore, randomly sampled controls were re-matched on AJCC stage (stage IA, IB, IIA, IIB, IIC), and new stage-matched controls were collected for cases who could not be rematched. The original D-ESMEL validation cohort included 5,815 stage I/II melanoma patients, of whom 154 developed distant metastasis (cases). 98/154 Cases were stage II and only 24 stage II controls were included, while the stage-matched design now includes 153 stage-matched case-control sets of which 97 stage II cases and 97 stage II controls derived from a population-based cohort of 5,785 stage I/II patients. The updated design increased the biological variability among stage II controls, balanced weights in weighted analyses and thereby facilitating reliable subgroup analyses in this clinically important subgroup.","PeriodicalId":11907,"journal":{"name":"European Journal of Epidemiology","volume":"94 1","pages":""},"PeriodicalIF":13.6,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145949746","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Quality of cancer-related data from the Danish National patient registry (1994-2025) and the Danish cancer registry (2004-2025): a systematic review. 丹麦国家患者登记处(1994-2025)和丹麦癌症登记处(2004-2025)癌症相关数据的质量:一项系统综述。
IF 13.6 1区 医学 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH Pub Date : 2026-01-12 DOI: 10.1007/s10654-025-01336-4
Amalie Helme Simoni,Kathrine Hald,Thure Filskov Overvad,Mette Søgaard,Anne Gulbech Ording
The Danish National Patient Registry (DNPR) and the Danish Cancer Registry (DCR) are central to registry-based cancer research. This systematic review evaluates studies assessing the quality of cancer-related data in these registries under their current data structures. PubMed and Embase were systematically searched on January 24, 2025 (PROSPERO: CRD420251005952). Studies validating cancer-related data in the DNPR or DCR against a gold standard were included. Findings were synthesized narratively and categorized by DNPR data, DCR data, or multi-source algorithms. The literature search generated 915 records, of which 50 were included: 23 validated DNPR data, 9 DCR data, and 18 algorithm performance. The quality of DNPR cancer diagnoses and treatment showed positive predictive values (PPVs) of 57-100%, highest for common malignancies and treatments. The quality of DNPR comorbidities and complications varied substantially (PPVs 0-98%). The PPV of a melanoma diagnosis in the DCR was 97%. The DCR staging completeness varied considerably (34-95%). Algorithms presented PPVs of 60-96% for recurrence, active cancer, and recognized metastases, and 28% for unrecognized metastases. The DNPR and DCR provide high-quality data for many cancer diagnoses, treatments, and outcomes, supporting their use in register-based research. While some data elements, including data on complications, exhibit lower quality, algorithmic approaches can enhance utility for less robust data. However, several aspects of cancer-related data remain unvalidated.
丹麦国家患者登记处(DNPR)和丹麦癌症登记处(DCR)是基于登记处的癌症研究的中心。本系统综述评估了在当前数据结构下评估这些登记处癌症相关数据质量的研究。PubMed和Embase于2025年1月24日被系统检索(PROSPERO: CRD420251005952)。研究证实了DNPR或DCR中与癌症相关的数据是否符合金标准。通过DNPR数据、DCR数据或多源算法对结果进行叙述性综合和分类。文献检索共产生915条记录,其中50条被纳入:23条验证DNPR数据,9条DCR数据,18条算法性能。DNPR肿瘤诊断和治疗质量的阳性预测值(ppv)为57-100%,常见恶性肿瘤和治疗的阳性预测值最高。DNPR合并症和并发症的质量差异很大(PPVs 0-98%)。DCR诊断黑色素瘤的PPV为97%。DCR分期完成度差异较大(34-95%)。算法显示,复发、活动性癌症和已识别转移的ppv为60-96%,未识别转移的ppv为28%。DNPR和DCR为许多癌症诊断、治疗和结果提供了高质量的数据,支持它们在基于登记的研究中的使用。虽然一些数据元素(包括有关复杂性的数据)表现出较低的质量,但算法方法可以增强对不那么健壮的数据的效用。然而,癌症相关数据的几个方面仍未得到证实。
{"title":"Quality of cancer-related data from the Danish National patient registry (1994-2025) and the Danish cancer registry (2004-2025): a systematic review.","authors":"Amalie Helme Simoni,Kathrine Hald,Thure Filskov Overvad,Mette Søgaard,Anne Gulbech Ording","doi":"10.1007/s10654-025-01336-4","DOIUrl":"https://doi.org/10.1007/s10654-025-01336-4","url":null,"abstract":"The Danish National Patient Registry (DNPR) and the Danish Cancer Registry (DCR) are central to registry-based cancer research. This systematic review evaluates studies assessing the quality of cancer-related data in these registries under their current data structures. PubMed and Embase were systematically searched on January 24, 2025 (PROSPERO: CRD420251005952). Studies validating cancer-related data in the DNPR or DCR against a gold standard were included. Findings were synthesized narratively and categorized by DNPR data, DCR data, or multi-source algorithms. The literature search generated 915 records, of which 50 were included: 23 validated DNPR data, 9 DCR data, and 18 algorithm performance. The quality of DNPR cancer diagnoses and treatment showed positive predictive values (PPVs) of 57-100%, highest for common malignancies and treatments. The quality of DNPR comorbidities and complications varied substantially (PPVs 0-98%). The PPV of a melanoma diagnosis in the DCR was 97%. The DCR staging completeness varied considerably (34-95%). Algorithms presented PPVs of 60-96% for recurrence, active cancer, and recognized metastases, and 28% for unrecognized metastases. The DNPR and DCR provide high-quality data for many cancer diagnoses, treatments, and outcomes, supporting their use in register-based research. While some data elements, including data on complications, exhibit lower quality, algorithmic approaches can enhance utility for less robust data. However, several aspects of cancer-related data remain unvalidated.","PeriodicalId":11907,"journal":{"name":"European Journal of Epidemiology","volume":"82 1","pages":""},"PeriodicalIF":13.6,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145949767","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Why do women live longer than men, but spend more time in poor health? A decomposition analysis of the gender gap in unhealthy life years across Europe. 为什么女性寿命比男性长,但健康状况不佳的时间却更多?欧洲不健康寿命年数性别差异的分解分析。
IF 13.6 1区 医学 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH Pub Date : 2026-01-12 DOI: 10.1007/s10654-025-01346-2
Magdalena Muszynska-Spielauer,Paola Di Giulio,Yuka Minagawa,Vanessa Di Lego,Marc Luy
This study tests the "longevity hypothesis," which posits that women's greater number of years spent in poor health is primarily a direct consequence of their longer survival. We analyse gender differences in unhealthy life years (ULY) at age 50 across 22 European countries in 2015-2017. ULY was estimated using three approaches-the Sullivan method, the cross-sectional average length of healthy life, and multistate life tables-applied to four health indicators of varying severity: chronic diseases, functional limitations, self-rated health, and disability. Data were drawn from the Human Mortality Database and the Survey of Health, Ageing and Retirement in Europe. We decomposed the gender gap in ULY into a "mortality effect" (ME), reflecting differences in life years lived, and a "health effect" (HE), reflecting differences in morbidity prevalence. Women at age 50 lived more unhealthy years than men across almost all health indicators and countries. In most cases, more than half of the gender gap in ULY was attributable to the ME, indicating that women's longer survival primarily explains their greater number of years spent in poor health. The HE showed greater variation across indicators and countries. Results were most consistent for chronic diseases and self-rated health, while functional limitations and disability yielded smaller and less consistent differences. Findings support the longevity hypothesis: women's higher life expectancy is the main driver of their longer lifetime spent in poor health. The variation across health dimensions highlights the importance of distinguishing between them when studying gender inequalities in health.
这项研究检验了“长寿假说”,该假说认为,女性健康状况不佳的时间更长,主要是她们寿命更长的直接后果。我们分析了2015-2017年22个欧洲国家50岁人群不健康寿命年(ULY)的性别差异。ULY是用三种方法来估计的——沙利文方法、健康寿命的横截面平均长度和多状态生命表——应用于四种不同严重程度的健康指标:慢性病、功能限制、自评健康和残疾。数据来自人类死亡率数据库和欧洲健康、老龄化和退休调查。我们将ULY中的性别差异分解为反映寿命年数差异的“死亡率效应”(ME)和反映发病率差异的“健康效应”(HE)。在几乎所有健康指标和国家中,50岁妇女的不健康寿命都比男子长。在大多数情况下,ULY中超过一半的性别差距可归因于ME,这表明妇女的生存时间较长主要解释了她们健康状况不佳的时间较长。高等教育在各个指标和国家之间显示出更大的差异。慢性疾病和自评健康的结果最为一致,而功能限制和残疾产生的差异较小且不太一致。研究结果支持长寿假说:女性的预期寿命较长是她们健康状况不佳的主要原因。健康各方面的差异突出了在研究健康方面的性别不平等时区分这些方面的重要性。
{"title":"Why do women live longer than men, but spend more time in poor health? A decomposition analysis of the gender gap in unhealthy life years across Europe.","authors":"Magdalena Muszynska-Spielauer,Paola Di Giulio,Yuka Minagawa,Vanessa Di Lego,Marc Luy","doi":"10.1007/s10654-025-01346-2","DOIUrl":"https://doi.org/10.1007/s10654-025-01346-2","url":null,"abstract":"This study tests the \"longevity hypothesis,\" which posits that women's greater number of years spent in poor health is primarily a direct consequence of their longer survival. We analyse gender differences in unhealthy life years (ULY) at age 50 across 22 European countries in 2015-2017. ULY was estimated using three approaches-the Sullivan method, the cross-sectional average length of healthy life, and multistate life tables-applied to four health indicators of varying severity: chronic diseases, functional limitations, self-rated health, and disability. Data were drawn from the Human Mortality Database and the Survey of Health, Ageing and Retirement in Europe. We decomposed the gender gap in ULY into a \"mortality effect\" (ME), reflecting differences in life years lived, and a \"health effect\" (HE), reflecting differences in morbidity prevalence. Women at age 50 lived more unhealthy years than men across almost all health indicators and countries. In most cases, more than half of the gender gap in ULY was attributable to the ME, indicating that women's longer survival primarily explains their greater number of years spent in poor health. The HE showed greater variation across indicators and countries. Results were most consistent for chronic diseases and self-rated health, while functional limitations and disability yielded smaller and less consistent differences. Findings support the longevity hypothesis: women's higher life expectancy is the main driver of their longer lifetime spent in poor health. The variation across health dimensions highlights the importance of distinguishing between them when studying gender inequalities in health.","PeriodicalId":11907,"journal":{"name":"European Journal of Epidemiology","volume":"30 1","pages":""},"PeriodicalIF":13.6,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145949744","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Risk of postpartum depression among women with endometriosis: the Norwegian mother, father and child cohort study (MoBa). 子宫内膜异位症妇女产后抑郁的风险:挪威母亲、父亲和儿童队列研究(MoBa)
IF 13.6 1区 医学 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH Pub Date : 2026-01-12 DOI: 10.1007/s10654-025-01338-2
Marius Johansen,Tone Kristin Omsland,Katariina Laine,Siri Eldevik Håberg,Maria Christine Magnus
Women with endometriosis have a higher burden of anxiety and depression. Whether they are at increased risk of postpartum depression (PPD) remains unclear. We aimed to compare the risk of PPD between women with and without endometriosis and to explore mediation by previous history of major depression and infertility. In a population-based cohort study, we compared 1,159 singleton pregnancies to women with self-reported endometriosis and 74,590 pregnancies to women without endometriosis. We calculated a djusted risk ratios (aRR) with 95% confidence intervals (CI) using multivariable log-binomial regression, adjusting for age, body mass index, education and income. Mediation analyses assessed the indirect effect of any history of major depression or infertility. Women with endometriosis had a higher risk of PPD (aRR: 1.34, 95% CI: 1.15-1.55). Mediation analyses indicated that a large part of this association was explained by a higher lifetime prevalence of major depression among women with endometriosis (natural direct effect of endometriosis: aRR: 1.17, 95% CI: 1.00-1.36; natural indirect effect of any history of major depression: aRR: 1.14, 95% CI: 1.08-1.20), with 49.3% proportion mediated. Infertility demonstrated a negative natural indirect effect on the association between endometriosis and PPD (aRR: 0.87, 95% CI: 0.81-0.94). Women with endometriosis had an elevated risk of PPD which was largely explained by a higher lifetime prevalence of major depression. Our findings suggest that they constitute a high-risk group and could benefit from closer follow-up to facilitate early identification and intervention.
患有子宫内膜异位症的女性有更高的焦虑和抑郁负担。她们是否会增加产后抑郁症(PPD)的风险尚不清楚。我们的目的是比较有子宫内膜异位症和无子宫内膜异位症的女性发生PPD的风险,并探讨重度抑郁症和不孕症的既往史。在一项基于人群的队列研究中,我们比较了1159名单胎妊娠与自我报告子宫内膜异位症的妇女,以及74590名妊娠与无子宫内膜异位症的妇女。我们使用多变量对数二项回归计算了校正风险比(aRR), 95%置信区间(CI),调整了年龄、体重指数、教育程度和收入。中介分析评估了任何重度抑郁或不孕史的间接影响。患有子宫内膜异位症的女性患PPD的风险更高(aRR: 1.34, 95% CI: 1.15-1.55)。中介分析表明,这种关联在很大程度上可以解释为子宫内膜异位症女性一生中重度抑郁症的患病率较高(子宫内膜异位症的自然直接影响:aRR: 1.17, 95% CI: 1.00-1.36;任何重度抑郁症史的自然间接影响:aRR: 1.14, 95% CI: 1.08-1.20),其中有49.3%的中介比例。不孕症对子宫内膜异位症与PPD之间的关系表现出负的自然间接影响(aRR: 0.87, 95% CI: 0.81-0.94)。患有子宫内膜异位症的女性患产后抑郁症的风险较高,这在很大程度上可以解释为重度抑郁症的终生患病率较高。我们的研究结果表明,他们构成了一个高风险群体,可以从更密切的随访中受益,以促进早期识别和干预。
{"title":"Risk of postpartum depression among women with endometriosis: the Norwegian mother, father and child cohort study (MoBa).","authors":"Marius Johansen,Tone Kristin Omsland,Katariina Laine,Siri Eldevik Håberg,Maria Christine Magnus","doi":"10.1007/s10654-025-01338-2","DOIUrl":"https://doi.org/10.1007/s10654-025-01338-2","url":null,"abstract":"Women with endometriosis have a higher burden of anxiety and depression. Whether they are at increased risk of postpartum depression (PPD) remains unclear. We aimed to compare the risk of PPD between women with and without endometriosis and to explore mediation by previous history of major depression and infertility. In a population-based cohort study, we compared 1,159 singleton pregnancies to women with self-reported endometriosis and 74,590 pregnancies to women without endometriosis. We calculated a djusted risk ratios (aRR) with 95% confidence intervals (CI) using multivariable log-binomial regression, adjusting for age, body mass index, education and income. Mediation analyses assessed the indirect effect of any history of major depression or infertility. Women with endometriosis had a higher risk of PPD (aRR: 1.34, 95% CI: 1.15-1.55). Mediation analyses indicated that a large part of this association was explained by a higher lifetime prevalence of major depression among women with endometriosis (natural direct effect of endometriosis: aRR: 1.17, 95% CI: 1.00-1.36; natural indirect effect of any history of major depression: aRR: 1.14, 95% CI: 1.08-1.20), with 49.3% proportion mediated. Infertility demonstrated a negative natural indirect effect on the association between endometriosis and PPD (aRR: 0.87, 95% CI: 0.81-0.94). Women with endometriosis had an elevated risk of PPD which was largely explained by a higher lifetime prevalence of major depression. Our findings suggest that they constitute a high-risk group and could benefit from closer follow-up to facilitate early identification and intervention.","PeriodicalId":11907,"journal":{"name":"European Journal of Epidemiology","volume":"24 1","pages":""},"PeriodicalIF":13.6,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145949750","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine learning versus logistic regression for propensity score estimation: a trial emulation benchmarked against the PARADIGM-HF randomized trial. 倾向评分估计的机器学习与逻辑回归:以PARADIGM-HF随机试验为基准的试验模拟。
IF 5.9 1区 医学 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH Pub Date : 2026-01-12 DOI: 10.1007/s10654-025-01341-7
Kaicheng Wang, Lindsey Rosman, Haidong Lu

Machine learning (ML) algorithms are increasingly used to estimate propensity score with expectation of improving causal inference. However, the validity of data-driven ML-based approaches for confounder selection and adjustment remains unclear. In this study, we emulated the device-stratified secondary analysis of the PARADIGM-HF trial among U.S. veterans with heart failure and implanted cardiac devices from 2016 to 2020. We benchmarked observational estimates from three propensity score approaches against the trial results. (1) logistic regression with pre-specified confounders (2), generalized boosted models (GBM) using the same pre-specified confounders, and (3) GBM with expanded covariates and automated feature selection. Logistic regression-based propensity score approach yielded estimates closest to the trial (HR = 0.93, 95% CI 0.61-1.42; 23-month RR = 0.86, 95% CI 0.57-1.24 vs. trial HR = 0.81, 95% CI 0.61-1.06). Despite better predictive performance, GBM with pre-specified confounders showed no improvement over the logistic regression approach (HR = 0.97, 95% CI 0.68-1.37; RR = 0.96, 95% CI 0.89-1.98). Moreover, GBM with expanded covariates and data-driven automated feature selection substantially increased bias (HR = 0.61, 95% CI 0.30-1.23; RR = 0.69, 95% CI 0.36-1.04). Our findings suggest that ML-based propensity score methods do not inherently improve causal estimation possibly due to residual confounding from omitted or partially adjusted variables and may introduce overadjustment bias when combined with automated feature selection. These results underscore the importance of careful confounder specification and causal reasoning over algorithmic complexity in causal inference.

机器学习(ML)算法越来越多地用于估计倾向得分,期望改善因果推理。然而,数据驱动的基于ml的混杂选择和调整方法的有效性仍然不清楚。在这项研究中,我们模拟了2016年至2020年美国退伍军人心力衰竭和植入心脏装置的PARADIGM-HF试验的器械分层二次分析。我们将三种倾向评分方法的观察性估计与试验结果进行基准比较。(1)预先指定混杂因素的逻辑回归(2),使用相同预先指定混杂因素的广义增强模型(GBM),以及(3)扩展协变量和自动特征选择的GBM。基于Logistic回归的倾向评分方法得出的估计值与试验最接近(HR = 0.93, 95% CI 0.61-1.42; 23个月的RR = 0.86, 95% CI 0.57-1.24,而试验HR = 0.81, 95% CI 0.61-1.06)。尽管具有更好的预测性能,但与逻辑回归方法相比,预先指定混杂因素的GBM没有改善(HR = 0.97, 95% CI 0.68-1.37; RR = 0.96, 95% CI 0.89-1.98)。此外,扩展协变量的GBM和数据驱动的自动特征选择大大增加了偏差(HR = 0.61, 95% CI 0.30-1.23; RR = 0.69, 95% CI 0.36-1.04)。我们的研究结果表明,基于机器学习的倾向评分方法并不能从本质上改善因果估计,这可能是由于遗漏或部分调整变量的残留混淆,并且当与自动特征选择相结合时可能会引入过度调整偏差。这些结果强调了在因果推理中,谨慎的混杂规范和因果推理在算法复杂性上的重要性。
{"title":"Machine learning versus logistic regression for propensity score estimation: a trial emulation benchmarked against the PARADIGM-HF randomized trial.","authors":"Kaicheng Wang, Lindsey Rosman, Haidong Lu","doi":"10.1007/s10654-025-01341-7","DOIUrl":"10.1007/s10654-025-01341-7","url":null,"abstract":"<p><p>Machine learning (ML) algorithms are increasingly used to estimate propensity score with expectation of improving causal inference. However, the validity of data-driven ML-based approaches for confounder selection and adjustment remains unclear. In this study, we emulated the device-stratified secondary analysis of the PARADIGM-HF trial among U.S. veterans with heart failure and implanted cardiac devices from 2016 to 2020. We benchmarked observational estimates from three propensity score approaches against the trial results. (1) logistic regression with pre-specified confounders (2), generalized boosted models (GBM) using the same pre-specified confounders, and (3) GBM with expanded covariates and automated feature selection. Logistic regression-based propensity score approach yielded estimates closest to the trial (HR = 0.93, 95% CI 0.61-1.42; 23-month RR = 0.86, 95% CI 0.57-1.24 vs. trial HR = 0.81, 95% CI 0.61-1.06). Despite better predictive performance, GBM with pre-specified confounders showed no improvement over the logistic regression approach (HR = 0.97, 95% CI 0.68-1.37; RR = 0.96, 95% CI 0.89-1.98). Moreover, GBM with expanded covariates and data-driven automated feature selection substantially increased bias (HR = 0.61, 95% CI 0.30-1.23; RR = 0.69, 95% CI 0.36-1.04). Our findings suggest that ML-based propensity score methods do not inherently improve causal estimation possibly due to residual confounding from omitted or partially adjusted variables and may introduce overadjustment bias when combined with automated feature selection. These results underscore the importance of careful confounder specification and causal reasoning over algorithmic complexity in causal inference.</p>","PeriodicalId":11907,"journal":{"name":"European Journal of Epidemiology","volume":" ","pages":""},"PeriodicalIF":5.9,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145951526","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Integration of latent factor analysis into multivariable Mendelian randomization. 将潜在因素分析整合到多变量孟德尔随机化中。
IF 13.6 1区 医学 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH Pub Date : 2026-01-12 DOI: 10.1007/s10654-025-01324-8
Yuankai Zhang,Roby Joehanes,Tianxiao Huan,Lukas M Weber,Qiong Yang,Kathryn L Lunetta,Daniel Levy,Chunyu Liu
Mendelian randomization has emerged as a powerful tool for exploring causal relationships in observational studies by using genetic variants as instrumental variables. While multivariable Mendelian randomization extends this approach to simultaneously address multiple exposures, it faces significant challenges with highly correlated exposures, particularly in high-dimensional settings such as multi-omics data. Conventional MVMR methods, which are primarily based on linear regression models, may suffer from multicollinearity and reduced statistical power when analyzing correlated exposures. The increasing availability of high-dimensional multi-omics data has highlighted the limitations of conventional MVMR approaches in analyzing correlated exposures while maintaining biological interpretability. To address these challenges, we propose integrating latent factor analysis into the MVMR framework, enabling dimension reduction without compromising interpretability. Through extensive simulation studies, we demonstrate that our method maintains a well-controlled false positive rate and offers superior sensitivity compared to conventional MVMR approaches. We apply our method to investigate the causal relationship between DNA methylation and mitochondrial DNA copy number. Our method offers a significant advantage in scenarios with highly correlated exposures driven by common latent factors or shared pathways, especially when individual effects are sparse. By applying our method to correlated multi-omics data, we can uncover new insights into the molecular mechanisms underlying complex phenotypes.
孟德尔随机化已经成为一种强大的工具,通过使用遗传变异作为工具变量来探索观察性研究中的因果关系。虽然多变量孟德尔随机化将这种方法扩展到同时处理多个暴露,但它面临着高度相关暴露的重大挑战,特别是在高维环境中,如多组学数据。传统的MVMR方法主要基于线性回归模型,在分析相关暴露时可能存在多重共线性和统计能力降低的问题。高维多组学数据的不断增加凸显了传统MVMR方法在分析相关暴露同时保持生物学可解释性方面的局限性。为了应对这些挑战,我们建议将潜在因素分析整合到MVMR框架中,在不影响可解释性的情况下实现降维。通过广泛的仿真研究,我们证明,与传统的MVMR方法相比,我们的方法保持了良好的假阳性率控制,并提供了更高的灵敏度。我们应用我们的方法来研究DNA甲基化和线粒体DNA拷贝数之间的因果关系。我们的方法在由共同潜在因素或共享途径驱动的高度相关暴露的情况下提供了显著的优势,特别是当个体影响稀疏时。通过将我们的方法应用于相关的多组学数据,我们可以揭示复杂表型背后的分子机制的新见解。
{"title":"Integration of latent factor analysis into multivariable Mendelian randomization.","authors":"Yuankai Zhang,Roby Joehanes,Tianxiao Huan,Lukas M Weber,Qiong Yang,Kathryn L Lunetta,Daniel Levy,Chunyu Liu","doi":"10.1007/s10654-025-01324-8","DOIUrl":"https://doi.org/10.1007/s10654-025-01324-8","url":null,"abstract":"Mendelian randomization has emerged as a powerful tool for exploring causal relationships in observational studies by using genetic variants as instrumental variables. While multivariable Mendelian randomization extends this approach to simultaneously address multiple exposures, it faces significant challenges with highly correlated exposures, particularly in high-dimensional settings such as multi-omics data. Conventional MVMR methods, which are primarily based on linear regression models, may suffer from multicollinearity and reduced statistical power when analyzing correlated exposures. The increasing availability of high-dimensional multi-omics data has highlighted the limitations of conventional MVMR approaches in analyzing correlated exposures while maintaining biological interpretability. To address these challenges, we propose integrating latent factor analysis into the MVMR framework, enabling dimension reduction without compromising interpretability. Through extensive simulation studies, we demonstrate that our method maintains a well-controlled false positive rate and offers superior sensitivity compared to conventional MVMR approaches. We apply our method to investigate the causal relationship between DNA methylation and mitochondrial DNA copy number. Our method offers a significant advantage in scenarios with highly correlated exposures driven by common latent factors or shared pathways, especially when individual effects are sparse. By applying our method to correlated multi-omics data, we can uncover new insights into the molecular mechanisms underlying complex phenotypes.","PeriodicalId":11907,"journal":{"name":"European Journal of Epidemiology","volume":"39 1","pages":""},"PeriodicalIF":13.6,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145949748","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Maternal hormonal contraception use and childhood cancer risk: a systematic review and meta-analysis. 母亲使用激素避孕与儿童癌症风险:系统回顾和荟萃分析。
IF 13.6 1区 医学 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH Pub Date : 2026-01-12 DOI: 10.1007/s10654-025-01335-5
Sif E Carlsen,Emily Jarden,Caroline H Hemmingsen,Lone Schmidt,Sarah Hjorth,Maarit K Leinonen,Ulrika Nörby,Lina S Mørch,Susanne K Kjaer,Hedvig Nordeng,Marie Hargreave
Observational studies have linked maternal hormonal contraception use to childhood cancer risk, but findings are inconsistent. A systematic review was conducted of this potential relationship. A systematic search was performed in PubMed, Embase, Scopus, Cochrane, and Web of Science databases until April 9, 2025. Studies reporting maternal hormonal contraception use before or during pregnancy and childhood cancer risk (0-19 years) were eligible. We included studies providing risk estimates in English or Scandinavian languages. Newcastle-Ottawa Scale was used to assess study quality. Meta-analysis using fixed and random effects was used to pool relative risks (RRs) with 95% confidence intervals (CIs) for childhood cancer according to maternal hormonal contraception use (1) up to or during pregnancy, and (2) exclusively during pregnancy. We included 27 studies (24 case-control and 3 cohort), totaling 11,067 childhood cancer cases. Maternal hormonal contraception use up to and during pregnancy increased risk of any childhood cancer (RR = 1.18; 95% CI = 1.10-1.26), leukemia (RR = 1.24; 95% CI = 1.06-1.45), and lymphoid leukemia (RR = 1.17; 95% CI = 1.06-1.28). Exposures during pregnancy showed higher risk estimate for any cancer (RR = 1.32; 95% CI = 1.12-1.56) and leukemia (RR = 1.63; 95% CI = 1.07-2.49). Most studies were moderate (70%) or high (26%) quality. Maternal hormonal contraception use may increase childhood cancer risk, particularly for leukemia, and during pregnancy. Further prospective studies are needed, focusing on specific hormonal contraception substances and exposure timing.
观察性研究将母亲使用激素避孕与儿童癌症风险联系起来,但研究结果并不一致。对这种潜在的关系进行了系统的回顾。系统检索PubMed, Embase, Scopus, Cochrane和Web of Science数据库,直到2025年4月9日。报告孕妇在怀孕前或怀孕期间使用激素避孕和儿童癌症风险(0-19岁)的研究符合条件。我们纳入了用英语或斯堪的纳维亚语言提供风险评估的研究。采用纽卡斯尔-渥太华量表评估研究质量。采用固定效应和随机效应的荟萃分析,根据(1)怀孕前或怀孕期间以及(2)仅在怀孕期间使用激素避孕,汇总儿童癌症的相对危险度(rr), 95%置信区间(CIs)。我们纳入了27项研究(24项病例对照和3项队列研究),共计11067例儿童癌症病例。孕妇在怀孕前后和怀孕期间使用激素避孕药会增加任何儿童癌症(RR = 1.18; 95% CI = 1.10-1.26)、白血病(RR = 1.24; 95% CI = 1.06-1.45)和淋巴细胞白血病(RR = 1.17; 95% CI = 1.06-1.28)的风险。怀孕期间暴露在暴露环境中,患任何癌症(RR = 1.32; 95% CI = 1.12-1.56)和白血病(RR = 1.63; 95% CI = 1.07-2.49)的风险都较高。大多数研究为中等(70%)或高(26%)质量。孕妇使用激素避孕可能会增加儿童患癌症的风险,尤其是白血病和怀孕期间。需要进一步的前瞻性研究,重点是具体的激素避孕物质和暴露时间。
{"title":"Maternal hormonal contraception use and childhood cancer risk: a systematic review and meta-analysis.","authors":"Sif E Carlsen,Emily Jarden,Caroline H Hemmingsen,Lone Schmidt,Sarah Hjorth,Maarit K Leinonen,Ulrika Nörby,Lina S Mørch,Susanne K Kjaer,Hedvig Nordeng,Marie Hargreave","doi":"10.1007/s10654-025-01335-5","DOIUrl":"https://doi.org/10.1007/s10654-025-01335-5","url":null,"abstract":"Observational studies have linked maternal hormonal contraception use to childhood cancer risk, but findings are inconsistent. A systematic review was conducted of this potential relationship. A systematic search was performed in PubMed, Embase, Scopus, Cochrane, and Web of Science databases until April 9, 2025. Studies reporting maternal hormonal contraception use before or during pregnancy and childhood cancer risk (0-19 years) were eligible. We included studies providing risk estimates in English or Scandinavian languages. Newcastle-Ottawa Scale was used to assess study quality. Meta-analysis using fixed and random effects was used to pool relative risks (RRs) with 95% confidence intervals (CIs) for childhood cancer according to maternal hormonal contraception use (1) up to or during pregnancy, and (2) exclusively during pregnancy. We included 27 studies (24 case-control and 3 cohort), totaling 11,067 childhood cancer cases. Maternal hormonal contraception use up to and during pregnancy increased risk of any childhood cancer (RR = 1.18; 95% CI = 1.10-1.26), leukemia (RR = 1.24; 95% CI = 1.06-1.45), and lymphoid leukemia (RR = 1.17; 95% CI = 1.06-1.28). Exposures during pregnancy showed higher risk estimate for any cancer (RR = 1.32; 95% CI = 1.12-1.56) and leukemia (RR = 1.63; 95% CI = 1.07-2.49). Most studies were moderate (70%) or high (26%) quality. Maternal hormonal contraception use may increase childhood cancer risk, particularly for leukemia, and during pregnancy. Further prospective studies are needed, focusing on specific hormonal contraception substances and exposure timing.","PeriodicalId":11907,"journal":{"name":"European Journal of Epidemiology","volume":"255 1","pages":""},"PeriodicalIF":13.6,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145949749","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Coffee and tea intake, circulating inflammatory biomarkers, and long-term risk of dementia: findings from two longitudinal studies. 咖啡和茶的摄入量、循环炎症生物标志物和痴呆的长期风险:两项纵向研究的结果。
IF 5.9 1区 医学 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH Pub Date : 2026-01-01 Epub Date: 2025-10-27 DOI: 10.1007/s10654-025-01302-0
Minqing Yan, Jie Shen, Mengjia Zhao, Leqi Fei, Yuhui Huang, Minyu Wu, Ting Shen, Gulisiya Hailili, Dan Liu, Geng Zong, Yan Zheng, Dong Hang, Changzheng Yuan

The associations of coffee and tea intake with long-term risk of dementia have not been thoroughly established. Additionally, the potential mediating roles of circulating inflammatory biomarkers in these associations remain less explored. We included 6,001 participants from the Health and Retirement Study (HRS, 2013-2020) and 2,650 participants from the Framingham Heart Study Offspring cohort (FOS, 1998-2018), all free of dementia at baseline. Coffee and tea intake was assessed using a semi-quantitative food frequency questionnaire in both cohorts. Dementia diagnosis was ascertained using a validated algorithm and clinical review panel. Cox proportional hazard models were utilized to evaluate the associations of coffee and tea intake with dementia. Mediation analysis was conducted to examine whether circulating inflammatory biomarkers mediated these associations. During a median follow-up of 7.0 years in HRS and 11.1 years in FOS, 231 individuals in HRS and 204 in FOS developed all-cause dementia. Compared with intake of less than 1 cup of coffee per day, consuming ≥ 2 cups daily had a 28-37% lower risk of dementia (Hazards ratio [HR] = 0.72, 95% confidence interval [CI]: 0.52, 0.99, P-trend = 0.045 in HRS; HR = 0.63, 95% CI: 0.45, 0.90, P-trend = 0.015 in FOS). Compared to non-consumers, moderate tea consumption was associated with a lower dementia risk in HRS (HR = 0.65, 95% CI: 0.48, 0.89 for > 0 to < 1 cup/day; HR = 0.53, 95% CI: 0.30, 0.94 for ≥ 1 to < 2 cups/day), but no significant association was observed in FOS. In the mediation analysis, the association between coffee intake and dementia was partially mediated by interleukin-10 (IL-10, 29.30%), Cystatin C (24.45%), C-reactive protein (CRP, 16.54%), interleukin-1 receptor antagonist (IL-1RA, 11.06%), and soluble tumor necrosis factor receptor-1 (sTNFR-1, 10.78%). In conclusion, higher coffee consumption (≥ 2 cups per day) is associated with a lower risk of dementia, partially mediated by a set of inflammatory biomarkers. Moderate intake of tea (0-2 cups per day) may relate to a lower risk of dementia. Further large-scale observational and interventional studies are warranted to confirm these findings.

咖啡和茶的摄入量与患痴呆症的长期风险之间的联系还没有完全确定。此外,循环炎症生物标志物在这些关联中的潜在介导作用仍然较少探索。我们纳入了来自健康与退休研究(HRS, 2013-2020)的6,001名参与者和来自弗雷明汉心脏研究后代队列(FOS, 1998-2018)的2,650名参与者,他们在基线时均无痴呆。在两个队列中,使用半定量食物频率问卷来评估咖啡和茶的摄入量。痴呆诊断是通过一个有效的算法和临床评审小组确定的。使用Cox比例风险模型来评估咖啡和茶摄入与痴呆的关系。进行中介分析以检查循环炎症生物标志物是否介导了这些关联。在HRS组中位随访时间为7.0年,FOS组中位随访时间为11.1年,HRS组中位随访时间为231年,FOS组中位随访时间为204年。与每天摄入少于1杯咖啡的人相比,每天饮用≥2杯咖啡的人患痴呆的风险降低28-37% (HRS的风险比[HR] = 0.72, 95%可信区间[CI]: 0.52, 0.99, p趋势= 0.045;FOS的风险比[HR] = 0.63, 95% CI: 0.45, 0.90, p趋势= 0.015)。与不喝茶的人相比,适度喝茶的人在HRS中患痴呆症的风险较低(HR = 0.65, 95% CI: 0.48, 0.89)
{"title":"Coffee and tea intake, circulating inflammatory biomarkers, and long-term risk of dementia: findings from two longitudinal studies.","authors":"Minqing Yan, Jie Shen, Mengjia Zhao, Leqi Fei, Yuhui Huang, Minyu Wu, Ting Shen, Gulisiya Hailili, Dan Liu, Geng Zong, Yan Zheng, Dong Hang, Changzheng Yuan","doi":"10.1007/s10654-025-01302-0","DOIUrl":"10.1007/s10654-025-01302-0","url":null,"abstract":"<p><p>The associations of coffee and tea intake with long-term risk of dementia have not been thoroughly established. Additionally, the potential mediating roles of circulating inflammatory biomarkers in these associations remain less explored. We included 6,001 participants from the Health and Retirement Study (HRS, 2013-2020) and 2,650 participants from the Framingham Heart Study Offspring cohort (FOS, 1998-2018), all free of dementia at baseline. Coffee and tea intake was assessed using a semi-quantitative food frequency questionnaire in both cohorts. Dementia diagnosis was ascertained using a validated algorithm and clinical review panel. Cox proportional hazard models were utilized to evaluate the associations of coffee and tea intake with dementia. Mediation analysis was conducted to examine whether circulating inflammatory biomarkers mediated these associations. During a median follow-up of 7.0 years in HRS and 11.1 years in FOS, 231 individuals in HRS and 204 in FOS developed all-cause dementia. Compared with intake of less than 1 cup of coffee per day, consuming ≥ 2 cups daily had a 28-37% lower risk of dementia (Hazards ratio [HR] = 0.72, 95% confidence interval [CI]: 0.52, 0.99, P-trend = 0.045 in HRS; HR = 0.63, 95% CI: 0.45, 0.90, P-trend = 0.015 in FOS). Compared to non-consumers, moderate tea consumption was associated with a lower dementia risk in HRS (HR = 0.65, 95% CI: 0.48, 0.89 for > 0 to < 1 cup/day; HR = 0.53, 95% CI: 0.30, 0.94 for ≥ 1 to < 2 cups/day), but no significant association was observed in FOS. In the mediation analysis, the association between coffee intake and dementia was partially mediated by interleukin-10 (IL-10, 29.30%), Cystatin C (24.45%), C-reactive protein (CRP, 16.54%), interleukin-1 receptor antagonist (IL-1RA, 11.06%), and soluble tumor necrosis factor receptor-1 (sTNFR-1, 10.78%). In conclusion, higher coffee consumption (≥ 2 cups per day) is associated with a lower risk of dementia, partially mediated by a set of inflammatory biomarkers. Moderate intake of tea (0-2 cups per day) may relate to a lower risk of dementia. Further large-scale observational and interventional studies are warranted to confirm these findings.</p>","PeriodicalId":11907,"journal":{"name":"European Journal of Epidemiology","volume":" ","pages":"27-38"},"PeriodicalIF":5.9,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145376375","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DiNetxify-a python package for three‑dimensional disease network analysis based on electronic health record data. 基于电子健康记录数据的三维疾病网络分析python包。
IF 5.9 1区 医学 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH Pub Date : 2026-01-01 Epub Date: 2026-01-24 DOI: 10.1007/s10654-025-01360-4
Can Hou, Haowen Liu, Viktor H Ahlqvist, Elisabet Unnur Gisladottir, Yao Yang, Huazhen Yang, Fang Fang, Unnur A Valdimarsdóttir, Huan Song

The rapid expansion of large-scale electronic health record (EHR) data has underscored the necessity for advanced analytical methods, such as disease network analyses, to comprehensively identify and interpret multimorbidity patterns and disease progression pathways. To overcome existing obstacles associated with performing sophisticated disease network analyses on EHR data, we developed DiNetxify, an open-source Python package implementing our recently introduced three-dimensional (3D) disease network analysis method ( https://hzcohort.github.io/DiNetxify/ ). DiNetxify provides a dedicated data class for handling various EHR data, comprehensive modular functions for executing complete 3D disease network analyses, and visualization functions for interactive exploration of results. The package is efficient, user-friendly, and optimized for large-scale EHR datasets. It supports diverse study designs, customizable analysis parameters, and parallel computing for enhanced performance. Through a case study utilizing UK Biobank data to investigate disease networks associated with short leukocyte telomere length, we demonstrated the capability of DiNetxify to identify meaningful disease clusters and progression patterns consistent with established knowledge while uncovering novel insights. Computationally, the software successfully completed analyses involving cohorts exceeding half a million exposed individuals within 17 h, using moderate computational resources. We thus anticipate that DiNetxify can significantly reduce technical barriers to facilitate broader adoption of advanced disease network analysis techniques by different researchers, thereby enhancing the exploration of EHR data to improve the understanding of holistic health dynamics.

大规模电子健康记录(EHR)数据的快速扩展强调了先进分析方法的必要性,例如疾病网络分析,以全面识别和解释多发病模式和疾病进展途径。为了克服对电子病历数据进行复杂疾病网络分析相关的现有障碍,我们开发了DiNetxify,这是一个开源Python包,实现了我们最近引入的三维(3D)疾病网络分析方法(https://hzcohort.github.io/DiNetxify/)。DiNetxify提供了专门的数据类来处理各种EHR数据,提供了全面的模块化功能来执行完整的3D疾病网络分析,提供了可视化功能来交互式探索结果。该软件包高效、用户友好,并针对大规模电子病历数据集进行了优化。它支持不同的研究设计,可定制的分析参数和并行计算,以提高性能。通过一个案例研究,利用英国生物银行的数据来调查与白细胞端粒长度短相关的疾病网络,我们证明了DiNetxify能够识别有意义的疾病集群和与已有知识一致的进展模式,同时发现新的见解。计算上,该软件成功地在17小时内完成了涉及超过50万暴露个体的队列分析,使用了适度的计算资源。因此,我们预计DiNetxify可以显著减少技术障碍,促进不同研究人员更广泛地采用先进的疾病网络分析技术,从而加强对电子病历数据的探索,以提高对整体健康动态的理解。
{"title":"DiNetxify-a python package for three‑dimensional disease network analysis based on electronic health record data.","authors":"Can Hou, Haowen Liu, Viktor H Ahlqvist, Elisabet Unnur Gisladottir, Yao Yang, Huazhen Yang, Fang Fang, Unnur A Valdimarsdóttir, Huan Song","doi":"10.1007/s10654-025-01360-4","DOIUrl":"10.1007/s10654-025-01360-4","url":null,"abstract":"<p><p>The rapid expansion of large-scale electronic health record (EHR) data has underscored the necessity for advanced analytical methods, such as disease network analyses, to comprehensively identify and interpret multimorbidity patterns and disease progression pathways. To overcome existing obstacles associated with performing sophisticated disease network analyses on EHR data, we developed DiNetxify, an open-source Python package implementing our recently introduced three-dimensional (3D) disease network analysis method ( https://hzcohort.github.io/DiNetxify/ ). DiNetxify provides a dedicated data class for handling various EHR data, comprehensive modular functions for executing complete 3D disease network analyses, and visualization functions for interactive exploration of results. The package is efficient, user-friendly, and optimized for large-scale EHR datasets. It supports diverse study designs, customizable analysis parameters, and parallel computing for enhanced performance. Through a case study utilizing UK Biobank data to investigate disease networks associated with short leukocyte telomere length, we demonstrated the capability of DiNetxify to identify meaningful disease clusters and progression patterns consistent with established knowledge while uncovering novel insights. Computationally, the software successfully completed analyses involving cohorts exceeding half a million exposed individuals within 17 h, using moderate computational resources. We thus anticipate that DiNetxify can significantly reduce technical barriers to facilitate broader adoption of advanced disease network analysis techniques by different researchers, thereby enhancing the exploration of EHR data to improve the understanding of holistic health dynamics.</p>","PeriodicalId":11907,"journal":{"name":"European Journal of Epidemiology","volume":" ","pages":"119-126"},"PeriodicalIF":5.9,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146040215","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The effect of long-term adherence to physical activity recommendations in midlife on plasma proteins associated with frailty in the Atherosclerosis Risk in Communities (ARIC) study. 社区动脉粥样硬化风险(ARIC)研究中,中年长期坚持体力活动建议对与虚弱相关的血浆蛋白的影响
IF 5.9 1区 医学 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH Pub Date : 2026-01-01 Epub Date: 2025-10-27 DOI: 10.1007/s10654-025-01282-1
Fangyu Liu, Jennifer A Schrack, Keenan A Walker, Jeremy Walston, Rasika A Mathias, Michael E Griswold, Priya Palta, B Gwen Windham, John W Jackson

Clinical trials have shown favorable effects of exercise on frailty, supporting physical activity (PA) as a treatment and prevention strategy. Proteomics studies suggest that PA alters levels of many proteins, some of which may function as molecules in the biological processes underlying frailty. However, these studies have focused on structured exercise programs or cross-sectional PA-protein associations. Therefore, the effects of long-term PA on frailty-associated proteins remain unknown. Among 14,898 middle-aged adults, we emulated a target trial that assigned individuals to either (i) achieve and maintain the recommended PA level (≥ 150 min/week of moderate-to-vigorous physical activity [MVPA]) through 6 (± 0.3) years of follow-up or (ii) follow a "natural course" strategy, where all individuals engage in various amounts of habitual MVPA. We estimated the effects of long-term adherence to recommended MVPA versus the natural course strategy on 45 previously identified frailty-associated proteins at the end of the follow-up using inverse probability of weighting (IPW) and iterative conditional expectations (ICE). We found that long-term adherence to recommended MVPA improved the population levels of many frailty-associated proteins (ranged from 0.04 to 0.11 standard deviation); the greatest benefits were seen in proteins involved in the nervous system (e.g., voltage-dependent calcium channel subunit alpha-2/delta-3 [CACNA2D3], contactin-1 [CNTN1], neural cell adhesion molecule 1 [NCAM1], and transmembrane protein 132D [TMEM132D]) and inflammation (e.g., high-temperature requirement serine protease A1 [HTRA1] and C-reactive protein [CRP]). Our findings suggest improved nervous system and reduced inflammation as the biological basis of long-term engagement in adequate PA as an intervention strategy for frailty.

临床试验表明运动对虚弱有良好的影响,支持体育活动(PA)作为一种治疗和预防策略。蛋白质组学研究表明,PA改变了许多蛋白质的水平,其中一些蛋白质可能在潜在脆弱的生物过程中起分子作用。然而,这些研究都集中在有组织的锻炼计划或横断面pa蛋白关联上。因此,长期PA对脆弱相关蛋白的影响尚不清楚。在14898名中年人中,我们模拟了一项目标试验,将个体分配为(i)在6(±0.3)年的随访中达到并保持推荐的PA水平(≥150分钟/周的中高强度体育活动[MVPA]),或(ii)遵循“自然过程”策略,所有个体都参与不同量的习惯性MVPA。在随访结束时,我们使用加权逆概率(IPW)和迭代条件期望(ICE)估计了长期坚持推荐的MVPA与自然疗程策略对45种先前确定的脆弱相关蛋白的影响。我们发现,长期坚持推荐的MVPA可改善许多与虚弱相关的蛋白质的人群水平(范围为0.04至0.11标准差);最大的益处是涉及神经系统的蛋白质(如电压依赖性钙通道亚基α -2/ δ -3 [CACNA2D3]、接触蛋白-1 [CNTN1]、神经细胞粘附分子1 [NCAM1]和跨膜蛋白132D [TMEM132D])和炎症(如高温要求丝氨酸蛋白酶A1 [HTRA1]和c反应蛋白[CRP])。我们的研究结果表明,神经系统的改善和炎症的减少是长期参与足够的PA作为虚弱干预策略的生物学基础。
{"title":"The effect of long-term adherence to physical activity recommendations in midlife on plasma proteins associated with frailty in the Atherosclerosis Risk in Communities (ARIC) study.","authors":"Fangyu Liu, Jennifer A Schrack, Keenan A Walker, Jeremy Walston, Rasika A Mathias, Michael E Griswold, Priya Palta, B Gwen Windham, John W Jackson","doi":"10.1007/s10654-025-01282-1","DOIUrl":"10.1007/s10654-025-01282-1","url":null,"abstract":"<p><p>Clinical trials have shown favorable effects of exercise on frailty, supporting physical activity (PA) as a treatment and prevention strategy. Proteomics studies suggest that PA alters levels of many proteins, some of which may function as molecules in the biological processes underlying frailty. However, these studies have focused on structured exercise programs or cross-sectional PA-protein associations. Therefore, the effects of long-term PA on frailty-associated proteins remain unknown. Among 14,898 middle-aged adults, we emulated a target trial that assigned individuals to either (i) achieve and maintain the recommended PA level (≥ 150 min/week of moderate-to-vigorous physical activity [MVPA]) through 6 (± 0.3) years of follow-up or (ii) follow a \"natural course\" strategy, where all individuals engage in various amounts of habitual MVPA. We estimated the effects of long-term adherence to recommended MVPA versus the natural course strategy on 45 previously identified frailty-associated proteins at the end of the follow-up using inverse probability of weighting (IPW) and iterative conditional expectations (ICE). We found that long-term adherence to recommended MVPA improved the population levels of many frailty-associated proteins (ranged from 0.04 to 0.11 standard deviation); the greatest benefits were seen in proteins involved in the nervous system (e.g., voltage-dependent calcium channel subunit alpha-2/delta-3 [CACNA2D3], contactin-1 [CNTN1], neural cell adhesion molecule 1 [NCAM1], and transmembrane protein 132D [TMEM132D]) and inflammation (e.g., high-temperature requirement serine protease A1 [HTRA1] and C-reactive protein [CRP]). Our findings suggest improved nervous system and reduced inflammation as the biological basis of long-term engagement in adequate PA as an intervention strategy for frailty.</p>","PeriodicalId":11907,"journal":{"name":"European Journal of Epidemiology","volume":" ","pages":"51-69"},"PeriodicalIF":5.9,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12881102/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145376351","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
European Journal of Epidemiology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1