首页 > 最新文献

European Journal of Epidemiology最新文献

英文 中文
Guangdong Biobank Cohort (GDBC) study 广东生物库队列(GDBC)研究
IF 13.6 1区 医学 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH Pub Date : 2026-01-13 DOI: 10.1007/s10654-025-01320-y
Yong-Qiao He, Wen-Qiong Xue, Hua Diao, Ji-Yun Zhan, Ming-Fang Ji, Da-Wei Yang, Yi Zhao, Chang-Mi Deng, Zi-Yi Wu, Ting Zhou, Ying Liao, Mei-Qi Zheng, Wen-Li Zhang, Yi-Jing Jia, Lei-Lei Yuan, Lu-Ting Luo, Dan-Hua Li, Tong-Min Wang, Xia-Ting Tong, Yan Du, Ling-Ling Tang, Jing-Wen Huang, Chang-ling Huang, Zhi-Yang Zhao, Yan-Xia Wu, Lian-Jing Cao, Si-Qi Dong, Fang Wang, Cheng-Tao Jiang, Ruo-Wen Xiao, Wen-Bin Zhang, Xue-Yin Chen, Qiao-Ling Wang, Qiao-Yun Liu, Yue-Ze Zhao, Cao-Li Tang, Lin Ma, Xiao-Hui Zheng, Pei-Fen Zhang, Xi-Zhao Li, Shao-Dan Zhang, Ye-Zhu Hu, Xia Yu, Biao-Hua Wu, Fu-Gui Li, Jian-Hua Wu, Bi-Sen Deng, Xue-Jun Liang, Wei-Hua Jia
The global rise of non-communicable diseases (NCDs) presents an urgent public health challenge, particularly in regions undergoing rapid economic and demographic transitions. Guangdong Province, China’s most populous and economically advanced region, is experiencing a substantial and accelerating burden of NCDs. However, large-scale, population-based cohorts from this region remain scarce, limiting insights into region-specific disease determinants and prevention strategies. The Guangdong Biobank Cohort (GDBC) was established in 2017, enrolling 35,081 participants aged 40–84 years from urban and rural areas of Zhongshan City in the Pearl River Delta. At baseline, comprehensive data on 346 variables—including lifestyle, environmental exposures, medical histories, physical examinations, and laboratory profiles—were collected via a cloud-based member management information system (MMIS), alongside blood and saliva samples for biobanking. A sub-cohort underwent genome-wide genotyping ( N = 2,530) and oral microbiome profiling via 16 S rRNA sequencing ( N = 2,049). During dynamic follow-up, 44.2% ( N = 15,499) completed Phase I resurvey with repeated measurements and updated biospecimens. Disease outcomes, including hypertension, diabetes, and cancer, were ascertained through active surveillance and regional registry linkage until December 2023. Baseline prevalence of hypertension, diabetes, and cancer was 25.3%, 8.0%, and 3.6%, respectively. Over follow-up, 1,767 hypertension cases, 814 diabetes cases, and 558 cancers were recorded, yielding crude incidence rates of 1,804.6, 649.7, and 423.1 per 100,000 person-years, respectively. The GDBC provides a comprehensive, dynamically updated resource to dissect gene–microbiome–environment interactions and develop precision prevention strategies to inform public health policies.
全球非传染性疾病的增加对公共卫生提出了紧迫的挑战,特别是在经济和人口快速转型的区域。广东省是中国人口最多、经济最发达的地区,其非传染性疾病负担正日益加重。然而,来自该地区的大规模、基于人群的队列研究仍然很少,这限制了对区域特异性疾病决定因素和预防策略的了解。广东生物库队列(GDBC)于2017年成立,从珠江三角洲中山市的城乡地区招募了35,081名年龄在40-84岁之间的参与者。在基线时,通过基于云的会员管理信息系统(MMIS)收集了346个变量的综合数据,包括生活方式、环境暴露、病史、体检和实验室概况,以及用于生物银行的血液和唾液样本。一个亚队列进行了全基因组基因分型(N = 2530)和通过16s rRNA测序进行口腔微生物组分析(N = 2049)。在动态随访期间,44.2% (N = 15,499)通过重复测量和更新生物标本完成了I期重新调查。疾病结局,包括高血压、糖尿病和癌症,通过主动监测和区域登记联系确定,直到2023年12月。高血压、糖尿病和癌症的基线患病率分别为25.3%、8.0%和3.6%。在随访期间,记录了1767例高血压病例、814例糖尿病病例和558例癌症病例,粗发病率分别为每10万人年1804.6例、649.7例和423.1例。GDBC提供了一个全面的、动态更新的资源,以分析基因-微生物组-环境的相互作用,并制定精确的预防战略,为公共卫生政策提供信息。
{"title":"Guangdong Biobank Cohort (GDBC) study","authors":"Yong-Qiao He, Wen-Qiong Xue, Hua Diao, Ji-Yun Zhan, Ming-Fang Ji, Da-Wei Yang, Yi Zhao, Chang-Mi Deng, Zi-Yi Wu, Ting Zhou, Ying Liao, Mei-Qi Zheng, Wen-Li Zhang, Yi-Jing Jia, Lei-Lei Yuan, Lu-Ting Luo, Dan-Hua Li, Tong-Min Wang, Xia-Ting Tong, Yan Du, Ling-Ling Tang, Jing-Wen Huang, Chang-ling Huang, Zhi-Yang Zhao, Yan-Xia Wu, Lian-Jing Cao, Si-Qi Dong, Fang Wang, Cheng-Tao Jiang, Ruo-Wen Xiao, Wen-Bin Zhang, Xue-Yin Chen, Qiao-Ling Wang, Qiao-Yun Liu, Yue-Ze Zhao, Cao-Li Tang, Lin Ma, Xiao-Hui Zheng, Pei-Fen Zhang, Xi-Zhao Li, Shao-Dan Zhang, Ye-Zhu Hu, Xia Yu, Biao-Hua Wu, Fu-Gui Li, Jian-Hua Wu, Bi-Sen Deng, Xue-Jun Liang, Wei-Hua Jia","doi":"10.1007/s10654-025-01320-y","DOIUrl":"https://doi.org/10.1007/s10654-025-01320-y","url":null,"abstract":"The global rise of non-communicable diseases (NCDs) presents an urgent public health challenge, particularly in regions undergoing rapid economic and demographic transitions. Guangdong Province, China’s most populous and economically advanced region, is experiencing a substantial and accelerating burden of NCDs. However, large-scale, population-based cohorts from this region remain scarce, limiting insights into region-specific disease determinants and prevention strategies. The Guangdong Biobank Cohort (GDBC) was established in 2017, enrolling 35,081 participants aged 40–84 years from urban and rural areas of Zhongshan City in the Pearl River Delta. At baseline, comprehensive data on 346 variables—including lifestyle, environmental exposures, medical histories, physical examinations, and laboratory profiles—were collected via a cloud-based member management information system (MMIS), alongside blood and saliva samples for biobanking. A sub-cohort underwent genome-wide genotyping ( <jats:italic>N</jats:italic> = 2,530) and oral microbiome profiling via 16 S rRNA sequencing ( <jats:italic>N</jats:italic> = 2,049). During dynamic follow-up, 44.2% ( <jats:italic>N</jats:italic> = 15,499) completed Phase I resurvey with repeated measurements and updated biospecimens. Disease outcomes, including hypertension, diabetes, and cancer, were ascertained through active surveillance and regional registry linkage until December 2023. Baseline prevalence of hypertension, diabetes, and cancer was 25.3%, 8.0%, and 3.6%, respectively. Over follow-up, 1,767 hypertension cases, 814 diabetes cases, and 558 cancers were recorded, yielding crude incidence rates of 1,804.6, 649.7, and 423.1 per 100,000 person-years, respectively. The GDBC provides a comprehensive, dynamically updated resource to dissect gene–microbiome–environment interactions and develop precision prevention strategies to inform public health policies.","PeriodicalId":11907,"journal":{"name":"European Journal of Epidemiology","volume":"1 1","pages":""},"PeriodicalIF":13.6,"publicationDate":"2026-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145955848","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Space-time clustering of childhood high hyperdiploid B-cell precursor acute lymphoblastic leukemia: a nationwide Swedish study. 儿童高二倍体b细胞前体急性淋巴母细胞白血病的时空聚类:一项瑞典全国性研究。
IF 13.6 1区 医学 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH Pub Date : 2026-01-12 DOI: 10.1007/s10654-025-01323-9
Gleb Bychkov,Niklas Engsner,Benedicte Bang,Mats Marshall Heyman,Gisela Barbany,Anna Skarin Nordenvall,Giorgio Tettamanti,Claes Strannegård,Ann Nordgren
Acute lymphoblastic leukemia (ALL) is the most common childhood malignancy. While space-time clustering of ALL cases has been suggested, only one prior study has examined clustering by genetic subtype. We investigated space-time clustering of childhood ALL in Sweden, both overall and by genetic subtype. The cohort included 1,629 children age 0-18 years diagnosed with ALL between 1992 and 2017, comprising 1,446 B-cell precursor ALL (BCP-ALL) and 183 T-cell ALL (T-ALL) cases. Two BCP-ALL subgroups were analyzed: high hyperdiploidy (HeH, n = 466) and ETV6::RUNX1 (n = 225). The Unbiased Knox Test and Unbiased Combined Knox Test were used to assess space-time clustering at the municipality level, accounting for multiple testing and population shifts. The Density-Based Spatial Clustering of Applications with Noise (DBSCAN) algorithm was applied to identify significant clusters. Logistic regression was used to evaluate demographic differences between clusters, including age, sex, and birth order. Significant space-time clustering was observed in the HeH subgroup for both place and date of birth (p = 0.005) and place and date of diagnosis (p = 0.011), at space-time thresholds of 40 km/18 months and 30 km/24 months, respectively. No clustering was detected in the overall BCP-ALL group, T-ALL group, or the ETV6::RUNX1 subgroup. Space-time clustering at birth and diagnosis was observed in the HeH subgroup, suggesting potential etiologic heterogeneity in BCP-ALL. These findings support further investigation of environmental and infectious exposures across immunophenotypes and genetic subtypes in larger cohorts.
急性淋巴细胞白血病(ALL)是最常见的儿童恶性肿瘤。虽然所有病例的时空聚类已经提出,但只有一项先前的研究检查了遗传亚型的聚类。我们调查了瑞典儿童ALL的时空聚类,包括总体和遗传亚型。该队列包括1629名在1992年至2017年期间被诊断为ALL的0-18岁儿童,包括1446例b细胞前体ALL (BCP-ALL)和183例t细胞ALL (T-ALL)病例。分析两个BCP-ALL亚组:高高二倍体(HeH, n = 466)和ETV6::RUNX1 (n = 225)。采用无偏Knox检验和无偏联合Knox检验在考虑多重检验和人口迁移的情况下,对城市的时空聚类进行了评价。采用基于密度的带噪声应用空间聚类(DBSCAN)算法识别重要聚类。使用逻辑回归来评估聚类之间的人口统计学差异,包括年龄、性别和出生顺序。在时空阈值为40 km/18个月和30 km/24个月时,HeH亚组在出生地点和日期(p = 0.005)和诊断地点和日期(p = 0.011)上均存在显著的时空聚类。在整个BCP-ALL组、T-ALL组或ETV6::RUNX1亚组中未检测到聚类。HeH亚组在出生和诊断时观察到时空聚类,提示BCP-ALL的潜在病因异质性。这些发现支持在更大的队列中进一步调查不同免疫表型和遗传亚型的环境和感染暴露。
{"title":"Space-time clustering of childhood high hyperdiploid B-cell precursor acute lymphoblastic leukemia: a nationwide Swedish study.","authors":"Gleb Bychkov,Niklas Engsner,Benedicte Bang,Mats Marshall Heyman,Gisela Barbany,Anna Skarin Nordenvall,Giorgio Tettamanti,Claes Strannegård,Ann Nordgren","doi":"10.1007/s10654-025-01323-9","DOIUrl":"https://doi.org/10.1007/s10654-025-01323-9","url":null,"abstract":"Acute lymphoblastic leukemia (ALL) is the most common childhood malignancy. While space-time clustering of ALL cases has been suggested, only one prior study has examined clustering by genetic subtype. We investigated space-time clustering of childhood ALL in Sweden, both overall and by genetic subtype. The cohort included 1,629 children age 0-18 years diagnosed with ALL between 1992 and 2017, comprising 1,446 B-cell precursor ALL (BCP-ALL) and 183 T-cell ALL (T-ALL) cases. Two BCP-ALL subgroups were analyzed: high hyperdiploidy (HeH, n = 466) and ETV6::RUNX1 (n = 225). The Unbiased Knox Test and Unbiased Combined Knox Test were used to assess space-time clustering at the municipality level, accounting for multiple testing and population shifts. The Density-Based Spatial Clustering of Applications with Noise (DBSCAN) algorithm was applied to identify significant clusters. Logistic regression was used to evaluate demographic differences between clusters, including age, sex, and birth order. Significant space-time clustering was observed in the HeH subgroup for both place and date of birth (p = 0.005) and place and date of diagnosis (p = 0.011), at space-time thresholds of 40 km/18 months and 30 km/24 months, respectively. No clustering was detected in the overall BCP-ALL group, T-ALL group, or the ETV6::RUNX1 subgroup. Space-time clustering at birth and diagnosis was observed in the HeH subgroup, suggesting potential etiologic heterogeneity in BCP-ALL. These findings support further investigation of environmental and infectious exposures across immunophenotypes and genetic subtypes in larger cohorts.","PeriodicalId":11907,"journal":{"name":"European Journal of Epidemiology","volume":"48 1","pages":""},"PeriodicalIF":13.6,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145949747","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Air pollutants, genetic susceptibility, and the risk of age-related macular degeneration: a large prospective cohort study. 空气污染物、遗传易感性和年龄相关性黄斑变性的风险:一项大型前瞻性队列研究。
IF 13.6 1区 医学 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH Pub Date : 2026-01-12 DOI: 10.1007/s10654-025-01340-8
Shengli Chen,Gongyue Wang,Xin Guan,Chenming Wang,Yang Xiao,Xingdi Li,Shiru Hong,Yuhan Zhou,Yingqian You,Ye Fu,Yuxi Wang,Yichi Zhang,Hui Zhao,Yingchen Zhang,Yang Cheng,Huan Guo,Huatao Xie
Age-related macular degeneration (AMD) is a leading cause of irreversible vision loss worldwide. However, evidence regarding the relationship between air pollution and AMD is limited, and the modifying effect of genetic susceptibility on this association remains unknown. A total of 445,237 participants without AMD at baseline were included from the UK Biobank. The concentrations of nitrogen dioxide (NO2), nitrogen oxides (NOx), particulate matter (PM2.5, PM10, PM2.5-10) were collected by using land-use regression models. Air pollution score (APS) was constructed through summing each pollutant weighted by the regression coefficients with AMD from single-pollutant model. Cox proportional hazard models were used to evaluate hazard rations (HRs) and 95% confidence intervals (95%CIs) of associations between air pollutants and polygenic risk score (PRS) with incident AMD. During a median follow-up of 13.83 years, we observed 9,635 incident AMD events. The HR (95%CI) of incident AMD for each standard deviation increase in NO2, NOx, PM2.5, PM10, and APS were 1.04(1.02, 1.06), 1.03(1.01, 1.05). 1.04(1.02, 1.07), 1.02(1.00, 1.04), and 1.04(1.02, 1.06), respectively. Significant additive interaction effects of NO2, NOx, PM2.5-10, APS and PRS with incident risk of AMD were observed, with the relative excess risk due to the interaction (RERI), attributable proportion (AP), and their 95% CIs of 0.10(0.01, 0.18) and 0.05(0.01, 0.11) for NO2, 0.11(0.01, 0.19) and 0.05(0.02, 0.10) for NOx, 0.15(0.06, 0.23) and 0.08(0.03, 0.13) for PM2.5-10, and 0.12(0.03, 0.20) and 0.06(0.01, 0.11) for APS, respectively. Compared with participants exposed to low level of above air pollutants and low PRS, those exposed to high air pollution and high PRS had almost double incident risk of AMD [HR(95%CI) ranged from 1.83(1.68, 1.99) to 2.03(1.86, 2.21)]. Long-term exposure to air pollutants NO2, NOx, PM2.5, and PM10 showed positive associations with increased risk of AMD, which could be further enhanced by genetic susceptibility.
年龄相关性黄斑变性(AMD)是世界范围内不可逆视力丧失的主要原因。然而,关于空气污染与AMD之间关系的证据有限,遗传易感性对这种关联的修饰作用仍然未知。来自UK Biobank的445,237名基线时无AMD的参与者被纳入研究。利用土地利用回归模型收集了大气中二氧化氮(NO2)、氮氧化物(NOx)、颗粒物(PM2.5、PM10、PM2.5-10)的浓度。通过将回归系数加权的各污染物与单污染物模型的AMD相加,构建空气污染评分(APS)。采用Cox比例风险模型评估空气污染物与多基因风险评分(PRS)与AMD事件之间的危害比(hr)和95%置信区间(95% ci)。在中位随访13.83年期间,我们观察到9635例AMD事件。NO2、NOx、PM2.5、PM10和APS每增加一个标准差,发生AMD的HR (95%CI)分别为1.04(1.02,1.06)、1.03(1.01,1.05)。1.04(1.02, 1.07), 1.02(1.00, 1.04),和1.04(1.02,1.06),分别。NO2、NOx、PM2.5-10、APS和PRS对AMD发生风险具有显著的加性交互作用,其中NO2的相对过量风险(rei)、可归因比例(AP)及其95% ci分别为0.10(0.01、0.18)和0.05(0.01、0.11),NOx的0.11(0.01、0.19)和0.05(0.02、0.10),PM2.5-10的0.15(0.06、0.23)和0.08(0.03、0.13),APS的0.12(0.03、0.20)和0.06(0.01、0.11)。与暴露于低水平以上空气污染物和低PRS环境的参与者相比,暴露于高污染和高PRS环境的参与者AMD的事件风险几乎是两倍[HR(95%CI)范围为1.83(1.68,1.99)至2.03(1.86,2.21)]。长期暴露于空气污染物NO2、NOx、PM2.5和PM10与AMD风险增加呈正相关,遗传易感性可能进一步增强AMD风险。
{"title":"Air pollutants, genetic susceptibility, and the risk of age-related macular degeneration: a large prospective cohort study.","authors":"Shengli Chen,Gongyue Wang,Xin Guan,Chenming Wang,Yang Xiao,Xingdi Li,Shiru Hong,Yuhan Zhou,Yingqian You,Ye Fu,Yuxi Wang,Yichi Zhang,Hui Zhao,Yingchen Zhang,Yang Cheng,Huan Guo,Huatao Xie","doi":"10.1007/s10654-025-01340-8","DOIUrl":"https://doi.org/10.1007/s10654-025-01340-8","url":null,"abstract":"Age-related macular degeneration (AMD) is a leading cause of irreversible vision loss worldwide. However, evidence regarding the relationship between air pollution and AMD is limited, and the modifying effect of genetic susceptibility on this association remains unknown. A total of 445,237 participants without AMD at baseline were included from the UK Biobank. The concentrations of nitrogen dioxide (NO2), nitrogen oxides (NOx), particulate matter (PM2.5, PM10, PM2.5-10) were collected by using land-use regression models. Air pollution score (APS) was constructed through summing each pollutant weighted by the regression coefficients with AMD from single-pollutant model. Cox proportional hazard models were used to evaluate hazard rations (HRs) and 95% confidence intervals (95%CIs) of associations between air pollutants and polygenic risk score (PRS) with incident AMD. During a median follow-up of 13.83 years, we observed 9,635 incident AMD events. The HR (95%CI) of incident AMD for each standard deviation increase in NO2, NOx, PM2.5, PM10, and APS were 1.04(1.02, 1.06), 1.03(1.01, 1.05). 1.04(1.02, 1.07), 1.02(1.00, 1.04), and 1.04(1.02, 1.06), respectively. Significant additive interaction effects of NO2, NOx, PM2.5-10, APS and PRS with incident risk of AMD were observed, with the relative excess risk due to the interaction (RERI), attributable proportion (AP), and their 95% CIs of 0.10(0.01, 0.18) and 0.05(0.01, 0.11) for NO2, 0.11(0.01, 0.19) and 0.05(0.02, 0.10) for NOx, 0.15(0.06, 0.23) and 0.08(0.03, 0.13) for PM2.5-10, and 0.12(0.03, 0.20) and 0.06(0.01, 0.11) for APS, respectively. Compared with participants exposed to low level of above air pollutants and low PRS, those exposed to high air pollution and high PRS had almost double incident risk of AMD [HR(95%CI) ranged from 1.83(1.68, 1.99) to 2.03(1.86, 2.21)]. Long-term exposure to air pollutants NO2, NOx, PM2.5, and PM10 showed positive associations with increased risk of AMD, which could be further enhanced by genetic susceptibility.","PeriodicalId":11907,"journal":{"name":"European Journal of Epidemiology","volume":"48 1","pages":""},"PeriodicalIF":13.6,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145949743","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An extension of the validation cohort of the Dutch Early-Stage Melanoma (D-ESMEL) study for stage-specific analyses. 荷兰早期黑色素瘤(D-ESMEL)研究阶段特异性分析验证队列的扩展。
IF 13.6 1区 医学 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH Pub Date : 2026-01-12 DOI: 10.1007/s10654-025-01337-3
Catherine Zhou,Antien L Mooyaart,Nikita Hulscher,Thamila Kerkour,Jasper Ouwerkerk,Marieke W J Louwman,Marlies Wakkee,Yunlei Li,Quirinus J M Voorham,Annette Bruggink,Tamar E C Nijsten,Loes M Hollestein
There is a high need for accurate prognostic models among stage II melanoma to determine who may benefit from (neo)adjuvant systemic therapy. The Dutch Early- Stage Melanoma (D-ESMEL) study was designed to identify new prognostic features in a population-based sample of stage I/II melanoma patients in addition to American Joint Committee of Cancer (AJCC) staging. The validation cohort of the D-ESMEL study employs a nested case-control design. Initially, controls were randomly sampled to develop prognostic that included both known and new prognostic factors to assess the additive value of new prognostic factors. As a consequence, most controls had a very thin melanoma (<1.0 mm) while most cases had a thicker melanoma (>2.0 mm). This resulted in insufficient variability and high weights for stage II controls when applying weighted analyses in absolute risk prediction models. Therefore, randomly sampled controls were re-matched on AJCC stage (stage IA, IB, IIA, IIB, IIC), and new stage-matched controls were collected for cases who could not be rematched. The original D-ESMEL validation cohort included 5,815 stage I/II melanoma patients, of whom 154 developed distant metastasis (cases). 98/154 Cases were stage II and only 24 stage II controls were included, while the stage-matched design now includes 153 stage-matched case-control sets of which 97 stage II cases and 97 stage II controls derived from a population-based cohort of 5,785 stage I/II patients. The updated design increased the biological variability among stage II controls, balanced weights in weighted analyses and thereby facilitating reliable subgroup analyses in this clinically important subgroup.
在II期黑色素瘤中,需要精确的预后模型来确定谁可能从(新)辅助全身治疗中受益。荷兰早期黑色素瘤(D-ESMEL)研究旨在确定除美国癌症联合委员会(AJCC)分期外,基于人群的I/II期黑色素瘤患者样本中的新预后特征。D-ESMEL研究的验证队列采用嵌套病例对照设计。最初,随机抽取对照组,制定包括已知和新的预后因素的预后评估,以评估新预后因素的附加价值。结果,大多数对照组的黑色素瘤很薄(2.0毫米)。当在绝对风险预测模型中应用加权分析时,这导致II期控制的变异性不足和权重过高。因此,在AJCC分期(IA期、IB期、IIA期、IIB期、IIC期)随机抽样对照进行重新匹配,无法重新匹配的病例收集新的分期匹配对照。最初的D-ESMEL验证队列包括5815例I/II期黑色素瘤患者,其中154例发生远处转移。98/154例为II期,仅包括24例II期对照,而阶段匹配设计现在包括153个阶段匹配的病例对照组,其中97例II期病例和97例II期对照来自5,785例I/II期患者的基于人群的队列。更新的设计增加了II期对照的生物学变异性,平衡了加权分析中的权重,从而促进了这一临床重要亚组的可靠亚组分析。
{"title":"An extension of the validation cohort of the Dutch Early-Stage Melanoma (D-ESMEL) study for stage-specific analyses.","authors":"Catherine Zhou,Antien L Mooyaart,Nikita Hulscher,Thamila Kerkour,Jasper Ouwerkerk,Marieke W J Louwman,Marlies Wakkee,Yunlei Li,Quirinus J M Voorham,Annette Bruggink,Tamar E C Nijsten,Loes M Hollestein","doi":"10.1007/s10654-025-01337-3","DOIUrl":"https://doi.org/10.1007/s10654-025-01337-3","url":null,"abstract":"There is a high need for accurate prognostic models among stage II melanoma to determine who may benefit from (neo)adjuvant systemic therapy. The Dutch Early- Stage Melanoma (D-ESMEL) study was designed to identify new prognostic features in a population-based sample of stage I/II melanoma patients in addition to American Joint Committee of Cancer (AJCC) staging. The validation cohort of the D-ESMEL study employs a nested case-control design. Initially, controls were randomly sampled to develop prognostic that included both known and new prognostic factors to assess the additive value of new prognostic factors. As a consequence, most controls had a very thin melanoma (<1.0 mm) while most cases had a thicker melanoma (>2.0 mm). This resulted in insufficient variability and high weights for stage II controls when applying weighted analyses in absolute risk prediction models. Therefore, randomly sampled controls were re-matched on AJCC stage (stage IA, IB, IIA, IIB, IIC), and new stage-matched controls were collected for cases who could not be rematched. The original D-ESMEL validation cohort included 5,815 stage I/II melanoma patients, of whom 154 developed distant metastasis (cases). 98/154 Cases were stage II and only 24 stage II controls were included, while the stage-matched design now includes 153 stage-matched case-control sets of which 97 stage II cases and 97 stage II controls derived from a population-based cohort of 5,785 stage I/II patients. The updated design increased the biological variability among stage II controls, balanced weights in weighted analyses and thereby facilitating reliable subgroup analyses in this clinically important subgroup.","PeriodicalId":11907,"journal":{"name":"European Journal of Epidemiology","volume":"94 1","pages":""},"PeriodicalIF":13.6,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145949746","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Quality of cancer-related data from the Danish National patient registry (1994-2025) and the Danish cancer registry (2004-2025): a systematic review. 丹麦国家患者登记处(1994-2025)和丹麦癌症登记处(2004-2025)癌症相关数据的质量:一项系统综述。
IF 13.6 1区 医学 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH Pub Date : 2026-01-12 DOI: 10.1007/s10654-025-01336-4
Amalie Helme Simoni,Kathrine Hald,Thure Filskov Overvad,Mette Søgaard,Anne Gulbech Ording
The Danish National Patient Registry (DNPR) and the Danish Cancer Registry (DCR) are central to registry-based cancer research. This systematic review evaluates studies assessing the quality of cancer-related data in these registries under their current data structures. PubMed and Embase were systematically searched on January 24, 2025 (PROSPERO: CRD420251005952). Studies validating cancer-related data in the DNPR or DCR against a gold standard were included. Findings were synthesized narratively and categorized by DNPR data, DCR data, or multi-source algorithms. The literature search generated 915 records, of which 50 were included: 23 validated DNPR data, 9 DCR data, and 18 algorithm performance. The quality of DNPR cancer diagnoses and treatment showed positive predictive values (PPVs) of 57-100%, highest for common malignancies and treatments. The quality of DNPR comorbidities and complications varied substantially (PPVs 0-98%). The PPV of a melanoma diagnosis in the DCR was 97%. The DCR staging completeness varied considerably (34-95%). Algorithms presented PPVs of 60-96% for recurrence, active cancer, and recognized metastases, and 28% for unrecognized metastases. The DNPR and DCR provide high-quality data for many cancer diagnoses, treatments, and outcomes, supporting their use in register-based research. While some data elements, including data on complications, exhibit lower quality, algorithmic approaches can enhance utility for less robust data. However, several aspects of cancer-related data remain unvalidated.
丹麦国家患者登记处(DNPR)和丹麦癌症登记处(DCR)是基于登记处的癌症研究的中心。本系统综述评估了在当前数据结构下评估这些登记处癌症相关数据质量的研究。PubMed和Embase于2025年1月24日被系统检索(PROSPERO: CRD420251005952)。研究证实了DNPR或DCR中与癌症相关的数据是否符合金标准。通过DNPR数据、DCR数据或多源算法对结果进行叙述性综合和分类。文献检索共产生915条记录,其中50条被纳入:23条验证DNPR数据,9条DCR数据,18条算法性能。DNPR肿瘤诊断和治疗质量的阳性预测值(ppv)为57-100%,常见恶性肿瘤和治疗的阳性预测值最高。DNPR合并症和并发症的质量差异很大(PPVs 0-98%)。DCR诊断黑色素瘤的PPV为97%。DCR分期完成度差异较大(34-95%)。算法显示,复发、活动性癌症和已识别转移的ppv为60-96%,未识别转移的ppv为28%。DNPR和DCR为许多癌症诊断、治疗和结果提供了高质量的数据,支持它们在基于登记的研究中的使用。虽然一些数据元素(包括有关复杂性的数据)表现出较低的质量,但算法方法可以增强对不那么健壮的数据的效用。然而,癌症相关数据的几个方面仍未得到证实。
{"title":"Quality of cancer-related data from the Danish National patient registry (1994-2025) and the Danish cancer registry (2004-2025): a systematic review.","authors":"Amalie Helme Simoni,Kathrine Hald,Thure Filskov Overvad,Mette Søgaard,Anne Gulbech Ording","doi":"10.1007/s10654-025-01336-4","DOIUrl":"https://doi.org/10.1007/s10654-025-01336-4","url":null,"abstract":"The Danish National Patient Registry (DNPR) and the Danish Cancer Registry (DCR) are central to registry-based cancer research. This systematic review evaluates studies assessing the quality of cancer-related data in these registries under their current data structures. PubMed and Embase were systematically searched on January 24, 2025 (PROSPERO: CRD420251005952). Studies validating cancer-related data in the DNPR or DCR against a gold standard were included. Findings were synthesized narratively and categorized by DNPR data, DCR data, or multi-source algorithms. The literature search generated 915 records, of which 50 were included: 23 validated DNPR data, 9 DCR data, and 18 algorithm performance. The quality of DNPR cancer diagnoses and treatment showed positive predictive values (PPVs) of 57-100%, highest for common malignancies and treatments. The quality of DNPR comorbidities and complications varied substantially (PPVs 0-98%). The PPV of a melanoma diagnosis in the DCR was 97%. The DCR staging completeness varied considerably (34-95%). Algorithms presented PPVs of 60-96% for recurrence, active cancer, and recognized metastases, and 28% for unrecognized metastases. The DNPR and DCR provide high-quality data for many cancer diagnoses, treatments, and outcomes, supporting their use in register-based research. While some data elements, including data on complications, exhibit lower quality, algorithmic approaches can enhance utility for less robust data. However, several aspects of cancer-related data remain unvalidated.","PeriodicalId":11907,"journal":{"name":"European Journal of Epidemiology","volume":"82 1","pages":""},"PeriodicalIF":13.6,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145949767","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Why do women live longer than men, but spend more time in poor health? A decomposition analysis of the gender gap in unhealthy life years across Europe. 为什么女性寿命比男性长,但健康状况不佳的时间却更多?欧洲不健康寿命年数性别差异的分解分析。
IF 13.6 1区 医学 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH Pub Date : 2026-01-12 DOI: 10.1007/s10654-025-01346-2
Magdalena Muszynska-Spielauer,Paola Di Giulio,Yuka Minagawa,Vanessa Di Lego,Marc Luy
This study tests the "longevity hypothesis," which posits that women's greater number of years spent in poor health is primarily a direct consequence of their longer survival. We analyse gender differences in unhealthy life years (ULY) at age 50 across 22 European countries in 2015-2017. ULY was estimated using three approaches-the Sullivan method, the cross-sectional average length of healthy life, and multistate life tables-applied to four health indicators of varying severity: chronic diseases, functional limitations, self-rated health, and disability. Data were drawn from the Human Mortality Database and the Survey of Health, Ageing and Retirement in Europe. We decomposed the gender gap in ULY into a "mortality effect" (ME), reflecting differences in life years lived, and a "health effect" (HE), reflecting differences in morbidity prevalence. Women at age 50 lived more unhealthy years than men across almost all health indicators and countries. In most cases, more than half of the gender gap in ULY was attributable to the ME, indicating that women's longer survival primarily explains their greater number of years spent in poor health. The HE showed greater variation across indicators and countries. Results were most consistent for chronic diseases and self-rated health, while functional limitations and disability yielded smaller and less consistent differences. Findings support the longevity hypothesis: women's higher life expectancy is the main driver of their longer lifetime spent in poor health. The variation across health dimensions highlights the importance of distinguishing between them when studying gender inequalities in health.
这项研究检验了“长寿假说”,该假说认为,女性健康状况不佳的时间更长,主要是她们寿命更长的直接后果。我们分析了2015-2017年22个欧洲国家50岁人群不健康寿命年(ULY)的性别差异。ULY是用三种方法来估计的——沙利文方法、健康寿命的横截面平均长度和多状态生命表——应用于四种不同严重程度的健康指标:慢性病、功能限制、自评健康和残疾。数据来自人类死亡率数据库和欧洲健康、老龄化和退休调查。我们将ULY中的性别差异分解为反映寿命年数差异的“死亡率效应”(ME)和反映发病率差异的“健康效应”(HE)。在几乎所有健康指标和国家中,50岁妇女的不健康寿命都比男子长。在大多数情况下,ULY中超过一半的性别差距可归因于ME,这表明妇女的生存时间较长主要解释了她们健康状况不佳的时间较长。高等教育在各个指标和国家之间显示出更大的差异。慢性疾病和自评健康的结果最为一致,而功能限制和残疾产生的差异较小且不太一致。研究结果支持长寿假说:女性的预期寿命较长是她们健康状况不佳的主要原因。健康各方面的差异突出了在研究健康方面的性别不平等时区分这些方面的重要性。
{"title":"Why do women live longer than men, but spend more time in poor health? A decomposition analysis of the gender gap in unhealthy life years across Europe.","authors":"Magdalena Muszynska-Spielauer,Paola Di Giulio,Yuka Minagawa,Vanessa Di Lego,Marc Luy","doi":"10.1007/s10654-025-01346-2","DOIUrl":"https://doi.org/10.1007/s10654-025-01346-2","url":null,"abstract":"This study tests the \"longevity hypothesis,\" which posits that women's greater number of years spent in poor health is primarily a direct consequence of their longer survival. We analyse gender differences in unhealthy life years (ULY) at age 50 across 22 European countries in 2015-2017. ULY was estimated using three approaches-the Sullivan method, the cross-sectional average length of healthy life, and multistate life tables-applied to four health indicators of varying severity: chronic diseases, functional limitations, self-rated health, and disability. Data were drawn from the Human Mortality Database and the Survey of Health, Ageing and Retirement in Europe. We decomposed the gender gap in ULY into a \"mortality effect\" (ME), reflecting differences in life years lived, and a \"health effect\" (HE), reflecting differences in morbidity prevalence. Women at age 50 lived more unhealthy years than men across almost all health indicators and countries. In most cases, more than half of the gender gap in ULY was attributable to the ME, indicating that women's longer survival primarily explains their greater number of years spent in poor health. The HE showed greater variation across indicators and countries. Results were most consistent for chronic diseases and self-rated health, while functional limitations and disability yielded smaller and less consistent differences. Findings support the longevity hypothesis: women's higher life expectancy is the main driver of their longer lifetime spent in poor health. The variation across health dimensions highlights the importance of distinguishing between them when studying gender inequalities in health.","PeriodicalId":11907,"journal":{"name":"European Journal of Epidemiology","volume":"30 1","pages":""},"PeriodicalIF":13.6,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145949744","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Risk of postpartum depression among women with endometriosis: the Norwegian mother, father and child cohort study (MoBa). 子宫内膜异位症妇女产后抑郁的风险:挪威母亲、父亲和儿童队列研究(MoBa)
IF 13.6 1区 医学 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH Pub Date : 2026-01-12 DOI: 10.1007/s10654-025-01338-2
Marius Johansen,Tone Kristin Omsland,Katariina Laine,Siri Eldevik Håberg,Maria Christine Magnus
Women with endometriosis have a higher burden of anxiety and depression. Whether they are at increased risk of postpartum depression (PPD) remains unclear. We aimed to compare the risk of PPD between women with and without endometriosis and to explore mediation by previous history of major depression and infertility. In a population-based cohort study, we compared 1,159 singleton pregnancies to women with self-reported endometriosis and 74,590 pregnancies to women without endometriosis. We calculated a djusted risk ratios (aRR) with 95% confidence intervals (CI) using multivariable log-binomial regression, adjusting for age, body mass index, education and income. Mediation analyses assessed the indirect effect of any history of major depression or infertility. Women with endometriosis had a higher risk of PPD (aRR: 1.34, 95% CI: 1.15-1.55). Mediation analyses indicated that a large part of this association was explained by a higher lifetime prevalence of major depression among women with endometriosis (natural direct effect of endometriosis: aRR: 1.17, 95% CI: 1.00-1.36; natural indirect effect of any history of major depression: aRR: 1.14, 95% CI: 1.08-1.20), with 49.3% proportion mediated. Infertility demonstrated a negative natural indirect effect on the association between endometriosis and PPD (aRR: 0.87, 95% CI: 0.81-0.94). Women with endometriosis had an elevated risk of PPD which was largely explained by a higher lifetime prevalence of major depression. Our findings suggest that they constitute a high-risk group and could benefit from closer follow-up to facilitate early identification and intervention.
患有子宫内膜异位症的女性有更高的焦虑和抑郁负担。她们是否会增加产后抑郁症(PPD)的风险尚不清楚。我们的目的是比较有子宫内膜异位症和无子宫内膜异位症的女性发生PPD的风险,并探讨重度抑郁症和不孕症的既往史。在一项基于人群的队列研究中,我们比较了1159名单胎妊娠与自我报告子宫内膜异位症的妇女,以及74590名妊娠与无子宫内膜异位症的妇女。我们使用多变量对数二项回归计算了校正风险比(aRR), 95%置信区间(CI),调整了年龄、体重指数、教育程度和收入。中介分析评估了任何重度抑郁或不孕史的间接影响。患有子宫内膜异位症的女性患PPD的风险更高(aRR: 1.34, 95% CI: 1.15-1.55)。中介分析表明,这种关联在很大程度上可以解释为子宫内膜异位症女性一生中重度抑郁症的患病率较高(子宫内膜异位症的自然直接影响:aRR: 1.17, 95% CI: 1.00-1.36;任何重度抑郁症史的自然间接影响:aRR: 1.14, 95% CI: 1.08-1.20),其中有49.3%的中介比例。不孕症对子宫内膜异位症与PPD之间的关系表现出负的自然间接影响(aRR: 0.87, 95% CI: 0.81-0.94)。患有子宫内膜异位症的女性患产后抑郁症的风险较高,这在很大程度上可以解释为重度抑郁症的终生患病率较高。我们的研究结果表明,他们构成了一个高风险群体,可以从更密切的随访中受益,以促进早期识别和干预。
{"title":"Risk of postpartum depression among women with endometriosis: the Norwegian mother, father and child cohort study (MoBa).","authors":"Marius Johansen,Tone Kristin Omsland,Katariina Laine,Siri Eldevik Håberg,Maria Christine Magnus","doi":"10.1007/s10654-025-01338-2","DOIUrl":"https://doi.org/10.1007/s10654-025-01338-2","url":null,"abstract":"Women with endometriosis have a higher burden of anxiety and depression. Whether they are at increased risk of postpartum depression (PPD) remains unclear. We aimed to compare the risk of PPD between women with and without endometriosis and to explore mediation by previous history of major depression and infertility. In a population-based cohort study, we compared 1,159 singleton pregnancies to women with self-reported endometriosis and 74,590 pregnancies to women without endometriosis. We calculated a djusted risk ratios (aRR) with 95% confidence intervals (CI) using multivariable log-binomial regression, adjusting for age, body mass index, education and income. Mediation analyses assessed the indirect effect of any history of major depression or infertility. Women with endometriosis had a higher risk of PPD (aRR: 1.34, 95% CI: 1.15-1.55). Mediation analyses indicated that a large part of this association was explained by a higher lifetime prevalence of major depression among women with endometriosis (natural direct effect of endometriosis: aRR: 1.17, 95% CI: 1.00-1.36; natural indirect effect of any history of major depression: aRR: 1.14, 95% CI: 1.08-1.20), with 49.3% proportion mediated. Infertility demonstrated a negative natural indirect effect on the association between endometriosis and PPD (aRR: 0.87, 95% CI: 0.81-0.94). Women with endometriosis had an elevated risk of PPD which was largely explained by a higher lifetime prevalence of major depression. Our findings suggest that they constitute a high-risk group and could benefit from closer follow-up to facilitate early identification and intervention.","PeriodicalId":11907,"journal":{"name":"European Journal of Epidemiology","volume":"24 1","pages":""},"PeriodicalIF":13.6,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145949750","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine learning versus logistic regression for propensity score estimation: a trial emulation benchmarked against the PARADIGM-HF randomized trial. 倾向评分估计的机器学习与逻辑回归:以PARADIGM-HF随机试验为基准的试验模拟。
IF 5.9 1区 医学 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH Pub Date : 2026-01-12 DOI: 10.1007/s10654-025-01341-7
Kaicheng Wang, Lindsey Rosman, Haidong Lu

Machine learning (ML) algorithms are increasingly used to estimate propensity score with expectation of improving causal inference. However, the validity of data-driven ML-based approaches for confounder selection and adjustment remains unclear. In this study, we emulated the device-stratified secondary analysis of the PARADIGM-HF trial among U.S. veterans with heart failure and implanted cardiac devices from 2016 to 2020. We benchmarked observational estimates from three propensity score approaches against the trial results. (1) logistic regression with pre-specified confounders (2), generalized boosted models (GBM) using the same pre-specified confounders, and (3) GBM with expanded covariates and automated feature selection. Logistic regression-based propensity score approach yielded estimates closest to the trial (HR = 0.93, 95% CI 0.61-1.42; 23-month RR = 0.86, 95% CI 0.57-1.24 vs. trial HR = 0.81, 95% CI 0.61-1.06). Despite better predictive performance, GBM with pre-specified confounders showed no improvement over the logistic regression approach (HR = 0.97, 95% CI 0.68-1.37; RR = 0.96, 95% CI 0.89-1.98). Moreover, GBM with expanded covariates and data-driven automated feature selection substantially increased bias (HR = 0.61, 95% CI 0.30-1.23; RR = 0.69, 95% CI 0.36-1.04). Our findings suggest that ML-based propensity score methods do not inherently improve causal estimation possibly due to residual confounding from omitted or partially adjusted variables and may introduce overadjustment bias when combined with automated feature selection. These results underscore the importance of careful confounder specification and causal reasoning over algorithmic complexity in causal inference.

机器学习(ML)算法越来越多地用于估计倾向得分,期望改善因果推理。然而,数据驱动的基于ml的混杂选择和调整方法的有效性仍然不清楚。在这项研究中,我们模拟了2016年至2020年美国退伍军人心力衰竭和植入心脏装置的PARADIGM-HF试验的器械分层二次分析。我们将三种倾向评分方法的观察性估计与试验结果进行基准比较。(1)预先指定混杂因素的逻辑回归(2),使用相同预先指定混杂因素的广义增强模型(GBM),以及(3)扩展协变量和自动特征选择的GBM。基于Logistic回归的倾向评分方法得出的估计值与试验最接近(HR = 0.93, 95% CI 0.61-1.42; 23个月的RR = 0.86, 95% CI 0.57-1.24,而试验HR = 0.81, 95% CI 0.61-1.06)。尽管具有更好的预测性能,但与逻辑回归方法相比,预先指定混杂因素的GBM没有改善(HR = 0.97, 95% CI 0.68-1.37; RR = 0.96, 95% CI 0.89-1.98)。此外,扩展协变量的GBM和数据驱动的自动特征选择大大增加了偏差(HR = 0.61, 95% CI 0.30-1.23; RR = 0.69, 95% CI 0.36-1.04)。我们的研究结果表明,基于机器学习的倾向评分方法并不能从本质上改善因果估计,这可能是由于遗漏或部分调整变量的残留混淆,并且当与自动特征选择相结合时可能会引入过度调整偏差。这些结果强调了在因果推理中,谨慎的混杂规范和因果推理在算法复杂性上的重要性。
{"title":"Machine learning versus logistic regression for propensity score estimation: a trial emulation benchmarked against the PARADIGM-HF randomized trial.","authors":"Kaicheng Wang, Lindsey Rosman, Haidong Lu","doi":"10.1007/s10654-025-01341-7","DOIUrl":"https://doi.org/10.1007/s10654-025-01341-7","url":null,"abstract":"<p><p>Machine learning (ML) algorithms are increasingly used to estimate propensity score with expectation of improving causal inference. However, the validity of data-driven ML-based approaches for confounder selection and adjustment remains unclear. In this study, we emulated the device-stratified secondary analysis of the PARADIGM-HF trial among U.S. veterans with heart failure and implanted cardiac devices from 2016 to 2020. We benchmarked observational estimates from three propensity score approaches against the trial results. (1) logistic regression with pre-specified confounders (2), generalized boosted models (GBM) using the same pre-specified confounders, and (3) GBM with expanded covariates and automated feature selection. Logistic regression-based propensity score approach yielded estimates closest to the trial (HR = 0.93, 95% CI 0.61-1.42; 23-month RR = 0.86, 95% CI 0.57-1.24 vs. trial HR = 0.81, 95% CI 0.61-1.06). Despite better predictive performance, GBM with pre-specified confounders showed no improvement over the logistic regression approach (HR = 0.97, 95% CI 0.68-1.37; RR = 0.96, 95% CI 0.89-1.98). Moreover, GBM with expanded covariates and data-driven automated feature selection substantially increased bias (HR = 0.61, 95% CI 0.30-1.23; RR = 0.69, 95% CI 0.36-1.04). Our findings suggest that ML-based propensity score methods do not inherently improve causal estimation possibly due to residual confounding from omitted or partially adjusted variables and may introduce overadjustment bias when combined with automated feature selection. These results underscore the importance of careful confounder specification and causal reasoning over algorithmic complexity in causal inference.</p>","PeriodicalId":11907,"journal":{"name":"European Journal of Epidemiology","volume":" ","pages":""},"PeriodicalIF":5.9,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145951526","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Integration of latent factor analysis into multivariable Mendelian randomization. 将潜在因素分析整合到多变量孟德尔随机化中。
IF 13.6 1区 医学 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH Pub Date : 2026-01-12 DOI: 10.1007/s10654-025-01324-8
Yuankai Zhang,Roby Joehanes,Tianxiao Huan,Lukas M Weber,Qiong Yang,Kathryn L Lunetta,Daniel Levy,Chunyu Liu
Mendelian randomization has emerged as a powerful tool for exploring causal relationships in observational studies by using genetic variants as instrumental variables. While multivariable Mendelian randomization extends this approach to simultaneously address multiple exposures, it faces significant challenges with highly correlated exposures, particularly in high-dimensional settings such as multi-omics data. Conventional MVMR methods, which are primarily based on linear regression models, may suffer from multicollinearity and reduced statistical power when analyzing correlated exposures. The increasing availability of high-dimensional multi-omics data has highlighted the limitations of conventional MVMR approaches in analyzing correlated exposures while maintaining biological interpretability. To address these challenges, we propose integrating latent factor analysis into the MVMR framework, enabling dimension reduction without compromising interpretability. Through extensive simulation studies, we demonstrate that our method maintains a well-controlled false positive rate and offers superior sensitivity compared to conventional MVMR approaches. We apply our method to investigate the causal relationship between DNA methylation and mitochondrial DNA copy number. Our method offers a significant advantage in scenarios with highly correlated exposures driven by common latent factors or shared pathways, especially when individual effects are sparse. By applying our method to correlated multi-omics data, we can uncover new insights into the molecular mechanisms underlying complex phenotypes.
孟德尔随机化已经成为一种强大的工具,通过使用遗传变异作为工具变量来探索观察性研究中的因果关系。虽然多变量孟德尔随机化将这种方法扩展到同时处理多个暴露,但它面临着高度相关暴露的重大挑战,特别是在高维环境中,如多组学数据。传统的MVMR方法主要基于线性回归模型,在分析相关暴露时可能存在多重共线性和统计能力降低的问题。高维多组学数据的不断增加凸显了传统MVMR方法在分析相关暴露同时保持生物学可解释性方面的局限性。为了应对这些挑战,我们建议将潜在因素分析整合到MVMR框架中,在不影响可解释性的情况下实现降维。通过广泛的仿真研究,我们证明,与传统的MVMR方法相比,我们的方法保持了良好的假阳性率控制,并提供了更高的灵敏度。我们应用我们的方法来研究DNA甲基化和线粒体DNA拷贝数之间的因果关系。我们的方法在由共同潜在因素或共享途径驱动的高度相关暴露的情况下提供了显著的优势,特别是当个体影响稀疏时。通过将我们的方法应用于相关的多组学数据,我们可以揭示复杂表型背后的分子机制的新见解。
{"title":"Integration of latent factor analysis into multivariable Mendelian randomization.","authors":"Yuankai Zhang,Roby Joehanes,Tianxiao Huan,Lukas M Weber,Qiong Yang,Kathryn L Lunetta,Daniel Levy,Chunyu Liu","doi":"10.1007/s10654-025-01324-8","DOIUrl":"https://doi.org/10.1007/s10654-025-01324-8","url":null,"abstract":"Mendelian randomization has emerged as a powerful tool for exploring causal relationships in observational studies by using genetic variants as instrumental variables. While multivariable Mendelian randomization extends this approach to simultaneously address multiple exposures, it faces significant challenges with highly correlated exposures, particularly in high-dimensional settings such as multi-omics data. Conventional MVMR methods, which are primarily based on linear regression models, may suffer from multicollinearity and reduced statistical power when analyzing correlated exposures. The increasing availability of high-dimensional multi-omics data has highlighted the limitations of conventional MVMR approaches in analyzing correlated exposures while maintaining biological interpretability. To address these challenges, we propose integrating latent factor analysis into the MVMR framework, enabling dimension reduction without compromising interpretability. Through extensive simulation studies, we demonstrate that our method maintains a well-controlled false positive rate and offers superior sensitivity compared to conventional MVMR approaches. We apply our method to investigate the causal relationship between DNA methylation and mitochondrial DNA copy number. Our method offers a significant advantage in scenarios with highly correlated exposures driven by common latent factors or shared pathways, especially when individual effects are sparse. By applying our method to correlated multi-omics data, we can uncover new insights into the molecular mechanisms underlying complex phenotypes.","PeriodicalId":11907,"journal":{"name":"European Journal of Epidemiology","volume":"39 1","pages":""},"PeriodicalIF":13.6,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145949748","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Maternal hormonal contraception use and childhood cancer risk: a systematic review and meta-analysis. 母亲使用激素避孕与儿童癌症风险:系统回顾和荟萃分析。
IF 13.6 1区 医学 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH Pub Date : 2026-01-12 DOI: 10.1007/s10654-025-01335-5
Sif E Carlsen,Emily Jarden,Caroline H Hemmingsen,Lone Schmidt,Sarah Hjorth,Maarit K Leinonen,Ulrika Nörby,Lina S Mørch,Susanne K Kjaer,Hedvig Nordeng,Marie Hargreave
Observational studies have linked maternal hormonal contraception use to childhood cancer risk, but findings are inconsistent. A systematic review was conducted of this potential relationship. A systematic search was performed in PubMed, Embase, Scopus, Cochrane, and Web of Science databases until April 9, 2025. Studies reporting maternal hormonal contraception use before or during pregnancy and childhood cancer risk (0-19 years) were eligible. We included studies providing risk estimates in English or Scandinavian languages. Newcastle-Ottawa Scale was used to assess study quality. Meta-analysis using fixed and random effects was used to pool relative risks (RRs) with 95% confidence intervals (CIs) for childhood cancer according to maternal hormonal contraception use (1) up to or during pregnancy, and (2) exclusively during pregnancy. We included 27 studies (24 case-control and 3 cohort), totaling 11,067 childhood cancer cases. Maternal hormonal contraception use up to and during pregnancy increased risk of any childhood cancer (RR = 1.18; 95% CI = 1.10-1.26), leukemia (RR = 1.24; 95% CI = 1.06-1.45), and lymphoid leukemia (RR = 1.17; 95% CI = 1.06-1.28). Exposures during pregnancy showed higher risk estimate for any cancer (RR = 1.32; 95% CI = 1.12-1.56) and leukemia (RR = 1.63; 95% CI = 1.07-2.49). Most studies were moderate (70%) or high (26%) quality. Maternal hormonal contraception use may increase childhood cancer risk, particularly for leukemia, and during pregnancy. Further prospective studies are needed, focusing on specific hormonal contraception substances and exposure timing.
观察性研究将母亲使用激素避孕与儿童癌症风险联系起来,但研究结果并不一致。对这种潜在的关系进行了系统的回顾。系统检索PubMed, Embase, Scopus, Cochrane和Web of Science数据库,直到2025年4月9日。报告孕妇在怀孕前或怀孕期间使用激素避孕和儿童癌症风险(0-19岁)的研究符合条件。我们纳入了用英语或斯堪的纳维亚语言提供风险评估的研究。采用纽卡斯尔-渥太华量表评估研究质量。采用固定效应和随机效应的荟萃分析,根据(1)怀孕前或怀孕期间以及(2)仅在怀孕期间使用激素避孕,汇总儿童癌症的相对危险度(rr), 95%置信区间(CIs)。我们纳入了27项研究(24项病例对照和3项队列研究),共计11067例儿童癌症病例。孕妇在怀孕前后和怀孕期间使用激素避孕药会增加任何儿童癌症(RR = 1.18; 95% CI = 1.10-1.26)、白血病(RR = 1.24; 95% CI = 1.06-1.45)和淋巴细胞白血病(RR = 1.17; 95% CI = 1.06-1.28)的风险。怀孕期间暴露在暴露环境中,患任何癌症(RR = 1.32; 95% CI = 1.12-1.56)和白血病(RR = 1.63; 95% CI = 1.07-2.49)的风险都较高。大多数研究为中等(70%)或高(26%)质量。孕妇使用激素避孕可能会增加儿童患癌症的风险,尤其是白血病和怀孕期间。需要进一步的前瞻性研究,重点是具体的激素避孕物质和暴露时间。
{"title":"Maternal hormonal contraception use and childhood cancer risk: a systematic review and meta-analysis.","authors":"Sif E Carlsen,Emily Jarden,Caroline H Hemmingsen,Lone Schmidt,Sarah Hjorth,Maarit K Leinonen,Ulrika Nörby,Lina S Mørch,Susanne K Kjaer,Hedvig Nordeng,Marie Hargreave","doi":"10.1007/s10654-025-01335-5","DOIUrl":"https://doi.org/10.1007/s10654-025-01335-5","url":null,"abstract":"Observational studies have linked maternal hormonal contraception use to childhood cancer risk, but findings are inconsistent. A systematic review was conducted of this potential relationship. A systematic search was performed in PubMed, Embase, Scopus, Cochrane, and Web of Science databases until April 9, 2025. Studies reporting maternal hormonal contraception use before or during pregnancy and childhood cancer risk (0-19 years) were eligible. We included studies providing risk estimates in English or Scandinavian languages. Newcastle-Ottawa Scale was used to assess study quality. Meta-analysis using fixed and random effects was used to pool relative risks (RRs) with 95% confidence intervals (CIs) for childhood cancer according to maternal hormonal contraception use (1) up to or during pregnancy, and (2) exclusively during pregnancy. We included 27 studies (24 case-control and 3 cohort), totaling 11,067 childhood cancer cases. Maternal hormonal contraception use up to and during pregnancy increased risk of any childhood cancer (RR = 1.18; 95% CI = 1.10-1.26), leukemia (RR = 1.24; 95% CI = 1.06-1.45), and lymphoid leukemia (RR = 1.17; 95% CI = 1.06-1.28). Exposures during pregnancy showed higher risk estimate for any cancer (RR = 1.32; 95% CI = 1.12-1.56) and leukemia (RR = 1.63; 95% CI = 1.07-2.49). Most studies were moderate (70%) or high (26%) quality. Maternal hormonal contraception use may increase childhood cancer risk, particularly for leukemia, and during pregnancy. Further prospective studies are needed, focusing on specific hormonal contraception substances and exposure timing.","PeriodicalId":11907,"journal":{"name":"European Journal of Epidemiology","volume":"255 1","pages":""},"PeriodicalIF":13.6,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145949749","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
European Journal of Epidemiology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1