Genetic Epidemiology最新文献_第3页

Genome-Wide Association Analyses in Family Triads and Dyads Following Assisted Reproductive Technology 辅助生殖技术后家庭三联体和双联体的全基因组关联分析

IF 1.7 4区医学 Q3 GENETICS & HEREDITY

Genetic Epidemiology

Pub Date : 2025-06-02 DOI: 10.1002/gepi.70011

Siri N. Skodvin, Håkon K. Gjessing, Astanand Jugessur, Julia Romanowska, Alexandra Havdahl, Siri E. Håberg, Hans Ivar Hanevik, Robert Lyle, Rolv Terje Lie, Miriam Gjerdevik

Genetic selection occurs at different stages before a successful birth. The genetic makeup of a couple may influence the likelihood of needing assisted reproductive technology (ART) to achieve conception. However, frequent early fetal losses may also be perceived as reduced couple fertility and may thus be a contributing factor to the need for ART treatment. As ART procedures may enhance early fetal survival, genes that impact fetal viability may have a different allele distribution in ART offspring than expected under Mendelian transmission, as well as compared with the general population. With genetic data available from the Norwegian Mother, Father, and Child Cohort Study, we defined fetal survival as the study outcome and analyzed 1336 case-parent triads and dyads where the offspring were conceived by ART. Using log-linear models implemented in the R package Haplin, we conducted genome-wide scans to estimate fetal, maternal, and parent-of-origin effects and provided a detailed discussion on how these effects are estimated and interpreted. We detected fetal effects for single-nucleotide polymorphisms (SNPs) located in CXXC4-AS1, OPCML, and DYNLRB2-AS1. Since these effects were not observed in a limited follow-up analysis of non-ART triads, the identified effects are unlikely caused by genetic selection before fertilization.

基因选择发生在成功分娩前的不同阶段。一对夫妇的基因组成可能会影响需要辅助生殖技术（ART）来实现受孕的可能性。然而，频繁的早期胎儿丢失也可能被视为夫妇生育能力下降，因此可能是需要抗逆转录病毒治疗的一个因素。由于抗逆转录病毒治疗可以提高早期胎儿存活率，影响胎儿生存能力的基因在抗逆转录病毒治疗后代中的等位基因分布可能与孟德尔传播下的预期不同，也可能与一般人群的等位基因分布不同。利用来自挪威母亲、父亲和儿童队列研究的遗传数据，我们将胎儿存活率定义为研究结果，并分析了1336例通过ART受孕的父母三胞胎和双胞胎。利用R软件包Haplin中实现的对数线性模型，我们进行了全基因组扫描，以估计胎儿、母亲和父母的起源效应，并就如何估计和解释这些效应提供了详细的讨论。我们检测了CXXC4-AS1、OPCML和DYNLRB2-AS1的单核苷酸多态性（snp）对胎儿的影响。由于在有限的非art三联体随访分析中没有观察到这些影响，因此所确定的影响不太可能是受精前的遗传选择引起的。

{"title":"Genome-Wide Association Analyses in Family Triads and Dyads Following Assisted Reproductive Technology","authors":"Siri N. Skodvin, Håkon K. Gjessing, Astanand Jugessur, Julia Romanowska, Alexandra Havdahl, Siri E. Håberg, Hans Ivar Hanevik, Robert Lyle, Rolv Terje Lie, Miriam Gjerdevik","doi":"10.1002/gepi.70011","DOIUrl":"https://doi.org/10.1002/gepi.70011","url":null,"abstract":"Genetic selection occurs at different stages before a successful birth. The genetic makeup of a couple may influence the likelihood of needing assisted reproductive technology (ART) to achieve conception. However, frequent early fetal losses may also be perceived as reduced couple fertility and may thus be a contributing factor to the need for ART treatment. As ART procedures may enhance early fetal survival, genes that impact fetal viability may have a different allele distribution in ART offspring than expected under Mendelian transmission, as well as compared with the general population. With genetic data available from the Norwegian Mother, Father, and Child Cohort Study, we defined fetal survival as the study outcome and analyzed 1336 case-parent triads and dyads where the offspring were conceived by ART. Using log-linear models implemented in the R package Haplin, we conducted genome-wide scans to estimate fetal, maternal, and parent-of-origin effects and provided a detailed discussion on how these effects are estimated and interpreted. We detected fetal effects for single-nucleotide polymorphisms (SNPs) located in CXXC4-AS1, OPCML, and DYNLRB2-AS1. Since these effects were not observed in a limited follow-up analysis of non-ART triads, the identified effects are unlikely caused by genetic selection before fertilization.","PeriodicalId":12710,"journal":{"name":"Genetic Epidemiology","volume":"49 5","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/gepi.70011","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144197458","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Genome-Wide Association Studies of Down Syndrome Associated Congenital Heart Defects Suggests a Genetically Heterogeneous Risk for CHD in DS 唐氏综合征相关先天性心脏缺陷的全基因组关联研究表明，唐氏综合征患者冠心病的遗传异质性风险

IF 1.7 4区医学 Q3 GENETICS & HEREDITY

Genetic Epidemiology

Pub Date : 2025-05-23 DOI: 10.1002/gepi.70010

Elizabeth R. Feldman, Yunqi Li, David J. Cutler, Tracie C. Rosser, Stephanie B. Wechsler, Lauren Sanclemente, Angela L. Rachubinski, Natalina Elliott, Paresh Vyas, Irene Roberts, Karen R. Rabin, Michael Wagner, Bruce D. Gelb, Joaquin M. Espinosa, Philip J. Lupo, Adam J. de Smith, Stephanie L. Sherman, Elizabeth J. Leslie-Clarkson

Congenital heart defects (CHDs) are the most common structural birth defect and are present in 40%–50% of children born with Down syndrome (DS). To characterize the genetic architecture of DS-associated CHD, we sequenced genomes of a multiethnic group of children with DS and a CHD (n = 886: atrioventricular septal defects (AVSD), n = 438; atrial septal defects (ASD), n = 122; ventricular septal defects (VSD), n = 170; other types of CHD, n = 156) and DS with a structurally normal heart (DS + NH, n = 572). We performed four genome-wide association study (GWAS) for common variants (MAF > 0.05) comparing DS with CHD, stratified by CHD-subtype, to DS + NH controls. Although no SNP achieved genome-wide significance, multiple loci in each analysis achieved suggestive significance (p < 2 × 10⁻⁶). Of these, the 1p35.1 locus (near RBBP4) was specifically associated with ASD risk, and the 5q35.2 locus (near MSX2) was associated with any type of CHD. Each of the suggestive loci contained one or more plausible candidate genes expressed in the developing heart. While no SNP replicated (p < 2 × 10⁻⁶) in an independent cohort of DS + CHD (DS + CHD: n = 229; DS + NH: n = 197), most SNPs that were suggestive in our GWASs remained suggestive when meta-analyzed with the GWASs from the replication cohort. These results build on previous work to identify genetic modifiers of DS-associated CHD.

先天性心脏缺陷（CHDs）是最常见的结构性出生缺陷，在40%-50%的唐氏综合症（DS）患儿中存在。为了表征DS相关冠心病的遗传结构，我们对一组多民族DS合并冠心病儿童进行了基因组测序(n = 886；房室间隔缺损（AVSD）， n = 438；房间隔缺损（ASD）， 122例；室间隔缺损（VSD）， n = 170；其他类型冠心病（n = 156）和心脏结构正常的DS （DS + nhn = 572）。我们进行了四项全基因组关联研究（GWAS），比较DS与冠心病的常见变异（MAF > 0.05），按冠心病亚型分层，与DS + NH对照。虽然没有SNP具有全基因组意义，但每次分析中的多个位点都具有提示意义（p < 2 × 10−6）。其中，1p35.1位点（靠近RBBP4）与ASD风险特异性相关，5q35.2位点（靠近MSX2）与任何类型的冠心病相关。每个暗示性基因座都包含一个或多个在发育中的心脏中表达的可能的候选基因。而在DS +冠心病的独立队列中没有SNP复制（p < 2 × 10−6）(DS +冠心病：n = 229；DS + NH: n = 197)，在我们的GWASs中提示的大多数snp在与复制队列的GWASs进行meta分析时仍然提示。这些结果建立在先前鉴定ds相关冠心病基因修饰因子的工作基础上。

{"title":"Genome-Wide Association Studies of Down Syndrome Associated Congenital Heart Defects Suggests a Genetically Heterogeneous Risk for CHD in DS","authors":"Elizabeth R. Feldman, Yunqi Li, David J. Cutler, Tracie C. Rosser, Stephanie B. Wechsler, Lauren Sanclemente, Angela L. Rachubinski, Natalina Elliott, Paresh Vyas, Irene Roberts, Karen R. Rabin, Michael Wagner, Bruce D. Gelb, Joaquin M. Espinosa, Philip J. Lupo, Adam J. de Smith, Stephanie L. Sherman, Elizabeth J. Leslie-Clarkson","doi":"10.1002/gepi.70010","DOIUrl":"https://doi.org/10.1002/gepi.70010","url":null,"abstract":"<div>\u0000 \u0000 Congenital heart defects (CHDs) are the most common structural birth defect and are present in 40%–50% of children born with Down syndrome (DS). To characterize the genetic architecture of DS-associated CHD, we sequenced genomes of a multiethnic group of children with DS and a CHD (n = 886: atrioventricular septal defects (AVSD), n = 438; atrial septal defects (ASD), n = 122; ventricular septal defects (VSD), n = 170; other types of CHD, n = 156) and DS with a structurally normal heart (DS + NH, n = 572). We performed four genome-wide association study (GWAS) for common variants (MAF > 0.05) comparing DS with CHD, stratified by CHD-subtype, to DS + NH controls. Although no SNP achieved genome-wide significance, multiple loci in each analysis achieved suggestive significance (p < 2 × 10−6). Of these, the 1p35.1 locus (near RBBP4) was specifically associated with ASD risk, and the 5q35.2 locus (near MSX2) was associated with any type of CHD. Each of the suggestive loci contained one or more plausible candidate genes expressed in the developing heart. While no SNP replicated (p < 2 × 10−6) in an independent cohort of DS + CHD (DS + CHD: n = 229; DS + NH: n = 197), most SNPs that were suggestive in our GWASs remained suggestive when meta-analyzed with the GWASs from the replication cohort. These results build on previous work to identify genetic modifiers of DS-associated CHD.\u0000 </div>","PeriodicalId":12710,"journal":{"name":"Genetic Epidemiology","volume":"49 4","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144118194","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Three Loci Affecting Variance of Body Mass Index in African Americans and Sub-Saharan Africans 影响非洲裔美国人和撒哈拉以南非洲人体重指数差异的三个基因座

IF 1.7 4区医学 Q3 GENETICS & HEREDITY

Genetic Epidemiology

Pub Date : 2025-05-05 DOI: 10.1002/gepi.70009

Daniel Shriner, Amy R. Bentley, Ayo P. Doumatey, Jie Zhou, Guanjie Chen, Charles N. Rotimi, Adebowale A. Adeyemo

Conventional genome-wide association studies (GWAS) are designed to assess the effect of a genetic locus on phenotypic mean by genotype. Such loci explain a proportion of phenotypic variance known as narrow-sense heritability. In contrast, variance quantitative trait loci (vQTL) are associated with the phenotypic variance by genotype. These loci explain an additional proportion of phenotypic variance and contribute to broad-sense heritability but not to narrow-sense heritability. Here, a genome-wide vQTL analysis in 22,805 African Americans yielded eight loci for body mass index (BMI). Of these loci, three were replicated in 6002 sub-Saharan Africans. No locus reached genome-wide significance using the standard additive model. Furthermore, no locus showed evidence for natural selection, haplotype effects, or gene × sex or gene × study interactions. Two loci showed evidence for an effect of locus-specific ancestry resulting from admixture and for a gene × gene interaction. One locus showed evidence for interaction with diastolic blood pressure, consistent with this vQTL capturing an unmodeled gene × covariate interaction. These analyses demonstrate that relevant BMI loci can be detected by evaluating vQTL and that these loci contribute to the underexplored broad-sense heritability for this trait.

传统的全基因组关联研究（GWAS）旨在通过基因型评估遗传位点对表型平均值的影响。这样的基因座解释了被称为狭义遗传力的表型变异的一部分。相反，变异数量性状位点（vQTL）通过基因型与表型变异相关。这些基因座解释了表型变异的额外比例，并有助于广义遗传力，但不是狭义遗传力。在这里，对22,805名非裔美国人的全基因组vQTL分析得出了8个体重指数（BMI）位点。在这些基因座中，有三个在6002名撒哈拉以南非洲人身上得到了复制。使用标准加性模型，没有位点达到全基因组显著性。此外，没有证据表明基因座存在自然选择、单倍型效应、基因与性别或基因与研究的相互作用。两个基因座显示了由混合和基因×基因相互作用产生的位点特异性祖先效应的证据。一个位点显示出与舒张压相互作用的证据，这与该vQTL捕获未建模的基因×协变量相互作用一致。这些分析表明，通过评估vQTL可以检测到相关的BMI位点，并且这些位点有助于该性状未被充分开发的广义遗传力。

{"title":"Three Loci Affecting Variance of Body Mass Index in African Americans and Sub-Saharan Africans","authors":"Daniel Shriner, Amy R. Bentley, Ayo P. Doumatey, Jie Zhou, Guanjie Chen, Charles N. Rotimi, Adebowale A. Adeyemo","doi":"10.1002/gepi.70009","DOIUrl":"https://doi.org/10.1002/gepi.70009","url":null,"abstract":"Conventional genome-wide association studies (GWAS) are designed to assess the effect of a genetic locus on phenotypic mean by genotype. Such loci explain a proportion of phenotypic variance known as narrow-sense heritability. In contrast, variance quantitative trait loci (vQTL) are associated with the phenotypic variance by genotype. These loci explain an additional proportion of phenotypic variance and contribute to broad-sense heritability but not to narrow-sense heritability. Here, a genome-wide vQTL analysis in 22,805 African Americans yielded eight loci for body mass index (BMI). Of these loci, three were replicated in 6002 sub-Saharan Africans. No locus reached genome-wide significance using the standard additive model. Furthermore, no locus showed evidence for natural selection, haplotype effects, or gene × sex or gene × study interactions. Two loci showed evidence for an effect of locus-specific ancestry resulting from admixture and for a gene × gene interaction. One locus showed evidence for interaction with diastolic blood pressure, consistent with this vQTL capturing an unmodeled gene × covariate interaction. These analyses demonstrate that relevant BMI loci can be detected by evaluating vQTL and that these loci contribute to the underexplored broad-sense heritability for this trait.","PeriodicalId":12710,"journal":{"name":"Genetic Epidemiology","volume":"49 4","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-05-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/gepi.70009","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143905286","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Uncertainty Quantification in Epigenetic Clocks via Conformalized Quantile Regression 通过保形分位数回归分析表观遗传时钟的不确定性

IF 1.7 4区医学 Q3 GENETICS & HEREDITY

Genetic Epidemiology

Pub Date : 2025-03-27 DOI: 10.1002/gepi.70008

Yanping Li, Jaclyn M. Goodrich, Karen E. Peterson, Peter X.-K. Song, Lan Luo

DNA methylation (DNAm) is a chemical modification of DNA that can be influenced by various factors, including age, the environment, and lifestyle. An epigenetic clock is a predictive tool that measures biological age based on DNAm levels. It can provide insights into an individual's biological age, which may differ from their chronological age. This difference, known as the epigenetic age acceleration, may reflect health status and the risk for age-related diseases. Moreover, epigenetic clocks are used in studies of aging to assess the effectiveness of antiaging interventions and to understand the underlying mechanisms of aging and disease. Various epigenetic clocks have been developed using samples from different populations, tissues, and cell types, typically by training high-dimensional linear regression models with an elastic net penalty. While these models can predict mean biological age based on DNAm with high precision, there is a lack of uncertainty quantification which is important for interpreting the precision of age estimations and for clinical decision-making. To understand the distribution of a biological age clock beyond its mean, we propose a general pipeline for training epigenetic clocks, based on an integration of high-dimensional quantile regression and conformal prediction, to effectively reveal population heterogeneity and construct prediction intervals. Our approach produces adaptive prediction intervals not only achieving nominal coverage but also accounting for the inherent variability across individuals. By using the data collected from 728 blood samples in 11 DNAm data sets from children, we find that our quantile regression-based prediction intervals are narrower than those derived from conventional mean regression-based epigenetic clocks. This observation demonstrates an improved statistical efficiency over the existing pipeline for training epigenetic clocks. In addition, the resulting intervals have a synchronized varying pattern to age acceleration, effectively revealing cellular evolutionary heterogeneity in age patterns in different developmental stages during individual childhoods and adolescent cohort. Our findings suggest that conformalized high-dimensional quantile regression can produce valid prediction intervals and uncover underlying population heterogeneity. Although our methodology focuses on the distribution of measures of biological aging in children, it is applicable to a broader range of age groups to improve understanding of epigenetic age beyond the mean. This inference-based toolbox could provide valuable insights for future applications of epigenetic interventions for age-related diseases.

DNA甲基化（DNAm）是DNA的一种化学修饰，可以受到各种因素的影响，包括年龄、环境和生活方式。表观遗传时钟是一种基于dna水平测量生物年龄的预测工具。它可以让我们了解一个人的生理年龄，这可能与他们的实际年龄不同。这种差异被称为表观遗传年龄加速，可能反映了健康状况和患年龄相关疾病的风险。此外，表观遗传时钟被用于衰老研究，以评估抗衰老干预措施的有效性，并了解衰老和疾病的潜在机制。使用来自不同群体、组织和细胞类型的样本开发了各种表观遗传时钟，通常是通过训练具有弹性净惩罚的高维线性回归模型。虽然这些模型可以高精度地预测基于DNAm的平均生物年龄，但缺乏不确定性量化，这对于解释年龄估计的精度和临床决策至关重要。为了了解生物年龄时钟在其平均值之外的分布，我们提出了一种基于高维分位数回归和适形预测相结合的训练表观遗传时钟的通用管道，以有效地揭示群体异质性并构建预测区间。我们的方法产生了自适应的预测区间，不仅实现了名义覆盖率，而且考虑了个体之间的内在变异性。通过使用来自11个儿童DNAm数据集的728份血液样本收集的数据，我们发现基于分位数回归的预测区间比基于传统平均回归的表观遗传时钟的预测区间窄。这一观察结果表明，与现有的训练表观遗传时钟的管道相比，统计效率有所提高。此外，所得时间间隔与年龄加速具有同步变化模式，有效揭示了个体童年和青少年群体不同发育阶段年龄模式的细胞进化异质性。我们的研究结果表明，符合的高维分位数回归可以产生有效的预测区间，并揭示潜在的群体异质性。虽然我们的方法侧重于儿童生物衰老测量的分布，但它适用于更广泛的年龄组，以提高对表观遗传年龄超出平均值的理解。这个基于推理的工具箱可以为未来应用表观遗传干预治疗与年龄有关的疾病提供有价值的见解。

{"title":"Uncertainty Quantification in Epigenetic Clocks via Conformalized Quantile Regression","authors":"Yanping Li, Jaclyn M. Goodrich, Karen E. Peterson, Peter X.-K. Song, Lan Luo","doi":"10.1002/gepi.70008","DOIUrl":"https://doi.org/10.1002/gepi.70008","url":null,"abstract":"DNA methylation (DNAm) is a chemical modification of DNA that can be influenced by various factors, including age, the environment, and lifestyle. An epigenetic clock is a predictive tool that measures biological age based on DNAm levels. It can provide insights into an individual's biological age, which may differ from their chronological age. This difference, known as the epigenetic age acceleration, may reflect health status and the risk for age-related diseases. Moreover, epigenetic clocks are used in studies of aging to assess the effectiveness of antiaging interventions and to understand the underlying mechanisms of aging and disease. Various epigenetic clocks have been developed using samples from different populations, tissues, and cell types, typically by training high-dimensional linear regression models with an elastic net penalty. While these models can predict mean biological age based on DNAm with high precision, there is a lack of uncertainty quantification which is important for interpreting the precision of age estimations and for clinical decision-making. To understand the distribution of a biological age clock beyond its mean, we propose a general pipeline for training epigenetic clocks, based on an integration of high-dimensional quantile regression and conformal prediction, to effectively reveal population heterogeneity and construct prediction intervals. Our approach produces adaptive prediction intervals not only achieving nominal coverage but also accounting for the inherent variability across individuals. By using the data collected from 728 blood samples in 11 DNAm data sets from children, we find that our quantile regression-based prediction intervals are narrower than those derived from conventional mean regression-based epigenetic clocks. This observation demonstrates an improved statistical efficiency over the existing pipeline for training epigenetic clocks. In addition, the resulting intervals have a synchronized varying pattern to age acceleration, effectively revealing cellular evolutionary heterogeneity in age patterns in different developmental stages during individual childhoods and adolescent cohort. Our findings suggest that conformalized high-dimensional quantile regression can produce valid prediction intervals and uncover underlying population heterogeneity. Although our methodology focuses on the distribution of measures of biological aging in children, it is applicable to a broader range of age groups to improve understanding of epigenetic age beyond the mean. This inference-based toolbox could provide valuable insights for future applications of epigenetic interventions for age-related diseases.","PeriodicalId":12710,"journal":{"name":"Genetic Epidemiology","volume":"49 4","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/gepi.70008","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143707357","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Parent-of-Origin Effects in Childhood Asthma at Seven Years of Age 7岁儿童哮喘的原生父母影响

IF 1.7 4区医学 Q3 GENETICS & HEREDITY

Genetic Epidemiology

Pub Date : 2025-03-25 DOI: 10.1002/gepi.70007

Yunsung Lee, Miriam Gjerdevik, Astanand Jugessur, Håkon Kristian Gjessing, Elizabeth Corfield, Alexandra Havdahl, Jennifer Ruth Harris, Maria Christine Magnus, Siri Eldevik Håberg, Per Magnus

Childhood asthma is more common among children whose mothers have asthma than among those whose fathers have asthma. The reasons for this are unknown, and we hypothesize that genomic imprinting may partly explain this observation. Our aim is to assess parent-of-origin (PoO) effects on childhood asthma by analyzing SNP array genotype data from a large population-based cohort. To estimate PoO effects in parent-reported childhood asthma at 7 years of age, we fit a log-linear model implemented in the HAPLIN R package to SNP array genotype data from 915 mother–father–child case triads, 603 mother–child case dyads, and 113 father–child case dyads participating in the Norwegian Mother, Father, and Child Cohort Study (MoBa). We found that alleles at two SNPs—rs3003214 and rs3003211—near the adenylosuccinate synthase 2 gene (ADSS2 on chromosome 1q44) showed significant PoO effects at a false positive rate ≤ 0.05. The ratio of the effect of the maternally and paternally inherited G-allele at rs3003214 was 1.68 (95% CI: 1.41–2.03, p value = 1.13E−08). Our results suggest PoO effects at the ADSS2 gene, particularly the maternally inherited G-allele at rs3003214, may contribute to the maternal effect in childhood asthma.

儿童哮喘在母亲患有哮喘的儿童中比在父亲患有哮喘的儿童中更常见。其原因尚不清楚，我们假设基因组印记可能部分解释了这一观察结果。我们的目的是通过分析来自大型人群队列的SNP阵列基因型数据来评估父母起源（PoO）对儿童哮喘的影响。为了估计PoO对父母报告的7岁儿童哮喘的影响，我们将HAPLIN R软件包中实现的对数线性模型拟合到参与挪威母亲，父亲和儿童队列研究（MoBa）的915名母亲-父亲-儿童三联病例，603名母亲-儿童双体病例和113名父亲-儿童双体病例的SNP阵列基因型数据。我们发现在1q44染色体上的腺苷琥珀酸合酶2基因（ADSS2）附近的两个snp位点rs3003214和rs3003211的等位基因表现出显著的PoO效应，假阳性率≤0.05。母本遗传g等位基因与父本遗传g等位基因在rs3003214位点的影响比为1.68 （95% CI: 1.41 ~ 2.03， p值= 1.13E−08）。我们的研究结果表明，粪便对ADSS2基因的影响，特别是母系遗传的g等位基因rs3003214，可能有助于儿童哮喘的母系效应。

{"title":"Parent-of-Origin Effects in Childhood Asthma at Seven Years of Age","authors":"Yunsung Lee, Miriam Gjerdevik, Astanand Jugessur, Håkon Kristian Gjessing, Elizabeth Corfield, Alexandra Havdahl, Jennifer Ruth Harris, Maria Christine Magnus, Siri Eldevik Håberg, Per Magnus","doi":"10.1002/gepi.70007","DOIUrl":"https://doi.org/10.1002/gepi.70007","url":null,"abstract":"Childhood asthma is more common among children whose mothers have asthma than among those whose fathers have asthma. The reasons for this are unknown, and we hypothesize that genomic imprinting may partly explain this observation. Our aim is to assess parent-of-origin (PoO) effects on childhood asthma by analyzing SNP array genotype data from a large population-based cohort. To estimate PoO effects in parent-reported childhood asthma at 7 years of age, we fit a log-linear model implemented in the HAPLIN R package to SNP array genotype data from 915 mother–father–child case triads, 603 mother–child case dyads, and 113 father–child case dyads participating in the Norwegian Mother, Father, and Child Cohort Study (MoBa). We found that alleles at two SNPs—rs3003214 and rs3003211—near the adenylosuccinate synthase 2 gene (ADSS2 on chromosome 1q44) showed significant PoO effects at a false positive rate ≤ 0.05. The ratio of the effect of the maternally and paternally inherited G-allele at rs3003214 was 1.68 (95% CI: 1.41–2.03, p value = 1.13E−08). Our results suggest PoO effects at the ADSS2 gene, particularly the maternally inherited G-allele at rs3003214, may contribute to the maternal effect in childhood asthma.","PeriodicalId":12710,"journal":{"name":"Genetic Epidemiology","volume":"49 3","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/gepi.70007","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143698714","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Dimension Reduction Using Local Principal Components for Regression-Based Multi-SNP Analysis in 1000 Genomes and the Canadian Longitudinal Study on Aging (CLSA) 利用局部主成分降低维度，在 1000 基因组和加拿大老龄化纵向研究 (CLSA) 中进行基于回归的多 SNP 分析

IF 1.7 4区医学 Q3 GENETICS & HEREDITY

Genetic Epidemiology

Pub Date : 2025-03-01 DOI: 10.1002/gepi.70005

Fatemeh Yavartanoo, Myriam Brossard, Shelley B. Bull, Andrew D. Paterson, Yun Joo Yoo

For genetic association analysis based on multiple SNP regression of genotypes obtained by dense DNA sequencing or array data imputation, multi-collinearity can be a severe issue causing failure to fit the regression model. In this study, we propose a method of Dimension Reduction using Local Principal Components (DRLPC) which aims to resolve multi-collinearity by removing SNPs under the assumption that the remaining SNPs can capture the effect of a removed SNP due to high linear dependency. This approach to dimension reduction is expected to improve the power of regression-based statistical tests. We apply DRLPC to chromosome 22 SNPs of two data sets, the 1000 Genomes Project (phase 3) and the Canadian Longitudinal Study on Aging (CLSA), and calculate variance inflation factors (VIF) in various SNP-sets before and after implementing DRLPC as a metric of collinearity. Notably, DRLPC addresses multi-collinearity by excluding variables with a VIF exceeding a predetermined threshold (VIF = 20), thereby improving applicability for subsequent regression analyses. The number of variables in a final set for regression analysis is reduced to around 20% on average for larger-sized genes, whereas for smaller ones, the proportion is around 48%; suggesting that DRLPC is particularly effective for larger genes. We also compare the power of several multi-SNP statistics constructed for gene-specific analysis to evaluate power gains achieved by DRLPC. In simulation studies based on 100 genes with ≤ 500 SNPs per gene, DRLPC increases the power of the multiple regression Wald test from 60% to around 80%.

对于基于密集DNA测序或阵列数据输入获得的基因型的多重SNP回归的遗传关联分析，多重共线性可能是导致回归模型无法拟合的严重问题。在这项研究中，我们提出了一种使用局部主成分（DRLPC）的降维方法，该方法旨在通过去除SNPs来解决多重共线性问题，假设剩余的SNPs可以捕获由于高度线性依赖而被去除的SNP的影响。这种降维方法有望提高基于回归的统计检验的能力。我们将DRLPC应用于1000基因组计划（第三阶段）和加拿大老龄化纵向研究（CLSA）两个数据集的22号染色体snp，并计算了实施DRLPC之前和之后不同snp集的方差膨胀因子（VIF）作为共线性的度量。值得注意的是，DRLPC通过排除VIF超过预定阈值（VIF = 20）的变量来解决多重共线性问题，从而提高了后续回归分析的适用性。对于较大的基因，用于回归分析的最后一组变量的数量平均减少到20%左右，而对于较小的基因，这一比例约为48%；这表明DRLPC对较大的基因特别有效。我们还比较了为基因特异性分析构建的几个多snp统计数据的功率，以评估DRLPC获得的功率增益。在基于每个基因≤500个snp的100个基因的模拟研究中，DRLPC将多元回归Wald检验的功率从60%提高到80%左右。

{"title":"Dimension Reduction Using Local Principal Components for Regression-Based Multi-SNP Analysis in 1000 Genomes and the Canadian Longitudinal Study on Aging (CLSA)","authors":"Fatemeh Yavartanoo, Myriam Brossard, Shelley B. Bull, Andrew D. Paterson, Yun Joo Yoo","doi":"10.1002/gepi.70005","DOIUrl":"https://doi.org/10.1002/gepi.70005","url":null,"abstract":"<div>\u0000 \u0000 For genetic association analysis based on multiple SNP regression of genotypes obtained by dense DNA sequencing or array data imputation, multi-collinearity can be a severe issue causing failure to fit the regression model. In this study, we propose a method of Dimension Reduction using Local Principal Components (DRLPC) which aims to resolve multi-collinearity by removing SNPs under the assumption that the remaining SNPs can capture the effect of a removed SNP due to high linear dependency. This approach to dimension reduction is expected to improve the power of regression-based statistical tests. We apply DRLPC to chromosome 22 SNPs of two data sets, the 1000 Genomes Project (phase 3) and the Canadian Longitudinal Study on Aging (CLSA), and calculate variance inflation factors (VIF) in various SNP-sets before and after implementing DRLPC as a metric of collinearity. Notably, DRLPC addresses multi-collinearity by excluding variables with a VIF exceeding a predetermined threshold (VIF = 20), thereby improving applicability for subsequent regression analyses. The number of variables in a final set for regression analysis is reduced to around 20% on average for larger-sized genes, whereas for smaller ones, the proportion is around 48%; suggesting that DRLPC is particularly effective for larger genes. We also compare the power of several multi-SNP statistics constructed for gene-specific analysis to evaluate power gains achieved by DRLPC. In simulation studies based on 100 genes with ≤ 500 SNPs per gene, DRLPC increases the power of the multiple regression Wald test from 60% to around 80%.\u0000 </div>","PeriodicalId":12710,"journal":{"name":"Genetic Epidemiology","volume":"49 3","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143521945","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Sex-Specific Association Between Polymorphisms in Estrogen Receptor Alpha Gene (ESR1) and Depression: A Genome-Wide Association Study of All of Us and UK Biobank Data 雌激素受体α基因（ESR1）多态性与抑郁症之间的性别特异性关联：我们所有人和英国生物银行数据的全基因组关联研究

IF 1.7 4区医学 Q3 GENETICS & HEREDITY

Genetic Epidemiology

Pub Date : 2025-02-26 DOI: 10.1002/gepi.70004

Yue Hu, Menglu Che, Heping Zhang

Major depressive disorder (MDD) is prevalent worldwide, substantially and negatively impacting both the quality and length of life of 280 million people globally. The genetic risk factors of MDD have been studied in various previous research, but the findings lack consistency. Sex/gender and racial/ethnic disparities have been reported; however, many previous genetic studies, represented by large-scale genome-wide association studies (GWASs) are known to lack diversity in the study cohorts. All of Us is a biorepository aiming to focus on the historically underrepresented groups. We perform GWASs for the MDD phenotype, using over 200,000 participants' genotypes and carry out sex- and racial/ethnic-specific subgroup studies. We identified a risk locus (chr6:151945242) in Estrogen Receptor Alpha Gene (ESR1) (p = $� � � � � 1.70 � � \times � � �^{10 � � � - � � 9 � �} � � � �$ ), and further confirmed the genetic association is sex-specific. The single-nucleotide polymorphism (SNP) chr6:151945242 was significant only in the male group, but not in the female group. These findings were replicated in the UK Biobank and echo with existing studies on the ESR1 gene and depressive disorders. Our results indicate that the All of Us program is a reliable resource for GWAS, as well as shedding light on further investigation of sex- and racial/ethnic-specific genome association, especially in underrepresented groups of the US population.

重度抑郁症（MDD）在世界范围内普遍存在，对全球2.8亿人的生活质量和寿命产生重大负面影响。以往的各种研究对重度抑郁症的遗传危险因素进行了研究，但研究结果缺乏一致性。已报告性别/性别和种族/族裔差异；然而，以大规模全基因组关联研究（GWASs）为代表的许多先前的遗传研究已知缺乏研究队列的多样性。《我们所有人》是一个生物资料库，旨在关注历史上代表性不足的群体。我们对MDD表型进行了GWASs，使用了超过20万参与者的基因型，并进行了性别和种族/民族特异性亚组研究。我们在雌激素受体α基因（ESR1）中发现了一个危险位点（chr6:151945242）（p = 1.70 × 10−9）$1.70times{10}^{-9}$)，进一步证实了遗传关联是性别特异性的。单核苷酸多态性（SNP） chr6:151945242仅在男性组中显著，在女性组中不显著。这些发现在英国生物银行得到了重复，并与现有的ESR1基因和抑郁症的研究相呼应。我们的研究结果表明，“我们所有人”项目是GWAS的可靠资源，也为进一步研究性别和种族/民族特异性基因组关联提供了线索，特别是在美国人口中代表性不足的群体中。

{"title":"Sex-Specific Association Between Polymorphisms in Estrogen Receptor Alpha Gene (ESR1) and Depression: A Genome-Wide Association Study of All of Us and UK Biobank Data","authors":"Yue Hu, Menglu Che, Heping Zhang","doi":"10.1002/gepi.70004","DOIUrl":"https://doi.org/10.1002/gepi.70004","url":null,"abstract":"<div>\u0000 \u0000 Major depressive disorder (MDD) is prevalent worldwide, substantially and negatively impacting both the quality and length of life of 280 million people globally. The genetic risk factors of MDD have been studied in various previous research, but the findings lack consistency. Sex/gender and racial/ethnic disparities have been reported; however, many previous genetic studies, represented by large-scale genome-wide association studies (GWASs) are known to lack diversity in the study cohorts. All of Us is a biorepository aiming to focus on the historically underrepresented groups. We perform GWASs for the MDD phenotype, using over 200,000 participants' genotypes and carry out sex- and racial/ethnic-specific subgroup studies. We identified a risk locus (chr6:151945242) in Estrogen Receptor Alpha Gene (ESR1) (p = <math>\u0000 <semantics>\u0000 <mrow>\u0000 \u0000 <mrow>\u0000 <mn>1.70</mn>\u0000 \u0000 <mo>×</mo>\u0000 \u0000 <msup>\u0000 <mn>10</mn>\u0000 \u0000 <mrow>\u0000 <mo>−</mo>\u0000 \u0000 <mn>9</mn>\u0000 </mrow>\u0000 </msup>\u0000 </mrow>\u0000 </mrow>\u0000 <annotation> $1.70times {10}^{-9}$</annotation>\u0000 </semantics></math>), and further confirmed the genetic association is sex-specific. The single-nucleotide polymorphism (SNP) chr6:151945242 was significant only in the male group, but not in the female group. These findings were replicated in the UK Biobank and echo with existing studies on the ESR1 gene and depressive disorders. Our results indicate that the All of Us program is a reliable resource for GWAS, as well as shedding light on further investigation of sex- and racial/ethnic-specific genome association, especially in underrepresented groups of the US population.\u0000 </div>","PeriodicalId":12710,"journal":{"name":"Genetic Epidemiology","volume":"49 3","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143489994","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Reference-Based Standardization Approach Stabilizing Small Batch Risk Prediction via Polygenic Score 基于参考的标准化方法稳定多基因评分的小批量风险预测。

IF 1.7 4区医学 Q3 GENETICS & HEREDITY

Genetic Epidemiology

Pub Date : 2025-01-30 DOI: 10.1002/gepi.70002

Yoichi Sutoh, Tsuyoshi Hachiya, Yayoi Otsuka-Yamasaki, Tomoharu Tokutomi, Akiko Yoshida, Yuka Kotozaki, Shohei Komaki, Shiori Minabe, Hideki Ohmomo, Kozo Tanno, Akimune Fukushima, Makoto Sasaki, Atsushi Shimizu

The polygenic score (PGS) holds promise for motivating preventive behavioral changes. However, no clinically validated standardization methodology currently exists. Here, we demonstrate the efficacy of a “reference-based” approach for standardization. This method uses the PGS distribution in the general population as a reference for normalization and percentile determination; however, it has not been validated. We investigated three potential influences on PGS computation: (1) the size of the reference population, (2) biases associated with different genotyping platforms, and (3) inclusion of kinship ties within the reference group. Our results indicate that the reference size affects the bootstrap estimate of standard error for PGS percentiles, peaking around the 50th percentile and diminishing at extreme percentiles (1st or 100th). Discrepancies between genotyping platforms, such as different microarrays and whole-genome sequencing, resulted in deviations in PGS (p < 0.05 in Kolmogorov–Smirnov test). However, these deviations were reduced to a nonsignificant level using shared genetic variants in the calculations when the ancestry of the samples and reference were matched. This approach recovered approximately 9.6% of the positive predictive value of PGS by naïve genotype. Our results provide fundamental insights for establishing clinical guidelines for implementing PGS to communicate reliable risks to individuals.

多基因评分（PGS）有望激励预防性行为改变。然而，目前尚无临床验证的标准化方法。在这里，我们展示了“基于参考”的标准化方法的有效性。该方法以一般人群中的PGS分布作为归一化和百分位数确定的参考；然而，它还没有得到证实。我们研究了对PGS计算的三个潜在影响：(1)参考人群的规模，(2)与不同基因分型平台相关的偏差，(3)参考群体中亲属关系的纳入。我们的研究结果表明，参考大小影响PGS百分位数的标准误差的自举估计，在第50百分位数附近达到峰值，在极端百分位数（第1或第100）处减弱。基因分型平台之间的差异，如不同的微阵列和全基因组测序，导致PGS的偏差(p

{"title":"Reference-Based Standardization Approach Stabilizing Small Batch Risk Prediction via Polygenic Score","authors":"Yoichi Sutoh, Tsuyoshi Hachiya, Yayoi Otsuka-Yamasaki, Tomoharu Tokutomi, Akiko Yoshida, Yuka Kotozaki, Shohei Komaki, Shiori Minabe, Hideki Ohmomo, Kozo Tanno, Akimune Fukushima, Makoto Sasaki, Atsushi Shimizu","doi":"10.1002/gepi.70002","DOIUrl":"10.1002/gepi.70002","url":null,"abstract":"<div>\u0000 \u0000 The polygenic score (PGS) holds promise for motivating preventive behavioral changes. However, no clinically validated standardization methodology currently exists. Here, we demonstrate the efficacy of a “reference-based” approach for standardization. This method uses the PGS distribution in the general population as a reference for normalization and percentile determination; however, it has not been validated. We investigated three potential influences on PGS computation: (1) the size of the reference population, (2) biases associated with different genotyping platforms, and (3) inclusion of kinship ties within the reference group. Our results indicate that the reference size affects the bootstrap estimate of standard error for PGS percentiles, peaking around the 50th percentile and diminishing at extreme percentiles (1st or 100th). Discrepancies between genotyping platforms, such as different microarrays and whole-genome sequencing, resulted in deviations in PGS (p < 0.05 in Kolmogorov–Smirnov test). However, these deviations were reduced to a nonsignificant level using shared genetic variants in the calculations when the ancestry of the samples and reference were matched. This approach recovered approximately 9.6% of the positive predictive value of PGS by naïve genotype. Our results provide fundamental insights for establishing clinical guidelines for implementing PGS to communicate reliable risks to individuals.\u0000 </div>","PeriodicalId":12710,"journal":{"name":"Genetic Epidemiology","volume":"49 2","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143065247","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

RetroFun-RVS: A Retrospective Family-Based Framework for Rare Variant Analysis Incorporating Functional Annotations RetroFun-RVS：一个回顾性的基于家族的框架，用于结合功能注释的罕见变体分析。

IF 1.7 4区医学 Q3 GENETICS & HEREDITY

Genetic Epidemiology

Pub Date : 2025-01-28 DOI: 10.1002/gepi.70001

Loïc Mangnier, Ingo Ruczinski, Jasmin Ricard, Claudia Moreau, Simon Girard, Michel Maziade, Alexandre Bureau

A large proportion of genetic variations involved in complex diseases are rare and located within noncoding regions, making the interpretation of underlying biological mechanisms a daunting task. Although technical and methodological progress has been made to annotate the genome, current disease-rare-variant association tests incorporating such annotations suffer from two major limitations. First, they are generally restricted to case−control designs of unrelated individuals, which often require tens or hundreds of thousands of individuals to achieve sufficient power. Second, they were not evaluated with region-based annotations needed to interpret the causal regulatory mechanisms. In this work, we propose RetroFun-RVS, a new retrospective family-based score test, incorporating functional annotations. A critical feature of the proposed method is to aggregate genotypes to compare against rare variant-sharing expectations among affected family members. Through extensive simulations, we have demonstrated that RetroFun-RVS integrating networks based on 3D genome contacts as functional annotations reach greater power over the region-wide test, other strategies to include subregions and competing methods. Also, the proposed framework shows robustness to non-informative annotations, maintaining its power when causal variants are spread across regions. Asymptotic p-values are susceptible to Type I error inflation when the number of families with rare variants is small, and a bootstrap procedure is recommended in these instances. Application of RetroFun-RVS is illustrated on whole genome sequence in the Eastern Quebec Schizophrenia and Bipolar Disorder Kindred Study with networks constructed from 3D contacts and epigenetic data on neurons. In summary, the integration of functional annotations corresponding to regions or networks with transcriptional impacts in rare variant tests appears promising to highlight regulatory mechanisms involved in complex diseases.

涉及复杂疾病的很大一部分遗传变异是罕见的，并且位于非编码区，这使得解释潜在的生物学机制成为一项艰巨的任务。尽管在基因组注释方面取得了技术和方法上的进步，但目前包含这种注释的疾病罕见变异关联测试存在两个主要局限性。首先，它们通常局限于不相关个体的病例对照设计，通常需要数万或数十万个体才能获得足够的权力。其次，它们没有使用解释因果调节机制所需的基于区域的注释进行评估。在这项工作中，我们提出了RetroFun-RVS，一个新的基于家庭的回顾性分数测试，包含功能注释。所提出方法的一个关键特征是汇总基因型，以比较受影响家庭成员之间罕见的变异共享期望。通过大量的模拟，我们已经证明RetroFun-RVS基于3D基因组接触作为功能注释的整合网络在区域范围内的测试中具有更大的功能，其他策略包括子区域和竞争方法。此外，所提出的框架对非信息注释具有鲁棒性，当因果变量跨区域分布时保持其功能。当具有罕见变异的家族数量较少时，渐近p值容易受到I型误差膨胀的影响，在这些情况下建议采用自举过程。RetroFun-RVS在东魁北克精神分裂症和双相情感障碍亲缘关系研究中的全基因组序列应用，该研究利用三维接触和神经元表观遗传数据构建网络。综上所述，在罕见变异测试中整合与转录影响相关的区域或网络的功能注释，似乎有望突出复杂疾病的调控机制。

{"title":"RetroFun-RVS: A Retrospective Family-Based Framework for Rare Variant Analysis Incorporating Functional Annotations","authors":"Loïc Mangnier, Ingo Ruczinski, Jasmin Ricard, Claudia Moreau, Simon Girard, Michel Maziade, Alexandre Bureau","doi":"10.1002/gepi.70001","DOIUrl":"10.1002/gepi.70001","url":null,"abstract":"A large proportion of genetic variations involved in complex diseases are rare and located within noncoding regions, making the interpretation of underlying biological mechanisms a daunting task. Although technical and methodological progress has been made to annotate the genome, current disease-rare-variant association tests incorporating such annotations suffer from two major limitations. First, they are generally restricted to case−control designs of unrelated individuals, which often require tens or hundreds of thousands of individuals to achieve sufficient power. Second, they were not evaluated with region-based annotations needed to interpret the causal regulatory mechanisms. In this work, we propose RetroFun-RVS, a new retrospective family-based score test, incorporating functional annotations. A critical feature of the proposed method is to aggregate genotypes to compare against rare variant-sharing expectations among affected family members. Through extensive simulations, we have demonstrated that RetroFun-RVS integrating networks based on 3D genome contacts as functional annotations reach greater power over the region-wide test, other strategies to include subregions and competing methods. Also, the proposed framework shows robustness to non-informative annotations, maintaining its power when causal variants are spread across regions. Asymptotic p-values are susceptible to Type I error inflation when the number of families with rare variants is small, and a bootstrap procedure is recommended in these instances. Application of RetroFun-RVS is illustrated on whole genome sequence in the Eastern Quebec Schizophrenia and Bipolar Disorder Kindred Study with networks constructed from 3D contacts and epigenetic data on neurons. In summary, the integration of functional annotations corresponding to regions or networks with transcriptional impacts in rare variant tests appears promising to highlight regulatory mechanisms involved in complex diseases.","PeriodicalId":12710,"journal":{"name":"Genetic Epidemiology","volume":"49 2","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11775437/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143058818","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Gene−Air Pollution Interaction and Diversity of Genetic Sampling: The Southern California Children's Health Study 基因-空气污染相互作用和遗传样本多样性：南加州儿童健康研究。

IF 1.7 4区医学 Q3 GENETICS & HEREDITY

Genetic Epidemiology

Pub Date : 2025-01-25 DOI: 10.1002/gepi.70000

Justine Po, John Morrison, Brittney Marian, Zhanghua Chen, W. James Gauderman, Erika Garcia

Gene−environment interactions have been observed for childhood asthma, however few have been assessed in ethnically diverse populations. Thus, we examined how polygenic risk score (PRS) modifies the association between ambient air pollution exposure (nitrogen dioxide [NO₂], ozone, particulate matter < 2.5 and < 10 μm) and childhood asthma incidence in a diverse cohort. Participants (n = 1794) were drawn from the Southern California Children's Health Study, a multi-wave prospective cohort followed from 4th to 12th grade. PRS was developed using single nucleotide polymorphisms previously associated with childhood asthma. PRS−asthma associations and PRS−air pollutant interactions were estimated using Poisson regression. An interquartile range PRS increase was associated with 36% (95% CI: 9%, 70%) higher asthma incidence among non-Hispanic children, but not associated with asthma among Hispanic children (rate ratio: 0.81 [95% CI: 0.62, 1.04]). NO₂−PRS interaction was borderline significant in the overall sample (coefficient: 0.23 [95% CI: −0.03, 0.49]). Limited evidence was observed for a positive interaction between PRS and NO₂ exposure associated with asthma incidence; however, the literature-based PRS was not associated with asthma among Hispanic participants. Equitable, diverse genetic sampling approaches are needed to better identify clinically relevant SNPs in this population.

已经观察到儿童哮喘的基因-环境相互作用，但很少在不同种族的人群中进行评估。因此，我们研究了多基因风险评分（PRS）如何改变环境空气污染暴露之间的关系(二氧化氮[NO2]，臭氧，颗粒物-PRS相互作用在整个样本中具有临界显著性（系数：0.23 [95% CI: -0.03, 0.49]）。有限的证据表明，PRS和二氧化氮暴露与哮喘发病率之间存在正相互作用；然而，在西班牙裔参与者中，基于文献的PRS与哮喘无关。公平，多样化的遗传采样方法需要更好地识别临床相关的snp在这一人群。

{"title":"Gene−Air Pollution Interaction and Diversity of Genetic Sampling: The Southern California Children's Health Study","authors":"Justine Po, John Morrison, Brittney Marian, Zhanghua Chen, W. James Gauderman, Erika Garcia","doi":"10.1002/gepi.70000","DOIUrl":"10.1002/gepi.70000","url":null,"abstract":"<div>\u0000 \u0000 Gene−environment interactions have been observed for childhood asthma, however few have been assessed in ethnically diverse populations. Thus, we examined how polygenic risk score (PRS) modifies the association between ambient air pollution exposure (nitrogen dioxide [NO2], ozone, particulate matter < 2.5 and < 10 μm) and childhood asthma incidence in a diverse cohort. Participants (n = 1794) were drawn from the Southern California Children's Health Study, a multi-wave prospective cohort followed from 4th to 12th grade. PRS was developed using single nucleotide polymorphisms previously associated with childhood asthma. PRS−asthma associations and PRS−air pollutant interactions were estimated using Poisson regression. An interquartile range PRS increase was associated with 36% (95% CI: 9%, 70%) higher asthma incidence among non-Hispanic children, but not associated with asthma among Hispanic children (rate ratio: 0.81 [95% CI: 0.62, 1.04]). NO2−PRS interaction was borderline significant in the overall sample (coefficient: 0.23 [95% CI: −0.03, 0.49]). Limited evidence was observed for a positive interaction between PRS and NO2 exposure associated with asthma incidence; however, the literature-based PRS was not associated with asthma among Hispanic participants. Equitable, diverse genetic sampling approaches are needed to better identify clinically relevant SNPs in this population.\u0000 </div>","PeriodicalId":12710,"journal":{"name":"Genetic Epidemiology","volume":"49 1","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143046386","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0