首页 > 最新文献

American journal of human genetics最新文献

英文 中文
The PRIMED Consortium: Reducing disparities in polygenic risk assessment. PRIMED 联合会:减少多基因风险评估中的差异。
IF 8.1 1区 生物学 Q1 GENETICS & HEREDITY Pub Date : 2024-12-05 Epub Date: 2024-11-18 DOI: 10.1016/j.ajhg.2024.10.010
Iftikhar J Kullo, Matthew P Conomos, Sarah C Nelson, Sally N Adebamowo, Ananyo Choudhury, David Conti, Stephanie M Fullerton, Stephanie M Gogarten, Ben Heavner, Whitney E Hornsby, Eimear E Kenny, Alyna Khan, Amit V Khera, Yun Li, Iman Martin, Josep M Mercader, Maggie Ng, Laura M Raffield, Alex Reiner, Robb Rowley, Daniel Schaid, Adrienne Stilp, Ken Wiley, Riley Wilson, John S Witte, Pradeep Natarajan

By improving disease risk prediction, polygenic risk scores (PRSs) could have a significant impact on health promotion and disease prevention. Due to the historical oversampling of populations with European ancestry for genome-wide association studies, PRSs perform less well in other, understudied populations, leading to concerns that clinical use in their current forms could widen health care disparities. The PRIMED Consortium was established to develop methods to improve the performance of PRSs in global populations and individuals of diverse genetic ancestry. To this end, PRIMED is aggregating and harmonizing multiple phenotype and genotype datasets on AnVIL, an interoperable secure cloud-based platform, to perform individual- and summary-level analyses using population and statistical genetics approaches. Study sites, the coordinating center, and representatives from the NIH work alongside other NHGRI and global consortia to achieve these goals. PRIMED is also evaluating ethical and social implications of PRS implementation and investigating the joint modeling of social determinants of health and PRS in computing disease risk. The phenotypes of interest are primarily cardiometabolic diseases and cancer, the leading causes of death and disability worldwide. Early deliverables of the consortium include methods for data sharing on AnVIL, development of a common data model to harmonize phenotype and genotype data from cohort studies as well as electronic health records, adaptation of recent guidelines for population descriptors to global cohorts, and sharing of PRS methods/tools. As a multisite collaboration, PRIMED aims to foster equity in the development and use of polygenic risk assessment.

通过改善疾病风险预测,多基因风险评分(PRS)可对促进健康和预防疾病产生重大影响。由于历史上全基因组关联研究对欧洲血统人群的过度采样,PRS 在其他未充分研究的人群中表现较差,导致人们担心目前形式的临床使用可能会扩大医疗差距。PRIMED 联合会的成立旨在开发各种方法,提高 PRS 在全球人群和不同基因血统个体中的表现。为此,PRIMED 正在 AnVIL(一个可互操作的安全云平台)上汇总和统一多个表型和基因型数据集,以便使用群体遗传学和统计遗传学方法进行个体和汇总级分析。研究机构、协调中心和来自美国国立卫生研究院的代表与其他 NHGRI 和全球联盟一起努力实现这些目标。PRIMED 还在评估实施 PRS 的伦理和社会影响,并调查健康的社会决定因素和 PRS 在计算疾病风险方面的联合建模。关注的表型主要是心脏代谢疾病和癌症,它们是全球死亡和残疾的主要原因。该联盟的早期成果包括 AnVIL 数据共享方法、开发通用数据模型以协调队列研究和电子健康记录中的表型和基因型数据、将最新的人群描述符指南适用于全球队列,以及共享 PRS 方法/工具。作为一项多站点合作项目,PRIMED 的目标是促进多基因风险评估开发和使用的公平性。
{"title":"The PRIMED Consortium: Reducing disparities in polygenic risk assessment.","authors":"Iftikhar J Kullo, Matthew P Conomos, Sarah C Nelson, Sally N Adebamowo, Ananyo Choudhury, David Conti, Stephanie M Fullerton, Stephanie M Gogarten, Ben Heavner, Whitney E Hornsby, Eimear E Kenny, Alyna Khan, Amit V Khera, Yun Li, Iman Martin, Josep M Mercader, Maggie Ng, Laura M Raffield, Alex Reiner, Robb Rowley, Daniel Schaid, Adrienne Stilp, Ken Wiley, Riley Wilson, John S Witte, Pradeep Natarajan","doi":"10.1016/j.ajhg.2024.10.010","DOIUrl":"10.1016/j.ajhg.2024.10.010","url":null,"abstract":"<p><p>By improving disease risk prediction, polygenic risk scores (PRSs) could have a significant impact on health promotion and disease prevention. Due to the historical oversampling of populations with European ancestry for genome-wide association studies, PRSs perform less well in other, understudied populations, leading to concerns that clinical use in their current forms could widen health care disparities. The PRIMED Consortium was established to develop methods to improve the performance of PRSs in global populations and individuals of diverse genetic ancestry. To this end, PRIMED is aggregating and harmonizing multiple phenotype and genotype datasets on AnVIL, an interoperable secure cloud-based platform, to perform individual- and summary-level analyses using population and statistical genetics approaches. Study sites, the coordinating center, and representatives from the NIH work alongside other NHGRI and global consortia to achieve these goals. PRIMED is also evaluating ethical and social implications of PRS implementation and investigating the joint modeling of social determinants of health and PRS in computing disease risk. The phenotypes of interest are primarily cardiometabolic diseases and cancer, the leading causes of death and disability worldwide. Early deliverables of the consortium include methods for data sharing on AnVIL, development of a common data model to harmonize phenotype and genotype data from cohort studies as well as electronic health records, adaptation of recent guidelines for population descriptors to global cohorts, and sharing of PRS methods/tools. As a multisite collaboration, PRIMED aims to foster equity in the development and use of polygenic risk assessment.</p>","PeriodicalId":7659,"journal":{"name":"American journal of human genetics","volume":" ","pages":"2594-2606"},"PeriodicalIF":8.1,"publicationDate":"2024-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11639095/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142674908","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Allele frequency impacts the cross-ancestry portability of gene expression prediction in lymphoblastoid cell lines. 等位基因频率影响淋巴母细胞系基因表达预测的跨种可移植性。
IF 8.1 1区 生物学 Q1 GENETICS & HEREDITY Pub Date : 2024-12-05 Epub Date: 2024-11-15 DOI: 10.1016/j.ajhg.2024.10.009
Marie Saitou, Andy Dahl, Qingbo Wang, Xuanyao Liu

Population-level genetic studies are overwhelmingly biased toward European ancestries. Transferring genetic predictions from European ancestries to other ancestries results in a substantial loss of accuracy. Yet, it remains unclear how much various genetic factors, such as causal effect differences, linkage disequilibrium (LD) differences, or allele frequency differences, contribute to the loss of prediction accuracy across ancestries. In this study, we used gene expression levels in lymphoblastoid cell lines to understand how much each genetic factor contributes to lowered portability of gene expression prediction from European to African ancestries. We found that cis-genetic effects on gene expression are highly similar between European and African individuals. However, we found that allele frequency differences of causal variants have a striking impact on prediction portability. For example, portability is reduced by more than 32% when the causal cis-variant is common (minor allele frequency, MAF >5%) in European samples (training population) but is rarer (MAF <5%) in African samples (prediction population). While large allele frequency differences can decrease portability through increasing LD differences, we also determined that causal allele frequency can significantly impact portability when the impact from LD is substantially controlled. This observation suggests that improving statistical fine-mapping alone does not overcome the loss of portability resulting from differences in causal allele frequency. We conclude that causal cis-eQTL effects are highly similar in European and African individuals, and allele frequency differences have a large impact on the accuracy of gene expression prediction.

人口层面的基因研究绝大多数偏重于欧洲血统。将欧洲血统的遗传预测转移到其他血统会导致准确性大幅下降。然而,各种遗传因素(如因果效应差异、连锁不平衡(LD)差异或等位基因频率差异)在多大程度上导致了跨血统预测准确性的损失,目前仍不清楚。在这项研究中,我们利用淋巴母细胞样细胞系的基因表达水平来了解每种遗传因素对降低从欧洲血统到非洲血统的基因表达预测可移植性的贡献程度。我们发现,顺式遗传对基因表达的影响在欧洲人和非洲人之间非常相似。然而,我们发现因果变异的等位基因频率差异对预测的可移植性有显著影响。例如,当因果顺式变异体在欧洲样本(训练人群)中常见(小等位基因频率,MAF >5%),而在非洲样本(训练人群)中较少见(MAF >5%)时,可移植性会降低 32% 以上。
{"title":"Allele frequency impacts the cross-ancestry portability of gene expression prediction in lymphoblastoid cell lines.","authors":"Marie Saitou, Andy Dahl, Qingbo Wang, Xuanyao Liu","doi":"10.1016/j.ajhg.2024.10.009","DOIUrl":"10.1016/j.ajhg.2024.10.009","url":null,"abstract":"<p><p>Population-level genetic studies are overwhelmingly biased toward European ancestries. Transferring genetic predictions from European ancestries to other ancestries results in a substantial loss of accuracy. Yet, it remains unclear how much various genetic factors, such as causal effect differences, linkage disequilibrium (LD) differences, or allele frequency differences, contribute to the loss of prediction accuracy across ancestries. In this study, we used gene expression levels in lymphoblastoid cell lines to understand how much each genetic factor contributes to lowered portability of gene expression prediction from European to African ancestries. We found that cis-genetic effects on gene expression are highly similar between European and African individuals. However, we found that allele frequency differences of causal variants have a striking impact on prediction portability. For example, portability is reduced by more than 32% when the causal cis-variant is common (minor allele frequency, MAF >5%) in European samples (training population) but is rarer (MAF <5%) in African samples (prediction population). While large allele frequency differences can decrease portability through increasing LD differences, we also determined that causal allele frequency can significantly impact portability when the impact from LD is substantially controlled. This observation suggests that improving statistical fine-mapping alone does not overcome the loss of portability resulting from differences in causal allele frequency. We conclude that causal cis-eQTL effects are highly similar in European and African individuals, and allele frequency differences have a large impact on the accuracy of gene expression prediction.</p>","PeriodicalId":7659,"journal":{"name":"American journal of human genetics","volume":" ","pages":"2814-2825"},"PeriodicalIF":8.1,"publicationDate":"2024-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11639078/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142643243","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The methylomic landscape of human articular cartilage development contains epigenetic signatures of osteoarthritis risk. 人类关节软骨发育的甲基组图谱包含骨关节炎风险的表观遗传学特征。
IF 8.1 1区 生物学 Q1 GENETICS & HEREDITY Pub Date : 2024-12-05 Epub Date: 2024-11-22 DOI: 10.1016/j.ajhg.2024.10.017
Euan McDonnell, Sarah E Orr, Matthew J Barter, Danielle Rux, Abby Brumwell, Nicola Wrobel, Lee Murphy, Lynne M Overman, Antony K Sorial, David A Young, Jamie Soul, Sarah J Rice

Increasing evidence is emerging to link age-associated complex musculoskeletal diseases, including osteoarthritis (OA), to developmental factors. Multiple studies have shown a functional role for DNA methylation in the genetic mechanisms of OA risk using articular cartilage samples taken from aged individuals, yet knowledge of temporal changes to the methylome during human cartilage development is limited. We quantified DNA methylation at ∼700,000 individual CpGs across the epigenome of developing human chondrocytes in 72 samples ranging from 7 to 21 post-conception weeks. We identified significant changes in 3% of all CpGs and >8,200 developmental differentially methylated regions. We further identified 24 loci at which OA genetic variants colocalize with methylation quantitative trait loci. Through integrating developmental and mature human chondrocyte datasets, we find evidence for functional effects exerted solely in development or throughout the life course. This will have profound impacts on future approaches to translating genetic pathways for therapeutic intervention.

越来越多的证据表明,包括骨关节炎(OA)在内的与年龄相关的复杂肌肉骨骼疾病与发育因素有关。多项研究表明,DNA甲基化在老年关节软骨样本的OA风险遗传机制中发挥着功能性作用,但人们对人类软骨发育过程中甲基组的时间变化了解有限。我们对发育中的人类软骨细胞表观基因组中 700,000 个 CpGs 的 DNA 甲基化进行了量化,共采集了 72 个样本,时间跨度从怀孕后 7 周到 21 周不等。我们发现了所有 CpGs 中的 3% 发生了重大变化,以及超过 8,200 个发育差异甲基化区域。我们进一步确定了 24 个 OA 遗传变异与甲基化定量性状位点共定位的位点。通过整合发育期和成熟期人类软骨细胞数据集,我们发现了仅在发育期或整个生命过程中产生功能效应的证据。这将对未来转化遗传途径进行治疗干预的方法产生深远影响。
{"title":"The methylomic landscape of human articular cartilage development contains epigenetic signatures of osteoarthritis risk.","authors":"Euan McDonnell, Sarah E Orr, Matthew J Barter, Danielle Rux, Abby Brumwell, Nicola Wrobel, Lee Murphy, Lynne M Overman, Antony K Sorial, David A Young, Jamie Soul, Sarah J Rice","doi":"10.1016/j.ajhg.2024.10.017","DOIUrl":"10.1016/j.ajhg.2024.10.017","url":null,"abstract":"<p><p>Increasing evidence is emerging to link age-associated complex musculoskeletal diseases, including osteoarthritis (OA), to developmental factors. Multiple studies have shown a functional role for DNA methylation in the genetic mechanisms of OA risk using articular cartilage samples taken from aged individuals, yet knowledge of temporal changes to the methylome during human cartilage development is limited. We quantified DNA methylation at ∼700,000 individual CpGs across the epigenome of developing human chondrocytes in 72 samples ranging from 7 to 21 post-conception weeks. We identified significant changes in 3% of all CpGs and >8,200 developmental differentially methylated regions. We further identified 24 loci at which OA genetic variants colocalize with methylation quantitative trait loci. Through integrating developmental and mature human chondrocyte datasets, we find evidence for functional effects exerted solely in development or throughout the life course. This will have profound impacts on future approaches to translating genetic pathways for therapeutic intervention.</p>","PeriodicalId":7659,"journal":{"name":"American journal of human genetics","volume":" ","pages":"2756-2772"},"PeriodicalIF":8.1,"publicationDate":"2024-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11639090/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142695101","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Monoallelic pathogenic variants in LEPR do not cause obesity. LEPR 的单倍致病变体不会导致肥胖。
IF 8.1 1区 生物学 Q1 GENETICS & HEREDITY Pub Date : 2024-12-05 Epub Date: 2024-11-18 DOI: 10.1016/j.ajhg.2024.10.014
Jérôme Delplanque, Lauriane Le Collen, Hélène Loiselle, Audrey Leloire, Bénédicte Toussaint, Emmanuel Vaillant, Guillaume Charpentier, Sylvia Franc, Beverley Balkau, Michel Marre, Emma Henriques, Emmanuel Buse Falay, Mehdi Derhourhi, Philippe Froguel, Amélie Bonnefond

Individuals with obesity caused by biallelic pathogenic LEPR (leptin receptor) variants can benefit from setmelanotide, the novel MC4R agonist. An ongoing phase 3 clinical trial (NCT05093634) includes individuals with obesity who carry a heterozygous LEPR variant, although the obesogenic impact of these variants remains incompletely evaluated. The aim of this study was to functionally assess heterozygous variants in LEPR and to evaluate their effect on obesity. We sequenced LEPR in ∼10,000 participants from the French RaDiO study. We found 86 rare heterozygous variants. Each identified variant was then investigated in vitro using luciferase and western blot assays. Using the criteria of the American College of Medical Genetics and Genomics (ACMG), including the strong criterion related to functional assays, we found 12 pathogenic LEPR variants. Most heterozygotes did not present with obesity, and we found no association between these pathogenic variants and body mass index (BMI). This lack of association between pathogenic LEPR variants and obesity risk or BMI was confirmed using exome data from 200,000 individuals in the UK Biobank. In the literature, among 55 reported heterozygotes for of a rare pathogenic LEPR variant, only 27% had obesity. In conclusion, monoallelic pathogenic LEPR variants were functionally tested, and they do not elevate the risk of obesity or BMI levels. This raises questions about the use of setmelanotide, a costly drug with potential side effects, based solely on the presence of a heterozygous LEPR variant.

由双倍性致病性 LEPR(瘦素受体)变体引起的肥胖症患者可以从新型 MC4R 激动剂 setmelanotide 中获益。目前正在进行的一项三期临床试验(NCT05093634)包括了携带杂合子LEPR变异体的肥胖症患者,但这些变异体对肥胖症的影响仍未得到全面评估。本研究旨在对 LEPR 杂合变异进行功能评估,并评估其对肥胖的影响。我们对法国 RaDiO 研究中 10,000 名参与者的 LEPR 进行了测序。我们发现了 86 个罕见的杂合变异体。然后,我们使用荧光素酶和 Western 印迹检测法对每个已确定的变异体进行了体外研究。根据美国医学遗传学和基因组学学会(ACMG)的标准,包括与功能检测相关的严格标准,我们发现了 12 个致病性 LEPR 变异。大多数杂合子并不伴有肥胖症,我们也没有发现这些致病变异与体重指数(BMI)之间存在关联。这种致病性 LEPR 变体与肥胖风险或体重指数之间缺乏关联的情况通过英国生物库中 20 万人的外显子组数据得到了证实。在文献中,55 个报告的罕见致病性 LEPR 变异的杂合子中,只有 27% 患有肥胖症。总之,对单等位致病性 LEPR 变体进行了功能测试,它们不会提高肥胖风险或 BMI 水平。这就对仅凭存在杂合 LEPR 变异就使用具有潜在副作用的昂贵药物 Setmelanotide 提出了质疑。
{"title":"Monoallelic pathogenic variants in LEPR do not cause obesity.","authors":"Jérôme Delplanque, Lauriane Le Collen, Hélène Loiselle, Audrey Leloire, Bénédicte Toussaint, Emmanuel Vaillant, Guillaume Charpentier, Sylvia Franc, Beverley Balkau, Michel Marre, Emma Henriques, Emmanuel Buse Falay, Mehdi Derhourhi, Philippe Froguel, Amélie Bonnefond","doi":"10.1016/j.ajhg.2024.10.014","DOIUrl":"10.1016/j.ajhg.2024.10.014","url":null,"abstract":"<p><p>Individuals with obesity caused by biallelic pathogenic LEPR (leptin receptor) variants can benefit from setmelanotide, the novel MC4R agonist. An ongoing phase 3 clinical trial (NCT05093634) includes individuals with obesity who carry a heterozygous LEPR variant, although the obesogenic impact of these variants remains incompletely evaluated. The aim of this study was to functionally assess heterozygous variants in LEPR and to evaluate their effect on obesity. We sequenced LEPR in ∼10,000 participants from the French RaDiO study. We found 86 rare heterozygous variants. Each identified variant was then investigated in vitro using luciferase and western blot assays. Using the criteria of the American College of Medical Genetics and Genomics (ACMG), including the strong criterion related to functional assays, we found 12 pathogenic LEPR variants. Most heterozygotes did not present with obesity, and we found no association between these pathogenic variants and body mass index (BMI). This lack of association between pathogenic LEPR variants and obesity risk or BMI was confirmed using exome data from 200,000 individuals in the UK Biobank. In the literature, among 55 reported heterozygotes for of a rare pathogenic LEPR variant, only 27% had obesity. In conclusion, monoallelic pathogenic LEPR variants were functionally tested, and they do not elevate the risk of obesity or BMI levels. This raises questions about the use of setmelanotide, a costly drug with potential side effects, based solely on the presence of a heterozygous LEPR variant.</p>","PeriodicalId":7659,"journal":{"name":"American journal of human genetics","volume":" ","pages":"2668-2674"},"PeriodicalIF":8.1,"publicationDate":"2024-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11639077/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142674907","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Inherited infertility: Mapping loci associated with impaired female reproduction. 遗传性不孕症:绘制与女性生殖能力受损有关的基因座图谱
IF 8.1 1区 生物学 Q1 GENETICS & HEREDITY Pub Date : 2024-12-05 Epub Date: 2024-11-19 DOI: 10.1016/j.ajhg.2024.10.018
Sanni Ruotsalainen, Juha Karjalainen, Mitja Kurki, Elisa Lahtela, Matti Pirinen, Juha Riikonen, Jarmo Ritari, Silja Tammi, Jukka Partanen, Hannele Laivuori, Aarno Palotie, Henrike Heyne, Mark Daly, Elisabeth Widen

Female infertility is a common and complex health problem affecting millions of women worldwide. While multiple factors can contribute to this condition, the underlying cause remains elusive in up to 15%-30% of affected individuals. In our large genome-wide association study (GWAS) of 22,849 women with infertility and 198,989 control individuals from the Finnish population cohort FinnGen, we unveil a landscape of genetic factors associated with the disorder. Our recessive analysis identified a low-frequency stop-gained mutation in TATA-box binding protein-like 2 (TBPL2; c.895A>T [p.Arg299Ter]; minor-allele frequency [MAF] = 1.2%) with an impact comparable to highly penetrant monogenic mutations (odds ratio [OR] = 650, p = 4.1 × 10-25). While previous studies have linked the orthologous gene to anovulation and sterility in knockout mice, the severe consequence of the p.Arg299Ter variant was evidenced by individuals carrying two copies of that variant having significantly fewer offspring (average of 0.16) compared to women belonging to the other genotype groups (average of 1.75 offspring, p = 1.4 × 10-15). Notably, all homozygous women who had given birth had received infertility therapy. Moreover, our age-stratified analyses identified three additional genome-wide significant loci. Two loci were associated with early-onset infertility (diagnosed before age 30), located near CHEK2 and within the major histocompatibility complex (MHC) region. The third locus, associated with late-onset infertility, had its lead SNP located in an intron of a long non-coding RNA (lncRNA) gene. Taken together, our data highlight the significance of rare recessive alleles in shaping female infertility risk. The results further provide evidence supporting specific age-dependent mechanisms underlying this complex disorder.

女性不孕是一个常见而复杂的健康问题,影响着全球数百万女性。虽然有多种因素可导致这种情况,但高达 15%-30%的患者仍无法找到根本原因。我们对来自芬兰人群队列 FinnGen 中的 22,849 名不孕症女性和 198,989 名对照个体进行了大型全基因组关联研究(GWAS),揭示了与该疾病相关的遗传因素。我们的隐性分析发现了TATA盒结合蛋白样2(TBPL2;c.895A>T [p.Arg299Ter];小等位基因频率[MAF] = 1.2%)中的一个低频停止-获得突变,其影响与高渗透性单基因突变相当(几率比[OR] = 650,p = 4.1 × 10-25)。尽管之前的研究已将该同源基因与基因敲除小鼠的无排卵和不育症联系起来,但 p.Arg299Ter 变异的严重后果表现在,与其他基因型组的女性(平均 1.75 个后代,p = 1.4 × 10-15)相比,携带两个该变异拷贝的个体的后代数量明显较少(平均 0.16 个)。值得注意的是,所有生育过孩子的同型女性都接受过不孕症治疗。此外,我们的年龄分层分析还发现了另外三个全基因组重要位点。其中两个基因位点与早发不孕症(30 岁前确诊)有关,位于 CHEK2 附近和主要组织相容性复合体(MHC)区域内。第三个位点与晚发不孕症有关,其前导 SNP 位于长非编码 RNA(lncRNA)基因的一个内含子中。综上所述,我们的数据凸显了罕见隐性等位基因在影响女性不孕风险方面的重要性。这些结果进一步提供了证据,支持这一复杂疾病的特定年龄依赖机制。
{"title":"Inherited infertility: Mapping loci associated with impaired female reproduction.","authors":"Sanni Ruotsalainen, Juha Karjalainen, Mitja Kurki, Elisa Lahtela, Matti Pirinen, Juha Riikonen, Jarmo Ritari, Silja Tammi, Jukka Partanen, Hannele Laivuori, Aarno Palotie, Henrike Heyne, Mark Daly, Elisabeth Widen","doi":"10.1016/j.ajhg.2024.10.018","DOIUrl":"10.1016/j.ajhg.2024.10.018","url":null,"abstract":"<p><p>Female infertility is a common and complex health problem affecting millions of women worldwide. While multiple factors can contribute to this condition, the underlying cause remains elusive in up to 15%-30% of affected individuals. In our large genome-wide association study (GWAS) of 22,849 women with infertility and 198,989 control individuals from the Finnish population cohort FinnGen, we unveil a landscape of genetic factors associated with the disorder. Our recessive analysis identified a low-frequency stop-gained mutation in TATA-box binding protein-like 2 (TBPL2; c.895A>T [p.Arg299Ter]; minor-allele frequency [MAF] = 1.2%) with an impact comparable to highly penetrant monogenic mutations (odds ratio [OR] = 650, p = 4.1 × 10<sup>-25</sup>). While previous studies have linked the orthologous gene to anovulation and sterility in knockout mice, the severe consequence of the p.Arg299Ter variant was evidenced by individuals carrying two copies of that variant having significantly fewer offspring (average of 0.16) compared to women belonging to the other genotype groups (average of 1.75 offspring, p = 1.4 × 10<sup>-15</sup>). Notably, all homozygous women who had given birth had received infertility therapy. Moreover, our age-stratified analyses identified three additional genome-wide significant loci. Two loci were associated with early-onset infertility (diagnosed before age 30), located near CHEK2 and within the major histocompatibility complex (MHC) region. The third locus, associated with late-onset infertility, had its lead SNP located in an intron of a long non-coding RNA (lncRNA) gene. Taken together, our data highlight the significance of rare recessive alleles in shaping female infertility risk. The results further provide evidence supporting specific age-dependent mechanisms underlying this complex disorder.</p>","PeriodicalId":7659,"journal":{"name":"American journal of human genetics","volume":" ","pages":"2789-2798"},"PeriodicalIF":8.1,"publicationDate":"2024-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11639076/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142680622","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparative analysis of predicted DNA secondary structures infers complex human centromere topology. 对预测的 DNA 二级结构进行比较分析,推断出复杂的人类中心粒拓扑结构。
IF 8.1 1区 生物学 Q1 GENETICS & HEREDITY Pub Date : 2024-12-05 Epub Date: 2024-11-18 DOI: 10.1016/j.ajhg.2024.10.016
Sai Swaroop Chittoor, Simona Giunta

Secondary structures are non-canonical arrangements of nucleic acids due to intra-strand interactions, including base pairing, stacking, or other higher-order features that deviate from the standard double-helical conformation. While these structures are extensively studied in RNA, they can also form when DNA becomes single stranded, creating topological roadblocks that can impact essential DNA-based processes such as replication, transcription, and repair, ultimately affecting genome stability. The availability of a complete linear sequence of human genomes, including repetitive loci, enables the prediction of DNA secondary structures comparing across various regions. Here, we evaluate the intrinsic properties of linear single-stranded DNA sequences derived from sampling specialized human loci such as centromeres, pericentromeres, ribosomal DNA (rDNA), and coding regions from the CHM13 genome. Our comparative analysis of predicted secondary structures across human chromosomes revealed the heightened presence, complexity, and instability of secondary structures within the centromere, which gradually decreased toward the pericentromere onto chromosomes' arms, on average lowest in coding regions. Notably, centromeric repeats exhibited the highest level of topological complexity within both the active and divergent domains, even when compared to other repetitive tandem satellites, such as rDNA in acrocentric chromosomes. Our findings provide evidence of the intrinsic self-hybridizing properties of centromere repeats, which are capable of generating complex topological structures that may functionally correlate with chromosome missegregation, especially when centromeric chromatin is disrupted. Processes such as long non-coding RNA transcription, recombination, and other mechanisms that dechromatinize and unwind stretches of linear DNA in these regions create in vivo opportunities for the DNA acrobatics hereby predicted.

二级结构是核酸因链内相互作用(包括碱基配对、堆叠或其他偏离标准双螺旋构象的高阶特征)而形成的非规范排列。虽然这些结构在 RNA 中被广泛研究,但当 DNA 变为单链时也会形成这些结构,从而产生拓扑路障,影响以 DNA 为基础的基本过程,如复制、转录和修复,最终影响基因组的稳定性。人类基因组包括重复位点在内的完整线性序列的出现,使得对不同区域的 DNA 二级结构进行比较预测成为可能。在这里,我们评估了线性单链 DNA 序列的内在特性,这些序列来自于取样专门的人类基因座,如中心粒、周中心粒、核糖体 DNA (rDNA) 和 CHM13 基因组的编码区。我们对人类染色体上预测的二级结构进行了比较分析,发现中心粒内二级结构的存在性、复杂性和不稳定性都很高,向染色体臂的近中心粒方向逐渐降低,平均而言,编码区的二级结构最低。值得注意的是,即使与其他重复串联卫星(如非中心染色体中的 rDNA)相比,中心粒重复序列在活跃域和发散域内都表现出最高的拓扑复杂性。我们的研究结果提供了中心粒重复序列内在自杂交特性的证据,它能够产生复杂的拓扑结构,在功能上可能与染色体错分离相关,尤其是当中心粒染色质被破坏时。长非编码 RNA 转录、重组等过程,以及其他使这些区域的线性 DNA 片段脱染色质和解旋的机制,为此处预测的 DNA 杂技表演创造了活体机会。
{"title":"Comparative analysis of predicted DNA secondary structures infers complex human centromere topology.","authors":"Sai Swaroop Chittoor, Simona Giunta","doi":"10.1016/j.ajhg.2024.10.016","DOIUrl":"10.1016/j.ajhg.2024.10.016","url":null,"abstract":"<p><p>Secondary structures are non-canonical arrangements of nucleic acids due to intra-strand interactions, including base pairing, stacking, or other higher-order features that deviate from the standard double-helical conformation. While these structures are extensively studied in RNA, they can also form when DNA becomes single stranded, creating topological roadblocks that can impact essential DNA-based processes such as replication, transcription, and repair, ultimately affecting genome stability. The availability of a complete linear sequence of human genomes, including repetitive loci, enables the prediction of DNA secondary structures comparing across various regions. Here, we evaluate the intrinsic properties of linear single-stranded DNA sequences derived from sampling specialized human loci such as centromeres, pericentromeres, ribosomal DNA (rDNA), and coding regions from the CHM13 genome. Our comparative analysis of predicted secondary structures across human chromosomes revealed the heightened presence, complexity, and instability of secondary structures within the centromere, which gradually decreased toward the pericentromere onto chromosomes' arms, on average lowest in coding regions. Notably, centromeric repeats exhibited the highest level of topological complexity within both the active and divergent domains, even when compared to other repetitive tandem satellites, such as rDNA in acrocentric chromosomes. Our findings provide evidence of the intrinsic self-hybridizing properties of centromere repeats, which are capable of generating complex topological structures that may functionally correlate with chromosome missegregation, especially when centromeric chromatin is disrupted. Processes such as long non-coding RNA transcription, recombination, and other mechanisms that dechromatinize and unwind stretches of linear DNA in these regions create in vivo opportunities for the DNA acrobatics hereby predicted.</p>","PeriodicalId":7659,"journal":{"name":"American journal of human genetics","volume":" ","pages":"2707-2719"},"PeriodicalIF":8.1,"publicationDate":"2024-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11639080/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142674906","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Genomic medicine year in review: 2024.
IF 8.1 1区 生物学 Q1 GENETICS & HEREDITY Pub Date : 2024-12-05 DOI: 10.1016/j.ajhg.2024.11.002
Teri A Manolio, Jahnavi Narula, Alauna Rupert, Carol J Bult, Rex L Chisholm, Geoffrey S Ginsburg, Eric D Green, Gillian Hooker, Gail P Jarvik, George A Mensah, Erin M Ramos, Dan M Roden, Robb Rowley, Casey Overby Taylor, Marc S Williams
{"title":"Genomic medicine year in review: 2024.","authors":"Teri A Manolio, Jahnavi Narula, Alauna Rupert, Carol J Bult, Rex L Chisholm, Geoffrey S Ginsburg, Eric D Green, Gillian Hooker, Gail P Jarvik, George A Mensah, Erin M Ramos, Dan M Roden, Robb Rowley, Casey Overby Taylor, Marc S Williams","doi":"10.1016/j.ajhg.2024.11.002","DOIUrl":"10.1016/j.ajhg.2024.11.002","url":null,"abstract":"","PeriodicalId":7659,"journal":{"name":"American journal of human genetics","volume":"111 12","pages":"2585-2588"},"PeriodicalIF":8.1,"publicationDate":"2024-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11639092/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142790957","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Prequalification of genome-based newborn screening for severe childhood genetic diseases through federated training based on purifying hyperselection.
IF 8.1 1区 生物学 Q1 GENETICS & HEREDITY Pub Date : 2024-12-05 DOI: 10.1016/j.ajhg.2024.10.021
Stephen F Kingsmore, Meredith Wright, Laurie D Smith, Yupu Liang, William R Mowrey, Liana Protopsaltis, Matthew Bainbridge, Mei Baker, Sergey Batalov, Eric Blincow, Bryant Cao, Sara Caylor, Christina Chambers, Katarzyna Ellsworth, Annette Feigenbaum, Erwin Frise, Lucia Guidugli, Kevin P Hall, Christian Hansen, Mark Kiel, Lucita Van Der Kraan, Chad Krilow, Hugh Kwon, Lakshminarasimha Madhavrao, Sebastien Lefebvre, Jeremy Leipzig, Rebecca Mardach, Barry Moore, Danny Oh, Lauren Olsen, Eric Ontiveros, Mallory J Owen, Rebecca Reimers, Gunter Scharer, Jennifer Schleit, Seth Shelnutt, Shyamal S Mehtalia, Albert Oriol, Erica Sanford, Steve Schwartz, Kristen Wigby, Mary J Willis, Mark Yandell, Chris M Kunard, Thomas Defay

Genome-sequence-based newborn screening (gNBS) has substantial potential to improve outcomes in hundreds of severe childhood genetic disorders (SCGDs). However, a major impediment to gNBS is imprecision due to variants classified as pathogenic (P) or likely pathogenic (LP) that are not SCGD causal. gNBS with 53,855 P/LP variants, 342 genes, 412 SCGDs, and 1,603 therapies was positive in 74% of UK Biobank (UKB470K) adults, suggesting 97% false positives. We used the phenomenon of purifying hyperselection, which acts to decrease the frequency of SCGD causal diplotypes, to reduce false positives. Training of gene-disease-inheritance mode-diplotype tetrads in 618,290 control and affected subjects identified 293 variants or haplotypes and seven genes with variable inheritance contributing higher positive diplotype counts than consistent with purifying hyperselection and with little or no evidence of SCGD causality. With these changes, 2.0% of UKB470K adults were positive. In contrast, gNBS was positive in 7.2% of 3,118 critically ill children with suspected SCGDs and 7.9% of 705 infant deaths. When compared with rapid diagnostic genome sequencing (RDGS), gNBS had 99.1% recall. In eight true-positive children, gNBS was projected to decrease time to diagnosis by a median of 121 days and avoid life-threatening disease presentations in four children, organ damage in six children, ∼$1.25 million in healthcare cost, and ten (1.4%) infant deaths. Federated training predicated on purifying hyperselection provides a general framework to attain high precision in population screening. Federated training across many biobanks and clinical trials can provide a privacy-preserving mechanism for qualification of gNBS in diverse genetic ancestries.

{"title":"Prequalification of genome-based newborn screening for severe childhood genetic diseases through federated training based on purifying hyperselection.","authors":"Stephen F Kingsmore, Meredith Wright, Laurie D Smith, Yupu Liang, William R Mowrey, Liana Protopsaltis, Matthew Bainbridge, Mei Baker, Sergey Batalov, Eric Blincow, Bryant Cao, Sara Caylor, Christina Chambers, Katarzyna Ellsworth, Annette Feigenbaum, Erwin Frise, Lucia Guidugli, Kevin P Hall, Christian Hansen, Mark Kiel, Lucita Van Der Kraan, Chad Krilow, Hugh Kwon, Lakshminarasimha Madhavrao, Sebastien Lefebvre, Jeremy Leipzig, Rebecca Mardach, Barry Moore, Danny Oh, Lauren Olsen, Eric Ontiveros, Mallory J Owen, Rebecca Reimers, Gunter Scharer, Jennifer Schleit, Seth Shelnutt, Shyamal S Mehtalia, Albert Oriol, Erica Sanford, Steve Schwartz, Kristen Wigby, Mary J Willis, Mark Yandell, Chris M Kunard, Thomas Defay","doi":"10.1016/j.ajhg.2024.10.021","DOIUrl":"10.1016/j.ajhg.2024.10.021","url":null,"abstract":"<p><p>Genome-sequence-based newborn screening (gNBS) has substantial potential to improve outcomes in hundreds of severe childhood genetic disorders (SCGDs). However, a major impediment to gNBS is imprecision due to variants classified as pathogenic (P) or likely pathogenic (LP) that are not SCGD causal. gNBS with 53,855 P/LP variants, 342 genes, 412 SCGDs, and 1,603 therapies was positive in 74% of UK Biobank (UKB470K) adults, suggesting 97% false positives. We used the phenomenon of purifying hyperselection, which acts to decrease the frequency of SCGD causal diplotypes, to reduce false positives. Training of gene-disease-inheritance mode-diplotype tetrads in 618,290 control and affected subjects identified 293 variants or haplotypes and seven genes with variable inheritance contributing higher positive diplotype counts than consistent with purifying hyperselection and with little or no evidence of SCGD causality. With these changes, 2.0% of UKB470K adults were positive. In contrast, gNBS was positive in 7.2% of 3,118 critically ill children with suspected SCGDs and 7.9% of 705 infant deaths. When compared with rapid diagnostic genome sequencing (RDGS), gNBS had 99.1% recall. In eight true-positive children, gNBS was projected to decrease time to diagnosis by a median of 121 days and avoid life-threatening disease presentations in four children, organ damage in six children, ∼$1.25 million in healthcare cost, and ten (1.4%) infant deaths. Federated training predicated on purifying hyperselection provides a general framework to attain high precision in population screening. Federated training across many biobanks and clinical trials can provide a privacy-preserving mechanism for qualification of gNBS in diverse genetic ancestries.</p>","PeriodicalId":7659,"journal":{"name":"American journal of human genetics","volume":"111 12","pages":"2618-2642"},"PeriodicalIF":8.1,"publicationDate":"2024-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11639087/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142790961","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
GPS-Net: Discovering prognostic pathway modules based on network regularized kernel learning. GPS-Net:基于网络正则化核学习发现预后路径模块
IF 8.1 1区 生物学 Q1 GENETICS & HEREDITY Pub Date : 2024-12-05 Epub Date: 2024-11-06 DOI: 10.1016/j.ajhg.2024.10.004
Sijie Yao, Kaiqiao Li, Tingyi Li, Xiaoqing Yu, Pei Fen Kuan, Xuefeng Wang

The search for prognostic biomarkers capable of predicting patient outcomes, by analyzing gene expression in tissue samples and other molecular profiles, remains largely focused on single-gene-based or global-gene-search approaches. Gene-centric approaches, while foundational, fail to capture the higher-order dependencies that reflect the activities of co-regulated processes, pathway alterations, and regulatory networks, all of which are crucial in determining the patient outcomes in complex diseases like cancer. Here, we introduce GPS-Net, a computational framework that fills the gap in efficiently identifying prognostic modules by incorporating the holistic pathway structures and the network of gene interactions. By innovatively incorporating advanced multiple kernel learning techniques and network-based regularization, the proposed method not only enhances the accuracy of biomarker and pathway identification but also significantly reduces computational complexity, as demonstrated by extensive simulation studies. Applying GPS-Net, we identified key pathways that are predictive of patient outcomes in a cancer immunotherapy study. Overall, our approach provides a novel framework that renders genome-wide pathway-level prognostic analysis both feasible and scalable, synergizing both mechanism-driven and data-driven methodologies for precision genomics.

通过分析组织样本中的基因表达和其他分子特征,寻找能够预测患者预后的预后生物标志物,在很大程度上仍然集中在基于单基因或全局基因的研究方法上。以基因为中心的方法虽然具有基础性,但却无法捕捉到反映共调过程、通路改变和调控网络活动的高阶依赖性,而所有这些都是决定癌症等复杂疾病患者预后的关键。在此,我们介绍一种计算框架 GPS-Net,该框架通过整合整体通路结构和基因相互作用网络,填补了有效识别预后模块的空白。通过创新性地结合先进的多核学习技术和基于网络的正则化,所提出的方法不仅提高了生物标记物和通路识别的准确性,还显著降低了计算复杂性,这一点已在大量的模拟研究中得到证实。应用 GPS-Net,我们在一项癌症免疫疗法研究中确定了可预测患者预后的关键通路。总之,我们的方法提供了一个新颖的框架,使全基因组通路级预后分析变得既可行又可扩展,为精准基因组学的机制驱动和数据驱动方法提供了协同效应。
{"title":"GPS-Net: Discovering prognostic pathway modules based on network regularized kernel learning.","authors":"Sijie Yao, Kaiqiao Li, Tingyi Li, Xiaoqing Yu, Pei Fen Kuan, Xuefeng Wang","doi":"10.1016/j.ajhg.2024.10.004","DOIUrl":"10.1016/j.ajhg.2024.10.004","url":null,"abstract":"<p><p>The search for prognostic biomarkers capable of predicting patient outcomes, by analyzing gene expression in tissue samples and other molecular profiles, remains largely focused on single-gene-based or global-gene-search approaches. Gene-centric approaches, while foundational, fail to capture the higher-order dependencies that reflect the activities of co-regulated processes, pathway alterations, and regulatory networks, all of which are crucial in determining the patient outcomes in complex diseases like cancer. Here, we introduce GPS-Net, a computational framework that fills the gap in efficiently identifying prognostic modules by incorporating the holistic pathway structures and the network of gene interactions. By innovatively incorporating advanced multiple kernel learning techniques and network-based regularization, the proposed method not only enhances the accuracy of biomarker and pathway identification but also significantly reduces computational complexity, as demonstrated by extensive simulation studies. Applying GPS-Net, we identified key pathways that are predictive of patient outcomes in a cancer immunotherapy study. Overall, our approach provides a novel framework that renders genome-wide pathway-level prognostic analysis both feasible and scalable, synergizing both mechanism-driven and data-driven methodologies for precision genomics.</p>","PeriodicalId":7659,"journal":{"name":"American journal of human genetics","volume":" ","pages":"2826-2838"},"PeriodicalIF":8.1,"publicationDate":"2024-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11639089/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142602781","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Local genetic correlation via knockoffs reduces confounding due to cross-trait assortative mating. 通过 "山寨 "产生的局部遗传相关性减少了跨性状同配所造成的混杂。
IF 8.1 1区 生物学 Q1 GENETICS & HEREDITY Pub Date : 2024-12-05 Epub Date: 2024-11-14 DOI: 10.1016/j.ajhg.2024.10.012
Shiyang Ma, Fan Wang, Richard Border, Joseph Buxbaum, Noah Zaitlen, Iuliana Ionita-Laza

Local genetic correlation analysis is an important tool for identifying genetic loci with shared biology across traits. Recently, Border et al. have shown that the results of these analyses are confounded by cross-trait assortative mating (xAM), leading to many false-positive findings. Here, we describe LAVA-Knock, a local genetic correlation method that builds off an existing genetic correlation method, LAVA, and augments it by generating synthetic data in a way that preserves local and long-range linkage disequilibrium (LD), allowing us to reduce the confounding induced by xAM. We show in simulations based on a realistic xAM model and in genome-wide association study (GWAS) applications for 630 trait pairs that LAVA-Knock can greatly reduce the bias due to xAM relative to LAVA. Furthermore, we show a significant positive correlation between the reduction in local genetic correlations and estimates in the literature of cross-mate phenotype correlations; in particular, pairs of traits that are known to have high cross-mate phenotype correlation values have a significantly higher reduction in the number of local genetic correlations compared with other trait pairs. A few representative examples include education and intelligence, education and alcohol consumption, and attention-deficit hyperactivity disorder and depression. These results suggest that LAVA-Knock can reduce confounding due to both short-range LD and long-range LD induced by xAM.

局部遗传相关性分析是确定具有跨性状共同生物学特性的遗传位点的重要工具。最近,Border 等人的研究表明,这些分析的结果会受到跨性状同配(xAM)的干扰,从而导致许多假阳性结果。在这里,我们介绍一种局部遗传相关方法 LAVA-Knock,它以现有的遗传相关方法 LAVA 为基础,并通过生成合成数据的方式对其进行增强,从而保留局部和长程连锁不平衡(LD),使我们能够减少 xAM 引起的混杂。我们在基于现实 xAM 模型的模拟和针对 630 个性状对的全基因组关联研究(GWAS)应用中表明,相对于 LAVA,LAVA-Knock 能大大减少 xAM 带来的偏差。此外,我们还发现,局部遗传相关性的降低与文献中对跨配偶表型相关性的估计之间存在显著的正相关;特别是,与其他性状对相比,已知具有较高跨配偶表型相关性值的性状对的局部遗传相关性数量的降低幅度明显更高。一些有代表性的例子包括教育与智力、教育与饮酒、注意力缺陷多动障碍与抑郁。这些结果表明,LAVA-Knock 可以减少由 xAM 引起的短程 LD 和长程 LD 所造成的混杂。
{"title":"Local genetic correlation via knockoffs reduces confounding due to cross-trait assortative mating.","authors":"Shiyang Ma, Fan Wang, Richard Border, Joseph Buxbaum, Noah Zaitlen, Iuliana Ionita-Laza","doi":"10.1016/j.ajhg.2024.10.012","DOIUrl":"10.1016/j.ajhg.2024.10.012","url":null,"abstract":"<p><p>Local genetic correlation analysis is an important tool for identifying genetic loci with shared biology across traits. Recently, Border et al. have shown that the results of these analyses are confounded by cross-trait assortative mating (xAM), leading to many false-positive findings. Here, we describe LAVA-Knock, a local genetic correlation method that builds off an existing genetic correlation method, LAVA, and augments it by generating synthetic data in a way that preserves local and long-range linkage disequilibrium (LD), allowing us to reduce the confounding induced by xAM. We show in simulations based on a realistic xAM model and in genome-wide association study (GWAS) applications for 630 trait pairs that LAVA-Knock can greatly reduce the bias due to xAM relative to LAVA. Furthermore, we show a significant positive correlation between the reduction in local genetic correlations and estimates in the literature of cross-mate phenotype correlations; in particular, pairs of traits that are known to have high cross-mate phenotype correlation values have a significantly higher reduction in the number of local genetic correlations compared with other trait pairs. A few representative examples include education and intelligence, education and alcohol consumption, and attention-deficit hyperactivity disorder and depression. These results suggest that LAVA-Knock can reduce confounding due to both short-range LD and long-range LD induced by xAM.</p>","PeriodicalId":7659,"journal":{"name":"American journal of human genetics","volume":" ","pages":"2839-2848"},"PeriodicalIF":8.1,"publicationDate":"2024-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11639086/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142638545","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
American journal of human genetics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1