Rare coding variant analysis for human diseases across biobanks and ancestries

IF 31.7 1区 生物学 Q1 GENETICS & HEREDITY Nature genetics Pub Date : 2024-08-29 DOI:10.1038/s41588-024-01894-5
Sean J. Jurgens, Xin Wang, Seung Hoan Choi, Lu-Chen Weng, Satoshi Koyama, James P. Pirruccello, Trang Nguyen, Patrick Smadbeck, Dongkeun Jang, Mark Chaffin, Roddy Walsh, Carolina Roselli, Amanda L. Elliott, Leonoor F. J. M. Wijdeveld, Kiran J. Biddinger, Shinwan Kany, Joel T. Rämö, Pradeep Natarajan, Krishna G. Aragam, Jason Flannick, Noël P. Burtt, Connie R. Bezzina, Steven A. Lubitz, Kathryn L. Lunetta, Patrick T. Ellinor
{"title":"Rare coding variant analysis for human diseases across biobanks and ancestries","authors":"Sean J. Jurgens, Xin Wang, Seung Hoan Choi, Lu-Chen Weng, Satoshi Koyama, James P. Pirruccello, Trang Nguyen, Patrick Smadbeck, Dongkeun Jang, Mark Chaffin, Roddy Walsh, Carolina Roselli, Amanda L. Elliott, Leonoor F. J. M. Wijdeveld, Kiran J. Biddinger, Shinwan Kany, Joel T. Rämö, Pradeep Natarajan, Krishna G. Aragam, Jason Flannick, Noël P. Burtt, Connie R. Bezzina, Steven A. Lubitz, Kathryn L. Lunetta, Patrick T. Ellinor","doi":"10.1038/s41588-024-01894-5","DOIUrl":null,"url":null,"abstract":"Large-scale sequencing has enabled unparalleled opportunities to investigate the role of rare coding variation in human phenotypic variability. Here, we present a pan-ancestry analysis of sequencing data from three large biobanks, including the All of Us research program. Using mixed-effects models, we performed gene-based rare variant testing for 601 diseases across 748,879 individuals, including 155,236 with ancestry dissimilar to European. We identified 363 significant associations, which highlighted core genes for the human disease phenome and identified potential novel associations, including UBR3 for cardiometabolic disease and YLPM1 for psychiatric disease. Pan-ancestry burden testing represented an inclusive and useful approach for discovery in diverse datasets, although we also highlight the importance of ancestry-specific sensitivity analyses in this setting. Finally, we found that effect sizes for rare protein-disrupting variants were concordant between samples similar to European ancestry and other genetic ancestries (βDeming = 0.7–1.0). Our results have implications for multi-ancestry and cross-biobank approaches in sequencing association studies for human disease. Gene-based rare variant analyses for 601 diseases across 748,879 individuals from three biobanks identify 363 significant associations and highlight important considerations for multi-ancestry and cross-biobank sequencing studies.","PeriodicalId":18985,"journal":{"name":"Nature genetics","volume":"56 9","pages":"1811-1820"},"PeriodicalIF":31.7000,"publicationDate":"2024-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nature genetics","FirstCategoryId":"99","ListUrlMain":"https://www.nature.com/articles/s41588-024-01894-5","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

Abstract

Large-scale sequencing has enabled unparalleled opportunities to investigate the role of rare coding variation in human phenotypic variability. Here, we present a pan-ancestry analysis of sequencing data from three large biobanks, including the All of Us research program. Using mixed-effects models, we performed gene-based rare variant testing for 601 diseases across 748,879 individuals, including 155,236 with ancestry dissimilar to European. We identified 363 significant associations, which highlighted core genes for the human disease phenome and identified potential novel associations, including UBR3 for cardiometabolic disease and YLPM1 for psychiatric disease. Pan-ancestry burden testing represented an inclusive and useful approach for discovery in diverse datasets, although we also highlight the importance of ancestry-specific sensitivity analyses in this setting. Finally, we found that effect sizes for rare protein-disrupting variants were concordant between samples similar to European ancestry and other genetic ancestries (βDeming = 0.7–1.0). Our results have implications for multi-ancestry and cross-biobank approaches in sequencing association studies for human disease. Gene-based rare variant analyses for 601 diseases across 748,879 individuals from three biobanks identify 363 significant associations and highlight important considerations for multi-ancestry and cross-biobank sequencing studies.

Abstract Image

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
跨生物库和祖先的人类疾病罕见编码变异分析
大规模测序为研究罕见编码变异在人类表型变异中的作用提供了无与伦比的机会。在此,我们对来自三个大型生物库(包括 "我们所有人 "研究计划)的测序数据进行了泛种群分析。利用混合效应模型,我们对 748,879 人的 601 种疾病进行了基于基因的罕见变异测试,其中包括 155,236 名祖先与欧洲人不同的个体。我们发现了 363 个重大关联,突出了人类疾病表型的核心基因,并发现了潜在的新型关联,包括与心脏代谢疾病相关的 UBR3 和与精神疾病相关的 YLPM1。尽管我们也强调了在这种情况下进行特定祖先敏感性分析的重要性,但泛祖先负担测试是在不同数据集中进行发现的一种包容而有用的方法。最后,我们发现,在类似欧洲血统的样本和其他遗传血统的样本之间,罕见蛋白质干扰变体的效应大小是一致的(βDeming = 0.7-1.0)。我们的研究结果对人类疾病测序关联研究中的多血统和跨生物库方法具有重要意义。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Nature genetics
Nature genetics 生物-遗传学
CiteScore
43.00
自引率
2.60%
发文量
241
审稿时长
3 months
期刊介绍: Nature Genetics publishes the very highest quality research in genetics. It encompasses genetic and functional genomic studies on human and plant traits and on other model organisms. Current emphasis is on the genetic basis for common and complex diseases and on the functional mechanism, architecture and evolution of gene networks, studied by experimental perturbation. Integrative genetic topics comprise, but are not limited to: -Genes in the pathology of human disease -Molecular analysis of simple and complex genetic traits -Cancer genetics -Agricultural genomics -Developmental genetics -Regulatory variation in gene expression -Strategies and technologies for extracting function from genomic data -Pharmacological genomics -Genome evolution
期刊最新文献
Genome-wide association analysis provides insights into the molecular etiology of dilated cardiomyopathy Genome-wide association study reveals mechanisms underlying dilated cardiomyopathy and myocardial resilience Author Correction: Brca1 haploinsufficiency promotes early tumor onset and epigenetic alterations in a mouse model of hereditary breast cancer Epigenetic scars of Brca1 loss point toward breast cancer cell of origin A temporal cortex cell atlas highlights gene expression dynamics during human brain maturation
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1