Sources of gene expression variation in a globally diverse human cohort

IF 50.5 1区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES Nature Pub Date : 2024-07-17 DOI:10.1038/s41586-024-07708-2
Dylan J. Taylor, Surya B. Chhetri, Michael G. Tassia, Arjun Biddanda, Stephanie M. Yan, Genevieve L. Wojcik, Alexis Battle, Rajiv C. McCoy
{"title":"Sources of gene expression variation in a globally diverse human cohort","authors":"Dylan J. Taylor, Surya B. Chhetri, Michael G. Tassia, Arjun Biddanda, Stephanie M. Yan, Genevieve L. Wojcik, Alexis Battle, Rajiv C. McCoy","doi":"10.1038/s41586-024-07708-2","DOIUrl":null,"url":null,"abstract":"Genetic variation that influences gene expression and splicing is a key source of phenotypic diversity1–5. Although invaluable, studies investigating these links in humans have been strongly biased towards participants of European ancestries, which constrains generalizability and hinders evolutionary research. Here to address these limitations, we developed MAGE, an open-access RNA sequencing dataset of lymphoblastoid cell lines from 731 individuals from the 1000 Genomes Project6, spread across 5 continental groups and 26 populations. Most variation in gene expression (92%) and splicing (95%) was distributed within versus between populations, which mirrored the variation in DNA sequence. We mapped associations between genetic variants and expression and splicing of nearby genes (cis-expression quantitative trait loci (eQTLs) and cis-splicing QTLs (sQTLs), respectively). We identified more than 15,000 putatively causal eQTLs and more than 16,000 putatively causal sQTLs that are enriched for relevant epigenomic signatures. These include 1,310 eQTLs and 1,657 sQTLs that are largely private to underrepresented populations. Our data further indicate that the magnitude and direction of causal eQTL effects are highly consistent across populations. Moreover, the apparent ‘population-specific’ effects observed in previous studies were largely driven by low resolution or additional independent eQTLs of the same genes that were not detected. Together, our study expands our understanding of human gene expression diversity and provides an inclusive resource for studying the evolution and function of human genomes. A new open-access RNA sequencing dataset, MAGE, of 731 individuals across geographically diverse human populations provides a valuable resource to study genetic diversity and evolution and expands the capacity to identify new genetic associations.","PeriodicalId":18787,"journal":{"name":"Nature","volume":null,"pages":null},"PeriodicalIF":50.5000,"publicationDate":"2024-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11291278/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nature","FirstCategoryId":"103","ListUrlMain":"https://www.nature.com/articles/s41586-024-07708-2","RegionNum":1,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Genetic variation that influences gene expression and splicing is a key source of phenotypic diversity1–5. Although invaluable, studies investigating these links in humans have been strongly biased towards participants of European ancestries, which constrains generalizability and hinders evolutionary research. Here to address these limitations, we developed MAGE, an open-access RNA sequencing dataset of lymphoblastoid cell lines from 731 individuals from the 1000 Genomes Project6, spread across 5 continental groups and 26 populations. Most variation in gene expression (92%) and splicing (95%) was distributed within versus between populations, which mirrored the variation in DNA sequence. We mapped associations between genetic variants and expression and splicing of nearby genes (cis-expression quantitative trait loci (eQTLs) and cis-splicing QTLs (sQTLs), respectively). We identified more than 15,000 putatively causal eQTLs and more than 16,000 putatively causal sQTLs that are enriched for relevant epigenomic signatures. These include 1,310 eQTLs and 1,657 sQTLs that are largely private to underrepresented populations. Our data further indicate that the magnitude and direction of causal eQTL effects are highly consistent across populations. Moreover, the apparent ‘population-specific’ effects observed in previous studies were largely driven by low resolution or additional independent eQTLs of the same genes that were not detected. Together, our study expands our understanding of human gene expression diversity and provides an inclusive resource for studying the evolution and function of human genomes. A new open-access RNA sequencing dataset, MAGE, of 731 individuals across geographically diverse human populations provides a valuable resource to study genetic diversity and evolution and expands the capacity to identify new genetic associations.

Abstract Image

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
全球不同人群基因表达变异的来源。
影响基因表达和剪接的遗传变异是表型多样性的一个重要来源1-5。尽管这些研究很有价值,但调查人类中这些联系的研究却严重偏向于欧洲血统的参与者,这限制了研究的普遍性并阻碍了进化研究。为了解决这些局限性,我们开发了 MAGE,这是一个开放存取的 RNA 测序数据集,来自 1000 基因组计划6 的 731 个个体的淋巴母细胞系,分布在 5 个大陆组和 26 个人群中。基因表达(92%)和剪接(95%)的大多数变异分布在人群内部和人群之间,这反映了 DNA 序列的变异。我们绘制了遗传变异与附近基因的表达和剪接之间的关联图(分别为顺式表达数量性状位点(eQTLs)和顺式剪接 QTLs(sQTLs))。我们发现了超过 15,000 个推测为因果关系的 eQTL 和超过 16,000 个推测为因果关系的 sQTL,它们富集了相关的表观基因组特征。其中包括 1,310 个 eQTL 和 1,657 个 sQTL,它们在很大程度上不属于代表性不足的人群。我们的数据进一步表明,eQTL 的因果效应的大小和方向在不同人群中高度一致。此外,以往研究中观察到的明显的 "种群特异性 "效应在很大程度上是由低分辨率或未检测到的相同基因的额外独立 eQTLs 驱动的。总之,我们的研究拓展了我们对人类基因表达多样性的理解,为研究人类基因组的进化和功能提供了一个包容性资源。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Nature
Nature 综合性期刊-综合性期刊
CiteScore
90.00
自引率
1.20%
发文量
3652
审稿时长
3 months
期刊介绍: Nature is a prestigious international journal that publishes peer-reviewed research in various scientific and technological fields. The selection of articles is based on criteria such as originality, importance, interdisciplinary relevance, timeliness, accessibility, elegance, and surprising conclusions. In addition to showcasing significant scientific advances, Nature delivers rapid, authoritative, insightful news, and interpretation of current and upcoming trends impacting science, scientists, and the broader public. The journal serves a dual purpose: firstly, to promptly share noteworthy scientific advances and foster discussions among scientists, and secondly, to ensure the swift dissemination of scientific results globally, emphasizing their significance for knowledge, culture, and daily life.
期刊最新文献
The Amazon’s gargantuan gardeners: manatees Publisher Correction: Single-crystalline metal-oxide dielectrics for top-gate 2D transistors The baseless stat that could be harming Indigenous conservation efforts Cough or sneeze? How the brain knows what to unleash Fur farming a ‘viral highway’ that could spark next pandemic, say scientists
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1