Independent expansion, selection and hypervariability of the TBC1D3 gene family in humans

IF 6.2 2区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Genome research Pub Date : 2024-08-06 DOI:10.1101/gr.279299.124
Xavi Guitart, David Porubsky, DongAhn Yoo, Max L Dougherty, Philip Dishuck, Katherine M. Munson, Alexandra P. Lewis, Kendra Hoekzema, Jordan Knuth, Stephen Chang, Tomi Pastinen, Evan E. Eichler
{"title":"Independent expansion, selection and hypervariability of the TBC1D3 gene family in humans","authors":"Xavi Guitart, David Porubsky, DongAhn Yoo, Max L Dougherty, Philip Dishuck, Katherine M. Munson, Alexandra P. Lewis, Kendra Hoekzema, Jordan Knuth, Stephen Chang, Tomi Pastinen, Evan E. Eichler","doi":"10.1101/gr.279299.124","DOIUrl":null,"url":null,"abstract":"<em>TBC1D3</em> is a primate-specific gene family that has expanded in the human lineage and has been implicated in neuronal progenitor proliferation and expansion of the frontal cortex. The gene family and its expression have been challenging to investigate because it is embedded in high-identity and highly variable segmental duplications. We sequenced and assembled the gene family using long-read sequencing data from 34 humans and 11 non-human primate species. Our analysis shows that this particular gene family has independently duplicated in at least five primate lineages, and the duplicated loci are enriched at sites of large-scale chromosomal rearrangements on Chromosome 17. We find that all human copy number variation maps to two distinct clusters located at Chromosome 17q12 and that humans are highly structurally variable at this locus, differing by as many as 20 copies and ~1 Mbp in length depending on haplotypes. We also show evidence of positive selection, as well as a significant change in the predicted human TBC1D3 protein sequence. Lastly, we find that, despite multiple duplications, human <em>TBC1D3</em> expression is limited to a subset of copies and, most notably, from a single paralog group: <em>TBC1D3-CDKL</em>. These observations may help explain why a gene potentially important in cortical development can be so variable in the human population.","PeriodicalId":12678,"journal":{"name":"Genome research","volume":"46 1","pages":""},"PeriodicalIF":6.2000,"publicationDate":"2024-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Genome research","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1101/gr.279299.124","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

TBC1D3 is a primate-specific gene family that has expanded in the human lineage and has been implicated in neuronal progenitor proliferation and expansion of the frontal cortex. The gene family and its expression have been challenging to investigate because it is embedded in high-identity and highly variable segmental duplications. We sequenced and assembled the gene family using long-read sequencing data from 34 humans and 11 non-human primate species. Our analysis shows that this particular gene family has independently duplicated in at least five primate lineages, and the duplicated loci are enriched at sites of large-scale chromosomal rearrangements on Chromosome 17. We find that all human copy number variation maps to two distinct clusters located at Chromosome 17q12 and that humans are highly structurally variable at this locus, differing by as many as 20 copies and ~1 Mbp in length depending on haplotypes. We also show evidence of positive selection, as well as a significant change in the predicted human TBC1D3 protein sequence. Lastly, we find that, despite multiple duplications, human TBC1D3 expression is limited to a subset of copies and, most notably, from a single paralog group: TBC1D3-CDKL. These observations may help explain why a gene potentially important in cortical development can be so variable in the human population.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
人类 TBC1D3 基因家族的独立扩展、选择和高变异性
TBC1D3 是一个灵长类特有的基因家族,它在人类血统中扩展,并与神经元祖细胞的增殖和额叶皮层的扩展有关。该基因家族及其表达一直是研究的难点,因为它包含在高同一性和高度可变的节段重复中。我们利用来自 34 个人类和 11 个非人灵长类物种的长线程测序数据对该基因家族进行了测序和组装。我们的分析表明,这个特殊的基因家族在至少五个灵长类物种中发生了独立的重复,重复的基因位点富集在 17 号染色体上大规模染色体重排的位置。我们发现,人类所有的拷贝数变异都映射到位于染色体 17q12 的两个不同群组上,而且人类在该基因座上的结构变异很大,根据单倍型的不同,拷贝数相差多达 20 个,长度相差约 1 Mbp。我们还显示了正选择的证据,以及预测的人类 TBC1D3 蛋白序列的显著变化。最后,我们发现,尽管存在多个重复,但人类 TBC1D3 的表达仅限于一部分拷贝,而且最明显的是,只来自一个旁系组:TBC1D3-CDKL。这些观察结果可能有助于解释为什么一个可能对大脑皮层发育很重要的基因在人类群体中会如此多变。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Genome research
Genome research 生物-生化与分子生物学
CiteScore
12.40
自引率
1.40%
发文量
140
审稿时长
6 months
期刊介绍: Launched in 1995, Genome Research is an international, continuously published, peer-reviewed journal that focuses on research that provides novel insights into the genome biology of all organisms, including advances in genomic medicine. Among the topics considered by the journal are genome structure and function, comparative genomics, molecular evolution, genome-scale quantitative and population genetics, proteomics, epigenomics, and systems biology. The journal also features exciting gene discoveries and reports of cutting-edge computational biology and high-throughput methodologies. New data in these areas are published as research papers, or methods and resource reports that provide novel information on technologies or tools that will be of interest to a broad readership. Complete data sets are presented electronically on the journal''s web site where appropriate. The journal also provides Reviews, Perspectives, and Insight/Outlook articles, which present commentary on the latest advances published both here and elsewhere, placing such progress in its broader biological context.
期刊最新文献
Global identification of mammalian host and nested gene pairs reveal tissue-specific transcriptional interplay Convergent relaxation of molecular constraint in herbivores reveals the changing role of liver and kidney functions across mammalian diets KAS-ATAC reveals the genome-wide single-stranded accessible chromatin landscape of the human genome Advancements in prospective single-cell lineage barcoding and their applications in research The chromatin landscape of the histone-possessing Bacteriovorax bacteria
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1