Structural polymorphism and diversity of human segmental duplications

IF 31.7 1区 生物学 Q1 GENETICS & HEREDITY Nature genetics Pub Date : 2025-01-08 DOI:10.1038/s41588-024-02051-8
Hyeonsoo Jeong, Philip C. Dishuck, DongAhn Yoo, William T. Harvey, Katherine M. Munson, Alexandra P. Lewis, Jennifer Kordosky, Gage H. Garcia,  Human Genome Structural Variation Consortium (HGSVC), Feyza Yilmaz, Pille Hallast, Charles Lee, Tomi Pastinen, Evan E. Eichler
{"title":"Structural polymorphism and diversity of human segmental duplications","authors":"Hyeonsoo Jeong, Philip C. Dishuck, DongAhn Yoo, William T. Harvey, Katherine M. Munson, Alexandra P. Lewis, Jennifer Kordosky, Gage H. Garcia,  Human Genome Structural Variation Consortium (HGSVC), Feyza Yilmaz, Pille Hallast, Charles Lee, Tomi Pastinen, Evan E. Eichler","doi":"10.1038/s41588-024-02051-8","DOIUrl":null,"url":null,"abstract":"Segmental duplications (SDs) contribute significantly to human disease, evolution and diversity but have been difficult to resolve at the sequence level. We present a population genetics survey of SDs by analyzing 170 human genome assemblies (from 85 samples representing 38 Africans and 47 non-Africans) in which the majority of autosomal SDs are fully resolved using long-read sequence assembly. Excluding the acrocentric short arms and sex chromosomes, we identify 173.2 Mb of duplicated sequence (47.4 Mb not present in the telomere-to-telomere reference) distinguishing fixed from structurally polymorphic events. We find that intrachromosomal SDs are among the most variable, with rare events mapping near their progenitor sequences. African genomes harbor significantly more intrachromosomal SDs and are more likely to have recently duplicated gene families with higher copy numbers than non-African samples. Comparison to a resource of 563 million full-length isoform sequencing reads identifies 201 novel, potentially protein-coding genes corresponding to these copy number polymorphic SDs. Analysis of 170 human genomes assembled using long-read sequencing provides a map of structural variation within regions of segmental duplication and identifies novel candidate protein-coding genes supported by full-length Iso-Seq reads.","PeriodicalId":18985,"journal":{"name":"Nature genetics","volume":"57 2","pages":"390-401"},"PeriodicalIF":31.7000,"publicationDate":"2025-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.nature.com/articles/s41588-024-02051-8.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nature genetics","FirstCategoryId":"99","ListUrlMain":"https://www.nature.com/articles/s41588-024-02051-8","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

Abstract

Segmental duplications (SDs) contribute significantly to human disease, evolution and diversity but have been difficult to resolve at the sequence level. We present a population genetics survey of SDs by analyzing 170 human genome assemblies (from 85 samples representing 38 Africans and 47 non-Africans) in which the majority of autosomal SDs are fully resolved using long-read sequence assembly. Excluding the acrocentric short arms and sex chromosomes, we identify 173.2 Mb of duplicated sequence (47.4 Mb not present in the telomere-to-telomere reference) distinguishing fixed from structurally polymorphic events. We find that intrachromosomal SDs are among the most variable, with rare events mapping near their progenitor sequences. African genomes harbor significantly more intrachromosomal SDs and are more likely to have recently duplicated gene families with higher copy numbers than non-African samples. Comparison to a resource of 563 million full-length isoform sequencing reads identifies 201 novel, potentially protein-coding genes corresponding to these copy number polymorphic SDs. Analysis of 170 human genomes assembled using long-read sequencing provides a map of structural variation within regions of segmental duplication and identifies novel candidate protein-coding genes supported by full-length Iso-Seq reads.

Abstract Image

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
人类片段复制的结构多态性和多样性
片段重复(SDs)对人类疾病、进化和多样性有重要贡献,但难以在序列水平上解决。我们通过分析170个人类基因组组合(来自85个样本,代表38个非洲人和47个非非洲人),对SDs进行了群体遗传学调查,其中大多数常染色体SDs使用长读序列组装完全解决。排除顶心短臂和性染色体,我们鉴定了173.2 Mb的重复序列(47.4 Mb不存在于端粒到端粒的参考中),以区分固定和结构多态性事件。我们发现染色体内的SDs是最可变的,罕见的事件映射在它们的祖序列附近。与非非洲样本相比,非洲基因组含有更多的染色体内SDs,并且更有可能具有较高拷贝数的近期重复基因家族。与5.63亿个全长同种异构体测序的资源相比,鉴定出201个新的、潜在的蛋白质编码基因,与这些拷贝数多态性SDs相对应。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Nature genetics
Nature genetics 生物-遗传学
CiteScore
43.00
自引率
2.60%
发文量
241
审稿时长
3 months
期刊介绍: Nature Genetics publishes the very highest quality research in genetics. It encompasses genetic and functional genomic studies on human and plant traits and on other model organisms. Current emphasis is on the genetic basis for common and complex diseases and on the functional mechanism, architecture and evolution of gene networks, studied by experimental perturbation. Integrative genetic topics comprise, but are not limited to: -Genes in the pathology of human disease -Molecular analysis of simple and complex genetic traits -Cancer genetics -Agricultural genomics -Developmental genetics -Regulatory variation in gene expression -Strategies and technologies for extracting function from genomic data -Pharmacological genomics -Genome evolution
期刊最新文献
Mass spectrometry-based mapping of plasma protein QTLs in children and adolescents Safeguard repressor locks hepatocyte identity and blocks liver cancer Plasma proteome variation and its genetic determinants in children and adolescents ImmuneLENS characterizes systemic immune dysregulation in aging and cancer Large-scale genome-wide association analyses identify novel genetic loci and mechanisms in hypertrophic cardiomyopathy
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1