Highly accurate Korean draft genomes reveal structural variation highlighting human telomere evolution

IF 13.1 2区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Nucleic Acids Research Pub Date : 2025-01-08 DOI:10.1093/nar/gkae1294
Jun Kim, Jong Lyul Park, Jin Ok Yang, Sangok Kim, Soobok Joe, Gunwoo Park, Taeyeon Hwang, Mun-Jeong Cho, Seungjae Lee, Jong-Eun Lee, Ji-Hwan Park, Min-Kyung Yeo, Seon-Young Kim
{"title":"Highly accurate Korean draft genomes reveal structural variation highlighting human telomere evolution","authors":"Jun Kim, Jong Lyul Park, Jin Ok Yang, Sangok Kim, Soobok Joe, Gunwoo Park, Taeyeon Hwang, Mun-Jeong Cho, Seungjae Lee, Jong-Eun Lee, Ji-Hwan Park, Min-Kyung Yeo, Seon-Young Kim","doi":"10.1093/nar/gkae1294","DOIUrl":null,"url":null,"abstract":"Given the presence of highly repetitive genomic regions such as subtelomeric regions, understanding human genomic evolution remains challenging. Recently, long-read sequencing technology has facilitated the identification of complex genetic variants, including structural variants (SVs), at the single-nucleotide level. Here, we resolved SVs and their underlying DNA damage–repair mechanisms in subtelomeric regions, which are among the most uncharted genomic regions. We generated ∼20 × high-fidelity long-read sequencing data from three Korean individuals and their partially phased high-quality de novo genome assemblies (contig N50: 6.3–58.2 Mb). We identified 131 138 deletion and 121 461 insertion SVs, 41.6% of which were prevalent in the East Asian population. The commonality of the SVs identified among the Korean population was examined by short-read sequencing data from 103 Korean individuals, providing the first comprehensive SV set representing the population based on the long-read assemblies. Manual investigation of 19 large subtelomeric SVs (≥5 kb) and their associated repair signatures revealed the potential repair mechanisms leading to the formation of these SVs. Our study provides mechanistic insight into human telomere evolution and can facilitate our understanding of human SV formation.","PeriodicalId":19471,"journal":{"name":"Nucleic Acids Research","volume":"1 1","pages":""},"PeriodicalIF":13.1000,"publicationDate":"2025-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nucleic Acids Research","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/nar/gkae1294","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Given the presence of highly repetitive genomic regions such as subtelomeric regions, understanding human genomic evolution remains challenging. Recently, long-read sequencing technology has facilitated the identification of complex genetic variants, including structural variants (SVs), at the single-nucleotide level. Here, we resolved SVs and their underlying DNA damage–repair mechanisms in subtelomeric regions, which are among the most uncharted genomic regions. We generated ∼20 × high-fidelity long-read sequencing data from three Korean individuals and their partially phased high-quality de novo genome assemblies (contig N50: 6.3–58.2 Mb). We identified 131 138 deletion and 121 461 insertion SVs, 41.6% of which were prevalent in the East Asian population. The commonality of the SVs identified among the Korean population was examined by short-read sequencing data from 103 Korean individuals, providing the first comprehensive SV set representing the population based on the long-read assemblies. Manual investigation of 19 large subtelomeric SVs (≥5 kb) and their associated repair signatures revealed the potential repair mechanisms leading to the formation of these SVs. Our study provides mechanistic insight into human telomere evolution and can facilitate our understanding of human SV formation.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
高度精确的韩国基因组草案揭示了人类端粒进化的结构变异
鉴于存在高度重复的基因组区域,如亚端粒区域,了解人类基因组进化仍然具有挑战性。最近,长读测序技术促进了在单核苷酸水平上识别复杂的遗传变异,包括结构变异(sv)。在这里,我们在亚端粒区域解决了sv及其潜在的DNA损伤修复机制,这是最未知的基因组区域之一。我们从三个韩国人及其部分分期的高质量从头基因组组装(N50: 6.3-58.2 Mb)中生成了~ 20 ×高保真长读测序数据。我们确定了131 138个缺失sv和121 461个插入sv,其中41.6%在东亚人群中普遍存在。通过103名韩国人的短读测序数据,研究了韩国人群中发现的SV的共性,提供了第一个基于长读序列代表人群的综合SV集。对19个大亚端粒sv(≥5 kb)及其相关修复特征的人工研究揭示了导致这些sv形成的潜在修复机制。我们的研究提供了人类端粒进化的机制,并有助于我们理解人类SV的形成。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Nucleic Acids Research
Nucleic Acids Research 生物-生化与分子生物学
CiteScore
27.10
自引率
4.70%
发文量
1057
审稿时长
2 months
期刊介绍: Nucleic Acids Research (NAR) is a scientific journal that publishes research on various aspects of nucleic acids and proteins involved in nucleic acid metabolism and interactions. It covers areas such as chemistry and synthetic biology, computational biology, gene regulation, chromatin and epigenetics, genome integrity, repair and replication, genomics, molecular biology, nucleic acid enzymes, RNA, and structural biology. The journal also includes a Survey and Summary section for brief reviews. Additionally, each year, the first issue is dedicated to biological databases, and an issue in July focuses on web-based software resources for the biological community. Nucleic Acids Research is indexed by several services including Abstracts on Hygiene and Communicable Diseases, Animal Breeding Abstracts, Agricultural Engineering Abstracts, Agbiotech News and Information, BIOSIS Previews, CAB Abstracts, and EMBASE.
期刊最新文献
FOXN3 integrates the KU70/KU80/SREBP-1 complex to regulate lipid metabolism in non-alcoholic fatty liver disease Glycine max Jumonji C domain-containing proteins JMJ19/20 exhibit endopeptidase activity and interact with LUXs to mediate flowering time Structural basis of Cas8-independent Cas3 recruitment in Type I-F2 CRISPR–Cas Spatially concentrated adenine base editors efficiently correct PLP1 mutations in oligodendrocytes Engineering of tRNAPro1E2 anticodon stem enhances multiple/consecutive ribosomal incorporation of N-methyl-l-α-amino acids and d-α-amino acids.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1