Expanding the Russian allele frequency reference via cross-laboratory data integration: insights from 7,452 exome samples

IF 16.3 1区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES National Science Review Pub Date : 2024-09-13 DOI:10.1093/nsr/nwae326
Yury A Barbitoff, Darya N Khmelkova, Ekaterina A Pomerantseva, Aleksandr V Slepchenkov, Nikita A Zubashenko, Irina V Mironova, Vladimir S Kaimonov, Dmitrii E Polev, Victoria V Tsay, Andrey S Glotov, Mikhail V Aseev, Sergey G Shcherbak, Oleg S Glotov, Arthur A Isaev, Alexander V Predeus
{"title":"Expanding the Russian allele frequency reference via cross-laboratory data integration: insights from 7,452 exome samples","authors":"Yury A Barbitoff, Darya N Khmelkova, Ekaterina A Pomerantseva, Aleksandr V Slepchenkov, Nikita A Zubashenko, Irina V Mironova, Vladimir S Kaimonov, Dmitrii E Polev, Victoria V Tsay, Andrey S Glotov, Mikhail V Aseev, Sergey G Shcherbak, Oleg S Glotov, Arthur A Isaev, Alexander V Predeus","doi":"10.1093/nsr/nwae326","DOIUrl":null,"url":null,"abstract":"Population allele frequency is crucially important for accurate interpretation of known and novel variants in medical genetics. Recently, several large allele frequency databases, such as Genome Aggregation Database (gnomAD), have been created to serve as a global reference for such studies. However, frequencies of many rare alleles vary dramatically between populations, and population-specific allele frequency is often more informative than the global one. Many countries and regions, including Russia, remain poorly studied from the genetic perspective. Here, we report the first successful attempt to integrate genetic information between major medical genetic laboratories in Russia. We construct RUSeq, an open, large-scale reference set of genetic variants by analyzing 7 492 exome samples collected in two major Russian cities of Moscow and St. Petersburg. An approximately tenfold increase in sample size compared to previous studies allowed us to characterize extensive genetic diversity within the admixed Russian population with contributions from several major ancestral groups. We highlight 51 known pathogenic variants that are overrepresented in Russia compared to other European countries. We also identify several dozen high-impact variants that are present in healthy donors despite either being annotated as pathogenic in ClinVar and falling within genes associated with autosomal dominant disorders. The constructed database of genetic variant frequencies in Russia has been made available to the medical genetics community through a variant browser available at http://ruseq.ru.","PeriodicalId":18842,"journal":{"name":"National Science Review","volume":"16 1","pages":""},"PeriodicalIF":16.3000,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"National Science Review","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1093/nsr/nwae326","RegionNum":1,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Population allele frequency is crucially important for accurate interpretation of known and novel variants in medical genetics. Recently, several large allele frequency databases, such as Genome Aggregation Database (gnomAD), have been created to serve as a global reference for such studies. However, frequencies of many rare alleles vary dramatically between populations, and population-specific allele frequency is often more informative than the global one. Many countries and regions, including Russia, remain poorly studied from the genetic perspective. Here, we report the first successful attempt to integrate genetic information between major medical genetic laboratories in Russia. We construct RUSeq, an open, large-scale reference set of genetic variants by analyzing 7 492 exome samples collected in two major Russian cities of Moscow and St. Petersburg. An approximately tenfold increase in sample size compared to previous studies allowed us to characterize extensive genetic diversity within the admixed Russian population with contributions from several major ancestral groups. We highlight 51 known pathogenic variants that are overrepresented in Russia compared to other European countries. We also identify several dozen high-impact variants that are present in healthy donors despite either being annotated as pathogenic in ClinVar and falling within genes associated with autosomal dominant disorders. The constructed database of genetic variant frequencies in Russia has been made available to the medical genetics community through a variant browser available at http://ruseq.ru.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
通过跨实验室数据整合扩展俄罗斯等位基因频率参考:从 7452 个外显子组样本中获得的启示
人群等位基因频率对于准确解释医学遗传学中的已知变异和新型变异至关重要。最近,人们创建了几个大型等位基因频率数据库,如基因组聚合数据库(gnomAD),作为此类研究的全球参考。然而,许多罕见等位基因的频率在不同人群之间存在巨大差异,特定人群的等位基因频率往往比全球等位基因频率更有参考价值。从遗传学角度来看,包括俄罗斯在内的许多国家和地区仍然缺乏研究。在此,我们首次成功尝试整合俄罗斯主要医学遗传实验室之间的遗传信息。我们通过分析在莫斯科和圣彼得堡两个俄罗斯主要城市收集的 7 492 份外显子组样本,构建了一个开放的大规模遗传变异参考集 RUSeq。与之前的研究相比,样本量增加了约十倍,这使我们能够描述来自几个主要祖先群体的混血俄罗斯人口的广泛遗传多样性。我们重点研究了 51 个已知的致病变体,与其他欧洲国家相比,这些变体在俄罗斯的比例较高。我们还发现了几十个影响较大的变异体,这些变异体存在于健康捐献者中,尽管它们在 ClinVar 中被注释为致病性,并且属于常染色体显性遗传疾病的相关基因。所构建的俄罗斯遗传变异频率数据库已通过 http://ruseq.ru 上的变异浏览器提供给医学遗传学界。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
National Science Review
National Science Review MULTIDISCIPLINARY SCIENCES-
CiteScore
24.10
自引率
1.90%
发文量
249
审稿时长
13 weeks
期刊介绍: National Science Review (NSR; ISSN abbreviation: Natl. Sci. Rev.) is an English-language peer-reviewed multidisciplinary open-access scientific journal published by Oxford University Press under the auspices of the Chinese Academy of Sciences.According to Journal Citation Reports, its 2021 impact factor was 23.178. National Science Review publishes both review articles and perspectives as well as original research in the form of brief communications and research articles.
期刊最新文献
Ultraslow spreading ridges: slowest but locally thickest. Origin of sulfate in post-snowball-Earth oceans: river inputs vs. shelf-derived H2S. Deciphering decadal urban ozone trends from historical records since 1980. Contribution of irrigation to the production of maize, wheat, and rice in the major global producing countries. Fossil evidence for silica biomineralization in Permian lycophytes.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1