Yury A Barbitoff, Darya N Khmelkova, Ekaterina A Pomerantseva, Aleksandr V Slepchenkov, Nikita A Zubashenko, Irina V Mironova, Vladimir S Kaimonov, Dmitrii E Polev, Victoria V Tsay, Andrey S Glotov, Mikhail V Aseev, Sergey G Shcherbak, Oleg S Glotov, Arthur A Isaev, Alexander V Predeus
{"title":"通过跨实验室数据整合扩展俄罗斯等位基因频率参考:从 7452 个外显子组样本中获得的启示","authors":"Yury A Barbitoff, Darya N Khmelkova, Ekaterina A Pomerantseva, Aleksandr V Slepchenkov, Nikita A Zubashenko, Irina V Mironova, Vladimir S Kaimonov, Dmitrii E Polev, Victoria V Tsay, Andrey S Glotov, Mikhail V Aseev, Sergey G Shcherbak, Oleg S Glotov, Arthur A Isaev, Alexander V Predeus","doi":"10.1093/nsr/nwae326","DOIUrl":null,"url":null,"abstract":"Population allele frequency is crucially important for accurate interpretation of known and novel variants in medical genetics. Recently, several large allele frequency databases, such as Genome Aggregation Database (gnomAD), have been created to serve as a global reference for such studies. However, frequencies of many rare alleles vary dramatically between populations, and population-specific allele frequency is often more informative than the global one. Many countries and regions, including Russia, remain poorly studied from the genetic perspective. Here, we report the first successful attempt to integrate genetic information between major medical genetic laboratories in Russia. We construct RUSeq, an open, large-scale reference set of genetic variants by analyzing 7 492 exome samples collected in two major Russian cities of Moscow and St. Petersburg. An approximately tenfold increase in sample size compared to previous studies allowed us to characterize extensive genetic diversity within the admixed Russian population with contributions from several major ancestral groups. We highlight 51 known pathogenic variants that are overrepresented in Russia compared to other European countries. We also identify several dozen high-impact variants that are present in healthy donors despite either being annotated as pathogenic in ClinVar and falling within genes associated with autosomal dominant disorders. The constructed database of genetic variant frequencies in Russia has been made available to the medical genetics community through a variant browser available at http://ruseq.ru.","PeriodicalId":18842,"journal":{"name":"National Science Review","volume":"16 1","pages":""},"PeriodicalIF":16.3000,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Expanding the Russian allele frequency reference via cross-laboratory data integration: insights from 7,452 exome samples\",\"authors\":\"Yury A Barbitoff, Darya N Khmelkova, Ekaterina A Pomerantseva, Aleksandr V Slepchenkov, Nikita A Zubashenko, Irina V Mironova, Vladimir S Kaimonov, Dmitrii E Polev, Victoria V Tsay, Andrey S Glotov, Mikhail V Aseev, Sergey G Shcherbak, Oleg S Glotov, Arthur A Isaev, Alexander V Predeus\",\"doi\":\"10.1093/nsr/nwae326\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Population allele frequency is crucially important for accurate interpretation of known and novel variants in medical genetics. Recently, several large allele frequency databases, such as Genome Aggregation Database (gnomAD), have been created to serve as a global reference for such studies. However, frequencies of many rare alleles vary dramatically between populations, and population-specific allele frequency is often more informative than the global one. Many countries and regions, including Russia, remain poorly studied from the genetic perspective. Here, we report the first successful attempt to integrate genetic information between major medical genetic laboratories in Russia. We construct RUSeq, an open, large-scale reference set of genetic variants by analyzing 7 492 exome samples collected in two major Russian cities of Moscow and St. Petersburg. An approximately tenfold increase in sample size compared to previous studies allowed us to characterize extensive genetic diversity within the admixed Russian population with contributions from several major ancestral groups. We highlight 51 known pathogenic variants that are overrepresented in Russia compared to other European countries. We also identify several dozen high-impact variants that are present in healthy donors despite either being annotated as pathogenic in ClinVar and falling within genes associated with autosomal dominant disorders. The constructed database of genetic variant frequencies in Russia has been made available to the medical genetics community through a variant browser available at http://ruseq.ru.\",\"PeriodicalId\":18842,\"journal\":{\"name\":\"National Science Review\",\"volume\":\"16 1\",\"pages\":\"\"},\"PeriodicalIF\":16.3000,\"publicationDate\":\"2024-09-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"National Science Review\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://doi.org/10.1093/nsr/nwae326\",\"RegionNum\":1,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"National Science Review","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1093/nsr/nwae326","RegionNum":1,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
Expanding the Russian allele frequency reference via cross-laboratory data integration: insights from 7,452 exome samples
Population allele frequency is crucially important for accurate interpretation of known and novel variants in medical genetics. Recently, several large allele frequency databases, such as Genome Aggregation Database (gnomAD), have been created to serve as a global reference for such studies. However, frequencies of many rare alleles vary dramatically between populations, and population-specific allele frequency is often more informative than the global one. Many countries and regions, including Russia, remain poorly studied from the genetic perspective. Here, we report the first successful attempt to integrate genetic information between major medical genetic laboratories in Russia. We construct RUSeq, an open, large-scale reference set of genetic variants by analyzing 7 492 exome samples collected in two major Russian cities of Moscow and St. Petersburg. An approximately tenfold increase in sample size compared to previous studies allowed us to characterize extensive genetic diversity within the admixed Russian population with contributions from several major ancestral groups. We highlight 51 known pathogenic variants that are overrepresented in Russia compared to other European countries. We also identify several dozen high-impact variants that are present in healthy donors despite either being annotated as pathogenic in ClinVar and falling within genes associated with autosomal dominant disorders. The constructed database of genetic variant frequencies in Russia has been made available to the medical genetics community through a variant browser available at http://ruseq.ru.
期刊介绍:
National Science Review (NSR; ISSN abbreviation: Natl. Sci. Rev.) is an English-language peer-reviewed multidisciplinary open-access scientific journal published by Oxford University Press under the auspices of the Chinese Academy of Sciences.According to Journal Citation Reports, its 2021 impact factor was 23.178.
National Science Review publishes both review articles and perspectives as well as original research in the form of brief communications and research articles.