Mahmoud Koko, Laurie Fabian, Iaroslav Popov, Ruth Y Eberhardt, Gennadii Zakharov, Qin Qin Huang, Emma E Wade, Rafaq Azad, Petr Danecek, Karen Ho, Amy Hough, Wei Huang, Sarah J Lindsay, Daniel S Malawsky, Davide Bonfanti, Dan Mason, Deborah Plowman, Michael A Quail, Susan M Ring, Gemma Shireby, Sara Widaa, Emla Fitzsimons, Vivek Iyer, David Bann, Nicholas J Timpson, John Wright, Matthew E Hurles, Hilary C Martin
{"title":"英国出生队列的外显子组测序。","authors":"Mahmoud Koko, Laurie Fabian, Iaroslav Popov, Ruth Y Eberhardt, Gennadii Zakharov, Qin Qin Huang, Emma E Wade, Rafaq Azad, Petr Danecek, Karen Ho, Amy Hough, Wei Huang, Sarah J Lindsay, Daniel S Malawsky, Davide Bonfanti, Dan Mason, Deborah Plowman, Michael A Quail, Susan M Ring, Gemma Shireby, Sara Widaa, Emla Fitzsimons, Vivek Iyer, David Bann, Nicholas J Timpson, John Wright, Matthew E Hurles, Hilary C Martin","doi":"10.12688/wellcomeopenres.22697.2","DOIUrl":null,"url":null,"abstract":"<p><p>Birth cohort studies involve repeated surveys of large numbers of individuals from birth and throughout their lives. They collect information useful for a wide range of life course research domains, and biological samples which can be used to derive data from an increasing collection of omic technologies. This rich source of longitudinal data, when combined with genomic data, offers the scientific community valuable insights ranging from population genetics to applications across the social sciences. Here we present quality-controlled whole exome sequencing data from three UK birth cohorts: the Avon Longitudinal Study of Parents and Children (8,436 children and 3,215 parents), the Millenium Cohort Study (7,667 children and 6,925 parents) and Born in Bradford (8,784 children and 2,875 parents). The overall objective of this coordinated effort is to make the resulting high-quality data widely accessible to the global research community in a timely manner. We describe how the datasets were generated and subjected to quality control at the sample, variant and genotype level. We then present some preliminary analyses to illustrate the quality of the datasets and probe potential sources of bias. We introduce measures of ultra-rare variant burden to the variables available for researchers working on these cohorts, and show that the exome-wide burden of deleterious protein-truncating variants, <i>S</i> <sub>het</sub> burden, is associated with educational attainment and cognitive test scores. The whole exome sequence data from these birth cohorts (CRAM & VCF files) are available through the European Genome-Phenome Archive, and here we provide guidance for their use.</p>","PeriodicalId":23677,"journal":{"name":"Wellcome Open Research","volume":"9 ","pages":"390"},"PeriodicalIF":0.0000,"publicationDate":"2024-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11747307/pdf/","citationCount":"0","resultStr":"{\"title\":\"Exome sequencing of UK birth cohorts.\",\"authors\":\"Mahmoud Koko, Laurie Fabian, Iaroslav Popov, Ruth Y Eberhardt, Gennadii Zakharov, Qin Qin Huang, Emma E Wade, Rafaq Azad, Petr Danecek, Karen Ho, Amy Hough, Wei Huang, Sarah J Lindsay, Daniel S Malawsky, Davide Bonfanti, Dan Mason, Deborah Plowman, Michael A Quail, Susan M Ring, Gemma Shireby, Sara Widaa, Emla Fitzsimons, Vivek Iyer, David Bann, Nicholas J Timpson, John Wright, Matthew E Hurles, Hilary C Martin\",\"doi\":\"10.12688/wellcomeopenres.22697.2\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Birth cohort studies involve repeated surveys of large numbers of individuals from birth and throughout their lives. They collect information useful for a wide range of life course research domains, and biological samples which can be used to derive data from an increasing collection of omic technologies. This rich source of longitudinal data, when combined with genomic data, offers the scientific community valuable insights ranging from population genetics to applications across the social sciences. Here we present quality-controlled whole exome sequencing data from three UK birth cohorts: the Avon Longitudinal Study of Parents and Children (8,436 children and 3,215 parents), the Millenium Cohort Study (7,667 children and 6,925 parents) and Born in Bradford (8,784 children and 2,875 parents). The overall objective of this coordinated effort is to make the resulting high-quality data widely accessible to the global research community in a timely manner. We describe how the datasets were generated and subjected to quality control at the sample, variant and genotype level. We then present some preliminary analyses to illustrate the quality of the datasets and probe potential sources of bias. We introduce measures of ultra-rare variant burden to the variables available for researchers working on these cohorts, and show that the exome-wide burden of deleterious protein-truncating variants, <i>S</i> <sub>het</sub> burden, is associated with educational attainment and cognitive test scores. The whole exome sequence data from these birth cohorts (CRAM & VCF files) are available through the European Genome-Phenome Archive, and here we provide guidance for their use.</p>\",\"PeriodicalId\":23677,\"journal\":{\"name\":\"Wellcome Open Research\",\"volume\":\"9 \",\"pages\":\"390\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-12-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11747307/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Wellcome Open Research\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.12688/wellcomeopenres.22697.2\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q1\",\"JCRName\":\"Medicine\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Wellcome Open Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.12688/wellcomeopenres.22697.2","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"Medicine","Score":null,"Total":0}
Birth cohort studies involve repeated surveys of large numbers of individuals from birth and throughout their lives. They collect information useful for a wide range of life course research domains, and biological samples which can be used to derive data from an increasing collection of omic technologies. This rich source of longitudinal data, when combined with genomic data, offers the scientific community valuable insights ranging from population genetics to applications across the social sciences. Here we present quality-controlled whole exome sequencing data from three UK birth cohorts: the Avon Longitudinal Study of Parents and Children (8,436 children and 3,215 parents), the Millenium Cohort Study (7,667 children and 6,925 parents) and Born in Bradford (8,784 children and 2,875 parents). The overall objective of this coordinated effort is to make the resulting high-quality data widely accessible to the global research community in a timely manner. We describe how the datasets were generated and subjected to quality control at the sample, variant and genotype level. We then present some preliminary analyses to illustrate the quality of the datasets and probe potential sources of bias. We introduce measures of ultra-rare variant burden to the variables available for researchers working on these cohorts, and show that the exome-wide burden of deleterious protein-truncating variants, Shet burden, is associated with educational attainment and cognitive test scores. The whole exome sequence data from these birth cohorts (CRAM & VCF files) are available through the European Genome-Phenome Archive, and here we provide guidance for their use.
Wellcome Open ResearchBiochemistry, Genetics and Molecular Biology-Biochemistry, Genetics and Molecular Biology (all)
CiteScore
5.50
自引率
0.00%
发文量
426
审稿时长
1 weeks
期刊介绍:
Wellcome Open Research publishes scholarly articles reporting any basic scientific, translational and clinical research that has been funded (or co-funded) by Wellcome. Each publication must have at least one author who has been, or still is, a recipient of a Wellcome grant. Articles must be original (not duplications). All research, including clinical trials, systematic reviews, software tools, method articles, and many others, is welcome and will be published irrespective of the perceived level of interest or novelty; confirmatory and negative results, as well as null studies are all suitable. See the full list of article types here. All articles are published using a fully transparent, author-driven model: the authors are solely responsible for the content of their article. Invited peer review takes place openly after publication, and the authors play a crucial role in ensuring that the article is peer-reviewed by independent experts in a timely manner. Articles that pass peer review will be indexed in PubMed and elsewhere. Wellcome Open Research is an Open Research platform: all articles are published open access; the publishing and peer-review processes are fully transparent; and authors are asked to include detailed descriptions of methods and to provide full and easy access to source data underlying the results to improve reproducibility.