{"title":"Quality Control for the Illumina HumanExome BeadChip","authors":"Robert P. Igo Jr., Jessica N. Cooke Bailey, Jane Romm, Jonathan L. Haines, Janey L. Wiggs","doi":"10.1002/cphg.15","DOIUrl":null,"url":null,"abstract":"<p>The Illumina HumanExome BeadChip and other exome-based genotyping arrays offer inexpensive genotyping of some 240,000 mostly nonsynonymous coding variants across the human genome. The HumanExome chip, with its highly non-uniform distribution of markers and emphasis on rare coding variants, presents some unique challenges for quality control (QC) and data cleaning. Here, we describe QC procedures for HumanExome data, with examples of challenges specific to exome arrays from our experience cleaning a data set of ∼7,500 samples from the NEIGHBORHOOD Consortium. We focus on standard procedures for QC of genome-wide array data including genotype calling, sex verification, sample identity verification, relationship checking, and population structure that are complicated by the HumanExome panel's enrichment in rare, exonic variation. © 2016 by John Wiley & Sons, Inc.</p>","PeriodicalId":40007,"journal":{"name":"Current Protocols in Human Genetics","volume":"90 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cphg.15","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Current Protocols in Human Genetics","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cphg.15","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 10
Abstract
The Illumina HumanExome BeadChip and other exome-based genotyping arrays offer inexpensive genotyping of some 240,000 mostly nonsynonymous coding variants across the human genome. The HumanExome chip, with its highly non-uniform distribution of markers and emphasis on rare coding variants, presents some unique challenges for quality control (QC) and data cleaning. Here, we describe QC procedures for HumanExome data, with examples of challenges specific to exome arrays from our experience cleaning a data set of ∼7,500 samples from the NEIGHBORHOOD Consortium. We focus on standard procedures for QC of genome-wide array data including genotype calling, sex verification, sample identity verification, relationship checking, and population structure that are complicated by the HumanExome panel's enrichment in rare, exonic variation. © 2016 by John Wiley & Sons, Inc.
Illumina HumanExome芯片的质量控制
Illumina HumanExome BeadChip和其他基于外显子组的基因分型阵列提供了大约24万个人类基因组非同义编码变体的廉价基因分型。HumanExome芯片由于其高度不均匀的标记分布和强调罕见的编码变体,对质量控制(QC)和数据清理提出了一些独特的挑战。在这里,我们描述了HumanExome数据的QC程序,并举例说明了我们清理NEIGHBORHOOD Consortium的约7,500个样本数据集的经验中特定于外显子组阵列的挑战。我们专注于全基因组阵列数据的QC标准程序,包括基因型调用,性别验证,样本身份验证,关系检查和种群结构,这些数据因HumanExome面板中罕见外显子变异的丰富而变得复杂。©2016 by John Wiley &儿子,Inc。
本文章由计算机程序翻译,如有差异,请以英文原文为准。