首页 > 最新文献

Current Protocols in Human Genetics最新文献

英文 中文
Genotype Imputation in Genome-Wide Association Studies. 全基因组关联研究中的基因型插入。
Pub Date : 2019-06-01 DOI: 10.1002/cphg.84
Adam C Naj

Genotype imputation infers missing genotypes in silico using haplotype information from reference samples with genotypes from denser genotyping arrays or sequencing. This approach can confer a number of improvements on genome-wide association studies: it can improve statistical power to detect associations by reducing the number of missing genotypes; it can simplify data harmonization for meta-analyses by improving overlap of genomic variants between differently-genotyped sample sets; and it can increase the overall number and density of genomic variants available for association testing. This article reviews the general concepts behind imputation, describes imputation approaches and methods for various types of genotype data, including family-based data, and identifies web-based resources that can be used in different steps of the imputation process. For practical application, it provides a step-by-step guide to implementation of a two-step imputation process consisting of phasing of the study genotypes and the imputation of reference panel genotypes into the study haplotypes. In addition, this review describes recently developed haplotype reference panel resources and online imputation servers that are capable of remotely and securely implementing an imputation workflow on uploaded genotype array data. © 2019 by John Wiley & Sons, Inc.

基因型插补利用参考样本的单倍型信息在计算机上推断缺失的基因型,参考样本的基因型来自密集的基因型阵列或测序。这种方法可以为全基因组关联研究带来许多改进:它可以通过减少缺失基因型的数量来提高检测关联的统计能力;它可以通过改善不同基因型样本集之间基因组变异的重叠,简化meta分析的数据协调;它可以增加可用于关联测试的基因组变异的总数和密度。本文回顾了归算背后的一般概念,描述了各种类型的基因型数据(包括基于家庭的数据)的归算方法和方法,并确定了可用于归算过程不同步骤的基于网络的资源。对于实际应用,它提供了一个分步指南,以实现两步的插入过程,包括研究基因型的分阶段和参考面板基因型插入到研究单倍型中。此外,本文还介绍了最近开发的单倍型参考面板资源和在线插补服务器,它们能够远程安全地对上传的基因型阵列数据实施插补工作流程。©2019 by John Wiley & Sons, Inc。
{"title":"Genotype Imputation in Genome-Wide Association Studies.","authors":"Adam C Naj","doi":"10.1002/cphg.84","DOIUrl":"https://doi.org/10.1002/cphg.84","url":null,"abstract":"<p><p>Genotype imputation infers missing genotypes in silico using haplotype information from reference samples with genotypes from denser genotyping arrays or sequencing. This approach can confer a number of improvements on genome-wide association studies: it can improve statistical power to detect associations by reducing the number of missing genotypes; it can simplify data harmonization for meta-analyses by improving overlap of genomic variants between differently-genotyped sample sets; and it can increase the overall number and density of genomic variants available for association testing. This article reviews the general concepts behind imputation, describes imputation approaches and methods for various types of genotype data, including family-based data, and identifies web-based resources that can be used in different steps of the imputation process. For practical application, it provides a step-by-step guide to implementation of a two-step imputation process consisting of phasing of the study genotypes and the imputation of reference panel genotypes into the study haplotypes. In addition, this review describes recently developed haplotype reference panel resources and online imputation servers that are capable of remotely and securely implementing an imputation workflow on uploaded genotype array data. © 2019 by John Wiley & Sons, Inc.</p>","PeriodicalId":40007,"journal":{"name":"Current Protocols in Human Genetics","volume":"102 1","pages":"e84"},"PeriodicalIF":0.0,"publicationDate":"2019-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cphg.84","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"37084177","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Issue Information TOC 发布信息TOC
Pub Date : 2019-06-01 DOI: 10.1002/cphg.77
{"title":"Issue Information TOC","authors":"","doi":"10.1002/cphg.77","DOIUrl":"https://doi.org/10.1002/cphg.77","url":null,"abstract":"","PeriodicalId":40007,"journal":{"name":"Current Protocols in Human Genetics","volume":"27 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80841603","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Issue Information TOC 发布信息TOC
Pub Date : 2019-03-19 DOI: 10.1002/cphg.76

Cover: In Rasooly and Patel (https://doi.org/10.1002/cphg.82), the image shows the scatterplot suggests a positive causal relationship of the SNP effects on BMI against the SNP effects on type 2 diabetes. Each point represents a single genetic variant. The horizontal and vertical lines extending from each point represent 95% confidence intervals for the genetic associations. The x-axis displays the estimated genetic associations with the exposure (BMI), and the y-axis displays the estimated genetic associations with the outcome (type 2 diabetes). The color of the lines indicate the type of MR test used (light blue for IVW, dark blue for MR Egger, light green for simple mode, dark green for weighted median, and red for weighted mode).

封面:Rasooly和Patel (https://doi.org/10.1002/cphg.82)的图片显示,散点图显示SNP对BMI的影响与SNP对2型糖尿病的影响呈正相关。每个点代表一个单一的基因变异。从每个点延伸出来的水平线和垂直线代表遗传关联的95%置信区间。x轴显示与暴露(BMI)的估计遗传关联,y轴显示与结果(2型糖尿病)的估计遗传关联。线条的颜色表示所使用的MR测试的类型(浅蓝色表示IVW,深蓝色表示MR Egger,浅绿色表示简单模式,深绿色表示加权中位数,红色表示加权模式)。
{"title":"Issue Information TOC","authors":"","doi":"10.1002/cphg.76","DOIUrl":"10.1002/cphg.76","url":null,"abstract":"<p><b>Cover</b>: In Rasooly and Patel (https://doi.org/10.1002/cphg.82), the image shows the scatterplot suggests a positive causal relationship of the SNP effects on BMI against the SNP effects on type 2 diabetes. Each point represents a single genetic variant. The horizontal and vertical lines extending from each point represent 95% confidence intervals for the genetic associations. The x-axis displays the estimated genetic associations with the exposure (BMI), and the y-axis displays the estimated genetic associations with the outcome (type 2 diabetes). The color of the lines indicate the type of MR test used (light blue for IVW, dark blue for MR Egger, light green for simple mode, dark green for weighted median, and red for weighted mode).\u0000\u0000 <figure>\u0000 <div><picture>\u0000 <source></source></picture><p></p>\u0000 </div>\u0000 </figure></p>","PeriodicalId":40007,"journal":{"name":"Current Protocols in Human Genetics","volume":"101 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cphg.76","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85242550","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Methods for the Analysis and Interpretation for Rare Variants Associated with Complex Traits 复杂性状相关罕见变异的分析与解释方法
Pub Date : 2019-03-08 DOI: 10.1002/cphg.83
J. Dylan Weissenkampen, Yu Jiang, Scott Eckert, Bibo Jiang, Bingshan Li, Dajiang J. Liu

With the advent of Next Generation Sequencing (NGS) technologies, whole genome and whole exome DNA sequencing has become affordable for routine genetic studies. Coupled with improved genotyping arrays and genotype imputation methodologies, it is increasingly feasible to obtain rare genetic variant information in large datasets. Such datasets allow researchers to gain a more complete understanding of the genetic architecture of complex traits caused by rare variants. State-of-the-art statistical methods for the statistical genetics analysis of sequence-based association, including efficient algorithms for association analysis in biobank-scale datasets, gene-association tests, meta-analysis, fine mapping methods that integrate functional genomic dataset, and phenome-wide association studies (PheWAS), are reviewed here. These methods are expected to be highly useful for next generation statistical genetics analysis in the era of precision medicine. © 2019 by John Wiley & Sons, Inc.

随着下一代测序(NGS)技术的出现,全基因组和全外显子组DNA测序已经成为常规遗传研究的负担得起的方法。随着基因分型阵列和基因型插补方法的改进,在大型数据集中获取罕见遗传变异信息越来越可行。这样的数据集使研究人员能够更全面地了解由罕见变异引起的复杂性状的遗传结构。本文综述了基于序列关联的统计遗传学分析的最新统计方法,包括生物库规模数据集关联分析的高效算法、基因关联测试、元分析、集成功能基因组数据集的精细定位方法和全表型关联研究(PheWAS)。这些方法有望在精准医学时代的下一代统计遗传学分析中发挥重要作用。©2019 by John Wiley &儿子,Inc。
{"title":"Methods for the Analysis and Interpretation for Rare Variants Associated with Complex Traits","authors":"J. Dylan Weissenkampen,&nbsp;Yu Jiang,&nbsp;Scott Eckert,&nbsp;Bibo Jiang,&nbsp;Bingshan Li,&nbsp;Dajiang J. Liu","doi":"10.1002/cphg.83","DOIUrl":"10.1002/cphg.83","url":null,"abstract":"<p>With the advent of Next Generation Sequencing (NGS) technologies, whole genome and whole exome DNA sequencing has become affordable for routine genetic studies. Coupled with improved genotyping arrays and genotype imputation methodologies, it is increasingly feasible to obtain rare genetic variant information in large datasets. Such datasets allow researchers to gain a more complete understanding of the genetic architecture of complex traits caused by rare variants. State-of-the-art statistical methods for the statistical genetics analysis of sequence-based association, including efficient algorithms for association analysis in biobank-scale datasets, gene-association tests, meta-analysis, fine mapping methods that integrate functional genomic dataset, and phenome-wide association studies (PheWAS), are reviewed here. These methods are expected to be highly useful for next generation statistical genetics analysis in the era of precision medicine. © 2019 by John Wiley &amp; Sons, Inc.</p>","PeriodicalId":40007,"journal":{"name":"Current Protocols in Human Genetics","volume":"101 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-03-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cphg.83","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"37036747","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Conducting a Reproducible Mendelian Randomization Analysis Using the R Analytic Statistical Environment 利用R分析统计环境进行可重复的孟德尔随机化分析
Pub Date : 2019-01-15 DOI: 10.1002/cphg.82
Danielle Rasooly, Chirag J. Patel

Mendelian randomization (MR) is defined as the utilization of genetic variants as instrumental variables to assess the causal relationship between an exposure and an outcome. By leveraging genetic polymorphisms as proxy for an exposure, the causal effect of an exposure on an outcome can be assessed while addressing susceptibility to biases prone to conventional observational studies, including confounding and reverse causation, where the outcome causes the exposure. Analogous to a randomized controlled trial where patients are randomly assigned to subgroups based on different treatments, in an MR analysis, the random allocation of alleles during meiosis from parent to offspring assigns individuals to different subgroups based on genetic variants. Recent methods use summary statistics from genome-wide association studies to perform MR, bypassing the need for individual-level data. Here, we provide a straightforward protocol for using summary-level data to perform MR and provide guidance for utilizing available software. © 2019 by John Wiley & Sons, Inc.

孟德尔随机化(MR)被定义为利用遗传变异作为工具变量来评估暴露与结果之间的因果关系。通过利用遗传多态性作为暴露的代理,可以评估暴露对结果的因果关系,同时解决传统观察性研究中容易出现的偏见的易感性,包括混淆和反向因果关系,其中结果导致暴露。类似于随机对照试验,患者根据不同的治疗方法被随机分配到亚组,在MR分析中,减数分裂期间等位基因从亲本到后代的随机分配将个体根据遗传变异分配到不同的亚组。最近的方法使用来自全基因组关联研究的汇总统计数据来执行MR,绕过了对个人水平数据的需要。在这里,我们提供了一个使用摘要级数据执行MR的简单协议,并提供了使用可用软件的指导。©2019 by John Wiley &儿子,Inc。
{"title":"Conducting a Reproducible Mendelian Randomization Analysis Using the R Analytic Statistical Environment","authors":"Danielle Rasooly,&nbsp;Chirag J. Patel","doi":"10.1002/cphg.82","DOIUrl":"10.1002/cphg.82","url":null,"abstract":"<p>Mendelian randomization (MR) is defined as the utilization of genetic variants as instrumental variables to assess the causal relationship between an exposure and an outcome. By leveraging genetic polymorphisms as proxy for an exposure, the causal effect of an exposure on an outcome can be assessed while addressing susceptibility to biases prone to conventional observational studies, including confounding and reverse causation, where the outcome causes the exposure. Analogous to a randomized controlled trial where patients are randomly assigned to subgroups based on different treatments, in an MR analysis, the random allocation of alleles during meiosis from parent to offspring assigns individuals to different subgroups based on genetic variants. Recent methods use summary statistics from genome-wide association studies to perform MR, bypassing the need for individual-level data. Here, we provide a straightforward protocol for using summary-level data to perform MR and provide guidance for utilizing available software. © 2019 by John Wiley &amp; Sons, Inc.</p>","PeriodicalId":40007,"journal":{"name":"Current Protocols in Human Genetics","volume":"101 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cphg.82","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"36865548","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 34
Simultaneous Targeted Methylation Sequencing (sTM-Seq) 同步靶向甲基化测序(sTM-Seq)
Pub Date : 2019-01-08 DOI: 10.1002/cphg.81
Natalie Asmus, Ligia A. Papale, Andy Madrid, Reid S. Alisch

Mapping patterns of DNA methylation throughout the epigenome are critical to our understanding of several important biological and regulatory functions, such as transcriptional regulation, genomic imprinting, and embryonic development. The development and rapid advancement of next-generation sequencing (NGS) technologies have provided clinicians and researchers with accurate and reliable read-outs of genomic and epigenomic information at the nucleotide level. Such improvements have significantly lowered the cost required for genome-wide sequencing, facilitating the vast acquisition of data that has led to many improvements in patient care. However, the torrid rate of NGS data generation has left targeted validation approaches behind, including the confirmation of epigenetic marks such as DNA methylation. To overcome these shortcomings, we present a rapid and robust protocol for the parallel examination of multiple methylated sequences that we have termed simultaneous targeted methylation sequencing (sTM-Seq). Key features of this technique include the elimination of the need for large amounts of high-molecular weight DNA and the nucleotide specific distinction of both 5-methylcytosine (5mC) and 5-hydroxymethylcytosine (5hmC). Moreover, sTM-Seq is scalable and can be used to investigate multiple loci in dozens of samples within a single sequencing run. By utilizing freely available web-based software and universal primers for multipurpose barcoding, library preparation, and customized sequencing, sTM-Seq is affordable, efficient, and widely applicable. Together, these features enable sTM-Seq to have wide-reaching clinical applications that will greatly improve turnaround rates for same-day procedures and allow clinicians to collect high-resolution data that can be used in a variety of patient settings. © 2019 by John Wiley & Sons, Inc.

DNA甲基化在整个表观基因组中的定位模式对我们理解一些重要的生物学和调控功能至关重要,如转录调控、基因组印记和胚胎发育。新一代测序(NGS)技术的发展和快速进步为临床医生和研究人员提供了准确可靠的核苷酸水平的基因组和表观基因组信息。这些改进大大降低了全基因组测序所需的成本,促进了大量数据的获取,从而大大改善了患者护理。然而,NGS数据生成的惊人速度使有针对性的验证方法落后,包括DNA甲基化等表观遗传标记的确认。为了克服这些缺点,我们提出了一种快速而强大的方案,用于平行检查多个甲基化序列,我们称之为同步靶向甲基化测序(sTM-Seq)。该技术的主要特点包括消除了对大量高分子量DNA的需要,以及对5-甲基胞嘧啶(5mC)和5-羟甲基胞嘧啶(5hmC)的核苷酸特异性区分。此外,sTM-Seq具有可扩展性,可用于在单次测序运行中调查数十个样品中的多个位点。sTM-Seq利用免费的网络软件和通用引物进行多用途条形码,文库制备和定制测序,价格合理,效率高,应用广泛。总之,这些功能使sTM-Seq具有广泛的临床应用,这将大大提高当天手术的周转速度,并允许临床医生收集可用于各种患者环境的高分辨率数据。©2019 by John Wiley &儿子,Inc。
{"title":"Simultaneous Targeted Methylation Sequencing (sTM-Seq)","authors":"Natalie Asmus,&nbsp;Ligia A. Papale,&nbsp;Andy Madrid,&nbsp;Reid S. Alisch","doi":"10.1002/cphg.81","DOIUrl":"10.1002/cphg.81","url":null,"abstract":"<p>Mapping patterns of DNA methylation throughout the epigenome are critical to our understanding of several important biological and regulatory functions, such as transcriptional regulation, genomic imprinting, and embryonic development. The development and rapid advancement of next-generation sequencing (NGS) technologies have provided clinicians and researchers with accurate and reliable read-outs of genomic and epigenomic information at the nucleotide level. Such improvements have significantly lowered the cost required for genome-wide sequencing, facilitating the vast acquisition of data that has led to many improvements in patient care. However, the torrid rate of NGS data generation has left targeted validation approaches behind, including the confirmation of epigenetic marks such as DNA methylation. To overcome these shortcomings, we present a rapid and robust protocol for the parallel examination of multiple methylated sequences that we have termed simultaneous targeted methylation sequencing (sTM-Seq). Key features of this technique include the elimination of the need for large amounts of high-molecular weight DNA and the nucleotide specific distinction of both 5-methylcytosine (5mC) and 5-hydroxymethylcytosine (5hmC). Moreover, sTM-Seq is scalable and can be used to investigate multiple loci in dozens of samples within a single sequencing run. By utilizing freely available web-based software and universal primers for multipurpose barcoding, library preparation, and customized sequencing, sTM-Seq is affordable, efficient, and widely applicable. Together, these features enable sTM-Seq to have wide-reaching clinical applications that will greatly improve turnaround rates for same-day procedures and allow clinicians to collect high-resolution data that can be used in a variety of patient settings. © 2019 by John Wiley &amp; Sons, Inc.</p>","PeriodicalId":40007,"journal":{"name":"Current Protocols in Human Genetics","volume":"101 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cphg.81","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"36843310","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Issue Information TOC 发布信息TOC
Pub Date : 2019-01-03 DOI: 10.1002/cphg.74
{"title":"Issue Information TOC","authors":"","doi":"10.1002/cphg.74","DOIUrl":"10.1002/cphg.74","url":null,"abstract":"","PeriodicalId":40007,"journal":{"name":"Current Protocols in Human Genetics","volume":"100 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cphg.74","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84898828","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Using Electronic Health Records To Generate Phenotypes For Research 使用电子健康记录为研究生成表型
Pub Date : 2018-12-05 DOI: 10.1002/cphg.80
Sarah A. Pendergrass, Dana C. Crawford

Electronic health records contain patient-level data collected during and for clinical care. Data within the electronic health record include diagnostic billing codes, procedure codes, vital signs, laboratory test results, clinical imaging, and physician notes. With repeated clinic visits, these data are longitudinal, providing important information on disease development, progression, and response to treatment or intervention strategies. The near universal adoption of electronic health records nationally has the potential to provide population-scale real-world clinical data accessible for biomedical research, including genetic association studies. For this research potential to be realized, high-quality research-grade variables must be extracted from these clinical data warehouses. We describe here common and emerging electronic phenotyping approaches applied to electronic health records, as well as current limitations of both the approaches and the biases associated with these clinically collected data that impact their use in research. © 2018 by John Wiley & Sons, Inc.

电子健康记录包含在临床护理期间和为临床护理收集的患者级别数据。电子健康记录中的数据包括诊断账单代码、程序代码、生命体征、实验室测试结果、临床成像和医生记录。通过反复的临床访问,这些数据是纵向的,提供了关于疾病发展、进展和对治疗或干预策略的反应的重要信息。电子健康记录在全国范围内的几乎普遍采用,有可能为生物医学研究提供人口规模的真实世界临床数据,包括遗传关联研究。为了实现这一研究潜力,必须从这些临床数据仓库中提取高质量的研究级变量。我们在这里描述了应用于电子健康记录的常见和新兴的电子表型方法,以及这两种方法的当前局限性和与这些临床收集的数据相关的影响其在研究中使用的偏差。©2018 by John Wiley &儿子,Inc。
{"title":"Using Electronic Health Records To Generate Phenotypes For Research","authors":"Sarah A. Pendergrass,&nbsp;Dana C. Crawford","doi":"10.1002/cphg.80","DOIUrl":"10.1002/cphg.80","url":null,"abstract":"<p>Electronic health records contain patient-level data collected during and for clinical care. Data within the electronic health record include diagnostic billing codes, procedure codes, vital signs, laboratory test results, clinical imaging, and physician notes. With repeated clinic visits, these data are longitudinal, providing important information on disease development, progression, and response to treatment or intervention strategies. The near universal adoption of electronic health records nationally has the potential to provide population-scale real-world clinical data accessible for biomedical research, including genetic association studies. For this research potential to be realized, high-quality research-grade variables must be extracted from these clinical data warehouses. We describe here common and emerging electronic phenotyping approaches applied to electronic health records, as well as current limitations of both the approaches and the biases associated with these clinically collected data that impact their use in research. © 2018 by John Wiley &amp; Sons, Inc.</p>","PeriodicalId":40007,"journal":{"name":"Current Protocols in Human Genetics","volume":"100 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2018-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cphg.80","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"36797696","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 45
Strategies for Pathway Analysis Using GWAS and WGS Data 利用GWAS和WGS数据进行通路分析的策略
Pub Date : 2018-11-02 DOI: 10.1002/cphg.79
Marquitta J. White, Brian L. Yaspan, Olivia J. Veatch, Pagé Goddard, Oona S. Risse-Adams, Maria G. Contreras

Single-allele study designs, commonly used in genome-wide association studies (GWAS) as well as the more recently developed whole genome sequencing (WGS) studies, are a standard approach for investigating the relationship of common variation within the human genome to a given phenotype of interest. However, single-allele association results published for many GWAS studies represent only the tip of the iceberg for the information that can be extracted from these datasets. The primary analysis strategy for GWAS entails association analysis in which only the single nucleotide polymorphisms (SNPs) with the strongest p-values are declared statistically significant due to issues arising from multiple testing and type I errors. Factors such as locus heterogeneity, epistasis, and multiple genes conferring small effects contribute to the complexity of the genetic models underlying phenotype expression. Thus, many biologically meaningful associations having lower effect sizes at individual genes are overlooked, making it difficult to separate true associations from a sea of false-positive associations. Organizing these individual SNPs into biologically meaningful groups to look at the overall effects of minor perturbations to genes and pathways is desirable. This pathway-based approach provides researchers with insight into the functional foundations of the phenotype being studied and allows testing of various genetic scenarios. © 2018 by John Wiley & Sons, Inc.

单等位基因研究设计通常用于全基因组关联研究(GWAS)以及最近开发的全基因组测序(WGS)研究,是研究人类基因组内常见变异与特定表型之间关系的标准方法。然而,许多GWAS研究发表的单等位基因关联结果只代表了从这些数据集中可以提取的信息的冰山一角。GWAS的主要分析策略需要关联分析,其中只有具有最强p值的单核苷酸多态性(snp)被宣布为统计显著,这是由于多次测试和I型错误引起的问题。基因座异质性、上位性和多基因赋予的小影响等因素导致了表型表达遗传模型的复杂性。因此,许多在单个基因上具有较低效应大小的具有生物学意义的关联被忽视了,这使得很难将真正的关联从假阳性关联的海洋中分离出来。将这些单个snp组织成具有生物学意义的群体,以观察轻微扰动对基因和途径的总体影响是可取的。这种基于途径的方法为研究人员提供了对正在研究的表型的功能基础的见解,并允许对各种遗传情景进行测试。©2018 by John Wiley &儿子,Inc。
{"title":"Strategies for Pathway Analysis Using GWAS and WGS Data","authors":"Marquitta J. White,&nbsp;Brian L. Yaspan,&nbsp;Olivia J. Veatch,&nbsp;Pagé Goddard,&nbsp;Oona S. Risse-Adams,&nbsp;Maria G. Contreras","doi":"10.1002/cphg.79","DOIUrl":"10.1002/cphg.79","url":null,"abstract":"<p>Single-allele study designs, commonly used in genome-wide association studies (GWAS) as well as the more recently developed whole genome sequencing (WGS) studies, are a standard approach for investigating the relationship of common variation within the human genome to a given phenotype of interest. However, single-allele association results published for many GWAS studies represent only the tip of the iceberg for the information that can be extracted from these datasets. The primary analysis strategy for GWAS entails association analysis in which only the single nucleotide polymorphisms (SNPs) with the strongest <i>p-values</i> are declared statistically significant due to issues arising from multiple testing and type I errors. Factors such as locus heterogeneity, epistasis, and multiple genes conferring small effects contribute to the complexity of the genetic models underlying phenotype expression. Thus, many biologically meaningful associations having lower effect sizes at individual genes are overlooked, making it difficult to separate true associations from a sea of false-positive associations. Organizing these individual SNPs into biologically meaningful groups to look at the overall effects of minor perturbations to genes and pathways is desirable. This pathway-based approach provides researchers with insight into the functional foundations of the phenotype being studied and allows testing of various genetic scenarios. © 2018 by John Wiley &amp; Sons, Inc.</p>","PeriodicalId":40007,"journal":{"name":"Current Protocols in Human Genetics","volume":"100 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2018-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cphg.79","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"36629806","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 24
Obtaining a Genetic Family History Using Computer-Based Tools 使用基于计算机的工具获取遗传家族史
Pub Date : 2018-10-18 DOI: 10.1002/cphg.72
Weilong Li, Michael F. Murray, Monica A. Giovanni

Family health history has long been known to be a powerful predictor of individual disease risk. It can be obtained prior to DNA sequencing in order to examine inheritance patterns, to be used as a proxy for genetic information, or as a tool to guide decision-making on the utility of diagnostic genetic testing. Increasingly, it is also being obtained retrospectively from sequenced individuals to examine familial disease penetrance and to identify at-risk relatives for cascade testing. The collection of adequate family history information to screen patients for disease risk and guide decision-making is a time-consuming process that is difficult to accomplish exclusively through discussion between patients and their providers. Engaging individuals and families in data collection and data entry has the potential to improve data accuracy through re-iterative review with family members and health care providers, and to empower patients in their healthcare. In addition, electronic datasets can be shared amongst relatives and stored in electronic health records or personal files, enabling portability of family history information. The U.S. Surgeon General, the Centers for Disease Control and Prevention (CDC), and others have developed tools for electronic family history collection to help families and providers obtain this useful information in an efficient manner. This unit describes the utility of the web-based My Family Health Portrait (https://familyhistory.hhs.gov) as the prototype for patient-entered family history. © 2018 by John Wiley & Sons, Inc.

长期以来,家族健康史一直被认为是个体疾病风险的有力预测指标。它可以在DNA测序之前获得,以便检查遗传模式,用作遗传信息的代理,或作为指导诊断基因测试效用决策的工具。越来越多地,它也被从测序个体中回顾性地获得,以检查家族疾病的外显率,并确定有风险的亲属进行级联检测。收集足够的家族史信息来筛查患者的疾病风险和指导决策是一个耗时的过程,很难完全通过患者和他们的提供者之间的讨论来完成。让个人和家庭参与数据收集和数据输入有可能通过与家庭成员和医疗保健提供者进行反复审查来提高数据准确性,并增强患者的医疗保健能力。此外,电子数据集可以在亲属之间共享,并存储在电子健康记录或个人档案中,从而实现家族史信息的可移植性。美国卫生局局长、疾病控制和预防中心(CDC)和其他机构已经开发了电子家族史收集工具,以帮助家庭和提供者以有效的方式获得这些有用的信息。本单元描述了基于网络的“我的家庭健康画像”(https://familyhistory.hhs.gov)作为患者输入家族史的原型的效用。©2018 by John Wiley &儿子,Inc。
{"title":"Obtaining a Genetic Family History Using Computer-Based Tools","authors":"Weilong Li,&nbsp;Michael F. Murray,&nbsp;Monica A. Giovanni","doi":"10.1002/cphg.72","DOIUrl":"10.1002/cphg.72","url":null,"abstract":"<p>Family health history has long been known to be a powerful predictor of individual disease risk. It can be obtained prior to DNA sequencing in order to examine inheritance patterns, to be used as a proxy for genetic information, or as a tool to guide decision-making on the utility of diagnostic genetic testing. Increasingly, it is also being obtained retrospectively from sequenced individuals to examine familial disease penetrance and to identify at-risk relatives for cascade testing. The collection of adequate family history information to screen patients for disease risk and guide decision-making is a time-consuming process that is difficult to accomplish exclusively through discussion between patients and their providers. Engaging individuals and families in data collection and data entry has the potential to improve data accuracy through re-iterative review with family members and health care providers, and to empower patients in their healthcare. In addition, electronic datasets can be shared amongst relatives and stored in electronic health records or personal files, enabling portability of family history information. The U.S. Surgeon General, the Centers for Disease Control and Prevention (CDC), and others have developed tools for electronic family history collection to help families and providers obtain this useful information in an efficient manner. This unit describes the utility of the web-based My Family Health Portrait (https://familyhistory.hhs.gov) as the prototype for patient-entered family history. © 2018 by John Wiley &amp; Sons, Inc.</p>","PeriodicalId":40007,"journal":{"name":"Current Protocols in Human Genetics","volume":"100 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2018-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cphg.72","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"36639028","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
期刊
Current Protocols in Human Genetics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1