PLINK:数据分析的关键功能。

Susan H Slifer
{"title":"PLINK:数据分析的关键功能。","authors":"Susan H Slifer","doi":"10.1002/cphg.59","DOIUrl":null,"url":null,"abstract":"<p><p>Genetic data analysis of large numbers of single nucleotide variants (SNVs), including genome-wide association studies (GWAS), exome chips, and whole exome (WES) or whole-genome (WGS) sequencing data, requires well defined processing steps. As a result, several freely available analytic toolkits have been developed to streamline these processes. Among these, PLINK is the most comprehensive in terms of its quality control and analytic modules, although its focus remains on SNVs. PLINK fulfills two analytic needs-aiding the process of performing quality control (QC) on large data sets and providing basic statistical tools to analyze the variants in genetic models. The current version of PLINK (v1.90b) has incorporated several sophisticated statistical modeling features, such as those that were introduced by GCTA (genome-wide complex trait analysis), including mixed-model association analysis and cluster-based algorithms. Although PLINK is diverse in its applicability to data management and analysis, in some instances, other available tools offer more optimal options. Here we provide a practical overview of major PLINK features with respect to QC, data management, and association mapping, along with learned shortcuts and limitations to be considered. In cases where PLINK features are limited, we provide alternative approaches using additional freely available pipelines. © 2018 by John Wiley & Sons, Inc.</p>","PeriodicalId":40007,"journal":{"name":"Current Protocols in Human Genetics","volume":"97 1","pages":"e59"},"PeriodicalIF":0.0000,"publicationDate":"2018-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cphg.59","citationCount":"34","resultStr":"{\"title\":\"PLINK: Key Functions for Data Analysis.\",\"authors\":\"Susan H Slifer\",\"doi\":\"10.1002/cphg.59\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Genetic data analysis of large numbers of single nucleotide variants (SNVs), including genome-wide association studies (GWAS), exome chips, and whole exome (WES) or whole-genome (WGS) sequencing data, requires well defined processing steps. As a result, several freely available analytic toolkits have been developed to streamline these processes. Among these, PLINK is the most comprehensive in terms of its quality control and analytic modules, although its focus remains on SNVs. PLINK fulfills two analytic needs-aiding the process of performing quality control (QC) on large data sets and providing basic statistical tools to analyze the variants in genetic models. The current version of PLINK (v1.90b) has incorporated several sophisticated statistical modeling features, such as those that were introduced by GCTA (genome-wide complex trait analysis), including mixed-model association analysis and cluster-based algorithms. Although PLINK is diverse in its applicability to data management and analysis, in some instances, other available tools offer more optimal options. Here we provide a practical overview of major PLINK features with respect to QC, data management, and association mapping, along with learned shortcuts and limitations to be considered. In cases where PLINK features are limited, we provide alternative approaches using additional freely available pipelines. © 2018 by John Wiley & Sons, Inc.</p>\",\"PeriodicalId\":40007,\"journal\":{\"name\":\"Current Protocols in Human Genetics\",\"volume\":\"97 1\",\"pages\":\"e59\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1002/cphg.59\",\"citationCount\":\"34\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Current Protocols in Human Genetics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1002/cphg.59\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Current Protocols in Human Genetics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1002/cphg.59","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 34

摘要

大量单核苷酸变异(snv)的遗传数据分析,包括全基因组关联研究(GWAS)、外显子组芯片、全外显子组(WES)或全基因组(WGS)测序数据,需要明确的处理步骤。因此,已经开发了几个免费的分析工具包来简化这些过程。其中,PLINK在质量控制和分析模块方面是最全面的,尽管它的重点仍然是snv。PLINK满足了两个分析需求:帮助在大数据集上执行质量控制(QC)的过程,并提供基本的统计工具来分析遗传模型中的变异。PLINK的当前版本(v1.90b)包含了几个复杂的统计建模功能,例如GCTA(全基因组复杂性状分析)引入的功能,包括混合模型关联分析和基于聚类的算法。尽管PLINK在数据管理和分析方面的适用性多种多样,但在某些情况下,其他可用的工具提供了更优的选择。在这里,我们提供了一个关于质量控制、数据管理和关联映射的PLINK主要特性的实用概述,以及需要考虑的快捷方式和限制。在PLINK功能有限的情况下,我们提供了使用额外的免费管道的替代方法。©2018 by John Wiley & Sons, Inc。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
PLINK: Key Functions for Data Analysis.

Genetic data analysis of large numbers of single nucleotide variants (SNVs), including genome-wide association studies (GWAS), exome chips, and whole exome (WES) or whole-genome (WGS) sequencing data, requires well defined processing steps. As a result, several freely available analytic toolkits have been developed to streamline these processes. Among these, PLINK is the most comprehensive in terms of its quality control and analytic modules, although its focus remains on SNVs. PLINK fulfills two analytic needs-aiding the process of performing quality control (QC) on large data sets and providing basic statistical tools to analyze the variants in genetic models. The current version of PLINK (v1.90b) has incorporated several sophisticated statistical modeling features, such as those that were introduced by GCTA (genome-wide complex trait analysis), including mixed-model association analysis and cluster-based algorithms. Although PLINK is diverse in its applicability to data management and analysis, in some instances, other available tools offer more optimal options. Here we provide a practical overview of major PLINK features with respect to QC, data management, and association mapping, along with learned shortcuts and limitations to be considered. In cases where PLINK features are limited, we provide alternative approaches using additional freely available pipelines. © 2018 by John Wiley & Sons, Inc.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Current Protocols in Human Genetics
Current Protocols in Human Genetics Biochemistry, Genetics and Molecular Biology-Genetics
自引率
0.00%
发文量
0
期刊介绍: Current Protocols in Human Genetics is the resource for designing and running successful research projects in all branches of human genetics.
期刊最新文献
Issue Information Resolving Breakpoints of Chromosomal Rearrangements at the Nucleotide Level Using Sanger Sequencing Informed Consent for Genetic and Genomic Research A Guide to Using ClinTAD for Interpretation of DNA Copy Number Variants in the Context of Topologically Associated Domains The AD Knowledge Portal: A Repository for Multi-Omic Data on Alzheimer's Disease and Aging
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1