涅槃:临床级变异注释器

Michael P. Strömberg, R. Roy, J. Lajugie, Yu Jiang, Haochen Li, E. Margulies
{"title":"涅槃:临床级变异注释器","authors":"Michael P. Strömberg, R. Roy, J. Lajugie, Yu Jiang, Haochen Li, E. Margulies","doi":"10.1145/3107411.3108204","DOIUrl":null,"url":null,"abstract":"Sequencing an individual genome typically produces approximately three million variants compared to the human reference genome. The consequence for each variant depends on the location and nature of the variant and is a key question for genetic analysts performing clinical diagnosis. Variant annotation describes how a variant affects the sample's genome. These annotations include the functional consequence on the different transcripts for a gene or in proximal regulatory regions. Annotation also includes additional data on what is known about a given variant that can help in understanding its relevance to a given line of investigation. Often this data is provided by different sources and contain allele frequencies for different populations, clinical implications, relevance to cancer types, additional studies, etc. Ultimately this information helps clinicians interpret variants when providing a diagnosis. The three most widely used open source annotation tools are VEP, SnpEff and AnnoVar. VEP is widely considered to be most accurate of the three, but is also slower than both SnpEff and AnnoVar. When annotating the variants from a 30x genome (NA12878), VEP finished in 18 hours whereas SnpEff 4.3g and AnnoVar finish in 15 min and 67 min respectively using one core. We present Nirvana, an open source clinical variant annotator, that is both accurate (over 99.9% concordance with VEP) and fast (takes 7 min to annotate NA12878). Nirvana is used in all of Illumina's relevant analysis pipelines and is tested rigorously to ensure adherence to clinical standards.","PeriodicalId":246388,"journal":{"name":"Proceedings of the 8th ACM International Conference on Bioinformatics, Computational Biology,and Health Informatics","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Nirvana: Clinical Grade Variant Annotator\",\"authors\":\"Michael P. Strömberg, R. Roy, J. Lajugie, Yu Jiang, Haochen Li, E. Margulies\",\"doi\":\"10.1145/3107411.3108204\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Sequencing an individual genome typically produces approximately three million variants compared to the human reference genome. The consequence for each variant depends on the location and nature of the variant and is a key question for genetic analysts performing clinical diagnosis. Variant annotation describes how a variant affects the sample's genome. These annotations include the functional consequence on the different transcripts for a gene or in proximal regulatory regions. Annotation also includes additional data on what is known about a given variant that can help in understanding its relevance to a given line of investigation. Often this data is provided by different sources and contain allele frequencies for different populations, clinical implications, relevance to cancer types, additional studies, etc. Ultimately this information helps clinicians interpret variants when providing a diagnosis. The three most widely used open source annotation tools are VEP, SnpEff and AnnoVar. VEP is widely considered to be most accurate of the three, but is also slower than both SnpEff and AnnoVar. When annotating the variants from a 30x genome (NA12878), VEP finished in 18 hours whereas SnpEff 4.3g and AnnoVar finish in 15 min and 67 min respectively using one core. We present Nirvana, an open source clinical variant annotator, that is both accurate (over 99.9% concordance with VEP) and fast (takes 7 min to annotate NA12878). Nirvana is used in all of Illumina's relevant analysis pipelines and is tested rigorously to ensure adherence to clinical standards.\",\"PeriodicalId\":246388,\"journal\":{\"name\":\"Proceedings of the 8th ACM International Conference on Bioinformatics, Computational Biology,and Health Informatics\",\"volume\":\"6 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-08-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 8th ACM International Conference on Bioinformatics, Computational Biology,and Health Informatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3107411.3108204\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 8th ACM International Conference on Bioinformatics, Computational Biology,and Health Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3107411.3108204","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

摘要

与人类参考基因组相比,单个基因组的测序通常会产生大约300万个变体。每种变异的结果取决于变异的位置和性质,这是遗传分析人员进行临床诊断的关键问题。变体注释描述了一个变体如何影响样本的基因组。这些注释包括对基因或近端调控区域的不同转录本的功能后果。注释还包括关于给定变体的已知情况的附加数据,这些数据可以帮助理解其与给定调查线的相关性。这些数据通常来自不同的来源,包含不同人群的等位基因频率、临床意义、与癌症类型的相关性、其他研究等。最终,这些信息有助于临床医生在提供诊断时解释变异。三个最广泛使用的开源注释工具是VEP、SnpEff和AnnoVar。VEP被普遍认为是三者中最准确的,但也比SnpEff和AnnoVar慢。当对来自30x基因组(NA12878)的变体进行注释时,VEP在18小时内完成,而SnpEff 4.3g和AnnoVar分别在15分钟和67分钟内完成。我们介绍了Nirvana,一个开源的临床变异注释器,它既准确(与VEP的一致性超过99.9%)又快速(注释NA12878需要7分钟)。Nirvana用于Illumina所有相关的分析管道,并经过严格的测试,以确保符合临床标准。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Nirvana: Clinical Grade Variant Annotator
Sequencing an individual genome typically produces approximately three million variants compared to the human reference genome. The consequence for each variant depends on the location and nature of the variant and is a key question for genetic analysts performing clinical diagnosis. Variant annotation describes how a variant affects the sample's genome. These annotations include the functional consequence on the different transcripts for a gene or in proximal regulatory regions. Annotation also includes additional data on what is known about a given variant that can help in understanding its relevance to a given line of investigation. Often this data is provided by different sources and contain allele frequencies for different populations, clinical implications, relevance to cancer types, additional studies, etc. Ultimately this information helps clinicians interpret variants when providing a diagnosis. The three most widely used open source annotation tools are VEP, SnpEff and AnnoVar. VEP is widely considered to be most accurate of the three, but is also slower than both SnpEff and AnnoVar. When annotating the variants from a 30x genome (NA12878), VEP finished in 18 hours whereas SnpEff 4.3g and AnnoVar finish in 15 min and 67 min respectively using one core. We present Nirvana, an open source clinical variant annotator, that is both accurate (over 99.9% concordance with VEP) and fast (takes 7 min to annotate NA12878). Nirvana is used in all of Illumina's relevant analysis pipelines and is tested rigorously to ensure adherence to clinical standards.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Mapping Free Text into MedDRA by Natural Language Processing: A Modular Approach in Designing and Evaluating Software Extensions Evolving Conformation Paths to Model Protein Structural Transitions Supervised Machine Learning Approaches Predict and Characterize Nanomaterial Exposures: MWCNT Markers in Lung Lavage Fluid. Geometry Analysis for Protein Secondary Structures Matching Problem Geometric Sampling Framework for Exploring Molecular Walker Energetics and Dynamics
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1