UNMASC: tumor-only variant calling with unmatched normal controls.

NAR Cancer Pub Date : 2021-10-06 eCollection Date: 2021-12-01 DOI:10.1093/narcan/zcab040
Paul Little, Heejoon Jo, Alan Hoyle, Angela Mazul, Xiaobei Zhao, Ashley H Salazar, Douglas Farquhar, Siddharth Sheth, Maheer Masood, Michele C Hayward, Joel S Parker, Katherine A Hoadley, Jose Zevallos, D Neil Hayes
{"title":"UNMASC: tumor-only variant calling with unmatched normal controls.","authors":"Paul Little,&nbsp;Heejoon Jo,&nbsp;Alan Hoyle,&nbsp;Angela Mazul,&nbsp;Xiaobei Zhao,&nbsp;Ashley H Salazar,&nbsp;Douglas Farquhar,&nbsp;Siddharth Sheth,&nbsp;Maheer Masood,&nbsp;Michele C Hayward,&nbsp;Joel S Parker,&nbsp;Katherine A Hoadley,&nbsp;Jose Zevallos,&nbsp;D Neil Hayes","doi":"10.1093/narcan/zcab040","DOIUrl":null,"url":null,"abstract":"<p><p>Despite years of progress, mutation detection in cancer samples continues to require significant manual review as a final step. Expert review is particularly challenging in cases where tumors are sequenced without matched normal control DNA. Attempts have been made to call somatic point mutations without a matched normal sample by removing well-known germline variants, utilizing unmatched normal controls, and constructing decision rules to classify sequencing errors and private germline variants. With budgetary constraints related to computational and sequencing costs, finding the appropriate number of controls is a crucial step to identifying somatic variants. Our approach utilizes public databases for canonical somatic variants as well as germline variants and leverages information gathered about nearby positions in the normal controls. Drawing from our cohort of targeted capture panel sequencing of tumor and normal samples with varying tumortypes and demographics, these served as a benchmark for our tumor-only variant calling pipeline to observe the relationship between our ability to correctly classify variants against a number of unmatched normals. With our benchmarked samples, approximately ten normal controls were needed to maintain 94% sensitivity, 99% specificity and 76% positive predictive value, far outperforming comparable methods. Our approach, called UNMASC, also serves as a supplement to traditional tumor with matched normal variant calling workflows and can potentially extend to other concerns arising from analyzing next generation sequencing data.</p>","PeriodicalId":18879,"journal":{"name":"NAR Cancer","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2021-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8494212/pdf/","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"NAR Cancer","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/narcan/zcab040","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2021/12/1 0:00:00","PubModel":"eCollection","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

Despite years of progress, mutation detection in cancer samples continues to require significant manual review as a final step. Expert review is particularly challenging in cases where tumors are sequenced without matched normal control DNA. Attempts have been made to call somatic point mutations without a matched normal sample by removing well-known germline variants, utilizing unmatched normal controls, and constructing decision rules to classify sequencing errors and private germline variants. With budgetary constraints related to computational and sequencing costs, finding the appropriate number of controls is a crucial step to identifying somatic variants. Our approach utilizes public databases for canonical somatic variants as well as germline variants and leverages information gathered about nearby positions in the normal controls. Drawing from our cohort of targeted capture panel sequencing of tumor and normal samples with varying tumortypes and demographics, these served as a benchmark for our tumor-only variant calling pipeline to observe the relationship between our ability to correctly classify variants against a number of unmatched normals. With our benchmarked samples, approximately ten normal controls were needed to maintain 94% sensitivity, 99% specificity and 76% positive predictive value, far outperforming comparable methods. Our approach, called UNMASC, also serves as a supplement to traditional tumor with matched normal variant calling workflows and can potentially extend to other concerns arising from analyzing next generation sequencing data.

Abstract Image

Abstract Image

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
UNMASC:只有肿瘤的变种呼叫与不匹配的正常对照。
尽管取得了多年的进展,但癌症样本中的突变检测仍然需要大量的人工审查作为最后一步。在没有匹配的正常对照DNA进行肿瘤测序的情况下,专家审查尤其具有挑战性。通过去除已知的种系变异,利用不匹配的正常对照,以及构建决策规则来分类测序错误和私人种系变异,已经尝试在没有匹配正常样本的情况下调用体细胞点突变。由于与计算和测序成本相关的预算限制,找到适当数量的控制是识别体细胞变异的关键一步。我们的方法利用了典型体细胞变异和种系变异的公共数据库,并利用了在正常对照中收集的有关附近位置的信息。从我们针对不同肿瘤类型和人口统计学的肿瘤和正常样本的靶向捕获面板测序队列中提取,这些作为我们仅肿瘤变体调用管道的基准,以观察我们正确分类变体与许多不匹配的正常之间的关系。对于我们的基准样本,大约需要10个正常对照才能保持94%的灵敏度,99%的特异性和76%的阳性预测值,远远优于同类方法。我们的方法,称为UNMASC,也可以作为传统肿瘤的补充,具有匹配的正常变异调用工作流程,并且可以潜在地扩展到分析下一代测序数据所引起的其他问题。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Pan-cancer analysis of promoter activity quantitative trait loci Large-scale phenogenomic analysis of human cancers uncovers frequent alterations affecting SMC5/6 complex components in breast cancer. Inhibition of nonsense-mediated mRNA decay reduces the tumorigenicity of human fibrosarcoma cells. CDK2 regulates collapsed replication fork repair in CCNE1-amplified ovarian cancer cells via homologous recombination. Editorial: DNA repair and nucleic acid therapeutics in cancer.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1