Evaluating probabilistic genotyping for low-pass DNA sequencing

Sammed N. Mandape , Kapema Bupe Kapema , Tiffany Duque , Amy Smuts , Jonathan L. King , Benjamin Crysup , Jianye Ge , Bruce Budowle , August E. Woerner
{"title":"Evaluating probabilistic genotyping for low-pass DNA sequencing","authors":"Sammed N. Mandape ,&nbsp;Kapema Bupe Kapema ,&nbsp;Tiffany Duque ,&nbsp;Amy Smuts ,&nbsp;Jonathan L. King ,&nbsp;Benjamin Crysup ,&nbsp;Jianye Ge ,&nbsp;Bruce Budowle ,&nbsp;August E. Woerner","doi":"10.1016/j.fsigss.2022.10.001","DOIUrl":null,"url":null,"abstract":"<div><p>Most genomic methods consider the sample genotype. Data are evaluated at some location, and if the signal strength is sufficient, a genotype call is made. Conversely, sites that lack sufficient signal are treated as missing data. Such methods for genotype calling are binary, and this dichotomy limits genomic analyses to relatively high-coverage (and high-cost) massively parallel sequencing (MPS) data. It follows that bioinformatic methods that rely on genotypes may not be ideal for trace DNA samples, such as those sometimes encountered in forensic investigations, but even when applicable such analyses can be expensive. However, there are some genomic analyses where having many uncertain genotypes (with measured uncertainty) assayed over the entirety of the genome may be more powerful than current multi-locus approaches that consider a limited number of well-characterized markers. Methods for such problems may rely on genotype likelihood, which expresses the likelihood of alternative genotype calls in addition to the most likely call. One application that can benefit from genotype likelihoods is kinship analysis. NgsRelate is a bioinformatic tool that infers pairwise relatedness using a probabilistic genotyping framework, which accommodates the uncertainty associated with genotype calls for low-pass MPS data. Here, NgsRelate was used to infer kinship coefficients from low-pass whole genome sequencing data from a known pedigree. Multiple samples in a titration series (ranging from 50 ng to 0.5 ng) on a single MPS S4 flow cell were assessed. A reproducible scientific bioinformatic workflow was developed to evaluate kinship coefficients considering up to 3rd degree relatives. NgsRelate was found to provide robust assessments of kinship. Further, the use of low-pass MPS data provides a more cost-effective way to conduct forensic investigations.</p></div>","PeriodicalId":56262,"journal":{"name":"Forensic Science International: Genetics Supplement Series","volume":"8 ","pages":"Pages 112-114"},"PeriodicalIF":0.5000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1875176822000403/pdfft?md5=0847e9204784eab5cfa832f2a0e3c290&pid=1-s2.0-S1875176822000403-main.pdf","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Forensic Science International: Genetics Supplement Series","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1875176822000403","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 1

Abstract

Most genomic methods consider the sample genotype. Data are evaluated at some location, and if the signal strength is sufficient, a genotype call is made. Conversely, sites that lack sufficient signal are treated as missing data. Such methods for genotype calling are binary, and this dichotomy limits genomic analyses to relatively high-coverage (and high-cost) massively parallel sequencing (MPS) data. It follows that bioinformatic methods that rely on genotypes may not be ideal for trace DNA samples, such as those sometimes encountered in forensic investigations, but even when applicable such analyses can be expensive. However, there are some genomic analyses where having many uncertain genotypes (with measured uncertainty) assayed over the entirety of the genome may be more powerful than current multi-locus approaches that consider a limited number of well-characterized markers. Methods for such problems may rely on genotype likelihood, which expresses the likelihood of alternative genotype calls in addition to the most likely call. One application that can benefit from genotype likelihoods is kinship analysis. NgsRelate is a bioinformatic tool that infers pairwise relatedness using a probabilistic genotyping framework, which accommodates the uncertainty associated with genotype calls for low-pass MPS data. Here, NgsRelate was used to infer kinship coefficients from low-pass whole genome sequencing data from a known pedigree. Multiple samples in a titration series (ranging from 50 ng to 0.5 ng) on a single MPS S4 flow cell were assessed. A reproducible scientific bioinformatic workflow was developed to evaluate kinship coefficients considering up to 3rd degree relatives. NgsRelate was found to provide robust assessments of kinship. Further, the use of low-pass MPS data provides a more cost-effective way to conduct forensic investigations.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
评估低通DNA测序的概率基因分型
大多数基因组方法都考虑样本基因型。在某个位置评估数据,如果信号强度足够,则进行基因型调用。相反,缺乏足够信号的位点被视为缺失数据。这种基因型调用方法是二元的,这种二分法将基因组分析限制在相对高覆盖率(和高成本)的大规模平行测序(MPS)数据上。因此,依赖基因型的生物信息学方法可能不适合痕量DNA样本,例如法医调查中有时遇到的方法,但即使在适用的情况下,这种分析也可能很昂贵。然而,在一些基因组分析中,在整个基因组中分析许多不确定的基因型(具有测量的不确定性)可能比目前考虑有限数量的良好表征标记的多基因座方法更有效。解决此类问题的方法可能依赖于基因型可能性,该可能性表示除了最可能的呼叫之外,还有其他基因型呼叫的可能性。从基因型可能性中受益的一个应用是亲属关系分析。NgsLate是一种生物信息学工具,使用概率基因分型框架推断成对相关性,该框架适应了与基因型相关的不确定性,需要低通MPS数据。在这里,NgsLate被用来从已知谱系的低通全基因组测序数据中推断亲缘系数。在单个MPS S4流动池上评估滴定系列中的多个样品(范围从50ng到0.5ng)。开发了一种可重复的科学生物信息学工作流程,以评估考虑三级亲属的亲属关系系数。NgsLate被发现可以提供强有力的亲属关系评估。此外,低通MPS数据的使用为进行法医调查提供了一种更具成本效益的方式。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Forensic Science International: Genetics Supplement Series
Forensic Science International: Genetics Supplement Series Medicine-Pathology and Forensic Medicine
CiteScore
0.40
自引率
0.00%
发文量
122
审稿时长
25 days
期刊介绍: The Journal of Forensic Science International Genetics Supplement Series is the perfect publication vehicle for the proceedings of a scientific symposium, commissioned thematic issues, or for disseminating a selection of invited articles. The Forensic Science International Genetics Supplement Series is part of a duo of publications on forensic genetics, published by Elsevier on behalf of the International Society for Forensic Genetics.
期刊最新文献
The collapse of an Italian cemetery into the sea: Forensic approach to human remains identification Examination of pretreatment methods for DNA extraction from nails Evaluating probabilistic genotyping for low-pass DNA sequencing The ForAPP: Forensic Ancestry Prediction Pipeline for the interpretation of ancestry informative markers Whole-genome sequencing of degraded DNA for investigative genetic genealogy
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1