Ivar Grytten, Knut Dagestad Rand, Geir Kjetil Sandve
{"title":"KAGE: fast alignment-free graph-based genotyping of SNPs and short indels.","authors":"Ivar Grytten, Knut Dagestad Rand, Geir Kjetil Sandve","doi":"10.1186/s13059-022-02771-2","DOIUrl":null,"url":null,"abstract":"<p><p>Genotyping is a core application of high-throughput sequencing. We present KAGE, a genotyper for SNPs and short indels that is inspired by recent developments within graph-based genome representations and alignment-free methods. KAGE uses a pan-genome representation of the population to efficiently and accurately predict genotypes. Two novel ideas improve both the speed and accuracy: a Bayesian model incorporates genotypes from thousands of individuals to improve prediction accuracy, and a computationally efficient method leverages correlation between variants. We show that the accuracy of KAGE is at par with the best existing alignment-free genotypers, while being an order of magnitude faster.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":"23 1","pages":"209"},"PeriodicalIF":12.3000,"publicationDate":"2022-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9531401/pdf/","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Genome Biology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s13059-022-02771-2","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Agricultural and Biological Sciences","Score":null,"Total":0}
引用次数: 1
Abstract
Genotyping is a core application of high-throughput sequencing. We present KAGE, a genotyper for SNPs and short indels that is inspired by recent developments within graph-based genome representations and alignment-free methods. KAGE uses a pan-genome representation of the population to efficiently and accurately predict genotypes. Two novel ideas improve both the speed and accuracy: a Bayesian model incorporates genotypes from thousands of individuals to improve prediction accuracy, and a computationally efficient method leverages correlation between variants. We show that the accuracy of KAGE is at par with the best existing alignment-free genotypers, while being an order of magnitude faster.
期刊介绍:
Genome Biology is a leading research journal that focuses on the study of biology and biomedicine from a genomic and post-genomic standpoint. The journal consistently publishes outstanding research across various areas within these fields.
With an impressive impact factor of 12.3 (2022), Genome Biology has earned its place as the 3rd highest-ranked research journal in the Genetics and Heredity category, according to Thomson Reuters. Additionally, it is ranked 2nd among research journals in the Biotechnology and Applied Microbiology category. It is important to note that Genome Biology is the top-ranking open access journal in this category.
In summary, Genome Biology sets a high standard for scientific publications in the field, showcasing cutting-edge research and earning recognition among its peers.