Michael K B Ford, Ananth Hari, Qinghui Zhou, Ibrahim Numanagić, S Cenk Sahinalp
{"title":"基于生物学信息的杀伤细胞免疫球蛋白样受体(KIR)基因注释工具。","authors":"Michael K B Ford, Ananth Hari, Qinghui Zhou, Ibrahim Numanagić, S Cenk Sahinalp","doi":"10.1093/bioinformatics/btae622","DOIUrl":null,"url":null,"abstract":"<p><strong>Summary: </strong>Natural killer (NK) cells are essential components of the innate immune system, with their activity significantly regulated by Killer cell Immunoglobulin-like Receptors (KIRs). The diversity and structural complexity of KIR genes present significant challenges for accurate genotyping, essential for understanding NK cell functions and their implications in health and disease. Traditional genotyping methods struggle with the variable nature of KIR genes, leading to inaccuracies that can impede immunogenetic research. These challenges extend to high-quality phased assemblies, which have been recently popularized by the Human Pangenome Consortium. This article introduces BAKIR (Biologically informed Annotator for KIR locus), a tailored computational tool designed to overcome the challenges of KIR genotyping and annotation on high-quality, phased genome assemblies. BAKIR aims to enhance the accuracy of KIR gene annotations by structuring its annotation pipeline around identifying key functional mutations, thereby improving the identification and subsequent relevance of gene and allele calls. It uses a multi-stage mapping, alignment, and variant calling process to ensure high-precision gene and allele identification, while also maintaining high recall for sequences that are significantly mutated or truncated relative to the known allele database. BAKIR has been evaluated on a subset of the HPRC assemblies, where BAKIR was able to improve many of the associated annotations and call novel variants. BAKIR is freely available on GitHub, offering ease of access and use through multiple installation methods, including pip, conda, and singularity container, and is equipped with a user-friendly command-line interface, thereby promoting its adoption in the scientific community.</p><p><strong>Availability and implementation: </strong>BAKIR is available at github.com/algo-cancer/bakir.</p>","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11549020/pdf/","citationCount":"0","resultStr":"{\"title\":\"Biologically-informed killer cell immunoglobulin-like receptor gene annotation tool.\",\"authors\":\"Michael K B Ford, Ananth Hari, Qinghui Zhou, Ibrahim Numanagić, S Cenk Sahinalp\",\"doi\":\"10.1093/bioinformatics/btae622\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Summary: </strong>Natural killer (NK) cells are essential components of the innate immune system, with their activity significantly regulated by Killer cell Immunoglobulin-like Receptors (KIRs). The diversity and structural complexity of KIR genes present significant challenges for accurate genotyping, essential for understanding NK cell functions and their implications in health and disease. Traditional genotyping methods struggle with the variable nature of KIR genes, leading to inaccuracies that can impede immunogenetic research. These challenges extend to high-quality phased assemblies, which have been recently popularized by the Human Pangenome Consortium. This article introduces BAKIR (Biologically informed Annotator for KIR locus), a tailored computational tool designed to overcome the challenges of KIR genotyping and annotation on high-quality, phased genome assemblies. BAKIR aims to enhance the accuracy of KIR gene annotations by structuring its annotation pipeline around identifying key functional mutations, thereby improving the identification and subsequent relevance of gene and allele calls. It uses a multi-stage mapping, alignment, and variant calling process to ensure high-precision gene and allele identification, while also maintaining high recall for sequences that are significantly mutated or truncated relative to the known allele database. BAKIR has been evaluated on a subset of the HPRC assemblies, where BAKIR was able to improve many of the associated annotations and call novel variants. BAKIR is freely available on GitHub, offering ease of access and use through multiple installation methods, including pip, conda, and singularity container, and is equipped with a user-friendly command-line interface, thereby promoting its adoption in the scientific community.</p><p><strong>Availability and implementation: </strong>BAKIR is available at github.com/algo-cancer/bakir.</p>\",\"PeriodicalId\":93899,\"journal\":{\"name\":\"Bioinformatics (Oxford, England)\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11549020/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Bioinformatics (Oxford, England)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1093/bioinformatics/btae622\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinformatics (Oxford, England)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/bioinformatics/btae622","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
摘要:自然杀伤(NK)细胞是先天性免疫系统的重要组成部分,其活性受杀伤细胞免疫球蛋白样受体(KIR)的重要调节。KIR 基因的多样性和结构复杂性给准确的基因分型带来了巨大挑战,而准确的基因分型对于了解 NK 细胞的功能及其对健康和疾病的影响至关重要。传统的基因分型方法难以应对 KIR 基因的多变性,从而导致不准确性,阻碍了免疫遗传学的研究。这些挑战延伸到了高质量的分阶段组装,最近人类泛基因组联盟(Human Pangenome Consortium)推广了这种组装方法。本文介绍了 BAKIR(Biologically-informed Annotator for KIR locus),这是一种量身定制的计算工具,旨在克服在高质量分阶段基因组组装上进行 KIR 基因分型和注释所面临的挑战。BAKIR 的目标是通过围绕识别关键功能突变来构建其注释管道,从而提高 KIR 基因注释的准确性,从而改善基因和等位基因调用的识别和后续相关性。它采用多阶段映射、比对和变异调用过程,确保高精度的基因和等位基因鉴定,同时还能对相对于已知等位基因数据库有明显突变或截断的序列保持较高的召回率。BAKIR 已在 HPRC 集合的一个子集上进行了评估,BAKIR 能够改进许多相关注释并调用新的变异。BAKIR 可在 GitHub 上免费获取,通过多种安装方法(包括 pip、conda 和 singularity container)轻松访问和使用,并配备了用户友好的命令行界面,从而促进了其在科学界的应用:BAKIR 可在 github.com/algo-cancer/bakir 上获取:补充数据可在 Bioinformatics online 上获取。
Summary: Natural killer (NK) cells are essential components of the innate immune system, with their activity significantly regulated by Killer cell Immunoglobulin-like Receptors (KIRs). The diversity and structural complexity of KIR genes present significant challenges for accurate genotyping, essential for understanding NK cell functions and their implications in health and disease. Traditional genotyping methods struggle with the variable nature of KIR genes, leading to inaccuracies that can impede immunogenetic research. These challenges extend to high-quality phased assemblies, which have been recently popularized by the Human Pangenome Consortium. This article introduces BAKIR (Biologically informed Annotator for KIR locus), a tailored computational tool designed to overcome the challenges of KIR genotyping and annotation on high-quality, phased genome assemblies. BAKIR aims to enhance the accuracy of KIR gene annotations by structuring its annotation pipeline around identifying key functional mutations, thereby improving the identification and subsequent relevance of gene and allele calls. It uses a multi-stage mapping, alignment, and variant calling process to ensure high-precision gene and allele identification, while also maintaining high recall for sequences that are significantly mutated or truncated relative to the known allele database. BAKIR has been evaluated on a subset of the HPRC assemblies, where BAKIR was able to improve many of the associated annotations and call novel variants. BAKIR is freely available on GitHub, offering ease of access and use through multiple installation methods, including pip, conda, and singularity container, and is equipped with a user-friendly command-line interface, thereby promoting its adoption in the scientific community.
Availability and implementation: BAKIR is available at github.com/algo-cancer/bakir.