Using bioinformatics to investigate functional diversity: a case study of MHC diversity in koalas.

IF 2.9 4区 医学 Q2 GENETICS & HEREDITY Immunogenetics Pub Date : 2024-12-01 Epub Date: 2024-10-05 DOI:10.1007/s00251-024-01356-6
Luke W Silver, Elspeth A McLennan, Julian Beaman, Karen Burke da Silva, Peter Timms, Carolyn J Hogg, Katherine Belov
{"title":"Using bioinformatics to investigate functional diversity: a case study of MHC diversity in koalas.","authors":"Luke W Silver, Elspeth A McLennan, Julian Beaman, Karen Burke da Silva, Peter Timms, Carolyn J Hogg, Katherine Belov","doi":"10.1007/s00251-024-01356-6","DOIUrl":null,"url":null,"abstract":"<p><p>Conservation genomics can greatly improve conservation outcomes of threatened populations, including those impacted by disease. Understanding diversity within immune gene families, including the major histocompatibility complex (MHC) and toll-like receptors (TLR), is important due to the role they play in disease resilience and susceptibility. With recent advancements in sequencing technologies and bioinformatic tools, the cost of generating high-quality sequence data has significantly decreased and made it possible to investigate diversity across entire gene families in large numbers of individuals compared to investigating only a few genes or a few populations previously. Here, we use the koala as a case study for investigating functional diversity across populations. We utilised previous target enrichment data and 438 whole genomes to firstly, determine the level of sequencing depth required to investigate MHC diversity and, secondly, determine the current level of diversity in MHC genes in koala populations. We determined for low complexity, conserved genes such as TLR genes 10 × sequencing depth is sufficient to reliably genotype more than 90% of variants, whereas for complex genes such as the MHC greater than 20 × and preferably 30 × sequencing depth is required. We used whole genome data to identify 270 biallelic SNPs across 24 MHC genes as well as copy number variation (CNV) within class I and class II genes and conduct supertype analysis. Overall, we have provided a bioinformatic workflow for investigating variation in a complex immune gene family from whole genome sequencing data and determined current levels of diversity within koala MHC genes.</p>","PeriodicalId":13446,"journal":{"name":"Immunogenetics","volume":" ","pages":"381-395"},"PeriodicalIF":2.9000,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11496358/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Immunogenetics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s00251-024-01356-6","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/10/5 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

Abstract

Conservation genomics can greatly improve conservation outcomes of threatened populations, including those impacted by disease. Understanding diversity within immune gene families, including the major histocompatibility complex (MHC) and toll-like receptors (TLR), is important due to the role they play in disease resilience and susceptibility. With recent advancements in sequencing technologies and bioinformatic tools, the cost of generating high-quality sequence data has significantly decreased and made it possible to investigate diversity across entire gene families in large numbers of individuals compared to investigating only a few genes or a few populations previously. Here, we use the koala as a case study for investigating functional diversity across populations. We utilised previous target enrichment data and 438 whole genomes to firstly, determine the level of sequencing depth required to investigate MHC diversity and, secondly, determine the current level of diversity in MHC genes in koala populations. We determined for low complexity, conserved genes such as TLR genes 10 × sequencing depth is sufficient to reliably genotype more than 90% of variants, whereas for complex genes such as the MHC greater than 20 × and preferably 30 × sequencing depth is required. We used whole genome data to identify 270 biallelic SNPs across 24 MHC genes as well as copy number variation (CNV) within class I and class II genes and conduct supertype analysis. Overall, we have provided a bioinformatic workflow for investigating variation in a complex immune gene family from whole genome sequencing data and determined current levels of diversity within koala MHC genes.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用生物信息学研究功能多样性:考拉的 MHC 多样性案例研究。
保护基因组学可以大大改善受威胁种群(包括受疾病影响的种群)的保护成果。了解免疫基因家族(包括主要组织相容性复合体(MHC)和收费样受体(TLR))的多样性非常重要,因为它们在疾病的恢复力和易感性方面发挥着重要作用。随着最近测序技术和生物信息学工具的进步,生成高质量序列数据的成本大大降低,这使得研究大量个体整个基因家族的多样性成为可能,而以前只能研究几个基因或几个种群。在这里,我们以考拉为案例研究跨种群的功能多样性。我们利用之前的目标富集数据和 438 个全基因组,首先确定了研究 MHC 多样性所需的测序深度,其次确定了考拉种群中 MHC 基因目前的多样性水平。我们确定,对于 TLR 基因等低复杂度的保守基因,10 倍的测序深度足以可靠地对 90% 以上的变异基因进行基因分型,而对于 MHC 等复杂基因,测序深度需要大于 20 倍,最好是 30 倍。我们利用全基因组数据鉴定了 24 个 MHC 基因中的 270 个双侧 SNP,以及 I 类和 II 类基因中的拷贝数变异 (CNV),并进行了超级类型分析。总之,我们提供了一种生物信息学工作流程,可利用全基因组测序数据研究复杂免疫基因家族的变异,并确定考拉 MHC 基因目前的多样性水平。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Immunogenetics
Immunogenetics 医学-免疫学
CiteScore
6.20
自引率
6.20%
发文量
48
审稿时长
1 months
期刊介绍: Immunogenetics publishes original papers, brief communications, and reviews on research in the following areas: genetics and evolution of the immune system; genetic control of immune response and disease susceptibility; bioinformatics of the immune system; structure of immunologically important molecules; and immunogenetics of reproductive biology, tissue differentiation, and development.
期刊最新文献
The sufficiency of genetic diagnosis in managing patients with inborn errors of immunity during prenatal care and childbearing. Bioinformatic analysis predicts the regulatory function of noncoding SNPs associated with Long COVID-19 syndrome. Decoding the genetic landscape of juvenile dermatomyositis: insights from phosphorylation-associated single nucleotide polymorphisms. The characteristic of HLA-A, HLA-B, HLA-C, HLA-DRB1, HLA-DRB3/4/5, HLA-DQA1, HLA-DQB1, HLA-DPA1, and HLA-DPB1 alleles in Zhejiang Han population. The effect of circulating cytokines on the risk of systemic lupus erythematosus: Mendelian randomization and observational study.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1