{"title":"MOIRE:从多等位基因数据中估算等位基因频率和有效感染倍数的软件包。","authors":"Maxwell Murphy, Bryan Greenhouse","doi":"10.1093/bioinformatics/btae619","DOIUrl":null,"url":null,"abstract":"<p><strong>Motivation: </strong>Malaria parasite genetic data can provide insight into parasite phenotypes, evolution, and transmission. However, estimating key parameters such as allele frequencies, multiplicity of infection (MOI), and within-host relatedness from genetic data is challenging, particularly in the presence of multiple related coinfecting strains. Existing methods often rely on single nucleotide polymorphism (SNP) data and do not account for within-host relatedness.</p><p><strong>Results: </strong>We present Multiplicity Of Infection and allele frequency REcovery (MOIRE), a Bayesian approach to estimate allele frequencies, MOI, and within-host relatedness from genetic data subject to experimental error. MOIRE accommodates both polyallelic and SNP data, making it applicable to diverse genotyping panels. We also introduce a novel metric, the effective MOI (eMOI), which integrates MOI and within-host relatedness, providing a robust and interpretable measure of genetic diversity. Extensive simulations and real-world data from a malaria study in Namibia demonstrate the superior performance of MOIRE over naive estimation methods, accurately estimating MOI up to seven with moderate-sized panels of diverse loci (e.g. microhaplotypes). MOIRE also revealed substantial heterogeneity in population mean MOI and mean relatedness across health districts in Namibia, suggesting detectable differences in transmission dynamics. Notably, eMOI emerges as a portable metric of within-host diversity, facilitating meaningful comparisons across settings when allele frequencies or genotyping panels differ. Compared to existing software, MOIRE enables more comprehensive insights into within-host diversity and population structure.</p><p><strong>Availability and implementation: </strong>MOIRE is available as an R package at https://eppicenter.github.io/moire/.</p>","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11524891/pdf/","citationCount":"0","resultStr":"{\"title\":\"MOIRE: a software package for the estimation of allele frequencies and effective multiplicity of infection from polyallelic data.\",\"authors\":\"Maxwell Murphy, Bryan Greenhouse\",\"doi\":\"10.1093/bioinformatics/btae619\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Motivation: </strong>Malaria parasite genetic data can provide insight into parasite phenotypes, evolution, and transmission. However, estimating key parameters such as allele frequencies, multiplicity of infection (MOI), and within-host relatedness from genetic data is challenging, particularly in the presence of multiple related coinfecting strains. Existing methods often rely on single nucleotide polymorphism (SNP) data and do not account for within-host relatedness.</p><p><strong>Results: </strong>We present Multiplicity Of Infection and allele frequency REcovery (MOIRE), a Bayesian approach to estimate allele frequencies, MOI, and within-host relatedness from genetic data subject to experimental error. MOIRE accommodates both polyallelic and SNP data, making it applicable to diverse genotyping panels. We also introduce a novel metric, the effective MOI (eMOI), which integrates MOI and within-host relatedness, providing a robust and interpretable measure of genetic diversity. Extensive simulations and real-world data from a malaria study in Namibia demonstrate the superior performance of MOIRE over naive estimation methods, accurately estimating MOI up to seven with moderate-sized panels of diverse loci (e.g. microhaplotypes). MOIRE also revealed substantial heterogeneity in population mean MOI and mean relatedness across health districts in Namibia, suggesting detectable differences in transmission dynamics. Notably, eMOI emerges as a portable metric of within-host diversity, facilitating meaningful comparisons across settings when allele frequencies or genotyping panels differ. Compared to existing software, MOIRE enables more comprehensive insights into within-host diversity and population structure.</p><p><strong>Availability and implementation: </strong>MOIRE is available as an R package at https://eppicenter.github.io/moire/.</p>\",\"PeriodicalId\":93899,\"journal\":{\"name\":\"Bioinformatics (Oxford, England)\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11524891/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Bioinformatics (Oxford, England)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1093/bioinformatics/btae619\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinformatics (Oxford, England)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/bioinformatics/btae619","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
动机:疟原虫基因数据可以让我们深入了解寄生虫的表型、进化和传播。然而,从遗传数据中估算等位基因频率、感染倍率(MOI)和宿主内相关性等关键参数具有挑战性,尤其是在存在多个相关共感染菌株的情况下。现有方法通常依赖于单核苷酸多态性(SNP)数据,并不考虑宿主内相关性:结果:我们提出了 MOIRE(感染多重性和等位基因频率恢复),这是一种贝叶斯方法,可从受实验误差影响的基因数据中估算等位基因频率、感染多重性和宿主内相关性。MOIRE 同时适用于多等位基因和 SNP 数据,因此适用于不同的基因分型面板。我们还引入了一种新的指标--有效MOI(eMOI),它整合了MOI和宿主内相关性,为遗传多样性提供了一种稳健且可解释的衡量标准。来自纳米比亚疟疾研究的大量模拟和实际数据表明,MOIRE 的性能优于传统的估算方法,它能准确估算出中等规模的不同基因位点(如微组型)的 MOI,最高可达 7。MOIRE 还揭示了纳米比亚各卫生区人口平均 MOI 和平均亲缘关系的巨大异质性,表明在传播动态中存在可检测到的差异。值得注意的是,eMOI 是一种可移植的宿主内多样性指标,在等位基因频率或基因分型面板不同的情况下,便于进行有意义的跨环境比较。与现有软件相比,MOIRE 能够更全面地揭示宿主内多样性和种群结构:MOIRE是一个R软件包,可在https://eppicenter.github.io/moire/.Supplementary:补充数据可在 Bioinformatics online 上获取。
MOIRE: a software package for the estimation of allele frequencies and effective multiplicity of infection from polyallelic data.
Motivation: Malaria parasite genetic data can provide insight into parasite phenotypes, evolution, and transmission. However, estimating key parameters such as allele frequencies, multiplicity of infection (MOI), and within-host relatedness from genetic data is challenging, particularly in the presence of multiple related coinfecting strains. Existing methods often rely on single nucleotide polymorphism (SNP) data and do not account for within-host relatedness.
Results: We present Multiplicity Of Infection and allele frequency REcovery (MOIRE), a Bayesian approach to estimate allele frequencies, MOI, and within-host relatedness from genetic data subject to experimental error. MOIRE accommodates both polyallelic and SNP data, making it applicable to diverse genotyping panels. We also introduce a novel metric, the effective MOI (eMOI), which integrates MOI and within-host relatedness, providing a robust and interpretable measure of genetic diversity. Extensive simulations and real-world data from a malaria study in Namibia demonstrate the superior performance of MOIRE over naive estimation methods, accurately estimating MOI up to seven with moderate-sized panels of diverse loci (e.g. microhaplotypes). MOIRE also revealed substantial heterogeneity in population mean MOI and mean relatedness across health districts in Namibia, suggesting detectable differences in transmission dynamics. Notably, eMOI emerges as a portable metric of within-host diversity, facilitating meaningful comparisons across settings when allele frequencies or genotyping panels differ. Compared to existing software, MOIRE enables more comprehensive insights into within-host diversity and population structure.
Availability and implementation: MOIRE is available as an R package at https://eppicenter.github.io/moire/.