An almost infinite sites model

IF 1.2 4区 生物学 Q4 ECOLOGY Theoretical Population Biology Pub Date : 2024-10-23 DOI:10.1016/j.tpb.2024.10.001
Alejandra Avalos-Pacheco , Mathias C. Cronjäger , Paul A. Jenkins , Jotun Hein
{"title":"An almost infinite sites model","authors":"Alejandra Avalos-Pacheco ,&nbsp;Mathias C. Cronjäger ,&nbsp;Paul A. Jenkins ,&nbsp;Jotun Hein","doi":"10.1016/j.tpb.2024.10.001","DOIUrl":null,"url":null,"abstract":"<div><h3>Motivation:</h3><div>A main challenge in molecular evolution is to find computationally efficient mutation models with flexible assumptions that properly reflect genetic variation. The infinite sites model assumes that each mutation event occurs at a site never previously mutant, i.e. it does not allow recurrent mutations. This is reasonable for low mutation rates and makes statistical inference much more tractable. However, recurrent mutations are common enough to be observable from genetic variation data, even in species with low per-site mutation rates such as humans. The finite sites model on the other hand allows for recurrent mutations but is computationally unfeasible to work with in most cases. In this work, we bridge these two approaches by developing a novel molecular evolution model, the almost infinite sites model, that both admits recurrent mutations and is tractable. We provide a recursive characterization of the likelihood of our proposed model under complete linkage and outline a parsimonious approximation scheme for computing it.</div></div><div><h3>Results:</h3><div>We show the usefulness of our model in simulated and human mitochondrial data. Our results show that the AISM, in combination with a constraint on the total number of mutation events, can recover accurate approximations to the maximum likelihood estimator of the mutation rate.</div></div><div><h3>Availability and implementation:</h3><div>An implementation of our model is freely available along with code for reproducing our computational experiments at <span><span>https://github.com/Cronjaeger/almost-infinite-sites-recursions</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":49437,"journal":{"name":"Theoretical Population Biology","volume":"160 ","pages":"Pages 49-61"},"PeriodicalIF":1.2000,"publicationDate":"2024-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Theoretical Population Biology","FirstCategoryId":"99","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0040580924000935","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ECOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Motivation:

A main challenge in molecular evolution is to find computationally efficient mutation models with flexible assumptions that properly reflect genetic variation. The infinite sites model assumes that each mutation event occurs at a site never previously mutant, i.e. it does not allow recurrent mutations. This is reasonable for low mutation rates and makes statistical inference much more tractable. However, recurrent mutations are common enough to be observable from genetic variation data, even in species with low per-site mutation rates such as humans. The finite sites model on the other hand allows for recurrent mutations but is computationally unfeasible to work with in most cases. In this work, we bridge these two approaches by developing a novel molecular evolution model, the almost infinite sites model, that both admits recurrent mutations and is tractable. We provide a recursive characterization of the likelihood of our proposed model under complete linkage and outline a parsimonious approximation scheme for computing it.

Results:

We show the usefulness of our model in simulated and human mitochondrial data. Our results show that the AISM, in combination with a constraint on the total number of mutation events, can recover accurate approximations to the maximum likelihood estimator of the mutation rate.

Availability and implementation:

An implementation of our model is freely available along with code for reproducing our computational experiments at https://github.com/Cronjaeger/almost-infinite-sites-recursions.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
几乎无限的场地模型
动机分子进化的一个主要挑战是找到计算效率高、假设灵活、能正确反映遗传变异的突变模型。无限位点模型假设每次突变都发生在一个以前从未发生过突变的位点上,即不允许重复突变。这对低突变率来说是合理的,也使统计推断更加容易。然而,重复突变非常普遍,即使在人类等每个位点突变率较低的物种中,也能从遗传变异数据中观察到。另一方面,有限位点模型允许发生重复突变,但在大多数情况下计算上不可行。在这项研究中,我们开发了一种新的分子进化模型--几乎无限位点模型,它既允许重复突变,又易于操作,从而在这两种方法之间架起了一座桥梁。我们提供了我们提出的模型在完全关联条件下的可能性递归特征,并概述了计算该可能性的简便近似方案:我们在模拟数据和人类线粒体数据中展示了我们的模型的实用性。我们的结果表明,AISM 与突变事件总数的约束相结合,可以恢复突变率最大似然估计值的精确近似值:我们免费提供模型的实现以及用于重现计算实验的代码,请访问 https://github.com/Cronjaeger/almost-infinite-sites-recursions。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Theoretical Population Biology
Theoretical Population Biology 生物-进化生物学
CiteScore
2.50
自引率
14.30%
发文量
43
审稿时长
6-12 weeks
期刊介绍: An interdisciplinary journal, Theoretical Population Biology presents articles on theoretical aspects of the biology of populations, particularly in the areas of demography, ecology, epidemiology, evolution, and genetics. Emphasis is on the development of mathematical theory and models that enhance the understanding of biological phenomena. Articles highlight the motivation and significance of the work for advancing progress in biology, relying on a substantial mathematical effort to obtain biological insight. The journal also presents empirical results and computational and statistical methods directly impinging on theoretical problems in population biology.
期刊最新文献
Species coexistence as an emergent effect of interacting mechanisms. Effect of competition on emergent phases and phase transitions in competitive systems. Catching a wave: On the suitability of traveling-wave solutions in epidemiological modeling. Editorial. The impact of simultaneous infections on phage-host ecology.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1