Torchtree:使用 PyTorch 进行灵活的系统发生模型开发和推断

Mathieu Fourment, Matthew Macaulay, Christiaan J Swanepoel, Xiang Ji, Marc A Suchard, Frederick A Matsen IV
{"title":"Torchtree:使用 PyTorch 进行灵活的系统发生模型开发和推断","authors":"Mathieu Fourment, Matthew Macaulay, Christiaan J Swanepoel, Xiang Ji, Marc A Suchard, Frederick A Matsen IV","doi":"arxiv-2406.18044","DOIUrl":null,"url":null,"abstract":"Bayesian inference has predominantly relied on the Markov chain Monte Carlo\n(MCMC) algorithm for many years. However, MCMC is computationally laborious,\nespecially for complex phylogenetic models of time trees. This bottleneck has\nled to the search for alternatives, such as variational Bayes, which can scale\nbetter to large datasets. In this paper, we introduce torchtree, a framework\nwritten in Python that allows developers to easily implement rich phylogenetic\nmodels and algorithms using a fixed tree topology. One can either use automatic\ndifferentiation, or leverage torchtree's plug-in system to compute gradients\nanalytically for model components for which automatic differentiation is slow.\nWe demonstrate that the torchtree variational inference framework performs\nsimilarly to BEAST in terms of speed and approximation accuracy. Furthermore,\nwe explore the use of the forward KL divergence as an optimizing criterion for\nvariational inference, which can handle discontinuous and non-differentiable\nmodels. Our experiments show that inference using the forward KL divergence\ntends to be faster per iteration compared to the evidence lower bound (ELBO)\ncriterion, although the ELBO-based inference may converge faster in some cases.\nOverall, torchtree provides a flexible and efficient framework for phylogenetic\nmodel development and inference using PyTorch.","PeriodicalId":501215,"journal":{"name":"arXiv - STAT - Computation","volume":"43 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Torchtree: flexible phylogenetic model development and inference using PyTorch\",\"authors\":\"Mathieu Fourment, Matthew Macaulay, Christiaan J Swanepoel, Xiang Ji, Marc A Suchard, Frederick A Matsen IV\",\"doi\":\"arxiv-2406.18044\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Bayesian inference has predominantly relied on the Markov chain Monte Carlo\\n(MCMC) algorithm for many years. However, MCMC is computationally laborious,\\nespecially for complex phylogenetic models of time trees. This bottleneck has\\nled to the search for alternatives, such as variational Bayes, which can scale\\nbetter to large datasets. In this paper, we introduce torchtree, a framework\\nwritten in Python that allows developers to easily implement rich phylogenetic\\nmodels and algorithms using a fixed tree topology. One can either use automatic\\ndifferentiation, or leverage torchtree's plug-in system to compute gradients\\nanalytically for model components for which automatic differentiation is slow.\\nWe demonstrate that the torchtree variational inference framework performs\\nsimilarly to BEAST in terms of speed and approximation accuracy. Furthermore,\\nwe explore the use of the forward KL divergence as an optimizing criterion for\\nvariational inference, which can handle discontinuous and non-differentiable\\nmodels. Our experiments show that inference using the forward KL divergence\\ntends to be faster per iteration compared to the evidence lower bound (ELBO)\\ncriterion, although the ELBO-based inference may converge faster in some cases.\\nOverall, torchtree provides a flexible and efficient framework for phylogenetic\\nmodel development and inference using PyTorch.\",\"PeriodicalId\":501215,\"journal\":{\"name\":\"arXiv - STAT - Computation\",\"volume\":\"43 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-06-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - STAT - Computation\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2406.18044\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - STAT - Computation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2406.18044","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

多年来,贝叶斯推断主要依赖于马尔科夫链蒙特卡罗(MCMC)算法。然而,MCMC 计算起来非常费力,尤其是对于复杂的时间树系统发育模型。这一瓶颈导致人们开始寻找能更好地扩展到大型数据集的替代算法,如变异贝叶斯算法。在本文中,我们介绍了 torchtree,这是一个用 Python 编写的框架,允许开发人员使用固定的树拓扑结构轻松实现丰富的系统发育模型和算法。我们证明了 torchtree 变分推理框架在速度和近似精度方面的表现与 BEAST 相似。此外,我们还探索了使用前向 KL 发散作为变量推理的优化准则,它可以处理不连续和不可微分模型。我们的实验表明,与证据下限(ELBO)准则相比,使用前向 KL 发散进行推理的每次迭代速度更快,尽管基于 ELBO 的推理在某些情况下收敛得更快。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Torchtree: flexible phylogenetic model development and inference using PyTorch
Bayesian inference has predominantly relied on the Markov chain Monte Carlo (MCMC) algorithm for many years. However, MCMC is computationally laborious, especially for complex phylogenetic models of time trees. This bottleneck has led to the search for alternatives, such as variational Bayes, which can scale better to large datasets. In this paper, we introduce torchtree, a framework written in Python that allows developers to easily implement rich phylogenetic models and algorithms using a fixed tree topology. One can either use automatic differentiation, or leverage torchtree's plug-in system to compute gradients analytically for model components for which automatic differentiation is slow. We demonstrate that the torchtree variational inference framework performs similarly to BEAST in terms of speed and approximation accuracy. Furthermore, we explore the use of the forward KL divergence as an optimizing criterion for variational inference, which can handle discontinuous and non-differentiable models. Our experiments show that inference using the forward KL divergence tends to be faster per iteration compared to the evidence lower bound (ELBO) criterion, although the ELBO-based inference may converge faster in some cases. Overall, torchtree provides a flexible and efficient framework for phylogenetic model development and inference using PyTorch.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Model-Embedded Gaussian Process Regression for Parameter Estimation in Dynamical System Effects of the entropy source on Monte Carlo simulations A Robust Approach to Gaussian Processes Implementation HJ-sampler: A Bayesian sampler for inverse problems of a stochastic process by leveraging Hamilton-Jacobi PDEs and score-based generative models Reducing Shape-Graph Complexity with Application to Classification of Retinal Blood Vessels and Neurons
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1