{"title":"利用随机优化和方差缩小改进树概率估计","authors":"Tianyu Xie, Musu Yuan, Minghua Deng, Cheng Zhang","doi":"arxiv-2409.05282","DOIUrl":null,"url":null,"abstract":"Probability estimation of tree topologies is one of the fundamental tasks in\nphylogenetic inference. The recently proposed subsplit Bayesian networks (SBNs)\nprovide a powerful probabilistic graphical model for tree topology probability\nestimation by properly leveraging the hierarchical structure of phylogenetic\ntrees. However, the expectation maximization (EM) method currently used for\nlearning SBN parameters does not scale up to large data sets. In this paper, we\nintroduce several computationally efficient methods for training SBNs and show\nthat variance reduction could be the key for better performance. Furthermore,\nwe also introduce the variance reduction technique to improve the optimization\nof SBN parameters for variational Bayesian phylogenetic inference (VBPI).\nExtensive synthetic and real data experiments demonstrate that our methods\noutperform previous baseline methods on the tasks of tree topology probability\nestimation as well as Bayesian phylogenetic inference using SBNs.","PeriodicalId":501340,"journal":{"name":"arXiv - STAT - Machine Learning","volume":"58 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Improving Tree Probability Estimation with Stochastic Optimization and Variance Reduction\",\"authors\":\"Tianyu Xie, Musu Yuan, Minghua Deng, Cheng Zhang\",\"doi\":\"arxiv-2409.05282\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Probability estimation of tree topologies is one of the fundamental tasks in\\nphylogenetic inference. The recently proposed subsplit Bayesian networks (SBNs)\\nprovide a powerful probabilistic graphical model for tree topology probability\\nestimation by properly leveraging the hierarchical structure of phylogenetic\\ntrees. However, the expectation maximization (EM) method currently used for\\nlearning SBN parameters does not scale up to large data sets. In this paper, we\\nintroduce several computationally efficient methods for training SBNs and show\\nthat variance reduction could be the key for better performance. Furthermore,\\nwe also introduce the variance reduction technique to improve the optimization\\nof SBN parameters for variational Bayesian phylogenetic inference (VBPI).\\nExtensive synthetic and real data experiments demonstrate that our methods\\noutperform previous baseline methods on the tasks of tree topology probability\\nestimation as well as Bayesian phylogenetic inference using SBNs.\",\"PeriodicalId\":501340,\"journal\":{\"name\":\"arXiv - STAT - Machine Learning\",\"volume\":\"58 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - STAT - Machine Learning\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.05282\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - STAT - Machine Learning","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.05282","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Improving Tree Probability Estimation with Stochastic Optimization and Variance Reduction
Probability estimation of tree topologies is one of the fundamental tasks in
phylogenetic inference. The recently proposed subsplit Bayesian networks (SBNs)
provide a powerful probabilistic graphical model for tree topology probability
estimation by properly leveraging the hierarchical structure of phylogenetic
trees. However, the expectation maximization (EM) method currently used for
learning SBN parameters does not scale up to large data sets. In this paper, we
introduce several computationally efficient methods for training SBNs and show
that variance reduction could be the key for better performance. Furthermore,
we also introduce the variance reduction technique to improve the optimization
of SBN parameters for variational Bayesian phylogenetic inference (VBPI).
Extensive synthetic and real data experiments demonstrate that our methods
outperform previous baseline methods on the tasks of tree topology probability
estimation as well as Bayesian phylogenetic inference using SBNs.