快速拟合系统发育混合效应模型

Bert van der Veen, Robert Brian O'Hara
{"title":"快速拟合系统发育混合效应模型","authors":"Bert van der Veen, Robert Brian O'Hara","doi":"arxiv-2408.05333","DOIUrl":null,"url":null,"abstract":"Mixed effects models are among the most commonly used statistical methods for\nthe exploration of multispecies data. In recent years, also Joint Species\nDistribution Models and Generalized Linear Latent Variale Models have gained in\npopularity when the goal is to incorporate residual covariation between species\nthat cannot be explained due to measured environmental covariates. Few software\nimplementations of such models exist that can additionally incorporate\nphylogenetic information, and those that exist tend to utilize Markov chain\nMonte Carlo methods for estimation, so that model fitting takes a long time. In\nthis article we develop new methods for quickly and flexibly fitting\nphylogenetic mixed models, potentially incorporating residual covariation\nbetween species using latent variables, with the possibility to estimate the\nstrength of phylogenetic structuring in species responses per environmental\ncovariate, and while incorporating correlation between different covariate\neffects. By combining Variational approximations with a reduced rank matrix\nnormal covariance structure, Nearest Neighbours Gaussian Processes, and\nparallel computation, phylogenetic mixed models can be fitted much more quickly\nthan the current state-of-the-art. Two simulation studies demonstrate that the\nproposed combination of approximations is not only fast, but also enjoys high\naccuracy. Finally, we demonstrate use of the method with a real world dataset\nof wood-decaying fungi.","PeriodicalId":501215,"journal":{"name":"arXiv - STAT - Computation","volume":"73 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Fast fitting of phylogenetic mixed effects models\",\"authors\":\"Bert van der Veen, Robert Brian O'Hara\",\"doi\":\"arxiv-2408.05333\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Mixed effects models are among the most commonly used statistical methods for\\nthe exploration of multispecies data. In recent years, also Joint Species\\nDistribution Models and Generalized Linear Latent Variale Models have gained in\\npopularity when the goal is to incorporate residual covariation between species\\nthat cannot be explained due to measured environmental covariates. Few software\\nimplementations of such models exist that can additionally incorporate\\nphylogenetic information, and those that exist tend to utilize Markov chain\\nMonte Carlo methods for estimation, so that model fitting takes a long time. In\\nthis article we develop new methods for quickly and flexibly fitting\\nphylogenetic mixed models, potentially incorporating residual covariation\\nbetween species using latent variables, with the possibility to estimate the\\nstrength of phylogenetic structuring in species responses per environmental\\ncovariate, and while incorporating correlation between different covariate\\neffects. By combining Variational approximations with a reduced rank matrix\\nnormal covariance structure, Nearest Neighbours Gaussian Processes, and\\nparallel computation, phylogenetic mixed models can be fitted much more quickly\\nthan the current state-of-the-art. Two simulation studies demonstrate that the\\nproposed combination of approximations is not only fast, but also enjoys high\\naccuracy. Finally, we demonstrate use of the method with a real world dataset\\nof wood-decaying fungi.\",\"PeriodicalId\":501215,\"journal\":{\"name\":\"arXiv - STAT - Computation\",\"volume\":\"73 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-08-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - STAT - Computation\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2408.05333\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - STAT - Computation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.05333","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

混合效应模型是探索多物种数据最常用的统计方法之一。近年来,当目标是纳入物种间的残差时,联合物种分布模型和广义线性潜变量模型也越来越受欢迎。现有的此类模型软件很少能够额外纳入系统发育信息,而且现有的软件往往使用马尔可夫链蒙特卡罗方法进行估计,因此模型拟合需要很长时间。在这篇文章中,我们开发了快速灵活拟合系统发育混合模型的新方法,有可能利用潜变量纳入物种间的残差协变,并有可能估计每个环境变量在物种响应中的系统发育结构强度,同时纳入不同协变效应之间的相关性。通过将变分近似与秩矩阵正态协方差结构、近邻高斯过程和并行计算相结合,系统发育混合模型的拟合速度远远超过目前的先进水平。两项模拟研究表明,建议的近似值组合不仅速度快,而且精度高。最后,我们展示了该方法在实际木材腐朽真菌数据集中的应用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Fast fitting of phylogenetic mixed effects models
Mixed effects models are among the most commonly used statistical methods for the exploration of multispecies data. In recent years, also Joint Species Distribution Models and Generalized Linear Latent Variale Models have gained in popularity when the goal is to incorporate residual covariation between species that cannot be explained due to measured environmental covariates. Few software implementations of such models exist that can additionally incorporate phylogenetic information, and those that exist tend to utilize Markov chain Monte Carlo methods for estimation, so that model fitting takes a long time. In this article we develop new methods for quickly and flexibly fitting phylogenetic mixed models, potentially incorporating residual covariation between species using latent variables, with the possibility to estimate the strength of phylogenetic structuring in species responses per environmental covariate, and while incorporating correlation between different covariate effects. By combining Variational approximations with a reduced rank matrix normal covariance structure, Nearest Neighbours Gaussian Processes, and parallel computation, phylogenetic mixed models can be fitted much more quickly than the current state-of-the-art. Two simulation studies demonstrate that the proposed combination of approximations is not only fast, but also enjoys high accuracy. Finally, we demonstrate use of the method with a real world dataset of wood-decaying fungi.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Model-Embedded Gaussian Process Regression for Parameter Estimation in Dynamical System Effects of the entropy source on Monte Carlo simulations A Robust Approach to Gaussian Processes Implementation HJ-sampler: A Bayesian sampler for inverse problems of a stochastic process by leveraging Hamilton-Jacobi PDEs and score-based generative models Reducing Shape-Graph Complexity with Application to Classification of Retinal Blood Vessels and Neurons
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1