首页 > 最新文献

Systematic Biology最新文献

英文 中文
Cophylogeny Reconstruction Allowing for Multiple Associations Through Approximate Bayesian Computation. 通过近似贝叶斯计算重建允许多重关联的同源关系
IF 6.5 1区 生物学 Q1 Agricultural and Biological Sciences Pub Date : 2023-12-30 DOI: 10.1093/sysbio/syad058
Blerina Sinaimeri, Laura Urbini, Marie-France Sagot, Catherine Matias

Phylogenetic tree reconciliation is extensively employed for the examination of coevolution between host and symbiont species. An important concern is the requirement for dependable cost values when selecting event-based parsimonious reconciliation. Although certain approaches deduce event probabilities unique to each pair of host and symbiont trees, which can subsequently be converted into cost values, a significant limitation lies in their inability to model the invasion of diverse host species by the same symbiont species (termed as a spread event), which is believed to occur in symbiotic relationships. Invasions lead to the observation of multiple associations between symbionts and their hosts (indicating that a symbiont is no longer exclusive to a single host), which are incompatible with the existing methods of coevolution. Here, we present a method called AmoCoala (an enhanced version of the tool Coala) that provides a more realistic estimation of cophylogeny event probabilities for a given pair of host and symbiont trees, even in the presence of spread events. We expand the classical 4-event coevolutionary model to include 2 additional outcomes, vertical and horizontal spreads, that lead to multiple associations. In the initial step, we estimate the probabilities of spread events using heuristic frequencies. Subsequently, in the second step, we employ an approximate Bayesian computation approach to infer the probabilities of the remaining 4 classical events (cospeciation, duplication, host switch, and loss) based on these values. By incorporating spread events, our reconciliation model enables a more accurate consideration of multiple associations. This improvement enhances the precision of estimated cost sets, paving the way to a more reliable reconciliation of host and symbiont trees. To validate our method, we conducted experiments on synthetic datasets and demonstrated its efficacy using real-world examples. Our results showcase that AmoCoala produces biologically plausible reconciliation scenarios, further emphasizing its effectiveness.

在研究宿主与共生物种之间的共同进化时,系统发生树调和被广泛采用。一个重要的问题是,在选择基于事件的拟和时,需要可靠的成本值。虽然某些方法可以推导出每对宿主树和共生体树所特有的事件概率,并可将其转换为成本值,但它们的一个重要局限在于无法模拟同一共生体物种入侵不同宿主物种的情况(称为扩散事件),而这种情况被认为会在共生关系中发生。入侵会导致观察到共生体与其宿主之间的多重关联(表明共生体不再专属于单一宿主),这与现有的共同进化方法不相容。在这里,我们提出了一种名为 AmoCoala 的方法(Coala 工具的增强版),即使在存在传播事件的情况下,该方法也能对给定的一对宿主树和共生体树的共生事件概率进行更现实的估计。我们扩展了经典的 4 事件协同进化模型,增加了纵向和横向传播这两种导致多重关联的结果。第一步,我们使用启发式频率估计传播事件的概率。随后,在第二步中,我们采用近似贝叶斯计算方法,根据这些值推断出其余 4 个经典事件(共生、复制、宿主转换和丢失)的概率。通过纳入扩散事件,我们的调和模型能够更准确地考虑多重关联。这一改进提高了估计成本集的精确度,为更可靠地调节宿主树和共生体树铺平了道路。为了验证我们的方法,我们在合成数据集上进行了实验,并用实际例子证明了它的功效。我们的结果表明,AmoCoala 能产生生物学上可信的调和方案,进一步强调了它的有效性。
{"title":"Cophylogeny Reconstruction Allowing for Multiple Associations Through Approximate Bayesian Computation.","authors":"Blerina Sinaimeri, Laura Urbini, Marie-France Sagot, Catherine Matias","doi":"10.1093/sysbio/syad058","DOIUrl":"10.1093/sysbio/syad058","url":null,"abstract":"<p><p>Phylogenetic tree reconciliation is extensively employed for the examination of coevolution between host and symbiont species. An important concern is the requirement for dependable cost values when selecting event-based parsimonious reconciliation. Although certain approaches deduce event probabilities unique to each pair of host and symbiont trees, which can subsequently be converted into cost values, a significant limitation lies in their inability to model the invasion of diverse host species by the same symbiont species (termed as a spread event), which is believed to occur in symbiotic relationships. Invasions lead to the observation of multiple associations between symbionts and their hosts (indicating that a symbiont is no longer exclusive to a single host), which are incompatible with the existing methods of coevolution. Here, we present a method called AmoCoala (an enhanced version of the tool Coala) that provides a more realistic estimation of cophylogeny event probabilities for a given pair of host and symbiont trees, even in the presence of spread events. We expand the classical 4-event coevolutionary model to include 2 additional outcomes, vertical and horizontal spreads, that lead to multiple associations. In the initial step, we estimate the probabilities of spread events using heuristic frequencies. Subsequently, in the second step, we employ an approximate Bayesian computation approach to infer the probabilities of the remaining 4 classical events (cospeciation, duplication, host switch, and loss) based on these values. By incorporating spread events, our reconciliation model enables a more accurate consideration of multiple associations. This improvement enhances the precision of estimated cost sets, paving the way to a more reliable reconciliation of host and symbiont trees. To validate our method, we conducted experiments on synthetic datasets and demonstrated its efficacy using real-world examples. Our results showcase that AmoCoala produces biologically plausible reconciliation scenarios, further emphasizing its effectiveness.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":null,"pages":null},"PeriodicalIF":6.5,"publicationDate":"2023-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10579267","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Nucleotide Substitution Model Selection Is Not Necessary for Bayesian Inference of Phylogeny With Well-Behaved Priors. 核苷酸替换模型选择对具有良好先验的系统发育贝叶斯推断并非必要。
IF 6.5 1区 生物学 Q1 Agricultural and Biological Sciences Pub Date : 2023-12-30 DOI: 10.1093/sysbio/syad041
Luiza Guimarães Fabreti, Sebastian Höhna

Model selection aims to choose the most adequate model for the statistical analysis at hand. The model must be complex enough to capture the complexity of the data but should be simple enough not to overfit. In phylogenetics, the most common model selection scenario concerns selecting an adequate substitution and partition model for sequence evolution to infer a phylogenetic tree. Previously, several studies showed that substitution model under-parameterization can bias phylogenetic studies. Here, we explored the impact of substitution model over-parameterization in a Bayesian statistical framework. We performed simulations under the simplest substitution model, the Jukes-Cantor model, and compare posterior estimates of phylogenetic tree topologies and tree length under the true model to the most complex model, the $text{GTR}+Gamma+text{I}$ substitution model, including over-splitting the data into additional subsets (i.e., applying partitioned models). We explored 4 choices of prior distributions: the default substitution model priors of MrBayes, BEAST2, and RevBayes and a newly devised prior choice (Tame). Our results show that Bayesian inference of phylogeny is robust to substitution model over-parameterization and over-partitioning but only under our new prior settings. All 3 current default priors introduced biases for the estimated tree length. We conclude that substitution and partition model selection are superfluous steps in Bayesian phylogenetic inference pipelines if well-behaved prior distributions are applied and more effort should focus on more complex and biologically realistic substitution models.

模型选择的目的是为当前的统计分析选择最合适的模型。模型必须足够复杂,以捕捉数据的复杂性,但也应足够简单,不至于过度拟合。在系统发育学中,最常见的模型选择情况是为序列进化选择一个合适的替换和分割模型,以推断系统发生树。之前的一些研究表明,替换模型参数化不足会使系统发育研究出现偏差。在此,我们在贝叶斯统计框架下探讨了替换模型参数过高的影响。我们在最简单的替换模型--Jukes-Cantor模型下进行了模拟,并比较了真实模型与最复杂模型--$text{GTR}+Gamma+text{I}$替换模型下的系统发生树拓扑和树长的后验估计值,包括将数据过度分割成额外的子集(即应用分区模型)。我们探索了 4 种先验分布选择:MrBayes、BEAST2 和 RevBayes 的默认替换模型先验,以及一种新设计的先验选择(Tame)。我们的研究结果表明,贝叶斯系统发育推断对替代模型过度参数化和过度分区具有鲁棒性,但只有在我们新的先验设置下才具有这种鲁棒性。目前所有 3 个默认先验都会对估计的树长产生偏差。我们的结论是,如果应用良好的先验分布,替代和分区模型选择是贝叶斯系统发育推断流水线中多余的步骤,更多的精力应集中在更复杂和更符合生物现实的替代模型上。
{"title":"Nucleotide Substitution Model Selection Is Not Necessary for Bayesian Inference of Phylogeny With Well-Behaved Priors.","authors":"Luiza Guimarães Fabreti, Sebastian Höhna","doi":"10.1093/sysbio/syad041","DOIUrl":"10.1093/sysbio/syad041","url":null,"abstract":"<p><p>Model selection aims to choose the most adequate model for the statistical analysis at hand. The model must be complex enough to capture the complexity of the data but should be simple enough not to overfit. In phylogenetics, the most common model selection scenario concerns selecting an adequate substitution and partition model for sequence evolution to infer a phylogenetic tree. Previously, several studies showed that substitution model under-parameterization can bias phylogenetic studies. Here, we explored the impact of substitution model over-parameterization in a Bayesian statistical framework. We performed simulations under the simplest substitution model, the Jukes-Cantor model, and compare posterior estimates of phylogenetic tree topologies and tree length under the true model to the most complex model, the $text{GTR}+Gamma+text{I}$ substitution model, including over-splitting the data into additional subsets (i.e., applying partitioned models). We explored 4 choices of prior distributions: the default substitution model priors of MrBayes, BEAST2, and RevBayes and a newly devised prior choice (Tame). Our results show that Bayesian inference of phylogeny is robust to substitution model over-parameterization and over-partitioning but only under our new prior settings. All 3 current default priors introduced biases for the estimated tree length. We conclude that substitution and partition model selection are superfluous steps in Bayesian phylogenetic inference pipelines if well-behaved prior distributions are applied and more effort should focus on more complex and biologically realistic substitution models.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":null,"pages":null},"PeriodicalIF":6.5,"publicationDate":"2023-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10157050","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Artefactual Branch Effect and Phylogenetic Conflict: Species Delimitation with Gene Flow in Mangrove Pit Vipers (Trimeresurus purpureomaculatus-erythrurus Complex). 人工分支效应与系统发育冲突:红树林蝮蛇(Trimeresurus purpureomaculatus-erythrurus Complex)基因流动的物种划分。
IF 6.5 1区 生物学 Q1 Agricultural and Biological Sciences Pub Date : 2023-12-30 DOI: 10.1093/sysbio/syad043
Kin Onn Chan, Daniel G Mulcahy, Shahrul Anuar

Mangrove pit vipers of the Trimeresurus purpureomaculatus-erythrurus complex are the only species of viper known to naturally inhabit mangroves. Despite serving integral ecological functions in mangrove ecosystems, the evolutionary history, distribution, and species boundaries of mangrove pit vipers remain poorly understood, partly due to overlapping distributions, confusing phenotypic variations, and the lack of focused studies. Here, we present the first genomic study on mangrove pit vipers and introduce a robust hypothesis-driven species delimitation framework that considers gene flow and phylogenetic uncertainty in conjunction with a novel application of a new class of speciation-based delimitation model implemented through the program Delineate. Our results showed that gene flow produced phylogenetic conflict in our focal species and substantiates the artefactual branch effect where highly admixed populations appear as divergent nonmonophyletic lineages arranged in a stepwise manner at the basal position of clades. Despite the confounding effects of gene flow, we were able to obtain unequivocal support for the recognition of a new species based on the intersection and congruence of multiple lines of evidence. This study demonstrates that an integrative hypothesis-driven approach predicated on the consideration of multiple plausible evolutionary histories, population structure/differentiation, gene flow, and the implementation of a speciation-based delimitation model can effectively delimit species in the presence of gene flow and phylogenetic conflict.

红树林蝮蛇(Trimeresurus purpureomaculatus-erythrurus complex)是已知唯一自然栖息于红树林的蝮蛇物种。尽管在红树林生态系统中发挥着不可或缺的生态功能,但人们对红树林蝮蛇的进化史、分布和物种界限仍然知之甚少,部分原因是分布重叠、表型变异混乱以及缺乏重点研究。在本文中,我们首次对红树林蝮蛇进行了基因组研究,并引入了一个稳健的假设驱动的物种划分框架,该框架考虑了基因流和系统发育的不确定性,并结合了通过 Delineate 程序实现的一类基于物种划分模型的新应用。我们的研究结果表明,基因流在我们的重点物种中产生了系统发育冲突,并证实了人工分支效应,即高度混杂的种群以分化的非单系方式出现在支系的基部位置。尽管存在基因流的干扰效应,我们还是能够在多种证据交叉和一致的基础上,明确支持对一个新物种的确认。这项研究表明,在考虑多种可能的进化历史、种群结构/分化、基因流的基础上,采用综合假说驱动的方法,并实施基于物种划分的模式,可以在存在基因流和系统发育冲突的情况下有效地划分物种。
{"title":"The Artefactual Branch Effect and Phylogenetic Conflict: Species Delimitation with Gene Flow in Mangrove Pit Vipers (Trimeresurus purpureomaculatus-erythrurus Complex).","authors":"Kin Onn Chan, Daniel G Mulcahy, Shahrul Anuar","doi":"10.1093/sysbio/syad043","DOIUrl":"10.1093/sysbio/syad043","url":null,"abstract":"<p><p>Mangrove pit vipers of the Trimeresurus purpureomaculatus-erythrurus complex are the only species of viper known to naturally inhabit mangroves. Despite serving integral ecological functions in mangrove ecosystems, the evolutionary history, distribution, and species boundaries of mangrove pit vipers remain poorly understood, partly due to overlapping distributions, confusing phenotypic variations, and the lack of focused studies. Here, we present the first genomic study on mangrove pit vipers and introduce a robust hypothesis-driven species delimitation framework that considers gene flow and phylogenetic uncertainty in conjunction with a novel application of a new class of speciation-based delimitation model implemented through the program Delineate. Our results showed that gene flow produced phylogenetic conflict in our focal species and substantiates the artefactual branch effect where highly admixed populations appear as divergent nonmonophyletic lineages arranged in a stepwise manner at the basal position of clades. Despite the confounding effects of gene flow, we were able to obtain unequivocal support for the recognition of a new species based on the intersection and congruence of multiple lines of evidence. This study demonstrates that an integrative hypothesis-driven approach predicated on the consideration of multiple plausible evolutionary histories, population structure/differentiation, gene flow, and the implementation of a speciation-based delimitation model can effectively delimit species in the presence of gene flow and phylogenetic conflict.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":null,"pages":null},"PeriodicalIF":6.5,"publicationDate":"2023-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9904897","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fast and Accurate Maximum-Likelihood Estimation of Multi-Type Birth-Death Epidemiological Models from Phylogenetic Trees. 从系统发生树快速、准确地估计多类型出生-死亡流行病学模型的最大似然法
IF 6.5 1区 生物学 Q1 Agricultural and Biological Sciences Pub Date : 2023-12-30 DOI: 10.1093/sysbio/syad059
Anna Zhukova, Frédéric Hecht, Yvon Maday, Olivier Gascuel

Multi-type birth-death (MTBD) models are phylodynamic analogies of compartmental models in classical epidemiology. They serve to infer such epidemiological parameters as the average number of secondary infections Re and the infectious time from a phylogenetic tree (a genealogy of pathogen sequences). The representatives of this model family focus on various aspects of pathogen epidemics. For instance, the birth-death exposed-infectious (BDEI) model describes the transmission of pathogens featuring an incubation period (when there is a delay between the moment of infection and becoming infectious, as for Ebola and SARS-CoV-2), and permits its estimation along with other parameters. With constantly growing sequencing data, MTBD models should be extremely useful for unravelling information on pathogen epidemics. However, existing implementations of these models in a phylodynamic framework have not yet caught up with the sequencing speed. Computing time and numerical instability issues limit their applicability to medium data sets (≤ 500 samples), while the accuracy of estimations should increase with more data. We propose a new highly parallelizable formulation of ordinary differential equations for MTBD models. We also extend them to forests to represent situations when a (sub-)epidemic started from several cases (e.g., multiple introductions to a country). We implemented it for the BDEI model in a maximum likelihood framework using a combination of numerical analysis methods for efficient equation resolution. Our implementation estimates epidemiological parameter values and their confidence intervals in two minutes on a phylogenetic tree of 10,000 samples. Comparison to the existing implementations on simulated data shows that it is not only much faster but also more accurate. An application of our tool to the 2014 Ebola epidemic in Sierra-Leone is also convincing, with very fast calculation and precise estimates. As MTBD models are closely related to Cladogenetic State Speciation and Extinction (ClaSSE)-like models, our findings could also be easily transferred to the macroevolution domain.

多类型出生-死亡(MTBD)模型是经典流行病学中分区模型的系统动力学类似物。它们可以从系统发育树(病原体序列的系谱)中推断出二次感染的平均数量和感染时间等流行病学参数。该模型系列的代表侧重于病原体流行的各个方面。例如,"出生-死亡-暴露-感染"(BDEI)模型描述了具有潜伏期的病原体传播(从感染到具有传染性之间存在延迟,如埃博拉病毒和 SARS-CoV-2),并允许与其他参数一起进行估算。随着测序数据的不断增加,MTBD 模型对于揭示病原体流行信息应该非常有用。然而,这些模型在系统动力学框架下的现有实现还跟不上测序速度。计算时间和数值不稳定性问题限制了它们对中等数据集(≤ 500 个样本)的适用性,而估计的准确性应随着数据量的增加而提高。我们为 MTBD 模型提出了一种新的高度可并行化的常微分方程公式。我们还将其扩展到森林,以表示一种(亚)流行病从多个病例开始的情况(如一个国家的多次引入)。我们在最大似然法框架内对 BDEI 模型实施了这一方法,并结合使用了数值分析方法来有效解决方程问题。我们的实施方案能在两分钟内估算出包含 10,000 个样本的系统发生树上的流行病学参数值及其置信区间。与现有的模拟数据实施方案相比,我们的实施方案不仅速度更快,而且更加准确。我们的工具在 2014 年塞拉利昂埃博拉疫情中的应用也令人信服,计算速度非常快,估计值也很精确。由于 MTBD 模型与类似于 Cladogenetic State Speciation and Extinction(ClaSSE)的模型密切相关,我们的发现也可以很容易地转移到宏观进化领域。
{"title":"Fast and Accurate Maximum-Likelihood Estimation of Multi-Type Birth-Death Epidemiological Models from Phylogenetic Trees.","authors":"Anna Zhukova, Frédéric Hecht, Yvon Maday, Olivier Gascuel","doi":"10.1093/sysbio/syad059","DOIUrl":"10.1093/sysbio/syad059","url":null,"abstract":"<p><p>Multi-type birth-death (MTBD) models are phylodynamic analogies of compartmental models in classical epidemiology. They serve to infer such epidemiological parameters as the average number of secondary infections Re and the infectious time from a phylogenetic tree (a genealogy of pathogen sequences). The representatives of this model family focus on various aspects of pathogen epidemics. For instance, the birth-death exposed-infectious (BDEI) model describes the transmission of pathogens featuring an incubation period (when there is a delay between the moment of infection and becoming infectious, as for Ebola and SARS-CoV-2), and permits its estimation along with other parameters. With constantly growing sequencing data, MTBD models should be extremely useful for unravelling information on pathogen epidemics. However, existing implementations of these models in a phylodynamic framework have not yet caught up with the sequencing speed. Computing time and numerical instability issues limit their applicability to medium data sets (≤ 500 samples), while the accuracy of estimations should increase with more data. We propose a new highly parallelizable formulation of ordinary differential equations for MTBD models. We also extend them to forests to represent situations when a (sub-)epidemic started from several cases (e.g., multiple introductions to a country). We implemented it for the BDEI model in a maximum likelihood framework using a combination of numerical analysis methods for efficient equation resolution. Our implementation estimates epidemiological parameter values and their confidence intervals in two minutes on a phylogenetic tree of 10,000 samples. Comparison to the existing implementations on simulated data shows that it is not only much faster but also more accurate. An application of our tool to the 2014 Ebola epidemic in Sierra-Leone is also convincing, with very fast calculation and precise estimates. As MTBD models are closely related to Cladogenetic State Speciation and Extinction (ClaSSE)-like models, our findings could also be easily transferred to the macroevolution domain.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":null,"pages":null},"PeriodicalIF":6.5,"publicationDate":"2023-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10924745/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10232359","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Gene Transfer-Based Phylogenetics: Analytical Expressions and Additivity via Birth-Death Theory. 基于基因转移的系统发育遗传学:通过出生-死亡理论的分析表达和可加性。
IF 6.5 1区 生物学 Q1 Agricultural and Biological Sciences Pub Date : 2023-12-30 DOI: 10.1093/sysbio/syad060
Guy Katriel, Udi Mahanaymi, Shelly Brezner, Noor Kezel, Christoph Koutschan, Doron Zeilberger, Mike Steel, Sagi Snir

The genomic era has opened up vast opportunities in molecular systematics, one of which is deciphering the evolutionary history in fine detail. Under this mass of data, analyzing the point mutations of standard markers is often too crude and slow for fine-scale phylogenetics. Nevertheless, genome dynamics (GD) events provide alternative, often richer information. The synteny index (SI) between a pair of genomes combines gene order and gene content information, allowing the comparison of genomes of unequal gene content, together with order considerations of their common genes. Recently, genome dynamics has been modeled as a continuous-time Markov process, and gene distance in the genome as a birth-death-immigration process. Nevertheless, due to complexities arising in this setting, no precise and provably consistent estimators could be derived, resulting in heuristic solutions. Here, we extend this modeling approach by using techniques from birth-death theory to derive explicit expressions of the system's probabilistic dynamics in the form of rational functions of the model parameters. This, in turn, allows us to infer analytically accurate distances between organisms based on their SI. Subsequently, we establish additivity of this estimated evolutionary distance (a desirable property yielding phylogenetic consistency). Applying the new measure in simulation studies shows that it provides accurate results in realistic settings and even under model extensions such as gene gain/loss or over a tree structure. In the real-data realm, we applied the new formulation to unique data structure that we constructed-the ordered orthology DB-based on a new version of the EggNOG database, to construct a tree with more than 4.5K taxa. To the best of our knowledge, this is the largest gene-order-based tree constructed and it overcomes shortcomings found in previous approaches. Constructing a GD-based tree allows to confirm and contrast findings based on other phylogenetic approaches, as we show.

基因组时代为分子系统学开辟了巨大的机遇,其中之一就是详细解读进化史。在这样大量的数据下,分析标准标记的点突变对于精细规模的系统发育学来说往往过于粗糙和缓慢。然而,基因组动力学(GD)事件提供了替代的、通常更丰富的信息。一对基因组之间的同源性指数(SI)结合了基因顺序和基因含量信息,允许对基因含量不等的基因组进行比较,并考虑其共同基因的顺序。最近,基因组动力学被建模为连续时间马尔可夫过程,基因组中的基因距离被建模为出生-死亡迁移过程。然而,由于这种情况下出现的复杂性,无法导出精确且可证明一致的估计量,从而产生启发式解决方案。在这里,我们通过使用出生-死亡理论中的技术来扩展这种建模方法,以导出模型参数的有理函数形式的系统概率动力学的显式表达式。这反过来又使我们能够根据生物体之间的SI推断出分析上准确的距离。随后,我们建立了这一估计进化距离的可加性(产生系统发育一致性的理想特性)。将新的测量方法应用于模拟研究表明,它在现实环境中,甚至在模型扩展下,如基因增益/损失或在树结构上,都能提供准确的结果。在实际数据领域,我们将新的公式应用于我们基于EggNOG数据库的新版本构建的独特数据结构——有序正交数据库,以构建一个具有4.5K以上分类群的树。据我们所知,这是构建的最大的基于基因顺序的树,它克服了以前方法中发现的缺点。如我们所示,构建基于GD的树可以基于其他系统发育方法来确认和对比研究结果。
{"title":"Gene Transfer-Based Phylogenetics: Analytical Expressions and Additivity via Birth-Death Theory.","authors":"Guy Katriel, Udi Mahanaymi, Shelly Brezner, Noor Kezel, Christoph Koutschan, Doron Zeilberger, Mike Steel, Sagi Snir","doi":"10.1093/sysbio/syad060","DOIUrl":"10.1093/sysbio/syad060","url":null,"abstract":"<p><p>The genomic era has opened up vast opportunities in molecular systematics, one of which is deciphering the evolutionary history in fine detail. Under this mass of data, analyzing the point mutations of standard markers is often too crude and slow for fine-scale phylogenetics. Nevertheless, genome dynamics (GD) events provide alternative, often richer information. The synteny index (SI) between a pair of genomes combines gene order and gene content information, allowing the comparison of genomes of unequal gene content, together with order considerations of their common genes. Recently, genome dynamics has been modeled as a continuous-time Markov process, and gene distance in the genome as a birth-death-immigration process. Nevertheless, due to complexities arising in this setting, no precise and provably consistent estimators could be derived, resulting in heuristic solutions. Here, we extend this modeling approach by using techniques from birth-death theory to derive explicit expressions of the system's probabilistic dynamics in the form of rational functions of the model parameters. This, in turn, allows us to infer analytically accurate distances between organisms based on their SI. Subsequently, we establish additivity of this estimated evolutionary distance (a desirable property yielding phylogenetic consistency). Applying the new measure in simulation studies shows that it provides accurate results in realistic settings and even under model extensions such as gene gain/loss or over a tree structure. In the real-data realm, we applied the new formulation to unique data structure that we constructed-the ordered orthology DB-based on a new version of the EggNOG database, to construct a tree with more than 4.5K taxa. To the best of our knowledge, this is the largest gene-order-based tree constructed and it overcomes shortcomings found in previous approaches. Constructing a GD-based tree allows to confirm and contrast findings based on other phylogenetic approaches, as we show.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":null,"pages":null},"PeriodicalIF":6.5,"publicationDate":"2023-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49682532","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Burma Terrane Amber Fauna Shows Connections to Gondwana and Transported Gondwanan Lineages to the Northern Hemisphere (Araneae: Palpimanoidea). 缅甸地层琥珀动物群显示出与冈瓦纳的联系,并将冈瓦纳种系带到了北半球(鹤形目:Palpimanoidea)。
IF 6.5 1区 生物学 Q1 Agricultural and Biological Sciences Pub Date : 2023-12-30 DOI: 10.1093/sysbio/syad047
Hannah M Wood, Jörg Wunderlich

Burmese amber is a significant source of fossils that documents the mid-Cretaceous biota. This deposit was formed around 99 Ma on the Burma Terrane, which broke away from Gondwana and later collided with Asia, although the timing is disputed. Palpimanoidea is a dispersal-limited group that was a dominant element of the Mesozoic spider fauna, and has an extensive fossil record, particularly from Burmese amber. Using morphological and molecular data, evolutionary relationships of living and fossil Palpimanoidea are examined. Divergence dating with fossils as terminal tips shows timing of diversification is contemporaneous with continental breakup.Ancestral range estimations show widespread ancestral ranges that divide into lineages that inherit different Pangean fragments, consistent with vicariance. Our results suggest that the Burmese amber fauna has ties to Gondwana due to a historical connection in the Early Cretaceous, and that the Burma Terrane facilitated biotic exchange by transporting lineages from Gondwana into the Holarctic in the Cretaceous.

缅甸琥珀是记录白垩纪中期生物群化石的重要来源。该矿床形成于缅甸地层(Burma Terrane),形成时间约为 99 Ma,缅甸地层脱离冈瓦纳群,后来与亚洲相撞,但时间尚有争议。Palpimanoidea 是一个扩散受限的类群,是中生代蜘蛛动物群的主要组成部分,有大量的化石记录,尤其是来自缅甸琥珀的化石记录。利用形态学和分子数据,研究了生活和化石棕蛛属的进化关系。以化石为顶端的分化年代测定显示,其分化时间与大陆断裂同时发生。我们的研究结果表明,由于早白垩世的历史联系,缅甸琥珀动物群与冈瓦纳大陆有联系,缅甸地层在白垩世将冈瓦纳大陆的世系迁移到南北半球,促进了生物交流。
{"title":"Burma Terrane Amber Fauna Shows Connections to Gondwana and Transported Gondwanan Lineages to the Northern Hemisphere (Araneae: Palpimanoidea).","authors":"Hannah M Wood, Jörg Wunderlich","doi":"10.1093/sysbio/syad047","DOIUrl":"10.1093/sysbio/syad047","url":null,"abstract":"<p><p>Burmese amber is a significant source of fossils that documents the mid-Cretaceous biota. This deposit was formed around 99 Ma on the Burma Terrane, which broke away from Gondwana and later collided with Asia, although the timing is disputed. Palpimanoidea is a dispersal-limited group that was a dominant element of the Mesozoic spider fauna, and has an extensive fossil record, particularly from Burmese amber. Using morphological and molecular data, evolutionary relationships of living and fossil Palpimanoidea are examined. Divergence dating with fossils as terminal tips shows timing of diversification is contemporaneous with continental breakup.Ancestral range estimations show widespread ancestral ranges that divide into lineages that inherit different Pangean fragments, consistent with vicariance. Our results suggest that the Burmese amber fauna has ties to Gondwana due to a historical connection in the Early Cretaceous, and that the Burma Terrane facilitated biotic exchange by transporting lineages from Gondwana into the Holarctic in the Cretaceous.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":null,"pages":null},"PeriodicalIF":6.5,"publicationDate":"2023-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9920283","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Speciation-by-Extinction. 物种灭绝
IF 6.5 1区 生物学 Q1 Agricultural and Biological Sciences Pub Date : 2023-12-30 DOI: 10.1093/sysbio/syad049
Glenn F Seeholzer, Robb T Brumfield

Extinction is a dominant force shaping patterns of biodiversity through time; however its role as a catalyst of speciation through its interaction with intraspecific variation has been overlooked. Here, we synthesize ideas alluded to by Darwin and others into the model of "speciation-by-extinction" in which speciation results from the extinction of intermediate populations within a single geographically variable species. We explore the properties and distinguishing features of speciation-by-extinction with respect to other established speciation models. We demonstrate its plausibility by showing that the experimental extinction of populations within variable species can result in speciation. The prerequisites for speciation-by-extinction, geographically structured intraspecific variation and local extinction, are ubiquitous in nature. We propose that speciation-by-extinction may be a prevalent, but underappreciated, speciation mechanism.

随着时间的推移,灭绝是影响生物多样性模式的主导力量;然而,灭绝通过与种内变异的相互作用而催化物种变异的作用却一直被忽视。在这里,我们将达尔文等人的观点归纳为 "物种灭绝 "模式,即物种灭绝是单一地理变异物种内部中间种群灭绝的结果。我们探讨了 "物种灭绝演化 "模式与其他已确立的物种演化模式的不同之处。我们通过证明可变物种内种群的实验性灭绝可以导致物种的形成来证明其合理性。物种灭绝模式的先决条件--地理结构上的种内变异和局部灭绝--在自然界中无处不在。我们认为灭绝物种可能是一种普遍存在但未被充分重视的物种形成机制。
{"title":"Speciation-by-Extinction.","authors":"Glenn F Seeholzer, Robb T Brumfield","doi":"10.1093/sysbio/syad049","DOIUrl":"10.1093/sysbio/syad049","url":null,"abstract":"<p><p>Extinction is a dominant force shaping patterns of biodiversity through time; however its role as a catalyst of speciation through its interaction with intraspecific variation has been overlooked. Here, we synthesize ideas alluded to by Darwin and others into the model of \"speciation-by-extinction\" in which speciation results from the extinction of intermediate populations within a single geographically variable species. We explore the properties and distinguishing features of speciation-by-extinction with respect to other established speciation models. We demonstrate its plausibility by showing that the experimental extinction of populations within variable species can result in speciation. The prerequisites for speciation-by-extinction, geographically structured intraspecific variation and local extinction, are ubiquitous in nature. We propose that speciation-by-extinction may be a prevalent, but underappreciated, speciation mechanism.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":null,"pages":null},"PeriodicalIF":6.5,"publicationDate":"2023-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9994984","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep Learning from Phylogenies for Diversification Analyses. 从系统发生学中进行深度学习,以进行多样化分析。
IF 6.5 1区 生物学 Q1 Agricultural and Biological Sciences Pub Date : 2023-12-30 DOI: 10.1093/sysbio/syad044
Sophia Lambert, Jakub Voznica, Hélène Morlon

Birth-death (BD) models are widely used in combination with species phylogenies to study past diversification dynamics. Current inference approaches typically rely on likelihood-based methods. These methods are not generalizable, as a new likelihood formula must be established each time a new model is proposed; for some models, such a formula is not even tractable. Deep learning can bring solutions in such situations, as deep neural networks can be trained to learn the relation between simulations and parameter values as a regression problem. In this paper, we adapt a recently developed deep learning method from pathogen phylodynamics to the case of diversification inference, and we extend its applicability to the case of the inference of state-dependent diversification models from phylogenies associated with trait data. We demonstrate the accuracy and time efficiency of the approach for the time-constant homogeneous BD model and the Binary-State Speciation and Extinction model. Finally, we illustrate the use of the proposed inference machinery by reanalyzing a phylogeny of primates and their associated ecological role as seed dispersers. Deep learning inference provides at least the same accuracy as likelihood-based inference while being faster by several orders of magnitude, offering a promising new inference approach for the deployment of future models in the field.

出生-死亡(BD)模型与物种系统发生相结合,被广泛用于研究过去的物种多样化动态。目前的推断方法通常依赖于基于似然法的方法。这些方法不具有通用性,因为每次提出一个新模型时,都必须建立一个新的似然公式;对于某些模型,这样的公式甚至是不可行的。在这种情况下,深度学习可以带来解决方案,因为深度神经网络可以通过训练来学习模拟与参数值之间的关系,将其作为一个回归问题。在本文中,我们将最近从病原体系统动力学中开发的一种深度学习方法应用于多样化推断,并将其适用性扩展到从与性状数据相关的系统发育中推断与状态相关的多样化模型。我们展示了该方法在时间恒定同质 BD 模型和二元状态物种分化与灭绝模型中的准确性和时间效率。最后,我们通过重新分析灵长类动物的系统发育及其作为种子传播者的相关生态角色,说明了所提出的推断机制的用途。深度学习推断至少提供了与基于似然法的推断相同的准确性,同时速度快了几个数量级,为在该领域部署未来模型提供了一种前景广阔的新推断方法。
{"title":"Deep Learning from Phylogenies for Diversification Analyses.","authors":"Sophia Lambert, Jakub Voznica, Hélène Morlon","doi":"10.1093/sysbio/syad044","DOIUrl":"10.1093/sysbio/syad044","url":null,"abstract":"<p><p>Birth-death (BD) models are widely used in combination with species phylogenies to study past diversification dynamics. Current inference approaches typically rely on likelihood-based methods. These methods are not generalizable, as a new likelihood formula must be established each time a new model is proposed; for some models, such a formula is not even tractable. Deep learning can bring solutions in such situations, as deep neural networks can be trained to learn the relation between simulations and parameter values as a regression problem. In this paper, we adapt a recently developed deep learning method from pathogen phylodynamics to the case of diversification inference, and we extend its applicability to the case of the inference of state-dependent diversification models from phylogenies associated with trait data. We demonstrate the accuracy and time efficiency of the approach for the time-constant homogeneous BD model and the Binary-State Speciation and Extinction model. Finally, we illustrate the use of the proposed inference machinery by reanalyzing a phylogeny of primates and their associated ecological role as seed dispersers. Deep learning inference provides at least the same accuracy as likelihood-based inference while being faster by several orders of magnitude, offering a promising new inference approach for the deployment of future models in the field.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":null,"pages":null},"PeriodicalIF":6.5,"publicationDate":"2023-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10020465","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Phylogenomics reveals patterns of ancient hybridization and differential diversification that contribute to phylogenetic conflict in willows, poplars, and close relatives. 系统发生组学揭示了导致柳树、杨树和近缘植物系统发生冲突的古老杂交和差异分化模式。
IF 6.5 1区 生物学 Q1 Agricultural and Biological Sciences Pub Date : 2023-12-30 DOI: 10.1093/sysbio/syad042
Brian J Sanderson, Diksha Gambhir, Guanqiao Feng, Nan Hu, Quentin C Cronk, Diana M Percy, Francisco Molina Freaner, Matthew G Johnson, Lawrence B Smart, Ken Keefover-Ring, Tongming Yin, Tao Ma, Stephen P DiFazio, Jianquan Liu, Matthew S Olson

Despite the economic, ecological, and scientific importance of the genera Salix L. (willows) and Populus L. (poplars, cottonwoods, and aspens) Salicaceae, we know little about the sources of differences in species diversity between the genera and of the phylogenetic conflict that often confounds estimating phylogenetic trees. Salix subgenera and sections, in particular, have been difficult to classify, with one recent attempt termed a "spectacular failure" due to a speculated radiation of the subgenera Vetrix and Chamaetia. Here, we use targeted sequence capture to understand the evolutionary history of this portion of the Salicaceae plant family. Our phylogenetic hypothesis was based on 787 gene regions and identified extensive phylogenetic conflict among genes. Our analysis supported some previously described subgeneric relationships and confirmed the polyphyly of others. Using an fbranch analysis, we identified several cases of hybridization in deep branches of the phylogeny, which likely contributed to discordance among gene trees. In addition, we identified a rapid increase in diversification rate near the origination of the Vetrix-Chamaetia clade in Salix. This region of the tree coincided with several nodes that lacked strong statistical support, indicating a possible increase in incomplete lineage sorting due to rapid diversification. The extraordinary level of both recent and ancient hybridization in both Salix and Populus have played important roles in the diversification and diversity in these two genera.

尽管柳树属(Salix L.)和杨树属(Populus L.)(杨树、木棉和杨树)在经济、生态和科学方面具有重要意义,但我们对这两个属之间物种多样性差异的来源以及系统发育冲突的来源知之甚少,而系统发育树的估算经常受到这种冲突的困扰。尤其是柳杉亚属和科的分类一直很困难,最近的一次尝试由于推测Vetrix亚属和Chamaetia亚属的辐射而被称为 "惊人的失败"。在这里,我们利用定向序列捕获来了解这部分莎草科植物的进化历史。我们的系统发育假说基于 787 个基因区,发现了基因间广泛的系统发育冲突。我们的分析支持了之前描述的一些亚属关系,并证实了其他亚属的多型性。利用 fbranch 分析,我们在系统发生的深枝中发现了几种杂交情况,这很可能是造成基因树之间不一致的原因。此外,我们还在 Salix 的 Vetrix-Chamaetia 支系起源附近发现了分化率的快速增长。基因树的这一区域与几个缺乏强有力统计支持的节点相吻合,这表明由于快速分化,可能会增加不完全的世系分类。沙柳属和杨属近期和远古杂交的超常水平对这两个属的多样化和多样性起到了重要作用。
{"title":"Phylogenomics reveals patterns of ancient hybridization and differential diversification that contribute to phylogenetic conflict in willows, poplars, and close relatives.","authors":"Brian J Sanderson, Diksha Gambhir, Guanqiao Feng, Nan Hu, Quentin C Cronk, Diana M Percy, Francisco Molina Freaner, Matthew G Johnson, Lawrence B Smart, Ken Keefover-Ring, Tongming Yin, Tao Ma, Stephen P DiFazio, Jianquan Liu, Matthew S Olson","doi":"10.1093/sysbio/syad042","DOIUrl":"10.1093/sysbio/syad042","url":null,"abstract":"<p><p>Despite the economic, ecological, and scientific importance of the genera Salix L. (willows) and Populus L. (poplars, cottonwoods, and aspens) Salicaceae, we know little about the sources of differences in species diversity between the genera and of the phylogenetic conflict that often confounds estimating phylogenetic trees. Salix subgenera and sections, in particular, have been difficult to classify, with one recent attempt termed a \"spectacular failure\" due to a speculated radiation of the subgenera Vetrix and Chamaetia. Here, we use targeted sequence capture to understand the evolutionary history of this portion of the Salicaceae plant family. Our phylogenetic hypothesis was based on 787 gene regions and identified extensive phylogenetic conflict among genes. Our analysis supported some previously described subgeneric relationships and confirmed the polyphyly of others. Using an fbranch analysis, we identified several cases of hybridization in deep branches of the phylogeny, which likely contributed to discordance among gene trees. In addition, we identified a rapid increase in diversification rate near the origination of the Vetrix-Chamaetia clade in Salix. This region of the tree coincided with several nodes that lacked strong statistical support, indicating a possible increase in incomplete lineage sorting due to rapid diversification. The extraordinary level of both recent and ancient hybridization in both Salix and Populus have played important roles in the diversification and diversity in these two genera.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":null,"pages":null},"PeriodicalIF":6.5,"publicationDate":"2023-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9768582","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Skyline Fossilized Birth-Death Model is Robust to Violations of Sampling Assumptions in Total-Evidence Dating. 天际线生死化石模型对违反全证据定年中的取样假设具有稳健性。
IF 6.5 1区 生物学 Q1 Agricultural and Biological Sciences Pub Date : 2023-12-30 DOI: 10.1093/sysbio/syad054
Chi Zhang, Fredrik Ronquist, Tanja Stadler

Several total-evidence dating studies under the fossilized birth-death (FBD) model have produced very old age estimates, which are not supported by the fossil record. This phenomenon has been termed "deep root attraction (DRA)." For two specific data sets, involving divergence time estimation for the early radiations of ants, bees, and wasps (Hymenoptera) and of placental mammals (Eutheria), it has been shown that the DRA effect can be greatly reduced by accommodating the fact that extant species in these trees have been sampled to maximize diversity, so-called diversified sampling. Unfortunately, current methods to accommodate diversified sampling only consider the extreme case where it is possible to identify a cut-off time such that all splits occurring before this time are represented in the sampled tree but none of the younger splits. In reality, the sampling bias is rarely this extreme and may be difficult to model properly. Similar modeling challenges apply to the sampling of the fossil record. This raises the question of whether it is possible to find dating methods that are more robust to sampling biases. Here, we show that the skyline FBD (SFBD) process, where the diversification and fossil-sampling rates can vary over time in a piecewise fashion, provides age estimates that are more robust to inadequacies in the modeling of the sampling process and less sensitive to DRA effects. In the SFBD model we consider, rates in different time intervals are either considered to be independent and identically distributed or assumed to be autocorrelated following an Ornstein-Uhlenbeck (OU) process. Through simulations and reanalyses of Hymenoptera and Eutheria data, we show that both variants of the SFBD model unify age estimates under random and diversified sampling assumptions. The SFBD model can resolve DRA by absorbing the deviations from the sampling assumptions into the inferred dynamics of the diversification process over time. Although this means that the inferred diversification dynamics must be interpreted with caution, taking sampling biases into account, we conclude that the SFBD model represents the most robust approach currently available for addressing DRA in total-evidence dating.

根据化石出生-死亡(FBD)模式进行的几项全证据年代测定研究得出了非常古老的年龄估计,但化石记录却不支持这种估计。这种现象被称为 "深根吸引(DRA)"。对于涉及蚂蚁、蜜蜂和黄蜂(膜翅目)以及胎盘哺乳动物(Eutheria)早期演化的分化时间估计的两个特定数据集,研究表明,如果考虑到这些树中的现存物种已被采样以最大限度地提高多样性这一事实,即所谓的多样化采样,就可以大大降低 DRA 效应。遗憾的是,目前适应多样化取样的方法只考虑了一种极端情况,即有可能确定一个截止时间,使在此时间之前发生的所有分裂都能在取样树中得到体现,而较年轻的分裂则无一体现。在现实中,取样偏差很少达到这种极端情况,因此可能难以正确建模。类似的建模挑战也适用于化石记录的取样。这就提出了一个问题:是否有可能找到对取样偏差更稳健的年代测定方法?在这里,我们展示了天际线 FBD(SFBD)过程,在这个过程中,多样化率和化石取样率可以随时间以片断方式变化,它提供的年龄估计值对取样过程建模的不足更为稳健,对 DRA 的影响也不那么敏感。在我们所考虑的 SFBD 模型中,不同时间间隔内的采样率要么被认为是独立且同分布的,要么被假定为自相关的奥恩斯坦-乌伦贝克(OU)过程。通过模拟和重新分析膜翅目和杜鹃目数据,我们发现 SFBD 模型的两种变体都能在随机和多样化取样假设下统一年龄估计值。SFBD 模型可以通过将取样假设的偏差吸收到随时间变化的多样化过程的推断动态中来解决 DRA 问题。虽然这意味着必须考虑到取样偏差,谨慎解释推断的多样化动态,但我们得出的结论是,SFBD 模型是目前解决全证据年代测定中 DRA 问题的最可靠方法。
{"title":"Skyline Fossilized Birth-Death Model is Robust to Violations of Sampling Assumptions in Total-Evidence Dating.","authors":"Chi Zhang, Fredrik Ronquist, Tanja Stadler","doi":"10.1093/sysbio/syad054","DOIUrl":"10.1093/sysbio/syad054","url":null,"abstract":"<p><p>Several total-evidence dating studies under the fossilized birth-death (FBD) model have produced very old age estimates, which are not supported by the fossil record. This phenomenon has been termed \"deep root attraction (DRA).\" For two specific data sets, involving divergence time estimation for the early radiations of ants, bees, and wasps (Hymenoptera) and of placental mammals (Eutheria), it has been shown that the DRA effect can be greatly reduced by accommodating the fact that extant species in these trees have been sampled to maximize diversity, so-called diversified sampling. Unfortunately, current methods to accommodate diversified sampling only consider the extreme case where it is possible to identify a cut-off time such that all splits occurring before this time are represented in the sampled tree but none of the younger splits. In reality, the sampling bias is rarely this extreme and may be difficult to model properly. Similar modeling challenges apply to the sampling of the fossil record. This raises the question of whether it is possible to find dating methods that are more robust to sampling biases. Here, we show that the skyline FBD (SFBD) process, where the diversification and fossil-sampling rates can vary over time in a piecewise fashion, provides age estimates that are more robust to inadequacies in the modeling of the sampling process and less sensitive to DRA effects. In the SFBD model we consider, rates in different time intervals are either considered to be independent and identically distributed or assumed to be autocorrelated following an Ornstein-Uhlenbeck (OU) process. Through simulations and reanalyses of Hymenoptera and Eutheria data, we show that both variants of the SFBD model unify age estimates under random and diversified sampling assumptions. The SFBD model can resolve DRA by absorbing the deviations from the sampling assumptions into the inferred dynamics of the diversification process over time. Although this means that the inferred diversification dynamics must be interpreted with caution, taking sampling biases into account, we conclude that the SFBD model represents the most robust approach currently available for addressing DRA in total-evidence dating.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":null,"pages":null},"PeriodicalIF":6.5,"publicationDate":"2023-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10040045","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Systematic Biology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1