首页 > 最新文献

Systematic Biology最新文献

英文 中文
Benefits and Limits of Phasing Alleles for Network Inference of Allopolyploid Complexes. 异源多倍体复合体网络推断中分阶段等位基因的优势与局限性
IF 6.1 1区 生物学 Q1 EVOLUTIONARY BIOLOGY Pub Date : 2024-10-25 DOI: 10.1093/sysbio/syae024
George P Tiley, Andrew A Crowl, Paul S Manos, Emily B Sessa, Claudia Solís-Lemus, Anne D Yoder, J Gordon Burleigh

Accurately reconstructing the reticulate histories of polyploids remains a central challenge for understanding plant evolution. Although phylogenetic networks can provide insights into relationships among polyploid lineages, inferring networks may be hindered by the complexities of homology determination in polyploid taxa. We use simulations to show that phasing alleles from allopolyploid individuals can improve phylogenetic network inference under the multispecies coalescent by obtaining the true network with fewer loci compared with haplotype consensus sequences or sequences with heterozygous bases represented as ambiguity codes. Phased allelic data can also improve divergence time estimates for networks, which is helpful for evaluating allopolyploid speciation hypotheses and proposing mechanisms of speciation. To achieve these outcomes in empirical data, we present a novel pipeline that leverages a recently developed phasing algorithm to reliably phase alleles from polyploids. This pipeline is especially appropriate for target enrichment data, where the depth of coverage is typically high enough to phase entire loci. We provide an empirical example in the North American Dryopteris fern complex that demonstrates insights from phased data as well as the challenges of network inference. We establish that our pipeline (PATÉ: Phased Alleles from Target Enrichment data) is capable of recovering a high proportion of phased loci from both diploids and polyploids. These data may improve network estimates compared with using haplotype consensus assemblies by accurately inferring the direction of gene flow, but statistical nonidentifiability of phylogenetic networks poses a barrier to inferring the evolutionary history of reticulate complexes.

准确重建多倍体的网状历史仍然是了解植物进化的核心挑战。虽然系统发育网络可以让人们深入了解多倍体系之间的关系,但推断网络可能会受到多倍体类群同源性测定复杂性的阻碍。我们通过模拟实验表明,与单倍型共识序列或以模糊代码表示杂合碱基的序列相比,从异源多倍体个体中分期等位基因可以用较少的位点获得真正的网络,从而改善多物种聚合下的系统发生网络推断。分阶段等位基因数据还能改善网络的分歧时间估计,这有助于评估全多倍体物种形成假说和提出物种形成机制。为了在实证数据中取得这些成果,我们提出了一种新的方法,利用最近开发的相位算法对来自多倍体的等位基因进行可靠的相位分析。该管道尤其适用于目标富集数据,因为目标富集数据的覆盖深度通常很高,足以对整个基因座进行分期。我们提供了一个北美蕨类植物干蕨复合体的经验实例,展示了分阶段数据的启示以及网络推断所面临的挑战。我们发现,我们的管道(PATÉ:从目标富集数据中分期等位基因)能够从二倍体和多倍体中恢复很高比例的分期基因座。与使用单倍型共识组装相比,这些数据可以通过准确推断基因流的方向来改进网络估计,但系统发生网络的统计不可识别性对推断网状复合体的进化历史构成了障碍。
{"title":"Benefits and Limits of Phasing Alleles for Network Inference of Allopolyploid Complexes.","authors":"George P Tiley, Andrew A Crowl, Paul S Manos, Emily B Sessa, Claudia Solís-Lemus, Anne D Yoder, J Gordon Burleigh","doi":"10.1093/sysbio/syae024","DOIUrl":"10.1093/sysbio/syae024","url":null,"abstract":"<p><p>Accurately reconstructing the reticulate histories of polyploids remains a central challenge for understanding plant evolution. Although phylogenetic networks can provide insights into relationships among polyploid lineages, inferring networks may be hindered by the complexities of homology determination in polyploid taxa. We use simulations to show that phasing alleles from allopolyploid individuals can improve phylogenetic network inference under the multispecies coalescent by obtaining the true network with fewer loci compared with haplotype consensus sequences or sequences with heterozygous bases represented as ambiguity codes. Phased allelic data can also improve divergence time estimates for networks, which is helpful for evaluating allopolyploid speciation hypotheses and proposing mechanisms of speciation. To achieve these outcomes in empirical data, we present a novel pipeline that leverages a recently developed phasing algorithm to reliably phase alleles from polyploids. This pipeline is especially appropriate for target enrichment data, where the depth of coverage is typically high enough to phase entire loci. We provide an empirical example in the North American Dryopteris fern complex that demonstrates insights from phased data as well as the challenges of network inference. We establish that our pipeline (PATÉ: Phased Alleles from Target Enrichment data) is capable of recovering a high proportion of phased loci from both diploids and polyploids. These data may improve network estimates compared with using haplotype consensus assemblies by accurately inferring the direction of gene flow, but statistical nonidentifiability of phylogenetic networks poses a barrier to inferring the evolutionary history of reticulate complexes.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"666-682"},"PeriodicalIF":6.1,"publicationDate":"2024-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140908806","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sequential Bayesian Phylogenetic Inference. 序列贝叶斯系统发育推论
IF 6.1 1区 生物学 Q1 EVOLUTIONARY BIOLOGY Pub Date : 2024-10-25 DOI: 10.1093/sysbio/syae020
Sebastian Höhna, Allison Y Hsiang

The ideal approach to Bayesian phylogenetic inference is to estimate all parameters of interest jointly in a single hierarchical model. However, this is often not feasible in practice due to the high computational cost. Instead, phylogenetic pipelines generally consist of sequential analyses, whereby a single point estimate from a given analysis is used as input for the next analysis (e.g., a single multiple sequence alignment is used to estimate a gene tree). In this framework, uncertainty is not propagated from step to step, which can lead to inaccurate or spuriously confident results. Here, we formally develop and test a sequential inference approach for Bayesian phylogenetic inference, which uses importance sampling to generate observations for the next step of an analysis pipeline from the posterior distribution produced in the previous step. Our sequential inference approach presented here not only accounts for uncertainty between analysis steps but also allows for greater flexibility in software choice (and hence model availability) and can be computationally more efficient than the traditional joint inference approach when multiple models are being tested. We show that our sequential inference approach is identical in practice to the joint inference approach only if sufficient information in the data is present (a narrow posterior distribution) and/or sufficiently many important samples are used. Conversely, we show that the common practice of using a single point estimate can be biased, for example, a single phylogeny estimate can transform an unrooted phylogeny into a time-calibrated phylogeny. We demonstrate the theory of sequential Bayesian inference using both a toy example and an empirical case study of divergence-time estimation in insects using a relaxed clock model from transcriptome data. In the empirical example, we estimate 3 posterior distributions of branch lengths from the same data (DNA character matrix with a GTR+Γ+I substitution model, an amino acid data matrix with empirical substitution models, and an amino acid data matrix with the PhyloBayes CAT-GTR model). Finally, we apply 3 different node-calibration strategies and show that divergence time estimates are affected by both the data source and underlying substitution process to estimate branch lengths as well as the node-calibration strategies. Thus, our new sequential Bayesian phylogenetic inference provides the opportunity to efficiently test different approaches for divergence time estimation, including branch-length estimation from other software.

贝叶斯系统发育推断的理想方法是在单一分层模型中联合估计所有相关参数。然而,由于计算成本较高,这在实践中往往并不可行。取而代之的是,系统发育管道一般由连续分析组成,即把给定分析中的单点估计值作为下一步分析的输入(例如,用单个多序列比对来估计基因树)。在这个框架中,不确定性不会从一个步骤传播到另一个步骤,这可能导致不准确或虚假的可信结果。在这里,我们正式开发并测试了一种贝叶斯系统发育推断的顺序推断方法,该方法使用重要性采样从上一步产生的后验分布中为下一步分析流水线生成观测值。我们在此介绍的顺序推断方法不仅考虑了分析步骤之间的不确定性,而且在软件选择(从而模型可用性)方面具有更大的灵活性,并且在测试多个模型时比传统的联合推断方法计算效率更高。我们的研究表明,只有当数据中存在足够的信息(窄后验分布)和/或使用了足够多的重要性样本时,我们的顺序推断方法在实践中才与联合推断方法相同。相反,我们证明了使用单点估计的常见做法可能存在偏差,例如,使用单个系统发育估计将未根系统发育转化为时间校准系统发育。我们通过一个玩具示例和一个实证案例研究证明了序列贝叶斯推断理论,即利用转录组数据中的松弛时钟模型对昆虫的分化时间进行估计。在经验示例中,我们从相同的数据(采用 GTR+Γ+I 替代模型的 DNA 特征矩阵、采用经验替代模型的氨基酸数据矩阵和采用 PhyloBayes CAT-GTR 模型的氨基酸数据矩阵)中估计了三个分支长度的后验分布。最后,我们应用了三种不同的节点校准策略,结果表明分歧时间估计值既受数据源和基础替代过程的影响,也受估计分支长度的节点校准策略的影响。因此,我们新的序列贝叶斯系统发育推断方法为有效测试不同的分歧时间估计方法(包括其他软件的分支长度估计方法)提供了机会。
{"title":"Sequential Bayesian Phylogenetic Inference.","authors":"Sebastian Höhna, Allison Y Hsiang","doi":"10.1093/sysbio/syae020","DOIUrl":"10.1093/sysbio/syae020","url":null,"abstract":"<p><p>The ideal approach to Bayesian phylogenetic inference is to estimate all parameters of interest jointly in a single hierarchical model. However, this is often not feasible in practice due to the high computational cost. Instead, phylogenetic pipelines generally consist of sequential analyses, whereby a single point estimate from a given analysis is used as input for the next analysis (e.g., a single multiple sequence alignment is used to estimate a gene tree). In this framework, uncertainty is not propagated from step to step, which can lead to inaccurate or spuriously confident results. Here, we formally develop and test a sequential inference approach for Bayesian phylogenetic inference, which uses importance sampling to generate observations for the next step of an analysis pipeline from the posterior distribution produced in the previous step. Our sequential inference approach presented here not only accounts for uncertainty between analysis steps but also allows for greater flexibility in software choice (and hence model availability) and can be computationally more efficient than the traditional joint inference approach when multiple models are being tested. We show that our sequential inference approach is identical in practice to the joint inference approach only if sufficient information in the data is present (a narrow posterior distribution) and/or sufficiently many important samples are used. Conversely, we show that the common practice of using a single point estimate can be biased, for example, a single phylogeny estimate can transform an unrooted phylogeny into a time-calibrated phylogeny. We demonstrate the theory of sequential Bayesian inference using both a toy example and an empirical case study of divergence-time estimation in insects using a relaxed clock model from transcriptome data. In the empirical example, we estimate 3 posterior distributions of branch lengths from the same data (DNA character matrix with a GTR+Γ+I substitution model, an amino acid data matrix with empirical substitution models, and an amino acid data matrix with the PhyloBayes CAT-GTR model). Finally, we apply 3 different node-calibration strategies and show that divergence time estimates are affected by both the data source and underlying substitution process to estimate branch lengths as well as the node-calibration strategies. Thus, our new sequential Bayesian phylogenetic inference provides the opportunity to efficiently test different approaches for divergence time estimation, including branch-length estimation from other software.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"704-721"},"PeriodicalIF":6.1,"publicationDate":"2024-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141071866","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Assessing the Adequacy of Morphological Models using Posterior Predictive Simulations 利用后验预测模拟评估形态学模型的适当性
IF 6.5 1区 生物学 Q1 EVOLUTIONARY BIOLOGY Pub Date : 2024-10-07 DOI: 10.1093/sysbio/syae055
Laura P A Mulvey, Michael R May, Jeremy M Brown, Sebastian Höhna, April M Wright, Rachel C M Warnock
Reconstructing the evolutionary history of different groups of organisms provides insight into how life originated and diversified on Earth. Phylogenetic trees are commonly used to estimate this evolutionary history. Within Bayesian phylogenetics a major step in estimating a tree is in choosing an appropriate model of character evolution. While the most common character data used is molecular sequence data, morphological data remains a vital source of information. The use of morphological characters allows for the incorporation fossil taxa, and despite advances in molecular sequencing, continues to play a significant role in neontology. Moreover, it is the main data source that allows us to unite extinct and extant taxa directly under the same generating process. We therefore require suitable models of morphological character evolution, the most common being the Mk Lewis model. While it is frequently used in both palaeobiology and neontology, it is not known whether the simple Mk substitution model, or any extensions to it, provide a sufficiently good description of the process of morphological evolution. In this study we investigate the impact of different morphological models on empirical tetrapod data sets. Specifically, we compare unpartitioned Mk models with those where characters are partitioned by the number of observed states, both with and without allowing for rate variation across sites and accounting for ascertainment bias. We show that the choice of substitution model has an impact on both topology and branch lengths, highlighting the importance of model choice. Through simulations, we validate the use of the model adequacy approach, posterior predictive simulations, for choosing an appropriate model. Additionally, we compare the performance of model adequacy with Bayesian model selection. We demonstrate how model selection approaches based on marginal likelihoods are not appropriate for choosing between models with partition schemes that vary in character state space (i.e., that vary in Q-matrix state size). Using posterior predictive simulations, we found that current variations of the Mk model are often performing adequately in capturing the evolutionary dynamics that generated our data. We do not find any preference for a particular model extension across multiple data sets, indicating that there is no ‘one size fits all’ when it comes to morphological data and that careful consideration should be given to choosing models of discrete character evolution. By using suitable models of character evolution, we can increase our confidence in our phylogenetic estimates, which should in turn allow us to gain more accurate insights into the evolutionary history of both extinct and extant taxa.
重建不同生物类群的进化史有助于深入了解生命如何在地球上起源和多样化。系统发生树通常用于估算这种进化历史。在贝叶斯系统发育学中,估计系统树的一个主要步骤是选择一个合适的特征进化模型。虽然最常用的特征数据是分子序列数据,但形态数据仍然是重要的信息来源。使用形态特征可以纳入化石类群,尽管分子测序技术在不断进步,但形态特征在新生物学中仍然发挥着重要作用。此外,它也是使我们能够将已灭绝类群和现生类群直接整合到同一生成过程中的主要数据来源。因此,我们需要合适的形态特征演化模型,最常见的是 Mk Lewis 模型。虽然该模型在古生物学和新生物学中经常被使用,但简单的 Mk 替换模型或其扩展模型是否能对形态演化过程提供足够好的描述还不得而知。在本研究中,我们研究了不同形态模型对四足动物经验数据集的影响。具体来说,我们比较了未分区的 Mk 模型和按观察到的状态数量对特征进行分区的模型,既考虑到了不同位点的速率变化,也考虑到了确定偏差。我们发现,替代模型的选择对拓扑结构和分支长度都有影响,这突出了模型选择的重要性。通过模拟,我们验证了使用模型充分性方法--后验预测模拟--来选择合适的模型。此外,我们还比较了模型充分性与贝叶斯模型选择的性能。我们证明了基于边际似然的模型选择方法如何不适合在具有不同特征状态空间(即不同 Q 矩阵状态大小)的分区方案的模型之间进行选择。通过后验预测模拟,我们发现 Mk 模型的当前变体往往能充分捕捉到产生数据的进化动态。在多个数据集中,我们没有发现对某一特定模型扩展的偏好,这表明在形态学数据方面没有 "一刀切 "的做法,在选择离散特征演化模型时应慎重考虑。通过使用合适的特征演化模型,我们可以提高系统发生学估计的可信度,从而使我们能够更准确地了解已灭绝类群和现生类群的演化历史。
{"title":"Assessing the Adequacy of Morphological Models using Posterior Predictive Simulations","authors":"Laura P A Mulvey, Michael R May, Jeremy M Brown, Sebastian Höhna, April M Wright, Rachel C M Warnock","doi":"10.1093/sysbio/syae055","DOIUrl":"https://doi.org/10.1093/sysbio/syae055","url":null,"abstract":"Reconstructing the evolutionary history of different groups of organisms provides insight into how life originated and diversified on Earth. Phylogenetic trees are commonly used to estimate this evolutionary history. Within Bayesian phylogenetics a major step in estimating a tree is in choosing an appropriate model of character evolution. While the most common character data used is molecular sequence data, morphological data remains a vital source of information. The use of morphological characters allows for the incorporation fossil taxa, and despite advances in molecular sequencing, continues to play a significant role in neontology. Moreover, it is the main data source that allows us to unite extinct and extant taxa directly under the same generating process. We therefore require suitable models of morphological character evolution, the most common being the Mk Lewis model. While it is frequently used in both palaeobiology and neontology, it is not known whether the simple Mk substitution model, or any extensions to it, provide a sufficiently good description of the process of morphological evolution. In this study we investigate the impact of different morphological models on empirical tetrapod data sets. Specifically, we compare unpartitioned Mk models with those where characters are partitioned by the number of observed states, both with and without allowing for rate variation across sites and accounting for ascertainment bias. We show that the choice of substitution model has an impact on both topology and branch lengths, highlighting the importance of model choice. Through simulations, we validate the use of the model adequacy approach, posterior predictive simulations, for choosing an appropriate model. Additionally, we compare the performance of model adequacy with Bayesian model selection. We demonstrate how model selection approaches based on marginal likelihoods are not appropriate for choosing between models with partition schemes that vary in character state space (i.e., that vary in Q-matrix state size). Using posterior predictive simulations, we found that current variations of the Mk model are often performing adequately in capturing the evolutionary dynamics that generated our data. We do not find any preference for a particular model extension across multiple data sets, indicating that there is no ‘one size fits all’ when it comes to morphological data and that careful consideration should be given to choosing models of discrete character evolution. By using suitable models of character evolution, we can increase our confidence in our phylogenetic estimates, which should in turn allow us to gain more accurate insights into the evolutionary history of both extinct and extant taxa.","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":"54 1","pages":""},"PeriodicalIF":6.5,"publicationDate":"2024-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142384288","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The limits of the metapopulation: Lineage fragmentation in a widespread terrestrial salamander (Plethodon cinereus) 元种群的极限:一种广泛分布的陆生蝾螈(Plethodon cinereus)的种系破碎现象
IF 6.5 1区 生物学 Q1 EVOLUTIONARY BIOLOGY Pub Date : 2024-09-09 DOI: 10.1093/sysbio/syae053
Brian P Waldron, Emily F Watts, Donald J Morgan, Maggie M Hantak, Alan R Lemmon, Emily Moriarty Lemmon, Shawn R Kuchta
In vicariant species formation, divergence results primarily from periods of allopatry and restricted gene flow. Widespread species harboring differentiated, geographically distinct sublineages offer a window into what may be a common mode of species formation, whereby a species originates, spreads across the landscape, then fragments into multiple units. However, incipient lineages usually lack reproductive barriers that prevent their fusion upon secondary contact, blurring the boundaries between a single, large metapopulation-level lineage and multiple independent species. Here we explore this model of species formation in the Eastern Red-backed Salamander (Plethodon cinereus), a widespread terrestrial vertebrate with at least six divergent mitochondrial clades throughout its range. Using anchored hybrid enrichment data, we applied phylogenomic and population genomic approaches to investigate patterns of divergence, gene flow, and secondary contact. Genomic data broadly match most mitochondrial groups but reveal mitochondrial introgression and extensive admixture at several contact zones. While species delimitation analyses in BPP supported five lineages of P. cinereus, genealogical divergence indices (gdi) were highly sensitive to the inclusion of admixed samples and the geographic representation of candidate species, with increasing support for multiple species when removing admixed samples or limiting sampling to a single locality per group. An analysis of morphometric data revealed differences in body size and limb proportions among groups, with a reduction of forelimb length among warmer and drier localities consistent with increased fossoriality. We conclude that P. cinereus is a single species, but one with highly structured component lineages of various degrees of independence.
在近缘物种的形成过程中,分化主要是由异源繁殖和基因流动受限造成的。广泛分布的物种蕴藏着不同的、地理上截然不同的亚系,这为我们了解物种形成的常见模式提供了一个窗口。然而,初生亚系通常缺乏生殖障碍,无法在二次接触时进行融合,从而模糊了单一大型元种群级亚系与多个独立物种之间的界限。在这里,我们探讨了东红背大鲵(Plethodon cinereus)物种形成的这一模式,东红背大鲵是一种广泛分布的陆生脊椎动物,在其整个分布区至少有六个线粒体支系。利用锚定杂交富集数据,我们应用系统发生组和种群基因组方法研究了分化、基因流和二次接触的模式。基因组数据与大多数线粒体群大体吻合,但也揭示了几个接触区的线粒体引入和广泛混杂。虽然 BPP 中的物种划分分析支持 P. cinereus 的五个系,但系谱分异指数(gdi)对是否包含混杂样本和候选物种的地理代表性高度敏感,当去除混杂样本或将取样限制在每个组的单一地点时,对多个物种的支持度会增加。对形态计量数据的分析表明,各组之间的体型和肢体比例存在差异,在温暖和干燥的地方,前肢长度缩短,这与穴居性增加一致。我们的结论是,P. cinereus 是一个单一的物种,但具有不同独立程度的高度结构化的组成品系。
{"title":"The limits of the metapopulation: Lineage fragmentation in a widespread terrestrial salamander (Plethodon cinereus)","authors":"Brian P Waldron, Emily F Watts, Donald J Morgan, Maggie M Hantak, Alan R Lemmon, Emily Moriarty Lemmon, Shawn R Kuchta","doi":"10.1093/sysbio/syae053","DOIUrl":"https://doi.org/10.1093/sysbio/syae053","url":null,"abstract":"In vicariant species formation, divergence results primarily from periods of allopatry and restricted gene flow. Widespread species harboring differentiated, geographically distinct sublineages offer a window into what may be a common mode of species formation, whereby a species originates, spreads across the landscape, then fragments into multiple units. However, incipient lineages usually lack reproductive barriers that prevent their fusion upon secondary contact, blurring the boundaries between a single, large metapopulation-level lineage and multiple independent species. Here we explore this model of species formation in the Eastern Red-backed Salamander (Plethodon cinereus), a widespread terrestrial vertebrate with at least six divergent mitochondrial clades throughout its range. Using anchored hybrid enrichment data, we applied phylogenomic and population genomic approaches to investigate patterns of divergence, gene flow, and secondary contact. Genomic data broadly match most mitochondrial groups but reveal mitochondrial introgression and extensive admixture at several contact zones. While species delimitation analyses in BPP supported five lineages of P. cinereus, genealogical divergence indices (gdi) were highly sensitive to the inclusion of admixed samples and the geographic representation of candidate species, with increasing support for multiple species when removing admixed samples or limiting sampling to a single locality per group. An analysis of morphometric data revealed differences in body size and limb proportions among groups, with a reduction of forelimb length among warmer and drier localities consistent with increased fossoriality. We conclude that P. cinereus is a single species, but one with highly structured component lineages of various degrees of independence.","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":"63 1","pages":""},"PeriodicalIF":6.5,"publicationDate":"2024-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142160434","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dating in the Dark: Elevated Substitution Rates in Cave Cockroaches (Blattodea: Nocticolidae) Have Negative Impacts on Molecular Date Estimates. 黑暗中的约会:洞穴蟑螂(Blattodea: Nocticolidae)的替代率升高对分子日期估计有负面影响。
IF 6.1 1区 生物学 Q1 EVOLUTIONARY BIOLOGY Pub Date : 2024-09-05 DOI: 10.1093/sysbio/syae002
Toby G L Kovacs, James Walker, Simon Hellemans, Thomas Bourguignon, Nikolai J Tatarnic, Jane M McRae, Simon Y W Ho, Nathan Lo

Rates of nucleotide substitution vary substantially across the Tree of Life, with potentially confounding effects on phylogenetic and evolutionary analyses. A large acceleration in mitochondrial substitution rate occurs in the cockroach family Nocticolidae, which predominantly inhabit subterranean environments. To evaluate the impacts of this among-lineage rate heterogeneity on estimates of phylogenetic relationships and evolutionary timescales, we analyzed nuclear ultraconserved elements (UCEs) and mitochondrial genomes from nocticolids and other cockroaches. Substitution rates were substantially elevated in nocticolid lineages compared with other cockroaches, especially in mitochondrial protein-coding genes. This disparity in evolutionary rates is likely to have led to different evolutionary relationships being supported by phylogenetic analyses of mitochondrial genomes and UCE loci. Furthermore, Bayesian dating analyses using relaxed-clock models inferred much deeper divergence times compared with a flexible local clock. Our phylogenetic analysis of UCEs, which is the first genome-scale study to include all 13 major cockroach families, unites Corydiidae and Nocticolidae and places Anaplectidae as the sister lineage to the rest of Blattoidea. We uncover an extraordinary level of genetic divergence in Nocticolidae, including two highly distinct clades that separated ~115 million years ago despite both containing representatives of the genus Nocticola. The results of our study highlight the potential impacts of high among-lineage rate variation on estimates of phylogenetic relationships and evolutionary timescales.

生命之树上的核苷酸替换率差异很大,可能会对系统发育和进化分析产生混淆影响。在主要栖息于地下环境的蟑螂科(Nocticolidae)中,线粒体替代率出现了大幅加速。为了评估这种线粒体间比率异质性对系统发生关系和进化时间尺度估计的影响,我们分析了蜚蠊科和其他蜚蠊的核超保原(UCE)和线粒体基因组。与其他蟑螂相比,北极蠊种系的替代率大幅提高,尤其是线粒体蛋白编码基因。这种进化速度上的差异很可能导致线粒体基因组和 UCE 位点的系统进化分析支持不同的进化关系。此外,与灵活的局部时钟相比,使用松弛时钟模型进行的贝叶斯年代学分析推断出了更深的分化时间。我们的 UCE 系统发育分析是首次包括所有 13 个主要蟑螂科的基因组规模的研究,它将 Corydiidae 和 Nocticolidae 结合在一起,并将 Anaplectidae 视为 Blattoidea 其他科的姊妹系。我们发现蜚蠊科的遗传分化程度非常高,其中包括两个高度不同的支系,尽管这两个支系都包含蜚蠊属的代表,但它们在大约 1.15 亿年前就已经分开了。我们的研究结果凸显了不同品系之间的高比率差异对系统发生关系和进化时间尺度估计的潜在影响。
{"title":"Dating in the Dark: Elevated Substitution Rates in Cave Cockroaches (Blattodea: Nocticolidae) Have Negative Impacts on Molecular Date Estimates.","authors":"Toby G L Kovacs, James Walker, Simon Hellemans, Thomas Bourguignon, Nikolai J Tatarnic, Jane M McRae, Simon Y W Ho, Nathan Lo","doi":"10.1093/sysbio/syae002","DOIUrl":"10.1093/sysbio/syae002","url":null,"abstract":"<p><p>Rates of nucleotide substitution vary substantially across the Tree of Life, with potentially confounding effects on phylogenetic and evolutionary analyses. A large acceleration in mitochondrial substitution rate occurs in the cockroach family Nocticolidae, which predominantly inhabit subterranean environments. To evaluate the impacts of this among-lineage rate heterogeneity on estimates of phylogenetic relationships and evolutionary timescales, we analyzed nuclear ultraconserved elements (UCEs) and mitochondrial genomes from nocticolids and other cockroaches. Substitution rates were substantially elevated in nocticolid lineages compared with other cockroaches, especially in mitochondrial protein-coding genes. This disparity in evolutionary rates is likely to have led to different evolutionary relationships being supported by phylogenetic analyses of mitochondrial genomes and UCE loci. Furthermore, Bayesian dating analyses using relaxed-clock models inferred much deeper divergence times compared with a flexible local clock. Our phylogenetic analysis of UCEs, which is the first genome-scale study to include all 13 major cockroach families, unites Corydiidae and Nocticolidae and places Anaplectidae as the sister lineage to the rest of Blattoidea. We uncover an extraordinary level of genetic divergence in Nocticolidae, including two highly distinct clades that separated ~115 million years ago despite both containing representatives of the genus Nocticola. The results of our study highlight the potential impacts of high among-lineage rate variation on estimates of phylogenetic relationships and evolutionary timescales.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"532-545"},"PeriodicalIF":6.1,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11377191/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139698361","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Phylogenomics of Neogastropoda: The Backbone Hidden in the Bush. 新腹足纲的系统发生组学:隐藏在灌木丛中的骨干。
IF 6.1 1区 生物学 Q1 EVOLUTIONARY BIOLOGY Pub Date : 2024-09-05 DOI: 10.1093/sysbio/syae010
Alexander E Fedosov, Paul Zaharias, Thomas Lemarcis, Maria Vittoria Modica, Mandë Holford, Marco Oliverio, Yuri I Kantor, Nicolas Puillandre

The molluskan order Neogastropoda encompasses over 15,000 almost exclusively marine species playing important roles in benthic communities and in the economies of coastal countries. Neogastropoda underwent intensive cladogenesis in the early stages of diversification, generating a "bush" at the base of their evolutionary tree, which has been hard to resolve even with high throughput molecular data. In the present study to resolve the bush, we use a variety of phylogenetic inference methods and a comprehensive exon capture dataset of 1817 loci (79.6% data occupancy) comprising 112 taxa of 48 out of 60 Neogastropoda families. Our results show consistent topologies and high support in all analyses at (super)family level, supporting monophyly of Muricoidea, Mitroidea, Conoidea, and, with some reservations, Olivoidea and Buccinoidea. Volutoidea and Turbinelloidea as currently circumscribed are clearly paraphyletic. Despite our analyses consistently resolving most backbone nodes, 3 prove problematic: First, the uncertain placement of Cancellariidae, as the sister group to either a Ficoidea-Tonnoidea clade or to the rest of Neogastropoda, leaves monophyly of Neogastropoda unresolved. Second, relationships are contradictory at the base of the major "core Neogastropoda" grouping. Third, coalescence-based analyses reject monophyly of the Buccinoidea in relation to Vasidae. We analyzed phylogenetic signal of targeted loci in relation to potential biases, and we propose the most probable resolutions in the latter 2 recalcitrant nodes. The uncertain placement of Cancellariidae may be explained by orthology violations due to differential paralog loss shortly after the whole genome duplication, which should be resolved with a curated set of longer loci.

软体动物新腹足纲(Neogastropoda)包括 15,000 多个几乎完全属于海洋的物种,它们在底栖动物群落和沿海国家的经济中发挥着重要作用。新腹足纲在多样化的早期阶段经历了密集的支系发生,在其进化树的基部产生了一个 "灌木丛",即使有高通量的分子数据也很难解决这个问题。在本研究中,我们使用了多种系统发生推断方法和一个包含 1,817 个位点(数据占有率为 79.6%)的全面外显子捕获数据集,包括 60 个新腹足目科属中 48 个科属的 112 个类群,以解决这个灌木丛的问题。我们的结果表明,在(超)科一级的所有分析中,拓扑结构一致,支持率高,支持 Muricoidea、Mitroidea、Conoidea 的单系,以及 Olivoidea 和 Buccinoidea 的单系(有一些保留)。目前划分的涡虫纲(Volutoidea)和涡虫纲(Turbinelloidea)显然属于旁系。尽管我们的分析一直在解决大多数骨干节点的问题,但有三个节点证明是有问题的:首先,巨蟹目(Cancellariidae)作为蝶形目-盾形目(Ficoidea-Tonnoidea)支系或新腹足纲(Neogastropoda)其他支系的姐妹群的位置不确定,使得新腹足纲(Neogastropoda)的单系性悬而未决。其次,在 "核心新腹足纲 "主要类群的基部,各种关系相互矛盾。第三,基于聚合的分析否定了与瓣鳃亚目(Vasidae)相关的瓣鳃亚目(Buccinoidea)的单系性。我们分析了目标位点的系统发生信号与潜在偏差的关系,并提出了后两个难以解决的节点的最可能的解决方案。巨嘴鸟科的位置不确定可能是由于全基因组复制后不久因不同的旁系丢失而造成的违反系统发育规律的现象,而这一问题应通过一组较长的基因位点来解决。
{"title":"Phylogenomics of Neogastropoda: The Backbone Hidden in the Bush.","authors":"Alexander E Fedosov, Paul Zaharias, Thomas Lemarcis, Maria Vittoria Modica, Mandë Holford, Marco Oliverio, Yuri I Kantor, Nicolas Puillandre","doi":"10.1093/sysbio/syae010","DOIUrl":"10.1093/sysbio/syae010","url":null,"abstract":"<p><p>The molluskan order Neogastropoda encompasses over 15,000 almost exclusively marine species playing important roles in benthic communities and in the economies of coastal countries. Neogastropoda underwent intensive cladogenesis in the early stages of diversification, generating a \"bush\" at the base of their evolutionary tree, which has been hard to resolve even with high throughput molecular data. In the present study to resolve the bush, we use a variety of phylogenetic inference methods and a comprehensive exon capture dataset of 1817 loci (79.6% data occupancy) comprising 112 taxa of 48 out of 60 Neogastropoda families. Our results show consistent topologies and high support in all analyses at (super)family level, supporting monophyly of Muricoidea, Mitroidea, Conoidea, and, with some reservations, Olivoidea and Buccinoidea. Volutoidea and Turbinelloidea as currently circumscribed are clearly paraphyletic. Despite our analyses consistently resolving most backbone nodes, 3 prove problematic: First, the uncertain placement of Cancellariidae, as the sister group to either a Ficoidea-Tonnoidea clade or to the rest of Neogastropoda, leaves monophyly of Neogastropoda unresolved. Second, relationships are contradictory at the base of the major \"core Neogastropoda\" grouping. Third, coalescence-based analyses reject monophyly of the Buccinoidea in relation to Vasidae. We analyzed phylogenetic signal of targeted loci in relation to potential biases, and we propose the most probable resolutions in the latter 2 recalcitrant nodes. The uncertain placement of Cancellariidae may be explained by orthology violations due to differential paralog loss shortly after the whole genome duplication, which should be resolved with a curated set of longer loci.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"521-531"},"PeriodicalIF":6.1,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11377187/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140060479","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Rediscovery of a Relict Unlocks the First Global Phylogeny of Whip Spiders (Amblypygi). 重新发现的一个遗迹解开了鞭蜘蛛(Amblypygi)的第一个全球系统发育过程。
IF 6.1 1区 生物学 Q1 EVOLUTIONARY BIOLOGY Pub Date : 2024-09-05 DOI: 10.1093/sysbio/syae021
Gustavo S de Miranda, Siddharth S Kulkarni, Jéssica Tagliatela, Caitlin M Baker, Alessandro P L Giupponi, Facundo M Labarque, Efrat Gavish-Regev, Michael G Rix, Leonardo S Carvalho, Lívia Maria Fusari, Mark S Harvey, Hannah M Wood, Prashant P Sharma

Asymmetrical rates of cladogenesis and extinction abound in the tree of life, resulting in numerous minute clades that are dwarfed by larger sister groups. Such taxa are commonly regarded as phylogenetic relicts or "living fossils" when they exhibit an ancient first appearance in the fossil record and prolonged external morphological stasis, particularly in comparison to their more diversified sister groups. Due to their special status, various phylogenetic relicts tend to be well-studied and prioritized for conservation. A notable exception to this trend is found within Amblypygi ("whip spiders"), a visually striking order of functionally hexapodous arachnids that are notable for their antenniform first walking leg pair (the eponymous "whips"). Paleoamblypygi, the putative sister group to the remaining Amblypygi, is known from Late Carboniferous and Eocene deposits but is survived by a single living species, Paracharon caecusHansen (1921), that was last collected in 1899. Due to the absence of genomic sequence-grade tissue for this vital taxon, there is no global molecular phylogeny for Amblypygi to date, nor a fossil-calibrated estimation of divergences within the group. Here, we report a previously unknown species of Paleoamblypygi from a cave site in Colombia. Capitalizing upon this discovery, we generated the first molecular phylogeny of Amblypygi, integrating ultraconserved element sequencing with legacy Sanger datasets and including described extant genera. To quantify the impact of sampling Paleoamblypygi on divergence time estimation, we performed in silico experiments with pruning of Paracharon. We demonstrate that the omission of relicts has a significant impact on the accuracy of node dating approaches that outweighs the impact of excluding ingroup fossils, which bears upon the ancestral range reconstruction for the group. Our results underscore the imperative for biodiversity discovery efforts in elucidating the phylogenetic relationships of "dark taxa," and especially phylogenetic relicts in tropical and subtropical habitats. The lack of reciprocal monophyly for Charontidae and Charinidae leads us to subsume them into one family, Charontidae, new synonymy.

在生命之树上,不对称的类群发生率和灭绝率比比皆是,这就造成了许多微小类群在较大的姊妹类群面前相形见绌。这类类群通常被视为系统发育遗迹或 "活化石",因为它们首次出现在化石记录中的时间较早,而且外部形态长期处于停滞状态,特别是与其更多样化的姊妹类群相比。由于它们的特殊地位,各种系统发育遗物往往会得到很好的研究和优先保护。在这一趋势中,Amblypygi("鞭蛛")是一个明显的例外,这是一种具有视觉冲击力的功能性六足蛛形纲,以其触角状的第一对步行腿(同名的 "鞭")而闻名。古amblypygi 是其余 Amblypygi 的姊妹类群,已知于晚石炭世和始新世的沉积物中,但目前仅存 Paracharon caecus Hansen(1921 年)这一个物种,该物种最后一次被采集是在 1899 年。由于缺乏这一重要类群的基因组序列级组织,迄今为止还没有Amblypygi的全球分子系统发生,也没有对该类群内部的分化进行化石校准估算。在这里,我们报告了哥伦比亚一个洞穴遗址中发现的一个以前未知的古囊蝶类物种。利用这一发现,我们整合了超保留元素测序与传统的 Sanger 数据集,并将已描述的现生属纳入其中,首次建立了 Amblypygi 的分子系统发育。为了量化古囊虫采样对分化时间估计的影响,我们对 Paracharon 进行了剪枝硅学实验。我们证明,遗物的遗漏对节点测年方法的准确性有重大影响,其影响超过了排除内群化石的影响,这对该类群祖先分布范围的重建产生了影响。我们的研究结果突出表明,生物多样性发现工作必须阐明 "暗类群 "的系统发育关系,特别是热带和亚热带栖息地的系统发育遗物。夏龙科和夏麟科缺乏互为单系的关系,因此我们将它们归入一个科,即夏龙科(Charontidae),这是新的异名。
{"title":"The Rediscovery of a Relict Unlocks the First Global Phylogeny of Whip Spiders (Amblypygi).","authors":"Gustavo S de Miranda, Siddharth S Kulkarni, Jéssica Tagliatela, Caitlin M Baker, Alessandro P L Giupponi, Facundo M Labarque, Efrat Gavish-Regev, Michael G Rix, Leonardo S Carvalho, Lívia Maria Fusari, Mark S Harvey, Hannah M Wood, Prashant P Sharma","doi":"10.1093/sysbio/syae021","DOIUrl":"10.1093/sysbio/syae021","url":null,"abstract":"<p><p>Asymmetrical rates of cladogenesis and extinction abound in the tree of life, resulting in numerous minute clades that are dwarfed by larger sister groups. Such taxa are commonly regarded as phylogenetic relicts or \"living fossils\" when they exhibit an ancient first appearance in the fossil record and prolonged external morphological stasis, particularly in comparison to their more diversified sister groups. Due to their special status, various phylogenetic relicts tend to be well-studied and prioritized for conservation. A notable exception to this trend is found within Amblypygi (\"whip spiders\"), a visually striking order of functionally hexapodous arachnids that are notable for their antenniform first walking leg pair (the eponymous \"whips\"). Paleoamblypygi, the putative sister group to the remaining Amblypygi, is known from Late Carboniferous and Eocene deposits but is survived by a single living species, Paracharon caecusHansen (1921), that was last collected in 1899. Due to the absence of genomic sequence-grade tissue for this vital taxon, there is no global molecular phylogeny for Amblypygi to date, nor a fossil-calibrated estimation of divergences within the group. Here, we report a previously unknown species of Paleoamblypygi from a cave site in Colombia. Capitalizing upon this discovery, we generated the first molecular phylogeny of Amblypygi, integrating ultraconserved element sequencing with legacy Sanger datasets and including described extant genera. To quantify the impact of sampling Paleoamblypygi on divergence time estimation, we performed in silico experiments with pruning of Paracharon. We demonstrate that the omission of relicts has a significant impact on the accuracy of node dating approaches that outweighs the impact of excluding ingroup fossils, which bears upon the ancestral range reconstruction for the group. Our results underscore the imperative for biodiversity discovery efforts in elucidating the phylogenetic relationships of \"dark taxa,\" and especially phylogenetic relicts in tropical and subtropical habitats. The lack of reciprocal monophyly for Charontidae and Charinidae leads us to subsume them into one family, Charontidae, new synonymy.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"495-505"},"PeriodicalIF":6.1,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140908807","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Random-Effects Substitution Models for Phylogenetics via Scalable Gradient Approximations. 通过可扩展梯度近似为系统发育建立随机效应替代模型
IF 6.1 1区 生物学 Q1 EVOLUTIONARY BIOLOGY Pub Date : 2024-09-05 DOI: 10.1093/sysbio/syae019
Andrew F Magee, Andrew J Holbrook, Jonathan E Pekar, Itzue W Caviedes-Solis, Fredrick A Matsen Iv, Guy Baele, Joel O Wertheim, Xiang Ji, Philippe Lemey, Marc A Suchard

Phylogenetic and discrete-trait evolutionary inference depend heavily on an appropriate characterization of the underlying character substitution process. In this paper, we present random-effects substitution models that extend common continuous-time Markov chain models into a richer class of processes capable of capturing a wider variety of substitution dynamics. As these random-effects substitution models often require many more parameters than their usual counterparts, inference can be both statistically and computationally challenging. Thus, we also propose an efficient approach to compute an approximation to the gradient of the data likelihood with respect to all unknown substitution model parameters. We demonstrate that this approximate gradient enables scaling of sampling-based inference, namely Bayesian inference via Hamiltonian Monte Carlo, under random-effects substitution models across large trees and state-spaces. Applied to a dataset of 583 SARS-CoV-2 sequences, an HKY model with random-effects shows strong signals of nonreversibility in the substitution process, and posterior predictive model checks clearly show that it is a more adequate model than a reversible model. When analyzing the pattern of phylogeographic spread of 1441 influenza A virus (H3N2) sequences between 14 regions, a random-effects phylogeographic substitution model infers that air travel volume adequately predicts almost all dispersal rates. A random-effects state-dependent substitution model reveals no evidence for an effect of arboreality on the swimming mode in the tree frog subfamily Hylinae. Simulations reveal that random-effects substitution models can accommodate both negligible and radical departures from the underlying base substitution model. We show that our gradient-based inference approach is over an order of magnitude more time efficient than conventional approaches.

系统发育和离散性状进化推断在很大程度上取决于对基本性状替换过程的适当描述。在本文中,我们提出了随机效应替代模型,这些模型将常见的连续时间马尔可夫链模型扩展为一类更丰富的过程,能够捕捉到更多的替代动态。由于这些随机效应替代模型所需的参数往往比通常的同类模型多得多,因此推断工作在统计和计算上都具有挑战性。因此,我们还提出了一种高效的方法,用于计算与所有未知替代模型参数相关的数据似然梯度的近似值。我们证明,在大树和状态空间的随机效应替代模型下,这种近似梯度可以扩展基于采样的推断,即通过哈密尔顿蒙特卡洛进行贝叶斯推断。应用于 583 个 SARS-CoV-2 序列的数据集时,随机效应 HKY 模型显示出替换过程中不可逆的强烈信号,后验预测模型检查清楚地表明它是一个比可逆模型更适当的模型。在分析 14 个地区之间 1441 个甲型流感病毒(H3N2)序列的系统地理学传播模式时,随机效应系统地理学替代模型推断航空旅行量能充分预测几乎所有的传播率。随机效应状态依赖替代模型显示,没有证据表明树栖性对树蛙亚科的游泳模式有影响。模拟结果表明,随机效应替代模型可以容纳与基础替代模型的微小偏离或根本偏离。我们的研究表明,与传统方法相比,我们基于梯度的推断方法的时间效率要高出一个数量级。
{"title":"Random-Effects Substitution Models for Phylogenetics via Scalable Gradient Approximations.","authors":"Andrew F Magee, Andrew J Holbrook, Jonathan E Pekar, Itzue W Caviedes-Solis, Fredrick A Matsen Iv, Guy Baele, Joel O Wertheim, Xiang Ji, Philippe Lemey, Marc A Suchard","doi":"10.1093/sysbio/syae019","DOIUrl":"10.1093/sysbio/syae019","url":null,"abstract":"<p><p>Phylogenetic and discrete-trait evolutionary inference depend heavily on an appropriate characterization of the underlying character substitution process. In this paper, we present random-effects substitution models that extend common continuous-time Markov chain models into a richer class of processes capable of capturing a wider variety of substitution dynamics. As these random-effects substitution models often require many more parameters than their usual counterparts, inference can be both statistically and computationally challenging. Thus, we also propose an efficient approach to compute an approximation to the gradient of the data likelihood with respect to all unknown substitution model parameters. We demonstrate that this approximate gradient enables scaling of sampling-based inference, namely Bayesian inference via Hamiltonian Monte Carlo, under random-effects substitution models across large trees and state-spaces. Applied to a dataset of 583 SARS-CoV-2 sequences, an HKY model with random-effects shows strong signals of nonreversibility in the substitution process, and posterior predictive model checks clearly show that it is a more adequate model than a reversible model. When analyzing the pattern of phylogeographic spread of 1441 influenza A virus (H3N2) sequences between 14 regions, a random-effects phylogeographic substitution model infers that air travel volume adequately predicts almost all dispersal rates. A random-effects state-dependent substitution model reveals no evidence for an effect of arboreality on the swimming mode in the tree frog subfamily Hylinae. Simulations reveal that random-effects substitution models can accommodate both negligible and radical departures from the underlying base substitution model. We show that our gradient-based inference approach is over an order of magnitude more time efficient than conventional approaches.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"562-578"},"PeriodicalIF":6.1,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11498053/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140869958","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Inferring the Evolutionary Model of Community-Structuring Traits with Convolutional Kitchen Sinks. 用卷积厨房水槽推断群落结构特征的进化模型
IF 6.1 1区 生物学 Q1 EVOLUTIONARY BIOLOGY Pub Date : 2024-09-05 DOI: 10.1093/sysbio/syae026
Avery Kruger, Vaishaal Shankar, T Jonathan Davies

When communities are assembled through processes such as filtering or limiting similarity acting on phylogenetically conserved traits, the evolutionary signature of those traits may be reflected in patterns of community membership. We show how the model of trait evolution underlying community-structuring traits can be inferred from community membership data using both a variation of a traditional eco-phylogenetic metric-the mean pairwise phylogenetic distance (MPD) between taxa-and a recent machine learning tool, Convolutional Kitchen Sinks (CKS). Both methods perform well across a range of phylogenetically informative evolutionary models, but CKS outperforms MPD as tree size increases. We demonstrate CKS by inferring the evolutionary history of freeze tolerance in angiosperms. Our analysis is consistent with a late burst model, suggesting freeze tolerance evolved recently. We suggest that multiple data types that are ordered on phylogenies, such as trait values, species interactions, or community presence/absence, are good candidates for CKS modeling because the generative models produce structured differences between neighboring points that CKS is well-suited for. We introduce the R package kitchen to perform CKS for generic application of the technique.

当群落通过过滤或限制相似性等作用于系统发育保守性状的过程而形成时,这些性状的进化特征可能会反映在群落成员模式中。我们展示了如何利用传统生态系统发育指标的变体--分类群之间的平均成对距离(MPD)--以及最新的机器学习工具--卷积厨房汇(CKS),从群落成员数据中推断出群落结构特质的基础性状进化模型。这两种方法在一系列系统发育信息丰富的进化模型中都表现良好,但随着树规模的增大,CKS的表现要优于MPD。我们通过推断被子植物耐冻性的进化历史来展示 CKS。我们的分析与晚期爆发模型一致,表明耐冻性是最近才进化出来的。我们认为,在系统发育过程中有序排列的多种数据类型,如性状值、物种相互作用或群落的存在/缺失,都是 CKS 建模的良好候选对象,因为生成模型会产生 CKS 非常适合的相邻点之间的结构差异。我们介绍了用于执行 CKS 的 R 软件包 kitchen,以实现该技术的通用应用。
{"title":"Inferring the Evolutionary Model of Community-Structuring Traits with Convolutional Kitchen Sinks.","authors":"Avery Kruger, Vaishaal Shankar, T Jonathan Davies","doi":"10.1093/sysbio/syae026","DOIUrl":"10.1093/sysbio/syae026","url":null,"abstract":"<p><p>When communities are assembled through processes such as filtering or limiting similarity acting on phylogenetically conserved traits, the evolutionary signature of those traits may be reflected in patterns of community membership. We show how the model of trait evolution underlying community-structuring traits can be inferred from community membership data using both a variation of a traditional eco-phylogenetic metric-the mean pairwise phylogenetic distance (MPD) between taxa-and a recent machine learning tool, Convolutional Kitchen Sinks (CKS). Both methods perform well across a range of phylogenetically informative evolutionary models, but CKS outperforms MPD as tree size increases. We demonstrate CKS by inferring the evolutionary history of freeze tolerance in angiosperms. Our analysis is consistent with a late burst model, suggesting freeze tolerance evolved recently. We suggest that multiple data types that are ordered on phylogenies, such as trait values, species interactions, or community presence/absence, are good candidates for CKS modeling because the generative models produce structured differences between neighboring points that CKS is well-suited for. We introduce the R package kitchen to perform CKS for generic application of the technique.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"546-561"},"PeriodicalIF":6.1,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11377182/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141065607","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Sequential Direct and Indirect Effects of Mountain Uplift, Climatic Niche, and Floral Trait Evolution on Diversification Dynamics in an Andean Plant Clade. 山地隆起、气候生态位和花性状进化对安第斯植物支系多样化动态的直接和间接影响。
IF 6.1 1区 生物学 Q1 EVOLUTIONARY BIOLOGY Pub Date : 2024-09-05 DOI: 10.1093/sysbio/syae011
Agnes S Dellinger, Laura Lagomarsino, Fabián Michelangeli, Stefan Dullinger, Stacey D Smith

Why and how organismal lineages radiate is commonly studied through either assessing abiotic factors (biogeography, geomorphological processes, and climate) or biotic factors (traits and interactions). Despite increasing awareness that both abiotic and biotic processes may have important joint effects on diversification dynamics, few attempts have been made to quantify the relative importance and timing of these factors, and their potentially interlinked direct and indirect effects, on lineage diversification. We here combine assessments of historical biogeography, geomorphology, climatic niche, vegetative, and floral trait evolution to test whether these factors jointly, or in isolation, explain diversification dynamics of a Neotropical plant clade (Merianieae, Melastomataceae). After estimating ancestral areas and the changes in niche and trait disparity over time, we employ Phylogenetic Path Analyses as a synthesis tool to test eleven hypotheses on the individual direct and indirect effects of these factors on diversification rates. We find strongest support for interlinked effects of colonization of the uplifting Andes during the mid-Miocene and rapid abiotic climatic niche evolution in explaining a burst in diversification rate in Merianieae. Within Andean habitats, later increases in floral disparity allowed for the exploitation of wider pollination niches (i.e., shifts from bee to vertebrate pollinators), but did not affect diversification rates. Our approach of including both vegetative and floral trait evolution, rare in assessments of plant diversification in general, highlights that the evolution of woody habit and larger flowers preceded the colonization of the Andes, but was likely critical in enabling the rapid radiation in montane environments. Overall, and in concert with the idea that ecological opportunity is a key element of evolutionary radiations, our results suggest that a combination of rapid niche evolution and trait shifts was critical for the exploitation of newly available niche space in the Andes in the mid-Miocene. Further, our results emphasize the importance of incorporating both abiotic and biotic factors into the same analytical framework if we aim to quantify the relative and interlinked effects of these processes on diversification.

通常是通过评估非生物因素(生物地理、地貌过程、气候)或生物因素(性状、相互作用)来研究生物品系为何以及如何辐射。尽管人们越来越意识到非生物过程和生物过程可能会对生物多样性动态产生重要的共同影响,但很少有人尝试量化这些因素的相对重要性和时间,以及它们可能相互关联的直接和间接影响。在本文中,我们结合了对历史生物地理学、地貌学、气候生态位、植被和花卉性状演化的评估,以检验这些因素是否共同或单独解释了一个新热带植物支系(Merianieae, Melastomataceae)的多样性动态。在估算了祖先区域以及生态位和性状差异随时间的变化之后,我们采用系统发生路径分析作为综合工具,检验了这些因素对多样化率的直接和间接影响的 11 个假设。我们发现,中新世中期安第斯山脉的隆起和快速的非生物气候生态位演化的相互关联效应最有力地解释了 Merianieae 多样化率爆发的原因。在安第斯山脉的栖息地内,后来花卉差异的增加使更广阔的授粉生态位得以利用(即从蜜蜂传粉者向脊椎动物传粉者的转变),但这并不影响其多样化率。我们将植物和花的性状演化都包括在内的方法在一般的植物多样性评估中是罕见的,这突出表明了木质习性和大花的演化先于安第斯山脉的殖民化,但很可能是在山地环境中实现快速辐射的关键。总之,与生态机会是进化辐射的关键因素这一观点一致,我们的研究结果表明,在中新世中期,快速生态位进化与性状转变的结合对于利用安第斯山脉新出现的生态位空间至关重要。此外,我们的研究结果还强调了将非生物因素和生物因素纳入同一分析框架的重要性,只有这样我们才能量化这些过程对生物多样化的相对和相互关联的影响。
{"title":"The Sequential Direct and Indirect Effects of Mountain Uplift, Climatic Niche, and Floral Trait Evolution on Diversification Dynamics in an Andean Plant Clade.","authors":"Agnes S Dellinger, Laura Lagomarsino, Fabián Michelangeli, Stefan Dullinger, Stacey D Smith","doi":"10.1093/sysbio/syae011","DOIUrl":"10.1093/sysbio/syae011","url":null,"abstract":"<p><p>Why and how organismal lineages radiate is commonly studied through either assessing abiotic factors (biogeography, geomorphological processes, and climate) or biotic factors (traits and interactions). Despite increasing awareness that both abiotic and biotic processes may have important joint effects on diversification dynamics, few attempts have been made to quantify the relative importance and timing of these factors, and their potentially interlinked direct and indirect effects, on lineage diversification. We here combine assessments of historical biogeography, geomorphology, climatic niche, vegetative, and floral trait evolution to test whether these factors jointly, or in isolation, explain diversification dynamics of a Neotropical plant clade (Merianieae, Melastomataceae). After estimating ancestral areas and the changes in niche and trait disparity over time, we employ Phylogenetic Path Analyses as a synthesis tool to test eleven hypotheses on the individual direct and indirect effects of these factors on diversification rates. We find strongest support for interlinked effects of colonization of the uplifting Andes during the mid-Miocene and rapid abiotic climatic niche evolution in explaining a burst in diversification rate in Merianieae. Within Andean habitats, later increases in floral disparity allowed for the exploitation of wider pollination niches (i.e., shifts from bee to vertebrate pollinators), but did not affect diversification rates. Our approach of including both vegetative and floral trait evolution, rare in assessments of plant diversification in general, highlights that the evolution of woody habit and larger flowers preceded the colonization of the Andes, but was likely critical in enabling the rapid radiation in montane environments. Overall, and in concert with the idea that ecological opportunity is a key element of evolutionary radiations, our results suggest that a combination of rapid niche evolution and trait shifts was critical for the exploitation of newly available niche space in the Andes in the mid-Miocene. Further, our results emphasize the importance of incorporating both abiotic and biotic factors into the same analytical framework if we aim to quantify the relative and interlinked effects of these processes on diversification.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"594-612"},"PeriodicalIF":6.1,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11377192/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140330144","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Systematic Biology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1