Systematic Biology最新文献_第2页

The Explosive Radiation of the Neotropical Tillandsia Subgenus Tillandsia (Bromeliaceae) Has Been Accompanied by Pervasive Hybridization. 新热带凤梨亚属（凤梨科）的爆炸辐射伴随着普遍的杂交。

IF 5.7 1区生物学 Q1 EVOLUTIONARY BIOLOGY

Systematic Biology

Pub Date : 2026-01-15 DOI: 10.1093/sysbio/syaf039

Gil Yardeni, Michael H J Barfuss, Walter Till, Matthew R Thornton, Clara Groot Crego, Christian Lexer, Thibault Leroy, Ovidiu Paun

The recent rapid radiation of Tillandsia subgenus Tillandsia (Bromeliaceae) provides an attractive system to study the drivers and constraints of species diversification. This species-rich Neotropical monocot clade includes predominantly epiphytic species displaying vast phenotypic diversity. Recent in-depth phylogenomic work revealed that the subgenus originated within the last 7 myr, with one major expansion from South into Central America within the last 5 myr. However, disagreements between phylogenies and lack of resolution at shallow nodes suggest that hybridization may have occurred throughout the radiation, together with frequent incomplete lineage sorting and rapid gene family evolution. We used whole-genome resequencing data to explore the evolutionary history of representative ingroup species employing both tree-based and network approaches. Our results indicate that lineage co-occurrence does not predict relatedness and confirm significant deviations from a tree-like structure, coupled with pervasive gene-tree discordance. Focusing on hybridization, ABBA-BABA and related statistics were used to infer the rates and relative timing of introgression, whereas topology weighting uncovered high heterogeneity of the phylogenetic signal along the genome. High rates of hybridization within and among subclades suggest that, contrary to previous hypotheses, the expansion of subgenus Tillandsia into Central America proceeded through several dispersal events, punctuated by episodes of diversification and gene flow. Network analysis revealed reticulation as a plausible propeller during radiation and establishment across different ecological niches. This work contributes a plant example of prevalent hybridization during rapid species diversification, supporting the hypothesis that interspecific gene flow facilitates explosive diversification.

凤梨科凤梨亚属的快速辐射为研究凤梨物种多样化的驱动因素和制约因素提供了一个有吸引力的系统。这个物种丰富的新热带单子叶枝主要包括附生物种，表现出巨大的表型多样性。最近深入的系统基因组学研究表明，该亚属起源于过去7个世纪，在过去5个世纪内从南美洲向中美洲进行了一次大规模扩张。然而，系统发育的分歧和浅节点缺乏分辨率表明，杂交可能发生在整个辐射过程中，以及频繁的不完整谱系分选和快速的基因家族进化。我们使用全基因组重测序数据，采用基于树和网络的方法来探索具有代表性的群内物种的进化史。我们的研究结果表明，谱系共现并不能预测亲缘关系，并证实了树状结构的显著偏差，以及普遍存在的基因树不一致。聚焦于杂交，ABBA-BABA和相关统计数据被用来推断基因渗入的速率和相对时间，而拓扑加权揭示了沿基因组的系统发育信号的高度异质性。亚分支内部和之间的高杂交率表明，与先前的假设相反，Tillandsia亚属向中美洲的扩张是通过几次分散事件进行的，其间不时出现多样化和基因流动。网络分析表明，在不同生态位的辐射和建立过程中，网状结构可能是一个推进器。这项工作提供了一个在快速物种多样化过程中普遍存在杂交的植物例子，支持了种间基因流动促进爆炸性多样化的假设。

{"title":"The Explosive Radiation of the Neotropical Tillandsia Subgenus Tillandsia (Bromeliaceae) Has Been Accompanied by Pervasive Hybridization.","authors":"Gil Yardeni, Michael H J Barfuss, Walter Till, Matthew R Thornton, Clara Groot Crego, Christian Lexer, Thibault Leroy, Ovidiu Paun","doi":"10.1093/sysbio/syaf039","DOIUrl":"10.1093/sysbio/syaf039","url":null,"abstract":"The recent rapid radiation of Tillandsia subgenus Tillandsia (Bromeliaceae) provides an attractive system to study the drivers and constraints of species diversification. This species-rich Neotropical monocot clade includes predominantly epiphytic species displaying vast phenotypic diversity. Recent in-depth phylogenomic work revealed that the subgenus originated within the last 7 myr, with one major expansion from South into Central America within the last 5 myr. However, disagreements between phylogenies and lack of resolution at shallow nodes suggest that hybridization may have occurred throughout the radiation, together with frequent incomplete lineage sorting and rapid gene family evolution. We used whole-genome resequencing data to explore the evolutionary history of representative ingroup species employing both tree-based and network approaches. Our results indicate that lineage co-occurrence does not predict relatedness and confirm significant deviations from a tree-like structure, coupled with pervasive gene-tree discordance. Focusing on hybridization, ABBA-BABA and related statistics were used to infer the rates and relative timing of introgression, whereas topology weighting uncovered high heterogeneity of the phylogenetic signal along the genome. High rates of hybridization within and among subclades suggest that, contrary to previous hypotheses, the expansion of subgenus Tillandsia into Central America proceeded through several dispersal events, punctuated by episodes of diversification and gene flow. Network analysis revealed reticulation as a plausible propeller during radiation and establishment across different ecological niches. This work contributes a plant example of prevalent hybridization during rapid species diversification, supporting the hypothesis that interspecific gene flow facilitates explosive diversification.","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"22-38"},"PeriodicalIF":5.7,"publicationDate":"2026-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12805668/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144498008","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Correction to: The Fossilized Birth-Death Model Is Identifiable. 更正：化石的出生-死亡模型是可识别的。

IF 5.7 1区生物学 Q1 EVOLUTIONARY BIOLOGY

Systematic Biology

Pub Date : 2026-01-15 DOI: 10.1093/sysbio/syaf074

引用次数: 0

torchtree: Flexible Phylogenetic Model Development and Inference Using PyTorch. torchtree：使用PyTorch开发和推断灵活的系统发育模型。

IF 5.7 1区生物学 Q1 EVOLUTIONARY BIOLOGY

Systematic Biology

Pub Date : 2026-01-15 DOI: 10.1093/sysbio/syaf047

Mathieu Fourment, Matthew Macaulay, Christiaan J Swanepoel, Xiang Ji, Marc A Suchard, Frederick A Matsen Iv

Bayesian inference has predominantly relied on the Markov chain Monte Carlo (MCMC) algorithm for many years. However, MCMC is computationally laborious, especially for complex phylogenetic models of time trees. This bottleneck has led to the search for alternatives, such as variational Bayes, which can scale better to large data sets. In this paper, we introduce torchtree, a framework written in Python that allows developers to easily implement rich phylogenetic models and algorithms using a fixed tree topology. One can either use automatic differentiation or leverage torchtree's plug-in system to compute gradients analytically for model components for which automatic differentiation is slow. We demonstrate that the torchtree variational inference framework performs similarly to BEAST in terms of speed, and delivers promising approximation results, though accuracy varies across scenarios. Furthermore, we explore the use of the forward Kullback-Leibler (KL) divergence as an optimizing criterion for variational inference, which can handle discontinuous and nondifferentiable models. Our experiments show that inference using the forward KL divergence is frequently faster per iteration compared with the evidence lower bound (ELBO) criterion, although the ELBO-based inference may converge faster in some cases. Overall, torchtree provides a flexible and efficient framework for phylogenetic model development and inference using PyTorch.

多年来，贝叶斯推理主要依赖于马尔可夫链蒙特卡罗（MCMC）算法。然而，MCMC在计算上很费力，特别是对于时间树的复杂系统发育模型。这一瓶颈导致人们寻找替代方法，比如变分贝叶斯，它可以更好地扩展到大型数据集。在本文中，我们介绍了torchtree，这是一个用Python编写的框架，允许开发人员使用固定的树拓扑结构轻松实现丰富的系统发育模型和算法。您可以使用自动微分，或者利用torchtree的插件系统来分析地计算自动微分缓慢的模型组件的梯度。我们证明了火炬树变分推理框架在速度方面与BEAST相似，并提供了有希望的近似结果，尽管准确率因场景而异。此外，我们探索了将前向KL散度作为变分推理的优化准则，该准则可以处理不连续和不可微模型。我们的实验表明，与证据下界（ELBO）标准相比，使用前向KL散度的推理每次迭代通常更快，尽管基于ELBO的推理在某些情况下可能收敛得更快。总的来说，torchtree为使用PyTorch开发和推断系统发育模型提供了一个灵活而高效的框架。系统发育，贝叶斯推理，变分贝叶斯，PyTorch。

{"title":"torchtree: Flexible Phylogenetic Model Development and Inference Using PyTorch.","authors":"Mathieu Fourment, Matthew Macaulay, Christiaan J Swanepoel, Xiang Ji, Marc A Suchard, Frederick A Matsen Iv","doi":"10.1093/sysbio/syaf047","DOIUrl":"10.1093/sysbio/syaf047","url":null,"abstract":"Bayesian inference has predominantly relied on the Markov chain Monte Carlo (MCMC) algorithm for many years. However, MCMC is computationally laborious, especially for complex phylogenetic models of time trees. This bottleneck has led to the search for alternatives, such as variational Bayes, which can scale better to large data sets. In this paper, we introduce torchtree, a framework written in Python that allows developers to easily implement rich phylogenetic models and algorithms using a fixed tree topology. One can either use automatic differentiation or leverage torchtree's plug-in system to compute gradients analytically for model components for which automatic differentiation is slow. We demonstrate that the torchtree variational inference framework performs similarly to BEAST in terms of speed, and delivers promising approximation results, though accuracy varies across scenarios. Furthermore, we explore the use of the forward Kullback-Leibler (KL) divergence as an optimizing criterion for variational inference, which can handle discontinuous and nondifferentiable models. Our experiments show that inference using the forward KL divergence is frequently faster per iteration compared with the evidence lower bound (ELBO) criterion, although the ELBO-based inference may converge faster in some cases. Overall, torchtree provides a flexible and efficient framework for phylogenetic model development and inference using PyTorch.","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"39-51"},"PeriodicalIF":5.7,"publicationDate":"2026-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12805669/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144561212","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Too Many Numbers? 数字太多？

IF 5.7 1区生物学 Q1 EVOLUTIONARY BIOLOGY

Systematic Biology

Pub Date : 2026-01-15 DOI: 10.1093/sysbio/syaf076

F James Rohlf

A somewhat personal account of the development and acceptance of numerical taxonomic methods during the early years of the journal Systematic Zoology. Includes a few perspectives on the changes in taxonomy and the journal after 75 years.

在《系统动物学》杂志早期，对数字分类学方法的发展和接受程度的个人叙述。包括对75年后分类学和期刊变化的一些观点。

引用次数: 0

PhyloFusion-Fast and Easy Fusion of Rooted Phylogenetic Trees into Rooted Phylogenetic Networks. 快速和容易的融合根系统发育树到根系统发育网络。

IF 5.7 1区生物学 Q1 EVOLUTIONARY BIOLOGY

Systematic Biology

Pub Date : 2026-01-15 DOI: 10.1093/sysbio/syaf049

Louxin Zhang, Banu Cetinkaya, Daniel H Huson

Unrooted phylogenetic networks are commonly used to represent evolutionary data in the presence of incompatibilities. Although rooted phylogenetic networks offer a more explicit framework for depicting evolutionary histories involving reticulate events, they are reported less frequently, probably due to a lack of tools that are as easily applicable as those for unrooted networks. Here, we introduce PhyloFusion, a fast and user-friendly method for constructing rooted phylogenetic networks from sets of rooted phylogenetic trees. The resulting networks have the tree-child property. The algorithm accommodates trees with unresolved nodes-often resulting from the contraction of low-support edges-as well as some degree of missing taxa. We demonstrate its application to the analysis of functionally related gene groups and show that it can efficiently handle data sets comprising tens of trees or hundreds of taxa. An open source implementation of PhyloFusion is available as part of the SplitsTree app: https://www.github.com/husonlab/splitstree6. All data available here: https://doi.org/10.5061/dryad.k3j9kd5h5.

无根系统发育网络通常用于表示存在不兼容性的进化数据。虽然有根的系统发育网络为描述涉及网状事件的进化历史提供了一个更明确的框架，但它们的报道频率较低，可能是因为缺乏像无根网络那样容易适用的工具。在这里，我们介绍了PhyloFusion，一种快速且用户友好的方法，用于从有根的系统发生树集构建有根的系统发生网络。由此产生的网络具有树-子属性。该算法适用于具有未解析节点的树——通常是由于低支持边的收缩——以及某种程度上缺失的分类群。我们演示了它在功能相关基因群分析中的应用，并表明它可以有效地处理包含数十棵树或数百个分类群的数据集。PhyloFusion的开源实现是splittree应用程序的一部分：https://www.github.com/husonlab/splitstree6所有数据可在这里获得：https://doi.org/10.5061/dryad.k3j9kd5h5。

{"title":"PhyloFusion-Fast and Easy Fusion of Rooted Phylogenetic Trees into Rooted Phylogenetic Networks.","authors":"Louxin Zhang, Banu Cetinkaya, Daniel H Huson","doi":"10.1093/sysbio/syaf049","DOIUrl":"10.1093/sysbio/syaf049","url":null,"abstract":"Unrooted phylogenetic networks are commonly used to represent evolutionary data in the presence of incompatibilities. Although rooted phylogenetic networks offer a more explicit framework for depicting evolutionary histories involving reticulate events, they are reported less frequently, probably due to a lack of tools that are as easily applicable as those for unrooted networks. Here, we introduce PhyloFusion, a fast and user-friendly method for constructing rooted phylogenetic networks from sets of rooted phylogenetic trees. The resulting networks have the tree-child property. The algorithm accommodates trees with unresolved nodes-often resulting from the contraction of low-support edges-as well as some degree of missing taxa. We demonstrate its application to the analysis of functionally related gene groups and show that it can efficiently handle data sets comprising tens of trees or hundreds of taxa. An open source implementation of PhyloFusion is available as part of the SplitsTree app: https://www.github.com/husonlab/splitstree6. All data available here: https://doi.org/10.5061/dryad.k3j9kd5h5.","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"88-99"},"PeriodicalIF":5.7,"publicationDate":"2026-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12805670/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144650538","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

An evolving view of species tree inference. 物种树推理的进化观点。

IF 6.5 1区生物学 Q1 EVOLUTIONARY BIOLOGY

Systematic Biology

Pub Date : 2026-01-15 DOI: 10.1093/sysbio/syag002

Laura Kubatko

Nearly all modern studies that address evolutionary questions require consideration of the phylogenetic relationships among species. But what, exactly, is a species tree? And how do we go about estimating such a tree from genomic data? In this Evolving View, I consider the historical development of the field of species tree inference and discuss both progress and controversies within the field at present. I conclude by suggesting future directions and highlighting challenges the field is likely to face in the coming years.

几乎所有解决进化问题的现代研究都需要考虑物种之间的系统发育关系。但是，物种树到底是什么呢？我们如何从基因组数据中估计这样一棵树呢？在这个进化的观点中，我考虑了物种树推理领域的历史发展，并讨论了目前该领域的进展和争议。最后，我提出了未来的发展方向，并强调了该领域在未来几年可能面临的挑战。

引用次数: 0

Coping with Ineffective Overlap in Multilocus Phylogenetics. 多位点系统发育中无效重叠的处理。

IF 5.7 1区生物学 Q1 EVOLUTIONARY BIOLOGY

Systematic Biology

Pub Date : 2026-01-15 DOI: 10.1093/sysbio/syaf044

Ana Serra Silva, Karen Siu-Ting, Christopher J Creevey, Davide Pisani, Mark Wilkinson

Missing data is a long-standing issue in phylogenetic inference, which often results in high levels of taxonomic instability, obscuring otherwise well-supported relationships. Multiple approaches have been developed to deal with the negative effects of ineffective overlap on tree resolution, often by identifying taxa for removal. Here, we repurpose a heuristic method developed to identify unstable taxa in morphological data matrices, concatabominations, and combine it with a novel gene-tree jackknifing on matrix representation of trees to identify candidates for targeted sequencing. Using a multilocus caecilian data set, we illustrate the method's capacity to identify candidate taxa and loci for additional sequencing, compare the results with those of the mathematics-based gene sampling sufficiency approach, and explore the terrace space associated with the multilocus data set. We show that our approach yields tractable numbers of loci/taxa for targeted sequencing that successfully mitigate topological instability due to ineffective overlap, even when modest amounts of data are added.

在系统发育推断中，数据缺失是一个长期存在的问题，它经常导致高度的分类不稳定性，模糊了其他得到良好支持的关系。已经开发了多种方法来处理无效重叠对树分辨率的负面影响，通常是通过确定要移除的分类群。在这里，我们重新利用一种启发式方法来识别形态学数据矩阵中不稳定的分类群，并将其与一种新的基于树的矩阵表示的基因树jackkning相结合，以确定目标测序的候选物种。利用一个多位点的蛭形生物数据集，我们说明了该方法识别候选分类群和基因座的能力，并将结果与基于数学的基因采样充分性方法进行了比较，并探索了与多位点数据集相关的平台空间。我们表明，即使添加适量的数据，我们的方法也能产生可处理的位点/分类群数量，用于靶向测序，成功地减轻了由于无效重叠造成的拓扑不稳定性。

{"title":"Coping with Ineffective Overlap in Multilocus Phylogenetics.","authors":"Ana Serra Silva, Karen Siu-Ting, Christopher J Creevey, Davide Pisani, Mark Wilkinson","doi":"10.1093/sysbio/syaf044","DOIUrl":"10.1093/sysbio/syaf044","url":null,"abstract":"Missing data is a long-standing issue in phylogenetic inference, which often results in high levels of taxonomic instability, obscuring otherwise well-supported relationships. Multiple approaches have been developed to deal with the negative effects of ineffective overlap on tree resolution, often by identifying taxa for removal. Here, we repurpose a heuristic method developed to identify unstable taxa in morphological data matrices, concatabominations, and combine it with a novel gene-tree jackknifing on matrix representation of trees to identify candidates for targeted sequencing. Using a multilocus caecilian data set, we illustrate the method's capacity to identify candidate taxa and loci for additional sequencing, compare the results with those of the mathematics-based gene sampling sufficiency approach, and explore the terrace space associated with the multilocus data set. We show that our approach yields tractable numbers of loci/taxa for targeted sequencing that successfully mitigate topological instability due to ineffective overlap, even when modest amounts of data are added.","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"52-71"},"PeriodicalIF":5.7,"publicationDate":"2026-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12805666/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144561211","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Genomic Perspective on Cryptic Species Reveals Complex Evolutionary Dynamics in the Gray Zone of the Speciation Continuum. 基因组学视角下的隐种揭示了物种形成连续体灰色地带的复杂进化动力学。

IF 6.5 1区生物学 Q1 EVOLUTIONARY BIOLOGY

Systematic Biology

Pub Date : 2026-01-14 DOI: 10.1093/sysbio/syag001

Kin Onn Chan,Dario N Neokleous,Shahrul Anuar,Rafe M Brown,Carl R Hutter,Indraneil Das,Stefan T Hertwig

The evolutionary dynamics of cryptic species remain poorly understood, and their detection relies primarily on methods that quantify divergence, assuming that gene flow is absent. Here, we examine how gene flow shapes the evolutionary trajectories and species boundaries in Bornean Fanged Frogs, a renowned example of cryptic diversity where a single species has been split into 18 genetically divergent yet morphologically indistinguishable species. We employed target-capture data from over 13,000 loci to assess lineage independence of 14 nominal species distributed across Malaysian Borneo by evaluating both divergence and cohesion using network multispecies coalescent (NMSC) and MSC + migration approaches. Under the Unified Species Concept, only six of the 14 nominal species unambiguously form independently evolving lineages; the remainder represent cohesive metapopulation lineages nested within those six species. While mitochondrial p-distances varied substantially (up to 10%), genome-wide net divergences (Da) were more consistent, ranging from 0.5-2%, placing all the hypothesized "cryptic species" within the empirical gray zone of the speciation continuum. We show that diversification in the gray zone is unpredictable and heavily impacted by gene flow, leading to two key phenomena that confound species delimitation: (1) the artifactual branch effect, where admixed lineages are inferred as long, early-diverging branches, creating an illusion of deep divergence; and (2) the species-definition anomaly zone, where intraspecific pairwise sequence distances exceed interspecific ones. We further demonstrate that divergence in the gray zone varies among metrics and genomic regions, reflecting heterogeneity in evolutionary dynamics across the genome. Different genomic markers also vary considerably in phylogenetic discordance and their ability to retain signatures of gene flow. Loci from anchored hybrid enrichment (AHE) and ultraconserved elements (UCE) produced less phylogenetic discordance and retained signals of older introgression but failed to detect recent migration, making them suitable for phylogenetic reconstruction and inferring ancient introgression, but not ongoing gene flow. Recognizing the central role of gene flow reframes our understanding of cryptic species; rather than being considered as genetically distinct units that failed to evolve morphological differentiation, they are manifestations of continuous diversification in the gray zone. This shift in perspective offers a new and dynamic evolutionary framework for identifying and interpreting cryptic biodiversity across the Tree of Life.

隐物种的进化动力学仍然知之甚少，它们的检测主要依赖于量化分化的方法，假设基因流不存在。在这里，我们研究了基因流如何塑造婆罗洲尖牙蛙的进化轨迹和物种边界，这是一个著名的隐性多样性的例子，一个物种被分成18个遗传上不同但形态上无法区分的物种。我们利用来自13000多个位点的目标捕获数据，利用网络多物种聚结（NMSC）和MSC +迁移方法评估了分布在马来西亚婆罗洲的14个名义物种的分化和内聚，从而评估了它们的谱系独立性。在统一物种概念下，14个名义物种中只有6个明确地形成独立进化的谱系；其余的代表了在这六个物种中嵌套的有凝聚力的超种群谱系。虽然线粒体p-距离差异很大（高达10%），但全基因组净差异（Da）更为一致，范围在0.5-2%之间，将所有假设的“隐物种”置于物种形成连续体的经验灰色地带。我们发现，灰色地带的多样化是不可预测的，并且受到基因流的严重影响，导致了混淆物种划分的两个关键现象：(1)人工分支效应，其中混合谱系被推断为长而早分化的分支，造成了深度分化的错觉；(2)种内配对序列距离大于种间配对序列距离的种定义异常带。我们进一步证明，灰色地带的差异在指标和基因组区域之间有所不同，反映了整个基因组进化动力学的异质性。不同的基因组标记在系统发育不一致性和保留基因流动特征的能力方面也有很大差异。来自锚定杂交富集（AHE）和超保守元件（UCE）的位点产生的系统发育不一致较少，保留了较早渗入的信号，但不能检测到最近的迁移，这使得它们适合系统发育重建和推断古代渗入，但不适合持续的基因流动。认识到基因流的核心作用重新构建了我们对隐物种的理解；而不是被认为是遗传上不同的单位，未能进化形态分化，他们是在灰色地带不断多样化的表现。这种观点的转变为识别和解释生命之树上的隐性生物多样性提供了一个新的、动态的进化框架。

{"title":"A Genomic Perspective on Cryptic Species Reveals Complex Evolutionary Dynamics in the Gray Zone of the Speciation Continuum.","authors":"Kin Onn Chan,Dario N Neokleous,Shahrul Anuar,Rafe M Brown,Carl R Hutter,Indraneil Das,Stefan T Hertwig","doi":"10.1093/sysbio/syag001","DOIUrl":"https://doi.org/10.1093/sysbio/syag001","url":null,"abstract":"The evolutionary dynamics of cryptic species remain poorly understood, and their detection relies primarily on methods that quantify divergence, assuming that gene flow is absent. Here, we examine how gene flow shapes the evolutionary trajectories and species boundaries in Bornean Fanged Frogs, a renowned example of cryptic diversity where a single species has been split into 18 genetically divergent yet morphologically indistinguishable species. We employed target-capture data from over 13,000 loci to assess lineage independence of 14 nominal species distributed across Malaysian Borneo by evaluating both divergence and cohesion using network multispecies coalescent (NMSC) and MSC + migration approaches. Under the Unified Species Concept, only six of the 14 nominal species unambiguously form independently evolving lineages; the remainder represent cohesive metapopulation lineages nested within those six species. While mitochondrial p-distances varied substantially (up to 10%), genome-wide net divergences (Da) were more consistent, ranging from 0.5-2%, placing all the hypothesized \"cryptic species\" within the empirical gray zone of the speciation continuum. We show that diversification in the gray zone is unpredictable and heavily impacted by gene flow, leading to two key phenomena that confound species delimitation: (1) the artifactual branch effect, where admixed lineages are inferred as long, early-diverging branches, creating an illusion of deep divergence; and (2) the species-definition anomaly zone, where intraspecific pairwise sequence distances exceed interspecific ones. We further demonstrate that divergence in the gray zone varies among metrics and genomic regions, reflecting heterogeneity in evolutionary dynamics across the genome. Different genomic markers also vary considerably in phylogenetic discordance and their ability to retain signatures of gene flow. Loci from anchored hybrid enrichment (AHE) and ultraconserved elements (UCE) produced less phylogenetic discordance and retained signals of older introgression but failed to detect recent migration, making them suitable for phylogenetic reconstruction and inferring ancient introgression, but not ongoing gene flow. Recognizing the central role of gene flow reframes our understanding of cryptic species; rather than being considered as genetically distinct units that failed to evolve morphological differentiation, they are manifestations of continuous diversification in the gray zone. This shift in perspective offers a new and dynamic evolutionary framework for identifying and interpreting cryptic biodiversity across the Tree of Life.","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":"15 1","pages":""},"PeriodicalIF":6.5,"publicationDate":"2026-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145971938","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Phylogenetic Inference from Atomised 3D Morphometric Data: a Case Study using Kangaroos. 从原子化三维形态测量数据的系统发育推断：袋鼠的案例研究。

IF 5.7 1区生物学 Q1 EVOLUTIONARY BIOLOGY

Systematic Biology

Pub Date : 2025-12-31 DOI: 10.1093/sysbio/syaf091

Mélina A Celik, Carmelo Fruciano, Kaylene Butler, Vera Weisbecker, Matthew J Phillips

Reconstructing phylogeny from morphological data remains mired in investigator biases, including subjective inclusion and discretisation of phenotypic variation. Geometric morphometrics and multivariate statistical analyses provide an alternative array of tools for studying variation in morphological traits. However, direct analysis of landmark data is often unreliable for phylogeny reconstruction. Morphological variation is typically highly correlated among nearby landmarks and may evolve saltationally between adaptive peaks instead of gradually, thereby violating the assumptions of typical continuous models. To address these concerns, we developed an approach to more objectively discretise morphometric data and applied it to 3D surface scans of mandibles and postcranial elements of Macropodiformes (kangaroos, bettongs and rat-kangaroos). The scanned elements were partitioned into sets of locally co-varying landmarks which approximate functional units. These subregions were discretised into "atomised" characters using novel approaches to combine the objectivity of continuous shape variation for delineating discrete states with the model flexibility offered for multistate and binary characters. This allows us to (1) potentially reduce the influence of non-independence among neighbouring landmarks, (2) accommodate multimodal variation from saltational evolution, (3) accommodate missing data, such as from fragmentary fossils, and (4) promote tree-search efficiency. We built discrete morphological character matrices using three alternative approaches: commonly used clustering algorithms (UPGMA, k-means, k-medoids, Gaussian mixture modelling), a minimum evolution branch length criterion, and a tree sampling procedure. Our phylogenetic analyses with these novel matrices generally succeeded in recovering genera and several deep-level macropodiform clades, but failed to accurately reconstruct intergeneric relationships within the rapid diversification of the macropodine sub-family; those relationships were also not recovered with continuous morphological data or traditionally discretised characters and are the most poorly resolved with DNA data. On balance, our atomised characters, which derive from only mandibular and three postcranial elements, show promise for improving objectivity, accuracy and clocklikeness in morphological phylogenetics and provide pathways for accommodating correlated homoplasy and for more accurately estimating rates of morphological evolution, and thereby better integrating phenotypic and genomic data for phylogenetic inference.

从形态学数据重建系统发育仍然陷入研究者偏见的困境，包括主观纳入和表型变异的离散化。几何形态计量学和多元统计分析为研究形态特征的变异提供了另一种方法。然而，直接分析里程碑数据对于系统发育重建通常是不可靠的。形态学变化通常在附近的地标之间高度相关，并且可能在自适应峰值之间跳跃式进化，而不是逐渐进化，从而违反了典型连续模型的假设。为了解决这些问题，我们开发了一种更客观地离散形态测量数据的方法，并将其应用于大足类动物（袋鼠、袋鼠和鼠袋鼠）下颌骨和颅后元素的3D表面扫描。扫描的元素被分割成局部共变的标记集，这些标记集近似于功能单元。这些子区域被离散为“原子化”字符，使用新方法将描述离散状态的连续形状变化的客观性与多状态和二元字符提供的模型灵活性相结合。这使我们能够(1)潜在地减少相邻地标之间不独立性的影响，(2)适应来自salsaltevolution的多模态变化，(3)适应缺失的数据，例如来自碎片化石的数据，以及(4)提高树搜索效率。我们使用三种可选方法构建离散形态特征矩阵：常用的聚类算法（UPGMA, k-means, k-medoids，高斯混合建模），最小进化分支长度标准和树采样程序。我们用这些新基质进行的系统发育分析通常成功地恢复了属和几个深层大足类分支，但未能准确地重建大足类亚科快速多样化中的属间关系；这些关系也不能用连续的形态数据或传统上离散的特征来恢复，并且用DNA数据最难解决。总的来说，我们的原子化特征，仅来自下颌和三个后颅元素，显示出在形态系统发育中提高客观性，准确性和顺时针性的希望，并为适应相关的同源性和更准确地估计形态进化率提供了途径，从而更好地整合表型和基因组数据进行系统发育推断。

{"title":"Phylogenetic Inference from Atomised 3D Morphometric Data: a Case Study using Kangaroos.","authors":"Mélina A Celik, Carmelo Fruciano, Kaylene Butler, Vera Weisbecker, Matthew J Phillips","doi":"10.1093/sysbio/syaf091","DOIUrl":"https://doi.org/10.1093/sysbio/syaf091","url":null,"abstract":"Reconstructing phylogeny from morphological data remains mired in investigator biases, including subjective inclusion and discretisation of phenotypic variation. Geometric morphometrics and multivariate statistical analyses provide an alternative array of tools for studying variation in morphological traits. However, direct analysis of landmark data is often unreliable for phylogeny reconstruction. Morphological variation is typically highly correlated among nearby landmarks and may evolve saltationally between adaptive peaks instead of gradually, thereby violating the assumptions of typical continuous models. To address these concerns, we developed an approach to more objectively discretise morphometric data and applied it to 3D surface scans of mandibles and postcranial elements of Macropodiformes (kangaroos, bettongs and rat-kangaroos). The scanned elements were partitioned into sets of locally co-varying landmarks which approximate functional units. These subregions were discretised into \"atomised\" characters using novel approaches to combine the objectivity of continuous shape variation for delineating discrete states with the model flexibility offered for multistate and binary characters. This allows us to (1) potentially reduce the influence of non-independence among neighbouring landmarks, (2) accommodate multimodal variation from saltational evolution, (3) accommodate missing data, such as from fragmentary fossils, and (4) promote tree-search efficiency. We built discrete morphological character matrices using three alternative approaches: commonly used clustering algorithms (UPGMA, k-means, k-medoids, Gaussian mixture modelling), a minimum evolution branch length criterion, and a tree sampling procedure. Our phylogenetic analyses with these novel matrices generally succeeded in recovering genera and several deep-level macropodiform clades, but failed to accurately reconstruct intergeneric relationships within the rapid diversification of the macropodine sub-family; those relationships were also not recovered with continuous morphological data or traditionally discretised characters and are the most poorly resolved with DNA data. On balance, our atomised characters, which derive from only mandibular and three postcranial elements, show promise for improving objectivity, accuracy and clocklikeness in morphological phylogenetics and provide pathways for accommodating correlated homoplasy and for more accurately estimating rates of morphological evolution, and thereby better integrating phenotypic and genomic data for phylogenetic inference.","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":""},"PeriodicalIF":5.7,"publicationDate":"2025-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145865621","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

GroupStruct2: A User-Friendly Graphical User Interface for Statistical and Visual Support in Species Diagnosis GroupStruct2：一个用户友好的图形用户界面，用于统计和可视化支持物种诊断

IF 6.5 1区生物学 Q1 EVOLUTIONARY BIOLOGY

Systematic Biology

Pub Date : 2025-12-19 DOI: 10.1093/sysbio/syaf090

Kin Onn Chan, L Lee Grismer

Statistically defensible species diagnoses are essential for producing robust taxonomies, which underpin much of biological research. Yet, most species descriptions remain largely descriptive, often lack rigorous statistical validation, and suffer from confounding factors stemming from sampling bias, geographic variation, and intraspecific diversity. These limitations are further compounded by the steep learning curve of advanced statistical and data visualization tools, which poses a major barrier for early-career scientists and researchers in under-resourced regions who may lack programming experience or access to expert support. To address these challenges, we developed GroupStruct2, a powerful yet accessible R-based Shiny application that democratizes robust statistical analysis and data visualization for species diagnosis across any taxonomic group. GroupStruct2 is implemented through a user-friendly graphical user interface (GUI) that requires no coding experience. The user-friendly interface intuitively guides users through a comprehensive workflow, from raw data upload and outlier detection to assumption testing, adaptive statistical analyses, and the generation of highly customizable, publication-ready visualizations based on the ggplot2 architecture. It supports allometric body-size correction and widely used dimension-reduction techniques, including Principal Component Analysis (PCA), Discriminant Analysis of Principal Components (DAPC), and, critically, Multiple Factor Analysis (MFA), which enables the joint analysis of meristic, morphometric, and categorical trait data within a single integrative taxonomic framework. We showcase GroupStruct2’s capabilities using two empirical datasets, demonstrating how to conduct robust statistical analyses and produce publication-quality visualizations in just a few clicks. By lowering technical barriers without compromising analytical rigor, GroupStruct2 empowers researchers of all backgrounds working on any taxonomic group to conduct statistically sound and produce visually compelling species diagnoses, thereby advancing both the accessibility and the scientific rigor of taxonomy, species delimitation, and biodiversity research.

在统计学上站得住脚的物种诊断对于产生稳健的分类学至关重要，而分类学是许多生物学研究的基础。然而，大多数物种描述仍然在很大程度上是描述性的，往往缺乏严格的统计验证，并受到来自采样偏差、地理差异和种内多样性的混杂因素的影响。高级统计和数据可视化工具的陡峭学习曲线进一步加剧了这些限制，这对资源不足地区的早期职业科学家和研究人员构成了主要障碍，他们可能缺乏编程经验或无法获得专家支持。为了应对这些挑战，我们开发了GroupStruct2，这是一个功能强大且易于访问的基于r的Shiny应用程序，它使强大的统计分析和数据可视化民主化，可以跨任何分类组进行物种诊断。GroupStruct2是通过用户友好的图形用户界面（GUI）实现的，不需要编码经验。用户友好的界面直观地引导用户完成一个全面的工作流程，从原始数据上传和离群值检测到假设测试、自适应统计分析，以及基于ggplot2架构生成高度可定制的、可发布的可视化。它支持异速体尺寸校正和广泛使用的降维技术，包括主成分分析（PCA），主成分判别分析（DAPC），以及重要的多因素分析（MFA），它可以在一个单一的综合分类框架内联合分析分生，形态计量和分类性状数据。我们使用两个经验数据集展示了GroupStruct2的功能，演示了如何进行可靠的统计分析，并只需点击几下即可生成出版物质量的可视化。通过降低技术障碍而不影响分析的严谨性，GroupStruct2使所有背景的研究人员都能进行统计上合理的研究，并产生视觉上引人注目的物种诊断，从而提高分类学、物种划分和生物多样性研究的可及性和科学严谨性。

{"title":"GroupStruct2: A User-Friendly Graphical User Interface for Statistical and Visual Support in Species Diagnosis","authors":"Kin Onn Chan, L Lee Grismer","doi":"10.1093/sysbio/syaf090","DOIUrl":"https://doi.org/10.1093/sysbio/syaf090","url":null,"abstract":"Statistically defensible species diagnoses are essential for producing robust taxonomies, which underpin much of biological research. Yet, most species descriptions remain largely descriptive, often lack rigorous statistical validation, and suffer from confounding factors stemming from sampling bias, geographic variation, and intraspecific diversity. These limitations are further compounded by the steep learning curve of advanced statistical and data visualization tools, which poses a major barrier for early-career scientists and researchers in under-resourced regions who may lack programming experience or access to expert support. To address these challenges, we developed GroupStruct2, a powerful yet accessible R-based Shiny application that democratizes robust statistical analysis and data visualization for species diagnosis across any taxonomic group. GroupStruct2 is implemented through a user-friendly graphical user interface (GUI) that requires no coding experience. The user-friendly interface intuitively guides users through a comprehensive workflow, from raw data upload and outlier detection to assumption testing, adaptive statistical analyses, and the generation of highly customizable, publication-ready visualizations based on the ggplot2 architecture. It supports allometric body-size correction and widely used dimension-reduction techniques, including Principal Component Analysis (PCA), Discriminant Analysis of Principal Components (DAPC), and, critically, Multiple Factor Analysis (MFA), which enables the joint analysis of meristic, morphometric, and categorical trait data within a single integrative taxonomic framework. We showcase GroupStruct2’s capabilities using two empirical datasets, demonstrating how to conduct robust statistical analyses and produce publication-quality visualizations in just a few clicks. By lowering technical barriers without compromising analytical rigor, GroupStruct2 empowers researchers of all backgrounds working on any taxonomic group to conduct statistically sound and produce visually compelling species diagnoses, thereby advancing both the accessibility and the scientific rigor of taxonomy, species delimitation, and biodiversity research.","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":"46 1","pages":""},"PeriodicalIF":6.5,"publicationDate":"2025-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145801174","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0