Systematic Biology最新文献_第10页

Unravelling the Effects of Ecology and Evolutionary History in the Phenotypic Convergence of Fishes. 揭示生态学和进化历史对鱼类表型趋同的影响。

IF 6.5 1区生物学 Q1 EVOLUTIONARY BIOLOGY

Systematic Biology

Pub Date : 2025-05-13 DOI: 10.1093/sysbio/syaf034

Jennifer R Hodge,Danielle S Adams,Keiffer L Williams,Laura R V Alencar,Benjamin Camper,Olivier Larouche,Mason A Thurman,Katerina Zapfe,Samantha A Price

Understanding the ecological drivers and limitations of adaptive convergence is a fundamental challenge. Here, we explore how adaptive convergence of planktivorous fishes has been influenced by multiple ecological factors, evolutionary history, and chance. Using ecomorphological data for over 1600 marine species, we integrate pattern-based metrics of convergence with evolutionary model fitting to test whether phenotypic similarities among specialist planktivores exceed expectations under null models and whether ecology, evolutionary history, or their combined effects best explain trait evolution. We find that planktivores are significantly more similar in phenotype than expected. Traits with functional relevance for prey detection and capture, such as eye diameter and lower jaw length, are strongly convergent, while general body size and shape are constrained by deep divisions between clades where the effects of evolutionary history are most pronounced. Since not all traits undergo strong selection toward a convergent ecomorph, their evolutionary trajectories have not entirely overcome ancestral differences in the multivariate trait space, resulting in a specific form of convergence termed conservatism. We show how adaptive responses to feeding ecology intertwine with other ecological pressures (i.e., light environment) and historical contingency to shape fish phenotype evolution over deep time, offering key insights into the generality of phenotypic evolution.

理解适应性趋同的生态驱动因素和局限性是一个根本性的挑战。本文探讨了浮游鱼类的适应性收敛是如何受到多种生态因素、进化史和偶然性的影响的。利用超过1600种海洋物种的生态形态学数据，我们将基于模式的收敛度量与进化模型拟合相结合，以检验在零模型下，专业浮游动物之间的表型相似性是否超出预期，以及生态学、进化史或它们的综合效应是否能最好地解释性状进化。我们发现浮游动物在表型上明显比预期的更相似。与猎物探测和捕获功能相关的特征，如眼睛直径和下颚长度，是强烈趋同的，而一般的身体大小和形状受到进化历史影响最明显的分支之间的深刻分歧的限制。由于并非所有性状都经历了趋同生态形态的强烈选择，它们的进化轨迹并没有完全克服多元性状空间中的祖先差异，导致一种称为保守性的特定形式的收敛。我们展示了对喂养生态的适应性反应如何与其他生态压力（即光环境）和历史偶然性交织在一起，在较长时间内塑造鱼类的表型进化，为表型进化的普遍性提供了关键见解。

{"title":"Unravelling the Effects of Ecology and Evolutionary History in the Phenotypic Convergence of Fishes.","authors":"Jennifer R Hodge,Danielle S Adams,Keiffer L Williams,Laura R V Alencar,Benjamin Camper,Olivier Larouche,Mason A Thurman,Katerina Zapfe,Samantha A Price","doi":"10.1093/sysbio/syaf034","DOIUrl":"https://doi.org/10.1093/sysbio/syaf034","url":null,"abstract":"Understanding the ecological drivers and limitations of adaptive convergence is a fundamental challenge. Here, we explore how adaptive convergence of planktivorous fishes has been influenced by multiple ecological factors, evolutionary history, and chance. Using ecomorphological data for over 1600 marine species, we integrate pattern-based metrics of convergence with evolutionary model fitting to test whether phenotypic similarities among specialist planktivores exceed expectations under null models and whether ecology, evolutionary history, or their combined effects best explain trait evolution. We find that planktivores are significantly more similar in phenotype than expected. Traits with functional relevance for prey detection and capture, such as eye diameter and lower jaw length, are strongly convergent, while general body size and shape are constrained by deep divisions between clades where the effects of evolutionary history are most pronounced. Since not all traits undergo strong selection toward a convergent ecomorph, their evolutionary trajectories have not entirely overcome ancestral differences in the multivariate trait space, resulting in a specific form of convergence termed conservatism. We show how adaptive responses to feeding ecology intertwine with other ecological pressures (i.e., light environment) and historical contingency to shape fish phenotype evolution over deep time, offering key insights into the generality of phenotypic evolution.","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":"1 1","pages":""},"PeriodicalIF":6.5,"publicationDate":"2025-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143945477","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Phylogenomic and Population Genomic Analyses of Ultraconserved Elements Reveal Deep Coalescence and Introgression Shaped Diversification Patterns in Lamprologine Cichlids of the Congo River. 超保守基因的系统基因组和种群基因组分析揭示了刚果河Lamprologine慈鲷的深聚结和渐入形多样化模式。

IF 6.5 1区生物学 Q1 EVOLUTIONARY BIOLOGY

Systematic Biology

Pub Date : 2025-05-13 DOI: 10.1093/sysbio/syaf032

Fernando Alda,S Elizabeth Alter,Naoko P Kurata,Prosanta Chakrabarty,Melanie L J Stiassny

Understanding the drivers of diversification is a central goal in evolutionary biology but can be challenging when lineages radiate quickly and/or hybridize frequently. Cichlids in the tribe Lamprologini, an exceptionally diverse clade found in the Congo basin, exemplify these issues: their evolutionary history has been difficult to untangle with previous datasets, particularly with regard to river-dwelling lineages in the genus Lamprologus. This clade notably includes the only known blind and depigmented cichlid, L. lethops. Here, we reconstructed the evolutionary, population, and biogeographic history of a Lamprologus clade from the Congo River by leveraging genomic data and sampling over 50 lamprologine species from the entire Lake Tanganyika radiation. This study provides the most comprehensive species-level coverage to date of the riverine taxa within this lacustrine-origin clade. We found that in the mid-late Pliocene, two lineages of Lake Tanganyika lamprologines independently colonized the Congo River, where they subsequently hybridized and diversified, forming the current monophyletic group of riverine Lamprologus. Our estimates for divergence time and introgression align with the region's geological history and suggest rapid speciation in Lamprologus species from the Congo River marked by rapids-driven vicariance and water level fluctuations, and repeated episodes of secondary contact and reticulation. As a result of our analyses, we propose the taxonomic restriction of the genus Lamprologus to Congo River taxa only. The complex evolutionary history of this group-characterized by introgressive hybridization followed by a rapid series of isolation and reconnection-illustrates the multifaceted dynamics of speciation that have shaped the rich biodiversity of this region. [African cichlids; Congo River; diversification; hybridization; Lamprologini; phylogenomics; UCEs; ultraconserved elements].

了解多样化的驱动因素是进化生物学的中心目标，但当谱系快速辐射和/或频繁杂交时，这可能具有挑战性。在刚果盆地发现的一个异常多样化的分支Lamprologini部落的稚鱼，例证了这些问题：它们的进化史很难用以前的数据集来解开，特别是关于Lamprologus属的河栖谱系。值得注意的是，这个分支包括唯一已知的盲和脱色慈鲷，L. lethops。在此，我们利用基因组数据和采样整个坦噶尼喀湖辐射的50多个lamprologine物种，重建了刚果河Lamprologus分支的进化，种群和生物地理历史。这项研究提供了迄今为止最全面的物种水平覆盖的河流分类群在这个湖源进化枝。我们发现，在上新世中晚期，坦噶尼喀湖的两个lamprologine谱系独立地在刚果河定居，随后它们在那里杂交和多样化，形成了目前的单系河流Lamprologus群。我们对分化时间和渗进的估计与该地区的地质历史一致，并表明刚果河的Lamprologus物种形成迅速，其特征是急流驱动的变异和水位波动，以及重复的二次接触和网状。根据分析结果，我们提出了Lamprologus属的分类限制仅为刚果河分类群。这个群体复杂的进化史——以渐进杂交为特征，随后是一系列快速的分离和重新联系——说明了物种形成的多方面动态，这些动态塑造了该地区丰富的生物多样性。[非洲丽鱼科鱼;刚果河;多样化;杂化;Lamprologini;phylogenomics;加州大学;ultraconserved元素)。

{"title":"Phylogenomic and Population Genomic Analyses of Ultraconserved Elements Reveal Deep Coalescence and Introgression Shaped Diversification Patterns in Lamprologine Cichlids of the Congo River.","authors":"Fernando Alda,S Elizabeth Alter,Naoko P Kurata,Prosanta Chakrabarty,Melanie L J Stiassny","doi":"10.1093/sysbio/syaf032","DOIUrl":"https://doi.org/10.1093/sysbio/syaf032","url":null,"abstract":"Understanding the drivers of diversification is a central goal in evolutionary biology but can be challenging when lineages radiate quickly and/or hybridize frequently. Cichlids in the tribe Lamprologini, an exceptionally diverse clade found in the Congo basin, exemplify these issues: their evolutionary history has been difficult to untangle with previous datasets, particularly with regard to river-dwelling lineages in the genus Lamprologus. This clade notably includes the only known blind and depigmented cichlid, L. lethops. Here, we reconstructed the evolutionary, population, and biogeographic history of a Lamprologus clade from the Congo River by leveraging genomic data and sampling over 50 lamprologine species from the entire Lake Tanganyika radiation. This study provides the most comprehensive species-level coverage to date of the riverine taxa within this lacustrine-origin clade. We found that in the mid-late Pliocene, two lineages of Lake Tanganyika lamprologines independently colonized the Congo River, where they subsequently hybridized and diversified, forming the current monophyletic group of riverine Lamprologus. Our estimates for divergence time and introgression align with the region's geological history and suggest rapid speciation in Lamprologus species from the Congo River marked by rapids-driven vicariance and water level fluctuations, and repeated episodes of secondary contact and reticulation. As a result of our analyses, we propose the taxonomic restriction of the genus Lamprologus to Congo River taxa only. The complex evolutionary history of this group-characterized by introgressive hybridization followed by a rapid series of isolation and reconnection-illustrates the multifaceted dynamics of speciation that have shaped the rich biodiversity of this region. [African cichlids; Congo River; diversification; hybridization; Lamprologini; phylogenomics; UCEs; ultraconserved elements].","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":"44 1","pages":""},"PeriodicalIF":6.5,"publicationDate":"2025-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143945478","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

CAnDI: a new tool to investigate conflict in homologous gene trees and explain convergent trait evolution 研究同源基因树冲突和解释趋同性状进化的新工具

IF 6.5 1区生物学 Q1 EVOLUTIONARY BIOLOGY

Systematic Biology

Pub Date : 2025-05-08 DOI: 10.1093/sysbio/syaf028

Holly M Robertson, Joseph F Walker, Edwige Moyroud

Phenotypic convergence is found across the tree of life, and morphological similarities in distantly related species are often presumed to have evolved independently. However, clarifying the origins of traits has recently highlighted the complex nature of evolution, as apparent convergent features often share similar genetic foundations. Hence, the tree topology of genes that underlie such traits frequently conflicts with the overall history of species relationships. This conflict, which usually results from incomplete lineage sorting, introgression or horizontal gene transfer, creates both a challenge for systematists and an exciting opportunity to investigate the rich, complex network of information that connects molecular trajectories with trait evolution. Here we present a novel conflict identification program named CAnDI (Conflict And Duplication Identifier), which enables the analysis of conflict in homologous gene trees rather than inferred orthologs. We demonstrate that the analysis of conflicts in homologous trees using CAnDI yields more comparisons than in ortholog trees in six datasets from across the eukaryotic tree of life. Using the carnivorous trap of Caryophyllales, a charismatic group of flowering plants, as a case study we demonstrate that analysing conflict on entire homolog trees can aid in inferring the contribution of standing genetic variation to trait evolution: by dissecting all gene relationships within homolog trees, we find genomic evidence that the molecular basis of the pleisiomorphic mucilaginous sticky trap was likely present in the ancestor of all carnivorous Caryophyllales. We also show that many genes whose evolutionary trajectories group species with similar trap devices code for proteins contributing to plant carnivory and identify a LATERAL ORGAN BOUNDARY DOMAIN transcription factor as a possible candidate for regulating sticky trap development.

在整个生命树中发现了表型趋同，而远亲物种的形态相似性通常被认为是独立进化的。然而，澄清特征的起源最近突出了进化的复杂性，因为明显的趋同特征通常具有相似的遗传基础。因此，构成这些特征的基因的树状拓扑结构经常与物种关系的整体历史相冲突。这种冲突通常是由不完整的谱系分选、基因渗入或水平基因转移造成的，这对系统学家来说既是一个挑战，也是一个令人兴奋的机会，可以研究将分子轨迹与性状进化联系起来的丰富而复杂的信息网络。在这里，我们提出了一个新的冲突识别程序，名为CAnDI（冲突和重复标识符），它可以分析同源基因树中的冲突，而不是推断的同源基因。我们证明，在真核生物生命之树的六个数据集中，使用CAnDI分析同源树中的冲突比同源树产生更多的比较。以有魅力的开花植物群石竹属（Caryophyllales）的肉食性陷阱为例，我们证明了分析整个同源树上的冲突有助于推断直立遗传变异对性状进化的贡献：通过解剖同源树上的所有基因关系，我们发现基因组证据表明，多形粘液粘陷阱的分子基础可能存在于所有肉食性石竹属（Caryophyllales）的祖先中。我们还发现，许多基因的进化轨迹将具有相似陷阱装置的物种分类为有助于植物食肉性的蛋白质，并确定了一个侧边器官边界域转录因子作为调节粘性陷阱发育的可能候选基因。

{"title":"CAnDI: a new tool to investigate conflict in homologous gene trees and explain convergent trait evolution","authors":"Holly M Robertson, Joseph F Walker, Edwige Moyroud","doi":"10.1093/sysbio/syaf028","DOIUrl":"https://doi.org/10.1093/sysbio/syaf028","url":null,"abstract":"Phenotypic convergence is found across the tree of life, and morphological similarities in distantly related species are often presumed to have evolved independently. However, clarifying the origins of traits has recently highlighted the complex nature of evolution, as apparent convergent features often share similar genetic foundations. Hence, the tree topology of genes that underlie such traits frequently conflicts with the overall history of species relationships. This conflict, which usually results from incomplete lineage sorting, introgression or horizontal gene transfer, creates both a challenge for systematists and an exciting opportunity to investigate the rich, complex network of information that connects molecular trajectories with trait evolution. Here we present a novel conflict identification program named CAnDI (Conflict And Duplication Identifier), which enables the analysis of conflict in homologous gene trees rather than inferred orthologs. We demonstrate that the analysis of conflicts in homologous trees using CAnDI yields more comparisons than in ortholog trees in six datasets from across the eukaryotic tree of life. Using the carnivorous trap of Caryophyllales, a charismatic group of flowering plants, as a case study we demonstrate that analysing conflict on entire homolog trees can aid in inferring the contribution of standing genetic variation to trait evolution: by dissecting all gene relationships within homolog trees, we find genomic evidence that the molecular basis of the pleisiomorphic mucilaginous sticky trap was likely present in the ancestor of all carnivorous Caryophyllales. We also show that many genes whose evolutionary trajectories group species with similar trap devices code for proteins contributing to plant carnivory and identify a LATERAL ORGAN BOUNDARY DOMAIN transcription factor as a possible candidate for regulating sticky trap development.","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":"27 1","pages":""},"PeriodicalIF":6.5,"publicationDate":"2025-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143920035","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Introgression across narrow contact zones shapes the genomic landscape of phylogenetic variation in an African bird clade 跨越狭窄接触带的渗入形成了非洲鸟类进化进化的基因组景观

IF 6.5 1区生物学 Q1 EVOLUTIONARY BIOLOGY

Systematic Biology

Pub Date : 2025-05-07 DOI: 10.1093/sysbio/syaf033

Loïs Rancilhac, Stacey G de Souza, Sifiso M Lukhele, Matteo Sebastianelli, Bridget O Ogolowa, Michaella Moysi, Christos Nikiforou, Tsyon Asfaw, Colleen T Downs, Alan Brelsford, Bridgett M vonHoldt, Alexander N G Kirschel

Genomic analyses of hybrid zones provide excellent opportunities to investigate the consequences of introgression in nature. In combination with phylogenomics analyses, hybrid zone studies may illuminate the role of ancient and contemporary gene flow in shaping variation of phylogenetic signals across the genome, but this avenue has not been explored yet. We combined phylogenomic and geographic cline analyses in a Pogoniulus tinkerbird clade to determine whether contemporary introgression through hybrid zones contributes to gene-tree heterogeneity across the species ranges. We found diverse phylogenetic signals across the genome with the most common topologies supporting monophyly among taxa connected by secondary contact zones. Remarkably, these systematic conflicts were also recovered when selecting only individuals from each taxon's core range. Using analyses of derived allele sharing and “recombination aware” phylogenomics, we found that introgression shapes gene-tree heterogeneity, and the species tree most likely supports monophyletic red-fronted tinkerbirds, as recovered in previous reconstructions based on mitochondrial DNA. Furthermore, by fitting geographic clines across two secondary contact zones, we found that introgression rates were lower in genomic regions supporting the putative species tree compared to those supporting the two taxa in contact as monophyletic. This demonstrates that introgression through narrow contact zones shapes gene-tree heterogeneity even in allopatric populations. Finally, we did not find evidence that mitochondria-interacting nuclear genes acted as barrier loci. Our results show that species can withstand important amounts of introgression while maintaining their phenotypic integrity and ecological separation, raising questions regarding the genomic architecture of adaptation and barriers to gene flow.

杂交带的基因组分析为研究自然中基因渗入的后果提供了极好的机会。结合系统基因组学分析，杂交区研究可能阐明古代和现代基因流在形成基因组系统发育信号变异中的作用，但这一途径尚未被探索。我们结合了Pogoniulus修补鸟进化枝的系统基因组学和地理渐变分析，以确定通过杂交带的当代渗入是否有助于物种范围内基因树的异质性。我们发现整个基因组具有不同的系统发育信号，最常见的拓扑结构支持由次级接触带连接的类群之间的单系性。值得注意的是，当只从每个分类单元的核心范围中选择个体时，这些系统冲突也被恢复了。通过对衍生等位基因共享和“重组意识”系统基因组学的分析，我们发现基因导入形成了基因树的异质性，并且物种树很可能支持单系红毛修补鸟，正如之前基于线粒体DNA重建所恢复的那样。此外，通过拟合两个次级接触带的地理曲线，我们发现在支持假定物种树的基因组区域，与支持两个接触类群为单系的基因组区域相比，渗入率更低。这表明，即使在异域种群中，通过狭窄接触区的渗入也会形成基因树异质性。最后，我们没有发现线粒体相互作用的核基因作为屏障位点的证据。我们的研究结果表明，物种可以承受大量的基因渗入，同时保持其表型完整性和生态分离，这就提出了关于适应基因组结构和基因流动障碍的问题。

{"title":"Introgression across narrow contact zones shapes the genomic landscape of phylogenetic variation in an African bird clade","authors":"Loïs Rancilhac, Stacey G de Souza, Sifiso M Lukhele, Matteo Sebastianelli, Bridget O Ogolowa, Michaella Moysi, Christos Nikiforou, Tsyon Asfaw, Colleen T Downs, Alan Brelsford, Bridgett M vonHoldt, Alexander N G Kirschel","doi":"10.1093/sysbio/syaf033","DOIUrl":"https://doi.org/10.1093/sysbio/syaf033","url":null,"abstract":"Genomic analyses of hybrid zones provide excellent opportunities to investigate the consequences of introgression in nature. In combination with phylogenomics analyses, hybrid zone studies may illuminate the role of ancient and contemporary gene flow in shaping variation of phylogenetic signals across the genome, but this avenue has not been explored yet. We combined phylogenomic and geographic cline analyses in a Pogoniulus tinkerbird clade to determine whether contemporary introgression through hybrid zones contributes to gene-tree heterogeneity across the species ranges. We found diverse phylogenetic signals across the genome with the most common topologies supporting monophyly among taxa connected by secondary contact zones. Remarkably, these systematic conflicts were also recovered when selecting only individuals from each taxon's core range. Using analyses of derived allele sharing and “recombination aware” phylogenomics, we found that introgression shapes gene-tree heterogeneity, and the species tree most likely supports monophyletic red-fronted tinkerbirds, as recovered in previous reconstructions based on mitochondrial DNA. Furthermore, by fitting geographic clines across two secondary contact zones, we found that introgression rates were lower in genomic regions supporting the putative species tree compared to those supporting the two taxa in contact as monophyletic. This demonstrates that introgression through narrow contact zones shapes gene-tree heterogeneity even in allopatric populations. Finally, we did not find evidence that mitochondria-interacting nuclear genes acted as barrier loci. Our results show that species can withstand important amounts of introgression while maintaining their phenotypic integrity and ecological separation, raising questions regarding the genomic architecture of adaptation and barriers to gene flow.","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":"119 1","pages":""},"PeriodicalIF":6.5,"publicationDate":"2025-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143920036","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Estimating Genome-wide Phylogenies Using Probabilistic Topic Modeling 利用概率主题模型估计全基因组系统发育

IF 6.5 1区生物学 Q1 EVOLUTIONARY BIOLOGY

Systematic Biology

Pub Date : 2025-05-05 DOI: 10.1093/sysbio/syaf015

Marzieh Khodaei, Scott V Edwards, Peter Beerli

Methods for rapidly inferring the evolutionary history of species or populations with, genome-wide data are progressing, but computational constraints still limit our abilities in, this area. We developed an alignment-free method to infer genome-wide phylogenies and, implemented it in the Python package TopicContml. The method uses probabilistic, topic modeling (specifically, Latent Dirichlet Allocation or LDA) to extract ‘topic’, frequencies from k-mers, which are derived from multilocus DNA sequences. These, extracted frequencies then serve as an input for the program Contml in the PHYLIP, package, which is used to generate a species tree. We evaluated the performance of, TopicContml on simulated datasets with gaps and three biological datasets: (1) 14 DNA, sequence loci from two Australian bird species distributed across nine populations, (2), 5162 loci from 80 mammal species, and (3) raw, unaligned, non-orthologous PacBio, sequences from 12 bird species. We also assessed the uncertainty of the estimated, relationships among clades using a bootstrap procedure. Our empirical results and, simulated data suggest that our method is efficient and statistically robust.

利用全基因组数据快速推断物种或种群进化史的方法正在取得进展，但计算限制仍然限制了我们在这一领域的能力。我们开发了一种无需比对的方法来推断全基因组的系统发育，并在Python包TopicContml中实现了它。该方法使用概率主题建模（特别是潜狄利克雷分配或LDA）从k-mers中提取“主题”频率，k-mers来自多位点DNA序列。这些被提取的频率然后作为PHYLIP包中的程序Contml的输入，用于生成物种树。我们评估了TopicContml在具有缺口的模拟数据集和3个生物数据集上的性能：(1)分布在9个种群中的2种澳大利亚鸟类的14个DNA序列位点，(2)来自80种哺乳动物的5162个位点，以及(3)来自12种鸟类的原始、未对齐、非同源PacBio序列。我们还评估了估计的不确定性，使用自举程序的分支之间的关系。我们的实证结果和模拟数据表明，我们的方法是有效的和统计稳健性。

{"title":"Estimating Genome-wide Phylogenies Using Probabilistic Topic Modeling","authors":"Marzieh Khodaei, Scott V Edwards, Peter Beerli","doi":"10.1093/sysbio/syaf015","DOIUrl":"https://doi.org/10.1093/sysbio/syaf015","url":null,"abstract":"Methods for rapidly inferring the evolutionary history of species or populations with, genome-wide data are progressing, but computational constraints still limit our abilities in, this area. We developed an alignment-free method to infer genome-wide phylogenies and, implemented it in the Python package TopicContml. The method uses probabilistic, topic modeling (specifically, Latent Dirichlet Allocation or LDA) to extract ‘topic’, frequencies from k-mers, which are derived from multilocus DNA sequences. These, extracted frequencies then serve as an input for the program Contml in the PHYLIP, package, which is used to generate a species tree. We evaluated the performance of, TopicContml on simulated datasets with gaps and three biological datasets: (1) 14 DNA, sequence loci from two Australian bird species distributed across nine populations, (2), 5162 loci from 80 mammal species, and (3) raw, unaligned, non-orthologous PacBio, sequences from 12 bird species. We also assessed the uncertainty of the estimated, relationships among clades using a bootstrap procedure. Our empirical results and, simulated data suggest that our method is efficient and statistically robust.","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":"50 1","pages":""},"PeriodicalIF":6.5,"publicationDate":"2025-05-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143910412","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

PickMe: Sample selection for species tree reconstruction using coalescent weighted quartets PickMe：使用聚结加权四重奏进行物种树重建的样本选择

IF 6.5 1区生物学 Q1 EVOLUTIONARY BIOLOGY

Systematic Biology

Pub Date : 2025-05-05 DOI: 10.1093/sysbio/syaf017

Joseph Rusinko, Yu Cai, Allison Crysler, Katherine Thompson, Julien Boutte, Mark Fishbein, Shannon C K Straub

After collecting large data sets for phylogenomics studies, researchers must decide which, genes or samples to include when reconstructing a species tree. Incomplete or unreliable, data sets make the empiricist’s decision more difficult. Researchers rely on ad hoc, strategies to maximize sampling while ensuring sufficient data for accurate inferences. An, algorithm called PickMe formalizes the sample selection process, assuming that the, samples evolved under the Tree Multispecies Coalescent model. We propose a Bayesian, framework for selecting samples for species tree analysis. Given a collection of gene trees, we compute a posterior probability for each quartet, describing the likelihood that the, species tree displays this topology. From this, we assign individual samples reliability, scores computed as the average of a scaled version of the posterior probabilities. PickMe, uses these weights to recommend which samples to include in a species tree analysis., Analysis of simulated data showed that including the samples suggested by Pickme, produced species trees closer to the true species trees than both unfiltered data sets and, data sets with ad hoc gene occupancy cut-offs applied. To further illustrate the efficacy of, this tool, we apply PickMe to gene trees generated from target capture data from, milkweeds. PickMe indicates more samples could have reliably been included in a previous, milkweed phylogenomic analysis than the authors analyzed without access to a formal, methodology for sample selection. Using simulated and empirical data, we also compare, PickMe to existing sample selection methods. Inclusion of PickMe will enhance, phylogenomics data analysis pipelines by providing a formal structure for sample selection.

在为系统基因组学研究收集了大量数据集之后，研究人员必须决定在重建物种树时包括哪些基因或样本。不完整或不可靠的数据集使经验主义者的决策更加困难。研究人员依靠特别的策略来最大化采样，同时确保足够的数据进行准确的推断。一种名为PickMe的算法将样本选择过程形式化，该算法假设样本在树多物种聚合模型下进化。我们提出了一个贝叶斯框架来选择样本进行物种树分析。给定一组基因树，我们计算每个四重奏的后验概率，描述物种树显示这种拓扑结构的可能性。由此，我们分配单个样本的可靠性，分数计算为后验概率的缩放版本的平均值。PickMe使用这些权重来推荐在物种树分析中包含哪些样本。对模拟数据的分析表明，包括Pickme建议的样本，所产生的物种树比未过滤的数据集和使用特设基因占用截止值的数据集更接近真实的物种树。为了进一步说明该工具的有效性，我们将PickMe应用于从乳草的目标捕获数据生成的基因树。PickMe指出，与作者在没有正式的样本选择方法的情况下分析的样本相比，更多的样本可以可靠地包括在以前的乳草系统基因组分析中。使用模拟和经验数据，我们还比较了，PickMe与现有的样本选择方法。包含PickMe将通过提供样本选择的正式结构来增强系统基因组学数据分析管道。

{"title":"PickMe: Sample selection for species tree reconstruction using coalescent weighted quartets","authors":"Joseph Rusinko, Yu Cai, Allison Crysler, Katherine Thompson, Julien Boutte, Mark Fishbein, Shannon C K Straub","doi":"10.1093/sysbio/syaf017","DOIUrl":"https://doi.org/10.1093/sysbio/syaf017","url":null,"abstract":"After collecting large data sets for phylogenomics studies, researchers must decide which, genes or samples to include when reconstructing a species tree. Incomplete or unreliable, data sets make the empiricist’s decision more difficult. Researchers rely on ad hoc, strategies to maximize sampling while ensuring sufficient data for accurate inferences. An, algorithm called PickMe formalizes the sample selection process, assuming that the, samples evolved under the Tree Multispecies Coalescent model. We propose a Bayesian, framework for selecting samples for species tree analysis. Given a collection of gene trees, we compute a posterior probability for each quartet, describing the likelihood that the, species tree displays this topology. From this, we assign individual samples reliability, scores computed as the average of a scaled version of the posterior probabilities. PickMe, uses these weights to recommend which samples to include in a species tree analysis., Analysis of simulated data showed that including the samples suggested by Pickme, produced species trees closer to the true species trees than both unfiltered data sets and, data sets with ad hoc gene occupancy cut-offs applied. To further illustrate the efficacy of, this tool, we apply PickMe to gene trees generated from target capture data from, milkweeds. PickMe indicates more samples could have reliably been included in a previous, milkweed phylogenomic analysis than the authors analyzed without access to a formal, methodology for sample selection. Using simulated and empirical data, we also compare, PickMe to existing sample selection methods. Inclusion of PickMe will enhance, phylogenomics data analysis pipelines by providing a formal structure for sample selection.","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":"29 1","pages":""},"PeriodicalIF":6.5,"publicationDate":"2025-05-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143910413","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Evolutionary Tempo, Supertaxa and Living Fossils 进化速度，超级分类群和活化石

IF 6.5 1区生物学 Q1 EVOLUTIONARY BIOLOGY

Systematic Biology

Pub Date : 2025-05-05 DOI: 10.1093/sysbio/syaf020

Graham E Budd, Richard P Mann

A relationship between the rate of molecular change and diversification has long been, discussed, on both theoretical and empirical grounds. However, the effect on our, understanding of evolutionary patterns is yet to be fully explored. Here we develop a new, model, the Covariant Evolutionary Tempo (CET) model, with the aim of integrating, patterns of diversification and molecular evolution within a framework of a continuously, changing ‘tempo’ variable that acts as a master control for molecular, morphological and, diversification rates. Importantly, tempo itself is treated as being variable at a rate, proportional to its own value. This model predicts that diversity is dominated by a small, number of extremely large clades at any historical epoch including the present; that these, large clades are expected to be characterised by explosive early radiations accompanied by, elevated rates of molecular evolution; and that extant organisms are likely to have evolved, from species with unusually fast evolutionary rates. Under such a model, the amount of, molecular change along a particular lineage is essentially independent of its height, which, weakens the molecular clock hypothesis. Finally, our model explains the existence of ‘living, fossil’ sister groups to large clades that are species poor and exhibit slow rates of, morphological and molecular change. Our results demonstrate that the observed historical, patterns of evolution can be modelled without invoking special evolutionary mechanisms or, innovations that are unique to specific times or taxa, even when they are highly, non-uniform

分子变化率和多样化之间的关系，早已在理论和经验的基础上讨论过。然而，对我们理解进化模式的影响还有待充分探索。在这里，我们开发了一个新的模型，协变进化速度（CET）模型，目的是在一个连续变化的“速度”变量框架内整合多样化和分子进化模式，该变量作为分子、形态和多样化率的主控制。重要的是，节奏本身被认为是可变的，与它自己的值成比例。该模型预测，在包括现在在内的任何历史时期，生物多样性都是由少数极其庞大的进化支主导的；这些大型进化支的特征是早期的爆炸性辐射，同时伴随着分子进化的加快；现存的生物很可能是从进化速度异常快的物种进化而来的。在这种模型下，沿着一个特定谱系的分子变化量基本上与它的高度无关，这削弱了分子钟假说。最后，我们的模型解释了“活的，化石的”姐妹群的存在，这些姐妹群是物种贫乏的，表现出缓慢的形态和分子变化速率。我们的研究结果表明，观察到的历史进化模式可以被建模，而不需要调用特殊的进化机制或特定时代或分类群特有的创新，即使它们高度不均匀

{"title":"Evolutionary Tempo, Supertaxa and Living Fossils","authors":"Graham E Budd, Richard P Mann","doi":"10.1093/sysbio/syaf020","DOIUrl":"https://doi.org/10.1093/sysbio/syaf020","url":null,"abstract":"A relationship between the rate of molecular change and diversification has long been, discussed, on both theoretical and empirical grounds. However, the effect on our, understanding of evolutionary patterns is yet to be fully explored. Here we develop a new, model, the Covariant Evolutionary Tempo (CET) model, with the aim of integrating, patterns of diversification and molecular evolution within a framework of a continuously, changing ‘tempo’ variable that acts as a master control for molecular, morphological and, diversification rates. Importantly, tempo itself is treated as being variable at a rate, proportional to its own value. This model predicts that diversity is dominated by a small, number of extremely large clades at any historical epoch including the present; that these, large clades are expected to be characterised by explosive early radiations accompanied by, elevated rates of molecular evolution; and that extant organisms are likely to have evolved, from species with unusually fast evolutionary rates. Under such a model, the amount of, molecular change along a particular lineage is essentially independent of its height, which, weakens the molecular clock hypothesis. Finally, our model explains the existence of ‘living, fossil’ sister groups to large clades that are species poor and exhibit slow rates of, morphological and molecular change. Our results demonstrate that the observed historical, patterns of evolution can be modelled without invoking special evolutionary mechanisms or, innovations that are unique to specific times or taxa, even when they are highly, non-uniform","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":"25 1","pages":""},"PeriodicalIF":6.5,"publicationDate":"2025-05-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143910414","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Non-adaptive Radiation Promotes Phenotypic Diversification and Convergent Evolution of Aposematic Mimicry in a Highly Diverse Genus of Megaloptera 非适应性辐射促进大翅目高多样性属的警示拟态表型多样化和趋同进化

IF 6.5 1区生物学 Q1 EVOLUTIONARY BIOLOGY

Systematic Biology

Pub Date : 2025-05-04 DOI: 10.1093/sysbio/syaf030

Yuezheng Tu, Xuankun Li, Fumio Hayashi, Feng Zhang, Ding Yang, Fabien L Condamine, Xingyue Liu

Evolutionary radiations are considered key processes underlying the origin of biodiversity. Notably, the mechanisms driving these radiations can vary across organisms and often involve a complex interplay of abiotic and biotic factors. Empirical studies on evolutionary history are crucial for validation of multiple hypothesis regarding the mode of evolutionary radiations. Within the aquatic insect order Megaloptera, the genus Protohermes is the most speciose clade with 90 described species, accounting for around 22% of the total ordinal diversity. Protohermes species are featured by the limited dispersal ability, primarily occurring across the Oriental region, and a range of diversified phenotypes, e.g., highly divergent genital characters, and mimetic coloration alongside shifts in biological rhythm—from nocturnal to diurnal activities. Here we infer the spatiotemporal mode of diversification and associated driving factors of the Protohermes radiation as a test case for exploring the processes and potential mechanisms of evolutionary radiations. We present the first time-calibrated phylogeny of Protohermes using genome-scale data of ultraconserved elements (UCEs) and mitochondrial genes with a comprehensive taxon sampling. Our results reveal a mid-Cretaceous stem age of Protohermes, followed by a recent and steady diversification during the Neogene. Estimation of historical biogeography suggests the genus likely originated from a broad range including the Himalayas-Hengduan Mountains + Indochina + Borneo, with the first two areas serving as the center of early diversification. Our results further suggest that vicariance events, likely attributed to the Cenozoic Himalayan orogeny as well as climate change in East Asia, triggered speciation that coincided with the accumulation of genital divergence. Further enhancement of genital and phenotypic diversification may have been promoted by secondary contacts of allopatric or parapatric lineages following the build-up of species richness, likely facilitating species coexistence and lineage accumulation. We argue that the current species diversity of Protohermes likely resulted from a non-adaptive radiation. Our results highlight the role of geographic vicariance and sexual selection in driving the species and phenotypic diversification in insects.

进化辐射被认为是生物多样性起源的关键过程。值得注意的是，驱动这些辐射的机制可能因生物而异，并且通常涉及非生物和生物因素的复杂相互作用。对进化历史的实证研究对于验证关于进化辐射模式的多重假设至关重要。在水生昆虫目大翅目中，原爱马仕属是物种最多的分支，共有90种，约占总有序多样性的22%。原始爱马仕物种的特点是有限的传播能力，主要发生在东方地区，并且具有一系列多样化的表型，例如，高度分化的生殖器特征，模仿的颜色以及生物节律的变化-从夜间活动到白天活动。本文通过对原爱马仕辐射多样性的时空模式和驱动因素的推断，为探索进化辐射的过程和潜在机制提供了实验依据。我们利用超保守元件（UCEs）和线粒体基因的基因组尺度数据，通过全面的分类群采样，首次提出了原生爱马仕的时间校准系统发育。我们的研究结果揭示了白垩纪中期的原始爱马仕，随后在新近纪出现了稳定的多样化。历史生物地理学估计表明，该属可能起源于包括喜马拉雅-横断山脉+印度支那+婆罗洲在内的广泛范围，其中前两个地区是早期多样化的中心。我们的研究结果进一步表明，可能归因于新生代喜马拉雅造山运动和东亚气候变化的变异事件引发了与生殖分化积累相一致的物种形成。在物种丰富度积累之后，异域或异域谱系的二次接触可能促进了生殖器官和表型多样化的进一步增强，这可能促进了物种共存和谱系积累。我们认为，原始爱马仕的物种多样性可能是由非适应性辐射造成的。我们的研究结果强调了地理变异和性选择在驱动昆虫物种和表型多样化中的作用。

{"title":"Non-adaptive Radiation Promotes Phenotypic Diversification and Convergent Evolution of Aposematic Mimicry in a Highly Diverse Genus of Megaloptera","authors":"Yuezheng Tu, Xuankun Li, Fumio Hayashi, Feng Zhang, Ding Yang, Fabien L Condamine, Xingyue Liu","doi":"10.1093/sysbio/syaf030","DOIUrl":"https://doi.org/10.1093/sysbio/syaf030","url":null,"abstract":"Evolutionary radiations are considered key processes underlying the origin of biodiversity. Notably, the mechanisms driving these radiations can vary across organisms and often involve a complex interplay of abiotic and biotic factors. Empirical studies on evolutionary history are crucial for validation of multiple hypothesis regarding the mode of evolutionary radiations. Within the aquatic insect order Megaloptera, the genus Protohermes is the most speciose clade with 90 described species, accounting for around 22% of the total ordinal diversity. Protohermes species are featured by the limited dispersal ability, primarily occurring across the Oriental region, and a range of diversified phenotypes, e.g., highly divergent genital characters, and mimetic coloration alongside shifts in biological rhythm—from nocturnal to diurnal activities. Here we infer the spatiotemporal mode of diversification and associated driving factors of the Protohermes radiation as a test case for exploring the processes and potential mechanisms of evolutionary radiations. We present the first time-calibrated phylogeny of Protohermes using genome-scale data of ultraconserved elements (UCEs) and mitochondrial genes with a comprehensive taxon sampling. Our results reveal a mid-Cretaceous stem age of Protohermes, followed by a recent and steady diversification during the Neogene. Estimation of historical biogeography suggests the genus likely originated from a broad range including the Himalayas-Hengduan Mountains + Indochina + Borneo, with the first two areas serving as the center of early diversification. Our results further suggest that vicariance events, likely attributed to the Cenozoic Himalayan orogeny as well as climate change in East Asia, triggered speciation that coincided with the accumulation of genital divergence. Further enhancement of genital and phenotypic diversification may have been promoted by secondary contacts of allopatric or parapatric lineages following the build-up of species richness, likely facilitating species coexistence and lineage accumulation. We argue that the current species diversity of Protohermes likely resulted from a non-adaptive radiation. Our results highlight the role of geographic vicariance and sexual selection in driving the species and phenotypic diversification in insects.","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":"42 1","pages":""},"PeriodicalIF":6.5,"publicationDate":"2025-05-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143902886","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Rampant Reticulation in a Rapid Radiation of Tropical Trees -Insights from Inga (Fabaceae) 热带树木快速辐射中的猖獗网状——来自印加科的启示

IF 6.5 1区生物学 Q1 EVOLUTIONARY BIOLOGY

Systematic Biology

Pub Date : 2025-05-04 DOI: 10.1093/sysbio/syaf027

Rowan J Schley, Rosalía Piñeiro, James A Nicholls, Flávia Fonseca Pezzini, Audrey Farbos, Gwilym P Lewis, Jens J Ringelberg, Catherine Kidner, Alex D Twyford, Kyle G Dexter, R Toby Pennington

Evolutionary radiations underlie much of the species diversity of life on Earth, particularly within the world’s most species-rich tree flora – that of the Amazon rainforest. Hybridisation occurs in many radiations, with effects ranging from homogenisation of divergent species to the generation of genetic and phenotypic novelty that fuels speciation. However, the influence of hybridisation on Amazonian tree radiations has been little studied. We address this using the ubiquitous, species-rich, neotropical tree genus Inga, which typifies rapid radiations of rainforest trees. We assess patterns of gene tree incongruence to ascertain whether hybridisation was associated with rapid radiation in Inga. Given the importance of insect herbivory in structuring rainforest tree communities (and hence the potential for hybridisation to promote adaptation through admixture of defence traits), we also test whether introgression of loci underlying chemical defences against herbivory occurred during the radiation of Inga. Our phylogenomic analyses of 189/288 Inga species using >1300 target capture loci showed widespread introgression in Inga. Specifically, we found widespread phylogenetic incongruence explained by introgression, with phylogenetic networks recovering multiple introgression events across Inga and up to 20% of shared, likely introgressed, genetic variation between some species. In addition, most defence chemistry loci showed evidence of positive selection and marginally higher levels of introgression. Overall, our results suggest that introgression has occurred widely over the course of Inga’s history, possibly in a syngameon scenario, likely facilitated by extensive dispersal across Amazonia. Furthermore, in some cases introgression of chemical defence loci may influence adaptation in Inga.

进化辐射是地球上许多物种多样性的基础，特别是在世界上物种最丰富的树木植物群——亚马逊雨林中。杂交发生在许多辐射中，其影响范围从不同物种的同质化到产生促进物种形成的遗传和表型新颖性。然而，杂交对亚马逊树木辐射的影响研究甚少。我们使用无所不在的、物种丰富的新热带树种因加来解决这个问题，它是热带雨林树木快速辐射的典型。我们评估基因树不一致的模式，以确定杂交是否与因加的快速辐射有关。考虑到昆虫食草性在构建雨林树木群落中的重要性（因此可能通过杂交来促进防御性状的混合适应），我们还测试了因加辐射期间是否发生了针对食草性化学防御的基因座渗入。我们利用&；gt；1300个目标捕获位点对189/288个印加物种进行了系统基因组分析，结果显示印加物种存在广泛的基因渗入。具体来说，我们发现广泛存在的系统发育不一致可以用基因渗入来解释，系统发育网络恢复了英加的多个基因渗入事件，以及一些物种之间多达20%的可能基因渗入的共享遗传变异。此外，大多数防御化学位点显示出正选择和略高水平的渗入的证据。总的来说，我们的研究结果表明，在英加的历史过程中，基因渗入已经广泛发生，可能是在一个合成子情景中，可能是由于在亚马逊地区的广泛分散而促进的。此外，在某些情况下，化学防御位点的渗入可能影响印加的适应。

{"title":"Rampant Reticulation in a Rapid Radiation of Tropical Trees -Insights from Inga (Fabaceae)","authors":"Rowan J Schley, Rosalía Piñeiro, James A Nicholls, Flávia Fonseca Pezzini, Audrey Farbos, Gwilym P Lewis, Jens J Ringelberg, Catherine Kidner, Alex D Twyford, Kyle G Dexter, R Toby Pennington","doi":"10.1093/sysbio/syaf027","DOIUrl":"https://doi.org/10.1093/sysbio/syaf027","url":null,"abstract":"Evolutionary radiations underlie much of the species diversity of life on Earth, particularly within the world’s most species-rich tree flora – that of the Amazon rainforest. Hybridisation occurs in many radiations, with effects ranging from homogenisation of divergent species to the generation of genetic and phenotypic novelty that fuels speciation. However, the influence of hybridisation on Amazonian tree radiations has been little studied. We address this using the ubiquitous, species-rich, neotropical tree genus Inga, which typifies rapid radiations of rainforest trees. We assess patterns of gene tree incongruence to ascertain whether hybridisation was associated with rapid radiation in Inga. Given the importance of insect herbivory in structuring rainforest tree communities (and hence the potential for hybridisation to promote adaptation through admixture of defence traits), we also test whether introgression of loci underlying chemical defences against herbivory occurred during the radiation of Inga. Our phylogenomic analyses of 189/288 Inga species using &gt;1300 target capture loci showed widespread introgression in Inga. Specifically, we found widespread phylogenetic incongruence explained by introgression, with phylogenetic networks recovering multiple introgression events across Inga and up to 20% of shared, likely introgressed, genetic variation between some species. In addition, most defence chemistry loci showed evidence of positive selection and marginally higher levels of introgression. Overall, our results suggest that introgression has occurred widely over the course of Inga’s history, possibly in a syngameon scenario, likely facilitated by extensive dispersal across Amazonia. Furthermore, in some cases introgression of chemical defence loci may influence adaptation in Inga.","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":"35 1","pages":""},"PeriodicalIF":6.5,"publicationDate":"2025-05-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143902885","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Site-specific structure and stability constrained substitution models improve phylogenetic inference 位点特异性结构和稳定性约束替代模型改进了系统发育推断

IF 6.5 1区生物学 Q1 EVOLUTIONARY BIOLOGY

Systematic Biology

Pub Date : 2025-04-24 DOI: 10.1093/sysbio/syaf007

Ivan Lorca-Alonso, Otero-de-Navascues Fernando, Miguel Arenas, Ugo Bastolla

In previous studies, we presented our site-specific Stability Constrained substitution models of Protein Evolution (Stab-CPE) that define fitness as the probability of finding a protein folded in its native state but ignore changes in the native structure. Stab-CPE models can be used to predict a more realistic evolutionary variability across protein sites, nevertheless they still qualitatively differ from observed data and appear too tolerant to mutations. Here we present novel structurally constrained substitution models (Str-CPE) that define fitness based on the structural deformation produced by a mutation, which we predict adopting an extension of Juli’an Echaveás linearly forced elastic network model. Compared to our previous Stab-CPE models, the novel Str-CPE models are more stringent (they predict lower sequence entropy and substitution rate), provide higher likelihood to multiple sequence alignments (MSAs) that include one or more known structures, and better predict the observed conservation across sites. The models that combine Str-CPE and Stab-CPE models are even more stringent and fit the empirical MSAs better. We collectively refer to our models as Structure and Stability Constrained substitution models of Protein Evolution (SSCPE). When using distantly-related proteins, we find that more similar phylogenies are inferred under the SSCPE models than under traditional empirical substitution models if compared to the corresponding reference phylogenies inferred using structural distances. Therefore, SSCPE models seem to be much better-fitting substitution models for deep phylogeny inference. The SSCPE models have been implemented in the PERL-based program SSCPE.pl, which uses RAxML-NG to infer phylogenies under the SSCPE model given a concatenated MSA and a list of protein structures that match the sequences in the MSA.

在之前的研究中，我们提出了蛋白质进化的位点特异性稳定性约束替代模型（Stab-CPE），该模型将适应度定义为发现蛋白质在其天然状态折叠而忽略天然结构变化的概率。Stab-CPE模型可以用来预测更现实的蛋白质位点的进化变异性，然而，它们在质量上仍然与观察到的数据不同，并且似乎对突变过于宽容。本文提出了一种新的结构约束替代模型（Str-CPE），该模型基于突变产生的结构变形来定义适应度，并采用聚力安Echaveás线性强迫弹性网络模型的扩展来预测。与我们之前的Stab-CPE模型相比，新的Str-CPE模型更严格（它们预测更低的序列熵和取代率），提供更高的可能性包含一个或多个已知结构的多个序列比对（msa），并更好地预测观察到的跨位点保守性。结合Str-CPE和Stab-CPE模型的模型更严格，更符合经验msa。我们将我们的模型统称为蛋白质进化的结构和稳定性约束替代模型（SSCPE）。当使用远缘相关蛋白时，我们发现，如果与使用结构距离推断的相应参考系统发生相比，在SSCPE模型下推断出的相似系统发生比在传统的经验替代模型下推断出的相似系统发生更多。因此，SSCPE模型似乎是更适合深层系统发育推断的替代模型。SSCPE模型已在基于perl的程序SSCPE.pl中实现，该程序使用RAxML-NG来推断SSCPE模型下的系统发育，并给出串联的MSA和与MSA中序列匹配的蛋白质结构列表。

{"title":"Site-specific structure and stability constrained substitution models improve phylogenetic inference","authors":"Ivan Lorca-Alonso, Otero-de-Navascues Fernando, Miguel Arenas, Ugo Bastolla","doi":"10.1093/sysbio/syaf007","DOIUrl":"https://doi.org/10.1093/sysbio/syaf007","url":null,"abstract":"In previous studies, we presented our site-specific Stability Constrained substitution models of Protein Evolution (Stab-CPE) that define fitness as the probability of finding a protein folded in its native state but ignore changes in the native structure. Stab-CPE models can be used to predict a more realistic evolutionary variability across protein sites, nevertheless they still qualitatively differ from observed data and appear too tolerant to mutations. Here we present novel structurally constrained substitution models (Str-CPE) that define fitness based on the structural deformation produced by a mutation, which we predict adopting an extension of Juli’an Echaveás linearly forced elastic network model. Compared to our previous Stab-CPE models, the novel Str-CPE models are more stringent (they predict lower sequence entropy and substitution rate), provide higher likelihood to multiple sequence alignments (MSAs) that include one or more known structures, and better predict the observed conservation across sites. The models that combine Str-CPE and Stab-CPE models are even more stringent and fit the empirical MSAs better. We collectively refer to our models as Structure and Stability Constrained substitution models of Protein Evolution (SSCPE). When using distantly-related proteins, we find that more similar phylogenies are inferred under the SSCPE models than under traditional empirical substitution models if compared to the corresponding reference phylogenies inferred using structural distances. Therefore, SSCPE models seem to be much better-fitting substitution models for deep phylogeny inference. The SSCPE models have been implemented in the PERL-based program SSCPE.pl, which uses RAxML-NG to infer phylogenies under the SSCPE model given a concatenated MSA and a list of protein structures that match the sequences in the MSA.","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":"5 1","pages":""},"PeriodicalIF":6.5,"publicationDate":"2025-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143866191","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0