首页 > 最新文献

Systematic Biology最新文献

英文 中文
Improved robustness to gene tree incompleteness, estimation errors, and systematic homology errors with weighted TREE-QMC 改进了加权tree - qmc对基因树不完备性、估计误差和系统同源性误差的鲁棒性
IF 6.5 1区 生物学 Q1 EVOLUTIONARY BIOLOGY Pub Date : 2025-02-26 DOI: 10.1093/sysbio/syaf009
Yunheng Han, Erin K Molloy
Summary methods are widely used to reconstruct species trees from gene tres while accommodating discordance from incomplete lineage sorting; however, it is increasingly recognized that their accuracy can be negatively impacted by incomplete and/or error-ridden gene trees. To address the latter, Zhang and Mirarab (2022) updated the popular summary method ASTRAL so that it weights quartets based on gene tree branch lengths and support values. The implementation of these weighting schemes presented computational challenges, leading Zhang and Mirarab (2022) to replace ASTRAL’s original algorithm (i.e., computing an exact solution within a constrained search space) in favor of search heuristics based on phylogenetic placement. Here, we show that these weighting schemes can be effectively leveraged within the Quartet Max Cut framework of Snir and Rao (2010), introducing weighted TREE-QMC. The incorporation of weighting schemes into TREE-QMC required only a small increase in time complexity compared to the unweighted algorithm; fortunately, the increase in runtime was also small, behaving more like a constant factor in our simulation study. Moreover, weighted TREE-QMC was fast and highly competitive with weighted ASTRAL, even outperforming it in terms of species tree accuracy on some challenging simulation conditions, such as large numbers of taxa. In reanalyzing two avian data sets, we found that weighting quartets by gene tree branch lengths can improve robustness to systematic homology errors and can be as effective as removing the impacted taxa from individual gene trees or removing the impacted gene trees entirely. Lastly, our study revealed that TREE-QMC was robust to extreme rates of missing taxa, suggesting its utility as a supertree method. Source code for weighted TREE-QMC is available on Github: (https://github.com/molloy-lab/TREE-QMC)
摘要方法被广泛用于从基因树重建物种树,以适应不完整谱系排序的不一致;然而,越来越多的人认识到它们的准确性会受到不完整和/或错误丛生的基因树的负面影响。为了解决后者,Zhang和Mirarab(2022)更新了流行的汇总方法ASTRAL,使其基于基因树分支长度和支持值对四重奏进行加权。这些加权方案的实现带来了计算上的挑战,导致Zhang和Mirarab(2022)取代了ASTRAL的原始算法(即在受限的搜索空间内计算精确解),而采用基于系统发育定位的搜索启发式算法。在这里,我们展示了这些加权方案可以在Snir和Rao(2010)的四重奏最大切割框架内有效地利用,引入加权TREE-QMC。与非加权算法相比,将加权方案纳入TREE-QMC只需要增加少量的时间复杂度;幸运的是,运行时间的增加也很小,在我们的模拟研究中表现得更像是一个常数因素。此外,加权tree - qmc速度快,与加权ASTRAL竞争激烈,甚至在一些具有挑战性的模拟条件下(如大量分类群),在物种树精度方面优于加权ASTRAL。通过对两个鸟类数据集的再分析,我们发现以基因树分支长度为权重的四元组可以提高对系统同源性错误的鲁棒性,并且可以像从单个基因树中删除受影响的类群或完全删除受影响的基因树一样有效。最后,我们的研究表明,TREE-QMC对极端缺失率的分类群具有鲁棒性,表明其作为超树方法的实用性。加权TREE-QMC的源代码可在Github上获得:(https://github.com/molloy-lab/TREE-QMC)
{"title":"Improved robustness to gene tree incompleteness, estimation errors, and systematic homology errors with weighted TREE-QMC","authors":"Yunheng Han, Erin K Molloy","doi":"10.1093/sysbio/syaf009","DOIUrl":"https://doi.org/10.1093/sysbio/syaf009","url":null,"abstract":"Summary methods are widely used to reconstruct species trees from gene tres while accommodating discordance from incomplete lineage sorting; however, it is increasingly recognized that their accuracy can be negatively impacted by incomplete and/or error-ridden gene trees. To address the latter, Zhang and Mirarab (2022) updated the popular summary method ASTRAL so that it weights quartets based on gene tree branch lengths and support values. The implementation of these weighting schemes presented computational challenges, leading Zhang and Mirarab (2022) to replace ASTRAL’s original algorithm (i.e., computing an exact solution within a constrained search space) in favor of search heuristics based on phylogenetic placement. Here, we show that these weighting schemes can be effectively leveraged within the Quartet Max Cut framework of Snir and Rao (2010), introducing weighted TREE-QMC. The incorporation of weighting schemes into TREE-QMC required only a small increase in time complexity compared to the unweighted algorithm; fortunately, the increase in runtime was also small, behaving more like a constant factor in our simulation study. Moreover, weighted TREE-QMC was fast and highly competitive with weighted ASTRAL, even outperforming it in terms of species tree accuracy on some challenging simulation conditions, such as large numbers of taxa. In reanalyzing two avian data sets, we found that weighting quartets by gene tree branch lengths can improve robustness to systematic homology errors and can be as effective as removing the impacted taxa from individual gene trees or removing the impacted gene trees entirely. Lastly, our study revealed that TREE-QMC was robust to extreme rates of missing taxa, suggesting its utility as a supertree method. Source code for weighted TREE-QMC is available on Github: (https://github.com/molloy-lab/TREE-QMC)","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":"32 1","pages":""},"PeriodicalIF":6.5,"publicationDate":"2025-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143495162","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Recent, Intricate Speciation in Amazonia Uncovered by a Multilayered Genomic Analysis of Tree Squirrels 最近,亚马逊地区复杂的物种形成由树松鼠的多层基因组分析揭示
IF 6.5 1区 生物学 Q1 EVOLUTIONARY BIOLOGY Pub Date : 2025-02-18 DOI: 10.1093/sysbio/syaf013
Edson F Abreu, Joyce R Prado, Jesús E Maldonado, Don E Wilson, Alexandre R Percequillo, Silvia E Pavan
Accurate estimates of species diversity are essential for all biodiversity research. Delimiting species and understanding the underlying processes of speciation are also central components of systematic biology that frame our comprehension of the evolutionary mechanisms generating biodiversity. The South American tree squirrels (genus Guerlinguetus) are keystone mammals, widely distributed, that are critical in tree-seed predation and dispersal in one of the most fragile and threated ecosystems of the world, the tropical rainforests of South America. We obtained genomic data (ultraconserved elements and single nucleotide polymorphisms) to explore alternative hypotheses on species limits of this genus and to clarify recent and rapid speciation on continental-scale and dynamically evolving landscapes. Using a multilayered genomic approach that integrates fine-scale population genetic analyses with quantitative molecular species delimitation methods, we observed that (i) the most likely number of species within Guerlinguetus is six, contrasting with both classic morphological revision and mitochondrial species delimitation; (ii) incongruencies in species relationships still persist, which might be a response to population migration and gene flow taking place in the lowlands of eastern Amazonia and/or to the extremely rapid successive speciation events; and (iii) effective migration surfaces detected important geographic barriers associated with the major Amazonian riverine systems and the mountain ranges of the Guiana Shield. In conclusion, we uncovered unexpected and higher species diversity on Guerlinguetus and corroborate recent findings suggesting that much of the extant species-level diversity in Amazonia is young, dating back to the Quaternary. We also reinforce long-established hypotheses on the role of rivers and climate-driven forest dynamics in triggering Amazonian speciation. [gene flow; Guerlinguetus; migration; mito-nuclear discordance; Neotropics; quantitative species delimitation; Sciuridae; ultraconserved elements]
物种多样性的准确估计对所有生物多样性研究都至关重要。划分物种和理解物种形成的潜在过程也是系统生物学的核心组成部分,它构建了我们对产生生物多样性的进化机制的理解。南美洲树松鼠(Guerlinguetus属)是一种重要的哺乳动物,分布广泛,在世界上最脆弱和最受威胁的生态系统之一——南美洲热带雨林中,它们对树木种子的捕食和传播至关重要。我们获得了基因组数据(超保守元件和单核苷酸多态性),以探索关于该属物种限制的其他假设,并阐明在大陆尺度和动态进化的景观上最近和快速的物种形成。通过将精细群体遗传分析与定量分子物种划分方法相结合的多层基因组方法,我们观察到(i)与经典形态修正和线粒体物种划分相比,Guerlinguetus最可能的物种数量为6个;物种关系的不一致性仍然存在,这可能是对东亚马逊低地发生的种群迁移和基因流动的反应,或者是对极快的连续物种形成事件的反应;(iii)有效迁移面检测到与主要亚马逊河流系统和圭亚那地盾山脉相关的重要地理障碍。总之,我们在Guerlinguetus身上发现了意想不到的更高的物种多样性,并证实了最近的发现,即亚马逊地区现存的物种多样性大多是年轻的,可以追溯到第四纪。我们还加强了关于河流和气候驱动的森林动态在触发亚马逊物种形成中的作用的长期建立的假设。[基因流动;Guerlinguetus;迁移;mito-nuclear不整合;Neotropics;定量物种划分;科;ultraconserved元素)
{"title":"Recent, Intricate Speciation in Amazonia Uncovered by a Multilayered Genomic Analysis of Tree Squirrels","authors":"Edson F Abreu, Joyce R Prado, Jesús E Maldonado, Don E Wilson, Alexandre R Percequillo, Silvia E Pavan","doi":"10.1093/sysbio/syaf013","DOIUrl":"https://doi.org/10.1093/sysbio/syaf013","url":null,"abstract":"Accurate estimates of species diversity are essential for all biodiversity research. Delimiting species and understanding the underlying processes of speciation are also central components of systematic biology that frame our comprehension of the evolutionary mechanisms generating biodiversity. The South American tree squirrels (genus Guerlinguetus) are keystone mammals, widely distributed, that are critical in tree-seed predation and dispersal in one of the most fragile and threated ecosystems of the world, the tropical rainforests of South America. We obtained genomic data (ultraconserved elements and single nucleotide polymorphisms) to explore alternative hypotheses on species limits of this genus and to clarify recent and rapid speciation on continental-scale and dynamically evolving landscapes. Using a multilayered genomic approach that integrates fine-scale population genetic analyses with quantitative molecular species delimitation methods, we observed that (i) the most likely number of species within Guerlinguetus is six, contrasting with both classic morphological revision and mitochondrial species delimitation; (ii) incongruencies in species relationships still persist, which might be a response to population migration and gene flow taking place in the lowlands of eastern Amazonia and/or to the extremely rapid successive speciation events; and (iii) effective migration surfaces detected important geographic barriers associated with the major Amazonian riverine systems and the mountain ranges of the Guiana Shield. In conclusion, we uncovered unexpected and higher species diversity on Guerlinguetus and corroborate recent findings suggesting that much of the extant species-level diversity in Amazonia is young, dating back to the Quaternary. We also reinforce long-established hypotheses on the role of rivers and climate-driven forest dynamics in triggering Amazonian speciation. [gene flow; Guerlinguetus; migration; mito-nuclear discordance; Neotropics; quantitative species delimitation; Sciuridae; ultraconserved elements]","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":"14 1","pages":""},"PeriodicalIF":6.5,"publicationDate":"2025-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143443301","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Global Patterns of Taxonomic Uncertainty and its Impacts on Biodiversity Research 全球分类不确定性格局及其对生物多样性研究的影响
IF 6.5 1区 生物学 Q1 EVOLUTIONARY BIOLOGY Pub Date : 2025-02-15 DOI: 10.1093/sysbio/syaf010
Jhonny J M Guedes, Mario R Moura, Lucas Jardim, José Alexandre F Diniz-Filho
Over two million species have been named so far, but many will be invalidated due to redundant descriptions. Undetected invalid species (i.e., synonyms) can impair inferences we make in biodiversity research and hamper the implementation of effective conservation strategies. However, the processes leading to the accumulation of invalid names remain largely unknown. Using multi-model inferences, we investigated the patterns and potential drivers of species- and assemblage-level variation in synonym counts across terrestrial vertebrates globally. We also explored how taxonomic uncertainty (i.e., instability in species identities) can affect latitudinal variation of diversification rates. The average number of synonyms was higher for species described earlier, better represented in scientific collections, with larger geographic ranges, occurring in temperate regions, and in areas of high biodiversity attention. In assemblage-level models, a higher average number of synonyms was associated with temperate regions harbouring more early-described species. Areas of high endemism richness showed fewer synonyms across amphibians and reptiles but had an inverse effect for birds and mammals. Other predictor-response relationships varied across taxonomic groups, biogeographical realm, and spatial grain. Assuming that more synonyms indicate more stable species that have been thoroughly studied and reviewed, high synonym numbers in temperate species and assemblages support claims of a potential latitudinal taxonomy gradient, where geographic variation in taxonomic practice could hinder the proper recognition of tropical species. We show that the accumulation of invalid names is not random and discuss how invalid hidden names can affect biodiversity inferences. A potential approach to address this problem would be developing a taxonomic uncertainty metric that could be incorporated into models (i.e., as weights to account for varying degrees of uncertainty during the fitting process). Our study provides an initial approximation and highlights the often-neglected issue of uncertainty and instability in species identities from a macroecological perspective.
到目前为止,已有超过200万个物种被命名,但许多物种将因描述冗余而无效。未被发现的无效物种(即同义词)会影响我们在生物多样性研究中的推论,阻碍有效保护策略的实施。然而,导致无效名称积累的过程在很大程度上仍然未知。利用多模型推断,我们研究了全球陆地脊椎动物物种和组合水平上同义词数量变化的模式和潜在驱动因素。我们还探讨了分类学上的不确定性(即物种身份的不稳定性)如何影响多样化率的纬度变化。同义词的平均数量在早期描述的物种中较高,在科学收藏中有更好的代表性,地理范围更大,发生在温带地区和高度关注生物多样性的地区。在组合水平模型中,同义词的平均数量较高与温带地区拥有更多早期描述的物种有关。在地方特有丰富度高的地区,两栖类和爬行类的同义词较少,而鸟类和哺乳动物的同义词则相反。其他预测-响应关系在不同的分类类群、生物地理领域和空间颗粒中有所不同。假设更多的近义词表明更稳定的物种已经被彻底研究和审查,温带物种和组合中的高近义词数量支持了潜在的纬度分类梯度的主张,在那里,分类学实践中的地理差异可能阻碍对热带物种的适当识别。我们证明了无效名称的积累不是随机的,并讨论了无效隐藏名称如何影响生物多样性推断。解决这一问题的一种潜在方法是开发一种可纳入模型的分类不确定性度量(即,作为权重来考虑拟合过程中不同程度的不确定性)。我们的研究提供了一个初步的近似,并从宏观生态学的角度强调了物种身份的不确定性和不稳定性这一经常被忽视的问题。
{"title":"Global Patterns of Taxonomic Uncertainty and its Impacts on Biodiversity Research","authors":"Jhonny J M Guedes, Mario R Moura, Lucas Jardim, José Alexandre F Diniz-Filho","doi":"10.1093/sysbio/syaf010","DOIUrl":"https://doi.org/10.1093/sysbio/syaf010","url":null,"abstract":"Over two million species have been named so far, but many will be invalidated due to redundant descriptions. Undetected invalid species (i.e., synonyms) can impair inferences we make in biodiversity research and hamper the implementation of effective conservation strategies. However, the processes leading to the accumulation of invalid names remain largely unknown. Using multi-model inferences, we investigated the patterns and potential drivers of species- and assemblage-level variation in synonym counts across terrestrial vertebrates globally. We also explored how taxonomic uncertainty (i.e., instability in species identities) can affect latitudinal variation of diversification rates. The average number of synonyms was higher for species described earlier, better represented in scientific collections, with larger geographic ranges, occurring in temperate regions, and in areas of high biodiversity attention. In assemblage-level models, a higher average number of synonyms was associated with temperate regions harbouring more early-described species. Areas of high endemism richness showed fewer synonyms across amphibians and reptiles but had an inverse effect for birds and mammals. Other predictor-response relationships varied across taxonomic groups, biogeographical realm, and spatial grain. Assuming that more synonyms indicate more stable species that have been thoroughly studied and reviewed, high synonym numbers in temperate species and assemblages support claims of a potential latitudinal taxonomy gradient, where geographic variation in taxonomic practice could hinder the proper recognition of tropical species. We show that the accumulation of invalid names is not random and discuss how invalid hidden names can affect biodiversity inferences. A potential approach to address this problem would be developing a taxonomic uncertainty metric that could be incorporated into models (i.e., as weights to account for varying degrees of uncertainty during the fitting process). Our study provides an initial approximation and highlights the often-neglected issue of uncertainty and instability in species identities from a macroecological perspective.","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":"1861 3 1","pages":""},"PeriodicalIF":6.5,"publicationDate":"2025-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143417492","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
UnFATE: A Comprehensive Probe Set and Bioinformatics Pipeline for Phylogeny Reconstruction and Multilocus Barcoding of Filamentous Ascomycetes (Ascomycota, Pezizomycotina) UnFATE:用于丝状子囊菌(Ascomycota, Pezizomycotina)系统发育重建和多位点条形码的综合探针集和生物信息学管道
IF 6.5 1区 生物学 Q1 EVOLUTIONARY BIOLOGY Pub Date : 2025-02-15 DOI: 10.1093/sysbio/syaf011
Claudio G Ametrano, Jacob Jensen, H Thorsten Lumbsch, Felix Grewe
The subphylum Pezizomycotina (filamentous ascomycetes) is the largest clade within Ascomycota. Despite the importance of this group of fungi, our understanding of their evolution is still limited due to insufficient taxon sampling. Although next-generation sequencing technology allows us to obtain complete genomes for phylogenetic analyses, generating complete genomes of fungal species can be challenging, especially when fungi occur in symbiotic relationships or when the DNA of rare herbarium specimens is degraded or contaminated. Additionally, assembly, annotation, and gene extraction of whole-genome sequencing data require bioinformatics skills and computational power, resulting in a substantial data burden. To overcome these obstacles, we designed a universal target enrichment probe set to reconstruct the phylogenetic relationships of filamentous ascomycetes at different phylogenetic levels. From a pool of single-copy orthologous genes extracted from available Pezizomycotina genomes, we identified the smallest subset of genetic markers that can reliably reconstruct a robust phylogeny. We used a clustering approach to identify a sequence set that could provide an optimal trade-off between potential missing data and probe set cost. We incorporated this probe set into a user-friendly wrapper script named UnFATE (https://github.com/claudioametrano/UnFATE) that allows phylogenomic inferences without requiring expert bioinformatics knowledge. In addition to phylogenetic results, the software provides a powerful multilocus alternative to ITS-based barcoding. Phylogeny and barcoding approaches can be complemented by an integrated, pre-processed, and periodically updated database of all publicly available Pezizomycotina genomes. The UnFATE pipeline, using the 195 selected marker genes, consistently performed well across various phylogenetic depths, generating trees consistent with the reference phylogenomic inferences. The topological distance between the reference trees from literature and the best tree produced by UnFATE ranged between 0.10 and 0.14 (nRF) for phylogenies from family to subphylum level. We also tested the in vitro success of the universal baits set in a target capture approach on 25 herbarium specimens from ten representative classes in Pezizomycotina, which recovered a topology congruent with recent phylogenomic inferences for this group of fungi. The discriminating power of our gene set was also assessed by the multilocus barcoding approach, which outperformed the barcoding approach based on ITS. With these tools, we aim to provide a framework for a collaborative approach to build robust, conclusive phylogenies of this important fungal clade.
丝状子囊菌亚门是子囊菌纲中最大的分支。尽管这组真菌很重要,但由于分类群采样不足,我们对其进化的理解仍然有限。虽然下一代测序技术使我们能够获得完整的基因组进行系统发育分析,但产生真菌物种的完整基因组可能具有挑战性,特别是当真菌发生在共生关系中或当稀有植物标本馆标本的DNA被降解或污染时。此外,全基因组测序数据的组装、注释和基因提取需要生物信息学技能和计算能力,导致大量数据负担。为了克服这些障碍,我们设计了一个通用的目标富集探针集来重建丝状子囊菌在不同系统发育水平上的系统发育关系。从现有的Pezizomycotina基因组中提取的单拷贝同源基因中,我们确定了最小的遗传标记子集,可以可靠地重建一个健壮的系统发育。我们使用聚类方法来确定一个序列集,该序列集可以在潜在丢失数据和探测集成本之间提供最佳权衡。我们将该探针集整合到名为UnFATE (https://github.com/claudioametrano/UnFATE)的用户友好包装脚本中,该脚本允许在不需要专业生物信息学知识的情况下进行系统基因组推断。除了系统发育结果外,该软件还提供了一个强大的多位点替代基于its的条形码。系统发育和条形码方法可以通过一个集成的、预处理的、定期更新的所有公开可用的Pezizomycotina基因组数据库来补充。UnFATE管道使用了195个选定的标记基因,在不同的系统发育深度上都表现良好,生成的树与参考系统发育推断一致。从科到亚门的系统发育水平上,文献参考树与UnFATE生成的最佳树的拓扑距离在0.10 ~ 0.14 (nRF)之间。我们还在Pezizomycotina 10个代表性类的25个标本室标本上测试了以目标捕获方法设置的通用诱饵的体外成功性,该方法恢复了与该真菌组最近的系统基因组推断一致的拓扑结构。我们的基因集的鉴别能力也通过多位点条形码方法进行了评估,该方法优于基于ITS的条形码方法。有了这些工具,我们的目标是提供一个框架,合作的方法来建立强大的,结论性的系统发育这一重要的真菌分支。
{"title":"UnFATE: A Comprehensive Probe Set and Bioinformatics Pipeline for Phylogeny Reconstruction and Multilocus Barcoding of Filamentous Ascomycetes (Ascomycota, Pezizomycotina)","authors":"Claudio G Ametrano, Jacob Jensen, H Thorsten Lumbsch, Felix Grewe","doi":"10.1093/sysbio/syaf011","DOIUrl":"https://doi.org/10.1093/sysbio/syaf011","url":null,"abstract":"The subphylum Pezizomycotina (filamentous ascomycetes) is the largest clade within Ascomycota. Despite the importance of this group of fungi, our understanding of their evolution is still limited due to insufficient taxon sampling. Although next-generation sequencing technology allows us to obtain complete genomes for phylogenetic analyses, generating complete genomes of fungal species can be challenging, especially when fungi occur in symbiotic relationships or when the DNA of rare herbarium specimens is degraded or contaminated. Additionally, assembly, annotation, and gene extraction of whole-genome sequencing data require bioinformatics skills and computational power, resulting in a substantial data burden. To overcome these obstacles, we designed a universal target enrichment probe set to reconstruct the phylogenetic relationships of filamentous ascomycetes at different phylogenetic levels. From a pool of single-copy orthologous genes extracted from available Pezizomycotina genomes, we identified the smallest subset of genetic markers that can reliably reconstruct a robust phylogeny. We used a clustering approach to identify a sequence set that could provide an optimal trade-off between potential missing data and probe set cost. We incorporated this probe set into a user-friendly wrapper script named UnFATE (https://github.com/claudioametrano/UnFATE) that allows phylogenomic inferences without requiring expert bioinformatics knowledge. In addition to phylogenetic results, the software provides a powerful multilocus alternative to ITS-based barcoding. Phylogeny and barcoding approaches can be complemented by an integrated, pre-processed, and periodically updated database of all publicly available Pezizomycotina genomes. The UnFATE pipeline, using the 195 selected marker genes, consistently performed well across various phylogenetic depths, generating trees consistent with the reference phylogenomic inferences. The topological distance between the reference trees from literature and the best tree produced by UnFATE ranged between 0.10 and 0.14 (nRF) for phylogenies from family to subphylum level. We also tested the in vitro success of the universal baits set in a target capture approach on 25 herbarium specimens from ten representative classes in Pezizomycotina, which recovered a topology congruent with recent phylogenomic inferences for this group of fungi. The discriminating power of our gene set was also assessed by the multilocus barcoding approach, which outperformed the barcoding approach based on ITS. With these tools, we aim to provide a framework for a collaborative approach to build robust, conclusive phylogenies of this important fungal clade.","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":"65 1","pages":""},"PeriodicalIF":6.5,"publicationDate":"2025-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143417538","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The consequences of budding speciation on trees 树木的萌芽物种形成的后果
IF 6.5 1区 生物学 Q1 EVOLUTIONARY BIOLOGY Pub Date : 2025-02-12 DOI: 10.1093/sysbio/syaf012
C Tomomi Parins-Fukuchi, James G Saulsbury
Paleobiologists have long sought to explain how alternative modes of speciation, including budding and bifurcating cladogenesis, shape patterns of evolution. Methods introduced over the past decade have paved the way for a renewed enthusiasm for exploring modes of speciation in the fossil record. However, the field does not yet have a strong intuition for how ancestor-descendant relationships, especially those that arise from budding speciation, might influence the shape of trees reconstructed for fossil or living clades. We developed a simulation approach based on classic paleobiological theory to ask what proportion of ancestral nodes in paleontological phylogenies are expected to correspond to sampled taxa under a range of preservational regimes. We compared our simulated results to empirical estimates of absolute fossil record completeness gathered from the literature and found that many fossilized clades of marine invertebrates are likely to display upwards of 80% sampled ancestors. Under a primarily budding model, phylogenies where 100% of the internal nodes correspond to named species are very possible for well-sampled clades at local and regional scales. We also leveraged our simulation approach to ask how budding might shape extant clades. We found that the ancestral signature of budding causes rampant hard polytomies (i.e., multifurcations), greatly impacting the shape of extant clades. Our results highlight how budding can yield dramatic and unrecognized effects on phylogenetic reconstruction of clades of both living and extinct organisms.
古生物学家长期以来一直试图解释物种形成的不同模式,包括萌芽和分叉枝发生,是如何塑造进化模式的。过去十年中引入的方法为探索化石记录中物种形成模式的新热情铺平了道路。然而,对于祖先-后代关系,特别是那些从萌芽的物种形成中产生的关系,如何影响为化石或活枝重建的树木的形状,这个领域还没有一个强烈的直觉。我们开发了一种基于经典古生物学理论的模拟方法,以询问在一系列保存制度下,古生物系统发育中祖先节点的比例预计与样本分类群相对应。我们将模拟结果与从文献中收集的绝对化石记录完整性的经验估计进行了比较,发现许多海洋无脊椎动物的化石分支可能显示了80%以上的样本祖先。在萌芽模式下,100%的内部节点对应于命名物种的系统发育在局部和区域尺度上是非常可能的。我们还利用我们的模拟方法来询问萌芽如何塑造现存的进化枝。我们发现,出芽的祖先特征导致了猖獗的硬多裂(即多功能),极大地影响了现存进化枝的形状。我们的研究结果强调了萌芽如何对现存和灭绝生物的进化枝的系统发育重建产生戏剧性的和未被认识到的影响。
{"title":"The consequences of budding speciation on trees","authors":"C Tomomi Parins-Fukuchi, James G Saulsbury","doi":"10.1093/sysbio/syaf012","DOIUrl":"https://doi.org/10.1093/sysbio/syaf012","url":null,"abstract":"Paleobiologists have long sought to explain how alternative modes of speciation, including budding and bifurcating cladogenesis, shape patterns of evolution. Methods introduced over the past decade have paved the way for a renewed enthusiasm for exploring modes of speciation in the fossil record. However, the field does not yet have a strong intuition for how ancestor-descendant relationships, especially those that arise from budding speciation, might influence the shape of trees reconstructed for fossil or living clades. We developed a simulation approach based on classic paleobiological theory to ask what proportion of ancestral nodes in paleontological phylogenies are expected to correspond to sampled taxa under a range of preservational regimes. We compared our simulated results to empirical estimates of absolute fossil record completeness gathered from the literature and found that many fossilized clades of marine invertebrates are likely to display upwards of 80% sampled ancestors. Under a primarily budding model, phylogenies where 100% of the internal nodes correspond to named species are very possible for well-sampled clades at local and regional scales. We also leveraged our simulation approach to ask how budding might shape extant clades. We found that the ancestral signature of budding causes rampant hard polytomies (i.e., multifurcations), greatly impacting the shape of extant clades. Our results highlight how budding can yield dramatic and unrecognized effects on phylogenetic reconstruction of clades of both living and extinct organisms.","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":"65 1","pages":""},"PeriodicalIF":6.5,"publicationDate":"2025-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143417539","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Fossilized Birth-Death Model Is Identifiable. 化石化的出生-死亡模型是可识别的。
IF 5.7 1区 生物学 Q1 EVOLUTIONARY BIOLOGY Pub Date : 2025-02-10 DOI: 10.1093/sysbio/syae058
Kate Truman, Timothy G Vaughan, Alex Gavryushkin, Alexandra Sasha Gavryushkina

Time-dependent birth-death sampling models have been used in numerous studies to infer past evolutionary dynamics in different biological contexts, for example,  speciation and extinction rates in macroevolutionary studies, or effective reproductive number in epidemiological studies. These models are branching processes where lineages can bifurcate, die, or be sampled with time-dependent birth, death, and sampling rates, generating phylogenetic trees. It has been shown that in some subclasses of such models, different sets of rates can result in the same distributions of reconstructed phylogenetic trees, and therefore, the rates become unidentifiable from the trees regardless of their size. Here, we show that widely used time-dependent fossilized birth-death (FBD) models are identifiable. This subclass of models makes more realistic assumptions about the fossilization process and certain infectious disease transmission processes than the unidentifiable birth-death sampling models. Namely, FBD models assume that sampled lineages stay in the process rather than being immediately removed upon sampling. The identifiability of the time-dependent FBD model justifies using statistical methods that implement this model to infer the underlying temporal diversification or epidemiological dynamics from phylogenetic trees or directly from molecular or other comparative data. We further show that the time-dependent FBD model with an extra parameter, the removal after sampling probability, is unidentifiable. This implies that in scenarios where we do not know how sampling affects lineages, we are unable to infer this extra parameter together with birth, death, and sampling rates solely from trees.

依赖时间的出生-死亡采样模型已被大量研究用于推断不同生物背景下过去的进化动态,例如宏观进化研究中的物种分化和灭绝率,或流行病学研究中的有效繁殖数量。这些模型是一个分支过程,在这个过程中,世系可以分叉、死亡,或以随时间变化的出生率、死亡率和采样率进行采样,从而生成系统进化树。有研究表明,在这类模型的某些子类中,不同的速率集会导致重建的系统发生树分布相同,因此,无论系统发生树的大小如何,速率都无法从系统发生树中识别出来。在这里,我们证明了广泛使用的时间依赖化石出生-死亡(FBD)模型是可识别的。与无法识别的出生-死亡抽样模型相比,这一子类模型对化石化过程和某些传染病的传播过程做出了更现实的假设。也就是说,FBD 模型假定采样的世系会留在化石过程中,而不是在采样后立即消失。时间依赖性 FBD 模型的可识别性使我们有理由使用实现该模型的统计方法,从系统发生树或直接从分子或其他比较数据中推断潜在的时间多样化或流行病学动态。我们进一步证明,具有额外参数(取样后移除概率)的时间依赖性化石出生-死亡模型是不可识别的。这意味着,在我们不知道采样如何影响世系的情况下,我们无法仅从树中推断出这个额外参数以及出生率、死亡率和采样率。
{"title":"The Fossilized Birth-Death Model Is Identifiable.","authors":"Kate Truman, Timothy G Vaughan, Alex Gavryushkin, Alexandra Sasha Gavryushkina","doi":"10.1093/sysbio/syae058","DOIUrl":"10.1093/sysbio/syae058","url":null,"abstract":"<p><p>Time-dependent birth-death sampling models have been used in numerous studies to infer past evolutionary dynamics in different biological contexts, for example,  speciation and extinction rates in macroevolutionary studies, or effective reproductive number in epidemiological studies. These models are branching processes where lineages can bifurcate, die, or be sampled with time-dependent birth, death, and sampling rates, generating phylogenetic trees. It has been shown that in some subclasses of such models, different sets of rates can result in the same distributions of reconstructed phylogenetic trees, and therefore, the rates become unidentifiable from the trees regardless of their size. Here, we show that widely used time-dependent fossilized birth-death (FBD) models are identifiable. This subclass of models makes more realistic assumptions about the fossilization process and certain infectious disease transmission processes than the unidentifiable birth-death sampling models. Namely, FBD models assume that sampled lineages stay in the process rather than being immediately removed upon sampling. The identifiability of the time-dependent FBD model justifies using statistical methods that implement this model to infer the underlying temporal diversification or epidemiological dynamics from phylogenetic trees or directly from molecular or other comparative data. We further show that the time-dependent FBD model with an extra parameter, the removal after sampling probability, is unidentifiable. This implies that in scenarios where we do not know how sampling affects lineages, we are unable to infer this extra parameter together with birth, death, and sampling rates solely from trees.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"112-123"},"PeriodicalIF":5.7,"publicationDate":"2025-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11997801/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142475252","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Phylogenomics of Bivalvia Using Ultraconserved Elements Reveal New Topologies for Pteriomorphia and Imparidentia. 使用超保守元素(UCEs)的双壳类系统发生组学揭示了翼手目和翼手目的新拓扑结构。
IF 6.1 1区 生物学 Q1 EVOLUTIONARY BIOLOGY Pub Date : 2025-02-10 DOI: 10.1093/sysbio/syae052
Yi-Xuan Li, Jack Chi-Ho Ip, Chong Chen, Ting Xu, Qian Zhang, Yanan Sun, Pei-Zhen Ma, Jian-Wen Qiu

Despite significant advances in phylogenetics over the past decades, the deep relationships within Bivalvia (phylum Mollusca) remain inconclusive. Previous efforts based on morphology or several genes have failed to resolve many key nodes in the phylogeny of Bivalvia. Advances have been made recently using transcriptome data, but the phylogenetic relationships within Bivalvia historically lacked consensus, especially within Pteriomorphia and Imparidentia. Here, we inferred the relationships of key lineages within Bivalvia using matrices generated from specifically designed ultraconserved elements (UCEs) with 16 available genomic resources and 85 newly sequenced specimens from 55 families. Our new probes (Bivalve UCE 2k v.1) for target sequencing captured an average of 849 UCEs with 1085 bp in mean length from in vitro experiments. Our results introduced novel schemes from 6 major clades (Protobranchina, Pteriomorphia, Palaeoheterodonta, Archiheterodonta, Anomalodesmata, and Imparidentia), though some inner nodes were poorly resolved, such as paraphyletic Heterodonta in some topologies potentially due to insufficient taxon sampling. The resolution increased when analyzing specific matrices for Pteriomorphia and Imparidentia. We recovered 3 Pteriomorphia topologies different from previously published trees, with the strongest support for ((Ostreida + (Arcida + Mytilida)) + (Pectinida + (Limida + Pectinida))). Limida were nested within Pectinida, warranting further studies. For Imparidentia, our results strongly supported the new hypothesis of (Galeommatida + (Adapedonta + Cardiida)), while the possible non-monophyly of Lucinida was inferred but poorly supported. Overall, our results provide important insights into the phylogeny of Bivalvia and show that target enrichment sequencing of UCEs can be broadly applied to study both deep and shallow phylogenetic relationships.

尽管过去几十年来系统发生学取得了重大进展,但双壳纲动物(软体动物门)内部的深层关系仍未确定。以前基于形态学或几个基因的研究未能解决双壳纲系统发育中的许多关键节点。最近,利用转录组数据的研究取得了进展,但双壳纲的系统发育关系历来缺乏共识,尤其是翼手目(Pteriomorphia)和栉水母目(Imparidentia)的系统发育关系。在此,我们利用专门设计的超保守元素(UCEs)矩阵,结合 16 个可用的基因组资源和 55 个科的 85 个新测序标本,推断了双壳纲内关键种系的关系。我们用于目标测序的新探针(Bivalve UCE 2k v.1)从体外实验中平均捕获了 849 个 UCE,平均长度为 1085-bp。我们的研究结果引入了来自六个主要支系(原枝目、翼手目、古翼手目、古翼手目、无尾目和无尾目)的新方案,但一些内部节点的解析度较低,如一些拓扑结构中的旁支系异齿目(Heterodonta),这可能是由于分类群取样不足造成的。在分析翼手目和翼手目的特定矩阵时,分辨率有所提高。我们发现了三种不同于以前发表的翼手目拓扑结构,其中((Ostreida + (Arcida + Mytilida)) + (Pectinida + (Limida + Pectinida))支持度最高。)Limida 嵌套在 Pectinida 中,值得进一步研究。对于无尾目(Imparidentia),我们的结果有力地支持了(Galeommatida + (Adapedonta + Cardiida))的新假说,而推断出了 Lucinida 的可能非单系,但支持度不高。总之,我们的研究结果为双壳纲的系统发育提供了重要的见解,并表明 UCEs 的目标富集测序可广泛应用于研究深层和浅层的系统发育关系。
{"title":"Phylogenomics of Bivalvia Using Ultraconserved Elements Reveal New Topologies for Pteriomorphia and Imparidentia.","authors":"Yi-Xuan Li, Jack Chi-Ho Ip, Chong Chen, Ting Xu, Qian Zhang, Yanan Sun, Pei-Zhen Ma, Jian-Wen Qiu","doi":"10.1093/sysbio/syae052","DOIUrl":"10.1093/sysbio/syae052","url":null,"abstract":"<p><p>Despite significant advances in phylogenetics over the past decades, the deep relationships within Bivalvia (phylum Mollusca) remain inconclusive. Previous efforts based on morphology or several genes have failed to resolve many key nodes in the phylogeny of Bivalvia. Advances have been made recently using transcriptome data, but the phylogenetic relationships within Bivalvia historically lacked consensus, especially within Pteriomorphia and Imparidentia. Here, we inferred the relationships of key lineages within Bivalvia using matrices generated from specifically designed ultraconserved elements (UCEs) with 16 available genomic resources and 85 newly sequenced specimens from 55 families. Our new probes (Bivalve UCE 2k v.1) for target sequencing captured an average of 849 UCEs with 1085 bp in mean length from in vitro experiments. Our results introduced novel schemes from 6 major clades (Protobranchina, Pteriomorphia, Palaeoheterodonta, Archiheterodonta, Anomalodesmata, and Imparidentia), though some inner nodes were poorly resolved, such as paraphyletic Heterodonta in some topologies potentially due to insufficient taxon sampling. The resolution increased when analyzing specific matrices for Pteriomorphia and Imparidentia. We recovered 3 Pteriomorphia topologies different from previously published trees, with the strongest support for ((Ostreida + (Arcida + Mytilida)) + (Pectinida + (Limida + Pectinida))). Limida were nested within Pectinida, warranting further studies. For Imparidentia, our results strongly supported the new hypothesis of (Galeommatida + (Adapedonta + Cardiida)), while the possible non-monophyly of Lucinida was inferred but poorly supported. Overall, our results provide important insights into the phylogeny of Bivalvia and show that target enrichment sequencing of UCEs can be broadly applied to study both deep and shallow phylogenetic relationships.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"16-33"},"PeriodicalIF":6.1,"publicationDate":"2025-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142295902","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Complex Models of Sequence Evolution Improve Fit, But Not Gene Tree Discordance, for Tetrapod Mitogenomes. 序列进化的复杂模型改善了四足有丝分裂基因组的拟合度,但没有改善基因树的不一致性。
IF 6.1 1区 生物学 Q1 EVOLUTIONARY BIOLOGY Pub Date : 2025-02-10 DOI: 10.1093/sysbio/syae056
Benjamin S Toups, Robert C Thomson, Jeremy M Brown

Variation in gene tree estimates is widely observed in empirical phylogenomic data and is often assumed to be the result of biological processes. However, a recent study using tetrapod mitochondrial genomes to control for biological sources of variation due to their haploid, uniparentally inherited, and non-recombining nature found that levels of discordance among mitochondrial gene trees were comparable to those found in studies that assume only biological sources of variation. Additionally, they found that several of the models of sequence evolution chosen to infer gene trees were doing an inadequate job of fitting the sequence data. These results indicated that significant amounts of gene tree discordance in empirical data may be due to poor fit of sequence evolution models and that more complex and biologically realistic models may be needed. To test how the fit of sequence evolution models relates to gene tree discordance, we analyzed the same mitochondrial data sets as the previous study using 2 additional, more complex models of sequence evolution that each include a different biologically realistic aspect of the evolutionary process: A covarion model to incorporate site-specific rate variation across lineages (heterotachy), and a partitioned model to incorporate variable evolutionary patterns by codon position. Our results show that both additional models fit the data better than the models used in the previous study, with the covarion being consistently and strongly preferred as tree size increases. However, even these more preferred models still inferred highly discordant mitochondrial gene trees, thus deepening the mystery around what we label the "Mito-Phylo Paradox" and leading us to ask whether the observed variation could, in fact, be biological in nature after all.

在经验性的系统发生组数据中,基因树估计值的变异被广泛观察到,并且通常被认为是生物过程的结果。然而,最近一项使用四足动物线粒体基因组来控制生物变异来源的研究发现,线粒体基因树之间的不一致程度与仅假定生物变异来源的研究中发现的不一致程度相当。此外,他们还发现,用于推断基因树的几个序列进化模型与序列数据的拟合程度不够。这些结果表明,经验数据中存在的大量基因树不一致现象可能是由于序列进化模型的拟合效果不佳造成的,因此可能需要更复杂、更符合生物学实际的模型。为了检验序列进化模型的拟合度与基因树不一致性之间的关系,我们使用了两个额外的、更复杂的序列进化模型来分析与前一项研究相同的线粒体数据集,这两个模型分别包含了进化过程中不同的生物学现实方面:一个是科瓦里翁模型(covarion model),它包含了不同世系中特定位点的速率变化(heterotachy);另一个是分区模型(partitioned model),它包含了不同密码子位置的进化模式。我们的研究结果表明,这两个额外的模型都比之前研究中使用的模型更适合数据,随着树规模的增加,共线性模型一直是首选。然而,即使是这些更受青睐的模型,仍然推断出了高度不和谐的线粒体基因树,从而加深了我们所称的 "Mito-Phylo 悖论 "的神秘性,并使我们提出了这样一个问题:所观察到的变异实际上是否具有生物学性质?
{"title":"Complex Models of Sequence Evolution Improve Fit, But Not Gene Tree Discordance, for Tetrapod Mitogenomes.","authors":"Benjamin S Toups, Robert C Thomson, Jeremy M Brown","doi":"10.1093/sysbio/syae056","DOIUrl":"10.1093/sysbio/syae056","url":null,"abstract":"<p><p>Variation in gene tree estimates is widely observed in empirical phylogenomic data and is often assumed to be the result of biological processes. However, a recent study using tetrapod mitochondrial genomes to control for biological sources of variation due to their haploid, uniparentally inherited, and non-recombining nature found that levels of discordance among mitochondrial gene trees were comparable to those found in studies that assume only biological sources of variation. Additionally, they found that several of the models of sequence evolution chosen to infer gene trees were doing an inadequate job of fitting the sequence data. These results indicated that significant amounts of gene tree discordance in empirical data may be due to poor fit of sequence evolution models and that more complex and biologically realistic models may be needed. To test how the fit of sequence evolution models relates to gene tree discordance, we analyzed the same mitochondrial data sets as the previous study using 2 additional, more complex models of sequence evolution that each include a different biologically realistic aspect of the evolutionary process: A covarion model to incorporate site-specific rate variation across lineages (heterotachy), and a partitioned model to incorporate variable evolutionary patterns by codon position. Our results show that both additional models fit the data better than the models used in the previous study, with the covarion being consistently and strongly preferred as tree size increases. However, even these more preferred models still inferred highly discordant mitochondrial gene trees, thus deepening the mystery around what we label the \"Mito-Phylo Paradox\" and leading us to ask whether the observed variation could, in fact, be biological in nature after all.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"86-100"},"PeriodicalIF":6.1,"publicationDate":"2025-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142406814","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
How to Validate a Bayesian Evolutionary Model. 如何验证贝叶斯进化模型。
IF 6.1 1区 生物学 Q1 EVOLUTIONARY BIOLOGY Pub Date : 2025-02-10 DOI: 10.1093/sysbio/syae064
Fábio K Mendes, Remco Bouckaert, Luiz M Carvalho, Alexei J Drummond

Biology has become a highly mathematical discipline in which probabilistic models play a central role. As a result, research in the biological sciences is now dependent on computational tools capable of carrying out complex analyses. These tools must be validated before they can be used, but what is understood as validation varies widely among methodological contributions. This may be a consequence of the still embryonic stage of the literature on statistical software validation for computational biology. Our manuscript aims to advance this literature. Here, we describe, illustrate, and introduce new good practices for assessing the correctness of a model implementation with an emphasis on Bayesian methods. We also introduce a suite of functionalities for automating validation protocols. It is our hope that the guidelines presented here help sharpen the focus of discussions on (as well as elevate) expected standards of statistical software for biology.

生物学已成为一门高度数学化的学科,其中概率模型发挥着核心作用。因此,生物科学研究现在依赖于能够进行复杂分析的计算工具。这些工具在使用之前必须经过验证,但对验证的理解却因方法论的不同而大相径庭。这可能是计算生物学统计软件验证文献仍处于萌芽阶段的结果。我们的手稿旨在推动这一文献的发展。在这里,我们描述、说明并介绍了评估模型实现正确性的新的良好实践,重点是贝叶斯方法。我们还介绍了一套用于自动验证协议的功能。我们希望这里介绍的指导原则有助于使生物学统计软件预期标准的讨论重点更加突出(以及提高)。
{"title":"How to Validate a Bayesian Evolutionary Model.","authors":"Fábio K Mendes, Remco Bouckaert, Luiz M Carvalho, Alexei J Drummond","doi":"10.1093/sysbio/syae064","DOIUrl":"10.1093/sysbio/syae064","url":null,"abstract":"<p><p>Biology has become a highly mathematical discipline in which probabilistic models play a central role. As a result, research in the biological sciences is now dependent on computational tools capable of carrying out complex analyses. These tools must be validated before they can be used, but what is understood as validation varies widely among methodological contributions. This may be a consequence of the still embryonic stage of the literature on statistical software validation for computational biology. Our manuscript aims to advance this literature. Here, we describe, illustrate, and introduce new good practices for assessing the correctness of a model implementation with an emphasis on Bayesian methods. We also introduce a suite of functionalities for automating validation protocols. It is our hope that the guidelines presented here help sharpen the focus of discussions on (as well as elevate) expected standards of statistical software for biology.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"158-175"},"PeriodicalIF":6.1,"publicationDate":"2025-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11809579/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142590679","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Rapid Evolution of Host Repertoire and Geographic Range in a Young and Diverse Genus of Montane Butterflies. 一个年轻而多样化的山地蝴蝶属的寄主种类和地理分布的快速进化。
IF 6.1 1区 生物学 Q1 EVOLUTIONARY BIOLOGY Pub Date : 2025-02-10 DOI: 10.1093/sysbio/syae061
Shifang Mo, Yaowei Zhu, Mariana P Braga, David J Lohman, Sören Nylin, Ashraf Moumou, Christopher W Wheat, Niklas Wahlberg, Min Wang, Fangzhou Ma, Peng Zhang, Houshuai Wang

Evolutionary changes in geographic distribution and larval host plants may promote the rapid diversification of montane insects, but this scenario has been rarely investigated. We studied the rapid radiation of the butterfly genus Colias, which has diversified in mountain ecosystems in Eurasia, Africa, and the Americas. Based on a data set of 150 nuclear protein-coding genetic loci and mitochondrial genomes, we constructed a time-calibrated phylogenetic tree of Colias species with broad taxon sampling. We then inferred their ancestral geographic ranges, historical diversification rates, and the evolution of host use. We found that the most recent common ancestor of Colias was likely geographically widespread and originated ~3.5 Ma. The group subsequently diversified in different regions across the world, often in tandem with geographic expansion events. No aspect of elevation was found to have a direct effect on diversification. The genus underwent a burst of diversification soon after the divergence of the Neotropical lineage, followed by an exponential decline in diversification rate toward the present. The ancestral host repertoire included the legume genera Astragalus and Trifolium but later expanded to include a wide range of Fabaceae genera and plants in more distantly related families, punctuated with periods of host range expansion and contraction. We suggest that the widespread distribution of the ancestor of all extant Colias lineages set the stage for diversification by isolation of populations that locally adapted to the various different environments they encountered, including different host plants. In this scenario, elevation is not the main driver but might have accelerated diversification by isolating populations.

地理分布和幼虫寄主植物的进化变化可能会促进山地昆虫的快速多样化,但这种情况很少被研究。我们研究了在欧亚大陆、非洲和美洲山区生态系统中实现多样化的蝶属 Colias 的快速辐射。基于 150 个核蛋白编码基因位点和线粒体基因组的数据集,我们构建了一个具有广泛类群取样的时间校准的 Colias 物种系统发生树。然后,我们推断了它们的祖先地理分布、历史分化率和宿主利用的演化。我们发现,Colias最近的共同祖先可能地理分布广泛,起源于约 3.5 Ma。该类群随后在全球不同地区进行了分化,通常与地理扩张事件同步进行。没有发现海拔高度对其多样化有直接影响。该属在新热带系分化后不久经历了一次多样化爆发,随后多样化率呈指数下降,直到现在。其祖先的寄主范围包括豆科的黄芪属和三叶草属,但后来扩展到包括广泛的豆科属和关系较远的科的植物,并伴随着寄主范围的扩张和收缩期。我们认为,所有现存科利亚斯(Colias)种系的祖先的广泛分布为种群的多样化创造了条件,这些种群通过隔离来适应它们遇到的各种不同环境,包括不同的寄主植物。在这种情况下,海拔高度并不是主要的驱动因素,但可能会通过隔离种群而加速多样化。
{"title":"Rapid Evolution of Host Repertoire and Geographic Range in a Young and Diverse Genus of Montane Butterflies.","authors":"Shifang Mo, Yaowei Zhu, Mariana P Braga, David J Lohman, Sören Nylin, Ashraf Moumou, Christopher W Wheat, Niklas Wahlberg, Min Wang, Fangzhou Ma, Peng Zhang, Houshuai Wang","doi":"10.1093/sysbio/syae061","DOIUrl":"10.1093/sysbio/syae061","url":null,"abstract":"<p><p>Evolutionary changes in geographic distribution and larval host plants may promote the rapid diversification of montane insects, but this scenario has been rarely investigated. We studied the rapid radiation of the butterfly genus Colias, which has diversified in mountain ecosystems in Eurasia, Africa, and the Americas. Based on a data set of 150 nuclear protein-coding genetic loci and mitochondrial genomes, we constructed a time-calibrated phylogenetic tree of Colias species with broad taxon sampling. We then inferred their ancestral geographic ranges, historical diversification rates, and the evolution of host use. We found that the most recent common ancestor of Colias was likely geographically widespread and originated ~3.5 Ma. The group subsequently diversified in different regions across the world, often in tandem with geographic expansion events. No aspect of elevation was found to have a direct effect on diversification. The genus underwent a burst of diversification soon after the divergence of the Neotropical lineage, followed by an exponential decline in diversification rate toward the present. The ancestral host repertoire included the legume genera Astragalus and Trifolium but later expanded to include a wide range of Fabaceae genera and plants in more distantly related families, punctuated with periods of host range expansion and contraction. We suggest that the widespread distribution of the ancestor of all extant Colias lineages set the stage for diversification by isolation of populations that locally adapted to the various different environments they encountered, including different host plants. In this scenario, elevation is not the main driver but might have accelerated diversification by isolating populations.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"141-157"},"PeriodicalIF":6.1,"publicationDate":"2025-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11809587/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142558837","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Systematic Biology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1