首页 > 最新文献

Systematic Biology最新文献

英文 中文
Alpine Extremophytes in Evolutionary Turmoil: Complex Diversification Patterns and Demographic Responses of a Halophilic Grass in a Central Asian Biodiversity Hotspot. 进化动荡中的高山极端植物:中亚生物多样性热点地区嗜卤禾本科植物的复杂多样性模式和人口响应。
IF 6.1 1区 生物学 Q1 EVOLUTIONARY BIOLOGY Pub Date : 2024-07-27 DOI: 10.1093/sysbio/syad073
Anna Wróbel, Ewelina Klichowska, Arkadiusz Nowak, Marcin Nobis

Diversification and demographic responses are key processes shaping species evolutionary history. Yet we still lack a full understanding of ecological mechanisms that shape genetic diversity at different spatial scales upon rapid environmental changes. In this study, we examined genetic differentiation in an extremophilic grass Puccinellia pamirica and factors affecting its population dynamics among the occupied hypersaline alpine wetlands on the arid Pamir Plateau in Central Asia. Using genomic data, we found evidence of fine-scale population structure and gene flow among the localities established across the high-elevation plateau as well as fingerprints of historical demographic expansion. We showed that an increase in the effective population size could coincide with the Last Glacial Period, which was followed by the species demographic decline during the Holocene. Geographic distance plays a vital role in shaping the spatial genetic structure of P. pamirica alongside with isolation-by-environment and habitat fragmentation. Our results highlight a complex history of divergence and gene flow in this species-poor alpine region during the Late Quaternary. We demonstrate that regional climate specificity and a shortage of nonclimate data largely impede predictions of future range changes of the alpine extremophile using ecological niche modeling. This study emphasizes the importance of fine-scale environmental heterogeneity for population dynamics and species distribution shifts.

多样性和人口反应是影响物种进化史的关键过程。然而,我们对环境快速变化时在不同空间尺度上形成遗传多样性的生态机制仍缺乏全面了解。在这项研究中,我们考察了中亚干旱的帕米尔高原上一种嗜极端水草 Puccinellia pamirica 的遗传分化以及影响其种群动态的因素。利用基因组数据,我们发现了在高海拔高原各地建立的精细种群结构和基因流动的证据,以及历史上人口扩张的痕迹。我们的研究表明,有效种群数量的增加可能与末次冰川期相吻合,而在全新世期间,种群数量随之减少。除了环境隔离和栖息地破碎化之外,地理距离对 P. pamirica 的空间遗传结构的形成也起着至关重要的作用。我们的研究结果突显了第四纪晚期这一物种贫乏的高山地区复杂的分化和基因流动历史。我们的研究结果表明,地区气候的特殊性和非气候数据的缺乏在很大程度上阻碍了利用生态位建模预测这种高山极端物种未来分布范围的变化。这项研究强调了细尺度环境异质性对种群动态和物种分布变化的重要性。
{"title":"Alpine Extremophytes in Evolutionary Turmoil: Complex Diversification Patterns and Demographic Responses of a Halophilic Grass in a Central Asian Biodiversity Hotspot.","authors":"Anna Wróbel, Ewelina Klichowska, Arkadiusz Nowak, Marcin Nobis","doi":"10.1093/sysbio/syad073","DOIUrl":"10.1093/sysbio/syad073","url":null,"abstract":"<p><p>Diversification and demographic responses are key processes shaping species evolutionary history. Yet we still lack a full understanding of ecological mechanisms that shape genetic diversity at different spatial scales upon rapid environmental changes. In this study, we examined genetic differentiation in an extremophilic grass Puccinellia pamirica and factors affecting its population dynamics among the occupied hypersaline alpine wetlands on the arid Pamir Plateau in Central Asia. Using genomic data, we found evidence of fine-scale population structure and gene flow among the localities established across the high-elevation plateau as well as fingerprints of historical demographic expansion. We showed that an increase in the effective population size could coincide with the Last Glacial Period, which was followed by the species demographic decline during the Holocene. Geographic distance plays a vital role in shaping the spatial genetic structure of P. pamirica alongside with isolation-by-environment and habitat fragmentation. Our results highlight a complex history of divergence and gene flow in this species-poor alpine region during the Late Quaternary. We demonstrate that regional climate specificity and a shortage of nonclimate data largely impede predictions of future range changes of the alpine extremophile using ecological niche modeling. This study emphasizes the importance of fine-scale environmental heterogeneity for population dynamics and species distribution shifts.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":null,"pages":null},"PeriodicalIF":6.1,"publicationDate":"2024-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11282368/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139032576","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bayesian Phylogenetic Analysis on Multi-Core Compute Architectures: Implementation and Evaluation of BEAGLE in RevBayes With MPI. 多核计算架构上的贝叶斯系统发育分析:使用 MPI 实现和评估 RevBayes 中的 BEAGLE。
IF 6.1 1区 生物学 Q1 EVOLUTIONARY BIOLOGY Pub Date : 2024-07-27 DOI: 10.1093/sysbio/syae005
Killian Smith, Daniel Ayres, René Neumaier, Gert Wörheide, Sebastian Höhna

Phylogenies are central to many research areas in biology and commonly estimated using likelihood-based methods. Unfortunately, any likelihood-based method, including Bayesian inference, can be restrictively slow for large datasets-with many taxa and/or many sites in the sequence alignment-or complex substitutions models. The primary limiting factor when using large datasets and/or complex models in probabilistic phylogenetic analyses is the likelihood calculation, which dominates the total computation time. To address this bottleneck, we incorporated the high-performance phylogenetic library BEAGLE into RevBayes, which enables multi-threading on multi-core CPUs and GPUs, as well as hardware specific vectorized instructions for faster likelihood calculations. Our new implementation of RevBayes+BEAGLE retains the flexibility and dynamic nature that users expect from vanilla RevBayes. In addition, we implemented native parallelization within RevBayes without an external library using the message passing interface (MPI); RevBayes+MPI. We evaluated our new implementation of RevBayes+BEAGLE using multi-threading on CPUs and 2 different powerful GPUs (NVidia Titan V and NVIDIA A100) against our native implementation of RevBayes+MPI. We found good improvements in speedup when multiple cores were used, with up to 20-fold speedup when using multiple CPU cores and over 90-fold speedup when using multiple GPU cores. The improvement depended on the data type used, DNA or amino acids, and the size of the alignment, but less on the size of the tree. We additionally investigated the cost of rescaling partial likelihoods to avoid numerical underflow and showed that unnecessarily frequent and inefficient rescaling can increase runtimes up to 4-fold. Finally, we presented and compared a new approach to store partial likelihoods on branches instead of nodes that can speed up computations up to 1.7 times but comes at twice the memory requirements.

系统发生是生物学许多研究领域的核心,通常使用基于似然法的方法进行估算。遗憾的是,任何基于似然法的方法,包括贝叶斯推断法,对于大型数据集--序列排列中有许多类群和/或许多位点--或复杂的替换模型来说,速度都会非常缓慢。在概率系统发育分析中使用大型数据集和/或复杂模型时,主要的限制因素是似然法计算,它在总计算时间中占主导地位。为了解决这个瓶颈问题,我们将高性能系统发育库 BEAGLE 纳入了 RevBayes,它可以在多核 CPU 和 GPU 上实现多线程,并提供硬件特定的矢量化指令,以加快似然计算速度。我们新的 RevBayes+BEAGLE 实现保留了用户期望从 vanilla RevBayes 中获得的灵活性和动态性。此外,我们还使用消息传递接口(MPI)在 RevBayes 中实现了本地并行化,而无需使用外部库;即 RevBayes+MPI。我们在 CPU 和两种不同的强大 GPU(NVidia Titan V 和 NVIDIA A100)上使用多线程对 RevBayes+BEAGLE 的新实现与 RevBayes+MPI 的本机实现进行了评估。我们发现,在使用多核的情况下,速度提高了很多,使用多 CPU 核时速度提高了 20 倍,使用多 GPU 核时速度提高了 90 多倍。速度的提高取决于所使用的数据类型(DNA 或氨基酸)和排列的大小,但与树的大小关系不大。此外,我们还研究了为避免数值下溢而重新调整部分似然的成本,结果表明,不必要的频繁、低效的重新调整会使运行时间增加多达 4 倍。最后,我们介绍并比较了一种将部分似然存储在分支而非节点上的新方法,这种方法可将计算速度提高 1.7 倍,但内存需求却是原来的两倍。
{"title":"Bayesian Phylogenetic Analysis on Multi-Core Compute Architectures: Implementation and Evaluation of BEAGLE in RevBayes With MPI.","authors":"Killian Smith, Daniel Ayres, René Neumaier, Gert Wörheide, Sebastian Höhna","doi":"10.1093/sysbio/syae005","DOIUrl":"10.1093/sysbio/syae005","url":null,"abstract":"<p><p>Phylogenies are central to many research areas in biology and commonly estimated using likelihood-based methods. Unfortunately, any likelihood-based method, including Bayesian inference, can be restrictively slow for large datasets-with many taxa and/or many sites in the sequence alignment-or complex substitutions models. The primary limiting factor when using large datasets and/or complex models in probabilistic phylogenetic analyses is the likelihood calculation, which dominates the total computation time. To address this bottleneck, we incorporated the high-performance phylogenetic library BEAGLE into RevBayes, which enables multi-threading on multi-core CPUs and GPUs, as well as hardware specific vectorized instructions for faster likelihood calculations. Our new implementation of RevBayes+BEAGLE retains the flexibility and dynamic nature that users expect from vanilla RevBayes. In addition, we implemented native parallelization within RevBayes without an external library using the message passing interface (MPI); RevBayes+MPI. We evaluated our new implementation of RevBayes+BEAGLE using multi-threading on CPUs and 2 different powerful GPUs (NVidia Titan V and NVIDIA A100) against our native implementation of RevBayes+MPI. We found good improvements in speedup when multiple cores were used, with up to 20-fold speedup when using multiple CPU cores and over 90-fold speedup when using multiple GPU cores. The improvement depended on the data type used, DNA or amino acids, and the size of the alignment, but less on the size of the tree. We additionally investigated the cost of rescaling partial likelihoods to avoid numerical underflow and showed that unnecessarily frequent and inefficient rescaling can increase runtimes up to 4-fold. Finally, we presented and compared a new approach to store partial likelihoods on branches instead of nodes that can speed up computations up to 1.7 times but comes at twice the memory requirements.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":null,"pages":null},"PeriodicalIF":6.1,"publicationDate":"2024-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139571417","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Convergent Adaptation of True Crabs (Decapoda: Brachyura) to a Gradient of Terrestrial Environments. 真正的螃蟹(十足目:Brachyuna)对陆地环境梯度的趋同适应。
IF 6.1 1区 生物学 Q1 EVOLUTIONARY BIOLOGY Pub Date : 2024-07-27 DOI: 10.1093/sysbio/syad066
Joanna M Wolfe, Lauren Ballou, Javier Luque, Victoria M Watson-Zink, Shane T Ahyong, Joëlle Barido-Sottani, Tin-Yam Chan, Ka Hou Chu, Keith A Crandall, Savel R Daniels, Darryl L Felder, Harrison Mancke, Joel W Martin, Peter K L Ng, Javier Ortega-Hernández, Emma Palacios Theil, N Dean Pentcheff, Rafael Robles, Brent P Thoma, Ling Ming Tsang, Regina Wetzer, Amanda M Windsor, Heather D Bracken-Grissom

For much of terrestrial biodiversity, the evolutionary pathways of adaptation from marine ancestors are poorly understood and have usually been viewed as a binary trait. True crabs, the decapod crustacean infraorder Brachyura, comprise over 7600 species representing a striking diversity of morphology and ecology, including repeated adaptation to non-marine habitats. Here, we reconstruct the evolutionary history of Brachyura using new and published sequences of 10 genes for 344 tips spanning 88 of 109 brachyuran families. Using 36 newly vetted fossil calibrations, we infer that brachyurans most likely diverged in the Triassic, with family-level splits in the late Cretaceous and early Paleogene. By contrast, the root age is underestimated with automated sampling of 328 fossil occurrences explicitly incorporated into the tree prior, suggesting such models are a poor fit under heterogeneous fossil preservation. We apply recently defined trait-by-environment associations to classify a gradient of transitions from marine to terrestrial lifestyles. We estimate that crabs left the marine environment at least 7 and up to 17 times convergently, and returned to the sea from non-marine environments at least twice. Although the most highly terrestrial- and many freshwater-adapted crabs are concentrated in Thoracotremata, Bayesian threshold models of ancestral state reconstruction fail to identify shifts to higher terrestrial grades due to the degree of underlying change required. Lineages throughout our tree inhabit intertidal and marginal marine environments, corroborating the inference that the early stages of terrestrial adaptation have a lower threshold to evolve. Our framework and extensive new fossil and natural history datasets will enable future comparisons of non-marine adaptation at the morphological and molecular level. Crabs provide an important window into the early processes of adaptation to novel environments, and different degrees of evolutionary constraint that might help predict these pathways. [Brachyura; convergent evolution; crustaceans; divergence times; fossil calibration; molecular phylogeny; terrestrialization; threshold model.].

对于大多数陆地生物多样性来说,海洋祖先的适应进化途径知之甚少,通常被视为一种二元特征。真正的螃蟹,即十足目甲壳类动物Brachyuna,包括7600多个物种,代表了惊人的形态和生态多样性,包括对非海洋栖息地的反复适应。在这里,我们使用新的和已发表的10个基因的序列重建了Brachyuna的进化史,这些基因来自109个brachyuran家族中的88个家族的344个尖端。使用36个新审查的化石校准,我们推断短吻龙最有可能在三叠纪分化,在白垩纪晚期和古近纪早期出现家族级分裂。相比之下,由于对328个化石的自动采样明确纳入了树木先验,因此低估了根系年龄,这表明这种模型在异质化石保存下不太适合。我们应用最近定义的环境特征关联来分类从海洋到陆地生活方式的转变梯度。我们估计,螃蟹至少有7次和17次会聚地离开海洋环境,至少有两次从非海洋环境返回大海。尽管最高度适应陆地和许多淡水的螃蟹都集中在胸肌,但由于所需的潜在变化程度,祖先状态重建的贝叶斯阈值模型无法识别向更高陆地等级的转变。我们整个树木的谱系都栖息在潮间带和边缘海洋环境中,这证实了陆地适应的早期阶段进化门槛较低的推断。我们的框架和广泛的新化石和自然史数据集将使未来能够在形态学和分子水平上对非海洋适应进行比较。螃蟹为了解适应新环境的早期过程以及可能有助于预测这些途径的不同程度的进化约束提供了一个重要的窗口。
{"title":"Convergent Adaptation of True Crabs (Decapoda: Brachyura) to a Gradient of Terrestrial Environments.","authors":"Joanna M Wolfe, Lauren Ballou, Javier Luque, Victoria M Watson-Zink, Shane T Ahyong, Joëlle Barido-Sottani, Tin-Yam Chan, Ka Hou Chu, Keith A Crandall, Savel R Daniels, Darryl L Felder, Harrison Mancke, Joel W Martin, Peter K L Ng, Javier Ortega-Hernández, Emma Palacios Theil, N Dean Pentcheff, Rafael Robles, Brent P Thoma, Ling Ming Tsang, Regina Wetzer, Amanda M Windsor, Heather D Bracken-Grissom","doi":"10.1093/sysbio/syad066","DOIUrl":"10.1093/sysbio/syad066","url":null,"abstract":"<p><p>For much of terrestrial biodiversity, the evolutionary pathways of adaptation from marine ancestors are poorly understood and have usually been viewed as a binary trait. True crabs, the decapod crustacean infraorder Brachyura, comprise over 7600 species representing a striking diversity of morphology and ecology, including repeated adaptation to non-marine habitats. Here, we reconstruct the evolutionary history of Brachyura using new and published sequences of 10 genes for 344 tips spanning 88 of 109 brachyuran families. Using 36 newly vetted fossil calibrations, we infer that brachyurans most likely diverged in the Triassic, with family-level splits in the late Cretaceous and early Paleogene. By contrast, the root age is underestimated with automated sampling of 328 fossil occurrences explicitly incorporated into the tree prior, suggesting such models are a poor fit under heterogeneous fossil preservation. We apply recently defined trait-by-environment associations to classify a gradient of transitions from marine to terrestrial lifestyles. We estimate that crabs left the marine environment at least 7 and up to 17 times convergently, and returned to the sea from non-marine environments at least twice. Although the most highly terrestrial- and many freshwater-adapted crabs are concentrated in Thoracotremata, Bayesian threshold models of ancestral state reconstruction fail to identify shifts to higher terrestrial grades due to the degree of underlying change required. Lineages throughout our tree inhabit intertidal and marginal marine environments, corroborating the inference that the early stages of terrestrial adaptation have a lower threshold to evolve. Our framework and extensive new fossil and natural history datasets will enable future comparisons of non-marine adaptation at the morphological and molecular level. Crabs provide an important window into the early processes of adaptation to novel environments, and different degrees of evolutionary constraint that might help predict these pathways. [Brachyura; convergent evolution; crustaceans; divergence times; fossil calibration; molecular phylogeny; terrestrialization; threshold model.].</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":null,"pages":null},"PeriodicalIF":6.1,"publicationDate":"2024-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11282366/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"71522557","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Artifactual Orthologs and the Need for Diligent Data Exploration in Complex Phylogenomic Datasets: A Museomic Case Study from the Andean Flora. 在复杂的系统发生组数据集中伪造直系同源物和勤奋数据探索的必要性:来自安第斯植物区系的博物学案例研究。
IF 6.1 1区 生物学 Q1 EVOLUTIONARY BIOLOGY Pub Date : 2024-07-27 DOI: 10.1093/sysbio/syad076
Laura A Frost, Ana M Bedoya, Laura P Lagomarsino

The Andes mountains of western South America are a globally important biodiversity hotspot, yet there is a paucity of resolved phylogenies for plant clades from this region. Filling an important gap in our understanding of the World's richest flora, we present the first phylogeny of Freziera (Pentaphylacaceae), an Andean-centered, cloud forest radiation. Our dataset was obtained via hybrid-enriched target sequence capture of Angiosperms353 universal loci for 50 of the ca. 75 spp., obtained almost entirely from herbarium specimens. We identify high phylogenomic complexity in Freziera, including the presence of data artifacts. Via by-eye observation of gene trees, detailed examination of warnings from recently improved assembly pipelines, and gene tree filtering, we identified that artifactual orthologs (i.e., the presence of only one copy of a multicopy gene due to differential assembly) were an important source of gene tree heterogeneity that had a negative impact on phylogenetic inference and support. These artifactual orthologs may be common in plant phylogenomic datasets, where multiple instances of genome duplication are common. After accounting for artifactual orthologs as source of gene tree error, we identified a significant, but nonspecific signal of introgression using Patterson's D and f4 statistics. Despite phylogenomic complexity, we were able to resolve Freziera into 9 well-supported subclades whose evolution has been shaped by multiple evolutionary processes, including incomplete lineage sorting, historical gene flow, and gene duplication. Our results highlight the complexities of plant phylogenomics, which are heightened in Andean radiations, and show the impact of filtering data processing artifacts and standard filtering approaches on phylogenetic inference.

南美洲西部的安第斯山脉是全球重要的生物多样性热点地区,但该地区植物支系的系统发生却很少。我们首次提出了以安第斯山脉为中心的云林辐射植物--Freziera(五枫香科)的系统发生,填补了我们对世界上最丰富植物区系了解的一个重要空白。我们的数据集是通过rid-enriched target sequence capture of Angiosperms获得的。这些数据几乎全部来自标本馆标本。我们在 Freziera 中发现了高度的系统发生复杂性,包括数据伪造的存在。通过亲眼观察基因树、详细检查最近改进的组装管道发出的警告以及基因树过滤,我们发现伪造的直系同源物(即由于差异组装导致多拷贝基因只有一个拷贝)是基因树异质性的一个重要来源,对系统发生推断和支持有负面影响。在植物系统发生组数据集中,这些人为的直向同源物可能很常见,因为在植物系统发生组数据集中,多个基因组重复的情况很普遍。在考虑了作为基因树误差来源的伪造直系同源物之后,我们利用 Patterson's D 和 f4 统计发现了一个显著但非特异性的引种信号。尽管系统发生组十分复杂,但我们仍能将 Freziera 分解为 9 个支持度较高的亚支系,其进化受多种进化过程的影响,包括不完全的世系分类、历史基因流和基因复制。我们的研究结果凸显了植物系统发生组学的复杂性,而安第斯地区的辐射则使这种复杂性更加突出,同时也显示了过滤数据处理人工痕迹和标准过滤方法对系统发生推断的影响。
{"title":"Artifactual Orthologs and the Need for Diligent Data Exploration in Complex Phylogenomic Datasets: A Museomic Case Study from the Andean Flora.","authors":"Laura A Frost, Ana M Bedoya, Laura P Lagomarsino","doi":"10.1093/sysbio/syad076","DOIUrl":"10.1093/sysbio/syad076","url":null,"abstract":"<p><p>The Andes mountains of western South America are a globally important biodiversity hotspot, yet there is a paucity of resolved phylogenies for plant clades from this region. Filling an important gap in our understanding of the World's richest flora, we present the first phylogeny of Freziera (Pentaphylacaceae), an Andean-centered, cloud forest radiation. Our dataset was obtained via hybrid-enriched target sequence capture of Angiosperms353 universal loci for 50 of the ca. 75 spp., obtained almost entirely from herbarium specimens. We identify high phylogenomic complexity in Freziera, including the presence of data artifacts. Via by-eye observation of gene trees, detailed examination of warnings from recently improved assembly pipelines, and gene tree filtering, we identified that artifactual orthologs (i.e., the presence of only one copy of a multicopy gene due to differential assembly) were an important source of gene tree heterogeneity that had a negative impact on phylogenetic inference and support. These artifactual orthologs may be common in plant phylogenomic datasets, where multiple instances of genome duplication are common. After accounting for artifactual orthologs as source of gene tree error, we identified a significant, but nonspecific signal of introgression using Patterson's D and f4 statistics. Despite phylogenomic complexity, we were able to resolve Freziera into 9 well-supported subclades whose evolution has been shaped by multiple evolutionary processes, including incomplete lineage sorting, historical gene flow, and gene duplication. Our results highlight the complexities of plant phylogenomics, which are heightened in Andean radiations, and show the impact of filtering data processing artifacts and standard filtering approaches on phylogenetic inference.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":null,"pages":null},"PeriodicalIF":6.1,"publicationDate":"2024-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139088586","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DateLife: Leveraging Databases and Analytical Tools to Reveal the Dated Tree of Life. DateLife:利用数据库和分析工具揭示年代久远的生命之树。
IF 6.1 1区 生物学 Q1 EVOLUTIONARY BIOLOGY Pub Date : 2024-07-27 DOI: 10.1093/sysbio/syae015
Luna L Sánchez Reyes, Emily Jane McTavish, Brian O'Meara

Chronograms-phylogenies with branch lengths proportional to time-represent key data on timing of evolutionary events, allowing us to study natural processes in many areas of biological research. Chronograms also provide valuable information that can be used for education, science communication, and conservation policy decisions. Yet, achieving a high-quality reconstruction of a chronogram is a difficult and resource-consuming task. Here we present DateLife, a phylogenetic software implemented as an R package and an R Shiny web application available at www.datelife.org, that provides services for efficient and easy discovery, summary, reuse, and reanalysis of node age data mined from a curated database of expert, peer-reviewed, and openly available chronograms. The main DateLife workflow starts with one or more scientific taxon names provided by a user. Names are processed and standardized to a unified taxonomy, allowing DateLife to run a name match across its local chronogram database that is curated from Open Tree of Life's phylogenetic repository, and extract all chronograms that contain at least two queried taxon names, along with their metadata. Finally, node ages from matching chronograms are mapped using the congruification algorithm to corresponding nodes on a tree topology, either extracted from Open Tree of Life's synthetic phylogeny or one provided by the user. Congruified node ages are used as secondary calibrations to date the chosen topology, with or without initial branch lengths, using different phylogenetic dating methods such as BLADJ, treePL, PATHd8, and MrBayes. We performed a cross-validation test to compare node ages resulting from a DateLife analysis (i.e, phylogenetic dating using secondary calibrations) to those from the original chronograms (i.e, obtained with primary calibrations), and found that DateLife's node age estimates are consistent with the age estimates from the original chronograms, with the largest variation in ages occurring around topologically deeper nodes. Because the results from any software for scientific analysis can only be as good as the data used as input, we highlight the importance of considering the results of a DateLife analysis in the context of the input chronograms. DateLife can help to increase awareness of the existing disparities among alternative hypotheses of dates for the same diversification events, and to support exploration of the effect of alternative chronogram hypotheses on downstream analyses, providing a framework for a more informed interpretation of evolutionary results.

年表--分支长度与时间成正比的系统发生图--是生物研究许多领域中研究自然过程的进化事件发生时间的关键数据。年表还提供了宝贵的信息,可用于教育、科学交流和保护政策决策。然而,实现高质量的年表重建是一项困难且耗费资源的任务。DateLife 是一个系统发育软件,以 R 软件包和 R Shiny 网络应用程序的形式实现,可在 www .datelife.org 网站上下载。DateLife 可为高效、轻松地发现、汇总、重用和重新分析节点年龄数据提供服务,这些节点年龄数据是从专家、同行评审和公开的年表数据库中挖掘出来的。DateLife 的主要工作流程始于用户提供的一个或多个科学类群名称。名称经过处理并标准化为统一的分类标准后,DateLife 就可以在其本地年表数据库中进行名称匹配,该数据库由开放生命树的系统发育资料库整理而成,并提取所有包含至少两个被查询分类群名称的年表及其元数据。最后,使用一致性算法将匹配年表中的节点年龄映射到树状拓扑上的相应节点上,树状拓扑可以从开放生命树的合成系统发生库中提取,也可以由用户提供。同化后的节点年龄将作为辅助校准,使用不同的系统发育年代测定方法(如 BLADJ、treePL、PATHd8 和 MrBayes)对所选拓扑进行年代测定,无论是否有初始分支长度。我们进行了交叉验证测试,将 DateLife 分析得出的节点年龄(即使用二级定标进行系统发育定年)与原始年代图得出的节点年龄(即使用一级定标得出的节点年龄)进行比较,结果发现 DateLife 的节点年龄估计值与原始年代图的年龄估计值一致,最大的年龄差异出现在拓扑较深的节点周围。由于任何科学分析软件的结果只能与作为输入的数据一样好,我们强调了在考虑 DateLife 分析结果时输入年表的重要性。DateLife 可以帮助人们更好地认识到同一多样化事件的其他日期假说之间存在的差异,并支持探索其他年表假说对下游分析的影响,为更明智地解释进化结果提供了一个框架。
{"title":"DateLife: Leveraging Databases and Analytical Tools to Reveal the Dated Tree of Life.","authors":"Luna L Sánchez Reyes, Emily Jane McTavish, Brian O'Meara","doi":"10.1093/sysbio/syae015","DOIUrl":"10.1093/sysbio/syae015","url":null,"abstract":"<p><p>Chronograms-phylogenies with branch lengths proportional to time-represent key data on timing of evolutionary events, allowing us to study natural processes in many areas of biological research. Chronograms also provide valuable information that can be used for education, science communication, and conservation policy decisions. Yet, achieving a high-quality reconstruction of a chronogram is a difficult and resource-consuming task. Here we present DateLife, a phylogenetic software implemented as an R package and an R Shiny web application available at www.datelife.org, that provides services for efficient and easy discovery, summary, reuse, and reanalysis of node age data mined from a curated database of expert, peer-reviewed, and openly available chronograms. The main DateLife workflow starts with one or more scientific taxon names provided by a user. Names are processed and standardized to a unified taxonomy, allowing DateLife to run a name match across its local chronogram database that is curated from Open Tree of Life's phylogenetic repository, and extract all chronograms that contain at least two queried taxon names, along with their metadata. Finally, node ages from matching chronograms are mapped using the congruification algorithm to corresponding nodes on a tree topology, either extracted from Open Tree of Life's synthetic phylogeny or one provided by the user. Congruified node ages are used as secondary calibrations to date the chosen topology, with or without initial branch lengths, using different phylogenetic dating methods such as BLADJ, treePL, PATHd8, and MrBayes. We performed a cross-validation test to compare node ages resulting from a DateLife analysis (i.e, phylogenetic dating using secondary calibrations) to those from the original chronograms (i.e, obtained with primary calibrations), and found that DateLife's node age estimates are consistent with the age estimates from the original chronograms, with the largest variation in ages occurring around topologically deeper nodes. Because the results from any software for scientific analysis can only be as good as the data used as input, we highlight the importance of considering the results of a DateLife analysis in the context of the input chronograms. DateLife can help to increase awareness of the existing disparities among alternative hypotheses of dates for the same diversification events, and to support exploration of the effect of alternative chronogram hypotheses on downstream analyses, providing a framework for a more informed interpretation of evolutionary results.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":null,"pages":null},"PeriodicalIF":6.1,"publicationDate":"2024-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11282365/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140176511","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Tropical Origin, Global Diversification, and Dispersal in the Pond Damselflies (Coenagrionoidea) Revealed by a New Molecular Phylogeny. 新的分子系统发育揭示了池塘豆娘(Coenagrionoidea)的热带起源、全球多样性和传播。
IF 6.1 1区 生物学 Q1 EVOLUTIONARY BIOLOGY Pub Date : 2024-07-27 DOI: 10.1093/sysbio/syae004
Beatriz Willink, Jessica L Ware, Erik I Svensson

The processes responsible for the formation of Earth's most conspicuous diversity pattern, the latitudinal diversity gradient (LDG), remain unexplored for many clades in the Tree of Life. Here, we present a densely sampled and dated molecular phylogeny for the most speciose clade of damselflies worldwide (Odonata: Coenagrionoidea) and investigate the role of time, macroevolutionary processes, and biome-shift dynamics in shaping the LDG in this ancient insect superfamily. We used process-based biogeographic models to jointly infer ancestral ranges and speciation times and to characterize within-biome dispersal and biome-shift dynamics across the cosmopolitan distribution of Coenagrionoidea. We also investigated temporal and biome-dependent variation in diversification rates. Our results uncover a tropical origin of pond damselflies and featherlegs ~105 Ma, while highlighting the uncertainty of ancestral ranges within the tropics in deep time. Even though diversification rates have declined since the origin of this clade, global climate change and biome-shifts have slowly increased diversity in warm- and cold-temperate areas, where lineage turnover rates have been relatively higher. This study underscores the importance of biogeographic origin and time to diversify as important drivers of the LDG in pond damselflies and their relatives, while diversification dynamics have instead resulted in the formation of ephemeral species in temperate regions. Biome-shifts, although limited by tropical niche conservatism, have been the main factor reducing the steepness of the LDG in the last 30 Myr. With ongoing climate change and increasing northward range expansions of many damselfly taxa, the LDG may become less pronounced. Our results support recent calls to unify biogeographic and macroevolutionary approaches to improve our understanding of how latitudinal diversity gradients are formed and why they vary across time and among taxa.

对于生命之树上的许多支系来说,地球上最明显的多样性模式--纬度多样性梯度(LDG)的形成过程仍有待探索。在本文中,我们展示了一个取样密集且年代久远的分子系统发育,该系统发育针对的是世界上种类最多的豆娘科(Odonata: Coenagrionoidea),并研究了时间、宏观进化过程和生物迁移动力学在形成这一古老昆虫超科的纬度多样性梯度中的作用。我们利用基于过程的生物地理学模型来共同推断祖先的分布范围和物种形成时间,并描述 Coenagrionoidea 在世界性分布中的生物群内扩散和生物群转移动态。我们还研究了多样化率随时间和生物群的变化。我们的研究结果揭示了池袋豆娘和羽腿豆娘在距今约 105 Ma 时起源于热带,同时强调了热带祖先分布范围在深部时间的不确定性。尽管自该支系起源以来其分化率有所下降,但全球气候变化和生物迁移缓慢地增加了暖温带和寒温带地区的多样性,这些地区的种系更替率相对较高。这项研究强调了生物地理起源和分化时间的重要性,它们是池塘豆娘及其近缘种的 LDG 的重要驱动因素,而分化动态反而导致了温带地区短暂物种的形成。尽管生物迁移受到热带生态位保守性的限制,但在过去的30 Myr中,生物迁移是降低LDG陡度的主要因素。随着气候变化的持续和许多豆娘类群向北扩展,LDG可能会变得不那么明显。我们的研究结果支持了最近的呼吁,即统一生物地理学和宏观进化方法,以加深我们对纬度多样性梯度如何形成及其为何在不同时期和不同类群之间变化的理解。
{"title":"Tropical Origin, Global Diversification, and Dispersal in the Pond Damselflies (Coenagrionoidea) Revealed by a New Molecular Phylogeny.","authors":"Beatriz Willink, Jessica L Ware, Erik I Svensson","doi":"10.1093/sysbio/syae004","DOIUrl":"10.1093/sysbio/syae004","url":null,"abstract":"<p><p>The processes responsible for the formation of Earth's most conspicuous diversity pattern, the latitudinal diversity gradient (LDG), remain unexplored for many clades in the Tree of Life. Here, we present a densely sampled and dated molecular phylogeny for the most speciose clade of damselflies worldwide (Odonata: Coenagrionoidea) and investigate the role of time, macroevolutionary processes, and biome-shift dynamics in shaping the LDG in this ancient insect superfamily. We used process-based biogeographic models to jointly infer ancestral ranges and speciation times and to characterize within-biome dispersal and biome-shift dynamics across the cosmopolitan distribution of Coenagrionoidea. We also investigated temporal and biome-dependent variation in diversification rates. Our results uncover a tropical origin of pond damselflies and featherlegs ~105 Ma, while highlighting the uncertainty of ancestral ranges within the tropics in deep time. Even though diversification rates have declined since the origin of this clade, global climate change and biome-shifts have slowly increased diversity in warm- and cold-temperate areas, where lineage turnover rates have been relatively higher. This study underscores the importance of biogeographic origin and time to diversify as important drivers of the LDG in pond damselflies and their relatives, while diversification dynamics have instead resulted in the formation of ephemeral species in temperate regions. Biome-shifts, although limited by tropical niche conservatism, have been the main factor reducing the steepness of the LDG in the last 30 Myr. With ongoing climate change and increasing northward range expansions of many damselfly taxa, the LDG may become less pronounced. Our results support recent calls to unify biogeographic and macroevolutionary approaches to improve our understanding of how latitudinal diversity gradients are formed and why they vary across time and among taxa.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":null,"pages":null},"PeriodicalIF":6.1,"publicationDate":"2024-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11282367/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139542962","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Evolution of Multiple Color Mechanisms Is Correlated with Diversification in Sunbirds (Nectariniidae). 多种色彩机制的进化与太阳鸟(Nectariniidae)的多样化有关。
IF 6.1 1区 生物学 Q1 EVOLUTIONARY BIOLOGY Pub Date : 2024-07-27 DOI: 10.1093/sysbio/syae006
Michaël P J Nicolaï, Bert Van Hecke, Svana Rogalla, Gerben Debruyn, Rauri C K Bowie, Nicholas J Matzke, Shannon J Hackett, Liliana D'Alba, Matthew D Shawkey

How and why certain groups become speciose is a key question in evolutionary biology. Novel traits that enable diversification by opening new ecological niches are likely important mechanisms. However, ornamental traits can also promote diversification by opening up novel sensory niches and thereby creating novel inter-specific interactions. More specifically, ornamental colors may enable more precise and/or easier species recognition and may act as key innovations by increasing the number of species-specific patterns and promoting diversification. While the influence of coloration on diversification is well-studied, the influence of the mechanisms that produce those colors (e.g., pigmentary, nanostructural) is less so, even though the ontogeny and evolution of these mechanisms differ. We estimated a new phylogenetic tree for 121 sunbird species and combined color data of 106 species with a range of phylogenetic tools to test the hypothesis that the evolution of novel color mechanisms increases diversification in sunbirds, one of the most colorful bird clades. Results suggest that: (1) the evolution of novel color mechanisms expands the visual sensory niche, increasing the number of achievable colors, (2) structural coloration diverges more readily across the body than pigment-based coloration, enabling an increase in color complexity, (3) novel color mechanisms might minimize trade-offs between natural and sexual selection such that color can function both as camouflage and conspicuous signal, and (4) despite structural colors being more colorful and mobile, only melanin-based coloration is positively correlated with net diversification. Together, these findings explain why color distances increase with an increasing number of sympatric species, even though packing of color space predicts otherwise.

某些群体如何以及为什么会成为物种是生物进化中的一个关键问题。通过开辟新的生态位来实现多样化的新性状可能是重要的机制。然而,装饰性特征也可以通过开辟新的感官龛位,从而产生新的种间相互作用来促进物种多样化。更具体地说,观赏色彩可以使物种识别更准确和/或更容易,并通过增加物种特异性模式的数量和促进物种多样化而成为关键的创新。虽然对色彩对物种多样化的影响研究得很多,但对产生这些色彩的机制(如色素、纳米结构)的影响研究得较少,尽管这些机制的本体和进化过程各不相同。我们为121个太阳鸟物种估算了一棵新的系统发生树,并将106个物种的颜色数据与一系列系统发生工具结合起来,以验证一个假设,即新颜色机制的进化增加了太阳鸟的多样化,而太阳鸟是最多彩的鸟类支系之一。结果表明:(1) 新型色彩机制的进化扩大了视觉感官范围,增加了可实现的色彩数量。(2)结构色彩比基于色素的色彩更容易在整个身体上分化,从而增加了色彩的复杂性。(3)新的色彩机制可能会最大限度地减少自然选择和性选择之间的权衡,从而使色彩既能起到伪装的作用,又能起到显眼信号的作用。(4) 尽管结构色的色彩更丰富、流动性更强,但只有基于黑色素的色彩与净多样化呈正相关。这些发现共同解释了为什么颜色距离会随着同域物种数量的增加而增加,尽管颜色空间的包装会预测出相反的结果。
{"title":"The Evolution of Multiple Color Mechanisms Is Correlated with Diversification in Sunbirds (Nectariniidae).","authors":"Michaël P J Nicolaï, Bert Van Hecke, Svana Rogalla, Gerben Debruyn, Rauri C K Bowie, Nicholas J Matzke, Shannon J Hackett, Liliana D'Alba, Matthew D Shawkey","doi":"10.1093/sysbio/syae006","DOIUrl":"10.1093/sysbio/syae006","url":null,"abstract":"<p><p>How and why certain groups become speciose is a key question in evolutionary biology. Novel traits that enable diversification by opening new ecological niches are likely important mechanisms. However, ornamental traits can also promote diversification by opening up novel sensory niches and thereby creating novel inter-specific interactions. More specifically, ornamental colors may enable more precise and/or easier species recognition and may act as key innovations by increasing the number of species-specific patterns and promoting diversification. While the influence of coloration on diversification is well-studied, the influence of the mechanisms that produce those colors (e.g., pigmentary, nanostructural) is less so, even though the ontogeny and evolution of these mechanisms differ. We estimated a new phylogenetic tree for 121 sunbird species and combined color data of 106 species with a range of phylogenetic tools to test the hypothesis that the evolution of novel color mechanisms increases diversification in sunbirds, one of the most colorful bird clades. Results suggest that: (1) the evolution of novel color mechanisms expands the visual sensory niche, increasing the number of achievable colors, (2) structural coloration diverges more readily across the body than pigment-based coloration, enabling an increase in color complexity, (3) novel color mechanisms might minimize trade-offs between natural and sexual selection such that color can function both as camouflage and conspicuous signal, and (4) despite structural colors being more colorful and mobile, only melanin-based coloration is positively correlated with net diversification. Together, these findings explain why color distances increase with an increasing number of sympatric species, even though packing of color space predicts otherwise.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":null,"pages":null},"PeriodicalIF":6.1,"publicationDate":"2024-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139642990","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Ecological Predictors of Organelle Genome Evolution: Phylogenetic Correlations with Taxonomically Broad, Sparse, Unsystematized Data. 细胞器基因组进化的生态学预测因素:与分类广泛、稀疏、非系统化数据的系统发育相关性。
IF 6.1 1区 生物学 Q1 EVOLUTIONARY BIOLOGY Pub Date : 2024-07-27 DOI: 10.1093/sysbio/syae009
Konstantinos Giannakis, Luke Richards, Iain G Johnston

Comparative analysis of variables across phylogenetically linked observations can reveal mechanisms and insights in evolutionary biology. As the taxonomic breadth of the sample of interest increases, challenges of data sparsity, poor phylogenetic resolution, and complicated evolutionary dynamics emerge. Here, we investigate a cross-eukaryotic question where all these problems exist: which organismal ecology features are correlated with gene retention in mitochondrial and chloroplast DNA (organelle DNA or oDNA). Through a wide palette of synthetic control studies, we first characterize the specificity and sensitivity of a collection of parametric and non-parametric phylogenetic comparative approaches to identify relationships in the face of such sparse and awkward datasets. This analysis is not directly focused on oDNA, and so provides generalizable insights into comparative approaches with challenging data. We then combine and curate ecological data coupled to oDNA genome information across eukaryotes, including a new semi-automated approach for gathering data on organismal traits from less systematized open-access resources including encyclopedia articles on species and taxa. The curation process also involved resolving several issues with existing datasets, including enforcing the clade-specificity of several ecological features and fixing incorrect annotations. Combining this unique dataset with our benchmarked comparative approaches, we confirm support for several known links between organismal ecology and organelle gene retention, identify several previously unidentified relationships constituting possible ecological contributors to oDNA genome evolution, and provide support for a recently hypothesized link between environmental demand and oDNA retention. We, with caution, discuss the implications of these findings for organelle evolution and of this pipeline for broad comparative analyses in other fields.

对具有系统发育联系的观测数据中的变量进行比较分析,可以揭示生物进化的机制和规律。随着感兴趣样本的分类广度增加,数据稀少、系统发育分辨率低和复杂的进化动态等挑战也随之出现。在这里,我们研究了一个存在所有这些问题的跨真核生物问题:哪些生物生态学特征与线粒体和叶绿体 DNA(细胞器 DNA 或 oDNA)中的基因保留相关。通过广泛的合成对照研究,我们首先确定了一系列参数和非参数系统发育比较方法的特异性和灵敏度,以便在面对如此稀少和笨拙的数据集时识别各种关系。这项分析并不直接针对 oDNA,因此能为具有挑战性数据的比较方法提供具有普遍意义的见解。然后,我们将生态学数据与真核生物的 oDNA 基因组信息结合起来并进行整理,包括采用一种新的半自动化方法,从系统化程度较低的开放获取资源(包括物种和类群的百科全书文章)中收集有关生物特征的数据。整理过程还涉及解决现有数据集的几个问题,包括强制执行几个生态特征的支系特异性和修正不正确的注释。通过将这一独特的数据集与我们的基准比较方法相结合,我们证实了生物生态学与细胞器基因保留之间的几种已知联系,确定了构成 oDNA 基因组进化的可能生态因素的几种以前未确定的关系,并为最近假设的环境需求与 oDNA 保留之间的联系提供了支持。我们谨慎地讨论了这些发现对细胞器进化的影响,以及这一管道对其他领域广泛比较分析的影响。
{"title":"Ecological Predictors of Organelle Genome Evolution: Phylogenetic Correlations with Taxonomically Broad, Sparse, Unsystematized Data.","authors":"Konstantinos Giannakis, Luke Richards, Iain G Johnston","doi":"10.1093/sysbio/syae009","DOIUrl":"10.1093/sysbio/syae009","url":null,"abstract":"<p><p>Comparative analysis of variables across phylogenetically linked observations can reveal mechanisms and insights in evolutionary biology. As the taxonomic breadth of the sample of interest increases, challenges of data sparsity, poor phylogenetic resolution, and complicated evolutionary dynamics emerge. Here, we investigate a cross-eukaryotic question where all these problems exist: which organismal ecology features are correlated with gene retention in mitochondrial and chloroplast DNA (organelle DNA or oDNA). Through a wide palette of synthetic control studies, we first characterize the specificity and sensitivity of a collection of parametric and non-parametric phylogenetic comparative approaches to identify relationships in the face of such sparse and awkward datasets. This analysis is not directly focused on oDNA, and so provides generalizable insights into comparative approaches with challenging data. We then combine and curate ecological data coupled to oDNA genome information across eukaryotes, including a new semi-automated approach for gathering data on organismal traits from less systematized open-access resources including encyclopedia articles on species and taxa. The curation process also involved resolving several issues with existing datasets, including enforcing the clade-specificity of several ecological features and fixing incorrect annotations. Combining this unique dataset with our benchmarked comparative approaches, we confirm support for several known links between organismal ecology and organelle gene retention, identify several previously unidentified relationships constituting possible ecological contributors to oDNA genome evolution, and provide support for a recently hypothesized link between environmental demand and oDNA retention. We, with caution, discuss the implications of these findings for organelle evolution and of this pipeline for broad comparative analyses in other fields.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":null,"pages":null},"PeriodicalIF":6.1,"publicationDate":"2024-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11282362/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140068682","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Blouch: Bayesian Linear Ornstein-Uhlenbeck Models for Comparative Hypotheses. Blouch:比较假设的贝叶斯线性奥恩斯坦-乌伦贝克模型。
IF 6.1 1区 生物学 Q1 EVOLUTIONARY BIOLOGY Pub Date : 2024-07-24 DOI: 10.1093/sysbio/syae044
Mark Grabowski

Relationships among species in the tree of life can complicate comparative methods and testing adaptive hypotheses. Models based on the Ornstein-Uhlenbeck process permit hypotheses about adaptation to be tested by allowing traits to either evolve towards fixed adaptive optima (e.g., regimes or niches) or track continuously changing optima that can be influenced by other traits. These models allow estimation of the effects of both adaptation and phylogenetic inertia - resistance to adaptation due to any source - on trait evolution, an approach known as the "adaptation-inertia" framework. However, previous applications of this framework, and most approaches suggested to deal with the issue of species non-independence, are based on a maximum likelihood approach and thus it is difficult to include information based on prior biological knowledge in the analysis, which can affect resulting inferences. Here I present Blouch, (Bayesian Linear Ornstein-Uhlenbeck Models for Comparative Hypotheses), which fits allometric and adaptive models of continuous trait evolution in a Bayesian framework based on fixed or continuous predictors and incorporates measurement error. I first briefly discuss the models implemented in Blouch, and then the new applications for these models provided by a Bayesian framework. This includes the advantages of assigning biologically meaningful priors when compared to non-Bayesian approaches, allowing for varying effects (intercepts and slopes), and multilevel modeling. Validations on simulated data show good performance in recovering the true evolutionary parameters for all models. To demonstrate the workflow of Blouch on an empirical dataset, I test the hypothesis that the relatively larger antlers of larger bodied deer are the result of more intense sexual selection that comes along with their tendency to live in larger breeding groups. While results show that larger bodied deer that live in larger breeding groups have relatively larger antlers, deer living in the smallest groups appear to have a different and steeper scaling pattern of antler size to body size than other groups. These results are contrary to previous findings and may argue that a different type of sexual selection or other selective pressures govern optimum antler size in the smallest breeding groups.

生命树中物种之间的关系会使比较方法和适应性假设检验变得复杂。基于奥恩斯坦-乌伦贝克过程(Ornstein-Uhlenbeck process)的模型允许性状向固定的适应性最佳值(如制度或龛位)演化,或追踪可受其他性状影响的持续变化的最佳值,从而对适应性假说进行检验。这些模型可以估算适应性和系统发育惯性(任何原因导致的适应阻力)对性状进化的影响,这种方法被称为 "适应-惯性 "框架。然而,以往对这一框架的应用,以及为处理物种非独立性问题而提出的大多数方法,都是基于最大似然法,因此很难在分析中纳入基于先验生物学知识的信息,这可能会影响推论结果。在此,我将介绍Blouch(用于比较假设的贝叶斯线性奥恩斯坦-乌伦贝克模型),它在贝叶斯框架内基于固定或连续预测因子拟合连续性状进化的异速和适应模型,并纳入测量误差。我首先简要讨论了 Blouch 中实现的模型,然后讨论了贝叶斯框架为这些模型提供的新应用。与非贝叶斯方法相比,这包括分配有生物意义的先验、允许不同效应(截距和斜率)以及多层次建模等优势。模拟数据的验证结果表明,该方法在恢复所有模型的真实进化参数方面性能良好。为了展示 Blouch 在经验数据集上的工作流程,我检验了这样一个假设:体型较大的鹿的鹿角相对较大,这是由于它们倾向于生活在较大的繁殖群体中,因而性选择更为强烈。结果表明,生活在较大繁殖群体中的体型较大的鹿的鹿角相对较大,但生活在最小群体中的鹿的鹿角大小与体型的比例模式似乎与其他群体不同,而且更陡峭。这些结果与之前的研究结果相反,可能说明在最小的繁殖群体中,不同类型的性选择或其他选择压力会影响鹿角的最佳尺寸。
{"title":"Blouch: Bayesian Linear Ornstein-Uhlenbeck Models for Comparative Hypotheses.","authors":"Mark Grabowski","doi":"10.1093/sysbio/syae044","DOIUrl":"https://doi.org/10.1093/sysbio/syae044","url":null,"abstract":"<p><p>Relationships among species in the tree of life can complicate comparative methods and testing adaptive hypotheses. Models based on the Ornstein-Uhlenbeck process permit hypotheses about adaptation to be tested by allowing traits to either evolve towards fixed adaptive optima (e.g., regimes or niches) or track continuously changing optima that can be influenced by other traits. These models allow estimation of the effects of both adaptation and phylogenetic inertia - resistance to adaptation due to any source - on trait evolution, an approach known as the \"adaptation-inertia\" framework. However, previous applications of this framework, and most approaches suggested to deal with the issue of species non-independence, are based on a maximum likelihood approach and thus it is difficult to include information based on prior biological knowledge in the analysis, which can affect resulting inferences. Here I present Blouch, (Bayesian Linear Ornstein-Uhlenbeck Models for Comparative Hypotheses), which fits allometric and adaptive models of continuous trait evolution in a Bayesian framework based on fixed or continuous predictors and incorporates measurement error. I first briefly discuss the models implemented in Blouch, and then the new applications for these models provided by a Bayesian framework. This includes the advantages of assigning biologically meaningful priors when compared to non-Bayesian approaches, allowing for varying effects (intercepts and slopes), and multilevel modeling. Validations on simulated data show good performance in recovering the true evolutionary parameters for all models. To demonstrate the workflow of Blouch on an empirical dataset, I test the hypothesis that the relatively larger antlers of larger bodied deer are the result of more intense sexual selection that comes along with their tendency to live in larger breeding groups. While results show that larger bodied deer that live in larger breeding groups have relatively larger antlers, deer living in the smallest groups appear to have a different and steeper scaling pattern of antler size to body size than other groups. These results are contrary to previous findings and may argue that a different type of sexual selection or other selective pressures govern optimum antler size in the smallest breeding groups.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":null,"pages":null},"PeriodicalIF":6.1,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141752780","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Inferring Taxonomic Affinities and Genetic Distances Using Morphological Features Extracted from Specimen Images: a Case Study with a Bivalve dataset. 利用从标本图像中提取的形态学特征推断分类亲缘关系和遗传距离:双壳类动物数据集案例研究。
IF 6.1 1区 生物学 Q1 EVOLUTIONARY BIOLOGY Pub Date : 2024-07-24 DOI: 10.1093/sysbio/syae042
Martin Hofmann, Steffen Kiel, Lara M Kösters, Jana Wäldchen, Patrick Mäder

Reconstructing the tree of life and understanding the relationships of taxa are core questions in evolutionary and systematic biology. The main advances in this field in the last decades were derived from molecular phylogenetics; however, for most species, molecular data are not available. Here, we explore the applicability of two deep learning methods - supervised classification approaches and unsupervised similarity learning - to infer organism relationships from specimen images. As a basis, we assembled an image dataset covering 4144 bivalve species belonging to 74 families across all orders and subclasses of the extant Bivalvia, with molecular phylogenetic data being available for all families and a complete taxonomic hierarchy for all species. The suitability of this dataset for deep learning experiments was evidenced by an ablation study resulting in almost 80% accuracy for identifications on the species level. Three sets of experiments were performed using our dataset. First, we included taxonomic hierarchy and genetic distances in a supervised learning approach to obtain predictions on several taxonomic levels simultaneously. Here, we stimulated the model to consider features shared between closely related taxa to be more critical for their classification than features shared with distantly related taxa, imprinting phylogenetic and taxonomic affinities into the architecture and training procedure. Second, we used transfer learning and similarity learning approaches for zero-shot experiments to identify the higher-level taxonomic affinities of test species that the models had not been trained on. The models assigned the unknown species to their respective genera with approximately 48% and 67% accuracy. Lastly, we used unsupervised similarity learning to infer the relatedness of the images without prior knowledge of their taxonomic or phylogenetic affinities. The results clearly showed similarities between visual appearance and genetic relationships at the higher taxonomic levels. The correlation was 0.6 for the most species-rich subclass (Imparidentia), ranging from 0.5 to 0.7 for the orders with the most images. Overall, the correlation between visual similarity and genetic distances at the family level was 0.78. However, fine-grained reconstructions based on these observed correlations, such as sister-taxa relationships, require further work. Overall, our results broaden the applicability of automated taxon identification systems and provide a new avenue for estimating phylogenetic relationships from specimen images.

重建生命树和了解类群关系是进化生物学和系统生物学的核心问题。过去几十年来,这一领域的主要进展来自分子系统学;然而,对于大多数物种来说,分子数据是不可用的。在这里,我们探索了两种深度学习方法--监督分类方法和无监督相似性学习--在从标本图像推断生物关系方面的适用性。在此基础上,我们建立了一个图像数据集,涵盖了现存双壳纲所有目和亚目的 74 个科的 4144 个双壳类物种,所有科都有分子系统发生学数据,所有物种都有完整的分类层次结构。通过一项消融研究,该数据集的物种识别准确率接近 80%,这证明了该数据集适合进行深度学习实验。使用我们的数据集进行了三组实验。首先,我们在监督学习方法中加入了分类层次和遗传距离,以同时获得多个分类层次的预测结果。在此,我们激励模型考虑近缘类群之间共享的特征比远缘类群共享的特征对其分类更为重要,从而将系统发生学和分类学亲缘关系印刻到结构和训练程序中。其次,我们利用迁移学习和相似性学习方法进行了零次实验,以确定模型未训练过的测试物种的高层分类亲缘关系。模型将未知物种归入各自属的准确率分别为 48% 和 67%。最后,我们使用无监督相似性学习来推断图像的亲缘关系,而无需事先了解其分类学或系统发育亲缘关系。结果清楚地表明,在较高的分类水平上,视觉外观与遗传关系之间存在相似性。物种最丰富的亚纲(Imparidentia)的相关性为 0.6,图像最多的目为 0.5 至 0.7。总体而言,视觉相似性与科级遗传距离之间的相关性为 0.78。然而,基于这些观察到的相关性进行细粒度重建,如姐妹-同属关系,还需要进一步的工作。总之,我们的研究结果拓宽了自动分类鉴定系统的适用范围,并为从标本图像中估计系统发育关系提供了一条新途径。
{"title":"Inferring Taxonomic Affinities and Genetic Distances Using Morphological Features Extracted from Specimen Images: a Case Study with a Bivalve dataset.","authors":"Martin Hofmann, Steffen Kiel, Lara M Kösters, Jana Wäldchen, Patrick Mäder","doi":"10.1093/sysbio/syae042","DOIUrl":"https://doi.org/10.1093/sysbio/syae042","url":null,"abstract":"<p><p>Reconstructing the tree of life and understanding the relationships of taxa are core questions in evolutionary and systematic biology. The main advances in this field in the last decades were derived from molecular phylogenetics; however, for most species, molecular data are not available. Here, we explore the applicability of two deep learning methods - supervised classification approaches and unsupervised similarity learning - to infer organism relationships from specimen images. As a basis, we assembled an image dataset covering 4144 bivalve species belonging to 74 families across all orders and subclasses of the extant Bivalvia, with molecular phylogenetic data being available for all families and a complete taxonomic hierarchy for all species. The suitability of this dataset for deep learning experiments was evidenced by an ablation study resulting in almost 80% accuracy for identifications on the species level. Three sets of experiments were performed using our dataset. First, we included taxonomic hierarchy and genetic distances in a supervised learning approach to obtain predictions on several taxonomic levels simultaneously. Here, we stimulated the model to consider features shared between closely related taxa to be more critical for their classification than features shared with distantly related taxa, imprinting phylogenetic and taxonomic affinities into the architecture and training procedure. Second, we used transfer learning and similarity learning approaches for zero-shot experiments to identify the higher-level taxonomic affinities of test species that the models had not been trained on. The models assigned the unknown species to their respective genera with approximately 48% and 67% accuracy. Lastly, we used unsupervised similarity learning to infer the relatedness of the images without prior knowledge of their taxonomic or phylogenetic affinities. The results clearly showed similarities between visual appearance and genetic relationships at the higher taxonomic levels. The correlation was 0.6 for the most species-rich subclass (Imparidentia), ranging from 0.5 to 0.7 for the orders with the most images. Overall, the correlation between visual similarity and genetic distances at the family level was 0.78. However, fine-grained reconstructions based on these observed correlations, such as sister-taxa relationships, require further work. Overall, our results broaden the applicability of automated taxon identification systems and provide a new avenue for estimating phylogenetic relationships from specimen images.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":null,"pages":null},"PeriodicalIF":6.1,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141752782","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Systematic Biology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1