首页 > 最新文献

Systematic Biology最新文献

英文 中文
Adaptive Radiation Without Independent Stages of Trait Evolution in a Group of Caribbean Anoles. 加勒比鼹鼠群中没有独立性状进化阶段的适应性辐射。
IF 6.1 1区 生物学 Q1 EVOLUTIONARY BIOLOGY Pub Date : 2024-10-30 DOI: 10.1093/sysbio/syae041
Brooke Bodensteiner, Edward D Burress, Martha M Muñoz

Adaptive radiation involves diversification along multiple trait axes, producing phenotypically diverse, species-rich lineages. Theory generally predicts that multi-trait evolution occurs via a "stages" model, with some traits saturating early in a lineage's history, and others diversifying later. Despite its multidimensional nature, however, we know surprisingly little about how different suites of traits evolve during adaptive radiation. Here, we investigated the rate, pattern, and timing of morphological and physiological evolution in the anole lizard adaptive radiation from the Caribbean island of Hispaniola. Rates and patterns of morphological and physiological diversity are largely unaligned, corresponding to independent selective pressures associated with structural and thermal niches. Cold tolerance evolution reflects parapatric divergence across elevation, rather than niche partitioning within communities. Heat tolerance evolution and the preferred temperature evolve more slowly than cold tolerance, reflecting behavioral buffering, particularly in edge-habitat species (a pattern associated with the Bogert effect). In contrast to the nearby island of Puerto Rico, closely related anoles on Hispaniola do not sympatrically partition thermal niche space. Instead, allopatric and parapatric separation across biogeographic and environmental boundaries serves to keep morphologically similar close relatives apart. The phenotypic diversity of this island's adaptive radiation accumulated largely as a by-product of time, with surprisingly few exceptional pulses of trait evolution. A better understanding of the processes that guide multidimensional trait evolution (and nuance therein) will prove key in determining whether the stages model should be considered a common theme of adaptive radiation.

适应性辐射涉及沿多个性状轴的多样化,从而产生表型多样、物种丰富的品系。理论通常预测,多性状进化是通过一个 "阶段 "模型发生的,一些性状在一个品系历史的早期达到饱和,而另一些则在后期多样化。尽管多性状进化具有多维性,但我们对适应性辐射过程中不同性状组合如何进化却知之甚少。在这里,我们研究了来自加勒比海伊斯帕尼奥拉岛的无尾蜥适应性辐射中形态和生理进化的速度、模式和时间。形态和生理多样性的速率和模式在很大程度上是不一致的,这与结构和热环境相关的独立选择压力相对应。耐寒性的进化反映了跨海拔的同域分化,而不是群落内部的生态位划分。耐热性和喜好温度的进化比耐寒性慢,这反映了行为缓冲,特别是在边缘栖息地物种中(与博格特效应相关的模式)。与附近的波多黎各岛不同,伊斯帕尼奥拉岛上亲缘关系密切的鼹鼠并不以同域方式划分热生态位空间。相反,跨越生物地理学和环境边界的同域和旁域分离使形态上相似的近缘种保持分离。该岛适应性辐射的表型多样性在很大程度上是随着时间的推移而积累起来的,其性状进化的特殊脉冲少得令人吃惊。更好地了解引导多维性状进化的过程(以及其中的细微差别)将被证明是确定阶段模型是否应被视为适应性辐射共同主题的关键。
{"title":"Adaptive Radiation Without Independent Stages of Trait Evolution in a Group of Caribbean Anoles.","authors":"Brooke Bodensteiner, Edward D Burress, Martha M Muñoz","doi":"10.1093/sysbio/syae041","DOIUrl":"10.1093/sysbio/syae041","url":null,"abstract":"<p><p>Adaptive radiation involves diversification along multiple trait axes, producing phenotypically diverse, species-rich lineages. Theory generally predicts that multi-trait evolution occurs via a \"stages\" model, with some traits saturating early in a lineage's history, and others diversifying later. Despite its multidimensional nature, however, we know surprisingly little about how different suites of traits evolve during adaptive radiation. Here, we investigated the rate, pattern, and timing of morphological and physiological evolution in the anole lizard adaptive radiation from the Caribbean island of Hispaniola. Rates and patterns of morphological and physiological diversity are largely unaligned, corresponding to independent selective pressures associated with structural and thermal niches. Cold tolerance evolution reflects parapatric divergence across elevation, rather than niche partitioning within communities. Heat tolerance evolution and the preferred temperature evolve more slowly than cold tolerance, reflecting behavioral buffering, particularly in edge-habitat species (a pattern associated with the Bogert effect). In contrast to the nearby island of Puerto Rico, closely related anoles on Hispaniola do not sympatrically partition thermal niche space. Instead, allopatric and parapatric separation across biogeographic and environmental boundaries serves to keep morphologically similar close relatives apart. The phenotypic diversity of this island's adaptive radiation accumulated largely as a by-product of time, with surprisingly few exceptional pulses of trait evolution. A better understanding of the processes that guide multidimensional trait evolution (and nuance therein) will prove key in determining whether the stages model should be considered a common theme of adaptive radiation.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"743-757"},"PeriodicalIF":6.1,"publicationDate":"2024-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141879499","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Complex Hybridization in a Clade of Polytypic Salamanders (Plethodontidae: Desmognathus) Uncovered by Estimating Higher-Level Phylogenetic Networks. 通过估算更高层次的系统发育网络发现多型蝾螈(Plethodontidae: Desmognathus)支系中的复杂杂交。
IF 6.1 1区 生物学 Q1 EVOLUTIONARY BIOLOGY Pub Date : 2024-10-28 DOI: 10.1093/sysbio/syae060
R Alexander Pyron, Kyle A O'Connell, Edward A Myers, David A Beamer, Hector Baños

Reticulation between incipient lineages is a common feature of diversification. We examine these phenomena in the Pisgah clade of Desmognathus salamanders from the southern Appalachian Mountains of the eastern United States. The group contains four to seven species exhibiting two discrete phenotypes, aquatic "shovel-nosed" and semi-aquatic "black-bellied" forms. These ecomorphologies are ancient and have apparently been transmitted repeatedly between lineages through introgression. Geographically proximate populations of both phenotypes exhibit admixture, and at least two black-bellied lineages have been produced via reticulations between shovel-nosed parentals, suggesting potential hybrid speciation dynamics. However, computational constraints currently limit our ability to reconstruct network radiations from gene-tree data. Available methods are limited to level-1 networks wherein reticulations do not share edges, and higher-level networks may be non-identifiable in many cases. We present a heuristic approach to recover information from higher-level networks across a range of potentially identifiable empirical scenarios, supported by theory and simulation. When extrinsic information indicates the location and direction of reticulations, our method can successfully estimate a reduced possible set of non-level-1 networks. Phylogenomic data support a single backbone topology with up to five overlapping hybrid edges in the Pisgah clade. These results suggest an unusual mechanism of ecomorphological hybrid speciation, wherein a binary threshold trait causes some hybrid populations to shift between microhabitat niches, promoting ecological divergence between sympatric hybrids and parentals. This contrasts with other well-known systems in which hybrids exhibit intermediate, novel, or transgressive phenotypes. The genetic basis of these phenotypes is unclear and further data are needed to clarify the evolutionary basis of morphological changes with ecological consequences.

初生品系之间的网状结构是生物多样化的一个常见特征。我们研究了美国东部阿巴拉契亚山脉南部 Desmognathus 大鲵 Pisgah 支系的这些现象。该类包含四到七个物种,表现出两种不同的表型,即水生的 "铲鼻 "型和半水生的 "黑腹 "型。这些非形态是古老的,显然是通过引种在不同品系之间反复传播的。这两种表型在地理上相近的种群表现出混杂现象,至少有两个黑腹系是通过铲鼻亲本之间的网状结构产生的,这表明存在潜在的杂交物种动态。然而,目前计算方面的制约因素限制了我们从基因树数据重建网络辐射的能力。现有的方法仅限于网状结构不共享边的一级网络,而更高级别的网络在很多情况下可能无法识别。我们提出了一种启发式方法,在理论和模拟的支持下,从一系列可能可识别的经验场景中恢复更高层次网络的信息。当外在信息表明网状结构的位置和方向时,我们的方法可以成功地估算出减少的非一级网络的可能集合。系统发生组数据支持皮斯加支系中具有多达五个重叠杂交边缘的单一主干拓扑结构。这些结果表明,二元阈值性状导致一些杂交种群在微生境壁龛之间转移,从而促进同域杂交种与亲本之间的生态分化,这是一种不寻常的非形态杂交物种分化机制。这与其他众所周知的系统形成鲜明对比,在这些系统中,杂交种表现出中间型、新颖型或跃迁型表型。这些表型的遗传基础尚不清楚,需要进一步的数据来澄清具有生态后果的形态变化的进化基础。
{"title":"Complex Hybridization in a Clade of Polytypic Salamanders (Plethodontidae: Desmognathus) Uncovered by Estimating Higher-Level Phylogenetic Networks.","authors":"R Alexander Pyron, Kyle A O'Connell, Edward A Myers, David A Beamer, Hector Baños","doi":"10.1093/sysbio/syae060","DOIUrl":"https://doi.org/10.1093/sysbio/syae060","url":null,"abstract":"<p><p>Reticulation between incipient lineages is a common feature of diversification. We examine these phenomena in the Pisgah clade of Desmognathus salamanders from the southern Appalachian Mountains of the eastern United States. The group contains four to seven species exhibiting two discrete phenotypes, aquatic \"shovel-nosed\" and semi-aquatic \"black-bellied\" forms. These ecomorphologies are ancient and have apparently been transmitted repeatedly between lineages through introgression. Geographically proximate populations of both phenotypes exhibit admixture, and at least two black-bellied lineages have been produced via reticulations between shovel-nosed parentals, suggesting potential hybrid speciation dynamics. However, computational constraints currently limit our ability to reconstruct network radiations from gene-tree data. Available methods are limited to level-1 networks wherein reticulations do not share edges, and higher-level networks may be non-identifiable in many cases. We present a heuristic approach to recover information from higher-level networks across a range of potentially identifiable empirical scenarios, supported by theory and simulation. When extrinsic information indicates the location and direction of reticulations, our method can successfully estimate a reduced possible set of non-level-1 networks. Phylogenomic data support a single backbone topology with up to five overlapping hybrid edges in the Pisgah clade. These results suggest an unusual mechanism of ecomorphological hybrid speciation, wherein a binary threshold trait causes some hybrid populations to shift between microhabitat niches, promoting ecological divergence between sympatric hybrids and parentals. This contrasts with other well-known systems in which hybrids exhibit intermediate, novel, or transgressive phenotypes. The genetic basis of these phenotypes is unclear and further data are needed to clarify the evolutionary basis of morphological changes with ecological consequences.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":""},"PeriodicalIF":6.1,"publicationDate":"2024-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142523144","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Caught in the Act: Incipient Speciation at the Southern Limit of Viburnum in the Central Andes. 被逮个正着:安第斯山脉中部紫云英南缘的物种萌芽。
IF 6.1 1区 生物学 Q1 EVOLUTIONARY BIOLOGY Pub Date : 2024-10-25 DOI: 10.1093/sysbio/syae023
Carlos A Maya-Lastra, Patrick W Sweeney, Deren A R Eaton, Vania Torrez, Carla Maldonado, Malu I Ore-Rengifo, Mónica Arakaki, Michael J Donoghue, Erika J Edwards

A fundamental objective of evolutionary biology is to understand the origin of independently evolving species. Phylogenetic studies of species radiations rarely are able to document ongoing speciation; instead, modes of speciation, entailing geographic separation and/or ecological differentiation, are posited retrospectively. The Oreinotinus clade of Viburnum has radiated recently from north to south through the cloud forests of Mexico and Central America to the Central Andes. Our analyses support a hypothesis of incipient speciation in Oreinotinus at the southern edge of its geographic range, from central Peru to northern Argentina. Although several species and infraspecific taxa have been recognized in this area, multiple lines of evidence and analytical approaches (including analyses of phylogenetic relationships, genetic structure, leaf morphology, and climatic envelopes) favor the recognition of just a single species, V. seemenii. We show that what has previously been recognized as V. seemenii f. minor has recently occupied the drier Tucuman-Bolivian forest region from Samaipata in Bolivia to Salta in northern Argentina. Plants in these populations form a well-supported clade with a distinctive genetic signature and they have evolved smaller, narrower leaves. We interpret this as the beginning of a within-species divergence process that has elsewhere in the neotropics resulted repeatedly in Viburnum species with a particular set of leaf ecomorphs. Specifically, the southern populations are in the process of evolving the small, glabrous, and entire leaf ecomorph that has evolved in four other montane areas of endemism. As predicted based on our studies of leaf ecomorphs in Chiapas, Mexico, these southern populations experience generally drier conditions, with large diurnal temperature fluctuations. In a central portion of the range of V. seemenii, characterized by wetter climatic conditions, we also document what may be the initial differentiation of the leaf ecomorph with larger, pubescent, and toothy leaves. The emergence of these ecomorphs thus appears to be driven by adaptation to subtly different climatic conditions in separate geographic regions, as opposed to parapatric differentiation along elevational gradients as suggested by Viburnum species distributions in other parts of the neotropics.

进化生物学的一个基本目标是了解独立进化物种的起源。物种辐射的系统发育研究很少能够记录正在进行的物种分化;相反,物种分化的模式,包括地理分离和/或生态分化,都是回溯性的假设。最近,紫云英的 Oreinotinus 支系从北向南辐射,穿过墨西哥和中美洲的云雾林,到达中安第斯山脉。我们的分析支持一种假设,即在秘鲁中部到阿根廷北部的 Oreinotinus 地理分布范围的南部边缘,其物种刚刚开始分化。尽管在这一地区已经发现了多个物种和种下类群,但多种证据和分析方法(包括系统发育关系、遗传结构、叶片形态学和气候包络分析)都支持只承认一个物种,即 V. seemenii。我们的研究表明,以前被认为是 V. seemenii f. minor 的物种最近占据了从玻利维亚的萨马伊帕塔到阿根廷北部萨尔塔的图库曼-玻利维亚森林地区。这些种群中的植物形成了一个具有独特遗传特征的支持良好的支系,它们进化出了更小、更窄的叶片。我们将此解释为种内分化过程的开始,这种分化过程在新热带地区的其他地方曾多次导致紫云英物种具有一套特殊的叶片非形态。具体来说,南方种群正在进化出小的、无毛的、全缘叶的叶片异形,而这种叶片异形已经在其他四个山地特有地区进化出来了。根据我们对墨西哥恰帕斯州叶片异形的研究预测,这些南部种群通常较为干燥,昼夜温度波动较大。在V. seemenii分布区的中部,气候条件较为潮湿,我们还记录了叶片的最初分化,叶片较大、有短柔毛和齿状突起。因此,这些非形态的出现似乎是为了适应不同地理区域微妙不同的气候条件,而不是像新热带地区其他地方的紫云英物种分布所表明的那样,沿着海拔梯度进行同域分化。
{"title":"Caught in the Act: Incipient Speciation at the Southern Limit of Viburnum in the Central Andes.","authors":"Carlos A Maya-Lastra, Patrick W Sweeney, Deren A R Eaton, Vania Torrez, Carla Maldonado, Malu I Ore-Rengifo, Mónica Arakaki, Michael J Donoghue, Erika J Edwards","doi":"10.1093/sysbio/syae023","DOIUrl":"10.1093/sysbio/syae023","url":null,"abstract":"<p><p>A fundamental objective of evolutionary biology is to understand the origin of independently evolving species. Phylogenetic studies of species radiations rarely are able to document ongoing speciation; instead, modes of speciation, entailing geographic separation and/or ecological differentiation, are posited retrospectively. The Oreinotinus clade of Viburnum has radiated recently from north to south through the cloud forests of Mexico and Central America to the Central Andes. Our analyses support a hypothesis of incipient speciation in Oreinotinus at the southern edge of its geographic range, from central Peru to northern Argentina. Although several species and infraspecific taxa have been recognized in this area, multiple lines of evidence and analytical approaches (including analyses of phylogenetic relationships, genetic structure, leaf morphology, and climatic envelopes) favor the recognition of just a single species, V. seemenii. We show that what has previously been recognized as V. seemenii f. minor has recently occupied the drier Tucuman-Bolivian forest region from Samaipata in Bolivia to Salta in northern Argentina. Plants in these populations form a well-supported clade with a distinctive genetic signature and they have evolved smaller, narrower leaves. We interpret this as the beginning of a within-species divergence process that has elsewhere in the neotropics resulted repeatedly in Viburnum species with a particular set of leaf ecomorphs. Specifically, the southern populations are in the process of evolving the small, glabrous, and entire leaf ecomorph that has evolved in four other montane areas of endemism. As predicted based on our studies of leaf ecomorphs in Chiapas, Mexico, these southern populations experience generally drier conditions, with large diurnal temperature fluctuations. In a central portion of the range of V. seemenii, characterized by wetter climatic conditions, we also document what may be the initial differentiation of the leaf ecomorph with larger, pubescent, and toothy leaves. The emergence of these ecomorphs thus appears to be driven by adaptation to subtly different climatic conditions in separate geographic regions, as opposed to parapatric differentiation along elevational gradients as suggested by Viburnum species distributions in other parts of the neotropics.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"629-643"},"PeriodicalIF":6.1,"publicationDate":"2024-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141238062","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Stochastic Character Mapping, Bayesian Model Selection, and Biosynthetic Pathways Shed New Light on the Evolution of Habitat Preference in Cyanobacteria. 随机特征映射、贝叶斯模型选择和生物合成途径为蓝藻栖息地偏好的进化提供了新线索
IF 6.1 1区 生物学 Q1 EVOLUTIONARY BIOLOGY Pub Date : 2024-10-25 DOI: 10.1093/sysbio/syae025
Giorgio Bianchini, Martin Hagemann, Patricia Sánchez-Baracaldo

Cyanobacteria are the only prokaryotes to have evolved oxygenic photosynthesis paving the way for complex life. Studying the evolution and ecological niche of cyanobacteria and their ancestors is crucial for understanding the intricate dynamics of biosphere evolution. These organisms frequently deal with environmental stressors such as salinity and drought, and they employ compatible solutes as a mechanism to cope with these challenges. Compatible solutes are small molecules that help maintain cellular osmotic balance in high-salinity environments, such as marine waters. Their production plays a crucial role in salt tolerance, which, in turn, influences habitat preference. Among the 5 known compatible solutes produced by cyanobacteria (sucrose, trehalose, glucosylglycerol, glucosylglycerate, and glycine betaine), their synthesis varies between individual strains. In this study, we work in a Bayesian stochastic mapping framework, integrating multiple sources of information about compatible solute biosynthesis in order to predict the ancestral habitat preference of Cyanobacteria. Through extensive model selection analyses and statistical tests for correlation, we identify glucosylglycerol and glucosylglycerate as the most significantly correlated with habitat preference, while trehalose exhibits the weakest correlation. Additionally, glucosylglycerol, glucosylglycerate, and glycine betaine show high loss/gain rate ratios, indicating their potential role in adaptability, while sucrose and trehalose are less likely to be lost due to their additional cellular functions. Contrary to previous findings, our analyses predict that the last common ancestor of Cyanobacteria (living at around 3180 Ma) had a 97% probability of a high salinity habitat preference and was likely able to synthesize glucosylglycerol and glucosylglycerate. Nevertheless, cyanobacteria likely colonized low-salinity environments shortly after their origin, with an 89% probability of the first cyanobacterium with low-salinity habitat preference arising prior to the Great Oxygenation Event (2460 Ma). Stochastic mapping analyses provide evidence of cyanobacteria inhabiting early marine habitats, aiding in the interpretation of the geological record. Our age estimate of ~2590 Ma for the divergence of 2 major cyanobacterial clades (Macro- and Microcyanobacteria) suggests that these were likely significant contributors to primary productivity in marine habitats in the lead-up to the Great Oxygenation Event, and thus played a pivotal role in triggering the sudden increase in atmospheric oxygen.

蓝藻是唯一进化出含氧光合作用的原核生物,为复杂生命铺平了道路。研究蓝藻及其祖先的进化和生态位对于了解生物圈进化的复杂动态至关重要。这些生物经常要应对盐度和干旱等环境压力,它们采用相容溶质作为应对这些挑战的机制。相容溶质是一种小分子,有助于在海水等高盐度环境中维持细胞渗透平衡。它们的产生对耐盐性起着至关重要的作用,而耐盐性反过来又会影响对栖息地的偏好。在蓝藻产生的五种已知相容溶质(蔗糖、曲哈糖、葡萄糖基甘油、葡萄糖基甘油酸和甘氨酸甜菜碱)中,不同菌株的合成情况各不相同。在本研究中,我们采用贝叶斯随机映射框架,整合了兼容溶质生物合成的多种信息来源,以预测蓝藻的祖先栖息地偏好。通过大量的模型选择分析和相关性统计检验,我们发现葡萄糖基甘油和葡萄糖基甘油酸酯与栖息地偏好的相关性最显著,而三卤糖的相关性最弱。此外,葡萄糖基甘油、葡萄糖基甘油酸和甘氨酸甜菜碱显示出较高的损失/增加比率,表明它们在适应性方面的潜在作用,而蔗糖和三卤糖由于具有额外的细胞功能,损失的可能性较小。与之前的研究结果相反,我们的分析预测蓝藻的最后一个共同祖先(生活在约 3180 马年)有 97% 的可能性偏好高盐度生境,并且很可能能够合成葡萄糖基甘油和葡萄糖基甘油酸。然而,蓝藻很可能在起源后不久就在低盐度环境中定殖,第一种具有低盐度生境偏好的蓝藻出现在大富氧事件(2460 Ma)之前的概率为 89%。随机绘图分析提供了蓝藻栖息于早期海洋生境的证据,有助于解释地质记录。我们对两大蓝藻支系(巨蓝藻和微蓝藻)分化的年龄估计为约 2590 Ma,这表明这些蓝藻支系很可能是大富氧作用之前海洋生境初级生产力的重要贡献者,因此在引发大气中氧气的突然增加方面发挥了关键作用。
{"title":"Stochastic Character Mapping, Bayesian Model Selection, and Biosynthetic Pathways Shed New Light on the Evolution of Habitat Preference in Cyanobacteria.","authors":"Giorgio Bianchini, Martin Hagemann, Patricia Sánchez-Baracaldo","doi":"10.1093/sysbio/syae025","DOIUrl":"10.1093/sysbio/syae025","url":null,"abstract":"<p><p>Cyanobacteria are the only prokaryotes to have evolved oxygenic photosynthesis paving the way for complex life. Studying the evolution and ecological niche of cyanobacteria and their ancestors is crucial for understanding the intricate dynamics of biosphere evolution. These organisms frequently deal with environmental stressors such as salinity and drought, and they employ compatible solutes as a mechanism to cope with these challenges. Compatible solutes are small molecules that help maintain cellular osmotic balance in high-salinity environments, such as marine waters. Their production plays a crucial role in salt tolerance, which, in turn, influences habitat preference. Among the 5 known compatible solutes produced by cyanobacteria (sucrose, trehalose, glucosylglycerol, glucosylglycerate, and glycine betaine), their synthesis varies between individual strains. In this study, we work in a Bayesian stochastic mapping framework, integrating multiple sources of information about compatible solute biosynthesis in order to predict the ancestral habitat preference of Cyanobacteria. Through extensive model selection analyses and statistical tests for correlation, we identify glucosylglycerol and glucosylglycerate as the most significantly correlated with habitat preference, while trehalose exhibits the weakest correlation. Additionally, glucosylglycerol, glucosylglycerate, and glycine betaine show high loss/gain rate ratios, indicating their potential role in adaptability, while sucrose and trehalose are less likely to be lost due to their additional cellular functions. Contrary to previous findings, our analyses predict that the last common ancestor of Cyanobacteria (living at around 3180 Ma) had a 97% probability of a high salinity habitat preference and was likely able to synthesize glucosylglycerol and glucosylglycerate. Nevertheless, cyanobacteria likely colonized low-salinity environments shortly after their origin, with an 89% probability of the first cyanobacterium with low-salinity habitat preference arising prior to the Great Oxygenation Event (2460 Ma). Stochastic mapping analyses provide evidence of cyanobacteria inhabiting early marine habitats, aiding in the interpretation of the geological record. Our age estimate of ~2590 Ma for the divergence of 2 major cyanobacterial clades (Macro- and Microcyanobacteria) suggests that these were likely significant contributors to primary productivity in marine habitats in the lead-up to the Great Oxygenation Event, and thus played a pivotal role in triggering the sudden increase in atmospheric oxygen.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"644-665"},"PeriodicalIF":6.1,"publicationDate":"2024-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11505929/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141459410","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Phylogenetic tree instability after taxon addition: empirical frequency, predictability, and consequences for online inference 分类群添加后系统发生树的不稳定性:经验频率、可预测性和在线推断的后果
IF 6.5 1区 生物学 Q1 EVOLUTIONARY BIOLOGY Pub Date : 2024-10-25 DOI: 10.1093/sysbio/syae059
Lena Collienne, Mary Barker, Marc A Suchard, Frederick A Matsen IV
Online phylogenetic inference methods add sequentially arriving sequences to an inferred phylogeny without the need to recompute the entire tree from scratch. Some online method implementations exist already, but there remains concern that additional sequences may change the topological relationship among the original set of taxa. We call such a change in tree topology a lack of stability for the inferred tree. In this paper, we analyze the stability of single taxon addition in a Maximum Likelihood framework across 1, 000 empirical datasets. We find that instability occurs in almost 90% of our examples, although observed topological differences do not always reach significance under the AU-test. Changes in tree topology after addition of a taxon rarely occur close to its attachment location, and are more frequently observed in more distant tree locations carrying low bootstrap support. To investigate whether instability is predictable, we hypothesize sources of instability and design summary statistics addressing these hypotheses. Using these summary statistics as input features for machine learning under random forests, we are able to predict instability and can identify the most influential features. In summary, it does not appear that a strict insertion-only online inference method will deliver globally optimal trees, although relaxing insertion strictness by allowing for a small number of final tree rearrangements or accepting slightly suboptimal solutions appears feasible.
在线系统发育推断方法可将连续到达的序列添加到推断的系统发育中,而无需从头开始重新计算整棵树。目前已经有一些在线方法的实现,但人们仍然担心额外的序列可能会改变原始分类群之间的拓扑关系。我们将这种树拓扑结构的变化称为推断树缺乏稳定性。在本文中,我们在最大似然法框架下分析了 1,000 个经验数据集中单个分类群增加的稳定性。我们发现几乎 90% 的实例都存在不稳定性,尽管在 AU 检验中观察到的拓扑差异并不总是达到显著性。加入一个分类群后,树拓扑结构的变化很少发生在其附着位置附近,而更多地发生在较远的树位置,且引导支持率较低。为了研究不稳定性是否可以预测,我们假设了不稳定性的来源,并针对这些假设设计了汇总统计量。使用这些汇总统计作为随机森林下机器学习的输入特征,我们能够预测不稳定性,并能识别出最有影响力的特征。总之,严格的只插入在线推理方法似乎无法提供全局最优树,不过通过允许少量最终树重新排列或接受略微次优的解决方案来放宽插入的严格性似乎是可行的。
{"title":"Phylogenetic tree instability after taxon addition: empirical frequency, predictability, and consequences for online inference","authors":"Lena Collienne, Mary Barker, Marc A Suchard, Frederick A Matsen IV","doi":"10.1093/sysbio/syae059","DOIUrl":"https://doi.org/10.1093/sysbio/syae059","url":null,"abstract":"Online phylogenetic inference methods add sequentially arriving sequences to an inferred phylogeny without the need to recompute the entire tree from scratch. Some online method implementations exist already, but there remains concern that additional sequences may change the topological relationship among the original set of taxa. We call such a change in tree topology a lack of stability for the inferred tree. In this paper, we analyze the stability of single taxon addition in a Maximum Likelihood framework across 1, 000 empirical datasets. We find that instability occurs in almost 90% of our examples, although observed topological differences do not always reach significance under the AU-test. Changes in tree topology after addition of a taxon rarely occur close to its attachment location, and are more frequently observed in more distant tree locations carrying low bootstrap support. To investigate whether instability is predictable, we hypothesize sources of instability and design summary statistics addressing these hypotheses. Using these summary statistics as input features for machine learning under random forests, we are able to predict instability and can identify the most influential features. In summary, it does not appear that a strict insertion-only online inference method will deliver globally optimal trees, although relaxing insertion strictness by allowing for a small number of final tree rearrangements or accepting slightly suboptimal solutions appears feasible.","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":"31 1","pages":""},"PeriodicalIF":6.5,"publicationDate":"2024-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142490398","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Phylogenomic Backbone for Acoelomorpha Inferred from Transcriptomic Data 从转录组数据中推断出Acoelomorpha的系统进化骨架
IF 6.5 1区 生物学 Q1 EVOLUTIONARY BIOLOGY Pub Date : 2024-10-25 DOI: 10.1093/sysbio/syae057
Samuel Abalde, Ulf Jondelius
Xenacoelomorpha are mostly microscopic, morphologically simple worms, lacking many structures typical of other bilaterians. Xenacoelomorphs –which include three main groups: Acoela, Nemertodermatida, and Xenoturbella– have been proposed to be an early diverging Bilateria, sister to protostomes and deuterostomes, but other phylogenomic analyses have recovered this clade nested within the deuterostomes, as sister to Ambulacraria. The position of Xenacoelomorpha within the metazoan tree has understandably attracted a lot of attention, overshadowing the study of phylogenetic relationships within this group. Given that Xenoturbella includes only six species whose relationships are well understood, we decided to focus on the most speciose Acoelomorpha (Acoela + Nemertodermatida). Here, we have sequenced 29 transcriptomes, doubling the number of sequenced species, to infer a backbone tree for Acoelomorpha based on genomic data. The recovered topology is mostly congruent with previous studies. The most important difference is the recovery of Paratomella as the first off-shoot within Acoela, dramatically changing the reconstruction of the ancestral acoel. Besides, we have detected incongruence between the gene trees and the species tree, likely linked to incomplete lineage sorting, and some signal of introgression between the families Dakuidae and Mecynostomidae, which hampers inferring the correct placement of this family and, particularly, of the genus Notocelis. We have also used this dataset to infer for the first time diversification times within Acoelomorpha, which coincide with known bilaterian diversification and extinction events. Given the importance of morphological data in acoelomorph phylogenetics, we tested several partitions and models. Although morphological data failed to recover a robust phylogeny, phylogenetic placement has proven to be a suitable alternative when a reference phylogeny is available.
Xenacoelomorpha大多是微小的、形态简单的蠕虫,缺乏其他双脊类动物的许多典型结构。Xenacoelomorphs包括三大类:Acoela、Nemertodermatida和Xenoturbella:Acoela、Nemertodermatida和Xenoturbella--曾被认为是早期分化的双鞭毛目动物,是原肠动物和去壳动物的姊妹,但其他系统发生学分析发现,这一支系嵌套在去壳动物中,是Ambulacraria的姊妹。可以理解的是,Xenacoelomorpha 在后生动物树中的位置引起了广泛的关注,这给该类群内部的系统发育关系研究蒙上了阴影。鉴于Xenoturbella仅包括6个物种,而这些物种之间的关系已十分清楚,我们决定重点研究物种最多的Acoelomorpha(Acoela + Nemertodermatida)。在这里,我们对 29 个转录组进行了测序,使测序物种的数量增加了一倍,从而根据基因组数据推断出 Acoelomorpha 的骨干树。所恢复的拓扑结构与之前的研究基本一致。最重要的不同之处在于,我们发现 Paratomella 是 Acoela 的第一个分支,这极大地改变了祖先 acoel 的重建。此外,我们还发现了基因树与物种树之间的不一致性,这很可能与不完全的世系分类有关,而且 Dakuidae 科与 Mecynostomidae 科之间也存在一些外来入侵信号,这妨碍了推断该科,尤其是 Notocelis 属的正确位置。我们还利用该数据集首次推断了Acoelomorpha科的分化时间,该时间与已知的两栖动物分化和灭绝事件相吻合。鉴于形态学数据在无尾目系统发生学中的重要性,我们测试了多个分区和模型。尽管形态学数据未能恢复一个稳健的系统发生,但事实证明,在有参考系统发生的情况下,系统发生定位是一个合适的替代方法。
{"title":"A Phylogenomic Backbone for Acoelomorpha Inferred from Transcriptomic Data","authors":"Samuel Abalde, Ulf Jondelius","doi":"10.1093/sysbio/syae057","DOIUrl":"https://doi.org/10.1093/sysbio/syae057","url":null,"abstract":"Xenacoelomorpha are mostly microscopic, morphologically simple worms, lacking many structures typical of other bilaterians. Xenacoelomorphs –which include three main groups: Acoela, Nemertodermatida, and Xenoturbella– have been proposed to be an early diverging Bilateria, sister to protostomes and deuterostomes, but other phylogenomic analyses have recovered this clade nested within the deuterostomes, as sister to Ambulacraria. The position of Xenacoelomorpha within the metazoan tree has understandably attracted a lot of attention, overshadowing the study of phylogenetic relationships within this group. Given that Xenoturbella includes only six species whose relationships are well understood, we decided to focus on the most speciose Acoelomorpha (Acoela + Nemertodermatida). Here, we have sequenced 29 transcriptomes, doubling the number of sequenced species, to infer a backbone tree for Acoelomorpha based on genomic data. The recovered topology is mostly congruent with previous studies. The most important difference is the recovery of Paratomella as the first off-shoot within Acoela, dramatically changing the reconstruction of the ancestral acoel. Besides, we have detected incongruence between the gene trees and the species tree, likely linked to incomplete lineage sorting, and some signal of introgression between the families Dakuidae and Mecynostomidae, which hampers inferring the correct placement of this family and, particularly, of the genus Notocelis. We have also used this dataset to infer for the first time diversification times within Acoelomorpha, which coincide with known bilaterian diversification and extinction events. Given the importance of morphological data in acoelomorph phylogenetics, we tested several partitions and models. Although morphological data failed to recover a robust phylogeny, phylogenetic placement has proven to be a suitable alternative when a reference phylogeny is available.","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":"97 1","pages":""},"PeriodicalIF":6.5,"publicationDate":"2024-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142489582","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Benefits and Limits of Phasing Alleles for Network Inference of Allopolyploid Complexes. 异源多倍体复合体网络推断中分阶段等位基因的优势与局限性
IF 6.1 1区 生物学 Q1 EVOLUTIONARY BIOLOGY Pub Date : 2024-10-25 DOI: 10.1093/sysbio/syae024
George P Tiley, Andrew A Crowl, Paul S Manos, Emily B Sessa, Claudia Solís-Lemus, Anne D Yoder, J Gordon Burleigh

Accurately reconstructing the reticulate histories of polyploids remains a central challenge for understanding plant evolution. Although phylogenetic networks can provide insights into relationships among polyploid lineages, inferring networks may be hindered by the complexities of homology determination in polyploid taxa. We use simulations to show that phasing alleles from allopolyploid individuals can improve phylogenetic network inference under the multispecies coalescent by obtaining the true network with fewer loci compared with haplotype consensus sequences or sequences with heterozygous bases represented as ambiguity codes. Phased allelic data can also improve divergence time estimates for networks, which is helpful for evaluating allopolyploid speciation hypotheses and proposing mechanisms of speciation. To achieve these outcomes in empirical data, we present a novel pipeline that leverages a recently developed phasing algorithm to reliably phase alleles from polyploids. This pipeline is especially appropriate for target enrichment data, where the depth of coverage is typically high enough to phase entire loci. We provide an empirical example in the North American Dryopteris fern complex that demonstrates insights from phased data as well as the challenges of network inference. We establish that our pipeline (PATÉ: Phased Alleles from Target Enrichment data) is capable of recovering a high proportion of phased loci from both diploids and polyploids. These data may improve network estimates compared with using haplotype consensus assemblies by accurately inferring the direction of gene flow, but statistical nonidentifiability of phylogenetic networks poses a barrier to inferring the evolutionary history of reticulate complexes.

准确重建多倍体的网状历史仍然是了解植物进化的核心挑战。虽然系统发育网络可以让人们深入了解多倍体系之间的关系,但推断网络可能会受到多倍体类群同源性测定复杂性的阻碍。我们通过模拟实验表明,与单倍型共识序列或以模糊代码表示杂合碱基的序列相比,从异源多倍体个体中分期等位基因可以用较少的位点获得真正的网络,从而改善多物种聚合下的系统发生网络推断。分阶段等位基因数据还能改善网络的分歧时间估计,这有助于评估全多倍体物种形成假说和提出物种形成机制。为了在实证数据中取得这些成果,我们提出了一种新的方法,利用最近开发的相位算法对来自多倍体的等位基因进行可靠的相位分析。该管道尤其适用于目标富集数据,因为目标富集数据的覆盖深度通常很高,足以对整个基因座进行分期。我们提供了一个北美蕨类植物干蕨复合体的经验实例,展示了分阶段数据的启示以及网络推断所面临的挑战。我们发现,我们的管道(PATÉ:从目标富集数据中分期等位基因)能够从二倍体和多倍体中恢复很高比例的分期基因座。与使用单倍型共识组装相比,这些数据可以通过准确推断基因流的方向来改进网络估计,但系统发生网络的统计不可识别性对推断网状复合体的进化历史构成了障碍。
{"title":"Benefits and Limits of Phasing Alleles for Network Inference of Allopolyploid Complexes.","authors":"George P Tiley, Andrew A Crowl, Paul S Manos, Emily B Sessa, Claudia Solís-Lemus, Anne D Yoder, J Gordon Burleigh","doi":"10.1093/sysbio/syae024","DOIUrl":"10.1093/sysbio/syae024","url":null,"abstract":"<p><p>Accurately reconstructing the reticulate histories of polyploids remains a central challenge for understanding plant evolution. Although phylogenetic networks can provide insights into relationships among polyploid lineages, inferring networks may be hindered by the complexities of homology determination in polyploid taxa. We use simulations to show that phasing alleles from allopolyploid individuals can improve phylogenetic network inference under the multispecies coalescent by obtaining the true network with fewer loci compared with haplotype consensus sequences or sequences with heterozygous bases represented as ambiguity codes. Phased allelic data can also improve divergence time estimates for networks, which is helpful for evaluating allopolyploid speciation hypotheses and proposing mechanisms of speciation. To achieve these outcomes in empirical data, we present a novel pipeline that leverages a recently developed phasing algorithm to reliably phase alleles from polyploids. This pipeline is especially appropriate for target enrichment data, where the depth of coverage is typically high enough to phase entire loci. We provide an empirical example in the North American Dryopteris fern complex that demonstrates insights from phased data as well as the challenges of network inference. We establish that our pipeline (PATÉ: Phased Alleles from Target Enrichment data) is capable of recovering a high proportion of phased loci from both diploids and polyploids. These data may improve network estimates compared with using haplotype consensus assemblies by accurately inferring the direction of gene flow, but statistical nonidentifiability of phylogenetic networks poses a barrier to inferring the evolutionary history of reticulate complexes.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"666-682"},"PeriodicalIF":6.1,"publicationDate":"2024-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140908806","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sequential Bayesian Phylogenetic Inference. 序列贝叶斯系统发育推论
IF 6.1 1区 生物学 Q1 EVOLUTIONARY BIOLOGY Pub Date : 2024-10-25 DOI: 10.1093/sysbio/syae020
Sebastian Höhna, Allison Y Hsiang

The ideal approach to Bayesian phylogenetic inference is to estimate all parameters of interest jointly in a single hierarchical model. However, this is often not feasible in practice due to the high computational cost. Instead, phylogenetic pipelines generally consist of sequential analyses, whereby a single point estimate from a given analysis is used as input for the next analysis (e.g., a single multiple sequence alignment is used to estimate a gene tree). In this framework, uncertainty is not propagated from step to step, which can lead to inaccurate or spuriously confident results. Here, we formally develop and test a sequential inference approach for Bayesian phylogenetic inference, which uses importance sampling to generate observations for the next step of an analysis pipeline from the posterior distribution produced in the previous step. Our sequential inference approach presented here not only accounts for uncertainty between analysis steps but also allows for greater flexibility in software choice (and hence model availability) and can be computationally more efficient than the traditional joint inference approach when multiple models are being tested. We show that our sequential inference approach is identical in practice to the joint inference approach only if sufficient information in the data is present (a narrow posterior distribution) and/or sufficiently many important samples are used. Conversely, we show that the common practice of using a single point estimate can be biased, for example, a single phylogeny estimate can transform an unrooted phylogeny into a time-calibrated phylogeny. We demonstrate the theory of sequential Bayesian inference using both a toy example and an empirical case study of divergence-time estimation in insects using a relaxed clock model from transcriptome data. In the empirical example, we estimate 3 posterior distributions of branch lengths from the same data (DNA character matrix with a GTR+Γ+I substitution model, an amino acid data matrix with empirical substitution models, and an amino acid data matrix with the PhyloBayes CAT-GTR model). Finally, we apply 3 different node-calibration strategies and show that divergence time estimates are affected by both the data source and underlying substitution process to estimate branch lengths as well as the node-calibration strategies. Thus, our new sequential Bayesian phylogenetic inference provides the opportunity to efficiently test different approaches for divergence time estimation, including branch-length estimation from other software.

贝叶斯系统发育推断的理想方法是在单一分层模型中联合估计所有相关参数。然而,由于计算成本较高,这在实践中往往并不可行。取而代之的是,系统发育管道一般由连续分析组成,即把给定分析中的单点估计值作为下一步分析的输入(例如,用单个多序列比对来估计基因树)。在这个框架中,不确定性不会从一个步骤传播到另一个步骤,这可能导致不准确或虚假的可信结果。在这里,我们正式开发并测试了一种贝叶斯系统发育推断的顺序推断方法,该方法使用重要性采样从上一步产生的后验分布中为下一步分析流水线生成观测值。我们在此介绍的顺序推断方法不仅考虑了分析步骤之间的不确定性,而且在软件选择(从而模型可用性)方面具有更大的灵活性,并且在测试多个模型时比传统的联合推断方法计算效率更高。我们的研究表明,只有当数据中存在足够的信息(窄后验分布)和/或使用了足够多的重要性样本时,我们的顺序推断方法在实践中才与联合推断方法相同。相反,我们证明了使用单点估计的常见做法可能存在偏差,例如,使用单个系统发育估计将未根系统发育转化为时间校准系统发育。我们通过一个玩具示例和一个实证案例研究证明了序列贝叶斯推断理论,即利用转录组数据中的松弛时钟模型对昆虫的分化时间进行估计。在经验示例中,我们从相同的数据(采用 GTR+Γ+I 替代模型的 DNA 特征矩阵、采用经验替代模型的氨基酸数据矩阵和采用 PhyloBayes CAT-GTR 模型的氨基酸数据矩阵)中估计了三个分支长度的后验分布。最后,我们应用了三种不同的节点校准策略,结果表明分歧时间估计值既受数据源和基础替代过程的影响,也受估计分支长度的节点校准策略的影响。因此,我们新的序列贝叶斯系统发育推断方法为有效测试不同的分歧时间估计方法(包括其他软件的分支长度估计方法)提供了机会。
{"title":"Sequential Bayesian Phylogenetic Inference.","authors":"Sebastian Höhna, Allison Y Hsiang","doi":"10.1093/sysbio/syae020","DOIUrl":"10.1093/sysbio/syae020","url":null,"abstract":"<p><p>The ideal approach to Bayesian phylogenetic inference is to estimate all parameters of interest jointly in a single hierarchical model. However, this is often not feasible in practice due to the high computational cost. Instead, phylogenetic pipelines generally consist of sequential analyses, whereby a single point estimate from a given analysis is used as input for the next analysis (e.g., a single multiple sequence alignment is used to estimate a gene tree). In this framework, uncertainty is not propagated from step to step, which can lead to inaccurate or spuriously confident results. Here, we formally develop and test a sequential inference approach for Bayesian phylogenetic inference, which uses importance sampling to generate observations for the next step of an analysis pipeline from the posterior distribution produced in the previous step. Our sequential inference approach presented here not only accounts for uncertainty between analysis steps but also allows for greater flexibility in software choice (and hence model availability) and can be computationally more efficient than the traditional joint inference approach when multiple models are being tested. We show that our sequential inference approach is identical in practice to the joint inference approach only if sufficient information in the data is present (a narrow posterior distribution) and/or sufficiently many important samples are used. Conversely, we show that the common practice of using a single point estimate can be biased, for example, a single phylogeny estimate can transform an unrooted phylogeny into a time-calibrated phylogeny. We demonstrate the theory of sequential Bayesian inference using both a toy example and an empirical case study of divergence-time estimation in insects using a relaxed clock model from transcriptome data. In the empirical example, we estimate 3 posterior distributions of branch lengths from the same data (DNA character matrix with a GTR+Γ+I substitution model, an amino acid data matrix with empirical substitution models, and an amino acid data matrix with the PhyloBayes CAT-GTR model). Finally, we apply 3 different node-calibration strategies and show that divergence time estimates are affected by both the data source and underlying substitution process to estimate branch lengths as well as the node-calibration strategies. Thus, our new sequential Bayesian phylogenetic inference provides the opportunity to efficiently test different approaches for divergence time estimation, including branch-length estimation from other software.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"704-721"},"PeriodicalIF":6.1,"publicationDate":"2024-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141071866","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Fossilised Birth-Death Model is Identifiable. 化石化的出生-死亡模型是可识别的。
IF 6.1 1区 生物学 Q1 EVOLUTIONARY BIOLOGY Pub Date : 2024-10-22 DOI: 10.1093/sysbio/syae058
Kate Truman, Timothy G Vaughan, Alex Gavryushkin, Alexandra Sasha Gavryushkina

Time-dependent birth-death sampling models have been used in numerous studies for inferring past evolutionary dynamics in different biological contexts, e.g. speciation and extinction rates in macroevolutionary studies, or effective reproductive number in epidemiological studies. These models are branching processes where lineages can bifurcate, die, or be sampled with time-dependent birth, death, and sampling rates, generating phylogenetic trees. It has been shown that in some subclasses of such models, different sets of rates can result in the same distributions of reconstructed phylogenetic trees, and therefore the rates become unidentifiable from the trees regardless of their size. Here we show that widely used time-dependent fossilised birth-death (FBD) models are identifiable. This subclass of models makes more realistic assumptions about the fossilisation process and certain infectious disease transmission processes than the unidentifiable birth-death sampling models. Namely, FBD models assume that sampled lineages stay in the process rather than being immediately removed upon sampling. Identifiability of the time-dependent FBD model justifies using statistical methods that implement this model to infer the underlying temporal diversification or epidemiological dynamics from phylogenetic trees or directly from molecular or other comparative data. We further show that the time-dependent fossilised-birth-death model with an extra parameter, the removal after sampling probability, is unidentifiable. This implies that in scenarios where we do not know how sampling affects lineages we are unable to infer this extra parameter together with birth, death, and sampling rates solely from trees.

依赖时间的出生-死亡采样模型已被大量研究用于推断不同生物背景下过去的进化动态,例如宏观进化研究中的物种分化和灭绝率,或流行病学研究中的有效繁殖数量。这些模型是一个分支过程,在这个过程中,世系可以分叉、死亡,或以随时间变化的出生率、死亡率和采样率进行采样,从而生成系统进化树。有研究表明,在这类模型的某些子类中,不同的速率集会导致重建的系统发生树分布相同,因此,无论系统发生树的大小如何,速率都无法从系统发生树中识别出来。在这里,我们证明了广泛使用的时间依赖化石出生-死亡(FBD)模型是可识别的。与无法识别的出生-死亡抽样模型相比,这一子类模型对化石化过程和某些传染病的传播过程做出了更现实的假设。也就是说,FBD 模型假定采样的世系会留在化石过程中,而不是在采样后立即消失。时间依赖性 FBD 模型的可识别性使我们有理由使用实现该模型的统计方法,从系统发生树或直接从分子或其他比较数据中推断潜在的时间多样化或流行病学动态。我们进一步证明,具有额外参数(取样后移除概率)的时间依赖性化石出生-死亡模型是不可识别的。这意味着,在我们不知道采样如何影响世系的情况下,我们无法仅从树中推断出这个额外参数以及出生率、死亡率和采样率。
{"title":"The Fossilised Birth-Death Model is Identifiable.","authors":"Kate Truman, Timothy G Vaughan, Alex Gavryushkin, Alexandra Sasha Gavryushkina","doi":"10.1093/sysbio/syae058","DOIUrl":"10.1093/sysbio/syae058","url":null,"abstract":"<p><p>Time-dependent birth-death sampling models have been used in numerous studies for inferring past evolutionary dynamics in different biological contexts, e.g. speciation and extinction rates in macroevolutionary studies, or effective reproductive number in epidemiological studies. These models are branching processes where lineages can bifurcate, die, or be sampled with time-dependent birth, death, and sampling rates, generating phylogenetic trees. It has been shown that in some subclasses of such models, different sets of rates can result in the same distributions of reconstructed phylogenetic trees, and therefore the rates become unidentifiable from the trees regardless of their size. Here we show that widely used time-dependent fossilised birth-death (FBD) models are identifiable. This subclass of models makes more realistic assumptions about the fossilisation process and certain infectious disease transmission processes than the unidentifiable birth-death sampling models. Namely, FBD models assume that sampled lineages stay in the process rather than being immediately removed upon sampling. Identifiability of the time-dependent FBD model justifies using statistical methods that implement this model to infer the underlying temporal diversification or epidemiological dynamics from phylogenetic trees or directly from molecular or other comparative data. We further show that the time-dependent fossilised-birth-death model with an extra parameter, the removal after sampling probability, is unidentifiable. This implies that in scenarios where we do not know how sampling affects lineages we are unable to infer this extra parameter together with birth, death, and sampling rates solely from trees.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":""},"PeriodicalIF":6.1,"publicationDate":"2024-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142475252","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Complex Models of Sequence Evolution Improve Fit, but not Gene Tree Discordance, for Tetrapod Mitogenomes. 序列进化的复杂模型改善了四足有丝分裂基因组的拟合度,但没有改善基因树的不一致性。
IF 6.1 1区 生物学 Q1 EVOLUTIONARY BIOLOGY Pub Date : 2024-10-11 DOI: 10.1093/sysbio/syae056
Benjamin S Toups, Robert C Thomson, Jeremy M Brown

Variation in gene tree estimates is widely observed in empirical phylogenomic data and is often assumed to be the result of biological processes. However, a recent study using tetrapod mitochondrial genomes to control for biological sources of variation due to their haploid, uniparentally inherited, and non-recombining nature found that levels of discordance among mitochondrial gene trees were comparable to those found in studies that assume only biological sources of variation. Additionally, they found that several of the models of sequence evolution chosen to infer gene trees were doing an inadequate job fitting the sequence data. These results indicated that significant amounts of gene tree discordance in empirical data may be due to poor fit of sequence evolution models, and that more complex and biologically realistic models may be needed. To test how the fit of sequence evolution models relates to gene tree discordance, we analyzed the same mitochondrial datasets as the previous study using two additional, more complex models of sequence evolution that each includes a different biologically realistic aspect of the evolutionary process: a covarion model to incorporate site-specific rate variation across lineages (heterotachy), and a partitioned model to incorporate variable evolutionary patterns by codon position. Our results show that both additional models fit the data better than the models used in the previous study, with the covarion being consistently and strongly preferred as tree size increases. However, even these more preferred models still inferred highly discordant mitochondrial gene trees, thus deepening the mystery around what we label the "Mito-Phylo Paradox" and leading us to ask whether the observed variation could, in fact, be biological in nature after all.

在经验性的系统发生组数据中,基因树估计值的变异被广泛观察到,并且通常被认为是生物过程的结果。然而,最近一项使用四足动物线粒体基因组来控制生物变异来源的研究发现,线粒体基因树之间的不一致程度与仅假定生物变异来源的研究中发现的不一致程度相当。此外,他们还发现,用于推断基因树的几个序列进化模型与序列数据的拟合程度不够。这些结果表明,经验数据中存在的大量基因树不一致现象可能是由于序列进化模型的拟合效果不佳造成的,因此可能需要更复杂、更符合生物学实际的模型。为了检验序列进化模型的拟合度与基因树不一致性之间的关系,我们使用了两个额外的、更复杂的序列进化模型来分析与前一项研究相同的线粒体数据集,这两个模型分别包含了进化过程中不同的生物学现实方面:一个是科瓦里翁模型(covarion model),它包含了不同世系中特定位点的速率变化(heterotachy);另一个是分区模型(partitioned model),它包含了不同密码子位置的进化模式。我们的研究结果表明,这两个额外的模型都比之前研究中使用的模型更适合数据,随着树规模的增加,共线性模型一直是首选。然而,即使是这些更受青睐的模型,仍然推断出了高度不和谐的线粒体基因树,从而加深了我们所称的 "Mito-Phylo 悖论 "的神秘性,并使我们提出了这样一个问题:所观察到的变异实际上是否具有生物学性质?
{"title":"Complex Models of Sequence Evolution Improve Fit, but not Gene Tree Discordance, for Tetrapod Mitogenomes.","authors":"Benjamin S Toups, Robert C Thomson, Jeremy M Brown","doi":"10.1093/sysbio/syae056","DOIUrl":"https://doi.org/10.1093/sysbio/syae056","url":null,"abstract":"<p><p>Variation in gene tree estimates is widely observed in empirical phylogenomic data and is often assumed to be the result of biological processes. However, a recent study using tetrapod mitochondrial genomes to control for biological sources of variation due to their haploid, uniparentally inherited, and non-recombining nature found that levels of discordance among mitochondrial gene trees were comparable to those found in studies that assume only biological sources of variation. Additionally, they found that several of the models of sequence evolution chosen to infer gene trees were doing an inadequate job fitting the sequence data. These results indicated that significant amounts of gene tree discordance in empirical data may be due to poor fit of sequence evolution models, and that more complex and biologically realistic models may be needed. To test how the fit of sequence evolution models relates to gene tree discordance, we analyzed the same mitochondrial datasets as the previous study using two additional, more complex models of sequence evolution that each includes a different biologically realistic aspect of the evolutionary process: a covarion model to incorporate site-specific rate variation across lineages (heterotachy), and a partitioned model to incorporate variable evolutionary patterns by codon position. Our results show that both additional models fit the data better than the models used in the previous study, with the covarion being consistently and strongly preferred as tree size increases. However, even these more preferred models still inferred highly discordant mitochondrial gene trees, thus deepening the mystery around what we label the \"Mito-Phylo Paradox\" and leading us to ask whether the observed variation could, in fact, be biological in nature after all.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":""},"PeriodicalIF":6.1,"publicationDate":"2024-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142406814","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Systematic Biology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1