首页 > 最新文献

Systematic Biology最新文献

英文 中文
Bayesian Inference Under the Multispecies Coalescent with Ancient DNA Sequences. 古 DNA 序列多物种聚合下的贝叶斯推断。
IF 6.1 1区 生物学 Q1 EVOLUTIONARY BIOLOGY Pub Date : 2024-11-29 DOI: 10.1093/sysbio/syae047
Anna A Nagel, Tomáš Flouri, Ziheng Yang, Bruce Rannala

Ancient DNA (aDNA) is increasingly being used to investigate questions such as the phylogenetic relationships and divergence times of extant and extinct species. If aDNA samples are sufficiently old, expected branch lengths (in units of nucleotide substitutions) are reduced relative to contemporary samples. This can be accounted for by incorporating sample ages into phylogenetic analyses. Existing methods that use tip (sample) dates infer gene trees rather than species trees, which can lead to incorrect or biased inferences of the species tree. Methods using a multispecies coalescent (MSC) model overcome these issues. We developed an MSC model with tip dates and implemented it in the program BPP. The method performed well for a range of biologically realistic scenarios, estimating calibrated divergence times and mutation rates precisely. Simulations suggest that estimation precision can be best improved by prioritizing sampling of many loci and more ancient samples. Incorrectly treating ancient samples as contemporary in analyzing simulated data, mimicking a common practice of empirical analyses, led to large systematic biases in model parameters, including divergence times. Two genomic datasets of mammoths and elephants were analyzed, demonstrating the method's empirical utility.

古DNA(aDNA)越来越多地被用于研究现存和灭绝物种的系统发育关系和分化时间等问题。如果 aDNA 样本的年代足够久远,其预期分支长度(以核苷酸替换为单位)相对于当代样本会有所缩短。这可以通过将样本年龄纳入系统发生学分析来解释。现有方法使用尖端(样本)日期推断基因树而不是物种树,这可能导致物种树推断不正确或有偏差。使用多物种凝聚(MSC)模型的方法可以克服这些问题。我们开发了一个带有尖端日期的 MSC 模型,并在 bpp 程序中实现了该模型。该方法在一系列生物现实场景中表现良好,精确估计了校准的分化时间和突变率。模拟结果表明,提高估计精度的最佳方法是优先对许多位点和更古老的样本进行采样。在分析模拟数据时,模仿经验分析的常见做法,不正确地将远古样本视为当代样本,会导致包括分化时间在内的模型参数出现较大的系统性偏差。分析了猛犸象和大象的两个基因组数据集,证明了该方法的经验实用性。
{"title":"Bayesian Inference Under the Multispecies Coalescent with Ancient DNA Sequences.","authors":"Anna A Nagel, Tomáš Flouri, Ziheng Yang, Bruce Rannala","doi":"10.1093/sysbio/syae047","DOIUrl":"10.1093/sysbio/syae047","url":null,"abstract":"<p><p>Ancient DNA (aDNA) is increasingly being used to investigate questions such as the phylogenetic relationships and divergence times of extant and extinct species. If aDNA samples are sufficiently old, expected branch lengths (in units of nucleotide substitutions) are reduced relative to contemporary samples. This can be accounted for by incorporating sample ages into phylogenetic analyses. Existing methods that use tip (sample) dates infer gene trees rather than species trees, which can lead to incorrect or biased inferences of the species tree. Methods using a multispecies coalescent (MSC) model overcome these issues. We developed an MSC model with tip dates and implemented it in the program BPP. The method performed well for a range of biologically realistic scenarios, estimating calibrated divergence times and mutation rates precisely. Simulations suggest that estimation precision can be best improved by prioritizing sampling of many loci and more ancient samples. Incorrectly treating ancient samples as contemporary in analyzing simulated data, mimicking a common practice of empirical analyses, led to large systematic biases in model parameters, including divergence times. Two genomic datasets of mammoths and elephants were analyzed, demonstrating the method's empirical utility.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"964-978"},"PeriodicalIF":6.1,"publicationDate":"2024-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11637557/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141793571","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Double-edged Sword: Evolutionary Novelty along Deep-time Diversity Oscillation in An Iconic Group of Predatory Insects (Neuroptera: Mantispoidea) 双刃剑:掠食性昆虫(神经目:螳螂虫科)一个标志性类群深时多样性振荡的进化新特点
IF 6.5 1区 生物学 Q1 EVOLUTIONARY BIOLOGY Pub Date : 2024-11-23 DOI: 10.1093/sysbio/syae068
Hongyu Li, De Zhuo, Bo Wang, Hiroshi Nakamine, Shûhei Yamamoto, Weiwei Zhang, James E Jepson, Michael Ohl, Ulrike Aspöck, Horst Aspöck, Thet Tin Nyunt, Michael S Engel, Michael J Benton, Philip Donoghue, Xingyue Liu
Evolutionary novelties are commonly identified as drivers of lineage diversification, with key innovations potentially triggering adaptive radiation. Nevertheless, testing hypotheses on the role of evolutionary novelties in promoting diversification through deep time has proven challenging. Here we unravel the role of the raptorial appendages, with evolutionary novelties for predation, in the macroevolution of a predatory insect lineage, the Superfamily Mantispoidea (mantidflies, beaded lacewings, thorny lacewings, and dipteromantispids), based on a new dated phylogeny and quantitative evolutionary analyses on modern and fossil species. We demonstrate a single origin of the raptorial foreleg and its associated novelties as key innovations triggering an early radiation of raptorial mantispoids from the Late Triassic to the Early Jurassic. Subsequently, the evolution of the raptorial foreleg influenced the diversification in different modes among lineages. At times, it might have limited the morphological diversity of other body parts and lead to lineage constraint by intensifying competition and lowering environmental resilience, e.g., in thorny lacewings, whose extant diversity is meagre. Conversely, in mantidflies, reduced emphasis on foreleg novelties and increased plasticity in other body parts may lead to better adaptation to predator-prey interactions and environmental shifts, thus maintaining a stable or accelerated level of diversification. We also reveal how major environmental change and lineage interactions interplayed with raptorial novelties in shaping the significant oscillations of mantispoid diversification over deep time, especially the abrupt shift near the mid-Cretaceous. However, by excluding a substantial portion of samples from the mid-Cretaceous of Myanmar, these shifts of some evolutionary parameters, such as morphological disparity, body size, and diversification rates, became inconspicuous and might be overestimated due to sampling bias. Our results uncover the intricate evolutionary patterns and profound significance of raptorial specializations, providing new insights into the role of novelties in forming evolutionary trajectories, both for the better and worse. [evolutionary novelty; macroevolution; diversification; raptorial foreleg; fossil; insect; Mantispoidea]
进化新特性通常被认为是物种多样化的驱动力,关键的创新可能会引发适应性辐射。尽管如此,检验进化新特性在促进物种多样化方面的作用的假说仍然具有挑战性。在本文中,我们基于新的年代系统发育以及对现代物种和化石物种的定量进化分析,揭示了捕食性昆虫超科螳螂属(螳螂、珠蛉、刺蛉和双翅目螳螂)的宏观进化过程中,具有捕食进化新特性的猛禽附肢所扮演的角色。我们证明了猛禽前腿的单一起源及其相关新特征是引发猛禽螳螂类从晚三叠世到早侏罗世早期辐射的关键创新。随后,猛禽前腿的进化以不同的模式影响了各系的多样化。有时,它可能会限制其他身体部位的形态多样性,并通过加剧竞争和降低环境适应能力而导致品系的限制,例如在现存多样性很低的棘蛉中。相反,在螳螂中,减少对前肢新颖性的强调和增加其他身体部位的可塑性可能会使其更好地适应捕食者与猎物之间的相互作用和环境变化,从而保持稳定或加速的多样化水平。我们还揭示了重大的环境变化和种系相互作用如何与猛禽的新颖性相互作用,从而形成螳螂类在深部时间的显著振荡,特别是在白垩纪中期附近的突然转变。然而,由于排除了缅甸白垩纪中期的大部分样本,一些进化参数(如形态差异、体型和多样化率)的变化变得不明显,并可能由于取样偏差而被高估。我们的研究结果揭示了猛禽特化的复杂进化模式和深远意义,为新物种在进化轨迹形成过程中的作用提供了新的视角,无论其进化轨迹是好是坏。[进化新奇性;宏观进化;多样化;猛禽前肢;化石;昆虫;螳螂目]
{"title":"A Double-edged Sword: Evolutionary Novelty along Deep-time Diversity Oscillation in An Iconic Group of Predatory Insects (Neuroptera: Mantispoidea)","authors":"Hongyu Li, De Zhuo, Bo Wang, Hiroshi Nakamine, Shûhei Yamamoto, Weiwei Zhang, James E Jepson, Michael Ohl, Ulrike Aspöck, Horst Aspöck, Thet Tin Nyunt, Michael S Engel, Michael J Benton, Philip Donoghue, Xingyue Liu","doi":"10.1093/sysbio/syae068","DOIUrl":"https://doi.org/10.1093/sysbio/syae068","url":null,"abstract":"Evolutionary novelties are commonly identified as drivers of lineage diversification, with key innovations potentially triggering adaptive radiation. Nevertheless, testing hypotheses on the role of evolutionary novelties in promoting diversification through deep time has proven challenging. Here we unravel the role of the raptorial appendages, with evolutionary novelties for predation, in the macroevolution of a predatory insect lineage, the Superfamily Mantispoidea (mantidflies, beaded lacewings, thorny lacewings, and dipteromantispids), based on a new dated phylogeny and quantitative evolutionary analyses on modern and fossil species. We demonstrate a single origin of the raptorial foreleg and its associated novelties as key innovations triggering an early radiation of raptorial mantispoids from the Late Triassic to the Early Jurassic. Subsequently, the evolution of the raptorial foreleg influenced the diversification in different modes among lineages. At times, it might have limited the morphological diversity of other body parts and lead to lineage constraint by intensifying competition and lowering environmental resilience, e.g., in thorny lacewings, whose extant diversity is meagre. Conversely, in mantidflies, reduced emphasis on foreleg novelties and increased plasticity in other body parts may lead to better adaptation to predator-prey interactions and environmental shifts, thus maintaining a stable or accelerated level of diversification. We also reveal how major environmental change and lineage interactions interplayed with raptorial novelties in shaping the significant oscillations of mantispoid diversification over deep time, especially the abrupt shift near the mid-Cretaceous. However, by excluding a substantial portion of samples from the mid-Cretaceous of Myanmar, these shifts of some evolutionary parameters, such as morphological disparity, body size, and diversification rates, became inconspicuous and might be overestimated due to sampling bias. Our results uncover the intricate evolutionary patterns and profound significance of raptorial specializations, providing new insights into the role of novelties in forming evolutionary trajectories, both for the better and worse. [evolutionary novelty; macroevolution; diversification; raptorial foreleg; fossil; insect; Mantispoidea]","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":"5 1","pages":""},"PeriodicalIF":6.5,"publicationDate":"2024-11-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142694243","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Are Modern Cryptic Species Detectable in the Fossil Record? A Case Study on Agamid Lizards. 化石记录中能发现现代隐性物种吗?姬蛙蜥个案研究。
IF 6.1 1区 生物学 Q1 EVOLUTIONARY BIOLOGY Pub Date : 2024-11-22 DOI: 10.1093/sysbio/syae067
Till Ramm, Jaimi A Gray, Christy A Hipsley, Scott Hocknull, Jane Melville, Johannes Müller

Comparisons of extant and extinct biodiversity are often dependent on objective morphology-based identifications of fossils and assume a well-established and comparable taxonomy for both fossil and modern taxa. However, since many modern (cryptic) species are delimitated mainly via external morphology and / or molecular data, it is often unclear to what degree fossilized (osteological) remains allow classification to a similar level. When intraspecific morphological variation in extant taxa is poorly known, the definition of extinct species as well as the referral of fossils to extant species can be heavily biased, particularly if fossils are represented by incomplete isolated skeletal elements. This problem is especially pronounced in squamates (lizards and snakes) owing to a lack of osteological comparative knowledge for many lower taxonomic groups, concomitant with a recent increase of molecular studies revealing great cryptic diversity. Here, we apply a quantitative approach using 3D geometric morphometrics on 238 individuals of 14 genera of extant Australian and Papua New Guinean agamid lizards to test the value of two isolated skull bones (frontals and maxillae) for inferring taxonomic and ecological affinities. We further test for the consistency of intra- and interspecific morphological variability of these elements as a proxy for extinct taxonomic richness. We show that both bones are diagnostic at the generic level, and both can infer microhabitat and are of palaeoecological utility. However, species-level diversity is likely underestimated by both elements, with ~30-40% of species pairs showing no significant differences in shape. Mean intraspecific morphological variability is largely consistent across species and bones and thus a useful proxy for extinct species diversity. Reducing sample size and landmark completeness to approximate fossil specimens led to decreased classification accuracy and increased variance of morphological disparity, raising further doubts on the transferability of modern species borders to the fossil record of agamids. Our results highlight the need to establish appropriate levels of morphology-based taxonomic or ecological groupings prior to comparing extant and extinct biodiversity.

现存生物多样性和已灭绝生物多样性的比较通常依赖于对化石进行客观的形态鉴定,并假定化石和现代类群都有完善的、可比较的分类法。然而,由于许多现代(隐性)物种主要是通过外部形态和/或分子数据来划分的,因此通常还不清楚化石(骨学)遗骸在多大程度上可以将物种分类提升到类似的水平。当对现生类群的种内形态变异知之甚少时,对已灭绝物种的定义以及将化石归入现生物种可能会产生严重偏差,尤其是当化石是由不完整的孤立骨骼元素代表时。这个问题在有鳞类动物(蜥蜴和蛇)中尤为突出,因为许多低等分类群缺乏骨骼学比较知识,而最近越来越多的分子研究揭示了巨大的隐匿多样性。在此,我们采用三维几何形态计量学的定量方法,对现存澳大利亚和巴布亚新几内亚14属的238个个体进行了研究,以检验两块孤立的头骨(额骨和上颌骨)在推断分类学和生态学亲缘关系方面的价值。我们还进一步测试了这些元素在种内和种间形态变异的一致性,以此作为已灭绝物种分类丰富度的代表。我们的研究表明,这两种骨骼都具有类属水平的诊断能力,都可以推断微生境,并具有古生态学的作用。然而,这两种元素可能低估了物种层面的多样性,约有30-40%的物种对在形态上没有明显差异。不同物种和骨骼的种内形态变异平均值基本一致,因此是已灭绝物种多样性的有效代表。将样本量和地标完整性降低到近似化石标本的程度会导致分类准确性下降和形态差异的增加,从而使人们进一步怀疑现代物种边界对姬蛙类化石记录的可转移性。我们的研究结果突出表明,在比较现生和已灭绝的生物多样性之前,有必要建立适当的基于形态学的分类学或生态学分组。
{"title":"Are Modern Cryptic Species Detectable in the Fossil Record? A Case Study on Agamid Lizards.","authors":"Till Ramm, Jaimi A Gray, Christy A Hipsley, Scott Hocknull, Jane Melville, Johannes Müller","doi":"10.1093/sysbio/syae067","DOIUrl":"https://doi.org/10.1093/sysbio/syae067","url":null,"abstract":"<p><p>Comparisons of extant and extinct biodiversity are often dependent on objective morphology-based identifications of fossils and assume a well-established and comparable taxonomy for both fossil and modern taxa. However, since many modern (cryptic) species are delimitated mainly via external morphology and / or molecular data, it is often unclear to what degree fossilized (osteological) remains allow classification to a similar level. When intraspecific morphological variation in extant taxa is poorly known, the definition of extinct species as well as the referral of fossils to extant species can be heavily biased, particularly if fossils are represented by incomplete isolated skeletal elements. This problem is especially pronounced in squamates (lizards and snakes) owing to a lack of osteological comparative knowledge for many lower taxonomic groups, concomitant with a recent increase of molecular studies revealing great cryptic diversity. Here, we apply a quantitative approach using 3D geometric morphometrics on 238 individuals of 14 genera of extant Australian and Papua New Guinean agamid lizards to test the value of two isolated skull bones (frontals and maxillae) for inferring taxonomic and ecological affinities. We further test for the consistency of intra- and interspecific morphological variability of these elements as a proxy for extinct taxonomic richness. We show that both bones are diagnostic at the generic level, and both can infer microhabitat and are of palaeoecological utility. However, species-level diversity is likely underestimated by both elements, with ~30-40% of species pairs showing no significant differences in shape. Mean intraspecific morphological variability is largely consistent across species and bones and thus a useful proxy for extinct species diversity. Reducing sample size and landmark completeness to approximate fossil specimens led to decreased classification accuracy and increased variance of morphological disparity, raising further doubts on the transferability of modern species borders to the fossil record of agamids. Our results highlight the need to establish appropriate levels of morphology-based taxonomic or ecological groupings prior to comparing extant and extinct biodiversity.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":""},"PeriodicalIF":6.1,"publicationDate":"2024-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142688897","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bayesian Selection of Relaxed-clock Models: Distinguishing Between Independent and Autocorrelated Rates. 松弛时钟模型的贝叶斯选择:区分独立速率和自相关速率。
IF 6.1 1区 生物学 Q1 EVOLUTIONARY BIOLOGY Pub Date : 2024-11-21 DOI: 10.1093/sysbio/syae066
Muthukumaran Panchaksaram, Lucas Freitas, Mario Dos Reis

In Bayesian molecular-clock dating of species divergences, rate models are used to construct the prior on the molecular evolutionary rates for branches in the phylogeny, with independent and autocorrelated rate models being commonly used. The two classes of models, however, can result in markedly different divergence time estimates for the same dataset, and thus selecting the best rate model appears important for obtaining reliable in- ferences of divergence times. However, the properties of Bayesian rate model selection are not well understood, in particular when the number of sequence partitions analysed increases and when age calibrations (such as fossil calibrations) are misspecified. Further- more, Bayesian rate model selection is computationally expensive as it requires calculation of marginal likelihoods by MCMC sampling, and therefore methods that can speed up the model selection procedure without compromising its accuracy are desirable. In this study, we use a combination of computer simulations and real data analysis to investigate the sta- tistical behaviour of Bayesian rate model selection and we also explore approximations of the likelihood to improve computational efficiency in large phylogenomic datasets. Our simulations demonstrate that the posterior probability for the correct rate model converges to one as more molecular sequence partitions are analysed and when no calibrations are used, as expected due to asymptotic Bayesian model selection theory. Furthermore, we also show the model selection procedure is robust to slight misspecification of calibrations, and reliable inference of the correct rate model is possible in this case. However, we show that when calibrations are seriously misspecified, calculated model probabilities are com- pletely wrong and may converge to one for the wrong rate model. Finally, we demonstrate that approximating the phylogenetic likelihood under an arcsine branch-length transform can dramatically reduce the computational cost of rate model selection without compro- mising accuracy. We test the approximate procedure on two large phylogenies of primates (372 species) and flowering plants (644 species), replicating results obtained on smaller datasets using exact likelihood. Our findings and methodology can assist users in selecting the optimal rate model for estimating times and rates along the Tree of Life.

在物种分化的贝叶斯分子钟测年法中,速率模型用于构建系统发育分支的分子进化速率先验,常用的有独立速率模型和自相关速率模型。然而,这两类模型对同一数据集可得出明显不同的分歧时间估计值,因此,选择最佳速率模型对于获得可靠的分歧时间差异似乎非常重要。然而,人们对贝叶斯速率模型选择的特性并不十分了解,特别是当分析的序列分区数量增加和年龄校准(如化石校准)被错误指定时。此外,贝叶斯速率模型选择需要通过 MCMC 采样计算边际似然,因此计算成本很高,因此我们希望采用既能加快模型选择过程又不影响其准确性的方法。在本研究中,我们采用计算机模拟和实际数据分析相结合的方法来研究贝叶斯速率模型选择的统计行为,同时我们还探索了似然的近似值,以提高大型系统发生组数据集的计算效率。我们的模拟结果表明,当分析的分子序列分区越多,在不使用校准的情况下,正确速率模型的后验概率会趋近于 1,这是贝叶斯模型选择理论的渐进性所预期的。此外,我们还证明了模型选择程序对校准的轻微错误规范具有鲁棒性,在这种情况下可以可靠地推断出正确的速率模型。然而,我们证明,当定标严重失当时,计算出的模型概率是完全错误的,可能会收敛为错误速率模型的概率。最后,我们证明了在 arcsine 分支长度变换下对系统发育似然进行近似,可以显著降低速率模型选择的计算成本,而不会影响准确性。我们在灵长类动物(372 个物种)和开花植物(644 个物种)的两个大型系统发生上测试了近似程序,复制了使用精确似然法在较小数据集上获得的结果。我们的发现和方法可以帮助用户选择最佳的速率模型,以估算生命树的时间和速率。
{"title":"Bayesian Selection of Relaxed-clock Models: Distinguishing Between Independent and Autocorrelated Rates.","authors":"Muthukumaran Panchaksaram, Lucas Freitas, Mario Dos Reis","doi":"10.1093/sysbio/syae066","DOIUrl":"https://doi.org/10.1093/sysbio/syae066","url":null,"abstract":"<p><p>In Bayesian molecular-clock dating of species divergences, rate models are used to construct the prior on the molecular evolutionary rates for branches in the phylogeny, with independent and autocorrelated rate models being commonly used. The two classes of models, however, can result in markedly different divergence time estimates for the same dataset, and thus selecting the best rate model appears important for obtaining reliable in- ferences of divergence times. However, the properties of Bayesian rate model selection are not well understood, in particular when the number of sequence partitions analysed increases and when age calibrations (such as fossil calibrations) are misspecified. Further- more, Bayesian rate model selection is computationally expensive as it requires calculation of marginal likelihoods by MCMC sampling, and therefore methods that can speed up the model selection procedure without compromising its accuracy are desirable. In this study, we use a combination of computer simulations and real data analysis to investigate the sta- tistical behaviour of Bayesian rate model selection and we also explore approximations of the likelihood to improve computational efficiency in large phylogenomic datasets. Our simulations demonstrate that the posterior probability for the correct rate model converges to one as more molecular sequence partitions are analysed and when no calibrations are used, as expected due to asymptotic Bayesian model selection theory. Furthermore, we also show the model selection procedure is robust to slight misspecification of calibrations, and reliable inference of the correct rate model is possible in this case. However, we show that when calibrations are seriously misspecified, calculated model probabilities are com- pletely wrong and may converge to one for the wrong rate model. Finally, we demonstrate that approximating the phylogenetic likelihood under an arcsine branch-length transform can dramatically reduce the computational cost of rate model selection without compro- mising accuracy. We test the approximate procedure on two large phylogenies of primates (372 species) and flowering plants (644 species), replicating results obtained on smaller datasets using exact likelihood. Our findings and methodology can assist users in selecting the optimal rate model for estimating times and rates along the Tree of Life.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":""},"PeriodicalIF":6.1,"publicationDate":"2024-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142682823","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Testing relationships between multiple regional features and biogeographic processes of speciation, extinction, and dispersal 测试多种区域特征与物种形成、灭绝和扩散的生物地理过程之间的关系
IF 6.5 1区 生物学 Q1 EVOLUTIONARY BIOLOGY Pub Date : 2024-11-20 DOI: 10.1093/sysbio/syae062
Sarah K Swiston, Michael J Landis
The spatial and environmental features of regions where clades are evolving are expected to impact biogeographic processes such as speciation, extinction, and dispersal. Any number of regional features (such as elevation, distance, area, etc.) may be directly or indirectly related to these processes. For example, it may be that distances or differences in elevation or both may limit dispersal rates. However, it is difficult to disentangle which features are most strongly related to rates of different processes. Here, we present an extensible Multi-feature Feature-Informed GeoSSE (MultiFIG) model that allows for the simultaneous investigation of any number of regional features. MultiFIG provides a conceptual framework for incorporating large numbers of features of different types, including categorical, quantitative, within-region, and between-region features, along with a mathematical framework for translating those features into biogeographic rates for statistical hypothesis testing. Using traditional Bayesian parameter estimation and reversible-jump Markov chain Monte Carlo, MultiFIG allows for the exploration of models with different numbers and combinations of feature-effect parameters, and generates estimates for the strengths of relationships between each regional feature and core process. We validate this model with a simulation study covering a range of scenarios with different numbers of regions, tree sizes, and feature values. We also demonstrate the application of MultiFIG with an empirical case study of the South American lizard genus Liolaemus, investigating sixteen regional features related to area, distance, and elevation. Our results show two important feature-process relationships: a negative distance/dispersal relationship, and a negative area/extinction relationship. Interestingly, although speciation rates were found to be higher in Andean versus non-Andean regions, the model did not assign significance to Andean- or elevation-related parameters. These results highlight the need to consider multiple regional features in biogeographic hypothesis testing.
支系演化区域的空间和环境特征预计会影响生物地理过程,如物种分化、灭绝和扩散。任何区域特征(如海拔、距离、面积等)都可能与这些过程直接或间接相关。例如,距离、海拔或两者的差异可能会限制物种的扩散速度。然而,很难区分哪些特征与不同过程的速率关系最大。在此,我们提出了一种可扩展的多特征地貌信息 GeoSSE(MultiFIG)模型,可同时研究任意数量的区域特征。MultiFIG 提供了一个概念框架,用于纳入大量不同类型的特征,包括分类特征、定量特征、区域内特征和区域间特征,以及一个数学框架,用于将这些特征转化为生物地理率,以进行统计假设检验。利用传统的贝叶斯参数估计和可逆跳跃马尔科夫链蒙特卡罗,MultiFIG 可以探索具有不同数量和组合的特征效应参数的模型,并对每个区域特征与核心过程之间的关系强度进行估计。我们通过模拟研究验证了这一模型,模拟研究涵盖了一系列具有不同区域数量、树大小和特征值的方案。我们还通过对南美洲蜥蜴属 Liolaemus 的实证案例研究来证明 MultiFIG 的应用,研究了与面积、距离和海拔相关的 16 个区域特征。我们的结果显示了两个重要的特征-过程关系:负的距离/分散关系和负的面积/灭绝关系。有趣的是,虽然发现安第斯山脉地区的物种变异率高于非安第斯山脉地区,但模型并没有赋予安第斯山脉或海拔相关参数显著性。这些结果凸显了在生物地理假设检验中考虑多种区域特征的必要性。
{"title":"Testing relationships between multiple regional features and biogeographic processes of speciation, extinction, and dispersal","authors":"Sarah K Swiston, Michael J Landis","doi":"10.1093/sysbio/syae062","DOIUrl":"https://doi.org/10.1093/sysbio/syae062","url":null,"abstract":"The spatial and environmental features of regions where clades are evolving are expected to impact biogeographic processes such as speciation, extinction, and dispersal. Any number of regional features (such as elevation, distance, area, etc.) may be directly or indirectly related to these processes. For example, it may be that distances or differences in elevation or both may limit dispersal rates. However, it is difficult to disentangle which features are most strongly related to rates of different processes. Here, we present an extensible Multi-feature Feature-Informed GeoSSE (MultiFIG) model that allows for the simultaneous investigation of any number of regional features. MultiFIG provides a conceptual framework for incorporating large numbers of features of different types, including categorical, quantitative, within-region, and between-region features, along with a mathematical framework for translating those features into biogeographic rates for statistical hypothesis testing. Using traditional Bayesian parameter estimation and reversible-jump Markov chain Monte Carlo, MultiFIG allows for the exploration of models with different numbers and combinations of feature-effect parameters, and generates estimates for the strengths of relationships between each regional feature and core process. We validate this model with a simulation study covering a range of scenarios with different numbers of regions, tree sizes, and feature values. We also demonstrate the application of MultiFIG with an empirical case study of the South American lizard genus Liolaemus, investigating sixteen regional features related to area, distance, and elevation. Our results show two important feature-process relationships: a negative distance/dispersal relationship, and a negative area/extinction relationship. Interestingly, although speciation rates were found to be higher in Andean versus non-Andean regions, the model did not assign significance to Andean- or elevation-related parameters. These results highlight the need to consider multiple regional features in biogeographic hypothesis testing.","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":"191 1","pages":""},"PeriodicalIF":6.5,"publicationDate":"2024-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142678579","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Robustness of Divergence Time Estimation Despite Gene Tree Estimation Error: A Case Study of Fireflies (Coleoptera: Lampyridae) 尽管存在基因树估计误差,但分化时间估计的稳健性:萤火虫(鞘翅目:灯蛾科)案例研究
IF 6.5 1区 生物学 Q1 EVOLUTIONARY BIOLOGY Pub Date : 2024-11-13 DOI: 10.1093/sysbio/syae065
Sebastian Höhna, Sarah E Lower, Pablo Duchen, Ana Catalán
Genomic data has become ubiquitous in phylogenomic studies, including divergence time estimation, but provide new challenges. These challenges include, amongst others, biological gene tree discordance, methodological gene tree estimation error, and computational limitations on performing full Bayesian inference under complex models. In this study, we use a recently published firefly (Coleoptera: Lampyridae) anchored hybrid enrichment dataset (AHE; 436 loci for 88 Lampyridae species and 10 outgroup species) as a case study to explore gene tree estimation error and the robustness of divergence time estimation. First, we explored the amount of model violation using posterior predictive simulations because model violations are likely to bias phylogenetic inferences and produce gene tree estimation error. We specifically focused on missing data (either uniformly distributed or systematically) and the distribution of highly variable and conserved sites (either uniformly distributed or clustered). Our assessment of model adequacy showed that standard phylogenetic substitution models are not adequate for any of the 436 AHE loci. We tested if the model violations and alignment errors resulted indeed in gene tree estimation error by comparing the observed gene tree discordance to simulated gene tree discordance under the multispecies coalescent model. Thus, we show that the inferred gene tree discordance is not only due to biological mechanism but primarily due to inference errors. Lastly, we explored if divergence time estimation is robust despite the observed gene tree estimation error. We selected four subsets of the full AHE dataset, concatenated each subset and performed a Bayesian relaxed clock divergence estimation in RevBayes. The estimated divergence times overlapped for all nodes that are shared between the topologies. Thus, divergence time estimation is robust using any well selected data subset as long as the topology inference is robust.
基因组数据在系统发生学研究(包括分化时间估计)中已变得无处不在,但也带来了新的挑战。这些挑战包括生物基因树不一致、方法学基因树估计误差以及在复杂模型下进行完全贝叶斯推断的计算限制等。在本研究中,我们以最近发表的萤火虫(鞘翅目:灯蛾科)锚定杂交富集数据集(AHE;88个灯蛾科物种和10个外群物种的436个位点)为案例,探讨了基因树估计误差和分歧时间估计的稳健性。首先,我们利用后验预测模拟探索了模型违反的程度,因为模型违反很可能会使系统发育推断产生偏差并产生基因树估计误差。我们特别关注了缺失数据(均匀分布或系统分布)以及高变异和保守位点的分布(均匀分布或聚类分布)。我们对模型适当性的评估表明,标准的系统发生替换模型对 436 个 AHE 位点中的任何一个都不适当。我们通过比较观察到的基因树不一致性和多物种聚合模型下模拟的基因树不一致性,检验了违反模型和比对错误是否确实导致了基因树估计错误。因此,我们表明推断出的基因树不一致不仅是生物机制造成的,而且主要是推断错误造成的。最后,我们探讨了尽管观察到了基因树估计误差,但分歧时间估计是否稳健。我们从完整的 AHE 数据集中选择了四个子集,将每个子集连接起来,并在 RevBayes 中进行了贝叶斯松弛时钟发散估计。对于拓扑之间共享的所有节点,估计的发散时间都是重叠的。因此,只要拓扑推断是稳健的,那么使用任何精心挑选的数据子集进行发散时间估计都是稳健的。
{"title":"Robustness of Divergence Time Estimation Despite Gene Tree Estimation Error: A Case Study of Fireflies (Coleoptera: Lampyridae)","authors":"Sebastian Höhna, Sarah E Lower, Pablo Duchen, Ana Catalán","doi":"10.1093/sysbio/syae065","DOIUrl":"https://doi.org/10.1093/sysbio/syae065","url":null,"abstract":"Genomic data has become ubiquitous in phylogenomic studies, including divergence time estimation, but provide new challenges. These challenges include, amongst others, biological gene tree discordance, methodological gene tree estimation error, and computational limitations on performing full Bayesian inference under complex models. In this study, we use a recently published firefly (Coleoptera: Lampyridae) anchored hybrid enrichment dataset (AHE; 436 loci for 88 Lampyridae species and 10 outgroup species) as a case study to explore gene tree estimation error and the robustness of divergence time estimation. First, we explored the amount of model violation using posterior predictive simulations because model violations are likely to bias phylogenetic inferences and produce gene tree estimation error. We specifically focused on missing data (either uniformly distributed or systematically) and the distribution of highly variable and conserved sites (either uniformly distributed or clustered). Our assessment of model adequacy showed that standard phylogenetic substitution models are not adequate for any of the 436 AHE loci. We tested if the model violations and alignment errors resulted indeed in gene tree estimation error by comparing the observed gene tree discordance to simulated gene tree discordance under the multispecies coalescent model. Thus, we show that the inferred gene tree discordance is not only due to biological mechanism but primarily due to inference errors. Lastly, we explored if divergence time estimation is robust despite the observed gene tree estimation error. We selected four subsets of the full AHE dataset, concatenated each subset and performed a Bayesian relaxed clock divergence estimation in RevBayes. The estimated divergence times overlapped for all nodes that are shared between the topologies. Thus, divergence time estimation is robust using any well selected data subset as long as the topology inference is robust.","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":"20 1","pages":""},"PeriodicalIF":6.5,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142610475","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evolution of Large Eyes in Stromboidea (Gastropoda): Impact of Photic Environment and Life History Traits. 石龙子目(腹足纲)大眼的进化:光环境和生活史特征的影响。
IF 6.1 1区 生物学 Q1 EVOLUTIONARY BIOLOGY Pub Date : 2024-11-05 DOI: 10.1093/sysbio/syae063
Alison R Irwin, Nicholas W Roberts, Ellen E Strong, Yasunori Kano, Daniel I Speiser, Elizabeth M Harper, Suzanne T Williams

Eyes within the marine gastropod superfamily Stromboidea range widely in size, from 0.2 to 2.3 mm - the largest eyes known in any gastropod. Despite this interesting variation, the underlying evolutionary pressures remain unknown. Here, we use the wealth of material available in museum collections to explore the evolution of stromboid eye size and structure. Our results suggest that depth is a key light-limiting factor in stromboid eye evolution; here, increasing water depth is correlated with increasing aperture width relative to lens diameter, and therefore an increasing investment in sensitivity in dim light environments. In the major clade containing all large-eyed stromboid families, species observed active during the day and the night had wider eye apertures relative to lens sizes than species observed active during the day only, thereby prioritising sensitivity over resolution. Species with no consistent diel activity pattern also had smaller body sizes than exclusively day-active species, which may suggest that smaller animals are more vulnerable to shell-crushing predators, and avoid the higher predation pressure experienced by animals active during the day. Within the same major clade, ancestral state reconstruction suggests that absolute eye size increased above 1 mm twice. The unresolved position of Varicospira, however, weakens this hypothesis and further work with additional markers is needed to confirm this result.

海洋腹足纲虾形目超科的眼睛大小不一,从 0.2 毫米到 2.3 毫米不等,这是已知腹足纲动物中最大的眼睛。尽管存在这种有趣的差异,但其背后的进化压力仍然未知。在这里,我们利用博物馆收藏的大量材料来探索石龙子眼大小和结构的进化。我们的研究结果表明,深度是节肢动物眼睛进化过程中一个关键的光限制因素;在这里,水深的增加与相对于晶状体直径的孔径宽度的增加相关,因此在暗光环境中的灵敏度也在增加。在包含所有大眼石龙子科的主要支系中,昼夜都有活动的物种的眼孔相对于晶状体的大小要比只在白天活动的物种更宽,因此灵敏度要优先于分辨率。与只在白天活动的物种相比,没有一致的昼夜活动模式的物种的体型也较小,这可能表明较小的动物更容易受到碎壳捕食者的攻击,从而避免了在白天活动的动物所经历的较高的捕食压力。在同一主要支系中,祖先状态重建表明眼睛的绝对大小曾两次超过 1 毫米。然而,水蛭的位置尚未确定,这削弱了这一假设,因此需要使用更多的标记物来进一步证实这一结果。
{"title":"Evolution of Large Eyes in Stromboidea (Gastropoda): Impact of Photic Environment and Life History Traits.","authors":"Alison R Irwin, Nicholas W Roberts, Ellen E Strong, Yasunori Kano, Daniel I Speiser, Elizabeth M Harper, Suzanne T Williams","doi":"10.1093/sysbio/syae063","DOIUrl":"https://doi.org/10.1093/sysbio/syae063","url":null,"abstract":"<p><p>Eyes within the marine gastropod superfamily Stromboidea range widely in size, from 0.2 to 2.3 mm - the largest eyes known in any gastropod. Despite this interesting variation, the underlying evolutionary pressures remain unknown. Here, we use the wealth of material available in museum collections to explore the evolution of stromboid eye size and structure. Our results suggest that depth is a key light-limiting factor in stromboid eye evolution; here, increasing water depth is correlated with increasing aperture width relative to lens diameter, and therefore an increasing investment in sensitivity in dim light environments. In the major clade containing all large-eyed stromboid families, species observed active during the day and the night had wider eye apertures relative to lens sizes than species observed active during the day only, thereby prioritising sensitivity over resolution. Species with no consistent diel activity pattern also had smaller body sizes than exclusively day-active species, which may suggest that smaller animals are more vulnerable to shell-crushing predators, and avoid the higher predation pressure experienced by animals active during the day. Within the same major clade, ancestral state reconstruction suggests that absolute eye size increased above 1 mm twice. The unresolved position of Varicospira, however, weakens this hypothesis and further work with additional markers is needed to confirm this result.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":""},"PeriodicalIF":6.1,"publicationDate":"2024-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142584383","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Topology Testing and Demographic Modeling Illuminate a Novel Speciation Pathway in the Greater Caribbean Sea Following the Formation of the Isthmus of Panama. 拓扑测试和人口模型揭示了巴拿马地峡形成后大加勒比海的新物种演化途径。
IF 6.1 1区 生物学 Q1 EVOLUTIONARY BIOLOGY Pub Date : 2024-10-30 DOI: 10.1093/sysbio/syae045
Benjamin M Titus, H Lisle Gibbs, Nuno Simões, Marymegan Daly

Recent genomic analyses have highlighted the prevalence of speciation with gene flow in many taxa and have underscored the importance of accounting for these reticulate evolutionary processes when constructing species trees and generating parameter estimates. This is especially important for deepening our understanding of speciation in the sea where fast-moving ocean currents, expanses of deep water, and periodic episodes of sea level rise and fall act as soft and temporary allopatric barriers that facilitate both divergence and secondary contact. Under these conditions, gene flow is not expected to cease completely while contemporary distributions are expected to differ from historical ones. Here, we conduct range-wide sampling for Pederson's cleaner shrimp (Ancylomenes pedersoni), a species complex from the Greater Caribbean that contains three clearly delimited mitochondrial lineages with both allopatric and sympatric distributions. Using mtDNA barcodes and a genomic ddRADseq approach, we combine classic phylogenetic analyses with extensive topology testing and demographic modeling (10 site frequency replicates × 45 evolutionary models × 50 model simulations/replicate = 22,500 simulations) to test species boundaries and reconstruct the evolutionary history of what was expected to be a simple case study. Instead, our results indicate a history of allopatric divergence, secondary contact, introgression, and endemic hybrid speciation that we hypothesize was driven by the final closure of the Isthmus of Panama and the strengthening of the Gulf Stream Current ~3.5 Ma. The history of this species complex recovered by model-based methods that allow reticulation differs from that recovered by standard phylogenetic analyses and is unexpected given contemporary distributions. The geologically and biologically meaningful insights gained by our model selection analyses illuminate what is likely a novel pathway of species formation not previously documented that resulted from one of the most biogeographically significant events in Earth's history.

最近的基因组分析突显了许多类群中基因流动的物种演化现象,并强调了在构建物种树和生成参数估计时考虑这些网状演化过程的重要性。在海洋中,快速移动的洋流、广阔的深水区以及周期性的海平面上升和下降成为软性和暂时性的同域屏障,促进了物种的分化和二次接触,这对于加深我们对海洋中物种分化的理解尤为重要。在这些条件下,基因流动预计不会完全停止,而当代分布预计会与历史分布有所不同。在这里,我们对佩德森对虾(Ancylomenes pedersoni)进行了全域采样,这是大加勒比海的一个物种群,包含三个界限清晰的线粒体系,既有同域分布,也有异域分布。利用 mtDNA 条形码和基因组 ddRADseq 方法,我们将经典的系统发育分析与广泛的拓扑测试和人口统计建模(10 个位点频率重复 x 45 个进化模型 x 50 个模型模拟/重复 = 22,500 次模拟)相结合,检验了物种边界,并重建了这一预期为简单案例研究的进化历史。相反,我们的研究结果表明,在距今约 350 万年前,巴拿马地峡的最终关闭和湾流的加强推动了异地分化、次生接触、引种和地方性杂交物种的形成。通过基于模型的方法(允许网状结构)复原的这一物种复合体的历史与标准系统发育分析复原的历史不同,而且从当代分布来看也出乎意料。我们的模型选择分析所获得的具有地质学和生物学意义的见解,阐明了地球历史上最重要的生物地理事件之一所导致的物种形成的新途径,这可能是以前没有记载的。
{"title":"Topology Testing and Demographic Modeling Illuminate a Novel Speciation Pathway in the Greater Caribbean Sea Following the Formation of the Isthmus of Panama.","authors":"Benjamin M Titus, H Lisle Gibbs, Nuno Simões, Marymegan Daly","doi":"10.1093/sysbio/syae045","DOIUrl":"10.1093/sysbio/syae045","url":null,"abstract":"<p><p>Recent genomic analyses have highlighted the prevalence of speciation with gene flow in many taxa and have underscored the importance of accounting for these reticulate evolutionary processes when constructing species trees and generating parameter estimates. This is especially important for deepening our understanding of speciation in the sea where fast-moving ocean currents, expanses of deep water, and periodic episodes of sea level rise and fall act as soft and temporary allopatric barriers that facilitate both divergence and secondary contact. Under these conditions, gene flow is not expected to cease completely while contemporary distributions are expected to differ from historical ones. Here, we conduct range-wide sampling for Pederson's cleaner shrimp (Ancylomenes pedersoni), a species complex from the Greater Caribbean that contains three clearly delimited mitochondrial lineages with both allopatric and sympatric distributions. Using mtDNA barcodes and a genomic ddRADseq approach, we combine classic phylogenetic analyses with extensive topology testing and demographic modeling (10 site frequency replicates × 45 evolutionary models × 50 model simulations/replicate = 22,500 simulations) to test species boundaries and reconstruct the evolutionary history of what was expected to be a simple case study. Instead, our results indicate a history of allopatric divergence, secondary contact, introgression, and endemic hybrid speciation that we hypothesize was driven by the final closure of the Isthmus of Panama and the strengthening of the Gulf Stream Current ~3.5 Ma. The history of this species complex recovered by model-based methods that allow reticulation differs from that recovered by standard phylogenetic analyses and is unexpected given contemporary distributions. The geologically and biologically meaningful insights gained by our model selection analyses illuminate what is likely a novel pathway of species formation not previously documented that resulted from one of the most biogeographically significant events in Earth's history.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"758-768"},"PeriodicalIF":6.1,"publicationDate":"2024-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141749074","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Fundamental Role of Character Coding in Bayesian Morphological Phylogenetics. 贝叶斯形态系统学中特征编码的基本作用。
IF 6.1 1区 生物学 Q1 EVOLUTIONARY BIOLOGY Pub Date : 2024-10-30 DOI: 10.1093/sysbio/syae033
Basanta Khakurel, Courtney Grigsby, Tyler D Tran, Juned Zariwala, Sebastian Höhna, April M Wright

Phylogenetic trees establish a historical context for the study of organismal form and function. Most phylogenetic trees are estimated using a model of evolution. For molecular data, modeling evolution is often based on biochemical observations about changes between character states. For example, there are 4 nucleotides, and we can make assumptions about the probability of transitions between them. By contrast, for morphological characters, we may not know a priori how many characters states there are per character, as both extant sampling and the fossil record may be highly incomplete, which leads to an observer bias. For a given character, the state space may be larger than what has been observed in the sample of taxa collected by the researcher. In this case, how many evolutionary rates are needed to even describe transitions between morphological character states may not be clear, potentially leading to model misspecification. To explore the impact of this model misspecification, we simulated character data with varying numbers of character states per character. We then used the data to estimate phylogenetic trees using models of evolution with the correct number of character states and an incorrect number of character states. The results of this study indicate that this observer bias may lead to phylogenetic error, particularly in the branch lengths of trees. If the state space is wrongly assumed to be too large, then we underestimate the branch lengths, and the opposite occurs when the state space is wrongly assumed to be too small.

系统发生树为研究生物体的形态和功能提供了历史背景。大多数系统发生树都是通过进化模型来估算的。对于分子数据,进化模型通常基于对特征状态之间变化的生化观察。例如,有四种核苷酸,我们可以对它们之间的转换概率做出假设。相比之下,对于形态特征而言,我们可能无法先验地知道每个特征有多少种特征状态,因为现存取样和化石记录都可能非常不完整,这就导致了观察者偏差。对于一个给定的特征,其状态空间可能比研究者收集的类群样本中观察到的更大。在这种情况下,需要多少进化率才能描述形态特征状态之间的转变可能并不清楚,从而可能导致模型的错误规范。为了探究这种模型不规范的影响,我们模拟了每个特征具有不同数量特征状态的特征数据。然后,我们利用这些数据,使用具有正确特征状态数和不正确特征状态数的进化模型来估计系统发生树。研究结果表明,这种观察者偏差可能会导致系统发育错误,尤其是在树的分支长度方面。如果错误地假定状态空间过大,那么我们就会低估分支长度,而如果错误地假定状态空间过小,则会出现相反的情况。
{"title":"The Fundamental Role of Character Coding in Bayesian Morphological Phylogenetics.","authors":"Basanta Khakurel, Courtney Grigsby, Tyler D Tran, Juned Zariwala, Sebastian Höhna, April M Wright","doi":"10.1093/sysbio/syae033","DOIUrl":"10.1093/sysbio/syae033","url":null,"abstract":"<p><p>Phylogenetic trees establish a historical context for the study of organismal form and function. Most phylogenetic trees are estimated using a model of evolution. For molecular data, modeling evolution is often based on biochemical observations about changes between character states. For example, there are 4 nucleotides, and we can make assumptions about the probability of transitions between them. By contrast, for morphological characters, we may not know a priori how many characters states there are per character, as both extant sampling and the fossil record may be highly incomplete, which leads to an observer bias. For a given character, the state space may be larger than what has been observed in the sample of taxa collected by the researcher. In this case, how many evolutionary rates are needed to even describe transitions between morphological character states may not be clear, potentially leading to model misspecification. To explore the impact of this model misspecification, we simulated character data with varying numbers of character states per character. We then used the data to estimate phylogenetic trees using models of evolution with the correct number of character states and an incorrect number of character states. The results of this study indicate that this observer bias may lead to phylogenetic error, particularly in the branch lengths of trees. If the state space is wrongly assumed to be too large, then we underestimate the branch lengths, and the opposite occurs when the state space is wrongly assumed to be too small.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"861-871"},"PeriodicalIF":6.1,"publicationDate":"2024-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141535331","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Whole Genomes Reveal Evolutionary Relationships and Mechanisms Underlying Gene-Tree Discordance in Neodiprion Sawflies. 全基因组揭示 Neodiprion 锯蝇基因树不一致的进化关系和机制
IF 6.1 1区 生物学 Q1 EVOLUTIONARY BIOLOGY Pub Date : 2024-10-30 DOI: 10.1093/sysbio/syae036
Danielle K Herrig, Ryan D Ridenbaugh, Kim L Vertacnik, Kathryn M Everson, Sheina B Sim, Scott M Geib, David W Weisrock, Catherine R Linnen

Rapidly evolving taxa are excellent models for understanding the mechanisms that give rise to biodiversity. However, developing an accurate historical framework for comparative analysis of such lineages remains a challenge due to ubiquitous incomplete lineage sorting (ILS) and introgression. Here, we use a whole-genome alignment, multiple locus-sampling strategies, and summary-tree and single nucleotide polymorphism-based species-tree methods to infer a species tree for eastern North American Neodiprion species, a clade of pine-feeding sawflies (Order: Hymenopteran; Family: Diprionidae). We recovered a well-supported species tree that-except for three uncertain relationships-was robust to different strategies for analyzing whole-genome data. Nevertheless, underlying gene-tree discordance was high. To understand this genealogical variation, we used multiple linear regression to model site concordance factors estimated in 50-kb windows as a function of several genomic predictor variables. We found that site concordance factors tended to be higher in regions of the genome with more parsimony-informative sites, fewer singletons, less missing data, lower GC content, more genes, lower recombination rates, and lower D-statistics (less introgression). Together, these results suggest that ILS, introgression, and genotyping error all shape the genomic landscape of gene-tree discordance in Neodiprion. More generally, our findings demonstrate how combining phylogenomic analysis with knowledge of local genomic features can reveal mechanisms that produce topological heterogeneity across genomes.

快速进化的类群是了解生物多样性产生机制的绝佳模型。然而,由于无处不在的不完全世系分类和引入,为这类世系的比较分析建立一个准确的历史框架仍然是一个挑战。在本文中,我们使用全基因组比对、多位点取样策略以及基于总结树和 SNP 的物种树方法来推断北美东部 Neodiprion 物种的物种树,这是一个食松锯蝇支系(目:膜翅目;科:双翅目)。我们恢复了一个支持良好的物种树,除了三个不确定的关系外,该物种树对不同的全基因组数据分析策略都很稳健。然而,潜在基因树的不一致性很高。为了了解这种谱系变异,我们使用多元线性回归方法,将 50-kb 窗口中估计的位点一致性因子作为几个基因组预测变量的函数来建模。我们发现,在基因组中具有更多解析信息的位点、更少的单子、更少的缺失数据、更低的 GC 含量、更多的基因、更低的重组率和更低的 D 统计量(更少的引入)的区域,位点一致性系数往往更高。这些结果表明,不完全的世系分选、引入和基因分型错误都会造成新地鸟基因组中基因树不一致的情况。更广泛地说,我们的研究结果证明了如何将系统发生组分析与对局部基因组特征的了解相结合,从而揭示产生跨基因组拓扑异质性的机制。
{"title":"Whole Genomes Reveal Evolutionary Relationships and Mechanisms Underlying Gene-Tree Discordance in Neodiprion Sawflies.","authors":"Danielle K Herrig, Ryan D Ridenbaugh, Kim L Vertacnik, Kathryn M Everson, Sheina B Sim, Scott M Geib, David W Weisrock, Catherine R Linnen","doi":"10.1093/sysbio/syae036","DOIUrl":"10.1093/sysbio/syae036","url":null,"abstract":"<p><p>Rapidly evolving taxa are excellent models for understanding the mechanisms that give rise to biodiversity. However, developing an accurate historical framework for comparative analysis of such lineages remains a challenge due to ubiquitous incomplete lineage sorting (ILS) and introgression. Here, we use a whole-genome alignment, multiple locus-sampling strategies, and summary-tree and single nucleotide polymorphism-based species-tree methods to infer a species tree for eastern North American Neodiprion species, a clade of pine-feeding sawflies (Order: Hymenopteran; Family: Diprionidae). We recovered a well-supported species tree that-except for three uncertain relationships-was robust to different strategies for analyzing whole-genome data. Nevertheless, underlying gene-tree discordance was high. To understand this genealogical variation, we used multiple linear regression to model site concordance factors estimated in 50-kb windows as a function of several genomic predictor variables. We found that site concordance factors tended to be higher in regions of the genome with more parsimony-informative sites, fewer singletons, less missing data, lower GC content, more genes, lower recombination rates, and lower D-statistics (less introgression). Together, these results suggest that ILS, introgression, and genotyping error all shape the genomic landscape of gene-tree discordance in Neodiprion. More generally, our findings demonstrate how combining phylogenomic analysis with knowledge of local genomic features can reveal mechanisms that produce topological heterogeneity across genomes.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"839-860"},"PeriodicalIF":6.1,"publicationDate":"2024-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141545293","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Systematic Biology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1