Molecular biology and evolution最新文献_第3页

Phylodynamics of Somatic Evolution: A Likelihood-Based Approach for Cellular Reproduction. 体细胞进化的系统动力学：基于可能性的细胞繁殖方法。

IF 5.3 1区生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY

Molecular biology and evolution

Pub Date : 2026-02-02 DOI: 10.1093/molbev/msag002

Tobias Dieselhorst, Johannes Berg

Understanding the evolutionary dynamics of cell populations requires models that link observed phylogenetic patterns to the underlying processes of cell division, death, and mutation. Classical phylodynamic inference methods-developed primarily for macroevolutionary settings-assume that mutations accrue in calendar time and often rely on a molecular clock. Here, we introduce a framework that ties mutations to discrete birth (division) events. In this setting, mutations accumulate via a compound Poisson process, capturing both visible and hidden cell divisions within the reconstructed phylogenetic tree. We present a computationally efficient dynamic programming algorithm to compute the likelihood based on tree topologies with associated mutations, integrating over latent variables such as branch durations and unobserved cell divisions. Our method is applicable to large-scale single-cell datasets, and we demonstrate its utility on simulated data and on single-cell phylogenies of hematopoietic stem cells.

理解细胞群体的进化动力学需要将观察到的系统发育模式与细胞分裂、死亡和突变的潜在过程联系起来的模型。经典的系统动力学推断方法——主要是为宏观进化设置而开发的——假设突变在日历时间内累积，并且通常依赖于分子钟。在这里，我们引入一个框架，将突变与离散的出生（分裂）事件联系起来。在这种情况下，突变通过复合泊松过程积累，在重建的系统发育树中捕获可见和隐藏的细胞分裂。我们提出了一种计算效率高的动态规划算法，该算法基于具有相关突变的树拓扑计算似然，对潜在变量（如分支持续时间和未观察到的细胞分裂）进行积分。我们的方法适用于大规模的单细胞数据集，我们在模拟数据和造血干细胞的单细胞系统发育上证明了它的实用性。

引用次数: 0

TreeProfiler: Large-scale metadata profiling along gene and species trees. TreeProfiler：沿着基因和物种树进行大规模元数据分析。

IF 5.3 1区生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY

Molecular biology and evolution

Pub Date : 2026-02-02 DOI: 10.1093/molbev/msag028

Ziqi Deng, Claudia Sanchis-López, Ana Hernández-Plaza, Adrián A Davín, Jaime Huerta-Cepas

Profiling biological traits along gene or species tree topologies is a well-established approach in comparative genomics, widely employed to infer gene function from co-evolutionary patterns (phylogenetic profiling), reconstruct ancestral states, and uncover ecological associations. However, existing profiling tools are typically tailored to specific use cases, have limited scalability for large datasets, and lack robust methods to aggregate or summarize traits at internal tree nodes. Here, we present TreeProfiler, a tool for automated annotation and interactive exploration of hundreds of features along large gene and species trees, with seamless summarization of mapped traits at internal nodes. TreeProfiler supports the profiling of custom continuous and discrete traits, as well as ancestral character reconstruction and phylogenetic signal tests. It also integrates commonly used genomic features, including multiple sequence alignments, protein domain architectures, and functional annotations. We demonstrate TreeProfiler's utility beyond traditional phylogenetic profiling, as well as its ability to efficiently handle massive datasets, by analyzing the functional diversification of the methyl-accepting chemotaxis protein (MCP) family comprising over 400,000 genomic and metagenomic sequences, and by profiling the relative abundance of 124,295 bacterial and archaeal species across 51 biomes. TreeProfiler is open-source and freely available at https://github.com/compgenomicslab/TreeProfiler.

沿着基因或物种树拓扑分析生物学性状是比较基因组学中一种成熟的方法，广泛用于从共同进化模式（系统发育分析）推断基因功能，重建祖先状态，并揭示生态关联。然而，现有的分析工具通常是针对特定的用例量身定制的，对于大型数据集的可扩展性有限，并且缺乏在内部树节点上聚合或总结特征的健壮方法。在这里，我们介绍了TreeProfiler，这是一个自动注释和交互式探索大型基因和物种树的数百个特征的工具，可以在内部节点上无缝地总结映射的特征。TreeProfiler支持自定义连续和离散特征的分析，以及祖先特征重建和系统发育信号测试。它还集成了常用的基因组特征，包括多序列比对、蛋白质结构域结构和功能注释。通过分析包含超过400,000个基因组和宏基因组序列的甲基接受趋化蛋白（MCP）家族的功能多样化，以及通过分析51个生物群落中124,295种细菌和古细菌的相对丰度，我们展示了TreeProfiler超越传统系统发育分析的实用性，以及它有效处理大量数据集的能力。TreeProfiler是开源的，可以在https://github.com/compgenomicslab/TreeProfiler免费获得。

{"title":"TreeProfiler: Large-scale metadata profiling along gene and species trees.","authors":"Ziqi Deng, Claudia Sanchis-López, Ana Hernández-Plaza, Adrián A Davín, Jaime Huerta-Cepas","doi":"10.1093/molbev/msag028","DOIUrl":"https://doi.org/10.1093/molbev/msag028","url":null,"abstract":"Profiling biological traits along gene or species tree topologies is a well-established approach in comparative genomics, widely employed to infer gene function from co-evolutionary patterns (phylogenetic profiling), reconstruct ancestral states, and uncover ecological associations. However, existing profiling tools are typically tailored to specific use cases, have limited scalability for large datasets, and lack robust methods to aggregate or summarize traits at internal tree nodes. Here, we present TreeProfiler, a tool for automated annotation and interactive exploration of hundreds of features along large gene and species trees, with seamless summarization of mapped traits at internal nodes. TreeProfiler supports the profiling of custom continuous and discrete traits, as well as ancestral character reconstruction and phylogenetic signal tests. It also integrates commonly used genomic features, including multiple sequence alignments, protein domain architectures, and functional annotations. We demonstrate TreeProfiler's utility beyond traditional phylogenetic profiling, as well as its ability to efficiently handle massive datasets, by analyzing the functional diversification of the methyl-accepting chemotaxis protein (MCP) family comprising over 400,000 genomic and metagenomic sequences, and by profiling the relative abundance of 124,295 bacterial and archaeal species across 51 biomes. TreeProfiler is open-source and freely available at https://github.com/compgenomicslab/TreeProfiler.","PeriodicalId":18730,"journal":{"name":"Molecular biology and evolution","volume":" ","pages":""},"PeriodicalIF":5.3,"publicationDate":"2026-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146100411","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Germline de novo mutation rate of the highly heterozygous amphioxus genome. 文昌鱼高杂合基因组的种系从头突变率。

IF 5.3 1区生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY

Molecular biology and evolution

Pub Date : 2026-02-02 DOI: 10.1093/molbev/msag017

Jing Xue, Lei Tao, Junwei Cao, Liang Wu, Guang Li, Cai Li

Germline de novo mutations (DNMs) are the ultimate source of heritable variation, yet their patterns in highly heterozygous genomes remain poorly understood. Amphioxus, an early-branching chordate with exceptionally high genomic heterozygosity (3.2% to 4.2% in sequenced species), offers a unique model to explore mutational dynamics in such contexts. It is unclear whether high heterozygosity in amphioxus is due to a large effective population size, an increased mutation rate, or both. Here, we perform deep short-read whole-genome sequencing of a two-generation pedigree of the amphioxus Branchiostoma floridae comprising two parents and 104 offspring and develop a framework based on allele-aware parental assemblies as the reference to accurately identify DNMs. We detect 242 high-confidence DNMs, yielding a genome-wide mutation rate of 5.89 × 10-9 per base per generation, which is comparable to that of vertebrates. Combining this estimate with observed nucleotide diversity, we obtain an effective population size of ∼1.7 million, indicating that the elevated heterozygosity mainly results from a large effective population size. We observe no sex bias when considering all DNMs but a paternal-origin bias for early-occurring ones. Amphioxus harbors a much smaller fraction of CpG>TpG DNMs relative to vertebrates, attributable to its low methylation levels. We also investigate putative postzygotic mutations in the offspring, revealing an unexpected paternal-origin bias. These suggest some distinct mutational mechanisms in amphioxus. Our study not only provides the first DNM measurement for amphioxus but also offers a generalizable strategy for studying DNMs in highly heterozygous genomes, facilitating mutation rate studies across chordates and other lineages.

种系从头突变（dnm）是遗传变异的最终来源，但它们在高杂合基因组中的模式仍然知之甚少。文昌鱼是一种早期分支脊索动物，具有极高的基因组杂合性（测序物种中为3.2 ~ 4.2%），为探索这种背景下的突变动力学提供了独特的模型。文昌鱼的高杂合性是由于有效种群规模大，突变率增加，还是两者兼而有之，目前尚不清楚。本研究对佛罗里达文昌鱼（amphioxus Branchiostoma floridae）两代家谱进行了深度短读全基因组测序，包括两个亲本和104个后代，并建立了一个基于等位基因感知亲本序列的框架，作为准确识别dnm的参考。我们检测到242个高置信度的dnm，每代每个碱基的全基因组突变率为5.89 × 10-9，与脊椎动物相当。将这一估计值与观察到的核苷酸多样性相结合，我们得到了有效种群规模约为170万，这表明杂合性的升高主要是由于有效种群规模较大。在考虑所有的dnm时，我们没有观察到性别偏见，但对早期发生的dnm存在父系起源偏见。与脊椎动物相比，文昌鱼的CpG>TpG dnm的比例要小得多，这是由于其甲基化水平较低。我们还调查了假定的后代的合子后突变，揭示了意想不到的父系起源偏见。这表明文昌鱼有一些不同的突变机制。我们的研究不仅为文昌鱼提供了第一个DNM测量，而且为研究高杂合基因组中的DNM提供了一种通用策略，促进了脊索动物和其他谱系的突变率研究。

{"title":"Germline de novo mutation rate of the highly heterozygous amphioxus genome.","authors":"Jing Xue, Lei Tao, Junwei Cao, Liang Wu, Guang Li, Cai Li","doi":"10.1093/molbev/msag017","DOIUrl":"10.1093/molbev/msag017","url":null,"abstract":"Germline de novo mutations (DNMs) are the ultimate source of heritable variation, yet their patterns in highly heterozygous genomes remain poorly understood. Amphioxus, an early-branching chordate with exceptionally high genomic heterozygosity (3.2% to 4.2% in sequenced species), offers a unique model to explore mutational dynamics in such contexts. It is unclear whether high heterozygosity in amphioxus is due to a large effective population size, an increased mutation rate, or both. Here, we perform deep short-read whole-genome sequencing of a two-generation pedigree of the amphioxus Branchiostoma floridae comprising two parents and 104 offspring and develop a framework based on allele-aware parental assemblies as the reference to accurately identify DNMs. We detect 242 high-confidence DNMs, yielding a genome-wide mutation rate of 5.89 × 10-9 per base per generation, which is comparable to that of vertebrates. Combining this estimate with observed nucleotide diversity, we obtain an effective population size of ∼1.7 million, indicating that the elevated heterozygosity mainly results from a large effective population size. We observe no sex bias when considering all DNMs but a paternal-origin bias for early-occurring ones. Amphioxus harbors a much smaller fraction of CpG>TpG DNMs relative to vertebrates, attributable to its low methylation levels. We also investigate putative postzygotic mutations in the offspring, revealing an unexpected paternal-origin bias. These suggest some distinct mutational mechanisms in amphioxus. Our study not only provides the first DNM measurement for amphioxus but also offers a generalizable strategy for studying DNMs in highly heterozygous genomes, facilitating mutation rate studies across chordates and other lineages.","PeriodicalId":18730,"journal":{"name":"Molecular biology and evolution","volume":" ","pages":""},"PeriodicalIF":5.3,"publicationDate":"2026-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12862219/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145966555","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Full spatio-temporal analyses of migration and colonization in evolution-dense 3D mapping of cancer metastases provides new insights. 进化过程中迁移和定植的完整时空分析-癌症转移的密集三维映射提供了新的见解。

IF 5.3 1区生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY

Molecular biology and evolution

Pub Date : 2026-02-02 DOI: 10.1093/molbev/msag008

Qihang Chen, Senmao Li, Xianrui Wu, Qing Xu, Ranran Zhu, Yongsen Ruan, Ao Lan, Zihan Liu, Jiarui Weng, Yanjiang Zhao, Xiying Xu, Xinyue Qi, Jinhong Lai, Leyi Xiao, Ping Lan, Chung-I Wu, Bingjie Chen

The process of migration and colonization is important in evolution; for example, modern humans experienced multiple waves of migrations out of Africa. However, no data cover the spatio-temporal patterns sufficiently to be truly informative. Metastatic cancer provides a unique in vivo model to study these processes through rapid somatic evolution. Here, we apply the high-resolution sampling technique (Dense 3D Crypt-scale Sampling) to analyze hundreds of spatially mapped micro-samples from the primary colorectal cancer and liver metastases in two representative cases. This would be analogous to recording the "out-of-Africa" events in two repeats. Our results support that liver metastases arise from polyphyletic and polyclonal seeding events where multiple, genetically distinct clones colonize a new site together. Following colonization, these multi-clonal populations can evolve into distinct spatial architectures: segregated territories formed by cells with low motility, or highly intermixed patterns driven by high motility. The colonization (or seeding) process begins within the first third of the primary tumor's progression, creating a large number of widespread but clinically undetectable micrometastatic colonies. These findings support a model where metastatic competence is not an intrinsic trait of a single "winner" clone but an emergent property of multiple concurrent clones. Collectively, our work supports metastasis as a multi-stage process initiated early in tumor development, characterized by continuous polyclonal dissemination and the formation of spatially distinct clonal architectures. This general pattern may echo the ecology of migration and colonization in organismal evolution.

迁移和殖民化过程在进化中是重要的；例如，现代人类经历了多次走出非洲的迁徙浪潮。然而，没有任何数据足以涵盖时空格局，足以提供真正的信息。转移性癌症提供了一个独特的体内模型，通过快速体细胞进化来研究这些过程。在这里，我们应用高分辨率采样技术（Dense 3D Crypt-scale sampling）分析了来自两个代表性病例的原发性结直肠癌和肝转移瘤的数百个空间映射微样本。这类似于将“走出非洲”的事件重复记录两次。我们的研究结果支持肝转移发生于多系和多克隆播种事件，其中多个遗传上不同的克隆一起定殖到一个新的位点。在定植之后，这些多克隆种群可以进化成不同的空间结构：由低运动性细胞形成的隔离区域，或由高运动性驱动的高度混合模式。定植（或播种）过程开始于原发肿瘤进展的前三分之一，产生大量广泛但临床无法检测到的微转移菌落。这些发现支持了一个模型，即转移能力不是单个“赢家”克隆的内在特征，而是多个并发克隆的涌现特性。总的来说，我们的工作支持转移是一个多阶段的过程，始于肿瘤发展的早期，其特征是持续的多克隆传播和空间上不同克隆结构的形成。这种普遍模式可能与生物进化中的迁移和殖民化生态学相呼应。

{"title":"Full spatio-temporal analyses of migration and colonization in evolution-dense 3D mapping of cancer metastases provides new insights.","authors":"Qihang Chen, Senmao Li, Xianrui Wu, Qing Xu, Ranran Zhu, Yongsen Ruan, Ao Lan, Zihan Liu, Jiarui Weng, Yanjiang Zhao, Xiying Xu, Xinyue Qi, Jinhong Lai, Leyi Xiao, Ping Lan, Chung-I Wu, Bingjie Chen","doi":"10.1093/molbev/msag008","DOIUrl":"10.1093/molbev/msag008","url":null,"abstract":"The process of migration and colonization is important in evolution; for example, modern humans experienced multiple waves of migrations out of Africa. However, no data cover the spatio-temporal patterns sufficiently to be truly informative. Metastatic cancer provides a unique in vivo model to study these processes through rapid somatic evolution. Here, we apply the high-resolution sampling technique (Dense 3D Crypt-scale Sampling) to analyze hundreds of spatially mapped micro-samples from the primary colorectal cancer and liver metastases in two representative cases. This would be analogous to recording the \"out-of-Africa\" events in two repeats. Our results support that liver metastases arise from polyphyletic and polyclonal seeding events where multiple, genetically distinct clones colonize a new site together. Following colonization, these multi-clonal populations can evolve into distinct spatial architectures: segregated territories formed by cells with low motility, or highly intermixed patterns driven by high motility. The colonization (or seeding) process begins within the first third of the primary tumor's progression, creating a large number of widespread but clinically undetectable micrometastatic colonies. These findings support a model where metastatic competence is not an intrinsic trait of a single \"winner\" clone but an emergent property of multiple concurrent clones. Collectively, our work supports metastasis as a multi-stage process initiated early in tumor development, characterized by continuous polyclonal dissemination and the formation of spatially distinct clonal architectures. This general pattern may echo the ecology of migration and colonization in organismal evolution.","PeriodicalId":18730,"journal":{"name":"Molecular biology and evolution","volume":" ","pages":""},"PeriodicalIF":5.3,"publicationDate":"2026-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12866917/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145952793","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

PharaohFUN: phylogenomic analysis for plant protein history and function elucidation. PharaohFUN：植物蛋白历史和功能的系统基因组分析。

IF 5.3 1区生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY

Molecular biology and evolution

Pub Date : 2026-02-02 DOI: 10.1093/molbev/msag011

Marcos Ramos-González, Víctor Ramos-González, Emma Serrano-Pérez, Christina Arvanitidou, Jorge Hernández-García, Mercedes García-González, Francisco J Romero-Campero

Since DNA sequencing has become commonplace, the development of efficient methods and tools to explore gene sequences has become indispensable. In particular, despite photosynthetic eukaryotes constituting the largest percentage of terrestrial biomass, computational functional characterization of gene sequences in these organisms still predominantly relies on comparisons with Arabidopsis thaliana and other angiosperms. This paper introduces PharaohFUN, a web application designed for the evolutionary and functional analysis of protein sequences in photosynthetic eukaryotes, leveraging orthology relationships between them. PharaohFUN incorporates a homogeneous representative sampling of key species in this group, bridging clades that have traditionally been studied separately, thus establishing a comprehensive evolutionary framework to draw conclusions about sequence evolution and function. For this purpose, it incorporates modules for exploring gene tree evolutionary history, expansion and contraction events, ancestral states, domain identification, multiple sequence alignments, and diverse functional annotation. It also incorporates different search modes to facilitate its use and increase its reach within the community. Tests were performed on the whole transcription factor toolbox of A. thaliana and on CCA1 protein to assess its utility for both large-scale and fine-grained phylogenetic studies. These exemplify how PharaohFUN accurately traces the corresponding evolutionary histories of these proteins by unifying results for land plants, streptophyte and chlorophyte microalgae. Thus, PharaohFUN democratices access to these kinds of analyses in photosynthetic organisms for every user, independently of their prior training in bioinformatics.

由于DNA测序已经变得司空见惯，开发有效的方法和工具来探索基因序列变得必不可少。特别是，尽管光合真核生物占陆地生物量的最大比例，但这些生物基因序列的计算功能表征仍然主要依赖于与拟南芥和其他被子植物的比较。本文介绍PharaohFUN，这是一个web应用程序，用于光合真核生物蛋白质序列的进化和功能分析，利用它们之间的同源关系。PharaohFUN整合了该组关键物种的同质代表性样本，连接了传统上单独研究的分支，从而建立了全面的进化框架，以得出序列进化和功能的结论。为此，它包含了探索基因树进化史、扩展和收缩事件、祖先状态、区域识别、多序列比对和多种功能注释的模块。它还结合了不同的搜索模式，以方便其使用和扩大其在社区内的覆盖范围。对拟南芥的整个转录因子工具箱和CCA1蛋白进行了测试，以评估其在大规模和细粒度系统发育研究中的效用。这些例子说明PharaohFUN是如何通过统一陆地植物、链藻和绿藻微藻的结果，准确地追踪这些蛋白质的相应进化历史的。因此，PharaohFUN为每个用户提供了这种对光合生物的分析，而不依赖于他们之前在生物信息学方面的培训。

{"title":"PharaohFUN: phylogenomic analysis for plant protein history and function elucidation.","authors":"Marcos Ramos-González, Víctor Ramos-González, Emma Serrano-Pérez, Christina Arvanitidou, Jorge Hernández-García, Mercedes García-González, Francisco J Romero-Campero","doi":"10.1093/molbev/msag011","DOIUrl":"10.1093/molbev/msag011","url":null,"abstract":"Since DNA sequencing has become commonplace, the development of efficient methods and tools to explore gene sequences has become indispensable. In particular, despite photosynthetic eukaryotes constituting the largest percentage of terrestrial biomass, computational functional characterization of gene sequences in these organisms still predominantly relies on comparisons with Arabidopsis thaliana and other angiosperms. This paper introduces PharaohFUN, a web application designed for the evolutionary and functional analysis of protein sequences in photosynthetic eukaryotes, leveraging orthology relationships between them. PharaohFUN incorporates a homogeneous representative sampling of key species in this group, bridging clades that have traditionally been studied separately, thus establishing a comprehensive evolutionary framework to draw conclusions about sequence evolution and function. For this purpose, it incorporates modules for exploring gene tree evolutionary history, expansion and contraction events, ancestral states, domain identification, multiple sequence alignments, and diverse functional annotation. It also incorporates different search modes to facilitate its use and increase its reach within the community. Tests were performed on the whole transcription factor toolbox of A. thaliana and on CCA1 protein to assess its utility for both large-scale and fine-grained phylogenetic studies. These exemplify how PharaohFUN accurately traces the corresponding evolutionary histories of these proteins by unifying results for land plants, streptophyte and chlorophyte microalgae. Thus, PharaohFUN democratices access to these kinds of analyses in photosynthetic organisms for every user, independently of their prior training in bioinformatics.","PeriodicalId":18730,"journal":{"name":"Molecular biology and evolution","volume":" ","pages":""},"PeriodicalIF":5.3,"publicationDate":"2026-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12866927/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146030207","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Viral Simulation Reveals Overestimation Bias in Within-Host Phylodynamic Migration Rate Estimates Under Selection. 病毒模拟揭示了在选择下宿主内系统动力学迁移速率估计的高估偏差。

IF 5.3 1区生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY

Molecular biology and evolution

Pub Date : 2026-02-02 DOI: 10.1093/molbev/msag014

Nicolas Ochsner, Judith Bouman, Timothy Vaughan, Tanja Stadler, Sebastian Bonhoeffer, Roland Regoes

Phylodynamic methods are widely used to infer the population dynamics of viruses between and within hosts. For HIV-1, these methods have been used to estimate migration rates between different anatomical compartments within a host. These methods typically assume that the genomic regions used for reconstruction are evolving without selective pressure, even though other parts of the viral genome are known to experience strong selection. In this study, we investigate how selection affects phylodynamic migration rate estimates. To this end, we developed a novel agent-based simulation tool, virolution, to simulate the evolution of virus within two anatomical compartments of a host. Using this tool, we generated viral sequences and genealogies assuming both, neutral evolution and purifying selection that is concordant in both compartments. We found that, under the selection regime, migration rates are significantly overestimated with a stochastic mixture model and a structured coalescent model in the Bayesian inference framework BEAST2. Our results reveal that commonly used phylogeographic methods, which assume neutral evolution, can significantly bias migration rate estimates in selective regimes.

系统动力学方法被广泛用于推断宿主之间和宿主内部病毒的种群动态。对于HIV-1，这些方法已用于估计宿主内不同解剖区室之间的迁移率。这些方法通常假设用于重建的基因组区域在没有选择压力的情况下进化，即使已知病毒基因组的其他部分经历了强烈的选择。在这项研究中，我们研究了选择如何影响系统动力学迁移速率估计。为此，我们开发了一种新的基于代理的模拟工具，病毒进化，来模拟病毒在宿主的两个解剖区室中的进化。使用这个工具，我们生成了病毒序列和谱系，假设中性进化和纯化选择在两个室中是一致的。我们发现，在选择机制下，贝叶斯推理框架BEAST2中的随机混合模型和结构化凝聚模型明显高估了迁移率。我们的研究结果表明，通常使用的假设中性进化的系统地理学方法在选择性制度下可能会显着偏差迁移率估计。

{"title":"Viral Simulation Reveals Overestimation Bias in Within-Host Phylodynamic Migration Rate Estimates Under Selection.","authors":"Nicolas Ochsner, Judith Bouman, Timothy Vaughan, Tanja Stadler, Sebastian Bonhoeffer, Roland Regoes","doi":"10.1093/molbev/msag014","DOIUrl":"https://doi.org/10.1093/molbev/msag014","url":null,"abstract":"Phylodynamic methods are widely used to infer the population dynamics of viruses between and within hosts. For HIV-1, these methods have been used to estimate migration rates between different anatomical compartments within a host. These methods typically assume that the genomic regions used for reconstruction are evolving without selective pressure, even though other parts of the viral genome are known to experience strong selection. In this study, we investigate how selection affects phylodynamic migration rate estimates. To this end, we developed a novel agent-based simulation tool, virolution, to simulate the evolution of virus within two anatomical compartments of a host. Using this tool, we generated viral sequences and genealogies assuming both, neutral evolution and purifying selection that is concordant in both compartments. We found that, under the selection regime, migration rates are significantly overestimated with a stochastic mixture model and a structured coalescent model in the Bayesian inference framework BEAST2. Our results reveal that commonly used phylogeographic methods, which assume neutral evolution, can significantly bias migration rate estimates in selective regimes.","PeriodicalId":18730,"journal":{"name":"Molecular biology and evolution","volume":" ","pages":""},"PeriodicalIF":5.3,"publicationDate":"2026-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146100580","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Genomic consequences of domestication and the diversification of body coloration and morphology in ornamental medaka strains. 观赏藻科植物驯化的基因组结果及体色和形态的多样化。

IF 5.3 1区生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY

Molecular biology and evolution

Pub Date : 2026-01-30 DOI: 10.1093/molbev/msag021

Tetsuo Kon, Rui Tang, Koto Kon-Nanjo, Soma Tomihara, Soichiro Fushiki, Wakana Fujii, Mifuyu Sera, Yusuke Takehana, Hideki Noguchi, Atsushi Toyoda, Kiyoshi Naruse, Yoshihiro Omori

Ornamental medaka strains derived from wild Japanese medaka (Oryzias latipes species complex) are bred worldwide. Over 200 years of selective breeding have produced over 700 strains with a wide variety of phenotypes, including diverse body coloration, scales, eyeball morphology, and fin and body shapes. In this study, we first identified and described 34 phenotypes in ornamental medaka strains. To understand the genomic basis of this phenotypic diversity and the domestication process, we performed whole genome sequencing on 181 individuals of 86 ornamental Japanese medaka strains. Population genomic analyses revealed that modern ornamental medaka strains are genetically closer to the wild Southern Japan population of the Kansai-Setouchi regions, suggesting the origin of ornamental strains. In addition, the gene loci poc1a, tyr, nme2a, and gabrr2b have undergone selection during domestication. We performed GWAS analysis for 29 phenotypes observed in ornamental medaka strains and identified strong candidate genes for some phenotypes, including kcnq5a for hirenaga and swallow, bmp5 for deme, adcy5 for orochi, and kitlga for aurora, respectively. We found that loss of exon 8 of adcy5 caused melanism, a dark body color phenotype, in medaka, providing a molecular insight on this phenomenon in vertebrates and human inherent dyskinesia. In addition, we uncovered the predominant candidate peaks of GWAS, including a total of 3,328 genes associated with 26 phenotypes. Our findings highlight the potential of population genomics to explore genotype-phenotype correlations and the genomic basis of body coloration and morphogenesis in medaka.

观赏樱草系是由野生日本樱草（Oryzias latipes species complex）衍生而来。经过200多年的选择性育种，已经产生了700多种具有多种表型的菌株，包括各种身体颜色，鳞片，眼球形态，鳍和身体形状。在这项研究中，我们首次鉴定和描述了观赏medaka菌株的34种表型。为了了解这种表型多样性的基因组基础和驯化过程，我们对86个观赏日本medaka品系的181个个体进行了全基因组测序。群体基因组分析表明，现代观赏medaka菌株在遗传上更接近关西濑户内地区的野生日本南部种群，这表明观赏菌株的起源。此外，基因位点poc1a、tyr、nme2a和gabrr2b在驯化过程中也经历了选择。我们对观赏medaka菌株中观察到的29种表型进行了GWAS分析，发现了一些表型的强候选基因，包括hirenaga和swallow的kcnq5a， deme的bmp5， orochi的adcy5和aurora的kitlga。我们发现adcy5外显子8的缺失导致了medaka的黑化，一种深色体色表型，这为脊椎动物和人类固有运动障碍的这种现象提供了分子视角。此外，我们还发现了GWAS的主要候选峰，包括与26种表型相关的3328个基因。我们的研究结果突出了群体基因组学在探索medaka的基因型-表型相关性以及身体颜色和形态发生的基因组基础方面的潜力。

{"title":"Genomic consequences of domestication and the diversification of body coloration and morphology in ornamental medaka strains.","authors":"Tetsuo Kon, Rui Tang, Koto Kon-Nanjo, Soma Tomihara, Soichiro Fushiki, Wakana Fujii, Mifuyu Sera, Yusuke Takehana, Hideki Noguchi, Atsushi Toyoda, Kiyoshi Naruse, Yoshihiro Omori","doi":"10.1093/molbev/msag021","DOIUrl":"https://doi.org/10.1093/molbev/msag021","url":null,"abstract":"Ornamental medaka strains derived from wild Japanese medaka (Oryzias latipes species complex) are bred worldwide. Over 200 years of selective breeding have produced over 700 strains with a wide variety of phenotypes, including diverse body coloration, scales, eyeball morphology, and fin and body shapes. In this study, we first identified and described 34 phenotypes in ornamental medaka strains. To understand the genomic basis of this phenotypic diversity and the domestication process, we performed whole genome sequencing on 181 individuals of 86 ornamental Japanese medaka strains. Population genomic analyses revealed that modern ornamental medaka strains are genetically closer to the wild Southern Japan population of the Kansai-Setouchi regions, suggesting the origin of ornamental strains. In addition, the gene loci poc1a, tyr, nme2a, and gabrr2b have undergone selection during domestication. We performed GWAS analysis for 29 phenotypes observed in ornamental medaka strains and identified strong candidate genes for some phenotypes, including kcnq5a for hirenaga and swallow, bmp5 for deme, adcy5 for orochi, and kitlga for aurora, respectively. We found that loss of exon 8 of adcy5 caused melanism, a dark body color phenotype, in medaka, providing a molecular insight on this phenomenon in vertebrates and human inherent dyskinesia. In addition, we uncovered the predominant candidate peaks of GWAS, including a total of 3,328 genes associated with 26 phenotypes. Our findings highlight the potential of population genomics to explore genotype-phenotype correlations and the genomic basis of body coloration and morphogenesis in medaka.","PeriodicalId":18730,"journal":{"name":"Molecular biology and evolution","volume":" ","pages":""},"PeriodicalIF":5.3,"publicationDate":"2026-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146086425","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Leveraging Low-Cost Short-Read Sequencing: Revolutionizing Complex Trait Genetics. 利用低成本短读测序：彻底改变复杂性状遗传学。

IF 5.3 1区生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY

Molecular biology and evolution

Pub Date : 2026-01-28 DOI: 10.1093/molbev/msag025

Sarah N Ruckman, Anthony D Long

The genetics of complex traits has been fundamentally transformed by the dramatic reduction in short-read sequencing costs, leading to a dramatic reversal in the relative costs of genotyping versus phenotyping. We explore this new scientific landscape by examining key experimental strategies that leverage inexpensive sequencing, including low-coverage whole-genome sequencing with imputation (lcWGS+I) for genotyping large cohorts. Although somewhat limited in outbred populations, lcWGS+I can be extremely effective in multi-parent populations (MPPs) and in founder-unknown closed colonies, where imputation accuracy can exceed 98%. We further explore pooled-sequencing (Pool-seq) approaches for dissecting complex traits, such as Evolve and Resequence (E&R) for tracking adaptive changes in allele frequency over several generations, and Extreme QTL (X-QTL) mapping that identifies loci by contrasting pooled samples from phenotypic extremes. We show that X-QTL mapping in MPPs, by testing for shifts in founder haplotype frequencies across small genomic windows, can be extremely powerful and cost-effective. Finally, we discuss methods where sequencing reads serve as the phenotype itself. DNA barcoding enables massive-scale fitness assays, while the "*-seq" toolkit (e.g., RNA-seq, ATAC-seq) allows for mapping molecular QTLs, though this introduces a significant multiple testing burden. Systems leveraging certain breeding designs in concert with low cost sequencing can greatly accelerate progress towards a mechanistic understanding of the genotype-phenotype relationship.

由于短读测序成本的大幅降低，复杂性状的遗传学已经从根本上发生了变化，导致基因分型与表型分型的相对成本发生了戏剧性的逆转。我们通过研究利用廉价测序的关键实验策略来探索这一新的科学前景，包括用于大型队列基因分型的低覆盖率全基因组测序与代入（lcWGS+I）。尽管lcWGS+I在近亲繁殖种群中有一定的局限性，但它在多亲本种群（mpp）和创始人未知的封闭种群中非常有效，其代入精度可超过98%。我们进一步探索了用于解剖复杂性状的混合测序（Pool-seq）方法，例如用于追踪几代等位基因频率适应性变化的进化和重测序（E&R）方法，以及通过对比来自极端表型的混合样本来识别位点的极端QTL （X-QTL）定位方法。我们表明，通过在小基因组窗口中测试创始人单倍型频率的变化，在mpp中进行X-QTL定位可以非常强大且具有成本效益。最后，我们讨论了测序读数作为表型本身的方法。DNA条形码可以实现大规模的适应度分析，而“*-seq”工具包（例如RNA-seq， ATAC-seq）允许绘制分子qtl，尽管这会带来重大的多重测试负担。利用某些育种设计与低成本测序相结合的系统可以大大加快对基因型-表型关系的机制理解的进展。

{"title":"Leveraging Low-Cost Short-Read Sequencing: Revolutionizing Complex Trait Genetics.","authors":"Sarah N Ruckman, Anthony D Long","doi":"10.1093/molbev/msag025","DOIUrl":"https://doi.org/10.1093/molbev/msag025","url":null,"abstract":"The genetics of complex traits has been fundamentally transformed by the dramatic reduction in short-read sequencing costs, leading to a dramatic reversal in the relative costs of genotyping versus phenotyping. We explore this new scientific landscape by examining key experimental strategies that leverage inexpensive sequencing, including low-coverage whole-genome sequencing with imputation (lcWGS+I) for genotyping large cohorts. Although somewhat limited in outbred populations, lcWGS+I can be extremely effective in multi-parent populations (MPPs) and in founder-unknown closed colonies, where imputation accuracy can exceed 98%. We further explore pooled-sequencing (Pool-seq) approaches for dissecting complex traits, such as Evolve and Resequence (E&R) for tracking adaptive changes in allele frequency over several generations, and Extreme QTL (X-QTL) mapping that identifies loci by contrasting pooled samples from phenotypic extremes. We show that X-QTL mapping in MPPs, by testing for shifts in founder haplotype frequencies across small genomic windows, can be extremely powerful and cost-effective. Finally, we discuss methods where sequencing reads serve as the phenotype itself. DNA barcoding enables massive-scale fitness assays, while the \"*-seq\" toolkit (e.g., RNA-seq, ATAC-seq) allows for mapping molecular QTLs, though this introduces a significant multiple testing burden. Systems leveraging certain breeding designs in concert with low cost sequencing can greatly accelerate progress towards a mechanistic understanding of the genotype-phenotype relationship.","PeriodicalId":18730,"journal":{"name":"Molecular biology and evolution","volume":" ","pages":""},"PeriodicalIF":5.3,"publicationDate":"2026-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146064862","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The Evolutionary Genomics of Meiotic Drive. 减数分裂驱动的进化基因组学。

IF 5.3 1区生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY

Molecular biology and evolution

Pub Date : 2026-01-23 DOI: 10.1093/molbev/msag020

Daven C Presgraves, R Kelly Dawe, Kelly A Dyer, Lila Fishman, Soumitra A Bhide, Sasha L Bradshaw, Meghan J Brady, Alejandro Burga, Cécile Courret, Brandon L Fagen, Ana Beatriz Stein Machado Ferretti, Reka K Kelemen, Jun Kitano, Yiran Liu, Emiliano Martí, Theresa Erlenbach, Josephine A Reinhardt, Laura Ross, Jan-Niklas Runge, Callie M Swanepoel, Beatriz Vicoso, Aaron A Vogan, Anna K Lindholm, Amanda M Larracuente, Robert L Unckless

Meiotic drivers are selfish genetic elements that gain transmission advantages by distorting equal, Mendelian segregation. For decades, biologists have considered meiotic drivers as interesting, albeit esoteric, case studies. It is now clear, however, that meiotic drive is more common and phylogenetically widespread than previously supposed. Indeed, intensive study of a few well-known cases has begun to reveal the evolutionary genomic consequences of meiotic drive. We argue here that many features of genome evolution, content, and organization that are seemingly inexplicable by organismal adaptation or nearly neutral processes are instead best accounted for by recurrent histories of meiotic drive. We review how meiotic drive can affect the evolution of sequences, gene copy numbers, genes with functions in meiosis and gametogenesis, signatures of "selection", chromosome rearrangements, and karyotype evolution. We also explore the interactions of meiotic drive elements with other classes of selfish genetic elements, including satellite DNAs, transposable elements, and with the endogenous host genes involved in drive suppression. Finally, we argue that some aspects of drive-mediated genome evolution are now sufficiently well established that we might reverse the direction of discovery- rather than ask how drive affects genome evolution, we can use genome data to discover new putative drive elements.

减数分裂驱动因子是自私的遗传因子，通过扭曲平等的孟德尔分离来获得遗传优势。几十年来，生物学家一直认为减数分裂驱动因素是有趣的，尽管是深奥的案例研究。然而，现在很清楚，减数分裂驱动比以前认为的更普遍，在系统发育上也更广泛。事实上，对一些著名案例的深入研究已经开始揭示减数分裂驱动的进化基因组后果。我们认为，基因组进化、内容和组织的许多特征似乎无法用有机体适应或近乎中性的过程来解释，相反，减数分裂驱动的循环历史最好地解释了这些特征。我们回顾了减数分裂驱动如何影响序列的进化、基因拷贝数、减数分裂和配子体发生中起作用的基因、“选择”特征、染色体重排和核型进化。我们还探讨了减数分裂驱动元件与其他类型的自私遗传元件的相互作用，包括卫星dna，转座元件，以及与内源性宿主基因参与驱动抑制。最后，我们认为驱动介导的基因组进化的某些方面现在已经充分确立，我们可能会逆转发现的方向-而不是问驱动如何影响基因组进化，我们可以使用基因组数据来发现新的假定的驱动元素。

{"title":"The Evolutionary Genomics of Meiotic Drive.","authors":"Daven C Presgraves, R Kelly Dawe, Kelly A Dyer, Lila Fishman, Soumitra A Bhide, Sasha L Bradshaw, Meghan J Brady, Alejandro Burga, Cécile Courret, Brandon L Fagen, Ana Beatriz Stein Machado Ferretti, Reka K Kelemen, Jun Kitano, Yiran Liu, Emiliano Martí, Theresa Erlenbach, Josephine A Reinhardt, Laura Ross, Jan-Niklas Runge, Callie M Swanepoel, Beatriz Vicoso, Aaron A Vogan, Anna K Lindholm, Amanda M Larracuente, Robert L Unckless","doi":"10.1093/molbev/msag020","DOIUrl":"https://doi.org/10.1093/molbev/msag020","url":null,"abstract":"Meiotic drivers are selfish genetic elements that gain transmission advantages by distorting equal, Mendelian segregation. For decades, biologists have considered meiotic drivers as interesting, albeit esoteric, case studies. It is now clear, however, that meiotic drive is more common and phylogenetically widespread than previously supposed. Indeed, intensive study of a few well-known cases has begun to reveal the evolutionary genomic consequences of meiotic drive. We argue here that many features of genome evolution, content, and organization that are seemingly inexplicable by organismal adaptation or nearly neutral processes are instead best accounted for by recurrent histories of meiotic drive. We review how meiotic drive can affect the evolution of sequences, gene copy numbers, genes with functions in meiosis and gametogenesis, signatures of \"selection\", chromosome rearrangements, and karyotype evolution. We also explore the interactions of meiotic drive elements with other classes of selfish genetic elements, including satellite DNAs, transposable elements, and with the endogenous host genes involved in drive suppression. Finally, we argue that some aspects of drive-mediated genome evolution are now sufficiently well established that we might reverse the direction of discovery- rather than ask how drive affects genome evolution, we can use genome data to discover new putative drive elements.","PeriodicalId":18730,"journal":{"name":"Molecular biology and evolution","volume":" ","pages":""},"PeriodicalIF":5.3,"publicationDate":"2026-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146053110","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Phylogeny of waterfowl (Anseriformes) constructed using genome sequences provides insights into topological incongruences. 使用基因组序列构建水禽（雁形目）的系统发育提供了对拓扑不一致的见解。

IF 5.3 1区生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY

Molecular biology and evolution

Pub Date : 2026-01-21 DOI: 10.1093/molbev/msag018

Gang Wang, Tao Zhu, Xinye Zhang, Xufang Ren, Anqi Chen, Zhonghua Ning, Marcel van Tuinen, Lujiang Qu

The evolutionary history of waterfowl (Anseriformes) has long been a focal point of avian research. However, previous phylogenetic investigations have focused primarily on morphology or mitochondrial DNA or have lacked sufficient taxon sampling. Accompanied by observed phylogenetic incongruence and incomplete resolution, waterfowl phylogenetic branching patterns remain uncertain at various taxonomic ranks. To further validate phylogenetic relationships among higher waterfowl taxa and assess presence of conflicting signal, we assembled and analyzed 24 waterfowl genomes representing all waterfowl families and several subfamilies. Utilizing both newly acquired and previously obtained genomes, we constructed and analyzed seven DNA data classes, which yielded highly resolved phylogenetic trees including a time-calibrated tree. Most of these trees consistently and completely resolved the phylogenetic relationships of the included waterfowl species. Despite these efforts, our analysis across chromosomes uncovered four instances of phylogenetic incongruous signal. After minimizing tree estimation error through focus on whole genome alignment (WGA) dataset and by sequence simulation, analyses revealed that ILS and gene introgression essentially contributed to all gene-tree discordance. The variable impact of both factors across distinct waterfowl nodes reflects an underlying complexity that warrants further interpretation. This study not only presents a strongly-supported and well-resolved phylogenetic backbone for the major waterfowl lineages, but also provides foundational data for subsequent comparative genomics studies of a more expanded set of waterfowl taxa.

水禽（雁形目）的进化史一直是鸟类研究的焦点。然而，以前的系统发育研究主要集中在形态学或线粒体DNA上，或者缺乏足够的分类群样本。随着观察到的系统发育不一致和不完全分解，水禽系统发育分支模式在不同的分类等级上仍然不确定。为了进一步验证高等水禽类群之间的系统发育关系并评估冲突信号的存在，我们收集并分析了代表所有水禽科和几个亚科的24个水禽基因组。利用新获得的和以前获得的基因组，我们构建并分析了7个DNA数据类，这些数据类产生了高度分辨率的系统发育树，包括一个时间校准树。大多数这些树一致地和完全地解决了所包括的水禽物种的系统发育关系。尽管有这些努力，我们的跨染色体分析发现了四个系统发育不协调信号的实例。通过全基因组比对（WGA）数据集和序列模拟最小化树估计误差后，分析表明，ILS和基因渗入是所有基因树不一致的主要原因。这两个因素在不同水禽节点上的不同影响反映了一种潜在的复杂性，值得进一步解释。该研究不仅为主要水禽谱系提供了强有力的支持和良好的系统发育主干，而且为后续对更广泛的水禽类群进行比较基因组学研究提供了基础数据。

{"title":"Phylogeny of waterfowl (Anseriformes) constructed using genome sequences provides insights into topological incongruences.","authors":"Gang Wang, Tao Zhu, Xinye Zhang, Xufang Ren, Anqi Chen, Zhonghua Ning, Marcel van Tuinen, Lujiang Qu","doi":"10.1093/molbev/msag018","DOIUrl":"https://doi.org/10.1093/molbev/msag018","url":null,"abstract":"The evolutionary history of waterfowl (Anseriformes) has long been a focal point of avian research. However, previous phylogenetic investigations have focused primarily on morphology or mitochondrial DNA or have lacked sufficient taxon sampling. Accompanied by observed phylogenetic incongruence and incomplete resolution, waterfowl phylogenetic branching patterns remain uncertain at various taxonomic ranks. To further validate phylogenetic relationships among higher waterfowl taxa and assess presence of conflicting signal, we assembled and analyzed 24 waterfowl genomes representing all waterfowl families and several subfamilies. Utilizing both newly acquired and previously obtained genomes, we constructed and analyzed seven DNA data classes, which yielded highly resolved phylogenetic trees including a time-calibrated tree. Most of these trees consistently and completely resolved the phylogenetic relationships of the included waterfowl species. Despite these efforts, our analysis across chromosomes uncovered four instances of phylogenetic incongruous signal. After minimizing tree estimation error through focus on whole genome alignment (WGA) dataset and by sequence simulation, analyses revealed that ILS and gene introgression essentially contributed to all gene-tree discordance. The variable impact of both factors across distinct waterfowl nodes reflects an underlying complexity that warrants further interpretation. This study not only presents a strongly-supported and well-resolved phylogenetic backbone for the major waterfowl lineages, but also provides foundational data for subsequent comparative genomics studies of a more expanded set of waterfowl taxa.","PeriodicalId":18730,"journal":{"name":"Molecular biology and evolution","volume":" ","pages":""},"PeriodicalIF":5.3,"publicationDate":"2026-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146011332","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0