Ercan Seçkin, Dominique Colinet, Edoardo Sarti, Etienne G J Danchin
Genes that lack identifiable homologs in other species have been an intriguing and interesting topic of research for many years. These so-called orphan genes were first studied in yeast and since then, they have been found in many other species. This has fostered a whole field of research aiming at tracing back their evolutionary origin and functional significance. Orphan genes represent an important part of protein-coding genes in many species. Their presence was initially mainly hypothesized to result from high divergence from a pre-existing gene, with duplications or horizontal gene transfer facilitating their accelerated evolution. More recently, their possible de novo emergence from nongenic regions has gained particular interest. Several orphan genes are predicted to be involved in reproduction, while others are involved in specific developmental stages, in adaptation mechanisms such as freeze protection or even human disease. However, there is currently no unified resource or synthesis that brings together existing knowledge about how prevalent orphan genes are across different species and what their roles might be. In this review, we focus on orphan genes in animals and fungi. We provide a detailed summary of discoveries over time in terms of orphan gene prevalence in genomes, their origins as well as their roles in different biological contexts.
{"title":"Orphan and de novo Genes in Fungi and Animals: Identification, Origins and Functions.","authors":"Ercan Seçkin, Dominique Colinet, Edoardo Sarti, Etienne G J Danchin","doi":"10.1093/gbe/evaf220","DOIUrl":"10.1093/gbe/evaf220","url":null,"abstract":"<p><p>Genes that lack identifiable homologs in other species have been an intriguing and interesting topic of research for many years. These so-called orphan genes were first studied in yeast and since then, they have been found in many other species. This has fostered a whole field of research aiming at tracing back their evolutionary origin and functional significance. Orphan genes represent an important part of protein-coding genes in many species. Their presence was initially mainly hypothesized to result from high divergence from a pre-existing gene, with duplications or horizontal gene transfer facilitating their accelerated evolution. More recently, their possible de novo emergence from nongenic regions has gained particular interest. Several orphan genes are predicted to be involved in reproduction, while others are involved in specific developmental stages, in adaptation mechanisms such as freeze protection or even human disease. However, there is currently no unified resource or synthesis that brings together existing knowledge about how prevalent orphan genes are across different species and what their roles might be. In this review, we focus on orphan genes in animals and fungi. We provide a detailed summary of discoveries over time in terms of orphan gene prevalence in genomes, their origins as well as their roles in different biological contexts.</p>","PeriodicalId":12779,"journal":{"name":"Genome Biology and Evolution","volume":" ","pages":""},"PeriodicalIF":2.8,"publicationDate":"2025-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12684174/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145603937","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Wei-Qiao Rao, Esperanza Rivera-de-Torre, Lorenzo Seneci, Min-Hui Shi, Yao-Lei Zhang, Liang Lin, Tian-Ming Lan, Jože Pungerčar, Si-Qi Liu, Andreas H Laustsen
We present a chromosome-level genome assembly of the western nose-horned viper (Vipera ammodytes ammodytes), the most medically important viper in Europe. Using PacBio Sequel and Illumina HiSeq X Ten sequencing, we generated ∼270 Gb of data, achieving ∼131× coverage of the genome. The final assembly spans 1.55 Gb with a contig N50 of 45.9 Mb and a scaffold N50 of 210 Mb, anchored into 18 pseudo-chromosomes. Completeness was supported by recovery of 97.1% of Vertebrata BUSCOs. A total of 20,775 protein-coding genes were predicted, of which 96.6% were functionally annotated. Repetitive sequences accounted for 53.75% of the genome, dominated by LINEs (41.87%) and LTRs (14.35%). We identified 112 venom-related genes across 15 families, with expansions in SVMPs, Snaclecs, sPLA₂s, SPIs, and SVSPs, together comprising 62.5% of the venom repertoire. Chemosensory genes were also expanded, including 448 olfactory receptors, 72 taste receptors, and 29 vomeronasal receptors. This assembly represents the most complete genome resource for a true viper to date and provides a key resource for investigating venom evolution, chemosensory adaptation, and comparative snake genomics.
{"title":"A Chromosome-level Genome Assembly of the Western Nose-Horned Viper (Vipera ammodytes ammodytes).","authors":"Wei-Qiao Rao, Esperanza Rivera-de-Torre, Lorenzo Seneci, Min-Hui Shi, Yao-Lei Zhang, Liang Lin, Tian-Ming Lan, Jože Pungerčar, Si-Qi Liu, Andreas H Laustsen","doi":"10.1093/gbe/evaf210","DOIUrl":"10.1093/gbe/evaf210","url":null,"abstract":"<p><p>We present a chromosome-level genome assembly of the western nose-horned viper (Vipera ammodytes ammodytes), the most medically important viper in Europe. Using PacBio Sequel and Illumina HiSeq X Ten sequencing, we generated ∼270 Gb of data, achieving ∼131× coverage of the genome. The final assembly spans 1.55 Gb with a contig N50 of 45.9 Mb and a scaffold N50 of 210 Mb, anchored into 18 pseudo-chromosomes. Completeness was supported by recovery of 97.1% of Vertebrata BUSCOs. A total of 20,775 protein-coding genes were predicted, of which 96.6% were functionally annotated. Repetitive sequences accounted for 53.75% of the genome, dominated by LINEs (41.87%) and LTRs (14.35%). We identified 112 venom-related genes across 15 families, with expansions in SVMPs, Snaclecs, sPLA₂s, SPIs, and SVSPs, together comprising 62.5% of the venom repertoire. Chemosensory genes were also expanded, including 448 olfactory receptors, 72 taste receptors, and 29 vomeronasal receptors. This assembly represents the most complete genome resource for a true viper to date and provides a key resource for investigating venom evolution, chemosensory adaptation, and comparative snake genomics.</p>","PeriodicalId":12779,"journal":{"name":"Genome Biology and Evolution","volume":" ","pages":""},"PeriodicalIF":2.8,"publicationDate":"2025-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12680307/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145481480","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Theresa Erlenbach, Ryan M R Gawryluk, Steve J Perlman, Robert L Unckless, Kelly A Dyer
Comparative genomic analyses among closely related species provide an opportunity to assess their evolutionary history. The relatedness between species can depend on a variety of factors, including reproductive isolation, introgression, and incomplete lineage sorting, and this can impact divergence across the genome. Here, we use a combination of long- and short-read sequencing and HI-C scaffolding to assemble genomes for each of the four species in the testacea species group of Drosophila, including D. testacea, D. orientacea, D. neotestacea, and D. putrida, and its outgroup, D. bizonata. First, among species, we find many structural rearrangements across the genome as well as a large size difference in the dot chromosome that we infer is due to the expansion of repetitive elements. Second, we assess phylogenetic discordance and uncover a difference in the phylogeny inferred from genes on Muller E and the mitogenome relative to the rest of the genome, which may be due to recent hybridization. Lastly, we assess the rate of molecular evolution of genes shared across all species and identify genes evolving at different rates across the phylogeny. Our results present genomic resources for this species group and begin to probe into some of the evolutionary characteristics that contribute to variation in genome structure, while highlighting the need for high-quality genome resources to fully capture and understand the evolutionary history among closely related species.
{"title":"Comparative Genomics of the Testacea Group of Drosophila Reveals Introgression and Variation in Chromosome Size and Structure.","authors":"Theresa Erlenbach, Ryan M R Gawryluk, Steve J Perlman, Robert L Unckless, Kelly A Dyer","doi":"10.1093/gbe/evaf225","DOIUrl":"10.1093/gbe/evaf225","url":null,"abstract":"<p><p>Comparative genomic analyses among closely related species provide an opportunity to assess their evolutionary history. The relatedness between species can depend on a variety of factors, including reproductive isolation, introgression, and incomplete lineage sorting, and this can impact divergence across the genome. Here, we use a combination of long- and short-read sequencing and HI-C scaffolding to assemble genomes for each of the four species in the testacea species group of Drosophila, including D. testacea, D. orientacea, D. neotestacea, and D. putrida, and its outgroup, D. bizonata. First, among species, we find many structural rearrangements across the genome as well as a large size difference in the dot chromosome that we infer is due to the expansion of repetitive elements. Second, we assess phylogenetic discordance and uncover a difference in the phylogeny inferred from genes on Muller E and the mitogenome relative to the rest of the genome, which may be due to recent hybridization. Lastly, we assess the rate of molecular evolution of genes shared across all species and identify genes evolving at different rates across the phylogeny. Our results present genomic resources for this species group and begin to probe into some of the evolutionary characteristics that contribute to variation in genome structure, while highlighting the need for high-quality genome resources to fully capture and understand the evolutionary history among closely related species.</p>","PeriodicalId":12779,"journal":{"name":"Genome Biology and Evolution","volume":" ","pages":""},"PeriodicalIF":2.8,"publicationDate":"2025-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12708344/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145654211","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Anthony J Barley, David V Ho, Peter Baumann, Ian J Wang, H Bradley Shaffer, Robert N Fisher, Levi N Gray, Trevor J Krabbenhoft, Robert E Espinoza, Merly Escalona, Erin Toffelmier, Ruta Sahasrabudhe, Oanh Nguyen, Colin W Fairbairn, Eric Beraut, Robert C Thomson
The transition from small genetic to genome-scale datasets for studying biodiversity has revealed that genetic exchange through introgressive hybridization is a widespread phenomenon in nature. Despite this, a lack of high-quality reference genomes for most non-model species limits our understanding of the impact of this process for many taxonomic groups. This restricts the range of insights that genomic tools can provide for conservation biologists, who often hope to employ genomic datasets to accurately identify historically isolated lineages to protect and to predict their evolutionary fate in the face of environmental change. Tiger whiptail lizards (Aspidoscelis tigris complex) are an abundant and important ecological component of ecosystems across the southwestern United States. In this study, we assembled and annotated a chromosome-level reference genome for A. t. stejnegeri from coastal California. We then used this reference genome to reconstruct patterns of speciation and admixture within the larger species complex, finding evidence that gene flow is widespread both geographically and across the genome.
生物多样性研究从小型遗传数据到基因组数据的转变表明,通过渐进杂交进行的遗传交换是自然界中普遍存在的现象。尽管如此,缺乏大多数非模式物种的高质量参考基因组限制了我们对这一过程对许多分类群体的影响的理解。这限制了基因组工具可以为保护生物学家提供的见解范围,保护生物学家通常希望使用基因组数据集来准确识别历史上孤立的谱系,以保护和预测它们在面对环境变化时的进化命运。虎鞭尾蜥蜴(Aspidoscelis tigris complex)是美国西南部生态系统中丰富而重要的生态组成部分。在这项研究中,我们组装并注释了来自加利福尼亚沿海的a. t. stejnegeri染色体水平的参考基因组。然后,我们使用这个参考基因组来重建更大的物种复合体中的物种形成和混合模式,发现基因流动在地理上和整个基因组中都很广泛的证据。
{"title":"Speciation Genomics in the Tiger Whiptail Lizards (Aspidoscelis tigris Complex).","authors":"Anthony J Barley, David V Ho, Peter Baumann, Ian J Wang, H Bradley Shaffer, Robert N Fisher, Levi N Gray, Trevor J Krabbenhoft, Robert E Espinoza, Merly Escalona, Erin Toffelmier, Ruta Sahasrabudhe, Oanh Nguyen, Colin W Fairbairn, Eric Beraut, Robert C Thomson","doi":"10.1093/gbe/evaf218","DOIUrl":"10.1093/gbe/evaf218","url":null,"abstract":"<p><p>The transition from small genetic to genome-scale datasets for studying biodiversity has revealed that genetic exchange through introgressive hybridization is a widespread phenomenon in nature. Despite this, a lack of high-quality reference genomes for most non-model species limits our understanding of the impact of this process for many taxonomic groups. This restricts the range of insights that genomic tools can provide for conservation biologists, who often hope to employ genomic datasets to accurately identify historically isolated lineages to protect and to predict their evolutionary fate in the face of environmental change. Tiger whiptail lizards (Aspidoscelis tigris complex) are an abundant and important ecological component of ecosystems across the southwestern United States. In this study, we assembled and annotated a chromosome-level reference genome for A. t. stejnegeri from coastal California. We then used this reference genome to reconstruct patterns of speciation and admixture within the larger species complex, finding evidence that gene flow is widespread both geographically and across the genome.</p>","PeriodicalId":12779,"journal":{"name":"Genome Biology and Evolution","volume":"17 12","pages":""},"PeriodicalIF":2.8,"publicationDate":"2025-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12720011/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145804088","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Katharina Sures, Sarah P Esser, Till L V Bornemann, Carrie J Moore, André R Soares, Julia Plewka, Perla Abigail Figueroa-Gonzalez, S Emil Ruff, Cristina Moraru, Alexander J Probst
Clustered regularly interspaced short palindromic repeats (CRISPR) and CRISPR-associated (Cas) systems of bacteria and archaea provide immunities against mobile genetic elements, like viruses. In addition, protospacer analyses revealed a very specific acquisition of CRISPR spacers derived from genomes of related species or from closely interacting episymbiont genomes as recently shown for subsurface archaea. However, the origin of most of the spacers that can be found in CRISPR-Cas systems from natural environments has not been deciphered. Here, by analyzing CRISPR-Cas systems of metagenome-assembled genomes (MAGs) from two subsurface environments spanning more than 1 Tb of sequencing data, we show that a substantial proportion of CRISPR spacers are acquired from DNA of other prokaryotes inhabiting the same environment. As such, we found that the number of respective spacers can be up to three times higher than the number of self-targeting spacers. Statistical analyses demonstrated that the acquisition of CRISPR spacers from other prokaryotic genomes is partly explained by the relative abundance of the MAG containing the protospacer, as well as by other factors, such as the total number of CRISPR arrays present in a MAG with the respective spacers. Further, we found that spacer acquisition from foreign prokaryotic DNA occurs in almost all types of CRISPR-Cas systems, but shows preferences for subtypes of CRISPR-Cas systems that differ across the investigated ecosystems. Taken together, our results shed new light on the diversity of CRISPR spacers in natural microbial communities and provide an explanation for some of the many unmatched spacers in public databases.
{"title":"Acquisition of Spacers from Foreign Prokaryotic Genomes by CRISPR-Cas Systems in Natural Environments.","authors":"Katharina Sures, Sarah P Esser, Till L V Bornemann, Carrie J Moore, André R Soares, Julia Plewka, Perla Abigail Figueroa-Gonzalez, S Emil Ruff, Cristina Moraru, Alexander J Probst","doi":"10.1093/gbe/evaf201","DOIUrl":"10.1093/gbe/evaf201","url":null,"abstract":"<p><p>Clustered regularly interspaced short palindromic repeats (CRISPR) and CRISPR-associated (Cas) systems of bacteria and archaea provide immunities against mobile genetic elements, like viruses. In addition, protospacer analyses revealed a very specific acquisition of CRISPR spacers derived from genomes of related species or from closely interacting episymbiont genomes as recently shown for subsurface archaea. However, the origin of most of the spacers that can be found in CRISPR-Cas systems from natural environments has not been deciphered. Here, by analyzing CRISPR-Cas systems of metagenome-assembled genomes (MAGs) from two subsurface environments spanning more than 1 Tb of sequencing data, we show that a substantial proportion of CRISPR spacers are acquired from DNA of other prokaryotes inhabiting the same environment. As such, we found that the number of respective spacers can be up to three times higher than the number of self-targeting spacers. Statistical analyses demonstrated that the acquisition of CRISPR spacers from other prokaryotic genomes is partly explained by the relative abundance of the MAG containing the protospacer, as well as by other factors, such as the total number of CRISPR arrays present in a MAG with the respective spacers. Further, we found that spacer acquisition from foreign prokaryotic DNA occurs in almost all types of CRISPR-Cas systems, but shows preferences for subtypes of CRISPR-Cas systems that differ across the investigated ecosystems. Taken together, our results shed new light on the diversity of CRISPR spacers in natural microbial communities and provide an explanation for some of the many unmatched spacers in public databases.</p>","PeriodicalId":12779,"journal":{"name":"Genome Biology and Evolution","volume":"17 11","pages":""},"PeriodicalIF":2.8,"publicationDate":"2025-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12612501/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145503396","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Marina Brasó-Vives, Diego A Hartasánchez, Julien F Ayroles, Christabel F Bucao, Mayra Furlan-Magaril, Paco Majic, Luisa F Pallares, James Phipps-Tan, Thea F Rogers, Aurora Ruiz-Herrera, Gabriela Santos-Rodriguez, Nikolas Vellnow, Justin J S Wilcox, Juan Antonio Rodríguez
Genomic diversity within species encompasses a range of sequence-related, structural, and regulatory features. To illustrate their complexity, we invoke the analogy of a kaleidoscope: while the DNA sequence represents its core, the genome has a dynamic, multidimensional configuration shaped by interactions across these features, generating an array of dimensions of genomic variation. In this perspective, we highlight underexplored dimensions of genomic variation that contribute to phenotypic diversity. We begin by revisiting the existence of noncanonical chromosomes and by emphasizing the role of large-scale structural changes and the 3D genome architecture in modulating genomic function. We then examine the regulatory mechanisms shaping transcriptional activity and genetic variation that, instead of regulating mean trait values, defines the degree of trait variability. Finally, we discuss the influence of sequence composition on its mutational potential. These dimensions, though rooted in sequence, are context dependent, interconnected, and often difficult to disentangle, reflecting a level of structural and regulatory complexity that challenges traditional genotype-phenotype models. By synthesizing recent findings across these dimensions, we argue for a broader framework for studying within-species genomic diversity: one that accounts for the diverse molecular architectures underlying phenotypic output. This expanded view not only deepens our knowledge of the genome itself but also contributes to our understanding of genome evolution and within-species phenotypic variation.
{"title":"The Genomic Kaleidoscope: On the Hidden Dimensions of Within-Species Genomic Diversity.","authors":"Marina Brasó-Vives, Diego A Hartasánchez, Julien F Ayroles, Christabel F Bucao, Mayra Furlan-Magaril, Paco Majic, Luisa F Pallares, James Phipps-Tan, Thea F Rogers, Aurora Ruiz-Herrera, Gabriela Santos-Rodriguez, Nikolas Vellnow, Justin J S Wilcox, Juan Antonio Rodríguez","doi":"10.1093/gbe/evaf204","DOIUrl":"10.1093/gbe/evaf204","url":null,"abstract":"<p><p>Genomic diversity within species encompasses a range of sequence-related, structural, and regulatory features. To illustrate their complexity, we invoke the analogy of a kaleidoscope: while the DNA sequence represents its core, the genome has a dynamic, multidimensional configuration shaped by interactions across these features, generating an array of dimensions of genomic variation. In this perspective, we highlight underexplored dimensions of genomic variation that contribute to phenotypic diversity. We begin by revisiting the existence of noncanonical chromosomes and by emphasizing the role of large-scale structural changes and the 3D genome architecture in modulating genomic function. We then examine the regulatory mechanisms shaping transcriptional activity and genetic variation that, instead of regulating mean trait values, defines the degree of trait variability. Finally, we discuss the influence of sequence composition on its mutational potential. These dimensions, though rooted in sequence, are context dependent, interconnected, and often difficult to disentangle, reflecting a level of structural and regulatory complexity that challenges traditional genotype-phenotype models. By synthesizing recent findings across these dimensions, we argue for a broader framework for studying within-species genomic diversity: one that accounts for the diverse molecular architectures underlying phenotypic output. This expanded view not only deepens our knowledge of the genome itself but also contributes to our understanding of genome evolution and within-species phenotypic variation.</p>","PeriodicalId":12779,"journal":{"name":"Genome Biology and Evolution","volume":" ","pages":""},"PeriodicalIF":2.8,"publicationDate":"2025-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12612680/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145495275","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The prevalence and evolutionary importance of inversion polymorphisms in natural populations is poorly known because of limited genome-wide sequence data availability for most species. Inversion studies in wild populations usually target rare cases of major trait polymorphisms or local adaptation whose genetic basis involves inversions, creating a strong impression that inversions in nature are generally maintained by natural selection through links to ecologically relevant phenotypes. By contrast, genome-wide studies in humans and model organisms suggest that inversion polymorphisms are common, subject to highly complex evolutionary processes, and generally difficult to link with clearly observable cases of phenotypic variation. Using a large comparative population genomic dataset generated from 35 codistributed species of birds, we tested the hypothesis that inversions are common even within populations that lack known phenotypic polymorphisms. We leveraged analytical methods suitable for low-coverage whole genome sequencing to reveal evidence for over 170 putative inversion polymorphisms within 28 species. We find that many polymorphisms are large and present at balanced frequencies, and some are shared across species boundaries. Yet, most polymorphisms do not deviate significantly from Hardy-Weinberg Equilibrium, raising the possibility that many of these massive haploblocks could be segregating neutrally. Our results thereby reveal evidence that inversions show a variety of complex yet largely hidden patterns in natural populations, beyond cases where they contribute to known variation in ecologically relevant traits.
{"title":"Large Inversion Polymorphisms are Widespread in North American Songbirds.","authors":"Teresa M Pegan, Benjamin M Winger","doi":"10.1093/gbe/evaf205","DOIUrl":"10.1093/gbe/evaf205","url":null,"abstract":"<p><p>The prevalence and evolutionary importance of inversion polymorphisms in natural populations is poorly known because of limited genome-wide sequence data availability for most species. Inversion studies in wild populations usually target rare cases of major trait polymorphisms or local adaptation whose genetic basis involves inversions, creating a strong impression that inversions in nature are generally maintained by natural selection through links to ecologically relevant phenotypes. By contrast, genome-wide studies in humans and model organisms suggest that inversion polymorphisms are common, subject to highly complex evolutionary processes, and generally difficult to link with clearly observable cases of phenotypic variation. Using a large comparative population genomic dataset generated from 35 codistributed species of birds, we tested the hypothesis that inversions are common even within populations that lack known phenotypic polymorphisms. We leveraged analytical methods suitable for low-coverage whole genome sequencing to reveal evidence for over 170 putative inversion polymorphisms within 28 species. We find that many polymorphisms are large and present at balanced frequencies, and some are shared across species boundaries. Yet, most polymorphisms do not deviate significantly from Hardy-Weinberg Equilibrium, raising the possibility that many of these massive haploblocks could be segregating neutrally. Our results thereby reveal evidence that inversions show a variety of complex yet largely hidden patterns in natural populations, beyond cases where they contribute to known variation in ecologically relevant traits.</p>","PeriodicalId":12779,"journal":{"name":"Genome Biology and Evolution","volume":" ","pages":""},"PeriodicalIF":2.8,"publicationDate":"2025-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12628760/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145431375","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Juan Pablo Marczuk-Rojas, Ana D Maldonado, Lorenzo Carretero-Paulet
NUPTs are DNA sequences of plastid origin that are present in plant nuclear genomes in varying, though typically low, amounts. It is assumed that they are continuously formed and, due to their potentially mutagenic effect, they are removed at a constant turnover rate, which should result in an exponential decay of their age distributions and a negative correlation between age and size. However, these assumptions are based on analyses from a limited number of species and have never been explicitly tested. To gain insight into the mechanisms driving the origin and evolution of NUPTs, here we surveyed the plastid and nuclear genomes of 30 species representing the main angiosperm (flowering plants) lineages. By modeling the distribution of ages and sizes, examining their linear arrangement across the plastid genome, and statistically assessing spatial biases with respect to other genomic features, we showed that NUPTs are: (i) formed by both continuous and episodic mechanisms; (ii) unevenly represented across the plastid genome; (iii) consistently associated with certain classes of RNA genes, in particular rRNA, tRNA, and regulatory RNA genes; (iv) differentially contributing to structural genes; and (v) closer than expected to different superfamilies of transposons in a species-specific manner. Our results reveal the unexpected complexity in the mechanisms driving the origin of NUPTs, which not only involve their continuous formation but also episodic events, highlight their role as a major source of noncoding RNA genes and other genomic features, and provide a more complete picture of the different drivers of evolutionary change at the genome level.
{"title":"Episodic and Ongoing Mechanisms Drive Plastid-Derived Nuclear DNA Evolution in Angiosperms.","authors":"Juan Pablo Marczuk-Rojas, Ana D Maldonado, Lorenzo Carretero-Paulet","doi":"10.1093/gbe/evaf194","DOIUrl":"10.1093/gbe/evaf194","url":null,"abstract":"<p><p>NUPTs are DNA sequences of plastid origin that are present in plant nuclear genomes in varying, though typically low, amounts. It is assumed that they are continuously formed and, due to their potentially mutagenic effect, they are removed at a constant turnover rate, which should result in an exponential decay of their age distributions and a negative correlation between age and size. However, these assumptions are based on analyses from a limited number of species and have never been explicitly tested. To gain insight into the mechanisms driving the origin and evolution of NUPTs, here we surveyed the plastid and nuclear genomes of 30 species representing the main angiosperm (flowering plants) lineages. By modeling the distribution of ages and sizes, examining their linear arrangement across the plastid genome, and statistically assessing spatial biases with respect to other genomic features, we showed that NUPTs are: (i) formed by both continuous and episodic mechanisms; (ii) unevenly represented across the plastid genome; (iii) consistently associated with certain classes of RNA genes, in particular rRNA, tRNA, and regulatory RNA genes; (iv) differentially contributing to structural genes; and (v) closer than expected to different superfamilies of transposons in a species-specific manner. Our results reveal the unexpected complexity in the mechanisms driving the origin of NUPTs, which not only involve their continuous formation but also episodic events, highlight their role as a major source of noncoding RNA genes and other genomic features, and provide a more complete picture of the different drivers of evolutionary change at the genome level.</p>","PeriodicalId":12779,"journal":{"name":"Genome Biology and Evolution","volume":" ","pages":""},"PeriodicalIF":2.8,"publicationDate":"2025-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12572781/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145279956","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Joseph Jalinsky, Kyle E McElroy, Joel Sharbrough, Laura Bankers, Peter D Fields, Chelsea Higgins, Cynthia Toll, Jeffrey L Boore, John M Logsdon, Maurine Neiman
Key unanswered questions in biology center on the causes, consequences, and maintenance of sexual reproduction ("sex"). Genome-driven processes are central to the evolutionary and genetic mechanisms inherent to sex, making genomic resources a fundamental part of answering these questions. We present the first genome assembly for a species that is uniquely well-suited for the study of (a)sex in nature, Potamopyrgus antipodarum. This New Zealand snail is unusual in featuring multiple separate transitions from obligately sexual to obligately asexual reproduction, leading to the coexistence of phenotypically similar sexual and asexual forms, a feature that is required to directly study the maintenance of sex. These separately derived asexual lineages constitute separate evolutionary experiments, providing a powerful means of characterizing how the absence of sex affects genome evolution. Our genome assembly provides critical steps toward understanding the causes and consequences of sex in this system and important resources for the rapidly growing P. antipodarum and molluscan genomics research community. In characterizing this genome, we uncovered unexpected evidence for a recent whole-genome duplication (WGD) in P. antipodarum. This discovery sets the stage for using P. antipodarum to evaluate processes of rediploidization following WGD and to assess whether WGD might drive transitions to asexuality.
{"title":"Whole-Genome Sequence of Potamopyrgus antipodarum-A Model System for the Maintenance of Sexual Reproduction-Reveals a Recent Whole-Genome Duplication.","authors":"Joseph Jalinsky, Kyle E McElroy, Joel Sharbrough, Laura Bankers, Peter D Fields, Chelsea Higgins, Cynthia Toll, Jeffrey L Boore, John M Logsdon, Maurine Neiman","doi":"10.1093/gbe/evaf192","DOIUrl":"10.1093/gbe/evaf192","url":null,"abstract":"<p><p>Key unanswered questions in biology center on the causes, consequences, and maintenance of sexual reproduction (\"sex\"). Genome-driven processes are central to the evolutionary and genetic mechanisms inherent to sex, making genomic resources a fundamental part of answering these questions. We present the first genome assembly for a species that is uniquely well-suited for the study of (a)sex in nature, Potamopyrgus antipodarum. This New Zealand snail is unusual in featuring multiple separate transitions from obligately sexual to obligately asexual reproduction, leading to the coexistence of phenotypically similar sexual and asexual forms, a feature that is required to directly study the maintenance of sex. These separately derived asexual lineages constitute separate evolutionary experiments, providing a powerful means of characterizing how the absence of sex affects genome evolution. Our genome assembly provides critical steps toward understanding the causes and consequences of sex in this system and important resources for the rapidly growing P. antipodarum and molluscan genomics research community. In characterizing this genome, we uncovered unexpected evidence for a recent whole-genome duplication (WGD) in P. antipodarum. This discovery sets the stage for using P. antipodarum to evaluate processes of rediploidization following WGD and to assess whether WGD might drive transitions to asexuality.</p>","PeriodicalId":12779,"journal":{"name":"Genome Biology and Evolution","volume":"17 11","pages":""},"PeriodicalIF":2.8,"publicationDate":"2025-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12596079/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145476964","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Shahan Derkarabetian, Arianna Lord, Katherine Angier, Julia G Cosgrove, Ella Frigyik, Sara González-Delgado, Breanna Jordan, Paula C Rodríguez-Flores, Shoyo Sato, Lily Shapiro, Gonzalo Giribet
Biology is in a genomics era, but many researchers may still be alienated from these techniques as a lack of genomic resources remains for many animal groups, necessitating further democratization. Hybrid capture is a popular phylogenomic approach in molecular systematics, with ultraconserved elements being the most popular. However, access to ultraconserved element data is more expensive per sample relative to other commonly used genetic/genomic approaches. Using published genomes, we developed a metazoan ultraconserved element probe set, the universality of which allows multiple research groups working on vastly different animal lineages to share costs and resources across labs. We demonstrated the utility of this probe set both in silico against 58 published metazoan genomes and in vitro with 130 samples representing 33 metazoan phyla, showing that these probes target loci useful for both deep and shallow level relationships. The proportion of target ultraconserved elements sequenced by the Metazoa probe is equivalent to that of taxon-specific probe sets, but from across all animal phyla. We explored general patterns of ultraconserved element recovery across Metazoa using published genomes and the majority of publicly available ultraconserved element probe sets. The Metazoa probe set is available in three forms: the full set, containing 19,986 probes targeting 2,146 loci, a set containing 10,749 probes targeting 1,022 loci, and the reduced cost set, containing 5,098 probes targeting 466 loci. The development of a universal ultraconserved element probe set should expand the use of genomic data to a much larger segment of the zoological community, with strong potential for broad applications in phylogenomics across all animal phyla.
{"title":"Covering All Bases: A Universal Metazoan UCE Probe Set to Democratize Phylogenomics.","authors":"Shahan Derkarabetian, Arianna Lord, Katherine Angier, Julia G Cosgrove, Ella Frigyik, Sara González-Delgado, Breanna Jordan, Paula C Rodríguez-Flores, Shoyo Sato, Lily Shapiro, Gonzalo Giribet","doi":"10.1093/gbe/evaf193","DOIUrl":"10.1093/gbe/evaf193","url":null,"abstract":"<p><p>Biology is in a genomics era, but many researchers may still be alienated from these techniques as a lack of genomic resources remains for many animal groups, necessitating further democratization. Hybrid capture is a popular phylogenomic approach in molecular systematics, with ultraconserved elements being the most popular. However, access to ultraconserved element data is more expensive per sample relative to other commonly used genetic/genomic approaches. Using published genomes, we developed a metazoan ultraconserved element probe set, the universality of which allows multiple research groups working on vastly different animal lineages to share costs and resources across labs. We demonstrated the utility of this probe set both in silico against 58 published metazoan genomes and in vitro with 130 samples representing 33 metazoan phyla, showing that these probes target loci useful for both deep and shallow level relationships. The proportion of target ultraconserved elements sequenced by the Metazoa probe is equivalent to that of taxon-specific probe sets, but from across all animal phyla. We explored general patterns of ultraconserved element recovery across Metazoa using published genomes and the majority of publicly available ultraconserved element probe sets. The Metazoa probe set is available in three forms: the full set, containing 19,986 probes targeting 2,146 loci, a set containing 10,749 probes targeting 1,022 loci, and the reduced cost set, containing 5,098 probes targeting 466 loci. The development of a universal ultraconserved element probe set should expand the use of genomic data to a much larger segment of the zoological community, with strong potential for broad applications in phylogenomics across all animal phyla.</p>","PeriodicalId":12779,"journal":{"name":"Genome Biology and Evolution","volume":" ","pages":""},"PeriodicalIF":2.8,"publicationDate":"2025-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12571155/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145279958","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}