Jasmine Gamblin, Amaury Lambert, François Blanquart
The pangenome of a species is the set of all genes carried by at least one member of the species. In bacteria, pangenomes can be much larger than the set of genes carried by a single organism. Many questions remain unanswered regarding the evolutionary forces shaping the patterns of presence/absence of genes in pangenomes of a given species. We introduce a new model for bacterial pangenome evolution along a species phylogeny that explicitly describes the timing of appearance of each gene in the species and accounts for three generic types of gene evolutionary dynamics: persistent genes that are present in the ancestral genome, private genes that are specific to a given clade, and mobile genes that are imported once into the gene pool and then undergo frequent horizontal gene transfers. We call this model the Persistent-Private-Mobile (PPM) model. We develop an algorithm fitting the PPM model and apply it to a dataset of 902 Salmonella enterica genomes. We show that the best fitting model is able to reproduce the global pattern of some multivariate statistics like the gene frequency spectrum and the parsimony vs. frequency plot. Moreover, the gene classification induced by the PPM model allows us to study the position of accessory genes on the chromosome depending on their category, as well as the gene functions that are most present in each category. This work paves the way for a mechanistic understanding of pangenome evolution, and the PPM model developed here could be used for dynamics-aware gene classification.
{"title":"Persistent, Private and Mobile genes: a model for gene dynamics in evolving pangenomes.","authors":"Jasmine Gamblin, Amaury Lambert, François Blanquart","doi":"10.1093/molbev/msaf001","DOIUrl":"https://doi.org/10.1093/molbev/msaf001","url":null,"abstract":"<p><p>The pangenome of a species is the set of all genes carried by at least one member of the species. In bacteria, pangenomes can be much larger than the set of genes carried by a single organism. Many questions remain unanswered regarding the evolutionary forces shaping the patterns of presence/absence of genes in pangenomes of a given species. We introduce a new model for bacterial pangenome evolution along a species phylogeny that explicitly describes the timing of appearance of each gene in the species and accounts for three generic types of gene evolutionary dynamics: persistent genes that are present in the ancestral genome, private genes that are specific to a given clade, and mobile genes that are imported once into the gene pool and then undergo frequent horizontal gene transfers. We call this model the Persistent-Private-Mobile (PPM) model. We develop an algorithm fitting the PPM model and apply it to a dataset of 902 Salmonella enterica genomes. We show that the best fitting model is able to reproduce the global pattern of some multivariate statistics like the gene frequency spectrum and the parsimony vs. frequency plot. Moreover, the gene classification induced by the PPM model allows us to study the position of accessory genes on the chromosome depending on their category, as well as the gene functions that are most present in each category. This work paves the way for a mechanistic understanding of pangenome evolution, and the PPM model developed here could be used for dynamics-aware gene classification.</p>","PeriodicalId":18730,"journal":{"name":"Molecular biology and evolution","volume":" ","pages":""},"PeriodicalIF":11.0,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142984071","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Tom Parée, Luke Noble, Denis Roze, Henrique Teotónio
Modifiers of recombination rates have been described but the selective pressures acting on them and their effect on adaptation to novel environments remain unclear. We performed experimental evolution in the nematode Caenorhabditis elegans using alternative rec-1 alleles modifying the position of meiotic crossovers along chromosomes without detectable direct fitness effects. We show that adaptation to a novel environment is impaired by the allele that decreases recombination rates in the genomic regions containing fitness variation. However, the allele that impairs adaptation is indirectly favored by selection because it increases recombination rates and reduces the associations among beneficial and deleterious variation located in its chromosomal vicinity. These results validate theoretical expectations about the evolution of recombination but suggest that genome-wide polygenic adaptation is of little consequence to indirect selection on recombination rate modifiers.
{"title":"Selection can favor a recombination landscape that limits polygenic adaptation.","authors":"Tom Parée, Luke Noble, Denis Roze, Henrique Teotónio","doi":"10.1093/molbev/msae273","DOIUrl":"https://doi.org/10.1093/molbev/msae273","url":null,"abstract":"<p><p>Modifiers of recombination rates have been described but the selective pressures acting on them and their effect on adaptation to novel environments remain unclear. We performed experimental evolution in the nematode Caenorhabditis elegans using alternative rec-1 alleles modifying the position of meiotic crossovers along chromosomes without detectable direct fitness effects. We show that adaptation to a novel environment is impaired by the allele that decreases recombination rates in the genomic regions containing fitness variation. However, the allele that impairs adaptation is indirectly favored by selection because it increases recombination rates and reduces the associations among beneficial and deleterious variation located in its chromosomal vicinity. These results validate theoretical expectations about the evolution of recombination but suggest that genome-wide polygenic adaptation is of little consequence to indirect selection on recombination rate modifiers.</p>","PeriodicalId":18730,"journal":{"name":"Molecular biology and evolution","volume":" ","pages":""},"PeriodicalIF":11.0,"publicationDate":"2025-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142951586","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The molecular basis of adaptive evolution and cancer progression are both complex processes that share many striking similarities. The potential adaptive significance of environmentally-induced epigenetic changes is currently an area of great interest in both evolutionary and cancer biology. In the field of cancer biology intense effort has been focused on the contribution of stress-induced non-coding RNAs (ncRNAs) in the activation of epigenetic changes associated with elevated mutation rates and the acquisition of environmentally adaptive traits. Examples of this process are presented and combined with more recent findings demonstrating that stress-induced ncRNAs are transferable from somatic to germline cells leading to cross-generational inheritance of acquired adaptive traits. The fact that ncRNAs have been implicated in the transient adaptive response of various plants and animals to environmental stress is consistent with findings in cancer biology. Based on these collective observations, a general model as well as specific and testable hypotheses are proposed on how transient ncRNA-mediated adaptive responses may facilitate the transition to long-term biological adaptation in both cancer and evolution.
{"title":"Adaptive Significance of Non-coding RNAs: Insights from Cancer Biology.","authors":"John F McDonald","doi":"10.1093/molbev/msae269","DOIUrl":"10.1093/molbev/msae269","url":null,"abstract":"<p><p>The molecular basis of adaptive evolution and cancer progression are both complex processes that share many striking similarities. The potential adaptive significance of environmentally-induced epigenetic changes is currently an area of great interest in both evolutionary and cancer biology. In the field of cancer biology intense effort has been focused on the contribution of stress-induced non-coding RNAs (ncRNAs) in the activation of epigenetic changes associated with elevated mutation rates and the acquisition of environmentally adaptive traits. Examples of this process are presented and combined with more recent findings demonstrating that stress-induced ncRNAs are transferable from somatic to germline cells leading to cross-generational inheritance of acquired adaptive traits. The fact that ncRNAs have been implicated in the transient adaptive response of various plants and animals to environmental stress is consistent with findings in cancer biology. Based on these collective observations, a general model as well as specific and testable hypotheses are proposed on how transient ncRNA-mediated adaptive responses may facilitate the transition to long-term biological adaptation in both cancer and evolution.</p>","PeriodicalId":18730,"journal":{"name":"Molecular biology and evolution","volume":" ","pages":""},"PeriodicalIF":11.0,"publicationDate":"2025-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11725524/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142979419","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Moran Meir, Arielle Kahn, Carmel Farage, Yael Maoz, Noam Harel, Adi Ben Zvi, Shir Segev, Maria Volkov, Ravit Yahud, Uri Gophna, Adi Stern
Cheater viruses cannot replicate on their own yet replicate faster than the wild type (WT) when the 2 viruses coinfect the same cell. Cheaters must possess dual genetic features: a defect, which leads to their inability to infect cells on their own, and a selective advantage over WT during coinfection. Previously, we have discovered 2 point-mutant cheaters of the MS2 bacteriophage. Here, we set out to discover the possible repertoire of cheater MS2 viruses by performing experimental evolution at a very high multiplicity of infection. Our results revealed a third point-mutant cheater that arose in 8 biological replicas. Each of the 3 primary cheaters disrupts the fine balance necessary for phage replication, in different ways that create a defect + advantage. We found that over time, the point-mutant cheaters accumulate additional secondary mutations, which alter other stages of the viral replication cycle, complementing the disruptions created by the original cheater. Intriguingly, cheater and secondary mutations almost always reside in very close proximity on the genome. This region encodes for multiple functions: overlapping reading frames as well as overlapping RNA structures critical for transitioning from one stage to another in the viral replication cycle. This region of overlap explains the dual functions of cheaters, as one mutation can have pleiotropic effects. Overall, these findings underscore how viruses, whose dense genomes often have overlapping functions, can easily evolve point-mutant cheaters, and how cheaters can evolve to alter the intricate balance of the viral replication cycle.
{"title":"Navigating a Fine Balance: Point-Mutant Cheater Viruses Disrupt the Viral Replication Cycle.","authors":"Moran Meir, Arielle Kahn, Carmel Farage, Yael Maoz, Noam Harel, Adi Ben Zvi, Shir Segev, Maria Volkov, Ravit Yahud, Uri Gophna, Adi Stern","doi":"10.1093/molbev/msae258","DOIUrl":"10.1093/molbev/msae258","url":null,"abstract":"<p><p>Cheater viruses cannot replicate on their own yet replicate faster than the wild type (WT) when the 2 viruses coinfect the same cell. Cheaters must possess dual genetic features: a defect, which leads to their inability to infect cells on their own, and a selective advantage over WT during coinfection. Previously, we have discovered 2 point-mutant cheaters of the MS2 bacteriophage. Here, we set out to discover the possible repertoire of cheater MS2 viruses by performing experimental evolution at a very high multiplicity of infection. Our results revealed a third point-mutant cheater that arose in 8 biological replicas. Each of the 3 primary cheaters disrupts the fine balance necessary for phage replication, in different ways that create a defect + advantage. We found that over time, the point-mutant cheaters accumulate additional secondary mutations, which alter other stages of the viral replication cycle, complementing the disruptions created by the original cheater. Intriguingly, cheater and secondary mutations almost always reside in very close proximity on the genome. This region encodes for multiple functions: overlapping reading frames as well as overlapping RNA structures critical for transitioning from one stage to another in the viral replication cycle. This region of overlap explains the dual functions of cheaters, as one mutation can have pleiotropic effects. Overall, these findings underscore how viruses, whose dense genomes often have overlapping functions, can easily evolve point-mutant cheaters, and how cheaters can evolve to alter the intricate balance of the viral replication cycle.</p>","PeriodicalId":18730,"journal":{"name":"Molecular biology and evolution","volume":" ","pages":""},"PeriodicalIF":11.0,"publicationDate":"2025-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142864802","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The origin of genes from noncoding sequences is a long-term and fundamental biological question. However, how de novo genes originate and integrate into the existing pathways to regulate phenotypic variations is largely unknown. Here, we selected 7 genes from 782 de novo genes for functional exploration based on transcriptional and translational evidence. Subsequently, we revealed that Sun Wu-Kong (SWK), a de novo gene that originated from a noncoding sequence in Arabidopsis thaliana, plays a role in seed germination under osmotic stress. SWK is primarily expressed in dry seed, imbibing seed and silique. SWK can be fully translated into an 8 kDa protein, which is mainly located in the nucleus. Intriguingly, SWK was integrated into an extant pathway of hydrogen peroxide content (folate synthesis pathway) via the upstream gene cytHPPK/DHPS, an Arabidopsis-specific gene that originated from the duplication of mitHPPK/DHPS, and downstream gene GSTF9, to improve seed germination in osmotic stress. In addition, we demonstrated that the presence of SWK may be associated with drought tolerance in natural populations of Arabidopsis. Overall, our study highlights how a de novo gene originated and integrated into the existing pathways to regulate stress adaptation.
{"title":"A de novo Gene Promotes Seed Germination Under Drought Stress in Arabidopsis.","authors":"Guang-Teng Jin, Yong-Chao Xu, Xing-Hui Hou, Juan Jiang, Xin-Xin Li, Jia-Hui Xiao, Yu-Tao Bian, Yan-Bo Gong, Ming-Yu Wang, Zhi-Qin Zhang, Yong E Zhang, Wang-Sheng Zhu, Yong-Xiu Liu, Ya-Long Guo","doi":"10.1093/molbev/msae262","DOIUrl":"10.1093/molbev/msae262","url":null,"abstract":"<p><p>The origin of genes from noncoding sequences is a long-term and fundamental biological question. However, how de novo genes originate and integrate into the existing pathways to regulate phenotypic variations is largely unknown. Here, we selected 7 genes from 782 de novo genes for functional exploration based on transcriptional and translational evidence. Subsequently, we revealed that Sun Wu-Kong (SWK), a de novo gene that originated from a noncoding sequence in Arabidopsis thaliana, plays a role in seed germination under osmotic stress. SWK is primarily expressed in dry seed, imbibing seed and silique. SWK can be fully translated into an 8 kDa protein, which is mainly located in the nucleus. Intriguingly, SWK was integrated into an extant pathway of hydrogen peroxide content (folate synthesis pathway) via the upstream gene cytHPPK/DHPS, an Arabidopsis-specific gene that originated from the duplication of mitHPPK/DHPS, and downstream gene GSTF9, to improve seed germination in osmotic stress. In addition, we demonstrated that the presence of SWK may be associated with drought tolerance in natural populations of Arabidopsis. Overall, our study highlights how a de novo gene originated and integrated into the existing pathways to regulate stress adaptation.</p>","PeriodicalId":18730,"journal":{"name":"Molecular biology and evolution","volume":" ","pages":""},"PeriodicalIF":11.0,"publicationDate":"2025-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11721784/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142886062","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Bats are considered natural hosts for numerous viruses. Their ability to carry viruses that cause severe diseases or even death in other mammals without falling ill themselves has attracted widespread research attention. Toll-like receptor 2 forms heterodimers with Toll-like receptor 1 or Toll-like receptor 6 on cell membranes, recognizing specific pathogen-associated molecular patterns and playing a key role in innate immune responses. Previous studies have shown that moderate Toll-like receptor 2-mediated immune signals aid in pathogen clearance, while excessive or inappropriate Toll-like receptor 2-mediated immune signals can cause self-damage. In this study, we observed that TLR2, unlike TLR1 or TLR6, has undergone relaxed selection in bats compared with other mammals, indicating a reduced functional constraint on TLR2 specifically in bats. Indeed, our cell-based functional assays demonstrated that the ability of Toll-like receptor 2 to bind with Toll-like receptor 1 or Toll-like receptor 6 was significantly reduced in bats, leading to dampened inflammatory signaling. We identified mutations unique to bats that were responsible for this observation. Additionally, we found that mutations at residues 375 and 376 of Toll-like receptor 2 in the common ancestor of bats also resulted in reduced inflammatory response, suggesting that this reduction occurred early in bat evolution. Together, our study reveals that the Toll-like receptor 2-mediated inflammatory response has been specifically dampened in bats, which may be one of the reasons why they could harbor many viruses without falling ill.
{"title":"Dampened TLR2-mediated Inflammatory Signaling in Bats.","authors":"Jiaming Zeng, Xiangyi Zhang, Chen Huang, Shilin Tian, Huabin Zhao","doi":"10.1093/molbev/msae253","DOIUrl":"10.1093/molbev/msae253","url":null,"abstract":"<p><p>Bats are considered natural hosts for numerous viruses. Their ability to carry viruses that cause severe diseases or even death in other mammals without falling ill themselves has attracted widespread research attention. Toll-like receptor 2 forms heterodimers with Toll-like receptor 1 or Toll-like receptor 6 on cell membranes, recognizing specific pathogen-associated molecular patterns and playing a key role in innate immune responses. Previous studies have shown that moderate Toll-like receptor 2-mediated immune signals aid in pathogen clearance, while excessive or inappropriate Toll-like receptor 2-mediated immune signals can cause self-damage. In this study, we observed that TLR2, unlike TLR1 or TLR6, has undergone relaxed selection in bats compared with other mammals, indicating a reduced functional constraint on TLR2 specifically in bats. Indeed, our cell-based functional assays demonstrated that the ability of Toll-like receptor 2 to bind with Toll-like receptor 1 or Toll-like receptor 6 was significantly reduced in bats, leading to dampened inflammatory signaling. We identified mutations unique to bats that were responsible for this observation. Additionally, we found that mutations at residues 375 and 376 of Toll-like receptor 2 in the common ancestor of bats also resulted in reduced inflammatory response, suggesting that this reduction occurred early in bat evolution. Together, our study reveals that the Toll-like receptor 2-mediated inflammatory response has been specifically dampened in bats, which may be one of the reasons why they could harbor many viruses without falling ill.</p>","PeriodicalId":18730,"journal":{"name":"Molecular biology and evolution","volume":" ","pages":""},"PeriodicalIF":11.0,"publicationDate":"2025-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11702297/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142813579","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Huaiyan Ren, Thomas K F Wong, Bui Quang Minh, Robert Lanfear
In phylogenetic studies, both partitioned models and mixture models are used to account for heterogeneity in molecular evolution among the sites of DNA sequence alignments. Partitioned models require the user to specify the grouping of sites into subsets, and then assume that each subset of sites can be modeled by a single common process. Mixture models do not require users to prespecify subsets of sites, and instead calculate the likelihood of every site under every model, while co-estimating the model weights and parameters. While much research has gone into the optimization of partitioned models by merging user-specified subsets, there has been less attention paid to the optimization of mixture models for DNA sequence alignments. In this study, we first ask whether a key assumption of partitioned models-that each user-specified subset can be modeled by a single common process-is supported by the data. Having shown that this is not the case, we then design, implement, test, and apply an algorithm, MixtureFinder, to select the optimum number of classes for a mixture model of Q-matrices for the standard models of DNA sequence evolution. We show this algorithm performs well on simulated and empirical datasets and suggest that it may be useful for future empirical studies. MixtureFinder is available in IQ-TREE2, and a tutorial for using MixtureFinder can be found here: http://www.iqtree.org/doc/Complex-Models#mixture-models.
{"title":"MixtureFinder: Estimating DNA Mixture Models for Phylogenetic Analyses.","authors":"Huaiyan Ren, Thomas K F Wong, Bui Quang Minh, Robert Lanfear","doi":"10.1093/molbev/msae264","DOIUrl":"10.1093/molbev/msae264","url":null,"abstract":"<p><p>In phylogenetic studies, both partitioned models and mixture models are used to account for heterogeneity in molecular evolution among the sites of DNA sequence alignments. Partitioned models require the user to specify the grouping of sites into subsets, and then assume that each subset of sites can be modeled by a single common process. Mixture models do not require users to prespecify subsets of sites, and instead calculate the likelihood of every site under every model, while co-estimating the model weights and parameters. While much research has gone into the optimization of partitioned models by merging user-specified subsets, there has been less attention paid to the optimization of mixture models for DNA sequence alignments. In this study, we first ask whether a key assumption of partitioned models-that each user-specified subset can be modeled by a single common process-is supported by the data. Having shown that this is not the case, we then design, implement, test, and apply an algorithm, MixtureFinder, to select the optimum number of classes for a mixture model of Q-matrices for the standard models of DNA sequence evolution. We show this algorithm performs well on simulated and empirical datasets and suggest that it may be useful for future empirical studies. MixtureFinder is available in IQ-TREE2, and a tutorial for using MixtureFinder can be found here: http://www.iqtree.org/doc/Complex-Models#mixture-models.</p>","PeriodicalId":18730,"journal":{"name":"Molecular biology and evolution","volume":" ","pages":""},"PeriodicalIF":11.0,"publicationDate":"2025-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11704958/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142881819","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
T Brock Wooldridge, Sarah M Ford, Holland C Conwell, John Hyde, Kelley Harris, Beth Shapiro
The rate at which mutations arise is a fundamental parameter of biology. Despite progress in measuring germline mutation rates across diverse taxa, such estimates are missing for much of Earth's biodiversity. Here, we present the first estimate of a germline mutation rate from the phylum Mollusca. We sequenced three pedigreed families of the white abalone Haliotis sorenseni, a long-lived, large-bodied, and critically endangered mollusk, and estimated a de novo mutation rate of 8.60 × 10-9 single nucleotide mutations per site per generation. This mutation rate is similar to rates measured in vertebrates with comparable generation times and longevity to abalone, and higher than mutation rates measured in faster-reproducing invertebrates. The spectrum of de novo mutations is also similar to that seen in vertebrate species, although an excess of rare C > A polymorphisms in wild individuals suggests that a modifier allele or environmental exposure may have once increased C > A mutation rates. We use our rate to infer baseline effective population sizes (Ne) across multiple Pacific abalone and find that abalone persisted over most of their evolutionary history as large and stable populations, in contrast to extreme fluctuations over recent history and small census sizes in the present day. We then use our mutation rate to infer the timing and pattern of evolution of the abalone genus Haliotis, which was previously unknown due to few fossil calibrations. Our findings are an important step toward understanding mutation rate evolution and they establish a key parameter for conservation and evolutionary genomics research in mollusks.
{"title":"Direct Measurement of the Mutation Rate and Its Evolutionary Consequences in a Critically Endangered Mollusk.","authors":"T Brock Wooldridge, Sarah M Ford, Holland C Conwell, John Hyde, Kelley Harris, Beth Shapiro","doi":"10.1093/molbev/msae266","DOIUrl":"10.1093/molbev/msae266","url":null,"abstract":"<p><p>The rate at which mutations arise is a fundamental parameter of biology. Despite progress in measuring germline mutation rates across diverse taxa, such estimates are missing for much of Earth's biodiversity. Here, we present the first estimate of a germline mutation rate from the phylum Mollusca. We sequenced three pedigreed families of the white abalone Haliotis sorenseni, a long-lived, large-bodied, and critically endangered mollusk, and estimated a de novo mutation rate of 8.60 × 10-9 single nucleotide mutations per site per generation. This mutation rate is similar to rates measured in vertebrates with comparable generation times and longevity to abalone, and higher than mutation rates measured in faster-reproducing invertebrates. The spectrum of de novo mutations is also similar to that seen in vertebrate species, although an excess of rare C > A polymorphisms in wild individuals suggests that a modifier allele or environmental exposure may have once increased C > A mutation rates. We use our rate to infer baseline effective population sizes (Ne) across multiple Pacific abalone and find that abalone persisted over most of their evolutionary history as large and stable populations, in contrast to extreme fluctuations over recent history and small census sizes in the present day. We then use our mutation rate to infer the timing and pattern of evolution of the abalone genus Haliotis, which was previously unknown due to few fossil calibrations. Our findings are an important step toward understanding mutation rate evolution and they establish a key parameter for conservation and evolutionary genomics research in mollusks.</p>","PeriodicalId":18730,"journal":{"name":"Molecular biology and evolution","volume":"42 1","pages":""},"PeriodicalIF":11.0,"publicationDate":"2025-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11704959/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142951710","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Kaustubh Amritkar, Bruno Cuevas-Zuviría, Betül Kaçar
Ribulose-1,5-bisphosphate carboxylase/oxygenase (RuBisCO) is an ancient protein critical for CO2-fixation and global biogeochemistry. Form-I RuBisCO complexes uniquely harbor small subunits that form a hexadecameric complex together with their large subunits. The small subunit protein is thought to have significantly contributed to RuBisCO's response to the atmospheric rise of O2 ∼2.5 billion years ago, marking a pivotal point in the enzyme's evolutionary history. Here, we performed a comprehensive evolutionary analysis of extant and ancestral RuBisCO sequences and structures to explore the impact of the small subunit's earliest integration on the molecular dynamics of the overall complex. Our simulations suggest that the small subunit restricted the conformational flexibility of the large subunit early in its history, impacting the evolutionary trajectory of the Form-I RuBisCO complex. Molecular dynamics investigations of CO2 and O2 gas distribution around predicted ancient RuBisCO complexes suggest that a proposed "CO2-reservoir" role for the small subunit is not conserved throughout the enzyme's evolutionary history. The evolutionary and biophysical response of RuBisCO to changing atmospheric conditions on ancient Earth showcase multi-level and trackable responses of enzymes to environmental shifts over long timescales.
{"title":"Evolutionary Dynamics of RuBisCO: Emergence of the Small Subunit and its Impact Through Time.","authors":"Kaustubh Amritkar, Bruno Cuevas-Zuviría, Betül Kaçar","doi":"10.1093/molbev/msae268","DOIUrl":"10.1093/molbev/msae268","url":null,"abstract":"<p><p>Ribulose-1,5-bisphosphate carboxylase/oxygenase (RuBisCO) is an ancient protein critical for CO2-fixation and global biogeochemistry. Form-I RuBisCO complexes uniquely harbor small subunits that form a hexadecameric complex together with their large subunits. The small subunit protein is thought to have significantly contributed to RuBisCO's response to the atmospheric rise of O2 ∼2.5 billion years ago, marking a pivotal point in the enzyme's evolutionary history. Here, we performed a comprehensive evolutionary analysis of extant and ancestral RuBisCO sequences and structures to explore the impact of the small subunit's earliest integration on the molecular dynamics of the overall complex. Our simulations suggest that the small subunit restricted the conformational flexibility of the large subunit early in its history, impacting the evolutionary trajectory of the Form-I RuBisCO complex. Molecular dynamics investigations of CO2 and O2 gas distribution around predicted ancient RuBisCO complexes suggest that a proposed \"CO2-reservoir\" role for the small subunit is not conserved throughout the enzyme's evolutionary history. The evolutionary and biophysical response of RuBisCO to changing atmospheric conditions on ancient Earth showcase multi-level and trackable responses of enzymes to environmental shifts over long timescales.</p>","PeriodicalId":18730,"journal":{"name":"Molecular biology and evolution","volume":"42 1","pages":""},"PeriodicalIF":11.0,"publicationDate":"2025-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11707681/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142951723","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Plant cells have two major organelles with their own genomes: chloroplasts and mitochondria. While chloroplast genomes tend to be structurally conserved, the mitochondrial genomes of plants, which are much larger than those of animals, are characterized by complex structural variation. We introduce TIPPo, a user-friendly, reference-free assembly tool that uses PacBio high-fidelity long-read data and that does not rely on genomes from related species or nuclear genome information for the assembly of organellar genomes. TIPPo employs a deep learning model for initial read classification and leverages k-mer counting for further refinement, significantly reducing the impact of nuclear insertions of organellar DNA on the assembly process. We used TIPPo to completely assemble a set of 54 complete chloroplast genomes. No other tool was able to completely assemble this set. TIPPo is comparable with PMAT in assembling mitochondrial genomes from most species but does achieve even higher completeness for several species. We also used the assembled organelle genomes to identify instances of nuclear plastid DNA (NUPTs) and nuclear mitochondrial DNA (NUMTs) insertions. The cumulative length of NUPTs/NUMTs positively correlates with the size of the nuclear genome, suggesting that insertions occur stochastically. NUPTs/NUMTs show predominantly C:G to T:A changes, with the mutated cytosines typically found in CG and CHG contexts, suggesting that degradation of NUPT and NUMT sequences is driven by the known elevated mutation rate of methylated cytosines. Small interfering RNA loci are enriched in NUPTs and NUMTs, consistent with the RdDM pathway mediating DNA methylation in these sequences.
{"title":"TIPPo: A User-Friendly Tool for De Novo Assembly of Organellar Genomes with High-Fidelity Data.","authors":"Wenfei Xian, Ilja Bezrukov, Zhigui Bao, Sebastian Vorbrugg, Anupam Gautam, Detlef Weigel","doi":"10.1093/molbev/msae247","DOIUrl":"10.1093/molbev/msae247","url":null,"abstract":"<p><p>Plant cells have two major organelles with their own genomes: chloroplasts and mitochondria. While chloroplast genomes tend to be structurally conserved, the mitochondrial genomes of plants, which are much larger than those of animals, are characterized by complex structural variation. We introduce TIPPo, a user-friendly, reference-free assembly tool that uses PacBio high-fidelity long-read data and that does not rely on genomes from related species or nuclear genome information for the assembly of organellar genomes. TIPPo employs a deep learning model for initial read classification and leverages k-mer counting for further refinement, significantly reducing the impact of nuclear insertions of organellar DNA on the assembly process. We used TIPPo to completely assemble a set of 54 complete chloroplast genomes. No other tool was able to completely assemble this set. TIPPo is comparable with PMAT in assembling mitochondrial genomes from most species but does achieve even higher completeness for several species. We also used the assembled organelle genomes to identify instances of nuclear plastid DNA (NUPTs) and nuclear mitochondrial DNA (NUMTs) insertions. The cumulative length of NUPTs/NUMTs positively correlates with the size of the nuclear genome, suggesting that insertions occur stochastically. NUPTs/NUMTs show predominantly C:G to T:A changes, with the mutated cytosines typically found in CG and CHG contexts, suggesting that degradation of NUPT and NUMT sequences is driven by the known elevated mutation rate of methylated cytosines. Small interfering RNA loci are enriched in NUPTs and NUMTs, consistent with the RdDM pathway mediating DNA methylation in these sequences.</p>","PeriodicalId":18730,"journal":{"name":"Molecular biology and evolution","volume":"42 1","pages":""},"PeriodicalIF":11.0,"publicationDate":"2025-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11725521/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142971642","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}