Pub Date : 2025-10-01Epub Date: 2025-09-29DOI: 10.1007/s00239-025-10273-5
Gary J Ren, Frances E Hauser, Elissa Khodikian, Nathan R Lovejoy, Cosima S Porteus
Hydrogen sulfide (H2S) plays a crucial role in various physiological processes and has been implicated in modulating aging through its impacts on oxidative stress and cellular health. This study investigates the evolutionary dynamics of the cystathionine-β-synthase (CBS), the cystathionine-ƴ-lyase (CSE), and the 3-mercaptopyruvate sulfurtransferase (3-MST) genes, involved in endogenous H2S production. We investigated CBS, CSE, and 3-MST in long-lived mammals, focusing on the naked mole-rat (Heterocephalus glaber). We hypothesized that these genes would exhibit signs of positive selection linked to extended lifespan. Codon-based evolutionary models were used to compare these genes across 114 mammalian species. Our results revealed a relaxation of purifying selection instead of positive selection in CSE, but not CBS and 3-MST, in long-lived mammals, suggesting an accumulation of mutations that enhance physiological functions related to longevity. In the naked mole-rat, CSE exhibited clear signs of positive selection, particularly in residues G116A and T118V, associated with increased protein stability and enhanced binding affinity to the cofactor pyridoxal-5'-phosphate (PLP). These mutations possibly improve CSE catalytic efficiency and potentially H2S production, supporting the naked mole-rat's unique adaptations to its subterranean habitat and contributing to its remarkable lifespan. Our findings indicate a potential correlation between longevity and selection on the CSE gene, but not the CBS and 3-MST genes, highlighting the complex role of H2S in aging and cellular health. The study provides new insights into the molecular mechanisms underlying lifespan extension in long-lived mammals and underscores the potential adaptive significance of H2S biogenesis in the naked mole-rat.
{"title":"Cystathionine Gamma-Lyase: A Potential Longevity Gene Under Selection?","authors":"Gary J Ren, Frances E Hauser, Elissa Khodikian, Nathan R Lovejoy, Cosima S Porteus","doi":"10.1007/s00239-025-10273-5","DOIUrl":"10.1007/s00239-025-10273-5","url":null,"abstract":"<p><p>Hydrogen sulfide (H<sub>2</sub>S) plays a crucial role in various physiological processes and has been implicated in modulating aging through its impacts on oxidative stress and cellular health. This study investigates the evolutionary dynamics of the cystathionine-β-synthase (CBS), the cystathionine-ƴ-lyase (CSE), and the 3-mercaptopyruvate sulfurtransferase (3-MST) genes, involved in endogenous H<sub>2</sub>S production. We investigated CBS, CSE, and 3-MST in long-lived mammals, focusing on the naked mole-rat (Heterocephalus glaber). We hypothesized that these genes would exhibit signs of positive selection linked to extended lifespan. Codon-based evolutionary models were used to compare these genes across 114 mammalian species. Our results revealed a relaxation of purifying selection instead of positive selection in CSE, but not CBS and 3-MST, in long-lived mammals, suggesting an accumulation of mutations that enhance physiological functions related to longevity. In the naked mole-rat, CSE exhibited clear signs of positive selection, particularly in residues G116A and T118V, associated with increased protein stability and enhanced binding affinity to the cofactor pyridoxal-5'-phosphate (PLP). These mutations possibly improve CSE catalytic efficiency and potentially H<sub>2</sub>S production, supporting the naked mole-rat's unique adaptations to its subterranean habitat and contributing to its remarkable lifespan. Our findings indicate a potential correlation between longevity and selection on the CSE gene, but not the CBS and 3-MST genes, highlighting the complex role of H<sub>2</sub>S in aging and cellular health. The study provides new insights into the molecular mechanisms underlying lifespan extension in long-lived mammals and underscores the potential adaptive significance of H<sub>2</sub>S biogenesis in the naked mole-rat.</p>","PeriodicalId":16366,"journal":{"name":"Journal of Molecular Evolution","volume":" ","pages":"665-681"},"PeriodicalIF":1.8,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145191747","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-10-01Epub Date: 2025-10-27DOI: 10.1007/s00239-025-10275-3
Qinglin Liu, Lisiteng Luo, Qinglin Liang, Xiaolian Wu, Ming Zhao, Xi Yuan, Yifeng Lin, Chunmei Shen, Bofeng Zhu
Inferring the biogeographic origin of unknown individual is crucial in forensic practice. To enhance the efficacy of biogeographic origin inference, we previously developed a novel panel containing 56 ancestry-informative insertion/deletions (AI-InDels), three Y-InDels, and the Amelogenin gene, all with amplicons less than 200 bp, facilitating DNA analysis of degraded samples. In this research, we investigated the forensic performance of the InDel panel in the Kazakh group in China, elucidated the genetic structure of Kazakh group, and verified the panel's efficacy in assigning unknown individuals to the appropriate intercontinental regions. The findings demonstrated that the novel panel showed relatively high genetic polymorphisms of autosomal InDels and could serve as an efficient tool for forensic individual identification in Kazakh group. Furthermore, the multiple results of population genetic analyses indicated that the Kazakh group has a mixture of ancestral components from East Asian and European populations, with East Asian ancestry predominating. Using different machine learning models, we found that the novel panel assigned unknown individuals to their intercontinental regions with at least 99% accuracy at the three-continental level and at least 90% accuracy at the five-continental level. In conclusion, we demonstrated that the novel panel can effectively reveal the genetic structure of the group with mixed genetic background and infer the biogeographic origins of unknown individuals with high accuracy.
{"title":"Efficiency of Ancestral Inference and Characterization of Genetic Structure of Chinese Kazakh Group with a Novel InDel Panel.","authors":"Qinglin Liu, Lisiteng Luo, Qinglin Liang, Xiaolian Wu, Ming Zhao, Xi Yuan, Yifeng Lin, Chunmei Shen, Bofeng Zhu","doi":"10.1007/s00239-025-10275-3","DOIUrl":"10.1007/s00239-025-10275-3","url":null,"abstract":"<p><p>Inferring the biogeographic origin of unknown individual is crucial in forensic practice. To enhance the efficacy of biogeographic origin inference, we previously developed a novel panel containing 56 ancestry-informative insertion/deletions (AI-InDels), three Y-InDels, and the Amelogenin gene, all with amplicons less than 200 bp, facilitating DNA analysis of degraded samples. In this research, we investigated the forensic performance of the InDel panel in the Kazakh group in China, elucidated the genetic structure of Kazakh group, and verified the panel's efficacy in assigning unknown individuals to the appropriate intercontinental regions. The findings demonstrated that the novel panel showed relatively high genetic polymorphisms of autosomal InDels and could serve as an efficient tool for forensic individual identification in Kazakh group. Furthermore, the multiple results of population genetic analyses indicated that the Kazakh group has a mixture of ancestral components from East Asian and European populations, with East Asian ancestry predominating. Using different machine learning models, we found that the novel panel assigned unknown individuals to their intercontinental regions with at least 99% accuracy at the three-continental level and at least 90% accuracy at the five-continental level. In conclusion, we demonstrated that the novel panel can effectively reveal the genetic structure of the group with mixed genetic background and infer the biogeographic origins of unknown individuals with high accuracy.</p>","PeriodicalId":16366,"journal":{"name":"Journal of Molecular Evolution","volume":" ","pages":"682-697"},"PeriodicalIF":1.8,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145377631","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-10-01Epub Date: 2025-08-27DOI: 10.1007/s00239-025-10264-6
Alexandre Sita, Gabriela E Birlem, Deivid S da Silva, Larissa Mallmann, Meriane Demoliner, Mariana S da Silva, Juliana S Gularte, Vyctoria M A G Pereira, Paula R de Almeida, Alana W Hansen, André A Witt, Juliane D Fleck, Fernando R Spilki, Daniela T da Rocha, Matheus N Weber
Bats are considered a unique group of mammals and are recognized as natural hosts for a variety of emerging viral agents of One Health importance. In Brazil, nine bat families are described, of which the family Phyllostomidae contains the highest number of cataloged species. Within them, the Pallas's long-tongued bat (Glossophaga soricina) is one of the most widely distributed species in South America. G. soricina are specialized in the use of nectar and pollen as food and therefore are extremely important in the dynamics of ecosystems. In this way, the present study aimed to evaluate the presence of viruses from the families Coronaviridae, Paramyxoviridae, Orthomyxoviridae, Reoviridae, and Rhabdoviridae in G. soricina bats. Samples of oral and rectal swabs from 40 bats in four bat colonies were collected in Southern Brazil. Four samples tested positive for coronavirus by PCR. Following DNA sequencing and phylogenetic analyses, they were classified into the Amalacovirus subgenus, within the alphacoronavirus (α-CoV) genus. In addition, it was possible to observe two distinct strains within the same bat colony. The results obtained reinforce that phyllostomids are the main reservoirs of amalacoviruses. Eco-vigilance in bats with close access to urban environments is essential for monitoring possible future spillover events.
{"title":"Screening for Viruses in Pallas's Long-Tongued Bats (Glossophaga soricina) in Brazil.","authors":"Alexandre Sita, Gabriela E Birlem, Deivid S da Silva, Larissa Mallmann, Meriane Demoliner, Mariana S da Silva, Juliana S Gularte, Vyctoria M A G Pereira, Paula R de Almeida, Alana W Hansen, André A Witt, Juliane D Fleck, Fernando R Spilki, Daniela T da Rocha, Matheus N Weber","doi":"10.1007/s00239-025-10264-6","DOIUrl":"10.1007/s00239-025-10264-6","url":null,"abstract":"<p><p>Bats are considered a unique group of mammals and are recognized as natural hosts for a variety of emerging viral agents of One Health importance. In Brazil, nine bat families are described, of which the family Phyllostomidae contains the highest number of cataloged species. Within them, the Pallas's long-tongued bat (Glossophaga soricina) is one of the most widely distributed species in South America. G. soricina are specialized in the use of nectar and pollen as food and therefore are extremely important in the dynamics of ecosystems. In this way, the present study aimed to evaluate the presence of viruses from the families Coronaviridae, Paramyxoviridae, Orthomyxoviridae, Reoviridae, and Rhabdoviridae in G. soricina bats. Samples of oral and rectal swabs from 40 bats in four bat colonies were collected in Southern Brazil. Four samples tested positive for coronavirus by PCR. Following DNA sequencing and phylogenetic analyses, they were classified into the Amalacovirus subgenus, within the alphacoronavirus (α-CoV) genus. In addition, it was possible to observe two distinct strains within the same bat colony. The results obtained reinforce that phyllostomids are the main reservoirs of amalacoviruses. Eco-vigilance in bats with close access to urban environments is essential for monitoring possible future spillover events.</p>","PeriodicalId":16366,"journal":{"name":"Journal of Molecular Evolution","volume":" ","pages":"600-609"},"PeriodicalIF":1.8,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144957415","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Serpins, characterized by a conserved structural fold, serve diverse biological roles. Protein Z-dependent protease inhibitor (ZPI), a serpin superfamily member, acts as an endogenous anticoagulant by inhibiting clotting factors Xa (fXa) and XIa (fXIa). Beyond anticoagulation, ZPI has roles in inflammation, cancer, and immune regulation. However, its exact pathophysiological role is yet to be fully characterized. To elucidate ZPI's evolutionary trajectory and non-haemostatic roles, we conducted a comprehensive phylogenetic analysis integrating sequence, gene structure, and synteny data. We identified a lamprey-specific serpin, ZPIL_AGTL_PMA, containing both an inhibitory reactive center loop (RCL) and an angiotensin II (Ang II) motif. This finding suggests that ZPIL_AGTL_PMA represents an ancestral bifunctional serpin from which ZPI and angiotensinogen (AGT), a non-inhibitory serpin involved in blood pressure regulation, evolved by sub-functionalization in jawed vertebrates. This bifunctionality within a single gene in lamprey likely reflects an ancestral vertebrate trait. Gene cluster analyses showed serpinA10 (ZPI) as possibly the earliest member, with other Clade A serpins arising via subsequent duplication. The chromosomal location of this gene cluster is conserved in most vertebrates, except Carnivores and Suidea. Sequence analysis indicated potential non-inhibitory ZPI variants in certain species with atypical non-serine residues at the P1' position within its RCL, a critical determinant of inhibitory serpin activity. The close evolutionary relationship between ZPI and AGT further suggests mechanistic interplay between coagulation and blood pressure regulation, highlighting shared regulatory pathways involving these serpins. Together, these findings expand the functional landscape of ZPI and underscore the dynamic evolution of serpin-mediated physiological processes.
{"title":"Unveiling the Evolutionary History and Functional Significance of the Protein Z-Dependent Protease Inhibitor (ZPI) Across Vertebrates.","authors":"Chennakesavan Suganthi, Parthiban Akshayaa, Tanusree Sengupta, Narayanan Manoj","doi":"10.1007/s00239-025-10267-3","DOIUrl":"10.1007/s00239-025-10267-3","url":null,"abstract":"<p><p>Serpins, characterized by a conserved structural fold, serve diverse biological roles. Protein Z-dependent protease inhibitor (ZPI), a serpin superfamily member, acts as an endogenous anticoagulant by inhibiting clotting factors Xa (fXa) and XIa (fXIa). Beyond anticoagulation, ZPI has roles in inflammation, cancer, and immune regulation. However, its exact pathophysiological role is yet to be fully characterized. To elucidate ZPI's evolutionary trajectory and non-haemostatic roles, we conducted a comprehensive phylogenetic analysis integrating sequence, gene structure, and synteny data. We identified a lamprey-specific serpin, ZPIL_AGTL_PMA, containing both an inhibitory reactive center loop (RCL) and an angiotensin II (Ang II) motif. This finding suggests that ZPIL_AGTL_PMA represents an ancestral bifunctional serpin from which ZPI and angiotensinogen (AGT), a non-inhibitory serpin involved in blood pressure regulation, evolved by sub-functionalization in jawed vertebrates. This bifunctionality within a single gene in lamprey likely reflects an ancestral vertebrate trait. Gene cluster analyses showed serpinA10 (ZPI) as possibly the earliest member, with other Clade A serpins arising via subsequent duplication. The chromosomal location of this gene cluster is conserved in most vertebrates, except Carnivores and Suidea. Sequence analysis indicated potential non-inhibitory ZPI variants in certain species with atypical non-serine residues at the P1' position within its RCL, a critical determinant of inhibitory serpin activity. The close evolutionary relationship between ZPI and AGT further suggests mechanistic interplay between coagulation and blood pressure regulation, highlighting shared regulatory pathways involving these serpins. Together, these findings expand the functional landscape of ZPI and underscore the dynamic evolution of serpin-mediated physiological processes.</p>","PeriodicalId":16366,"journal":{"name":"Journal of Molecular Evolution","volume":" ","pages":"636-649"},"PeriodicalIF":1.8,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145064954","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-10-01Epub Date: 2025-09-15DOI: 10.1007/s00239-025-10266-4
Mackenzie C Poirier, Roberta Wright, Marina Cvetkovska
Viridiplantae, a diverse group of green plants and algae that have evolved from a common ancestor, are unified in their ability to produce and use two types of chlorophyll (chlorophyll a and chlorophyll b) to capture light energy. In addition to playing a role in light harvesting, chlorophyll b is required at the appropriate level for the accumulation, assembly, and stability of light-harvesting complexes within the photosynthetic apparatus. Chlorophyll b is synthesized from chlorophyll a by the enzyme chlorophyllide a oxygenase (CAO), a Rieske-type mononuclear non-heme iron oxygenase. A regulatory degron sequence, described in detail only in land plants, regulates the stability of CAO proteins based on the availability of chlorophyll b. Recent identification of CAO gene duplication in bryophyte and green algal species, combined with expanded availability of sequenced genomes within the Viridiplantae, prompted further investigation into the role of gene duplication in the evolution of chlorophyll b biosynthesis. Examination of genomes from 246 plant and algae species revealed independently occurring CAO duplications throughout the Viridiplantae, with a higher prevalence of duplication in land plants compared to their algal relatives. Additionally, we demonstrate that the degron sequence is poorly conserved in chlorophytes, but first appears as a conserved sequence in charophytes, and is very highly conserved among the embryophytes. The evolutionary history and functional role of CAO throughout the Viridiplantae lineage is discussed based on these key observations, adding to our understanding of chlorophyll b biosynthesis and the role of CAO in photosynthetic species.
{"title":"Chlorophyllide a Oxygenase (CAO) Gene Duplication Across the Viridiplantae.","authors":"Mackenzie C Poirier, Roberta Wright, Marina Cvetkovska","doi":"10.1007/s00239-025-10266-4","DOIUrl":"10.1007/s00239-025-10266-4","url":null,"abstract":"<p><p>Viridiplantae, a diverse group of green plants and algae that have evolved from a common ancestor, are unified in their ability to produce and use two types of chlorophyll (chlorophyll a and chlorophyll b) to capture light energy. In addition to playing a role in light harvesting, chlorophyll b is required at the appropriate level for the accumulation, assembly, and stability of light-harvesting complexes within the photosynthetic apparatus. Chlorophyll b is synthesized from chlorophyll a by the enzyme chlorophyllide a oxygenase (CAO), a Rieske-type mononuclear non-heme iron oxygenase. A regulatory degron sequence, described in detail only in land plants, regulates the stability of CAO proteins based on the availability of chlorophyll b. Recent identification of CAO gene duplication in bryophyte and green algal species, combined with expanded availability of sequenced genomes within the Viridiplantae, prompted further investigation into the role of gene duplication in the evolution of chlorophyll b biosynthesis. Examination of genomes from 246 plant and algae species revealed independently occurring CAO duplications throughout the Viridiplantae, with a higher prevalence of duplication in land plants compared to their algal relatives. Additionally, we demonstrate that the degron sequence is poorly conserved in chlorophytes, but first appears as a conserved sequence in charophytes, and is very highly conserved among the embryophytes. The evolutionary history and functional role of CAO throughout the Viridiplantae lineage is discussed based on these key observations, adding to our understanding of chlorophyll b biosynthesis and the role of CAO in photosynthetic species.</p>","PeriodicalId":16366,"journal":{"name":"Journal of Molecular Evolution","volume":" ","pages":"620-635"},"PeriodicalIF":1.8,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145069770","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Codon usage serves as a fundamental viral signature, influencing survival, adaptation, and pathogenicity. Viruses exhibit distinct codon usage patterns shaped by genome composition, host interactions, and evolutionary pressures. The differences between DNA and RNA viruses in codon usage reflect their replication strategies, host preferences, and genome constraints. Viral adaptation to host codon usage, genome size, and lifestyle further shapes translational efficiency and immune evasion mechanisms. Host tRNA abundance plays a crucial role in viral translation rates, while codon deoptimization is a strategy used by viruses to evade immune detection. Additionally, codon bias is linked to viral virulence, replication rates, and pathogenicity. Building on these concepts, this review synthesizes current knowledge on the interplay between virus-host translational interactions, codon bias-driven viral evolution, and their implications for pathogenesis, immune evasion, and epidemiology, while also outlining their practical applications in vaccine development, antiviral strategies, and viral diagnostics. We discuss current challenges in codon usage studies, including context-dependent variations and limited experimental validation, and propose future research directions that integrate computational and experimental approaches to deepen our understanding of viral codon bias and its role in evolution, host adaptation, and disease control.
{"title":"Codon Usage Evolution in Viruses: Implications for Survival and Pathogenicity.","authors":"Shaikh Kaleem, Ujwal Dahal, Shivani Devi, Bhumandeep Kour, Sharanpreet Kour","doi":"10.1007/s00239-025-10263-7","DOIUrl":"10.1007/s00239-025-10263-7","url":null,"abstract":"<p><p>Codon usage serves as a fundamental viral signature, influencing survival, adaptation, and pathogenicity. Viruses exhibit distinct codon usage patterns shaped by genome composition, host interactions, and evolutionary pressures. The differences between DNA and RNA viruses in codon usage reflect their replication strategies, host preferences, and genome constraints. Viral adaptation to host codon usage, genome size, and lifestyle further shapes translational efficiency and immune evasion mechanisms. Host tRNA abundance plays a crucial role in viral translation rates, while codon deoptimization is a strategy used by viruses to evade immune detection. Additionally, codon bias is linked to viral virulence, replication rates, and pathogenicity. Building on these concepts, this review synthesizes current knowledge on the interplay between virus-host translational interactions, codon bias-driven viral evolution, and their implications for pathogenesis, immune evasion, and epidemiology, while also outlining their practical applications in vaccine development, antiviral strategies, and viral diagnostics. We discuss current challenges in codon usage studies, including context-dependent variations and limited experimental validation, and propose future research directions that integrate computational and experimental approaches to deepen our understanding of viral codon bias and its role in evolution, host adaptation, and disease control.</p>","PeriodicalId":16366,"journal":{"name":"Journal of Molecular Evolution","volume":" ","pages":"553-580"},"PeriodicalIF":1.8,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144992834","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-10-01Epub Date: 2025-08-27DOI: 10.1007/s00239-025-10265-5
M S Borges, M M A Salles, E F Oliveira, F M Magalhães, D C Silva, H F M Oliveira, K C Faria, V N Laine, T M Lilley, J M Silva, F M C B Domingos
Molecular data are irreplaceable resources for reconstructing the tree of life. Gene-specific substitution rates are essential for estimating divergence times in the absence of fossil calibration or converting coalescent units into absolute time in phylogeographic approaches, among other uses. However, substitution rate estimates are often derived from limited genomic loci, narrow taxonomic comparisons, and model organisms, hindering their applicability to understudied taxa. Among mammals, bats (Order Chiroptera)-despite their ecological diversity and evolutionary significance-remain underrepresented in substitution rate studies, particularly within the family Vespertilionidae, the third largest mammal family. Here, we investigate mitochondrial genome (mitogenome) evolutionary rates in this group, while also describing the first complete mitogenomes of three Neoeptesicus species: N. brasiliensis, N. diminutus, and N. furinalis. Using fossil-calibrated Bayesian phylogenetic analyses, we estimated that protein-coding genes evolve at rates between 0.0055-0.0089 substitutions per site per million years (subs/site/Ma), while ribosomal RNA genes evolve at rates between 0.0035-0.0049 subs/site/Ma. Notably, the ND4, ND4L, and ND5 genes exhibited the highest rates, whereas non-coding regions showed the lowest, suggesting that gene-specific evolutionary constraints influence these rates. These findings provide the first comprehensive substitution rate framework for Vespertilionidae mitogenomes, addressing a critical gap in genomic resources for this taxonomically complex group. By integrating novel mitogenomic data with rigorous rate estimation, this study advances our capacity to resolve evolutionary patterns in bats, offering a benchmark for future phylogenetic and phylogeographic studies in non-model mammals.
分子数据是重建生命之树不可替代的资源。基因特异性替代率对于在没有化石校准的情况下估计分化时间或在系统地理方法中将聚结单位转换为绝对时间以及其他用途至关重要。然而,替代率估计通常来自有限的基因组位点,狭窄的分类比较和模式生物,阻碍了它们对未充分研究的分类群的适用性。在哺乳动物中,蝙蝠(翼目)尽管具有生态多样性和进化意义,但在替代率研究中仍然缺乏代表性,特别是在第三大哺乳动物科蝙蝠科中。在这里,我们研究了这一群体的线粒体基因组(有丝分裂基因组)进化速率,同时也描述了三种新eptesicus物种:N. brasiliensis, N. diminutus和N. furinalis的第一个完整的有丝分裂基因组。利用化石校准的贝叶斯系统发育分析,我们估计蛋白质编码基因的进化速率为每百万年每位点(亚数/位点/Ma) 0.0055-0.0089个替换,而核糖体RNA基因的进化速率为0.0035-0.0049个亚数/位点/Ma。值得注意的是,ND4、ND4L和ND5基因表现出最高的速率,而非编码区表现出最低的速率,这表明基因特异性进化约束影响了这些速率。这些发现为狐尾虫科有丝分裂基因组提供了第一个全面的替代率框架,解决了这一分类复杂群体基因组资源的关键空白。通过整合新的有丝分裂基因组数据和严格的速率估计,本研究提高了我们解决蝙蝠进化模式的能力,为未来非模式哺乳动物的系统发育和系统地理研究提供了一个基准。
{"title":"Gene-Specific Substitution Rates for the Vespertilionidae (Chiroptera: Mammalia) Mitochondrial Genome, with the Description of Three new Mitogenomes.","authors":"M S Borges, M M A Salles, E F Oliveira, F M Magalhães, D C Silva, H F M Oliveira, K C Faria, V N Laine, T M Lilley, J M Silva, F M C B Domingos","doi":"10.1007/s00239-025-10265-5","DOIUrl":"10.1007/s00239-025-10265-5","url":null,"abstract":"<p><p>Molecular data are irreplaceable resources for reconstructing the tree of life. Gene-specific substitution rates are essential for estimating divergence times in the absence of fossil calibration or converting coalescent units into absolute time in phylogeographic approaches, among other uses. However, substitution rate estimates are often derived from limited genomic loci, narrow taxonomic comparisons, and model organisms, hindering their applicability to understudied taxa. Among mammals, bats (Order Chiroptera)-despite their ecological diversity and evolutionary significance-remain underrepresented in substitution rate studies, particularly within the family Vespertilionidae, the third largest mammal family. Here, we investigate mitochondrial genome (mitogenome) evolutionary rates in this group, while also describing the first complete mitogenomes of three Neoeptesicus species: N. brasiliensis, N. diminutus, and N. furinalis. Using fossil-calibrated Bayesian phylogenetic analyses, we estimated that protein-coding genes evolve at rates between 0.0055-0.0089 substitutions per site per million years (subs/site/Ma), while ribosomal RNA genes evolve at rates between 0.0035-0.0049 subs/site/Ma. Notably, the ND4, ND4L, and ND5 genes exhibited the highest rates, whereas non-coding regions showed the lowest, suggesting that gene-specific evolutionary constraints influence these rates. These findings provide the first comprehensive substitution rate framework for Vespertilionidae mitogenomes, addressing a critical gap in genomic resources for this taxonomically complex group. By integrating novel mitogenomic data with rigorous rate estimation, this study advances our capacity to resolve evolutionary patterns in bats, offering a benchmark for future phylogenetic and phylogeographic studies in non-model mammals.</p>","PeriodicalId":16366,"journal":{"name":"Journal of Molecular Evolution","volume":" ","pages":"610-619"},"PeriodicalIF":1.8,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144957393","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-10-01Epub Date: 2025-09-24DOI: 10.1007/s00239-025-10262-8
Taner Karagöl, Alper Karagöl, Shuguang Zhang
Membrane proteins play fundamental roles in cellular function, yet the evolutionary dynamics of their amino acid composition remain poorly understood. Our current study investigates the substitutional landscape and evolutionary patterns of hydrophilic and hydrophobic residues in membrane α-helical proteins, addressing a significant gap in our knowledge of protein evolution. We analyzed 2277 high-resolution protein structures from the RCSB Protein Data Bank corresponding to 458 unique PDB structures, 504 UniProt transmembrane entries and their AlphaMissense predicted mutational libraries including more than 5.8 million amino acid substitutions, focusing on known transmembrane α-helical proteins in Homo sapiens. Our analysis showed that the pathological outcome of the substitutions is diverse, as nonpolar to polar changes showed higher pathological scores in general. Notably, F <=> Y substitutions showed significantly lower pathological scores. Our further analysis revealed a significant asymmetry in the evolutionary frequencies of polar and nonpolar amino acids. We identified key residue pairs driving this asymmetry, with F <=> Y, A <=> T, V <=> T and A <=> S co-evolution diverging from the expected negative correlations (Spearman's rho > 0.20, p < 0.001). The V <=> T substitution via an alanine intermediate and the G <=> N substitution via a serine intermediate lower their statistical barrier, which would otherwise require two sequential base changes. We propose two evolutionary game theory (EGT) based models to explain their diversification, with partial correlation analysis on residue frequencies in homolog sequences. These mathematical insights suggest a previously unrecognized evolutionary pressure, potentially linked to functional diversification, which could be targeted to combat drug resistance. Our results offer insights into membrane protein evolution and may inform improved methods for protein structure prediction and design.
{"title":"Co-evolution of alpha-helical transmembrane protein residues: large-scale variant profiling and complete mutational landscape of 2277 known PDB entries representing 504 unique human protein sequences.","authors":"Taner Karagöl, Alper Karagöl, Shuguang Zhang","doi":"10.1007/s00239-025-10262-8","DOIUrl":"10.1007/s00239-025-10262-8","url":null,"abstract":"<p><p>Membrane proteins play fundamental roles in cellular function, yet the evolutionary dynamics of their amino acid composition remain poorly understood. Our current study investigates the substitutional landscape and evolutionary patterns of hydrophilic and hydrophobic residues in membrane α-helical proteins, addressing a significant gap in our knowledge of protein evolution. We analyzed 2277 high-resolution protein structures from the RCSB Protein Data Bank corresponding to 458 unique PDB structures, 504 UniProt transmembrane entries and their AlphaMissense predicted mutational libraries including more than 5.8 million amino acid substitutions, focusing on known transmembrane α-helical proteins in Homo sapiens. Our analysis showed that the pathological outcome of the substitutions is diverse, as nonpolar to polar changes showed higher pathological scores in general. Notably, F <=> Y substitutions showed significantly lower pathological scores. Our further analysis revealed a significant asymmetry in the evolutionary frequencies of polar and nonpolar amino acids. We identified key residue pairs driving this asymmetry, with F <=> Y, A <=> T, V <=> T and A <=> S co-evolution diverging from the expected negative correlations (Spearman's rho > 0.20, p < 0.001). The V <=> T substitution via an alanine intermediate and the G <=> N substitution via a serine intermediate lower their statistical barrier, which would otherwise require two sequential base changes. We propose two evolutionary game theory (EGT) based models to explain their diversification, with partial correlation analysis on residue frequencies in homolog sequences. These mathematical insights suggest a previously unrecognized evolutionary pressure, potentially linked to functional diversification, which could be targeted to combat drug resistance. Our results offer insights into membrane protein evolution and may inform improved methods for protein structure prediction and design.</p>","PeriodicalId":16366,"journal":{"name":"Journal of Molecular Evolution","volume":" ","pages":"581-599"},"PeriodicalIF":1.8,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12579659/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145137830","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-10-01Epub Date: 2025-09-19DOI: 10.1007/s00239-025-10269-1
Yaohui Chen, Jung-Hyun Kim, Hee-Sheung Lee, Sergey Koren, Vladimir Larionov, Adam M Phillippy, David Schlessinger, Ramaiah Nagaraja
An open reading frame in the intergenic spacer of human ribosomal (r)DNA codes for a 190 amino acid, 22 kDa protein that we have named ORF3. It comprises a 5'AluSx repeat sequence encoding 96 amino acids followed by a stretch of 94 amino acids containing a unique repeated stretch of 5 hydrophobic residues. Full copies of ORF3 have been isolated as transformation-associated recombination clones from mouse:human hybrid cell lines containing human chromosomes 21 or 22. In initial instances where the chromosome complement of rDNA repeats is fully resolved in whole genomes, in CHM13 cells, complete copies of ORF3 are mainly concentrated in a tandem cluster on chromosome 21, while other chromosomes contain 1 or 2 full copies, with the sequence in other rDNA repeats interrupted by a frameshift mutation. A diploid cell complement (HG002) again has both complete open reading frames (ORFs) and other copies with the frameshift or deletions. In searches among non-human primate sequences to assess the evolutionary history of ORF3, a > 93% conserved copy of the full sequence of the ORF, as well as copies with in-frame deletions, was found in bonobo, but only fragments homologous to the ORF were seen in chimpanzee, orangutan, and gorilla rDNA examined thus far. ORF3 was expressed as a V5-tagged chimeric protein in human kidney epithelial HEK293 cells, and both ORF3-V5 and endogenous ORF3 were detected with a newly generated antibody. The protein is found in both cytoplasm and nucleus. However, upon treatment of cells with RNase A, the protein is excluded from the nucleus, suggesting that it is in complexes with RNA. Although any function is currently unknown, the ORF3 protein is upregulated, speculatively associated with changes in chromatin, in viral-transformed HEK293 cells and in human diploid fibroblast cells rendered senescent by treatment with etoposide, ionizing radiation, or an oxidant (H2O2).
{"title":"Analysis of a Novel Human Protein, ORF3, Encoded by Spacer rDNA.","authors":"Yaohui Chen, Jung-Hyun Kim, Hee-Sheung Lee, Sergey Koren, Vladimir Larionov, Adam M Phillippy, David Schlessinger, Ramaiah Nagaraja","doi":"10.1007/s00239-025-10269-1","DOIUrl":"10.1007/s00239-025-10269-1","url":null,"abstract":"<p><p>An open reading frame in the intergenic spacer of human ribosomal (r)DNA codes for a 190 amino acid, 22 kDa protein that we have named ORF3. It comprises a 5'AluSx repeat sequence encoding 96 amino acids followed by a stretch of 94 amino acids containing a unique repeated stretch of 5 hydrophobic residues. Full copies of ORF3 have been isolated as transformation-associated recombination clones from mouse:human hybrid cell lines containing human chromosomes 21 or 22. In initial instances where the chromosome complement of rDNA repeats is fully resolved in whole genomes, in CHM13 cells, complete copies of ORF3 are mainly concentrated in a tandem cluster on chromosome 21, while other chromosomes contain 1 or 2 full copies, with the sequence in other rDNA repeats interrupted by a frameshift mutation. A diploid cell complement (HG002) again has both complete open reading frames (ORFs) and other copies with the frameshift or deletions. In searches among non-human primate sequences to assess the evolutionary history of ORF3, a > 93% conserved copy of the full sequence of the ORF, as well as copies with in-frame deletions, was found in bonobo, but only fragments homologous to the ORF were seen in chimpanzee, orangutan, and gorilla rDNA examined thus far. ORF3 was expressed as a V5-tagged chimeric protein in human kidney epithelial HEK293 cells, and both ORF3-V5 and endogenous ORF3 were detected with a newly generated antibody. The protein is found in both cytoplasm and nucleus. However, upon treatment of cells with RNase A, the protein is excluded from the nucleus, suggesting that it is in complexes with RNA. Although any function is currently unknown, the ORF3 protein is upregulated, speculatively associated with changes in chromatin, in viral-transformed HEK293 cells and in human diploid fibroblast cells rendered senescent by treatment with etoposide, ionizing radiation, or an oxidant (H<sub>2</sub>O<sub>2</sub>).</p>","PeriodicalId":16366,"journal":{"name":"Journal of Molecular Evolution","volume":" ","pages":"650-664"},"PeriodicalIF":1.8,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12579671/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145086309","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-10-01Epub Date: 2025-09-18DOI: 10.1007/s00239-025-10270-8
Ziyi Wang, Yuange Duan
Adenosine-to-inosine RNA editing is a widespread RNA modification that recodes genetic information and expands proteomic diversity. Exploring its evolutionary and adaptive roles requires multi-dimensional analysis, from sequence conservation to functional impact. In this article, we highlight RNA editing as a fascinating bridge between molecular biology and evolutionary theory-two domains used to be treated separately. Remarkably, RNA editing embodies both diversity (as a consequence of mutation, drift, and selection) and conservation (common ancestor), the two foundational principles of Darwin's theory and modern evolutionary research. While some evolutionary studies emphasize what has changed over time, RNA editing additionally draws attention to conserved signatures that persist across lineages, offering a more integrative perspective on how species evolve while remaining connected through shared molecular ancestry.
{"title":"Fascination with RNA Editing: In the Lights of Evolution and Biology.","authors":"Ziyi Wang, Yuange Duan","doi":"10.1007/s00239-025-10270-8","DOIUrl":"10.1007/s00239-025-10270-8","url":null,"abstract":"<p><p>Adenosine-to-inosine RNA editing is a widespread RNA modification that recodes genetic information and expands proteomic diversity. Exploring its evolutionary and adaptive roles requires multi-dimensional analysis, from sequence conservation to functional impact. In this article, we highlight RNA editing as a fascinating bridge between molecular biology and evolutionary theory-two domains used to be treated separately. Remarkably, RNA editing embodies both diversity (as a consequence of mutation, drift, and selection) and conservation (common ancestor), the two foundational principles of Darwin's theory and modern evolutionary research. While some evolutionary studies emphasize what has changed over time, RNA editing additionally draws attention to conserved signatures that persist across lineages, offering a more integrative perspective on how species evolve while remaining connected through shared molecular ancestry.</p>","PeriodicalId":16366,"journal":{"name":"Journal of Molecular Evolution","volume":" ","pages":"543-552"},"PeriodicalIF":1.8,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145085221","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}