首页 > 最新文献

Journal of Molecular Evolution最新文献

英文 中文
Structural and Evolutionary Analysis of Proteins Endowed with a Nucleotidyltransferase, or Non-canonical Palm, Catalytic Domain 具有核苷酸转移酶或非典型棕榈催化结构域的蛋白质的结构和进化分析
IF 3.9 3区 生物学 Q4 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-09-19 DOI: 10.1007/s00239-024-10207-7
Rodrigo Jácome

Many polymerases and other proteins are endowed with a catalytic domain belonging to the nucleotidyltransferase fold, which has also been deemed the non-canonical palm domain, in which three conserved acidic residues coordinate two divalent metal ions. Tertiary structure-based evolutionary analyses provide valuable information when the phylogenetic signal contained in the primary structure is blurry or has been lost, as is the case with these proteins. Pairwise structural comparisons of proteins with a nucleotidyltransferase fold were performed in the PDBefold web server: the RMSD, the number of superimposed residues, and the Qscore were obtained. The structural alignment score (RMSD × 100/number of superimposed residues) and the 1-Qscore were calculated, and distance matrices were constructed, from which a dendogram and a phylogenetic network were drawn for each score. The dendograms and the phylogenetic networks display well-defined clades, reflecting high levels of structural conservation within each clade, not mirrored by primary sequence. The conserved structural core between all these proteins consists of the catalytic nucleotidyltransferase fold, which is surrounded by different functional domains. Hence, many of the clades include proteins that bind different substrates or partake in non-related functions. Enzymes endowed with a nucleotidyltransferase fold are present in all domains of life, and participate in essential cellular and viral functions, which suggests that this domain is very ancient. Despite the loss of evolutionary traces in their primary structure, tertiary structure-based analyses allow us to delve into the evolution and functional diversification of the NT fold.

许多聚合酶和其他蛋白质都有一个属于核苷酸转移酶折叠的催化结构域,该结构域也被认为是非典型的棕榈结构域,其中三个保守的酸性残基协调两个二价金属离子。当一级结构中包含的系统发生学信号模糊或丢失时,基于三级结构的进化分析就能提供有价值的信息。我们在 PDBefold 网络服务器上对具有核苷酸转移酶折叠结构的蛋白质进行了配对结构比较:得出了 RMSD、叠加残基数和 Qscore。计算结构比对得分(RMSD × 100/叠加残基数)和1-Qscore,并构建距离矩阵,根据每个得分绘制树枝图和系统发生网络。树枝图和系统发育网络显示了明确的支系,反映了每个支系内部高度的结构保守性,而主序列并不反映这种保守性。所有这些蛋白质的保守结构核心包括催化核苷酸转移酶折叠,其周围有不同的功能域。因此,许多支系包括结合不同底物或参与非相关功能的蛋白质。具有核苷酸转移酶折叠的酶存在于生命的所有领域,并参与重要的细胞和病毒功能,这表明这一领域非常古老。尽管核苷酸转移酶的一级结构失去了进化的痕迹,但基于三级结构的分析使我们能够深入研究核苷酸转移酶折叠的进化和功能多样化。
{"title":"Structural and Evolutionary Analysis of Proteins Endowed with a Nucleotidyltransferase, or Non-canonical Palm, Catalytic Domain","authors":"Rodrigo Jácome","doi":"10.1007/s00239-024-10207-7","DOIUrl":"https://doi.org/10.1007/s00239-024-10207-7","url":null,"abstract":"<p>Many polymerases and other proteins are endowed with a catalytic domain belonging to the nucleotidyltransferase fold, which has also been deemed the non-canonical palm domain, in which three conserved acidic residues coordinate two divalent metal ions. Tertiary structure-based evolutionary analyses provide valuable information when the phylogenetic signal contained in the primary structure is blurry or has been lost, as is the case with these proteins. Pairwise structural comparisons of proteins with a nucleotidyltransferase fold were performed in the PDBefold web server: the RMSD, the number of superimposed residues, and the Qscore were obtained. The structural alignment score (RMSD × 100/number of superimposed residues) and the 1-Qscore were calculated, and distance matrices were constructed, from which a dendogram and a phylogenetic network were drawn for each score. The dendograms and the phylogenetic networks display well-defined clades, reflecting high levels of structural conservation within each clade, not mirrored by primary sequence. The conserved structural core between all these proteins consists of the catalytic nucleotidyltransferase fold, which is surrounded by different functional domains. Hence, many of the clades include proteins that bind different substrates or partake in non-related functions. Enzymes endowed with a nucleotidyltransferase fold are present in all domains of life, and participate in essential cellular and viral functions, which suggests that this domain is very ancient. Despite the loss of evolutionary traces in their primary structure, tertiary structure-based analyses allow us to delve into the evolution and functional diversification of the NT fold.</p>","PeriodicalId":16366,"journal":{"name":"Journal of Molecular Evolution","volume":"32 1","pages":""},"PeriodicalIF":3.9,"publicationDate":"2024-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142267091","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Untangling Zebrafish Genetic Annotation: Addressing Complexities and Nomenclature Issues in Orthologous Evaluation of TCOF1 and NOLC1 解开斑马鱼基因注释:解决 TCOF1 和 NOLC1 同源物评估中的复杂性和命名问题
IF 3.9 3区 生物学 Q4 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-09-13 DOI: 10.1007/s00239-024-10200-0
Guillermina Hill-Terán, Julieta Petrich, Maria Lorena Falcone Ferreyra, Manuel J. Aybar, Gabriela Coux

Treacher Collins syndrome (TCS) is a genetic disorder affecting facial development, primarily caused by mutations in the TCOF1 gene. TCOF1, along with NOLC1, play important roles in ribosomal RNA transcription and processing. Previously, a zebrafish model of TCS successfully recapitulated the main characteristics of the syndrome by knocking down the expression of a gene on chromosome 13 (coding for Uniprot ID B8JIY2), which was identified as the TCOF1 orthologue. However, database updates renamed this gene as nolc1 and the zebrafish database (ZFIN) identified a different gene on chromosome 14 as the TCOF1 orthologue (coding for Uniprot ID E7F9D9). NOLC1 and TCOF1 are large proteins with unstructured regions and repetitive sequences that complicate alignments and comparisons. Also, the additional whole genome duplication of teleosts sets further difficulty. In this study, we present evidence that endorses that NOLC1 and TCOF1 are paralogs, and that the zebrafish gene on chromosome 14 is a low-complexity LisH domain-containing factor that displays homology to NOLC1 but lacks essential sequence features to accomplish TCOF1 nucleolar functions. Our analysis also supports the idea that zebrafish, as has been suggested for other non-tetrapod vertebrates, lack the TCOF1 gene that is associated with tripartite nucleolus. Using BLAST searches in a group of teleost genomes, we identified fish-specific sequences similar to E7F9D9 zebrafish protein. We propose naming them “LisH-containing Low Complexity Proteins” (LLCP). Interestingly, the gene on chromosome 13 (nolc1) displays the sequence features, developmental expression patterns, and phenotypic impact of depletion that are characteristic of TCOF1 functions. These findings suggest that in teleost fish, the nucleolar functions described for both NOLC1 and TCOF1 mediated by their repeated motifs, are carried out by a single gene, nolc1. Our study, which is mainly based on computational tools available as free web-based algorithms, could help to solve similar conflicts regarding gene orthology in zebrafish.

特雷撤-科林斯综合征(TCS)是一种影响面部发育的遗传性疾病,主要由 TCOF1 基因突变引起。TCOF1 和 NOLC1 在核糖体 RNA 的转录和处理中发挥着重要作用。此前,通过敲除 13 号染色体上一个基因(编码为 Uniprot ID B8JIY2)的表达,一个 TCS 斑马鱼模型成功重现了该综合征的主要特征。然而,数据库更新后将该基因重新命名为 nolc1,斑马鱼数据库(ZFIN)则将 14 号染色体上的另一个基因确定为 TCOF1 同源物(编码为 Uniprot ID E7F9D9)。NOLC1 和 TCOF1 都是大型蛋白质,其非结构化区域和重复序列使比对和比较变得复杂。此外,长臂猿额外的全基因组复制也增加了难度。在本研究中,我们提出的证据支持 NOLC1 和 TCOF1 是旁系亲属,14 号染色体上的斑马鱼基因是一种低复杂度的含 LisH 结构域的因子,与 NOLC1 同源,但缺乏完成 TCOF1 核极功能的基本序列特征。我们的分析还支持这样一种观点,即斑马鱼与其他非四足类脊椎动物一样,缺乏与三方核仁相关的 TCOF1 基因。通过在一组远志动物基因组中进行 BLAST 搜索,我们发现了与 E7F9D9 斑马鱼蛋白相似的鱼类特异性序列。我们建议将它们命名为 "含 LisH 的低复杂性蛋白"(LLCP)。有趣的是,第 13 号染色体上的基因(nolc1)显示了 TCOF1 功能所特有的序列特征、发育表达模式和缺失的表型影响。这些研究结果表明,在长臂猿鱼类中,NOLC1 和 TCOF1 通过其重复基团介导的核小体功能是由一个基因 nolc1 实现的。我们的研究主要是基于免费网络算法的计算工具,可以帮助解决斑马鱼中类似的基因同源冲突。
{"title":"Untangling Zebrafish Genetic Annotation: Addressing Complexities and Nomenclature Issues in Orthologous Evaluation of TCOF1 and NOLC1","authors":"Guillermina Hill-Terán, Julieta Petrich, Maria Lorena Falcone Ferreyra, Manuel J. Aybar, Gabriela Coux","doi":"10.1007/s00239-024-10200-0","DOIUrl":"https://doi.org/10.1007/s00239-024-10200-0","url":null,"abstract":"<p>Treacher Collins syndrome (TCS) is a genetic disorder affecting facial development, primarily caused by mutations in the <i>TCOF1</i> gene. TCOF1, along with NOLC1, play important roles in ribosomal RNA transcription and processing. Previously, a zebrafish model of TCS successfully recapitulated the main characteristics of the syndrome by knocking down the expression of a gene on chromosome 13 (coding for Uniprot ID B8JIY2), which was identified as the <i>TCOF1</i> orthologue. However, database updates renamed this gene as <i>nolc1</i> and the zebrafish database (ZFIN) identified a different gene on chromosome 14 as the <i>TCOF1</i> orthologue (coding for Uniprot ID E7F9D9). NOLC1 and TCOF1 are large proteins with unstructured regions and repetitive sequences that complicate alignments and comparisons. Also, the additional whole genome duplication of teleosts sets further difficulty. In this study, we present evidence that endorses that <i>NOLC1</i> and <i>TCOF1</i> are paralogs, and that the zebrafish gene on chromosome 14 is a low-complexity LisH domain-containing factor that displays homology to NOLC1 but lacks essential sequence features to accomplish TCOF1 nucleolar functions. Our analysis also supports the idea that zebrafish, as has been suggested for other non-tetrapod vertebrates, lack the <i>TCOF1</i> gene that is associated with tripartite nucleolus. Using BLAST searches in a group of teleost genomes, we identified fish-specific sequences similar to E7F9D9 zebrafish protein. We propose naming them “LisH-containing Low Complexity Proteins” (LLCP). Interestingly, the gene on chromosome 13 <i>(nolc1</i>) displays the sequence features, developmental expression patterns, and phenotypic impact of depletion that are characteristic of <i>TCOF1</i> functions. These findings suggest that in teleost fish, the nucleolar functions described for both <i>NOLC1</i> and <i>TCOF1</i> mediated by their repeated motifs, are carried out by a single gene, <i>nolc1</i>. Our study, which is mainly based on computational tools available as free web-based algorithms, could help to solve similar conflicts regarding gene orthology in zebrafish.</p>","PeriodicalId":16366,"journal":{"name":"Journal of Molecular Evolution","volume":"5 1","pages":""},"PeriodicalIF":3.9,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142194309","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Recurrent Independent Pseudogenization Events of the Sperm Fertilization Gene ZP3r in Apes and Monkeys 猿猴精子受精基因 ZP3r 的反复独立假基因化事件
IF 3.9 3区 生物学 Q4 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-09-12 DOI: 10.1007/s00239-024-10192-x
J. A. Carlisle, D. H. Gurbuz, W. J. Swanson

Many reproductive proteins show signatures of rapid evolution through sequence divergence and duplication. These features of reproductive genes may complicate the detection of orthologs across taxa, making it difficult to connect studies in model systems to human biology. In mice, ZP3r/sp56 is a binding partner to the egg coat protein ZP3 and may mediate induction of the acrosome reaction, a crucial step in fertilization. In rodents, ZP3r, as a member of the Regulators of Complement Activation cluster, is surrounded by paralogs, some of which have been shown to be evolving under positive selection. Although primate egg coats also contain ZP3, sequence divergence paired with paralogous relationships with neighboring genes has complicated the accurate identification of the human ZP3r ortholog. Here, we phylogenetically and syntenically resolve that the human ortholog of ZP3r is the pseudogene C4BPAP1. We investigate the evolution of this gene within primates. We observe independent pseudogenization events of ZP3r in all Apes with the exception of Orangutans, and independent pseudogenization events in many monkey species. ZP3r in both primates that retain ZP3r and in rodents contains positively selected sites. We hypothesize that redundant mechanisms mediate ZP3 recognition in mammals and ZP3r’s relative importance to ZP recognition varies across species.

许多生殖蛋白通过序列分化和复制显示出快速进化的特征。生殖基因的这些特点可能会使跨类群的同源物检测变得复杂,从而难以将模型系统中的研究与人类生物学联系起来。在小鼠中,ZP3r/sp56 是卵壳蛋白 ZP3 的结合伙伴,可能介导顶体反应的诱导,而顶体反应是受精的关键步骤。在啮齿类动物中,ZP3r 作为补体激活调节因子集群的成员,被一些旁系亲属所包围,其中一些旁系亲属已被证明是在正向选择下进化的。虽然灵长类动物的卵膜也含有 ZP3,但由于序列差异以及与邻近基因的旁系亲属关系,使得准确鉴定人类 ZP3r 直向同源物的工作变得复杂。在这里,我们从系统发育和同源关系上确定了人类 ZP3r 的直向同源物是假基因 C4BPAP1。我们研究了该基因在灵长类动物中的进化。我们在除猩猩以外的所有类人猿中观察到了 ZP3r 的独立假基因事件,在许多猴类中也观察到了独立的假基因事件。在保留 ZP3r 的灵长类动物和啮齿类动物中,ZP3r 都含有正选位点。我们假设,哺乳动物的 ZP3 识别机制是冗余的,ZP3r 对 ZP 识别的相对重要性因物种而异。
{"title":"Recurrent Independent Pseudogenization Events of the Sperm Fertilization Gene ZP3r in Apes and Monkeys","authors":"J. A. Carlisle, D. H. Gurbuz, W. J. Swanson","doi":"10.1007/s00239-024-10192-x","DOIUrl":"https://doi.org/10.1007/s00239-024-10192-x","url":null,"abstract":"<p>Many reproductive proteins show signatures of rapid evolution through sequence divergence and duplication. These features of reproductive genes may complicate the detection of orthologs across taxa, making it difficult to connect studies in model systems to human biology. In mice, ZP3r/sp56 is a binding partner to the egg coat protein ZP3 and may mediate induction of the acrosome reaction, a crucial step in fertilization. In rodents, ZP3r, as a member of the Regulators of Complement Activation cluster, is surrounded by paralogs, some of which have been shown to be evolving under positive selection. Although primate egg coats also contain ZP3, sequence divergence paired with paralogous relationships with neighboring genes has complicated the accurate identification of the human ZP3r ortholog. Here, we phylogenetically and syntenically resolve that the human ortholog of ZP3r is the pseudogene <i>C4BPAP1</i>. We investigate the evolution of this gene within primates. We observe independent pseudogenization events of ZP3r in all Apes with the exception of Orangutans, and independent pseudogenization events in many monkey species. ZP3r in both primates that retain ZP3r and in rodents contains positively selected sites. We hypothesize that redundant mechanisms mediate ZP3 recognition in mammals and ZP3r’s relative importance to ZP recognition varies across species.</p>","PeriodicalId":16366,"journal":{"name":"Journal of Molecular Evolution","volume":"25 1","pages":""},"PeriodicalIF":3.9,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142194311","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unveiling the Genomic Symphony: Identification Cultivar-Specific Genes and Enhanced Insights on Sweet Sorghum Genomes Through Comprehensive superTranscriptomic Analysis 揭开基因组交响乐的神秘面纱:通过全面的超转录组分析鉴定栽培品种特异基因并加深对甜高粱基因组的了解
IF 3.9 3区 生物学 Q4 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-09-11 DOI: 10.1007/s00239-024-10198-5
Shinde Nikhil, Habeeb Shaikh Mohideen, Raja Natesan Sella

Sorghum (Sorghum bicolor (L.) Moench) is a multipurpose crop grown for food, fodder, and bioenergy production. Its cultivated varieties, along with their wild counterparts, contribute to the core genetic pool. Despite the availability of several re-sequenced sorghum genomes, a variable portion of sorghum genomes is not reported during reference genome assembly and annotation. The present analysis used 223 publicly available RNA-seq datasets from seven sweet sorghum cultivars to construct superTranscriptome. This approach yielded 45,864 Representative Transcript Assemblies (RTAs) that showcased intriguing Presence/Absence Variation (PAV) across 15 published sorghum genomes. We found 301 superTranscripts were exclusive to sweet sorghum, including 58 de novo genes encoded core and linker histones, zinc finger domains, glucosyl transferases, cellulose synthase, etc. The superTranscriptome added 2,802 new protein-coding genes to the Sweet Sorghum Reference Genome (SSRG), of which 559 code for different transcription factors (TFs). Our analysis revealed that MULE-like transposases were abundant in the sweet sorghum genome and could play a hidden role in the evolution of sweet sorghum. We observed large deletions in the D locus and terminal deletions in four other NAC encoding loci in the SSRG compared to its wild progenitor (353) suggesting non-functional NAC genes contributed to trait development in sweet sorghum. Moreover, superTranscript-based methods for Differential Exon Usage (DEU) and Differential Gene Expression (DGE) analyses were more accurate than those based on the SSRG. This study demonstrates that the superTranscriptome can enhance our understanding of fundamental sorghum mechanisms, improve genome annotations, and potentially even replace the reference genome.

高粱(Sorghum bicolor (L.) Moench)是一种多用途作物,用于生产粮食、饲料和生物能源。高粱的栽培品种及其野生品种为核心基因库做出了贡献。尽管已经有了几个重新测序的高粱基因组,但在参考基因组的组装和注释过程中,高粱基因组的不同部分并没有被报告。本分析利用来自 7 个甜高粱栽培品种的 223 个公开可用的 RNA-seq 数据集构建了 superTranscriptome。这种方法产生了 45,864 个代表性转录本组装(RTA),展示了 15 个已发表的高粱基因组中耐人寻味的存在/不存在变异(PAV)。我们发现 301 个超级转录本是甜高粱独有的,其中包括 58 个编码核心和连接组蛋白、锌指结构域、葡糖基转移酶、纤维素合成酶等的新基因。超级转录组为甜高粱参考基因组(SSRG)增加了 2,802 个新的蛋白质编码基因,其中 559 个编码不同的转录因子(TFs)。我们的分析表明,甜高粱基因组中存在大量类似 MULE 的转座酶,它们可能在甜高粱的进化过程中扮演着隐秘的角色。与甜高粱的野生祖先(353)相比,我们在 SSRG 中观察到 D 基因座的大量缺失和其他四个 NAC 编码基因座的末端缺失,这表明非功能性 NAC 基因对甜高粱的性状发展起到了促进作用。此外,基于超级转录本的差异外显子使用(DEU)和差异基因表达(DGE)分析方法比基于 SSRG 的分析方法更为准确。这项研究表明,超级转录组可以增强我们对高粱基本机制的了解,改进基因组注释,甚至有可能取代参考基因组。
{"title":"Unveiling the Genomic Symphony: Identification Cultivar-Specific Genes and Enhanced Insights on Sweet Sorghum Genomes Through Comprehensive superTranscriptomic Analysis","authors":"Shinde Nikhil, Habeeb Shaikh Mohideen, Raja Natesan Sella","doi":"10.1007/s00239-024-10198-5","DOIUrl":"https://doi.org/10.1007/s00239-024-10198-5","url":null,"abstract":"<p>Sorghum (<i>Sorghum bicolor (L.) Moench</i>) is a multipurpose crop grown for food, fodder, and bioenergy production. Its cultivated varieties, along with their wild counterparts, contribute to the core genetic pool. Despite the availability of several re-sequenced sorghum genomes, a variable portion of sorghum genomes is not reported during reference genome assembly and annotation. The present analysis used 223 publicly available RNA-seq datasets from seven sweet sorghum cultivars to construct superTranscriptome. This approach yielded 45,864 Representative Transcript Assemblies (RTAs) that showcased intriguing Presence/Absence Variation (PAV) across 15 published sorghum genomes. We found 301 superTranscripts were exclusive to sweet sorghum, including 58 de novo genes encoded core and linker histones, zinc finger domains, glucosyl transferases, cellulose synthase, etc. The superTranscriptome added 2,802 new protein-coding genes to the Sweet Sorghum Reference Genome (SSRG), of which 559 code for different transcription factors (TFs). Our analysis revealed that MULE-like transposases were abundant in the sweet sorghum genome and could play a hidden role in the evolution of sweet sorghum. We observed large deletions in the D locus and terminal deletions in four other NAC encoding loci in the SSRG compared to its wild progenitor (353) suggesting non-functional NAC genes contributed to trait development in sweet sorghum. Moreover, superTranscript-based methods for Differential Exon Usage (DEU) and Differential Gene Expression (DGE) analyses were more accurate than those based on the SSRG. This study demonstrates that the superTranscriptome can enhance our understanding of fundamental sorghum mechanisms, improve genome annotations, and potentially even replace the reference genome.</p>","PeriodicalId":16366,"journal":{"name":"Journal of Molecular Evolution","volume":"12 1","pages":""},"PeriodicalIF":3.9,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142194310","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On the Nature of the Last Common Ancestor: A Story from its Translation Machinery 论最后共同祖先的性质:翻译机械的故事
IF 3.9 3区 生物学 Q4 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-09-11 DOI: 10.1007/s00239-024-10199-4
Mario Rivas, George E. Fox

The Last Common Ancestor (LCA) is understood as a hypothetical population of organisms from which all extant living creatures are thought to have descended. Its biology and environment have been and continue to be the subject of discussions within the scientific community. Since the first bacterial genomes were obtained, multiple attempts to reconstruct the genetic content of the LCA have been made. In this review, we compare 10 of the most extensive reconstructions of the gene content possessed by the LCA as they relate to aspects of the translation machinery. Although each reconstruction has its own methodological biases and many disagree in the metabolic nature of the LCA all, to some extent, indicate that several components of the translation machinery are among the most conserved genetic elements. The datasets from each reconstruction clearly show that the LCA already had a largely complete translational system with a genetic code already in place and therefore was not a progenote. Among these features several ribosomal proteins, transcription factors like IF2, EF-G, and EF-Tu and both class I and class II aminoacyl tRNA synthetases were found in essentially all reconstructions. Due to the limitations of the various methodologies, some features such as the occurrence of rRNA posttranscriptional modified bases are not fully addressed. However, conserved as it is, non-universal ribosomal features found in various reconstructions indicate that LCA’s translation machinery was still evolving, thereby acquiring the domain specific features in the process. Although progenotes from the pre-LCA likely no longer exist recent results obtained by unraveling the early history of the ribosome and other genetic processes can provide insight to the nature of the pre-LCA world.

最后共同祖先(LCA)被理解为一个假设的生物种群,所有现存生物都被认为是它的后裔。其生物学和环境一直是科学界讨论的主题。自首次获得细菌基因组以来,人们已多次尝试重建 LCA 的基因内容。在这篇综述中,我们比较了 10 种最广泛的 LCA 基因内容重建,因为它们都与翻译机器的各个方面有关。尽管每种重建都有其方法上的偏差,而且许多重建在 LCA 的新陈代谢性质方面存在分歧,但所有重建都在一定程度上表明,翻译机器的几个组成部分是最保守的遗传元素之一。每次重建的数据集都清楚地表明,LCA 已经有了一个基本完整的翻译系统,遗传密码已经到位,因此不是原生生物。在这些特征中,一些核糖体蛋白、转录因子(如 IF2、EF-G 和 EF-Tu)以及 I 类和 II 类氨基酰 tRNA 合成酶基本上在所有的重建中都能找到。由于各种方法的局限性,一些特征如 rRNA 转录后修饰碱基的出现并没有得到充分的研究。然而,在各种重建中发现的非通用核糖体特征表明,LCA 的翻译机制仍在不断进化,从而在这一过程中获得了特定领域的特征。尽管前 LCA 的祖先很可能已不复存在,但通过揭示核糖体的早期历史和其他遗传过程所获得的最新结果,可以让我们深入了解前 LCA 世界的本质。
{"title":"On the Nature of the Last Common Ancestor: A Story from its Translation Machinery","authors":"Mario Rivas, George E. Fox","doi":"10.1007/s00239-024-10199-4","DOIUrl":"https://doi.org/10.1007/s00239-024-10199-4","url":null,"abstract":"<p>The Last Common Ancestor (LCA) is understood as a hypothetical population of organisms from which all extant living creatures are thought to have descended. Its biology and environment have been and continue to be the subject of discussions within the scientific community. Since the first bacterial genomes were obtained, multiple attempts to reconstruct the genetic content of the LCA have been made. In this review, we compare 10 of the most extensive reconstructions of the gene content possessed by the LCA as they relate to aspects of the translation machinery. Although each reconstruction has its own methodological biases and many disagree in the metabolic nature of the LCA all, to some extent, indicate that several components of the translation machinery are among the most conserved genetic elements. The datasets from each reconstruction clearly show that the LCA already had a largely complete translational system with a genetic code already in place and therefore was not a <i>progenote</i>. Among these features several ribosomal proteins, transcription factors like IF2, EF-G, and EF-Tu and both class I and class II aminoacyl tRNA synthetases were found in essentially all reconstructions. Due to the limitations of the various methodologies, some features such as the occurrence of rRNA posttranscriptional modified bases are not fully addressed. However, conserved as it is, non-universal ribosomal features found in various reconstructions indicate that LCA’s translation machinery was still evolving, thereby acquiring the domain specific features in the process. Although progenotes from the pre-LCA likely no longer exist recent results obtained by unraveling the early history of the ribosome and other genetic processes can provide insight to the nature of the pre-LCA world.</p>","PeriodicalId":16366,"journal":{"name":"Journal of Molecular Evolution","volume":"55 1","pages":""},"PeriodicalIF":3.9,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142194312","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Analysis of Cancer-Resisting Evolutionary Adaptations in Wild Animals and Applications for Human Oncology 野生动物抗癌进化适应性分析及在人类肿瘤学中的应用
IF 3.9 3区 生物学 Q4 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-09-11 DOI: 10.1007/s00239-024-10204-w
Bokai K. Zhang, Leonard Gines

This literature review is to present a new direction in developing better treatment or preventive measures. The larger the body of an organism, the more numerous the cells, which theoretically lead to a higher risk of cancer. However, observational studies suggest the lack of correlation between body size and cancer risk, which is known as Peto’s paradox. The corollary of Peto’s paradox is that large organisms must be cancer-resistant. Further investigation of the anti-cancer mechanisms in each species could be potentially rewarding, and how the anti-cancer mechanisms found in wild animals can help influence and develop more effective cancer treatment in humans is the main focus of this literature review. Due to a lack of research and understanding of the exact molecular mechanisms of the researched species, only a few (Elephants and rodents) that have been extensively researched have made substantive contributions to human oncology. A new research direction is to investigate the positively selective genes that are related to cancer resistance and see if homologous genes are presented in humans. Despite the great obstacle of applying anti-cancer mechanisms to the human body from phylogenetically distant species, this research direction of gaining insights through investigating cancer-resisting evolutionary adaptations in wild animals has great potential in human oncology research.

本文献综述旨在为制定更好的治疗或预防措施提供新的方向。生物体越大,细胞数量越多,理论上患癌症的风险就越高。然而,观察性研究表明,体型与癌症风险之间缺乏相关性,这就是所谓的佩托悖论(Peto's paradox)。佩托悖论的推论是,大型生物一定具有抗癌能力。进一步研究每个物种的抗癌机制可能会有所收获,而在野生动物中发现的抗癌机制如何有助于影响和开发更有效的人类癌症治疗方法,正是本文献综述的重点。由于缺乏对被研究物种确切分子机制的研究和了解,只有少数被广泛研究的物种(大象和啮齿类动物)对人类肿瘤学做出了实质性贡献。一个新的研究方向是调查与抗癌有关的正选择基因,看看人类是否存在同源基因。尽管将系统发育遥远的物种的抗癌机制应用于人体存在巨大障碍,但通过调查野生动物的抗癌进化适应性来获得启示的这一研究方向在人类肿瘤学研究中具有巨大潜力。
{"title":"Analysis of Cancer-Resisting Evolutionary Adaptations in Wild Animals and Applications for Human Oncology","authors":"Bokai K. Zhang, Leonard Gines","doi":"10.1007/s00239-024-10204-w","DOIUrl":"https://doi.org/10.1007/s00239-024-10204-w","url":null,"abstract":"<p>This literature review is to present a new direction in developing better treatment or preventive measures. The larger the body of an organism, the more numerous the cells, which theoretically lead to a higher risk of cancer. However, observational studies suggest the lack of correlation between body size and cancer risk, which is known as Peto’s paradox. The corollary of Peto’s paradox is that large organisms must be cancer-resistant. Further investigation of the anti-cancer mechanisms in each species could be potentially rewarding, and how the anti-cancer mechanisms found in wild animals can help influence and develop more effective cancer treatment in humans is the main focus of this literature review. Due to a lack of research and understanding of the exact molecular mechanisms of the researched species, only a few (Elephants and rodents) that have been extensively researched have made substantive contributions to human oncology. A new research direction is to investigate the positively selective genes that are related to cancer resistance and see if homologous genes are presented in humans. Despite the great obstacle of applying anti-cancer mechanisms to the human body from phylogenetically distant species, this research direction of gaining insights through investigating cancer-resisting evolutionary adaptations in wild animals has great potential in human oncology research.</p>","PeriodicalId":16366,"journal":{"name":"Journal of Molecular Evolution","volume":"4 1","pages":""},"PeriodicalIF":3.9,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142194313","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Diversification of von Willebrand Factor A and Chitin-Binding Domains in Pif/BMSPs Among Mollusks. 软体动物中 Pif/BMSP 中 von Willebrand Factor A 和几丁质结合域的多样化。
IF 2.1 3区 生物学 Q4 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-08-01 Epub Date: 2024-06-12 DOI: 10.1007/s00239-024-10180-1
Keisuke Shimizu, Lumi Negishi, Hitoshi Kurumizaka, Michio Suzuki

Pif is a shell matrix protein (SMP) identified in the nacreous layer of Pinctada fucata (Pfu) comprised two proteins, Pif97 and Pif 80. Pif97 contains a von Willebrand factor A (VWA) and chitin-binding domains, whereas Pif80 can bind calcium carbonate crystals. The VWA domain is conserved in the SMPs of various mollusk species; however, their phylogenetic relationship remains obscure. Furthermore, although the VWA domain participates in protein-protein interactions, its role in shell formation has not been established. Accordingly, in the current study, we investigate the phylogenetic relationship between PfuPif and other VWA domain-containing proteins in major mollusk species. The shell-related proteins containing VWA domains formed a large clade (the Pif/BMSP family) and were classified into eight subfamilies with unique sequential features, expression patterns, and taxa diversity. Furthermore, a pull-down assay using recombinant proteins containing the VWA domain of PfuPif 97 revealed that the VWA domain interacts with five nacreous layer-related SMPs of P. fucata, including Pif 80 and nacrein. Collectively, these results suggest that the VWA domain is important in the formation of organic complexes and participates in shell mineralisation.

Pif 是一种贝壳基质蛋白(SMP),已在 Pinctada fucata(Pfu)的珍珠层中发现,由 Pif97 和 Pif 80 两种蛋白组成。Pif97 含有一个 von Willebrand 因子 A(VWA)和几丁质结合结构域,而 Pif80 可结合碳酸钙晶体。VWA 结构域在各种软体动物的 SMPs 中都是保守的,但它们之间的系统发育关系仍然不清楚。此外,尽管 VWA 结构域参与蛋白质之间的相互作用,但其在贝壳形成过程中的作用尚未确定。因此,在本研究中,我们研究了主要软体动物物种中 PfuPif 与其他含 VWA 结构域的蛋白质之间的系统发育关系。含有 VWA 结构域的贝壳相关蛋白形成了一个大支系(Pif/BMSP 家族),并被分为八个亚家族,这些亚家族具有独特的序列特征、表达模式和类群多样性。此外,利用含有 PfuPif 97 的 VWA 结构域的重组蛋白进行的牵引试验发现,VWA 结构域与五种与珍珠层相关的岩浆蛋白(包括 Pif 80 和珍珠蛋白)相互作用。总之,这些结果表明,VWA结构域在有机复合物的形成过程中非常重要,并参与了贝壳矿化过程。
{"title":"Diversification of von Willebrand Factor A and Chitin-Binding Domains in Pif/BMSPs Among Mollusks.","authors":"Keisuke Shimizu, Lumi Negishi, Hitoshi Kurumizaka, Michio Suzuki","doi":"10.1007/s00239-024-10180-1","DOIUrl":"10.1007/s00239-024-10180-1","url":null,"abstract":"<p><p>Pif is a shell matrix protein (SMP) identified in the nacreous layer of Pinctada fucata (Pfu) comprised two proteins, Pif97 and Pif 80. Pif97 contains a von Willebrand factor A (VWA) and chitin-binding domains, whereas Pif80 can bind calcium carbonate crystals. The VWA domain is conserved in the SMPs of various mollusk species; however, their phylogenetic relationship remains obscure. Furthermore, although the VWA domain participates in protein-protein interactions, its role in shell formation has not been established. Accordingly, in the current study, we investigate the phylogenetic relationship between PfuPif and other VWA domain-containing proteins in major mollusk species. The shell-related proteins containing VWA domains formed a large clade (the Pif/BMSP family) and were classified into eight subfamilies with unique sequential features, expression patterns, and taxa diversity. Furthermore, a pull-down assay using recombinant proteins containing the VWA domain of PfuPif 97 revealed that the VWA domain interacts with five nacreous layer-related SMPs of P. fucata, including Pif 80 and nacrein. Collectively, these results suggest that the VWA domain is important in the formation of organic complexes and participates in shell mineralisation.</p>","PeriodicalId":16366,"journal":{"name":"Journal of Molecular Evolution","volume":" ","pages":"415-431"},"PeriodicalIF":2.1,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11291548/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141306068","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sampling Strategies for Experimentally Mapping Molecular Fitness Landscapes Using High-Throughput Methods. 使用高通量方法实验绘制分子健壮性景观的取样策略
IF 2.1 3区 生物学 Q4 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-08-01 Epub Date: 2024-06-17 DOI: 10.1007/s00239-024-10179-8
Steven K Chen, Jing Liu, Alexander Van Nynatten, Benjamin M Tudor-Price, Belinda S W Chang

Empirical studies of genotype-phenotype-fitness maps of proteins are fundamental to understanding the evolutionary process, in elucidating the space of possible genotypes accessible through mutations in a landscape of phenotypes and fitness effects. Yet, comprehensively mapping molecular fitness landscapes remains challenging since all possible combinations of amino acid substitutions for even a few protein sites are encoded by an enormous genotype space. High-throughput mapping of genotype space can be achieved using large-scale screening experiments known as multiplexed assays of variant effect (MAVEs). However, to accommodate such multi-mutational studies, the size of MAVEs has grown to the point where a priori determination of sampling requirements is needed. To address this problem, we propose calculations and simulation methods to approximate minimum sampling requirements for multi-mutational MAVEs, which we combine with a new library construction protocol to experimentally validate our approximation approaches. Analysis of our simulated data reveals how sampling trajectories differ between simulations of nucleotide versus amino acid variants and among mutagenesis schemes. For this, we show quantitatively that marginal gains in sampling efficiency demand increasingly greater sampling effort when sampling for nucleotide sequences over their encoded amino acid equivalents. We present a new library construction protocol that efficiently maximizes sequence variation, and demonstrate using ultradeep sequencing that the library encodes virtually all possible combinations of mutations within the experimental design. Insights learned from our analyses together with the methodological advances reported herein are immediately applicable toward pooled experimental screens of arbitrary design, enabling further assay upscaling and expanded testing of genotype space.

蛋白质基因型-表型-适配性图谱的实证研究是了解进化过程的基础,它阐明了在表型和适配性效应景观中通过突变可获得的可能基因型空间。然而,全面绘制分子适配性景观图仍然具有挑战性,因为即使是几个蛋白质位点的所有可能的氨基酸替换组合都包含在一个巨大的基因型空间中。基因型空间的高通量图谱可以通过被称为变异效应多重检测(MAVE)的大规模筛选实验来实现。然而,为了适应这种多重变异研究,MAVE 的规模已经发展到需要先验确定取样要求的地步。为了解决这个问题,我们提出了计算和模拟方法来近似确定多变异 MAVE 的最低取样要求,并将其与新的文库构建方案相结合,通过实验验证了我们的近似方法。对模拟数据的分析揭示了核苷酸变体与氨基酸变体模拟之间以及不同诱变方案之间采样轨迹的差异。为此,我们从数量上表明,在对核苷酸序列进行采样时,采样效率的边际收益要求采样工作量越来越大,而对氨基酸等价物进行采样时则要求采样工作量越来越大。我们提出了一种新的文库构建方案,它能有效地将序列变异最大化,并利用超深度测序证明文库编码了实验设计中几乎所有可能的突变组合。从我们的分析中获得的启示以及本文所报告的方法学进展可立即应用于任意设计的集合实验筛选,从而实现进一步的检测升级和扩大基因型空间的测试。
{"title":"Sampling Strategies for Experimentally Mapping Molecular Fitness Landscapes Using High-Throughput Methods.","authors":"Steven K Chen, Jing Liu, Alexander Van Nynatten, Benjamin M Tudor-Price, Belinda S W Chang","doi":"10.1007/s00239-024-10179-8","DOIUrl":"10.1007/s00239-024-10179-8","url":null,"abstract":"<p><p>Empirical studies of genotype-phenotype-fitness maps of proteins are fundamental to understanding the evolutionary process, in elucidating the space of possible genotypes accessible through mutations in a landscape of phenotypes and fitness effects. Yet, comprehensively mapping molecular fitness landscapes remains challenging since all possible combinations of amino acid substitutions for even a few protein sites are encoded by an enormous genotype space. High-throughput mapping of genotype space can be achieved using large-scale screening experiments known as multiplexed assays of variant effect (MAVEs). However, to accommodate such multi-mutational studies, the size of MAVEs has grown to the point where a priori determination of sampling requirements is needed. To address this problem, we propose calculations and simulation methods to approximate minimum sampling requirements for multi-mutational MAVEs, which we combine with a new library construction protocol to experimentally validate our approximation approaches. Analysis of our simulated data reveals how sampling trajectories differ between simulations of nucleotide versus amino acid variants and among mutagenesis schemes. For this, we show quantitatively that marginal gains in sampling efficiency demand increasingly greater sampling effort when sampling for nucleotide sequences over their encoded amino acid equivalents. We present a new library construction protocol that efficiently maximizes sequence variation, and demonstrate using ultradeep sequencing that the library encodes virtually all possible combinations of mutations within the experimental design. Insights learned from our analyses together with the methodological advances reported herein are immediately applicable toward pooled experimental screens of arbitrary design, enabling further assay upscaling and expanded testing of genotype space.</p>","PeriodicalId":16366,"journal":{"name":"Journal of Molecular Evolution","volume":" ","pages":"402-414"},"PeriodicalIF":2.1,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141419508","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Correction: Untangling the Evolution of the Receptor-Binding Motif of SARS-CoV-2. 更正:解开 SARS-CoV-2 的受体结合基团的演变。
IF 2.1 3区 生物学 Q4 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-08-01 DOI: 10.1007/s00239-024-10183-y
Luis Delaye, Lizbeth Román-Padilla
{"title":"Correction: Untangling the Evolution of the Receptor-Binding Motif of SARS-CoV-2.","authors":"Luis Delaye, Lizbeth Román-Padilla","doi":"10.1007/s00239-024-10183-y","DOIUrl":"10.1007/s00239-024-10183-y","url":null,"abstract":"","PeriodicalId":16366,"journal":{"name":"Journal of Molecular Evolution","volume":" ","pages":"525-526"},"PeriodicalIF":2.1,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11291600/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141446311","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Expansion and Functional Diversification of Long-Wavelength-Sensitive Opsin in Anabantoid Fishes. 无颌鱼类中长波长敏感光蛋白的扩展和功能多样化
IF 2.1 3区 生物学 Q4 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-08-01 Epub Date: 2024-06-11 DOI: 10.1007/s00239-024-10181-0
Jan Gerwin, Julián Torres-Dowdall, Thomas F Brown, Axel Meyer

Gene duplication is one of the most important sources of novel genotypic diversity and the subsequent evolution of phenotypic diversity. Determining the evolutionary history and functional changes of duplicated genes is crucial for a comprehensive understanding of adaptive evolution. The evolutionary history of visual opsin genes is very dynamic, with repeated duplication events followed by sub- or neofunctionalization. While duplication of the green-sensitive opsins rh2 is common in teleost fish, fewer cases of multiple duplication events of the red-sensitive opsin lws are known. In this study, we investigate the visual opsin gene repertoire of the anabantoid fishes, focusing on the five lws opsin genes found in the genus Betta. We determine the evolutionary history of the lws opsin gene by taking advantage of whole-genome sequences of nine anabantoid species, including the newly assembled genome of Betta imbellis. Our results show that at least two independent duplications of lws occurred in the Betta lineage. The analysis of amino acid sequences of the lws paralogs of Betta revealed high levels of diversification in four of the seven transmembrane regions of the lws protein. Amino acid substitutions at two key-tuning sites are predicted to lead to differentiation of absorption maxima (λmax) between the paralogs within Betta. Finally, eye transcriptomics of B. splendens at different developmental stages revealed expression shifts between paralogs for all cone opsin classes. The lws genes are expressed according to their relative position in the lws opsin cluster throughout ontogeny. We conclude that temporal collinearity of lws expression might have facilitated subfunctionalization of lws in Betta and teleost opsins in general.

基因复制是新基因型多样性和随后表型多样性进化的最重要来源之一。确定重复基因的进化历史和功能变化对于全面了解适应性进化至关重要。视蛋白基因的进化史非常动态,重复复制事件之后会出现亚功能化或新功能化。绿色敏感视蛋白 rh2 的重复在远洋鱼类中很常见,而红色敏感视蛋白 lws 的多次重复事件却鲜为人知。在这项研究中,我们调查了无尾目鱼类的视觉蛋白基因谱系,重点研究了贝塔鱼属(Betta)中的五个 lws 蛋白基因。我们利用九种无尾目鱼类的全基因组序列(包括新组装的Betta imbellis基因组)确定了lws视蛋白基因的进化历史。我们的研究结果表明,在 Betta 系中至少发生了两次独立的 lws 复制。对 Betta 的 lws 准同源物氨基酸序列的分析表明,在 lws 蛋白的七个跨膜区中,有四个跨膜区出现了高度多样化。据预测,两个关键调谐位点上的氨基酸置换导致了 Betta 内各准同系物之间吸收最大值(λmax)的差异。最后,白芨在不同发育阶段的眼转录组学发现,所有视锥蛋白类的旁系亲属之间都存在表达变化。在整个本体发育过程中,lws基因根据其在lws视蛋白簇中的相对位置进行表达。我们的结论是,lws表达的时间共线性可能促进了lws在Betta和一般远足类动物视蛋白中的亚功能化。
{"title":"Expansion and Functional Diversification of Long-Wavelength-Sensitive Opsin in Anabantoid Fishes.","authors":"Jan Gerwin, Julián Torres-Dowdall, Thomas F Brown, Axel Meyer","doi":"10.1007/s00239-024-10181-0","DOIUrl":"10.1007/s00239-024-10181-0","url":null,"abstract":"<p><p>Gene duplication is one of the most important sources of novel genotypic diversity and the subsequent evolution of phenotypic diversity. Determining the evolutionary history and functional changes of duplicated genes is crucial for a comprehensive understanding of adaptive evolution. The evolutionary history of visual opsin genes is very dynamic, with repeated duplication events followed by sub- or neofunctionalization. While duplication of the green-sensitive opsins rh2 is common in teleost fish, fewer cases of multiple duplication events of the red-sensitive opsin lws are known. In this study, we investigate the visual opsin gene repertoire of the anabantoid fishes, focusing on the five lws opsin genes found in the genus Betta. We determine the evolutionary history of the lws opsin gene by taking advantage of whole-genome sequences of nine anabantoid species, including the newly assembled genome of Betta imbellis. Our results show that at least two independent duplications of lws occurred in the Betta lineage. The analysis of amino acid sequences of the lws paralogs of Betta revealed high levels of diversification in four of the seven transmembrane regions of the lws protein. Amino acid substitutions at two key-tuning sites are predicted to lead to differentiation of absorption maxima (λ<sub>max</sub>) between the paralogs within Betta. Finally, eye transcriptomics of B. splendens at different developmental stages revealed expression shifts between paralogs for all cone opsin classes. The lws genes are expressed according to their relative position in the lws opsin cluster throughout ontogeny. We conclude that temporal collinearity of lws expression might have facilitated subfunctionalization of lws in Betta and teleost opsins in general.</p>","PeriodicalId":16366,"journal":{"name":"Journal of Molecular Evolution","volume":" ","pages":"432-448"},"PeriodicalIF":2.1,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11291592/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141300795","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Molecular Evolution
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1