Andrey V. Machulin , Evgeniya I. Deryusheva , Oxana V. Galzitskaya
{"title":"细菌 rpsA 基因碱基组成的变异、结构-功能关系和结构重复的起源。","authors":"Andrey V. Machulin , Evgeniya I. Deryusheva , Oxana V. Galzitskaya","doi":"10.1016/j.biosystems.2024.105196","DOIUrl":null,"url":null,"abstract":"<div><p>Protein domain repeats are known to arise due to tandem duplications of internal genes. However, the understanding of the underlying mechanisms of this process is incomplete. The goal of this work was to investigate the mechanism of occurrence of repeat expansion based on studying the sequences of 1324 <em>rpsA</em> genes of bacterial S1 ribosomal proteins containing different numbers of S1 structural domains. The <em>rpsA</em> gene encodes ribosomal S1 protein, which is essential for cell viability as it interacts with both mRNA and proteins. Gene ontology (GO) analysis of S1 domains in ribosomal S1 proteins revealed that bacterial protein sequences in S1 mainly have 3 types of molecular functions: RNA binding activity, nucleic acid activity, and ribosome structural component. Our results show that the maximum value of <em>rpsA</em> gene identity for full-length proteins was found for S1 proteins containing six structural domains (58%). Analysis of consensus sequences showed that parts of the <em>rpsA</em> gene encoding separate S1 domains have no a strictly repetitive structure between groups containing different numbers of S1 domains. At the same time, gene regions encoding some conserved residues that form the RNA-binding site remain conserved. The detected phylogenetic similarity suggests that the proposed fold of the <em>rpsA</em> translation initiation region of <em>Escherichia coli</em> has functional value and is important for translational control of <em>rpsA</em> gene expression in other bacterial phyla, but not only in gamma Proteobacteria.</p></div>","PeriodicalId":2,"journal":{"name":"ACS Applied Bio Materials","volume":null,"pages":null},"PeriodicalIF":4.6000,"publicationDate":"2024-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Variation in base composition, structure-function relationships, and origins of structural repetition in bacterial rpsA gene\",\"authors\":\"Andrey V. Machulin , Evgeniya I. Deryusheva , Oxana V. Galzitskaya\",\"doi\":\"10.1016/j.biosystems.2024.105196\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Protein domain repeats are known to arise due to tandem duplications of internal genes. However, the understanding of the underlying mechanisms of this process is incomplete. The goal of this work was to investigate the mechanism of occurrence of repeat expansion based on studying the sequences of 1324 <em>rpsA</em> genes of bacterial S1 ribosomal proteins containing different numbers of S1 structural domains. The <em>rpsA</em> gene encodes ribosomal S1 protein, which is essential for cell viability as it interacts with both mRNA and proteins. Gene ontology (GO) analysis of S1 domains in ribosomal S1 proteins revealed that bacterial protein sequences in S1 mainly have 3 types of molecular functions: RNA binding activity, nucleic acid activity, and ribosome structural component. Our results show that the maximum value of <em>rpsA</em> gene identity for full-length proteins was found for S1 proteins containing six structural domains (58%). Analysis of consensus sequences showed that parts of the <em>rpsA</em> gene encoding separate S1 domains have no a strictly repetitive structure between groups containing different numbers of S1 domains. At the same time, gene regions encoding some conserved residues that form the RNA-binding site remain conserved. The detected phylogenetic similarity suggests that the proposed fold of the <em>rpsA</em> translation initiation region of <em>Escherichia coli</em> has functional value and is important for translational control of <em>rpsA</em> gene expression in other bacterial phyla, but not only in gamma Proteobacteria.</p></div>\",\"PeriodicalId\":2,\"journal\":{\"name\":\"ACS Applied Bio Materials\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2024-03-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ACS Applied Bio Materials\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0303264724000819\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MATERIALS SCIENCE, BIOMATERIALS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Bio Materials","FirstCategoryId":"99","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0303264724000819","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATERIALS SCIENCE, BIOMATERIALS","Score":null,"Total":0}
Variation in base composition, structure-function relationships, and origins of structural repetition in bacterial rpsA gene
Protein domain repeats are known to arise due to tandem duplications of internal genes. However, the understanding of the underlying mechanisms of this process is incomplete. The goal of this work was to investigate the mechanism of occurrence of repeat expansion based on studying the sequences of 1324 rpsA genes of bacterial S1 ribosomal proteins containing different numbers of S1 structural domains. The rpsA gene encodes ribosomal S1 protein, which is essential for cell viability as it interacts with both mRNA and proteins. Gene ontology (GO) analysis of S1 domains in ribosomal S1 proteins revealed that bacterial protein sequences in S1 mainly have 3 types of molecular functions: RNA binding activity, nucleic acid activity, and ribosome structural component. Our results show that the maximum value of rpsA gene identity for full-length proteins was found for S1 proteins containing six structural domains (58%). Analysis of consensus sequences showed that parts of the rpsA gene encoding separate S1 domains have no a strictly repetitive structure between groups containing different numbers of S1 domains. At the same time, gene regions encoding some conserved residues that form the RNA-binding site remain conserved. The detected phylogenetic similarity suggests that the proposed fold of the rpsA translation initiation region of Escherichia coli has functional value and is important for translational control of rpsA gene expression in other bacterial phyla, but not only in gamma Proteobacteria.