Bacterial phylogenetic tree construction based on genomic translation stop signals.

Lijing Xu, Jimmy Kuo, Jong-Kang Liu, Tit-Yee Wong
{"title":"Bacterial phylogenetic tree construction based on genomic translation stop signals.","authors":"Lijing Xu,&nbsp;Jimmy Kuo,&nbsp;Jong-Kang Liu,&nbsp;Tit-Yee Wong","doi":"10.1186/2042-5783-2-6","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The efficiencies of the stop codons TAA, TAG, and TGA in protein synthesis termination are not the same. These variations could allow many genes to be regulated. There are many similar nucleotide trimers found on the second and third reading-frames of a gene. They are called premature stop codons (PSC). Like stop codons, the PSC in bacterial genomes are also highly bias in terms of their quantities and qualities on the genes. Phylogenetically related species often share a similar PSC profile. We want to know whether the selective forces that influence the stop codons and the PSC usage biases in a genome are related. We also wish to know how strong these trimers in a genome are related to the natural history of the bacterium. Knowing these relations may provide better knowledge in the phylogeny of bacteria</p><p><strong>Results: </strong>A 16SrRNA-alignment tree of 19 well-studied α-, β- and γ-Proteobacteria Type species is used as standard reference for bacterial phylogeny. The genomes of sixty-one bacteria, belonging to the α-, β- and γ-Proteobacteria subphyla, are used for this study. The stop codons and PSC are collectively termed \"Translation Stop Signals\" (TSS). A gene is represented by nine scalars corresponding to the numbers of counts of TAA, TAG, and TGA on each of the three reading-frames of that gene. \"Translation Stop Signals Ratio\" (TSSR) is the ratio between the TSS counts. Four types of TSSR are investigated. The TSSR-1, TSSR-2 and TSSR-3 are each a 3-scalar series corresponding respectively to the average ratio of TAA: TAG: TGA on the first, second, and third reading-frames of all genes in a genome. The Genomic-TSSR is a 9-scalar series representing the ratio of distribution of all TSS on the three reading-frames of all genes in a genome. Results show that bacteria grouped by their similarities based on TSSR-1, TSSR-2, or TSSR-3 values could only partially resolve the phylogeny of the species. However, grouping bacteria based on thier Genomic-TSSR values resulted in clusters of bacteria identical to those bacterial clusters of the reference tree. Unlike the 16SrRNA method, the Genomic-TSSR tree is also able to separate closely related species/strains at high resolution. Species and strains separated by the Genomic-TSSR grouping method are often in good agreement with those classified by other taxonomic methods. Correspondence analysis of individual genes shows that most genes in a bacterial genome share a similar TSSR value. However, within a chromosome, the Genic-TSSR values of genes near the replication origin region (Ori) are more similar to each other than those genes near the terminus region (Ter).</p><p><strong>Conclusion: </strong>The translation stop signals on the three reading-frames of the genes on a bacterial genome are interrelated, possibly due to frequent off-frame recombination facilitated by translational-associated recombination (TSR). However, TSR may not occur randomly in a bacterial chromosome. Genes near the Ori region are often highly expressed and a bacterium always maintains multiple copies of Ori. Frequent collisions between DNA- polymerase and RNA-polymerase would create many DNA strand-breaks on the genes; whereas DNA strand-break induced homologues-recombination is more likely to take place between genes with similar sequence. Thus, localized recombination could explain why the TSSR of genes near the Ori region are more similar to each other. The quantity and quality of these TSS in a genome strongly reflect the natural history of a bacterium. We propose that the Genomic- TSSR can be used as a subjective biomarker to represent the phyletic status of a bacterium.</p>","PeriodicalId":18538,"journal":{"name":"Microbial Informatics and Experimentation","volume":"2 1","pages":"6"},"PeriodicalIF":0.0000,"publicationDate":"2012-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/2042-5783-2-6","citationCount":"12","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Microbial Informatics and Experimentation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1186/2042-5783-2-6","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 12

Abstract

Background: The efficiencies of the stop codons TAA, TAG, and TGA in protein synthesis termination are not the same. These variations could allow many genes to be regulated. There are many similar nucleotide trimers found on the second and third reading-frames of a gene. They are called premature stop codons (PSC). Like stop codons, the PSC in bacterial genomes are also highly bias in terms of their quantities and qualities on the genes. Phylogenetically related species often share a similar PSC profile. We want to know whether the selective forces that influence the stop codons and the PSC usage biases in a genome are related. We also wish to know how strong these trimers in a genome are related to the natural history of the bacterium. Knowing these relations may provide better knowledge in the phylogeny of bacteria

Results: A 16SrRNA-alignment tree of 19 well-studied α-, β- and γ-Proteobacteria Type species is used as standard reference for bacterial phylogeny. The genomes of sixty-one bacteria, belonging to the α-, β- and γ-Proteobacteria subphyla, are used for this study. The stop codons and PSC are collectively termed "Translation Stop Signals" (TSS). A gene is represented by nine scalars corresponding to the numbers of counts of TAA, TAG, and TGA on each of the three reading-frames of that gene. "Translation Stop Signals Ratio" (TSSR) is the ratio between the TSS counts. Four types of TSSR are investigated. The TSSR-1, TSSR-2 and TSSR-3 are each a 3-scalar series corresponding respectively to the average ratio of TAA: TAG: TGA on the first, second, and third reading-frames of all genes in a genome. The Genomic-TSSR is a 9-scalar series representing the ratio of distribution of all TSS on the three reading-frames of all genes in a genome. Results show that bacteria grouped by their similarities based on TSSR-1, TSSR-2, or TSSR-3 values could only partially resolve the phylogeny of the species. However, grouping bacteria based on thier Genomic-TSSR values resulted in clusters of bacteria identical to those bacterial clusters of the reference tree. Unlike the 16SrRNA method, the Genomic-TSSR tree is also able to separate closely related species/strains at high resolution. Species and strains separated by the Genomic-TSSR grouping method are often in good agreement with those classified by other taxonomic methods. Correspondence analysis of individual genes shows that most genes in a bacterial genome share a similar TSSR value. However, within a chromosome, the Genic-TSSR values of genes near the replication origin region (Ori) are more similar to each other than those genes near the terminus region (Ter).

Conclusion: The translation stop signals on the three reading-frames of the genes on a bacterial genome are interrelated, possibly due to frequent off-frame recombination facilitated by translational-associated recombination (TSR). However, TSR may not occur randomly in a bacterial chromosome. Genes near the Ori region are often highly expressed and a bacterium always maintains multiple copies of Ori. Frequent collisions between DNA- polymerase and RNA-polymerase would create many DNA strand-breaks on the genes; whereas DNA strand-break induced homologues-recombination is more likely to take place between genes with similar sequence. Thus, localized recombination could explain why the TSSR of genes near the Ori region are more similar to each other. The quantity and quality of these TSS in a genome strongly reflect the natural history of a bacterium. We propose that the Genomic- TSSR can be used as a subjective biomarker to represent the phyletic status of a bacterium.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于基因组翻译停止信号的细菌系统发育树构建。
背景:终止密码子TAA、TAG和TGA在蛋白质合成终止中的效率是不一样的。这些变异可以使许多基因受到调控。在一个基因的第二和第三解读框上发现了许多相似的核苷酸三聚体。它们被称为过早终止密码子(PSC)。与终止密码子一样,细菌基因组中的PSC在基因上的数量和质量方面也存在高度的偏差。系统发育相关的物种通常具有相似的PSC特征。我们想知道影响终止密码子和基因组中PSC使用偏差的选择力是否相关。我们还希望知道基因组中的这些三聚体与细菌的自然史有多大关系。结果:用19种α-、β-和γ-变形杆菌的16srrna比对树作为细菌系统发育的标准参考。本研究使用了61种细菌的基因组,这些细菌属于α-、β-和γ-变形菌亚门。停止密码子和PSC统称为“翻译停止信号”(TSS)。一个基因由9个标量表示,对应于该基因的三个读码框上每个TAA、TAG和TGA的计数。“转换停止信号比”(TSSR)是转换停止信号数之间的比率。研究了四种类型的TSSR。TSSR-1、TSSR-2和TSSR-3是一个3标量序列,分别对应于基因组中所有基因的第一、二、三读框上TAA: TAG: TGA的平均比值。genome - tssr是一个9标量序列,表示基因组中所有基因的三个读框上所有TSS的分布比例。结果表明,根据TSSR-1、TSSR-2或TSSR-3的相似性分组的细菌只能部分解决物种的系统发育问题。然而,根据细菌的基因组- tssr值对细菌进行分组,得到的细菌簇与参考树的细菌簇相同。与16SrRNA方法不同,genome - tssr树也能够以高分辨率分离密切相关的物种/菌株。用基因组- tssr分组方法分离的种和品系往往与用其他分类学方法分类的种和品系一致。单个基因的对应分析表明,细菌基因组中的大多数基因具有相似的TSSR值。然而,在一条染色体内,靠近复制起始区(Ori)的基因的遗传- tssr值比靠近末端区(Ter)的基因更相似。结论:细菌基因组的三个读框上的翻译停止信号是相互关联的,可能是由于翻译相关重组(translation -associated recombination, TSR)促进了频繁的框外重组。然而,TSR可能不是随机发生在细菌染色体上。Ori区附近的基因通常高度表达,一个细菌总是保持多个Ori拷贝。DNA聚合酶和rna聚合酶的频繁碰撞会在基因上产生许多DNA链断裂;而DNA链断裂诱导的同源重组更可能发生在序列相似的基因之间。因此,局部重组可以解释为什么Ori区域附近基因的TSSR彼此之间更相似。基因组中这些TSS的数量和质量强烈地反映了细菌的自然历史。我们建议基因组- TSSR可以作为一个主观的生物标记物来代表细菌的种系状态。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
A systematic search for discriminating sites in the 16S ribosomal RNA gene. Deep sequencing of evolving pathogen populations: applications, errors, and bioinformatic solutions. Sialic acid utilization by Cronobacter sakazakii. Beginner's guide to comparative bacterial genome analysis using next-generation sequence data. An efficient rRNA removal method for RNA sequencing in GC-rich bacteria.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1