Xiao-Hua Yan, Shen-Chian Pei, Hsi-Ching Yen, Alain Blanchard, Pascal Sirand-Pugnet, Vincent Baby, Gail E Gasparich, Chih-Horng Kuo
{"title":"基于基因含量分析的细菌属划分:以毛霉菌中的支原体-原生支原体支系为例进行研究。","authors":"Xiao-Hua Yan, Shen-Chian Pei, Hsi-Ching Yen, Alain Blanchard, Pascal Sirand-Pugnet, Vincent Baby, Gail E Gasparich, Chih-Horng Kuo","doi":"10.1099/mgen.0.001321","DOIUrl":null,"url":null,"abstract":"<p><p>Genome-based analysis allows for large-scale classification of diverse bacteria and has been widely adopted for delineating species. Unfortunately, for higher taxonomic ranks such as genus, establishing a generally accepted approach based on genome analysis is challenging. While core-genome phylogenies depict the evolutionary relationships among species, determining the correspondence between clades and genera may not be straightforward. For genotypic divergence, the percentage of conserved proteins and genome-wide average amino acid identity are commonly used, but often do not provide a clear threshold for classification. In this work, we investigated the utility of global comparisons and data visualization in identifying clusters of species based on their overall gene content and rationalized that such patterns can be integrated with phylogeny and other information such as phenotypes for improving taxonomy. As a proof of concept, we selected 177 representative genome sequences from the <i>Mycoplasmatales-Entomoplasmatales</i> clade within the class <i>Mollicutes</i> for a case study. We found that the clustering patterns corresponded to the current understanding of these organisms, namely the split into three above-genus groups: Hominis, Pneumoniae and <i>Spiroplasma-Entomoplasmataceae</i>-Mycoides. However, at the genus level, several important issues were found. For example, recent taxonomic revisions that split the Hominis group into three genera and <i>Entomoplasmataceae</i> into five genera are problematic, as those newly described or emended genera lack clear differentiations in gene content from one another. Moreover, several cases of misclassification were identified. These findings demonstrated the utility of this approach and its potential application to other bacteria.</p>","PeriodicalId":18487,"journal":{"name":"Microbial Genomics","volume":"10 11","pages":""},"PeriodicalIF":4.0000,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11567158/pdf/","citationCount":"0","resultStr":"{\"title\":\"Delineating bacterial genera based on gene content analysis: a case study of the <i>Mycoplasmatales-Entomoplasmatales</i> clade within the class <i>Mollicutes</i>.\",\"authors\":\"Xiao-Hua Yan, Shen-Chian Pei, Hsi-Ching Yen, Alain Blanchard, Pascal Sirand-Pugnet, Vincent Baby, Gail E Gasparich, Chih-Horng Kuo\",\"doi\":\"10.1099/mgen.0.001321\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Genome-based analysis allows for large-scale classification of diverse bacteria and has been widely adopted for delineating species. Unfortunately, for higher taxonomic ranks such as genus, establishing a generally accepted approach based on genome analysis is challenging. While core-genome phylogenies depict the evolutionary relationships among species, determining the correspondence between clades and genera may not be straightforward. For genotypic divergence, the percentage of conserved proteins and genome-wide average amino acid identity are commonly used, but often do not provide a clear threshold for classification. In this work, we investigated the utility of global comparisons and data visualization in identifying clusters of species based on their overall gene content and rationalized that such patterns can be integrated with phylogeny and other information such as phenotypes for improving taxonomy. As a proof of concept, we selected 177 representative genome sequences from the <i>Mycoplasmatales-Entomoplasmatales</i> clade within the class <i>Mollicutes</i> for a case study. We found that the clustering patterns corresponded to the current understanding of these organisms, namely the split into three above-genus groups: Hominis, Pneumoniae and <i>Spiroplasma-Entomoplasmataceae</i>-Mycoides. However, at the genus level, several important issues were found. For example, recent taxonomic revisions that split the Hominis group into three genera and <i>Entomoplasmataceae</i> into five genera are problematic, as those newly described or emended genera lack clear differentiations in gene content from one another. Moreover, several cases of misclassification were identified. These findings demonstrated the utility of this approach and its potential application to other bacteria.</p>\",\"PeriodicalId\":18487,\"journal\":{\"name\":\"Microbial Genomics\",\"volume\":\"10 11\",\"pages\":\"\"},\"PeriodicalIF\":4.0000,\"publicationDate\":\"2024-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11567158/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Microbial Genomics\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1099/mgen.0.001321\",\"RegionNum\":2,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"GENETICS & HEREDITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Microbial Genomics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1099/mgen.0.001321","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0
摘要
基于基因组的分析可对各种细菌进行大规模分类,并已被广泛用于划分物种。遗憾的是,对于更高的分类等级(如属),建立一种基于基因组分析的公认方法具有挑战性。虽然核心基因组系统发育描述了物种之间的进化关系,但确定支系和属之间的对应关系可能并不简单。在基因型差异方面,保守蛋白质的百分比和全基因组平均氨基酸同一性是常用的方法,但往往不能为分类提供一个明确的阈值。在这项工作中,我们研究了全局比较和数据可视化在根据整体基因含量识别物种群方面的实用性,并合理地认为这种模式可以与系统发育和表型等其他信息相结合,以改进分类学。作为概念验证,我们从真菌门(Mollicutes)中的支原体门(Mycoplasmatales-Entomoplasmatales)支系中选择了 177 个具有代表性的基因组序列进行案例研究。我们发现,聚类模式与目前对这些生物的认识相符,即分成三个以上的属群:Hominis、Pneumoniae 和 Spiroplasma-Entomoplasmataceae-Mycoides 。不过,在属的层面上,也发现了几个重要问题。例如,最近的分类学修订将 Hominis 组分为三个属,将 Entomoplasmataceae 分为五个属,这就存在问题,因为这些新描述或修订的属在基因含量上彼此缺乏明确的区分。此外,还发现了一些分类错误的情况。这些发现证明了这种方法的实用性及其应用于其他细菌的潜力。
Delineating bacterial genera based on gene content analysis: a case study of the Mycoplasmatales-Entomoplasmatales clade within the class Mollicutes.
Genome-based analysis allows for large-scale classification of diverse bacteria and has been widely adopted for delineating species. Unfortunately, for higher taxonomic ranks such as genus, establishing a generally accepted approach based on genome analysis is challenging. While core-genome phylogenies depict the evolutionary relationships among species, determining the correspondence between clades and genera may not be straightforward. For genotypic divergence, the percentage of conserved proteins and genome-wide average amino acid identity are commonly used, but often do not provide a clear threshold for classification. In this work, we investigated the utility of global comparisons and data visualization in identifying clusters of species based on their overall gene content and rationalized that such patterns can be integrated with phylogeny and other information such as phenotypes for improving taxonomy. As a proof of concept, we selected 177 representative genome sequences from the Mycoplasmatales-Entomoplasmatales clade within the class Mollicutes for a case study. We found that the clustering patterns corresponded to the current understanding of these organisms, namely the split into three above-genus groups: Hominis, Pneumoniae and Spiroplasma-Entomoplasmataceae-Mycoides. However, at the genus level, several important issues were found. For example, recent taxonomic revisions that split the Hominis group into three genera and Entomoplasmataceae into five genera are problematic, as those newly described or emended genera lack clear differentiations in gene content from one another. Moreover, several cases of misclassification were identified. These findings demonstrated the utility of this approach and its potential application to other bacteria.
期刊介绍:
Microbial Genomics (MGen) is a fully open access, mandatory open data and peer-reviewed journal publishing high-profile original research on archaea, bacteria, microbial eukaryotes and viruses.