Xiao-Hua Yan, Shen-Chian Pei, Hsi-Ching Yen, Alain Blanchard, Pascal Sirand-Pugnet, Vincent Baby, Gail E Gasparich, Chih-Horng Kuo
{"title":"Delineating bacterial genera based on gene content analysis: a case study of the <i>Mycoplasmatales-Entomoplasmatales</i> clade within the class <i>Mollicutes</i>.","authors":"Xiao-Hua Yan, Shen-Chian Pei, Hsi-Ching Yen, Alain Blanchard, Pascal Sirand-Pugnet, Vincent Baby, Gail E Gasparich, Chih-Horng Kuo","doi":"10.1099/mgen.0.001321","DOIUrl":null,"url":null,"abstract":"<p><p>Genome-based analysis allows for large-scale classification of diverse bacteria and has been widely adopted for delineating species. Unfortunately, for higher taxonomic ranks such as genus, establishing a generally accepted approach based on genome analysis is challenging. While core-genome phylogenies depict the evolutionary relationships among species, determining the correspondence between clades and genera may not be straightforward. For genotypic divergence, the percentage of conserved proteins and genome-wide average amino acid identity are commonly used, but often do not provide a clear threshold for classification. In this work, we investigated the utility of global comparisons and data visualization in identifying clusters of species based on their overall gene content and rationalized that such patterns can be integrated with phylogeny and other information such as phenotypes for improving taxonomy. As a proof of concept, we selected 177 representative genome sequences from the <i>Mycoplasmatales-Entomoplasmatales</i> clade within the class <i>Mollicutes</i> for a case study. We found that the clustering patterns corresponded to the current understanding of these organisms, namely the split into three above-genus groups: Hominis, Pneumoniae and <i>Spiroplasma-Entomoplasmataceae</i>-Mycoides. However, at the genus level, several important issues were found. For example, recent taxonomic revisions that split the Hominis group into three genera and <i>Entomoplasmataceae</i> into five genera are problematic, as those newly described or emended genera lack clear differentiations in gene content from one another. Moreover, several cases of misclassification were identified. These findings demonstrated the utility of this approach and its potential application to other bacteria.</p>","PeriodicalId":18487,"journal":{"name":"Microbial Genomics","volume":"10 11","pages":""},"PeriodicalIF":4.0000,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11567158/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Microbial Genomics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1099/mgen.0.001321","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0
Abstract
Genome-based analysis allows for large-scale classification of diverse bacteria and has been widely adopted for delineating species. Unfortunately, for higher taxonomic ranks such as genus, establishing a generally accepted approach based on genome analysis is challenging. While core-genome phylogenies depict the evolutionary relationships among species, determining the correspondence between clades and genera may not be straightforward. For genotypic divergence, the percentage of conserved proteins and genome-wide average amino acid identity are commonly used, but often do not provide a clear threshold for classification. In this work, we investigated the utility of global comparisons and data visualization in identifying clusters of species based on their overall gene content and rationalized that such patterns can be integrated with phylogeny and other information such as phenotypes for improving taxonomy. As a proof of concept, we selected 177 representative genome sequences from the Mycoplasmatales-Entomoplasmatales clade within the class Mollicutes for a case study. We found that the clustering patterns corresponded to the current understanding of these organisms, namely the split into three above-genus groups: Hominis, Pneumoniae and Spiroplasma-Entomoplasmataceae-Mycoides. However, at the genus level, several important issues were found. For example, recent taxonomic revisions that split the Hominis group into three genera and Entomoplasmataceae into five genera are problematic, as those newly described or emended genera lack clear differentiations in gene content from one another. Moreover, several cases of misclassification were identified. These findings demonstrated the utility of this approach and its potential application to other bacteria.
期刊介绍:
Microbial Genomics (MGen) is a fully open access, mandatory open data and peer-reviewed journal publishing high-profile original research on archaea, bacteria, microbial eukaryotes and viruses.