首页 > 最新文献

Microbial Genomics最新文献

英文 中文
Development and implementation of a core genome multilocus sequence typing scheme for Haemophilus influenzae. 开发和实施流感嗜血杆菌核心基因组多焦点序列分型方案。
IF 4 2区 生物学 Q1 GENETICS & HEREDITY Pub Date : 2024-08-01 DOI: 10.1099/mgen.0.001281
Made Ananda Krisna, Keith A Jolley, William Monteith, Alexandra Boubour, Raph L Hamers, Angela B Brueggemann, Odile B Harrison, Martin C J Maiden

Haemophilus influenzae is part of the human nasopharyngeal microbiota and a pathogen causing invasive disease. The extensive genetic diversity observed in H. influenzae necessitates discriminatory analytical approaches to evaluate its population structure. This study developed a core genome multilocus sequence typing (cgMLST) scheme for H. influenzae using pangenome analysis tools and validated the cgMLST scheme using datasets consisting of complete reference genomes (N = 14) and high-quality draft H. influenzae genomes (N = 2297). The draft genome dataset was divided into a development dataset (N = 921) and a validation dataset (N = 1376). The development dataset was used to identify potential core genes, and the validation dataset was used to refine the final core gene list to ensure the reliability of the proposed cgMLST scheme. Functional classifications were made for all the resulting core genes. Phylogenetic analyses were performed using both allelic profiles and nucleotide sequence alignments of the core genome to test congruence, as assessed by Spearman's correlation and ordinary least square linear regression tests. Preliminary analyses using the development dataset identified 1067 core genes, which were refined to 1037 with the validation dataset. More than 70% of core genes were predicted to encode proteins essential for metabolism or genetic information processing. Phylogenetic and statistical analyses indicated that the core genome allelic profile accurately represented phylogenetic relatedness among the isolates (R 2 = 0.945). We used this cgMLST scheme to define a high-resolution population structure for H. influenzae, which enhances the genomic analysis of this clinically relevant human pathogen.

流感嗜血杆菌是人类鼻咽部微生物群的一部分,也是导致侵袭性疾病的病原体。流感嗜血杆菌具有广泛的遗传多样性,因此有必要采用鉴别分析方法来评估其种群结构。本研究利用泛基因组分析工具为流感嗜血杆菌开发了核心基因组多焦点序列分型(cgMLST)方案,并利用由完整参考基因组(N = 14)和高质量流感嗜血杆菌基因组草案(N = 2297)组成的数据集验证了cgMLST方案。基因组草案数据集分为开发数据集(N = 921)和验证数据集(N = 1376)。开发数据集用于确定潜在的核心基因,验证数据集用于完善最终的核心基因列表,以确保拟议的 cgMLST 方案的可靠性。对所有产生的核心基因进行了功能分类。利用等位基因图谱和核心基因组的核苷酸序列比对进行了系统发育分析,通过斯皮尔曼相关性和普通最小二乘法线性回归测试来检验一致性。使用开发数据集进行的初步分析确定了 1067 个核心基因,使用验证数据集将其细化为 1037 个。据预测,70%以上的核心基因编码新陈代谢或遗传信息处理所必需的蛋白质。系统发育和统计分析表明,核心基因组等位基因图谱准确地代表了分离株之间的系统发育亲缘关系(R 2 = 0.945)。我们利用这种 cgMLST 方案确定了流感嗜血杆菌的高分辨率种群结构,从而加强了对这种与临床相关的人类病原体的基因组分析。
{"title":"Development and implementation of a core genome multilocus sequence typing scheme for <i>Haemophilus influenzae</i>.","authors":"Made Ananda Krisna, Keith A Jolley, William Monteith, Alexandra Boubour, Raph L Hamers, Angela B Brueggemann, Odile B Harrison, Martin C J Maiden","doi":"10.1099/mgen.0.001281","DOIUrl":"10.1099/mgen.0.001281","url":null,"abstract":"<p><p><i>Haemophilus influenzae</i> is part of the human nasopharyngeal microbiota and a pathogen causing invasive disease. The extensive genetic diversity observed in <i>H. influenzae</i> necessitates discriminatory analytical approaches to evaluate its population structure. This study developed a core genome multilocus sequence typing (cgMLST) scheme for <i>H. influenzae</i> using pangenome analysis tools and validated the cgMLST scheme using datasets consisting of complete reference genomes (<i>N</i> = 14) and high-quality draft <i>H. influenzae</i> genomes (<i>N</i> = 2297). The draft genome dataset was divided into a development dataset (<i>N</i> = 921) and a validation dataset (<i>N</i> = 1376). The development dataset was used to identify potential core genes, and the validation dataset was used to refine the final core gene list to ensure the reliability of the proposed cgMLST scheme. Functional classifications were made for all the resulting core genes. Phylogenetic analyses were performed using both allelic profiles and nucleotide sequence alignments of the core genome to test congruence, as assessed by Spearman's correlation and ordinary least square linear regression tests. Preliminary analyses using the development dataset identified 1067 core genes, which were refined to 1037 with the validation dataset. More than 70% of core genes were predicted to encode proteins essential for metabolism or genetic information processing. Phylogenetic and statistical analyses indicated that the core genome allelic profile accurately represented phylogenetic relatedness among the isolates (<i>R</i> <sup>2</sup> = 0.945). We used this cgMLST scheme to define a high-resolution population structure for <i>H. influenzae</i>, which enhances the genomic analysis of this clinically relevant human pathogen.</p>","PeriodicalId":18487,"journal":{"name":"Microbial Genomics","volume":"10 8","pages":""},"PeriodicalIF":4.0,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11315579/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141907045","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparative genomic analysis identifies potential adaptive variation in Mycoplasma ovipneumoniae. 比较基因组分析确定了卵肺炎支原体的潜在适应性变异。
IF 4 2区 生物学 Q1 GENETICS & HEREDITY Pub Date : 2024-08-01 DOI: 10.1099/mgen.0.001279
Kimberly R Andrews, Thomas E Besser, Thibault Stalder, Eva M Top, Katherine N Baker, Matthew W Fagnan, Daniel D New, G Maria Schneider, Alexandra Gal, Rebecca Andrews-Dickert, Samuel S Hunter, Kimberlee B Beckmen, Lauren Christensen, Anne Justice-Allen, Denise Konetchy, Chadwick P Lehman, Kezia Manlove, Hollie Miyasaki, Todd Nordeen, Annette Roug, E Frances Cassirer

Mycoplasma ovipneumoniae is associated with respiratory disease in wild and domestic Caprinae globally, with wide variation in disease outcomes within and between host species. To gain insight into phylogenetic structure and mechanisms of pathogenicity for this bacterial species, we compared M. ovipneumoniae genomes for 99 samples from 6 countries (Australia, Bosnia and Herzegovina, Brazil, China, France and USA) and 4 host species (domestic sheep, domestic goats, bighorn sheep and caribou). Core genome sequences of M. ovipneumoniae assemblies from domestic sheep and goats fell into two well-supported phylogenetic clades that are divergent enough to be considered different bacterial species, consistent with each of these two clades having an evolutionary origin in separate host species. Genome assemblies from bighorn sheep and caribou also fell within these two clades, indicating multiple spillover events, most commonly from domestic sheep. Pangenome analysis indicated a high percentage (91.4 %) of accessory genes (i.e. genes found only in a subset of assemblies) compared to core genes (i.e. genes found in all assemblies), potentially indicating a propensity for this pathogen to adapt to within-host conditions. In addition, many genes related to carbon metabolism, which is a virulence factor for Mycoplasmas, showed evidence for homologous recombination, a potential signature of adaptation. The presence or absence of annotated genes was very similar between sheep and goat clades, with only two annotated genes significantly clade-associated. However, three M. ovipneumoniae genome assemblies from asymptomatic caribou in Alaska formed a highly divergent subclade within the sheep clade that lacked 23 annotated genes compared to other assemblies, and many of these genes had functions related to carbon metabolism. Overall, our results suggest that adaptation of M. ovipneumoniae has involved evolution of carbon metabolism pathways and virulence mechanisms related to those pathways. The genes involved in these pathways, along with other genes identified as potentially involved in virulence in this study, are potential targets for future investigation into a possible genomic basis for the high variation observed in disease outcomes within and between wild and domestic host species.

卵肺炎支原体与全球野生和家养毛冠菊科动物的呼吸道疾病有关,宿主物种内部和宿主物种之间的疾病结果差异很大。为了深入了解这种细菌的系统发育结构和致病机制,我们比较了来自 6 个国家(澳大利亚、波斯尼亚和黑塞哥维那、巴西、中国、法国和美国)和 4 个宿主物种(家养绵羊、家养山羊、北山羊和驯鹿)的 99 个样本的卵肺炎支原体基因组。来自家绵羊和山羊的卵肺炎霉菌的核心基因组序列集合分为两个支持良好的系统发育支系,其差异足以被视为不同的细菌物种,这两个支系分别起源于不同的宿主物种。来自大角羊和驯鹿的基因组也属于这两个支系,这表明存在多个外溢事件,其中最常见的是来自家羊的外溢事件。庞基因组分析表明,与核心基因(即在所有基因组中都能找到的基因)相比,附属基因(即只在部分基因组中发现的基因)的比例较高(91.4%),这可能表明这种病原体有适应宿主内部条件的倾向。此外,许多与碳代谢(支原体的毒力因子)相关的基因都显示出同源重组的迹象,这是适应的一个潜在特征。绵羊支原体和山羊支原体之间存在或不存在注释基因的情况非常相似,只有两个注释基因与支原体明显相关。然而,来自阿拉斯加无症状驯鹿的三个卵肺炎梭菌基因组序列在绵羊支系中形成了一个高度分化的亚支系,与其他序列相比,该亚支系缺少23个注释基因,其中许多基因的功能与碳代谢有关。总之,我们的研究结果表明,卵肺炎霉菌的适应涉及碳代谢途径和与这些途径相关的毒力机制的进化。参与这些途径的基因,以及本研究中发现的可能参与毒力的其他基因,都是未来研究野生宿主和家养宿主物种内部和之间疾病结果差异大的可能基因组基础的潜在目标。
{"title":"Comparative genomic analysis identifies potential adaptive variation in <i>Mycoplasma ovipneumoniae</i>.","authors":"Kimberly R Andrews, Thomas E Besser, Thibault Stalder, Eva M Top, Katherine N Baker, Matthew W Fagnan, Daniel D New, G Maria Schneider, Alexandra Gal, Rebecca Andrews-Dickert, Samuel S Hunter, Kimberlee B Beckmen, Lauren Christensen, Anne Justice-Allen, Denise Konetchy, Chadwick P Lehman, Kezia Manlove, Hollie Miyasaki, Todd Nordeen, Annette Roug, E Frances Cassirer","doi":"10.1099/mgen.0.001279","DOIUrl":"https://doi.org/10.1099/mgen.0.001279","url":null,"abstract":"<p><p><i>Mycoplasma ovipneumoniae</i> is associated with respiratory disease in wild and domestic Caprinae globally, with wide variation in disease outcomes within and between host species. To gain insight into phylogenetic structure and mechanisms of pathogenicity for this bacterial species, we compared <i>M. ovipneumoniae</i> genomes for 99 samples from 6 countries (Australia, Bosnia and Herzegovina, Brazil, China, France and USA) and 4 host species (domestic sheep, domestic goats, bighorn sheep and caribou). Core genome sequences of <i>M. ovipneumoniae</i> assemblies from domestic sheep and goats fell into two well-supported phylogenetic clades that are divergent enough to be considered different bacterial species, consistent with each of these two clades having an evolutionary origin in separate host species. Genome assemblies from bighorn sheep and caribou also fell within these two clades, indicating multiple spillover events, most commonly from domestic sheep. Pangenome analysis indicated a high percentage (91.4 %) of accessory genes (i.e. genes found only in a subset of assemblies) compared to core genes (i.e. genes found in all assemblies), potentially indicating a propensity for this pathogen to adapt to within-host conditions. In addition, many genes related to carbon metabolism, which is a virulence factor for Mycoplasmas, showed evidence for homologous recombination, a potential signature of adaptation. The presence or absence of annotated genes was very similar between sheep and goat clades, with only two annotated genes significantly clade-associated. However, three <i>M. ovipneumoniae</i> genome assemblies from asymptomatic caribou in Alaska formed a highly divergent subclade within the sheep clade that lacked 23 annotated genes compared to other assemblies, and many of these genes had functions related to carbon metabolism. Overall, our results suggest that adaptation of <i>M. ovipneumoniae</i> has involved evolution of carbon metabolism pathways and virulence mechanisms related to those pathways. The genes involved in these pathways, along with other genes identified as potentially involved in virulence in this study, are potential targets for future investigation into a possible genomic basis for the high variation observed in disease outcomes within and between wild and domestic host species.</p>","PeriodicalId":18487,"journal":{"name":"Microbial Genomics","volume":"10 8","pages":""},"PeriodicalIF":4.0,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11364169/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142109156","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Whole-genome sequencing of Western Canadian Borrelia spp. collected from diverse tick and animal hosts reveals short-lived local genotypes interspersed with longer-lived continental genotypes. 从不同的蜱虫和动物宿主身上采集的加拿大西部鲍瑞氏杆菌的全基因组测序显示,短寿命的本地基因型与较长寿命的大陆基因型相互交错。
IF 4 2区 生物学 Q1 GENETICS & HEREDITY Pub Date : 2024-08-01 DOI: 10.1099/mgen.0.001276
Jennifer N Russell, Min-Kuang Lee, Miguel I Uyaguari-Diaz, Ashton N Sies, Danae M Suchan, William Hsiao, Erin Fraser, Muhammad G Morshed, Andrew D S Cameron

Changing climates are allowing the geographic expansion of ticks and their animal hosts, increasing the risk of Borrelia-caused zoonoses in Canada. However, little is known about the genomic diversity of Borrelia from the west of the Canadian Rockies and from the tick vectors Ixodes pacificus, Ixodes auritulus and Ixodes angustus. Here, we report the whole-genome shotgun sequences of 51 Borrelia isolates from multiple tick species collected on a range of animal hosts between 1993 and 2016, located primarily in coastal British Columbia. The bacterial isolates represented three different species from the Lyme disease-causing Borrelia burgdorferi sensu lato genospecies complex [Borrelia burgdorferi sensu stricto (n=47), Borrelia americana (n=3) and Borrelia bissettiae (n=1)]. The traditional eight-gene multi-locus sequence typing (MLST) strategy was applied to facilitate comparisons across studies. This identified 13 known Borrelia sequence types (STs), established 6 new STs, and assigned 5 novel types to the nearest sequence types. B. burgdorferi s. s. isolates were further differentiated into ten ospC types, plus one novel ospC with less than 92 % nucleotide identity to all previously defined ospC types. The MLST types resampled over extended time periods belonged to previously described STs that are distributed across North America. The most geographically widespread ST, ST.12, was isolated from all three tick species. Conversely, new B. burgdorferi s. s. STs from Vancouver Island and the Vancouver region were only detected for short periods, revealing a surprising transience in space, time and host tick species, possibly due to displacement by longer-lived genotypes that expanded across North America.This article contains data hosted by Microreact.

气候的变化使得蜱虫及其动物宿主的地理分布不断扩大,增加了加拿大人畜共患包柔氏菌病的风险。然而,人们对加拿大落基山脉西部以及蜱虫载体太平洋伊蚊(Ixodes pacificus)、钝头伊蚊(Ixodes auritulus)和角伊蚊(Ixodes angustus)的鲍氏杆菌基因组多样性知之甚少。在此,我们报告了 1993 年至 2016 年间从多种蜱虫物种中采集的 51 个包柔氏菌分离物的全基因组霰弹枪序列,这些蜱虫主要分布在不列颠哥伦比亚省沿海地区的一系列动物宿主身上。这些细菌分离物代表了莱姆病致病包柔氏菌(Borrelia burgdorferi sensu lato genospecies complex)中的三个不同物种[严格意义上的包柔氏菌(Borrelia burgdorferi sensu stricto,n=47)、美洲包柔氏菌(Borrelia americana,n=3)和鲍氏杆菌(Borrelia bissettiae,n=1)]。采用传统的八基因多焦点序列分型(MLST)策略,以便于对各项研究进行比较。该方法确定了 13 个已知的包柔病毒序列类型(ST),建立了 6 个新的 ST,并将 5 个新类型归入最近的序列类型。B. burgdorferi s. s. 分离物被进一步区分为 10 种 ospC 类型,外加一种与之前定义的所有 ospC 类型的核苷酸同一性低于 92% 的新型 ospC。在较长时期内重新取样的 MLST 类型属于先前描述的 ST,这些 ST 分布在整个北美地区。地理分布最广的 ST(ST.12)从所有三个蜱种中分离出来。相反,来自温哥华岛的新的 B. burgdorferi s.温哥华岛和温哥华地区的新B. burgdorferi s. s.ST仅在短时间内被检测到,揭示了空间、时间和宿主蜱物种的惊人瞬变性,这可能是由于被扩展到北美的长寿命基因型所取代。
{"title":"Whole-genome sequencing of Western Canadian <i>Borrelia</i> spp. collected from diverse tick and animal hosts reveals short-lived local genotypes interspersed with longer-lived continental genotypes.","authors":"Jennifer N Russell, Min-Kuang Lee, Miguel I Uyaguari-Diaz, Ashton N Sies, Danae M Suchan, William Hsiao, Erin Fraser, Muhammad G Morshed, Andrew D S Cameron","doi":"10.1099/mgen.0.001276","DOIUrl":"10.1099/mgen.0.001276","url":null,"abstract":"<p><p>Changing climates are allowing the geographic expansion of ticks and their animal hosts, increasing the risk of <i>Borrelia</i>-caused zoonoses in Canada. However, little is known about the genomic diversity of <i>Borrelia</i> from the west of the Canadian Rockies and from the tick vectors <i>Ixodes pacificus</i>, <i>Ixodes auritulus</i> and <i>Ixodes angustus</i>. Here, we report the whole-genome shotgun sequences of 51 <i>Borrelia</i> isolates from multiple tick species collected on a range of animal hosts between 1993 and 2016, located primarily in coastal British Columbia. The bacterial isolates represented three different species from the Lyme disease-causing <i>Borrelia burgdorferi sensu lato</i> genospecies complex [<i>Borrelia burgdorferi sensu stricto</i> (<i>n</i>=47), <i>Borrelia americana</i> (<i>n</i>=3) and <i>Borrelia bissettiae</i> (<i>n</i>=1)]. The traditional eight-gene multi-locus sequence typing (MLST) strategy was applied to facilitate comparisons across studies. This identified 13 known <i>Borrelia</i> sequence types (STs), established 6 new STs, and assigned 5 novel types to the nearest sequence types. <i>B. burgdorferi</i> s. s. isolates were further differentiated into ten <i>ospC</i> types, plus one novel <i>ospC</i> with less than 92 % nucleotide identity to all previously defined <i>ospC</i> types. The MLST types resampled over extended time periods belonged to previously described STs that are distributed across North America. The most geographically widespread ST, ST.12, was isolated from all three tick species. Conversely, new <i>B. burgdorferi</i> s. s. STs from Vancouver Island and the Vancouver region were only detected for short periods, revealing a surprising transience in space, time and host tick species, possibly due to displacement by longer-lived genotypes that expanded across North America.This article contains data hosted by Microreact.</p>","PeriodicalId":18487,"journal":{"name":"Microbial Genomics","volume":"10 8","pages":""},"PeriodicalIF":4.0,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11296321/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141875283","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Development of the Pneumococcal Genome Library, a core genome multilocus sequence typing scheme, and a taxonomic life identification number barcoding system to investigate and define pneumococcal population structure. 开发肺炎球菌基因组文库、核心基因组多焦点序列分型方案和分类生命识别号条形码系统,以研究和确定肺炎球菌种群结构。
IF 4 2区 生物学 Q1 GENETICS & HEREDITY Pub Date : 2024-08-01 DOI: 10.1099/mgen.0.001280
Melissa J Jansen van Rensburg, Duncan J Berger, Iman Yassine, David Shaw, Andy Fohrmann, James E Bray, Keith A Jolley, Martin C J Maiden, Angela B Brueggemann

Investigating the genomic epidemiology of major bacterial pathogens is integral to understanding transmission, evolution, colonization, disease, antimicrobial resistance and vaccine impact. Furthermore, the recent accumulation of large numbers of whole genome sequences for many bacterial species enhances the development of robust genome-wide typing schemes to define the overall bacterial population structure and lineages within it. Using the previously published data, we developed the Pneumococcal Genome Library (PGL), a curated dataset of 30 976 genomes and contextual data for carriage and disease pneumococci recovered between 1916 and 2018 in 82 countries. We leveraged the size and diversity of the PGL to develop a core genome multilocus sequence typing (cgMLST) scheme comprised of 1222 loci. Finally, using multilevel single-linkage clustering, we stratified pneumococci into hierarchical clusters based on allelic similarity thresholds and defined these with a taxonomic life identification number (LIN) barcoding system. The PGL, cgMLST scheme and LIN barcodes represent a high-quality genomic resource and fine-scale clustering approaches for the analysis of pneumococcal populations, which support the genomic epidemiology and surveillance of this leading global pathogen.

调查主要细菌病原体的基因组流行病学是了解传播、进化、定植、疾病、抗菌药耐药性和疫苗影响所不可或缺的。此外,近来许多细菌物种的全基因组序列的大量积累也促进了强大的全基因组分型计划的发展,以确定细菌的总体种群结构和其中的世系。利用之前公布的数据,我们开发了肺炎球菌基因组库(PGL),这是一个包含 30 976 个基因组和上下文数据的数据集,这些数据是 1916 年至 2018 年期间在 82 个国家回收的携带和疾病肺炎球菌。我们利用 PGL 的规模和多样性开发了一个由 1222 个位点组成的核心基因组多焦点序列分型(cgMLST)方案。最后,我们利用多级单链聚类,根据等位基因相似性阈值将肺炎球菌分层聚类,并通过分类生命识别码(LIN)条形码系统对这些聚类进行定义。PGL、cgMLST 方案和 LIN 条形码代表了用于分析肺炎球菌种群的高质量基因组资源和精细聚类方法,为这一全球主要病原体的基因组流行病学和监测提供了支持。
{"title":"Development of the Pneumococcal Genome Library, a core genome multilocus sequence typing scheme, and a taxonomic life identification number barcoding system to investigate and define pneumococcal population structure.","authors":"Melissa J Jansen van Rensburg, Duncan J Berger, Iman Yassine, David Shaw, Andy Fohrmann, James E Bray, Keith A Jolley, Martin C J Maiden, Angela B Brueggemann","doi":"10.1099/mgen.0.001280","DOIUrl":"10.1099/mgen.0.001280","url":null,"abstract":"<p><p>Investigating the genomic epidemiology of major bacterial pathogens is integral to understanding transmission, evolution, colonization, disease, antimicrobial resistance and vaccine impact. Furthermore, the recent accumulation of large numbers of whole genome sequences for many bacterial species enhances the development of robust genome-wide typing schemes to define the overall bacterial population structure and lineages within it. Using the previously published data, we developed the Pneumococcal Genome Library (PGL), a curated dataset of 30 976 genomes and contextual data for carriage and disease pneumococci recovered between 1916 and 2018 in 82 countries. We leveraged the size and diversity of the PGL to develop a core genome multilocus sequence typing (cgMLST) scheme comprised of 1222 loci. Finally, using multilevel single-linkage clustering, we stratified pneumococci into hierarchical clusters based on allelic similarity thresholds and defined these with a taxonomic life identification number (LIN) barcoding system. The PGL, cgMLST scheme and LIN barcodes represent a high-quality genomic resource and fine-scale clustering approaches for the analysis of pneumococcal populations, which support the genomic epidemiology and surveillance of this leading global pathogen.</p>","PeriodicalId":18487,"journal":{"name":"Microbial Genomics","volume":"10 8","pages":""},"PeriodicalIF":4.0,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11321556/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141976130","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Phylogenetic diversity of putative nickel-containing carbon monoxide dehydrogenase-encoding prokaryotes in the human gut microbiome. 人类肠道微生物群中假定的含镍一氧化碳脱氢酶编码原核生物的系统发育多样性。
IF 4 2区 生物学 Q1 GENETICS & HEREDITY Pub Date : 2024-08-01 DOI: 10.1099/mgen.0.001285
Yuka Adachi Katayama, Ryoma Kamikawa, Takashi Yoshida

Although the production of carbon monoxide (CO) within the human body has been detected, only two CO-utilizing prokaryotes (CO utilizers) have been reported in the human gut. Therefore, the phylogenetic diversity of the human gut CO-utilizing prokaryotes remains unclear. Here, we unveiled more than a thousand representative genomes containing genes for putative nickel-containing CO dehydrogenase (pCODH), an essential enzyme for CO utilization. The taxonomy of genomes encoding pCODH was expanded to include 8 phyla, comprising 82 genera and 248 species. In contrast, putative molybdenum-containing CODH genes were not detected in the human gut microbial genomes. pCODH transcripts were detected in 97.3 % (n=110) of public metatranscriptome datasets derived from healthy human faeces, suggesting the ubiquitous presence of prokaryotes bearing transcriptionally active pCODH genes in the human gut. More than half of the pCODH-encoding genomes contain a set of genes for the autotrophic Wood-Ljungdahl pathway (WLP). However, 79 % of these genomes commonly lack a key gene for the WLP, which encodes the enzyme that synthesizes formate from CO2, suggesting that potential human gut CO-utilizing prokaryotes share a degenerated gene set for WLP. In the other half of the pCODH-encoding genomes, seven genes, including putative genes for flavin adenine dinucleotide-dependent NAD(P) oxidoreductase (FNOR), ABC transporter and Fe-hydrogenase, were found adjacent to the pCODH gene. None of the putative genes associated with CO-oxidizing respiratory machinery, such as energy-converting hydrogenase genes, were found in pCODH-encoding genomes. This suggests that the human gut CO utilization is not for CO removal, but potentially for fixation and/or biosynthesis, consistent with the harmless yet continuous production of CO in the human gut. Our findings reveal the diversity and distribution of prokaryotes with pCODH in the human gut microbiome, suggesting their potential contribution to microbial ecosystems in human gut environments.

虽然在人体内发现了一氧化碳(CO)的产生,但在人体肠道中只报道了两种利用 CO 的原核生物(CO 利用者)。因此,人体肠道一氧化碳利用原核生物的系统发育多样性仍不清楚。在这里,我们揭示了一千多个具有代表性的基因组,这些基因组含有推定的含镍一氧化碳脱氢酶(pCODH)基因,这是一种利用一氧化碳的重要酶。编码 pCODH 的基因组的分类范围扩大到 8 个门,包括 82 个属和 248 个种。与此相反,在人类肠道微生物基因组中没有检测到假定的含钼 CODH 基因。在来自健康人类粪便的公共元转录组数据集中,97.3%(n=110)的 pCODH 转录本被检测到,这表明人类肠道中普遍存在携带转录活性 pCODH 基因的原核生物。一半以上的 pCODH 编码基因组包含一组自养型伍德-荣格达尔途径(WLP)的基因。然而,这些基因组中有 79% 通常缺少 WLP 的一个关键基因,该基因编码从二氧化碳合成甲酸的酶,这表明潜在的人类肠道二氧化碳利用原核生物共享一套退化的 WLP 基因。在另一半编码 pCODH 的基因组中,发现有 7 个基因与 pCODH 基因相邻,其中包括黄素腺嘌呤二核苷酸依赖性 NAD(P)氧化还原酶(FNOR)、ABC 转运体和铁氢化酶的推定基因。在 pCODH 编码基因组中,没有发现与 CO 氧化呼吸机制相关的假定基因,如能量转换氢酶基因。这表明,人类肠道对一氧化碳的利用不是用于清除一氧化碳,而是可能用于固定和/或生物合成,这与人类肠道中无害但持续产生一氧化碳的情况是一致的。我们的研究结果揭示了人类肠道微生物组中具有 pCODH 的原核生物的多样性和分布情况,表明它们可能对人类肠道环境中的微生物生态系统做出贡献。
{"title":"Phylogenetic diversity of putative nickel-containing carbon monoxide dehydrogenase-encoding prokaryotes in the human gut microbiome.","authors":"Yuka Adachi Katayama, Ryoma Kamikawa, Takashi Yoshida","doi":"10.1099/mgen.0.001285","DOIUrl":"10.1099/mgen.0.001285","url":null,"abstract":"<p><p>Although the production of carbon monoxide (CO) within the human body has been detected, only two CO-utilizing prokaryotes (CO utilizers) have been reported in the human gut. Therefore, the phylogenetic diversity of the human gut CO-utilizing prokaryotes remains unclear. Here, we unveiled more than a thousand representative genomes containing genes for putative nickel-containing CO dehydrogenase (pCODH), an essential enzyme for CO utilization. The taxonomy of genomes encoding pCODH was expanded to include 8 phyla, comprising 82 genera and 248 species. In contrast, putative molybdenum-containing CODH genes were not detected in the human gut microbial genomes. pCODH transcripts were detected in 97.3 % (<i>n</i>=110) of public metatranscriptome datasets derived from healthy human faeces, suggesting the ubiquitous presence of prokaryotes bearing transcriptionally active pCODH genes in the human gut. More than half of the pCODH-encoding genomes contain a set of genes for the autotrophic Wood-Ljungdahl pathway (WLP). However, 79 % of these genomes commonly lack a key gene for the WLP, which encodes the enzyme that synthesizes formate from CO<sub>2</sub>, suggesting that potential human gut CO-utilizing prokaryotes share a degenerated gene set for WLP. In the other half of the pCODH-encoding genomes, seven genes, including putative genes for flavin adenine dinucleotide-dependent NAD(P) oxidoreductase (FNOR), ABC transporter and Fe-hydrogenase, were found adjacent to the pCODH gene. None of the putative genes associated with CO-oxidizing respiratory machinery, such as energy-converting hydrogenase genes, were found in pCODH-encoding genomes. This suggests that the human gut CO utilization is not for CO removal, but potentially for fixation and/or biosynthesis, consistent with the harmless yet continuous production of CO in the human gut. Our findings reveal the diversity and distribution of prokaryotes with pCODH in the human gut microbiome, suggesting their potential contribution to microbial ecosystems in human gut environments.</p>","PeriodicalId":18487,"journal":{"name":"Microbial Genomics","volume":"10 8","pages":""},"PeriodicalIF":4.0,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11338639/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142017965","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Capturing clinically relevant Campylobacter attributes through direct whole genome sequencing of stool. 通过对粪便进行直接全基因组测序,捕捉与临床相关的弯曲杆菌属性。
IF 4 2区 生物学 Q1 GENETICS & HEREDITY Pub Date : 2024-08-01 DOI: 10.1099/mgen.0.001284
Bilal Djeghout, Thanh Le-Viet, Leonardo de Oliveira Martins, George M Savva, Rhiannon Evans, David Baker, Andrew Page, Ngozi Elumogo, John Wain, Nicol Janecko

Campylobacter is the leading bacterial cause of infectious intestinal disease, but the pathogen typically accounts for a very small proportion of the overall stool microbiome in each patient. Diagnosis is even more difficult due to the fastidious nature of Campylobacter in the laboratory setting. This has, in part, driven a change in recent years, from culture-based to rapid PCR-based diagnostic assays which have improved diagnostic detection, whilst creating a knowledge gap in our clinical and epidemiological understanding of Campylobacter genotypes - no isolates to sequence. In this study, direct metagenomic sequencing approaches were used to assess the possibility of replacing genome sequences with metagenome sequences; metagenomic sequencing outputs were used to describe clinically relevant attributes of Campylobacter genotypes. A total of 37 diarrhoeal stool samples with Campylobacter and five samples with an unknown pathogen result were collected and processed with and without filtration, DNA was extracted, and metagenomes were sequenced by short-read sequencing. Culture-based methods were used to validate Campylobacter metagenome-derived genome (MDG) results. Sequence output metrics were assessed for Campylobacter genome quality and accuracy of characterization. Of the 42 samples passing quality checks for analysis, identification of Campylobacter to the genus and species level was dependent on Campylobacter genome read count, coverage and genome completeness. A total of 65% (24/37) of samples were reliably identified to the genus level through Campylobacter MDG, 73% (27/37) by culture and 97% (36/37) by qPCR. The Campylobacter genomes with a genome completeness of over 60% (n=21) were all accurately identified at the species level (100%). Of those, 72% (15/21) were identified to sequence types (STs), and 95% (20/21) accurately identified antimicrobial resistance (AMR) gene determinants. Filtration of stool samples enhanced Campylobacter MDG recovery and genome quality metrics compared to the corresponding unfiltered samples, which improved the identification of STs and AMR profiles. The phylogenetic analysis in this study demonstrated the clustering of the metagenome-derived with culture-derived genomes and revealed the reliability of genomes from direct stool sequencing. Furthermore, Campylobacter genome spiking percentages ranging from 0 to 2% total metagenome abundance in the ONT MinION sequencer, configured to adaptive sequencing, exhibited better assembly quality and accurate identification of STs, particularly in the analysis of metagenomes containing 2 and 1% of Campylobacter jejuni genomes. Direct sequencing of Campylobacter from stool samples provides clinically relevant and epidemiologically important genomic information without the reliance on cultured genomes.

弯曲状杆菌是引起肠道感染性疾病的主要细菌,但这种病原体通常只占每位患者粪便微生物群的很小一部分。由于弯曲菌在实验室环境中的耐受性很差,因此诊断更加困难。这在一定程度上促使近年来的诊断方法发生了变化,从基于培养的诊断方法转变为基于 PCR 的快速诊断方法,从而提高了诊断检测率,但同时也给我们对弯曲杆菌基因型的临床和流行病学了解造成了知识空白--没有分离物可测序。本研究采用直接元基因组测序方法来评估用元基因组测序取代基因组测序的可能性;元基因组测序结果用于描述弯曲杆菌基因型的临床相关属性。共收集了 37 份带有弯曲杆菌的腹泻粪便样本和 5 份病原体结果未知的样本,经过过滤和不过滤处理,提取了 DNA,并通过短线程测序对元基因组进行了测序。采用基于培养的方法验证弯曲杆菌元基因组(MDG)结果。对弯曲杆菌基因组质量和特征描述准确性的序列输出指标进行了评估。在通过质量检查的 42 份分析样本中,弯曲状杆菌的属种鉴定取决于弯曲状杆菌基因组读数数、覆盖率和基因组完整性。通过弯曲状杆菌 MDG,共有 65%(24/37)的样本被可靠地鉴定为属级;通过培养,73%(27/37)的样本被可靠地鉴定为种级,通过 qPCR,97%(36/37)的样本被可靠地鉴定为属级。基因组完整性超过 60% 的弯曲杆菌基因组(n=21)均被准确鉴定为种一级(100%)。其中,72%(15/21)的序列类型(STs)得到了鉴定,95%(20/21)的抗菌药耐药性(AMR)基因决定因子得到了准确鉴定。与未经过滤的相应样本相比,过滤粪便样本提高了弯曲杆菌 MDG 的回收率和基因组质量指标,从而改进了 STs 和 AMR 图谱的鉴定。本研究中的系统发育分析表明了元基因组与培养基基因组的聚类,并揭示了粪便直接测序基因组的可靠性。此外,在ONT MinION测序仪中,弯曲状杆菌基因组的加标百分比范围为元基因组总丰度的0%至2%,配置为自适应测序,可获得更好的组装质量和ST的准确鉴定,尤其是在分析含有2%和1%空肠弯曲杆菌基因组的元基因组时。直接对粪便样本中的弯曲杆菌进行测序,无需依赖培养基因组,就能提供与临床相关且具有流行病学意义的基因组信息。
{"title":"Capturing clinically relevant <i>Campylobacter</i> attributes through direct whole genome sequencing of stool.","authors":"Bilal Djeghout, Thanh Le-Viet, Leonardo de Oliveira Martins, George M Savva, Rhiannon Evans, David Baker, Andrew Page, Ngozi Elumogo, John Wain, Nicol Janecko","doi":"10.1099/mgen.0.001284","DOIUrl":"10.1099/mgen.0.001284","url":null,"abstract":"<p><p><i>Campylobacter</i> is the leading bacterial cause of infectious intestinal disease, but the pathogen typically accounts for a very small proportion of the overall stool microbiome in each patient. Diagnosis is even more difficult due to the fastidious nature of <i>Campylobacter</i> in the laboratory setting. This has, in part, driven a change in recent years, from culture-based to rapid PCR-based diagnostic assays which have improved diagnostic detection, whilst creating a knowledge gap in our clinical and epidemiological understanding of <i>Campylobacter</i> genotypes - no isolates to sequence. In this study, direct metagenomic sequencing approaches were used to assess the possibility of replacing genome sequences with metagenome sequences; metagenomic sequencing outputs were used to describe clinically relevant attributes of <i>Campylobacter</i> genotypes. A total of 37 diarrhoeal stool samples with <i>Campylobacter</i> and five samples with an unknown pathogen result were collected and processed with and without filtration, DNA was extracted, and metagenomes were sequenced by short-read sequencing. Culture-based methods were used to validate <i>Campylobacter</i> metagenome-derived genome (MDG) results. Sequence output metrics were assessed for <i>Campylobacter</i> genome quality and accuracy of characterization. Of the 42 samples passing quality checks for analysis, identification of <i>Campylobacter</i> to the genus and species level was dependent on <i>Campylobacter</i> genome read count, coverage and genome completeness. A total of 65% (24/37) of samples were reliably identified to the genus level through <i>Campylobacter</i> MDG, 73% (27/37) by culture and 97% (36/37) by qPCR. The <i>Campylobacter</i> genomes with a genome completeness of over 60% (<i>n</i>=21) were all accurately identified at the species level (100%). Of those, 72% (15/21) were identified to sequence types (STs), and 95% (20/21) accurately identified antimicrobial resistance (AMR) gene determinants. Filtration of stool samples enhanced <i>Campylobacter</i> MDG recovery and genome quality metrics compared to the corresponding unfiltered samples, which improved the identification of STs and AMR profiles. The phylogenetic analysis in this study demonstrated the clustering of the metagenome-derived with culture-derived genomes and revealed the reliability of genomes from direct stool sequencing. Furthermore, <i>Campylobacter</i> genome spiking percentages ranging from 0 to 2% total metagenome abundance in the ONT MinION sequencer, configured to adaptive sequencing, exhibited better assembly quality and accurate identification of STs, particularly in the analysis of metagenomes containing 2 and 1% of <i>Campylobacter jejuni</i> genomes. Direct sequencing of <i>Campylobacter</i> from stool samples provides clinically relevant and epidemiologically important genomic information without the reliance on cultured genomes.</p>","PeriodicalId":18487,"journal":{"name":"Microbial Genomics","volume":"10 8","pages":""},"PeriodicalIF":4.0,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11570993/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142109155","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Complete genome of the Medicago anthracnose fungus, Colletotrichum destructivum, reveals a mini-chromosome-like region within a core chromosome. 麦迪格炭疽真菌的完整基因组(Colletotrichum destructivum)揭示了核心染色体中的一个类似小染色体的区域。
IF 4 2区 生物学 Q1 GENETICS & HEREDITY Pub Date : 2024-08-01 DOI: 10.1099/mgen.0.001283
Nicolas Lapalu, Adeline Simon, Antoine Lu, Peter-Louis Plaumann, Joëlle Amselem, Sandrine Pigné, Annie Auger, Christian Koch, Jean-Félix Dallery, Richard J O'Connell

Colletotrichum destructivum (Cd) is a phytopathogenic fungus causing significant economic losses on forage legume crops (Medicago and Trifolium species) worldwide. To gain insights into the genetic basis of fungal virulence and host specificity, we sequenced the genome of an isolate from Medicago sativa using long-read (PacBio) technology. The resulting genome assembly has a total length of 51.7 Mb and comprises ten core chromosomes and two accessory chromosomes, all of which were sequenced from telomere to telomere. A total of 15, 631 gene models were predicted, including genes encoding potentially pathogenicity-related proteins such as candidate-secreted effectors (484), secondary metabolism key enzymes (110) and carbohydrate-active enzymes (619). Synteny analysis revealed extensive structural rearrangements in the genome of Cd relative to the closely related Brassicaceae pathogen, Colletotrichum higginsianum. In addition, a 1.2 Mb species-specific region was detected within the largest core chromosome of Cd that has all the characteristics of fungal accessory chromosomes (transposon-rich, gene-poor, distinct codon usage), providing evidence for exchange between these two genomic compartments. This region was also unique in having undergone extensive intra-chromosomal segmental duplications. Our findings provide insights into the evolution of accessory regions and possible mechanisms for generating genetic diversity in this asexual fungal pathogen.

破坏性轮枝霉菌(Colletotrichum destructivum,Cd)是一种植物病原真菌,对全世界的饲料豆科作物(Medicago 和 Trifolium 种)造成了巨大的经济损失。为了深入了解真菌毒力和宿主特异性的遗传基础,我们利用长线程(PacBio)技术对来自Medicago sativa的一个分离株进行了基因组测序。测序得到的基因组总长度为 51.7 Mb,由十条核心染色体和两条附属染色体组成,所有染色体都从端粒到端粒进行了测序。共预测出 15 631 个基因模型,包括编码潜在致病性相关蛋白的基因,如候选分泌效应物(484 个)、次级代谢关键酶(110 个)和碳水化合物活性酶(619 个)。合成分析表明,相对于密切相关的十字花科病原体 Colletotrichum higginsianum,Cd 的基因组发生了广泛的结构重排。此外,在 Cd 最大的核心染色体中发现了一个 1.2 Mb 的物种特异性区域,该区域具有真菌附属染色体的所有特征(转座子丰富、基因贫乏、密码子用法不同),为这两个基因组区之间的交换提供了证据。该区域还经历了广泛的染色体内片段复制,这一点也很独特。我们的研究结果为了解附属区的进化以及这种无性真菌病原体产生遗传多样性的可能机制提供了见解。
{"title":"Complete genome of the Medicago anthracnose fungus, <i>Colletotrichum destructivum</i>, reveals a mini-chromosome-like region within a core chromosome.","authors":"Nicolas Lapalu, Adeline Simon, Antoine Lu, Peter-Louis Plaumann, Joëlle Amselem, Sandrine Pigné, Annie Auger, Christian Koch, Jean-Félix Dallery, Richard J O'Connell","doi":"10.1099/mgen.0.001283","DOIUrl":"10.1099/mgen.0.001283","url":null,"abstract":"<p><p><i>Colletotrichum destructivum</i> (<i>Cd</i>) is a phytopathogenic fungus causing significant economic losses on forage legume crops (<i>Medicago</i> and <i>Trifolium</i> species) worldwide. To gain insights into the genetic basis of fungal virulence and host specificity, we sequenced the genome of an isolate from <i>Medicago sativa</i> using long-read (PacBio) technology. The resulting genome assembly has a total length of 51.7 Mb and comprises ten core chromosomes and two accessory chromosomes, all of which were sequenced from telomere to telomere. A total of 15, 631 gene models were predicted, including genes encoding potentially pathogenicity-related proteins such as candidate-secreted effectors (484), secondary metabolism key enzymes (110) and carbohydrate-active enzymes (619). Synteny analysis revealed extensive structural rearrangements in the genome of <i>Cd</i> relative to the closely related Brassicaceae pathogen, <i>Colletotrichum higginsianum</i>. In addition, a 1.2 Mb species-specific region was detected within the largest core chromosome of <i>Cd</i> that has all the characteristics of fungal accessory chromosomes (transposon-rich, gene-poor, distinct codon usage), providing evidence for exchange between these two genomic compartments. This region was also unique in having undergone extensive intra-chromosomal segmental duplications. Our findings provide insights into the evolution of accessory regions and possible mechanisms for generating genetic diversity in this asexual fungal pathogen.</p>","PeriodicalId":18487,"journal":{"name":"Microbial Genomics","volume":"10 8","pages":""},"PeriodicalIF":4.0,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11338638/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142017963","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparison of gene-by-gene and genome-wide short nucleotide sequence-based approaches to define the global population structure of Streptococcus pneumoniae. 比较基于逐基因和全基因组短核苷酸序列的方法来确定肺炎链球菌的全球种群结构。
IF 4 2区 生物学 Q1 GENETICS & HEREDITY Pub Date : 2024-08-01 DOI: 10.1099/mgen.0.001278
Alannah C King, Narender Kumar, Kate C Mellor, Paulina A Hawkins, Lesley McGee, Nicholas J Croucher, Stephen D Bentley, John A Lees, Stephanie W Lo

Defining the population structure of a pathogen is a key part of epidemiology, as genomically related isolates are likely to share key clinical features such as antimicrobial resistance profiles and invasiveness. Multiple different methods are currently used to cluster together closely related genomes, potentially leading to inconsistency between studies. Here, we use a global dataset of 26 306 Streptococcus pneumoniae genomes to compare four clustering methods: gene-by-gene seven-locus MLST, core genome MLST (cgMLST)-based hierarchical clustering (HierCC) assignments, life identification number (LIN) barcoding and k-mer-based PopPUNK clustering (known as GPSCs in this species). We compare the clustering results with phylogenetic and pan-genome analyses to assess their relationship with genome diversity and evolution, as we would expect a good clustering method to form a single monophyletic cluster that has high within-cluster similarity of genomic content. We show that the four methods are generally able to accurately reflect the population structure based on these metrics and that the methods were broadly consistent with each other. We investigated further to study the discrepancies in clusters. The greatest concordance was seen between LIN barcoding and HierCC (adjusted mutual information score=0.950), which was expected given that both methods utilize cgMLST, but have different methods for defining an individual cluster and different core genome schema. However, the existence of differences between the two methods shows that the selection of a core genome schema can introduce inconsistencies between studies. GPSC and HierCC assignments were also highly concordant (AMI=0.946), showing that k-mer-based methods which use the whole genome and do not require the careful selection of a core genome schema are just as effective at representing the population structure. Additionally, where there were differences in clustering between these methods, this could be explained by differences in the accessory genome that were not identified in cgMLST. We conclude that for S. pneumoniae, standardized and stable nomenclature is important as the number of genomes available expands. Furthermore, the research community should transition away from seven-locus MLST, whilst cgMLST, GPSC and LIN assignments should be used more widely. However, to allow for easy comparison between studies and to make previous literature relevant, the reporting of multiple clustering names should be standardized within the research.

确定病原体的种群结构是流行病学的一个关键部分,因为基因组相关的分离株很可能具有共同的关键临床特征,如抗菌药耐药性和侵袭性。目前有多种不同的方法用于聚类密切相关的基因组,这可能会导致研究之间的不一致性。在这里,我们使用一个包含 26 306 个肺炎链球菌基因组的全球数据集来比较四种聚类方法:逐基因七焦点 MLST、基于核心基因组 MLST (cgMLST) 的分层聚类 (HierCC) 分配、生命识别码 (LIN) 条形码和基于 k 聚合体的 PopPUNK 聚类(在该物种中称为 GPSC)。我们将聚类结果与系统进化分析和泛基因组分析进行比较,以评估它们与基因组多样性和进化之间的关系,因为我们期望一种好的聚类方法能形成一个单一的单系群,该群内的基因组内容具有高度的相似性。我们的研究表明,基于这些指标,四种方法一般都能准确反映种群结构,而且方法之间大体一致。我们进一步研究了聚类中的差异。LIN 条形编码与 HierCC 的一致性最高(调整后互信息得分=0.950),这在意料之中,因为这两种方法都使用了 cgMLST,但定义单个聚类的方法不同,核心基因组模式也不同。然而,两种方法之间存在的差异表明,核心基因组模式的选择可能会导致不同研究之间的不一致。GPSC 和 HierCC 分配也高度一致(AMI=0.946),这表明基于 k-mer 的方法使用了全基因组,不需要仔细选择核心基因组模式,在表现群体结构方面同样有效。此外,如果这些方法之间存在聚类差异,这可能是由于 cgMLST 中未识别出的附属基因组的差异造成的。我们的结论是,随着可用基因组数量的增加,标准化和稳定的命名法对肺炎双球菌非常重要。此外,研究界应放弃七焦点 MLST,而更广泛地使用 cgMLST、GPSC 和 LIN 分配。不过,为了便于研究之间的比较,并使以前的文献具有相关性,应在研究中规范多重聚类名称的报告。
{"title":"Comparison of gene-by-gene and genome-wide short nucleotide sequence-based approaches to define the global population structure of <i>Streptococcus pneumoniae</i>.","authors":"Alannah C King, Narender Kumar, Kate C Mellor, Paulina A Hawkins, Lesley McGee, Nicholas J Croucher, Stephen D Bentley, John A Lees, Stephanie W Lo","doi":"10.1099/mgen.0.001278","DOIUrl":"10.1099/mgen.0.001278","url":null,"abstract":"<p><p>Defining the population structure of a pathogen is a key part of epidemiology, as genomically related isolates are likely to share key clinical features such as antimicrobial resistance profiles and invasiveness. Multiple different methods are currently used to cluster together closely related genomes, potentially leading to inconsistency between studies. Here, we use a global dataset of 26 306 <i>Streptococcus pneumoniae</i> genomes to compare four clustering methods: gene-by-gene seven-locus MLST, core genome MLST (cgMLST)-based hierarchical clustering (HierCC) assignments, life identification number (LIN) barcoding and k-mer-based PopPUNK clustering (known as GPSCs in this species). We compare the clustering results with phylogenetic and pan-genome analyses to assess their relationship with genome diversity and evolution, as we would expect a good clustering method to form a single monophyletic cluster that has high within-cluster similarity of genomic content. We show that the four methods are generally able to accurately reflect the population structure based on these metrics and that the methods were broadly consistent with each other. We investigated further to study the discrepancies in clusters. The greatest concordance was seen between LIN barcoding and HierCC (adjusted mutual information score=0.950), which was expected given that both methods utilize cgMLST, but have different methods for defining an individual cluster and different core genome schema. However, the existence of differences between the two methods shows that the selection of a core genome schema can introduce inconsistencies between studies. GPSC and HierCC assignments were also highly concordant (AMI=0.946), showing that k-mer-based methods which use the whole genome and do not require the careful selection of a core genome schema are just as effective at representing the population structure. Additionally, where there were differences in clustering between these methods, this could be explained by differences in the accessory genome that were not identified in cgMLST. We conclude that for <i>S. pneumoniae</i>, standardized and stable nomenclature is important as the number of genomes available expands. Furthermore, the research community should transition away from seven-locus MLST, whilst cgMLST, GPSC and LIN assignments should be used more widely. However, to allow for easy comparison between studies and to make previous literature relevant, the reporting of multiple clustering names should be standardized within the research.</p>","PeriodicalId":18487,"journal":{"name":"Microbial Genomics","volume":"10 8","pages":""},"PeriodicalIF":4.0,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11353345/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142080741","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Detection of novel orthoparamyxoviruses, orthonairoviruses and an orthohepevirus in European white-toothed shrews. 在欧洲白齿鼩体内检测到新型正巴拉米哥病毒、正奈洛韦病毒和一种正河病毒。
IF 4 2区 生物学 Q1 GENETICS & HEREDITY Pub Date : 2024-08-01 DOI: 10.1099/mgen.0.001275
Viola C Haring, Benedikt Litz, Jens Jacob, Michael Brecht, Markus Bauswein, Julia Sehl-Ewert, Marta Heroldova, Claudia Wylezich, Donata Hoffmann, Rainer G Ulrich, Martin Beer, Florian Pfaff

While the viromes and immune systems of bats and rodents have been extensively studied, comprehensive data are lacking for insectivores (order Eulipotyphla) despite their wide geographic distribution. Anthropogenic land use and outdoor recreational activities, as well as changes in the range of shrews, may lead to an expansion of the human-shrew interface with the risk of spillover infections, as reported for Borna disease virus 1. We investigated the virome of 45 individuals of 4 white-toothed shrew species present in Europe, using metagenomic RNA sequencing of tissue and intestine pools. Moderate to high abundances of sequences related to the families Paramyxoviridae, Nairoviridae, Hepeviridae and Bornaviridae were detected. Whole genomes were determined for novel orthoparamyxoviruses (n=3), orthonairoviruses (n=2) and an orthohepevirus. The novel paramyxovirus, tentatively named Hasua virus, was phylogenetically related to the zoonotic Langya virus and Mòjiāng virus. The novel orthonairoviruses, along with the potentially zoonotic Erve virus, fall within the shrew-borne Thiafora virus genogroup. The highest viral RNA loads of orthoparamyxoviruses were detected in the kidneys, in well-perfused organs for orthonairoviruses and in the liver and intestine for orthohepevirus, indicating potential transmission routes. Notably, several shrews were found to be coinfected with viruses from different families. Our study highlights the virus diversity present in shrews, not only in biodiversity-rich regions but also in areas influenced by human activity. This study warrants further research to characterize and assess the clinical implications and risk of these viruses and the importance of shrews as reservoirs in European ecosystems.

蝙蝠和啮齿类动物的病毒体和免疫系统已被广泛研究,但食虫目动物(Eulipotyphla)尽管地理分布广泛,却缺乏全面的数据。人类活动造成的土地使用和户外休闲活动以及鼩鼱分布范围的变化可能会导致人鼩接触面的扩大,从而带来外溢感染的风险,正如 Borna 病病毒 1 的报道所指出的那样。我们利用组织和肠道池的元基因组 RNA 测序,研究了欧洲 4 种白齿鼩 45 个个体的病毒组。我们检测到了与副粘病毒科、奈洛病毒科、肝病毒科和博纳病毒科相关的中度到高度丰富的序列。对新型正副粘病毒(3 种)、正奈洛病毒(2 种)和一种正河病毒进行了全基因组测定。新型副粘病毒(暂定名为Hasua病毒)与人畜共患的琅琊病毒和Mòjiāng病毒在系统发育上有亲缘关系。新型正粘病毒与可能人畜共患病的埃尔韦病毒同属鼩鼱传播的Thiafora病毒基因组。在肾脏和灌注良好的器官中检测到的正粘病毒的病毒 RNA 量最高,而在肝脏和肠道中检测到的正热病毒的病毒 RNA 量最高,这表明了潜在的传播途径。值得注意的是,有几只鼩鼱同时感染了不同科的病毒。我们的研究凸显了鼩鼱体内病毒的多样性,这种多样性不仅存在于生物多样性丰富的地区,也存在于受人类活动影响的地区。这项研究值得进一步研究,以确定和评估这些病毒的临床影响和风险,以及鼩鼱作为欧洲生态系统中的病毒库的重要性。
{"title":"Detection of novel orthoparamyxoviruses, orthonairoviruses and an orthohepevirus in European white-toothed shrews.","authors":"Viola C Haring, Benedikt Litz, Jens Jacob, Michael Brecht, Markus Bauswein, Julia Sehl-Ewert, Marta Heroldova, Claudia Wylezich, Donata Hoffmann, Rainer G Ulrich, Martin Beer, Florian Pfaff","doi":"10.1099/mgen.0.001275","DOIUrl":"10.1099/mgen.0.001275","url":null,"abstract":"<p><p>While the viromes and immune systems of bats and rodents have been extensively studied, comprehensive data are lacking for insectivores (order Eulipotyphla) despite their wide geographic distribution. Anthropogenic land use and outdoor recreational activities, as well as changes in the range of shrews, may lead to an expansion of the human-shrew interface with the risk of spillover infections, as reported for Borna disease virus 1. We investigated the virome of 45 individuals of 4 white-toothed shrew species present in Europe, using metagenomic RNA sequencing of tissue and intestine pools. Moderate to high abundances of sequences related to the families <i>Paramyxoviridae</i>, <i>Nairoviridae</i>, <i>Hepeviridae</i> and <i>Bornaviridae</i> were detected. Whole genomes were determined for novel orthoparamyxoviruses (<i>n</i>=3), orthonairoviruses (<i>n</i>=2) and an orthohepevirus. The novel paramyxovirus, tentatively named Hasua virus, was phylogenetically related to the zoonotic Langya virus and Mòjiāng virus. The novel orthonairoviruses, along with the potentially zoonotic Erve virus, fall within the shrew-borne Thiafora virus genogroup. The highest viral RNA loads of orthoparamyxoviruses were detected in the kidneys, in well-perfused organs for orthonairoviruses and in the liver and intestine for orthohepevirus, indicating potential transmission routes. Notably, several shrews were found to be coinfected with viruses from different families. Our study highlights the virus diversity present in shrews, not only in biodiversity-rich regions but also in areas influenced by human activity. This study warrants further research to characterize and assess the clinical implications and risk of these viruses and the importance of shrews as reservoirs in European ecosystems.</p>","PeriodicalId":18487,"journal":{"name":"Microbial Genomics","volume":"10 8","pages":""},"PeriodicalIF":4.0,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11293873/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141860203","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Corrigendum: Identification of genes influencing the evolution of Escherichia coli ST372 in dogs and humans. 更正:影响狗和人体内大肠杆菌 ST372 演化的基因鉴定。
IF 5.4 2区 生物学 Q1 GENETICS & HEREDITY Pub Date : 2024-08-01 DOI: 10.1099/mgen.0.001286
Paarthiphan Elankumaran, Glenn F Browning, Marc S Marenda, Amanda Kidsley, Marwan Osman, Marisa Haenni, James R Johnson, Darren J Trott, Cameron J Reid, Steven P Djordjevic
{"title":"Corrigendum: Identification of genes influencing the evolution of <i>Escherichia coli</i> ST372 in dogs and humans.","authors":"Paarthiphan Elankumaran, Glenn F Browning, Marc S Marenda, Amanda Kidsley, Marwan Osman, Marisa Haenni, James R Johnson, Darren J Trott, Cameron J Reid, Steven P Djordjevic","doi":"10.1099/mgen.0.001286","DOIUrl":"10.1099/mgen.0.001286","url":null,"abstract":"","PeriodicalId":18487,"journal":{"name":"Microbial Genomics","volume":"10 8","pages":""},"PeriodicalIF":5.4,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11561579/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142017964","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Microbial Genomics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1