{"title":"揭示基因组组装对细菌分型的影响:一个健康的视角。","authors":"Déborah Merda, Meryl Vila-Nova, Mathilde Bonis, Anne-Laure Boutigny, Thomas Brauge, Marina Cavaiuolo, Amandine Cunty, Antoine Regnier, Maroua Sayeb, Noémie Vingadassalon, Claire Yvon, Virginie Chesnais","doi":"10.1186/s12864-024-10982-z","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>In the context of pathogen surveillance, it is crucial to ensure interoperability and harmonized data. Several surveillance systems are designed to compare bacteria and identify outbreak clusters based on core genome MultiLocus Sequence Typing (cgMLST). Among the different approaches available to generate bacterial cgMLST, our research used an assembly-based approach (chewBBACA tool).</p><p><strong>Methods: </strong>Simulations of short-read sequencing were conducted for 5 genomes of 27 pathogens of interest in animal, plant, and human health to evaluate the repeatability and reproducibility of cgMLST. Various quality parameters, such as read quality and depth of sequencing were applied, and several read simulations and genome assemblies were repeated using three tools: SPAdes, Unicycler and Shovill. In vitro sequencing were also used to evaluate assembly impact on cgMLST results, for six bacterial species: Bacillus thuringiensis, Listeria monocytogenes, Salmonella enterica, Staphylococcus aureus, Vibrio parahaemolyticus and Xylella fastidiosa.</p><p><strong>Results: </strong>The results highlighted variability in cgMLST, which not only related to the assembly tools, but also induced by the intrinsic composition of the genomes themselves. This variability observed in simulated sequencing was further validated with real data for six of the bacterial pathogens studied.</p><p><strong>Conclusion: </strong>This highlights that the intrinsic genome composition affects assembly and resulting cgMLST profiles, and that variability in bioinformatics tools can induce a bias in cgMLST profiles. In conclusion, we propose that the completeness of cgMLST schemes should be considered when clustering strains.</p>","PeriodicalId":9030,"journal":{"name":"BMC Genomics","volume":"25 1","pages":"1059"},"PeriodicalIF":3.5000,"publicationDate":"2024-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11545336/pdf/","citationCount":"0","resultStr":"{\"title\":\"Unraveling the impact of genome assembly on bacterial typing: a one health perspective.\",\"authors\":\"Déborah Merda, Meryl Vila-Nova, Mathilde Bonis, Anne-Laure Boutigny, Thomas Brauge, Marina Cavaiuolo, Amandine Cunty, Antoine Regnier, Maroua Sayeb, Noémie Vingadassalon, Claire Yvon, Virginie Chesnais\",\"doi\":\"10.1186/s12864-024-10982-z\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>In the context of pathogen surveillance, it is crucial to ensure interoperability and harmonized data. Several surveillance systems are designed to compare bacteria and identify outbreak clusters based on core genome MultiLocus Sequence Typing (cgMLST). Among the different approaches available to generate bacterial cgMLST, our research used an assembly-based approach (chewBBACA tool).</p><p><strong>Methods: </strong>Simulations of short-read sequencing were conducted for 5 genomes of 27 pathogens of interest in animal, plant, and human health to evaluate the repeatability and reproducibility of cgMLST. Various quality parameters, such as read quality and depth of sequencing were applied, and several read simulations and genome assemblies were repeated using three tools: SPAdes, Unicycler and Shovill. In vitro sequencing were also used to evaluate assembly impact on cgMLST results, for six bacterial species: Bacillus thuringiensis, Listeria monocytogenes, Salmonella enterica, Staphylococcus aureus, Vibrio parahaemolyticus and Xylella fastidiosa.</p><p><strong>Results: </strong>The results highlighted variability in cgMLST, which not only related to the assembly tools, but also induced by the intrinsic composition of the genomes themselves. This variability observed in simulated sequencing was further validated with real data for six of the bacterial pathogens studied.</p><p><strong>Conclusion: </strong>This highlights that the intrinsic genome composition affects assembly and resulting cgMLST profiles, and that variability in bioinformatics tools can induce a bias in cgMLST profiles. In conclusion, we propose that the completeness of cgMLST schemes should be considered when clustering strains.</p>\",\"PeriodicalId\":9030,\"journal\":{\"name\":\"BMC Genomics\",\"volume\":\"25 1\",\"pages\":\"1059\"},\"PeriodicalIF\":3.5000,\"publicationDate\":\"2024-11-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11545336/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BMC Genomics\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1186/s12864-024-10982-z\",\"RegionNum\":2,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"BIOTECHNOLOGY & APPLIED MICROBIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Genomics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s12864-024-10982-z","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOTECHNOLOGY & APPLIED MICROBIOLOGY","Score":null,"Total":0}
Unraveling the impact of genome assembly on bacterial typing: a one health perspective.
Background: In the context of pathogen surveillance, it is crucial to ensure interoperability and harmonized data. Several surveillance systems are designed to compare bacteria and identify outbreak clusters based on core genome MultiLocus Sequence Typing (cgMLST). Among the different approaches available to generate bacterial cgMLST, our research used an assembly-based approach (chewBBACA tool).
Methods: Simulations of short-read sequencing were conducted for 5 genomes of 27 pathogens of interest in animal, plant, and human health to evaluate the repeatability and reproducibility of cgMLST. Various quality parameters, such as read quality and depth of sequencing were applied, and several read simulations and genome assemblies were repeated using three tools: SPAdes, Unicycler and Shovill. In vitro sequencing were also used to evaluate assembly impact on cgMLST results, for six bacterial species: Bacillus thuringiensis, Listeria monocytogenes, Salmonella enterica, Staphylococcus aureus, Vibrio parahaemolyticus and Xylella fastidiosa.
Results: The results highlighted variability in cgMLST, which not only related to the assembly tools, but also induced by the intrinsic composition of the genomes themselves. This variability observed in simulated sequencing was further validated with real data for six of the bacterial pathogens studied.
Conclusion: This highlights that the intrinsic genome composition affects assembly and resulting cgMLST profiles, and that variability in bioinformatics tools can induce a bias in cgMLST profiles. In conclusion, we propose that the completeness of cgMLST schemes should be considered when clustering strains.
期刊介绍:
BMC Genomics is an open access, peer-reviewed journal that considers articles on all aspects of genome-scale analysis, functional genomics, and proteomics.
BMC Genomics is part of the BMC series which publishes subject-specific journals focused on the needs of individual research communities across all areas of biology and medicine. We offer an efficient, fair and friendly peer review service, and are committed to publishing all sound science, provided that there is some advance in knowledge presented by the work.