zDB: bacterial comparative genomics made easy.

IF 5 2区 生物学 Q1 MICROBIOLOGY mSystems Pub Date : 2024-07-23 Epub Date: 2024-06-28 DOI:10.1128/msystems.00473-24
Bastian Marquis, Trestan Pillonel, Alessia Carrara, Claire Bertelli
{"title":"zDB: bacterial comparative genomics made easy.","authors":"Bastian Marquis, Trestan Pillonel, Alessia Carrara, Claire Bertelli","doi":"10.1128/msystems.00473-24","DOIUrl":null,"url":null,"abstract":"<p><p>The analysis and comparison of genomes rely on different tools for tasks such as annotation, orthology prediction, and phylogenetic inference. Most tools are specialized for a single task, and additional efforts are necessary to integrate and visualize the results. To fill this gap, we developed zDB, an application integrating a Nextflow analysis pipeline and a Python visualization platform built on the Django framework. The application is available on GitHub (https://github.com/metagenlab/zDB) and from the bioconda channel. Starting from annotated Genbank files, zDB identifies orthologs and infers a phylogeny for each orthogroup. A species phylogeny is also constructed from shared single-copy orthologs. The results can be enriched with Pfam protein domain prediction, Cluster of Orthologs Genes and Kyoto Encyclopedia of Genes and Genomes annotations, and Swissprot homologs. The web application allows searching for specific genes or annotations, running Blast queries, and comparing genomic regions and whole genomes. The metabolic capacities of organisms can be compared at either the module or pathway levels. Finally, users can run queries to examine the conservation of specific genes or annotations across a chosen subset of genomes and display the results as a list of genes, Venn diagram, or heatmaps. Those features make zDB useful for both bioinformaticians and researchers more accustomed to laboratory research.IMPORTANCEGenome comparison and analysis rely on many independent tools, leaving to scientists the burden to integrate and visualize their results for interpretation. To alleviate this burden, we have built zDB, a comparative genomics tool that includes both an analysis pipeline and a visualization platform. The analysis pipeline automates gene annotation, orthology prediction, and phylogenetic inference, while the visualization platform allows scientists to easily explore the results in a web browser. Among other features, the interface allows users to visually compare whole genomes and targeted regions, assess the conservation of genes or metabolic pathways, perform Blast searches, or look for specific annotations. Altogether, this tool will be useful for a broad range of applications in comparative studies between two and hundred genomes. Furthermore, it is designed to allow sharing of data sets easily at a local or international scale, thereby supporting exploratory analyses for non-bioinformaticians on the genome of their favorite organisms.</p>","PeriodicalId":18819,"journal":{"name":"mSystems","volume":null,"pages":null},"PeriodicalIF":5.0000,"publicationDate":"2024-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11264898/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"mSystems","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1128/msystems.00473-24","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/6/28 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

The analysis and comparison of genomes rely on different tools for tasks such as annotation, orthology prediction, and phylogenetic inference. Most tools are specialized for a single task, and additional efforts are necessary to integrate and visualize the results. To fill this gap, we developed zDB, an application integrating a Nextflow analysis pipeline and a Python visualization platform built on the Django framework. The application is available on GitHub (https://github.com/metagenlab/zDB) and from the bioconda channel. Starting from annotated Genbank files, zDB identifies orthologs and infers a phylogeny for each orthogroup. A species phylogeny is also constructed from shared single-copy orthologs. The results can be enriched with Pfam protein domain prediction, Cluster of Orthologs Genes and Kyoto Encyclopedia of Genes and Genomes annotations, and Swissprot homologs. The web application allows searching for specific genes or annotations, running Blast queries, and comparing genomic regions and whole genomes. The metabolic capacities of organisms can be compared at either the module or pathway levels. Finally, users can run queries to examine the conservation of specific genes or annotations across a chosen subset of genomes and display the results as a list of genes, Venn diagram, or heatmaps. Those features make zDB useful for both bioinformaticians and researchers more accustomed to laboratory research.IMPORTANCEGenome comparison and analysis rely on many independent tools, leaving to scientists the burden to integrate and visualize their results for interpretation. To alleviate this burden, we have built zDB, a comparative genomics tool that includes both an analysis pipeline and a visualization platform. The analysis pipeline automates gene annotation, orthology prediction, and phylogenetic inference, while the visualization platform allows scientists to easily explore the results in a web browser. Among other features, the interface allows users to visually compare whole genomes and targeted regions, assess the conservation of genes or metabolic pathways, perform Blast searches, or look for specific annotations. Altogether, this tool will be useful for a broad range of applications in comparative studies between two and hundred genomes. Furthermore, it is designed to allow sharing of data sets easily at a local or international scale, thereby supporting exploratory analyses for non-bioinformaticians on the genome of their favorite organisms.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
zDB:细菌比较基因组学变得简单。
基因组的分析和比较依赖于不同的工具,如注释、选集预测和系统发育推断。大多数工具都是针对单一任务的专用工具,因此需要额外的努力来整合和可视化结果。为了填补这一空白,我们开发了zDB,一个集成了Next-flow分析管道和基于Django框架的Python可视化平台的应用程序。该应用程序可在 GitHub (https://github.com/metagenlab/zDB) 和 bioconda 频道上下载。从注释的 Genbank 文件开始,zDB 可识别直向同源物,并推断出每个直向群的系统发生。还可从共享的单拷贝直向同源物中构建物种系统发育。研究结果可通过 Pfam 蛋白结构域预测、同源物基因簇和京都基因和基因组百科全书注释以及 Swissprot 同源物进行丰富。该网络应用程序允许搜索特定基因或注释,运行 Blast 查询,以及比较基因组区域和整个基因组。生物体的代谢能力可在模块或通路水平上进行比较。最后,用户可以运行查询来检查特定基因或注释在所选基因组子集中的保护情况,并以基因列表、维恩图或热图的形式显示结果。这些功能使 zDB 对生物信息学家和更习惯于实验室研究的研究人员都很有用。重要意义基因组比较和分析依赖于许多独立的工具,这就给科学家们留下了整合和可视化其结果以进行解释的负担。为了减轻这一负担,我们开发了 zDB,这是一款比较基因组学工具,包括分析管道和可视化平台。分析管道可自动进行基因注释、正选预测和系统发育推断,而可视化平台则可让科学家在网络浏览器中轻松探索结果。除其他功能外,该界面还允许用户直观地比较全基因组和目标区域、评估基因或代谢途径的保护情况、执行爆炸搜索或查找特定注释。总之,该工具将在两个或上百个基因组之间的比较研究中发挥广泛的作用。此外,它的设计还允许在本地或国际范围内轻松共享数据集,从而支持非生物信息学家对他们最喜欢的生物的基因组进行探索性分析。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
mSystems
mSystems Biochemistry, Genetics and Molecular Biology-Biochemistry
CiteScore
10.50
自引率
3.10%
发文量
308
审稿时长
13 weeks
期刊介绍: mSystems™ will publish preeminent work that stems from applying technologies for high-throughput analyses to achieve insights into the metabolic and regulatory systems at the scale of both the single cell and microbial communities. The scope of mSystems™ encompasses all important biological and biochemical findings drawn from analyses of large data sets, as well as new computational approaches for deriving these insights. mSystems™ will welcome submissions from researchers who focus on the microbiome, genomics, metagenomics, transcriptomics, metabolomics, proteomics, glycomics, bioinformatics, and computational microbiology. mSystems™ will provide streamlined decisions, while carrying on ASM''s tradition of rigorous peer review.
期刊最新文献
Announcing the mSystems special collection on microbial dormancy L-tryptophan and copper interactions linked to reduced colibactin genotoxicity in pks+ Escherichia coli Characterization of the carbapenem-resistant Acinetobacter baumannii clinical reference isolate BAL062 (CC2:KL58:OCL1): resistance properties and capsular polysaccharide structure The occurrence of Aerococcus urinaeequi and non-aureus staphylococci in raw milk negatively correlates with Escherichia coli clinical mastitis The predicted secreted proteome of activated sludge microorganisms indicates distinct nutrient niches
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1