Pangenome mining of the Streptomyces genus redefines species’ biosynthetic potential

IF 10.1 1区 生物学 Q1 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Genome Biology Pub Date : 2025-01-14 DOI:10.1186/s13059-024-03471-9
Omkar S. Mohite, Tue S. Jørgensen, Thomas J. Booth, Pep Charusanti, Patrick V. Phaneuf, Tilmann Weber, Bernhard O. Palsson
{"title":"Pangenome mining of the Streptomyces genus redefines species’ biosynthetic potential","authors":"Omkar S. Mohite, Tue S. Jørgensen, Thomas J. Booth, Pep Charusanti, Patrick V. Phaneuf, Tilmann Weber, Bernhard O. Palsson","doi":"10.1186/s13059-024-03471-9","DOIUrl":null,"url":null,"abstract":"Streptomyces is a highly diverse genus known for the production of secondary or specialized metabolites with a wide range of applications in the medical and agricultural industries. Several thousand complete or nearly complete Streptomyces genome sequences are now available, affording the opportunity to deeply investigate the biosynthetic potential within these organisms and to advance natural product discovery initiatives. We perform pangenome analysis on 2371 Streptomyces genomes, including approximately 1200 complete assemblies. Employing a data-driven approach based on genome similarities, the Streptomyces genus was classified into 7 primary and 42 secondary Mash-clusters, forming the basis for comprehensive pangenome mining. A refined workflow for grouping biosynthetic gene clusters (BGCs) redefines their diversity across different Mash-clusters. This workflow also reassigns 2729 known BGC families to only 440 families, a reduction caused by inaccuracies in BGC boundary detections. When the genomic location of BGCs is included in the analysis, a conserved genomic structure, or synteny, among BGCs becomes apparent within species and Mash-clusters. This synteny suggests that vertical inheritance is a major factor in the diversification of BGCs. Our analysis of a genomic dataset at a scale of thousands of genomes refines predictions of BGC diversity using Mash-clusters as a basis for pangenome analysis. The observed conservation in the order of BGCs’ genomic locations shows that the BGCs are vertically inherited. The presented workflow and the in-depth analysis pave the way for large-scale pangenome investigations and enhance our understanding of the biosynthetic potential of the Streptomyces genus.","PeriodicalId":12611,"journal":{"name":"Genome Biology","volume":"90 1","pages":""},"PeriodicalIF":10.1000,"publicationDate":"2025-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Genome Biology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s13059-024-03471-9","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOTECHNOLOGY & APPLIED MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Streptomyces is a highly diverse genus known for the production of secondary or specialized metabolites with a wide range of applications in the medical and agricultural industries. Several thousand complete or nearly complete Streptomyces genome sequences are now available, affording the opportunity to deeply investigate the biosynthetic potential within these organisms and to advance natural product discovery initiatives. We perform pangenome analysis on 2371 Streptomyces genomes, including approximately 1200 complete assemblies. Employing a data-driven approach based on genome similarities, the Streptomyces genus was classified into 7 primary and 42 secondary Mash-clusters, forming the basis for comprehensive pangenome mining. A refined workflow for grouping biosynthetic gene clusters (BGCs) redefines their diversity across different Mash-clusters. This workflow also reassigns 2729 known BGC families to only 440 families, a reduction caused by inaccuracies in BGC boundary detections. When the genomic location of BGCs is included in the analysis, a conserved genomic structure, or synteny, among BGCs becomes apparent within species and Mash-clusters. This synteny suggests that vertical inheritance is a major factor in the diversification of BGCs. Our analysis of a genomic dataset at a scale of thousands of genomes refines predictions of BGC diversity using Mash-clusters as a basis for pangenome analysis. The observed conservation in the order of BGCs’ genomic locations shows that the BGCs are vertically inherited. The presented workflow and the in-depth analysis pave the way for large-scale pangenome investigations and enhance our understanding of the biosynthetic potential of the Streptomyces genus.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
链霉菌属的庞基因组挖掘重新定义了物种的生物合成潜力
链霉菌是一个高度多样化的属,以生产次生或专门的代谢物而闻名,在医疗和农业工业中有着广泛的应用。目前已有数千个完整或接近完整的链霉菌基因组序列,为深入研究这些生物的生物合成潜力和推进天然产物的发现提供了机会。我们对2371个链霉菌基因组进行了全基因组分析,其中包括大约1200个完整的片段。采用基于基因组相似性的数据驱动方法,将链霉菌属划分为7个一级簇和42个二级簇,为全面的泛基因组挖掘奠定了基础。一种精细的生物合成基因簇(bgc)分组工作流程重新定义了它们在不同mash -簇中的多样性。该工作流还将2729个已知的BGC家族重新分配到440个家族,减少了BGC边界检测的不准确性。当BGCs的基因组位置包含在分析中时,在物种和mash -cluster中BGCs之间的保守基因组结构或共系性变得明显。这种一致性表明,纵向继承是bgc多样化的一个主要因素。我们对数千个基因组规模的基因组数据集进行了分析,使用Mash-clusters作为泛基因组分析的基础,改进了BGC多样性的预测。观察到的BGCs基因组位置顺序的保守性表明BGCs是垂直遗传的。所提出的工作流程和深入的分析为大规模的泛基因组研究铺平了道路,并提高了我们对链霉菌属生物合成潜力的认识。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Genome Biology
Genome Biology Biochemistry, Genetics and Molecular Biology-Genetics
CiteScore
21.00
自引率
3.30%
发文量
241
审稿时长
2 months
期刊介绍: Genome Biology stands as a premier platform for exceptional research across all domains of biology and biomedicine, explored through a genomic and post-genomic lens. With an impressive impact factor of 12.3 (2022),* the journal secures its position as the 3rd-ranked research journal in the Genetics and Heredity category and the 2nd-ranked research journal in the Biotechnology and Applied Microbiology category by Thomson Reuters. Notably, Genome Biology holds the distinction of being the highest-ranked open-access journal in this category. Our dedicated team of highly trained in-house Editors collaborates closely with our esteemed Editorial Board of international experts, ensuring the journal remains on the forefront of scientific advances and community standards. Regular engagement with researchers at conferences and institute visits underscores our commitment to staying abreast of the latest developments in the field.
期刊最新文献
CENP-A/CENP-B uncoupling in the evolutionary reshuffling of centromeres in equids SETD2 loss-of-function uniquely sensitizes cells to epigenetic targeting of NSD1-directed H3K36 methylation MAAT: a new nonparametric Bayesian framework for incorporating multiple functional annotations in transcriptome-wide association studies A novel decomposer-exploiter interaction framework of plant residue microbial decomposition Multi-INTACT: integrative analysis of the genome, transcriptome, and proteome identifies causal mechanisms of complex traits
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1