MetaComBin: combining abundances and overlaps for binning metagenomics reads.

IF 2.8 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY Frontiers in bioinformatics Pub Date : 2025-03-03 eCollection Date: 2025-01-01 DOI:10.3389/fbinf.2025.1504728
Francesco Tomasella, Cinzia Pizzi
{"title":"MetaComBin: combining abundances and overlaps for binning metagenomics reads.","authors":"Francesco Tomasella, Cinzia Pizzi","doi":"10.3389/fbinf.2025.1504728","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>Metagenomics is the discipline that studies heterogeneous microbial samples extracted directly from their natural environment, for example, from soil, water, or the human body. The detection and quantification of species that populate microbial communities have been the subject of many recent studies based on classification and clustering, motivated by being the first step in more complex pipelines (e.g., for functional analysis, de novo assembly, or comparison of metagenomes). Metagenomics has an impact on both environmental studies and precision medicine; thus, it is crucial to improve the quality of species identification through computational tools.</p><p><strong>Methods: </strong>In this paper, we explore the idea of improving the overall quality of metagenomics binning at the read level by proposing a computational framework that sequentially combines two complementary read-binning approaches: one based on species abundance determination and another one relying on read overlap in order to cluster reads together. We called this approach MetaComBin (metagenomics combined binning).</p><p><strong>Results and discussion: </strong>The results of our experiments with the MetaComBin approach showed that the combination of two tools, based on different approaches, can improve the clustering quality in realistic conditions where the number of species is not known beforehand.</p>","PeriodicalId":73066,"journal":{"name":"Frontiers in bioinformatics","volume":"5 ","pages":"1504728"},"PeriodicalIF":2.8000,"publicationDate":"2025-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11912761/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in bioinformatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/fbinf.2025.1504728","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Introduction: Metagenomics is the discipline that studies heterogeneous microbial samples extracted directly from their natural environment, for example, from soil, water, or the human body. The detection and quantification of species that populate microbial communities have been the subject of many recent studies based on classification and clustering, motivated by being the first step in more complex pipelines (e.g., for functional analysis, de novo assembly, or comparison of metagenomes). Metagenomics has an impact on both environmental studies and precision medicine; thus, it is crucial to improve the quality of species identification through computational tools.

Methods: In this paper, we explore the idea of improving the overall quality of metagenomics binning at the read level by proposing a computational framework that sequentially combines two complementary read-binning approaches: one based on species abundance determination and another one relying on read overlap in order to cluster reads together. We called this approach MetaComBin (metagenomics combined binning).

Results and discussion: The results of our experiments with the MetaComBin approach showed that the combination of two tools, based on different approaches, can improve the clustering quality in realistic conditions where the number of species is not known beforehand.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
简介元基因组学是一门研究直接从自然环境(如土壤、水或人体)中提取的异质微生物样本的学科。微生物群落中物种的检测和定量是最近许多基于分类和聚类的研究的主题,其动机是作为更复杂管道的第一步(如功能分析、从头组装或元基因组比较)。元基因组学对环境研究和精准医疗都有影响;因此,通过计算工具提高物种鉴定的质量至关重要:在本文中,我们提出了一种计算框架,将两种互补的读数分选方法依次结合起来,以提高元基因组学在读数水平上的整体分选质量,其中一种方法基于物种丰度确定,另一种方法则依靠读数重叠将读数聚类。我们称这种方法为 MetaComBin(元基因组学组合分选):MetaComBin 方法的实验结果表明,在物种数量事先未知的现实条件下,基于不同方法的两种工具的组合可以提高聚类质量。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
2.60
自引率
0.00%
发文量
0
期刊最新文献
MetaComBin: combining abundances and overlaps for binning metagenomics reads. Reliable machine learning models in genomic medicine using conformal prediction. Phytochemical baicalin potentially inhibits Bcl-2 and VEGF: an in silico approach. Unlocking the potential of in silico approach in designing antibodies against SARS-CoV-2. Seurat function argument values in scRNA-seq data analysis: potential pitfalls and refinements for biological interpretation.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1