{"title":"An Improved Method for Assembling Metagenomic Sequences","authors":"Nguyen Phi Khanh, L. Vinh","doi":"10.1109/ICSSE58758.2023.10227158","DOIUrl":null,"url":null,"abstract":"Metagenomic assembly is a crucial step in analyzing pipelines of microbial communities. It allows to reconstruct contiguous consensus sequences (contigs) from mixed reads of multiple species. Meta-IDBA is one of the de novo assemblers which applies the de Bruijn graph algorithm to assemble metagenomic data without reference genomes. However, errors occurring after the graph creation are not well corrected, and thus result to low accuracy of the approach. This study proposes an improved method that tackles the problem using an effective error removal technique. Experimental results show that the improved algorithm outperforms original Meta-IDBA assembler on simulated and real data.","PeriodicalId":280745,"journal":{"name":"2023 International Conference on System Science and Engineering (ICSSE)","volume":"46 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on System Science and Engineering (ICSSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSSE58758.2023.10227158","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Metagenomic assembly is a crucial step in analyzing pipelines of microbial communities. It allows to reconstruct contiguous consensus sequences (contigs) from mixed reads of multiple species. Meta-IDBA is one of the de novo assemblers which applies the de Bruijn graph algorithm to assemble metagenomic data without reference genomes. However, errors occurring after the graph creation are not well corrected, and thus result to low accuracy of the approach. This study proposes an improved method that tackles the problem using an effective error removal technique. Experimental results show that the improved algorithm outperforms original Meta-IDBA assembler on simulated and real data.