{"title":"A Novel Method for Reducing Computational Complexity of Whole Genome Sequence Alignment","authors":"Ryuichiro Nakato, O. Gotoh","doi":"10.1142/9781848161092_0013","DOIUrl":null,"url":null,"abstract":"National Institute of Advanced Industrial Science and Technology,Computational Biology Research Center,2-42 Aomi, Koto-ku, Tokyo 135-0064, JapanGenomic sequence alignment is a powerful tool for finding com mon subsequence patterns sharedby the input sequences and identifying evolutionary relationships between the species. However, therunning time and space requirement of genome alignment have often been very extensive. In thisresearch, we propose a novel algorithm called Coarse-Grained AlignmenT (CGAT) algorithm, forreducing computational complexity necessary for cross-species whole genome sequence alignment.The CGAT first divides the input sequences into ”blocks” with a fixed length and aligns these blocksto each other. The generated block-level alignment is then refined at the nucleotide level. This two-stepprocedure can drastically reduce the overall computational time and space necessary for an alignment.In this paper, we show the effectiveness of the proposed algorithm by applying it to whole genomesequences of several bacteria.Keywords: Genome Alignment; Multiple Alignment; Sequence Analysis; Comparative Genomics.","PeriodicalId":74513,"journal":{"name":"Proceedings of the ... Asia-Pacific bioinformatics conference","volume":"26 1","pages":"101-110"},"PeriodicalIF":0.0000,"publicationDate":"2007-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the ... Asia-Pacific bioinformatics conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1142/9781848161092_0013","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
National Institute of Advanced Industrial Science and Technology,Computational Biology Research Center,2-42 Aomi, Koto-ku, Tokyo 135-0064, JapanGenomic sequence alignment is a powerful tool for finding com mon subsequence patterns sharedby the input sequences and identifying evolutionary relationships between the species. However, therunning time and space requirement of genome alignment have often been very extensive. In thisresearch, we propose a novel algorithm called Coarse-Grained AlignmenT (CGAT) algorithm, forreducing computational complexity necessary for cross-species whole genome sequence alignment.The CGAT first divides the input sequences into ”blocks” with a fixed length and aligns these blocksto each other. The generated block-level alignment is then refined at the nucleotide level. This two-stepprocedure can drastically reduce the overall computational time and space necessary for an alignment.In this paper, we show the effectiveness of the proposed algorithm by applying it to whole genomesequences of several bacteria.Keywords: Genome Alignment; Multiple Alignment; Sequence Analysis; Comparative Genomics.