El-Sayed Orabi, M. Assal, Mustafa Abdel Azim, Yasser Kamal
{"title":"DNA fingerprint using smith waterman algorithm by grid computing","authors":"El-Sayed Orabi, M. Assal, Mustafa Abdel Azim, Yasser Kamal","doi":"10.1109/INFOS.2014.7036681","DOIUrl":null,"url":null,"abstract":"Deoxyribonucleic acid (DNA) is a molecule that encodes unique genetic instructions used in the development and functioning of all known living organisms and many viruses. This Genetic information is encoded as a sequence of nucleotides (adenine, cytosine, guanine, and thymine) recorded using the letters A, C, G, and T. DNA querying or alignment of these sequences required dynamic programming tools and very complex matrices and some heuristic methods like fast-all protein or nucleotide (FASTA) and Basic Local Alignment Search Tool (BLAST) that use massive force of processing and highly time consuming. We present a parallel solution to reduce the processing time. Smith waterman algorithm, some weighting matrices and a grid of computers are used to And field of similarity between these sequences in large DNA datasets. This grid consists of master computer and unlimited number of agents. The master computer is the user interface for insert the queried sequence and coordinates the processing between the grid agents.","PeriodicalId":394058,"journal":{"name":"2014 9th International Conference on Informatics and Systems","volume":"69 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 9th International Conference on Informatics and Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INFOS.2014.7036681","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
Deoxyribonucleic acid (DNA) is a molecule that encodes unique genetic instructions used in the development and functioning of all known living organisms and many viruses. This Genetic information is encoded as a sequence of nucleotides (adenine, cytosine, guanine, and thymine) recorded using the letters A, C, G, and T. DNA querying or alignment of these sequences required dynamic programming tools and very complex matrices and some heuristic methods like fast-all protein or nucleotide (FASTA) and Basic Local Alignment Search Tool (BLAST) that use massive force of processing and highly time consuming. We present a parallel solution to reduce the processing time. Smith waterman algorithm, some weighting matrices and a grid of computers are used to And field of similarity between these sequences in large DNA datasets. This grid consists of master computer and unlimited number of agents. The master computer is the user interface for insert the queried sequence and coordinates the processing between the grid agents.