{"title":"ANN Based Classification of Unknown Genome Fragments Using Chaos Game Representation","authors":"Vrinda V. Nair, K. Vijayan, D. Gopinath, A. Nair","doi":"10.1109/ICMLC.2010.56","DOIUrl":null,"url":null,"abstract":"Classification of organisms into different categories using their genomic sequences has found importance in study of evolutionary characteristics, specific identification of previously unknown organisms, study of mutual relationships between organisms and many other aspects in the study of living things. Chaos game representation (CGR) uniquely represents DNA sequences and reveals hidden patterns in it. Frequency-CGR (FCGR) derived from CGR, shows the frequency of sub-sequences present in the DNA sequence. In this paper, a novel method for classification of organisms based on a combination of FCGR and Artificial Neural network (ANN) is proposed. Eight categories from the taxonomical distribution of Eukaryotic organisms are taken and ANN is used for classification. Different configurations of ANN are tested and good accuracy is obtained. The way the fractal nature of DNA helps in classification, is also investigated.","PeriodicalId":423912,"journal":{"name":"2010 Second International Conference on Machine Learning and Computing","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-02-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 Second International Conference on Machine Learning and Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLC.2010.56","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
Classification of organisms into different categories using their genomic sequences has found importance in study of evolutionary characteristics, specific identification of previously unknown organisms, study of mutual relationships between organisms and many other aspects in the study of living things. Chaos game representation (CGR) uniquely represents DNA sequences and reveals hidden patterns in it. Frequency-CGR (FCGR) derived from CGR, shows the frequency of sub-sequences present in the DNA sequence. In this paper, a novel method for classification of organisms based on a combination of FCGR and Artificial Neural network (ANN) is proposed. Eight categories from the taxonomical distribution of Eukaryotic organisms are taken and ANN is used for classification. Different configurations of ANN are tested and good accuracy is obtained. The way the fractal nature of DNA helps in classification, is also investigated.