{"title":"CNN for human exons and introns classification","authors":"FE Nasr, A. Oueslati","doi":"10.1109/SSD52085.2021.9429303","DOIUrl":null,"url":null,"abstract":"Modeling properties and functions associated with DNA sequences is a very complex task in genomics. This task is more and more difficult when it concerns the human genome. This genome is composed with coding and non-coding regions which are not yet fully identified. Difficulties seen in this context are particularly related to the fact that 98% of the human genome is made up of non-coding zones. Therefore it seems evident that a powerful predictive model can have a huge advantage in advancing the exploration of the human genome. In this paper, we started from text representation of the DNA sequence to the representation of exons and introns by images to classify them. Here we are introducing a convolutional neural network model to classify human exons and introns. Our model has shown very good results concerning classification learning and testing rate which exceed 90%.","PeriodicalId":6799,"journal":{"name":"2021 18th International Multi-Conference on Systems, Signals & Devices (SSD)","volume":"3527 1","pages":"249-254"},"PeriodicalIF":0.0000,"publicationDate":"2021-03-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 18th International Multi-Conference on Systems, Signals & Devices (SSD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SSD52085.2021.9429303","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
Modeling properties and functions associated with DNA sequences is a very complex task in genomics. This task is more and more difficult when it concerns the human genome. This genome is composed with coding and non-coding regions which are not yet fully identified. Difficulties seen in this context are particularly related to the fact that 98% of the human genome is made up of non-coding zones. Therefore it seems evident that a powerful predictive model can have a huge advantage in advancing the exploration of the human genome. In this paper, we started from text representation of the DNA sequence to the representation of exons and introns by images to classify them. Here we are introducing a convolutional neural network model to classify human exons and introns. Our model has shown very good results concerning classification learning and testing rate which exceed 90%.