{"title":"Identification of Splice Junctions Across Species Using BLSTM Model","authors":"Aparajita Dutta, K. Singh, A. Anand","doi":"10.1145/3589437.3589438","DOIUrl":null,"url":null,"abstract":"Deep learning models like convolutional neural networks (CNN) and recurrent neural networks (RNN) have been used to identify splice sites from genome sequences. Most deep learning applications identify splice sites from a single species. Furthermore, the models generally identify and interpret only the canonical splice sites. However, a model capable of identifying both canonical and non-canonical splice sites from multiple species with comparable accuracy is more generalizable and robust. We analyze the performance of a BLSTM model for the first time across various species. We compare this RNN-based model with state-of-the-art splice site prediction models for identifying novel canonical and non-canonical splice sites in homo sapiens, mus musculus, and drosophila melanogaster.","PeriodicalId":119590,"journal":{"name":"Proceedings of the 2022 6th International Conference on Computational Biology and Bioinformatics","volume":"125 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2022 6th International Conference on Computational Biology and Bioinformatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3589437.3589438","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Deep learning models like convolutional neural networks (CNN) and recurrent neural networks (RNN) have been used to identify splice sites from genome sequences. Most deep learning applications identify splice sites from a single species. Furthermore, the models generally identify and interpret only the canonical splice sites. However, a model capable of identifying both canonical and non-canonical splice sites from multiple species with comparable accuracy is more generalizable and robust. We analyze the performance of a BLSTM model for the first time across various species. We compare this RNN-based model with state-of-the-art splice site prediction models for identifying novel canonical and non-canonical splice sites in homo sapiens, mus musculus, and drosophila melanogaster.