{"title":"Prediction method for Intronic alternative polyadenylation sites.","authors":"Shanxin Zhang","doi":"10.1109/EMBC.2016.7591464","DOIUrl":null,"url":null,"abstract":"Alternative Polyadenylation (APA) of mRNAs has been proven as a considerable mechanism for post-transcriptional gene regulation. The interplay between Intronic APA and splicing may affect the isoforms of mRNAs. In this paper, we have found four prevalent motifs, i.e. AATAAA, TTTTTTTT, CCAGSCTGG and RGYRYRGTGG surrounding the polyadenylation sites; then we proposed a new computational method to identify the Intronic APA sites in the human genome, which is based on a Support Vector Machine (SVM) with weighted degree string kernel. Compared with other APA events, Intronic APA sites are likely to be with TTTTTTTT pattern. The proposed algorithm can correctly classify 89% of Intronic APA sites from the constitutive polyadenylation sites. In addition, the prediction accuracy of separating the Intronic APA from other types of APA could be up to 88.3%. The prediction results indicate that our computational method is promising for the identification of Intronic APA sites.","PeriodicalId":72689,"journal":{"name":"Conference proceedings : ... Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual Conference","volume":"52 1","pages":"3426-3428"},"PeriodicalIF":0.0000,"publicationDate":"2016-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Conference proceedings : ... Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EMBC.2016.7591464","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Alternative Polyadenylation (APA) of mRNAs has been proven as a considerable mechanism for post-transcriptional gene regulation. The interplay between Intronic APA and splicing may affect the isoforms of mRNAs. In this paper, we have found four prevalent motifs, i.e. AATAAA, TTTTTTTT, CCAGSCTGG and RGYRYRGTGG surrounding the polyadenylation sites; then we proposed a new computational method to identify the Intronic APA sites in the human genome, which is based on a Support Vector Machine (SVM) with weighted degree string kernel. Compared with other APA events, Intronic APA sites are likely to be with TTTTTTTT pattern. The proposed algorithm can correctly classify 89% of Intronic APA sites from the constitutive polyadenylation sites. In addition, the prediction accuracy of separating the Intronic APA from other types of APA could be up to 88.3%. The prediction results indicate that our computational method is promising for the identification of Intronic APA sites.