{"title":"Computational Prediction of RNA-Binding Proteins and Binding Sites","authors":"Jingna Si, Jing Cui, Jin Cheng, R. Wu","doi":"10.3390/ijms161125952","DOIUrl":null,"url":null,"abstract":"Proteins and RNA interaction have vital roles in many cellular processes such as protein synthesis, sequence encoding, RNA transfer, and gene regulation at the transcriptional and post-transcriptional levels. Approximately 6%–8% of all proteins are RNA-binding proteins (RBPs). Distinguishing these RBPs or their binding residues is a major aim of structural biology. Previously, a number of experimental methods were developed for the determination of protein–RNA interactions. However, these experimental methods are expensive, time-consuming, and labor-intensive. Alternatively, researchers have developed many computational approaches to predict RBPs and protein–RNA binding sites, by combining various machine learning methods and abundant sequence and/or structural features. There are three kinds of computational approaches, which are prediction from protein sequence, prediction from protein structure, and protein-RNA docking. In this paper, we review all existing studies of predictions of RNA-binding sites and RBPs and complexes, including data sets used in different approaches, sequence and structural features used in several predictors, prediction method classifications, performance comparisons, evaluation methods, and future directions.","PeriodicalId":14156,"journal":{"name":"International Journal of Molecular Sciences","volume":"16 1","pages":"26303 - 26317"},"PeriodicalIF":4.9000,"publicationDate":"2015-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.3390/ijms161125952","citationCount":"59","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Molecular Sciences","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.3390/ijms161125952","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 59
Abstract
Proteins and RNA interaction have vital roles in many cellular processes such as protein synthesis, sequence encoding, RNA transfer, and gene regulation at the transcriptional and post-transcriptional levels. Approximately 6%–8% of all proteins are RNA-binding proteins (RBPs). Distinguishing these RBPs or their binding residues is a major aim of structural biology. Previously, a number of experimental methods were developed for the determination of protein–RNA interactions. However, these experimental methods are expensive, time-consuming, and labor-intensive. Alternatively, researchers have developed many computational approaches to predict RBPs and protein–RNA binding sites, by combining various machine learning methods and abundant sequence and/or structural features. There are three kinds of computational approaches, which are prediction from protein sequence, prediction from protein structure, and protein-RNA docking. In this paper, we review all existing studies of predictions of RNA-binding sites and RBPs and complexes, including data sets used in different approaches, sequence and structural features used in several predictors, prediction method classifications, performance comparisons, evaluation methods, and future directions.
期刊介绍:
The International Journal of Molecular Sciences (ISSN 1422-0067) provides an advanced forum for chemistry, molecular physics (chemical physics and physical chemistry) and molecular biology. It publishes research articles, reviews, communications and short notes. Our aim is to encourage scientists to publish their theoretical and experimental results in as much detail as possible. Therefore, there is no restriction on the length of the papers or the number of electronics supplementary files. For articles with computational results, the full experimental details must be provided so that the results can be reproduced. Electronic files regarding the full details of the calculation and experimental procedure, if unable to be published in a normal way, can be deposited as supplementary material (including animated pictures, videos, interactive Excel sheets, software executables and others).