{"title":"Predicting ligand binding residues using multi-positional correlations and kernel canonical correlation analysis","authors":"Alvaro J. González, Li Liao, Cathy H. Wu","doi":"10.1109/BIBM.2010.5706556","DOIUrl":null,"url":null,"abstract":"We present a new computational method for predicting ligand binding sites in protein sequences. The method uses kernelbased canonical correlation analysis and linear regression to identify binding sites in protein sequences as the residues that exhibit strong correlation between the residues' evolutionary characterization at the sites and the structure based functional classification of the proteins in the context of a functional family. We explore the effect of correlations among multiple positions in the sequences and show that their inclusion enhances the prediction accuracy significantly.","PeriodicalId":275098,"journal":{"name":"2010 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIBM.2010.5706556","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
We present a new computational method for predicting ligand binding sites in protein sequences. The method uses kernelbased canonical correlation analysis and linear regression to identify binding sites in protein sequences as the residues that exhibit strong correlation between the residues' evolutionary characterization at the sites and the structure based functional classification of the proteins in the context of a functional family. We explore the effect of correlations among multiple positions in the sequences and show that their inclusion enhances the prediction accuracy significantly.