{"title":"Using Different Feature Combination Methods to Study Multisite Protein Sub-cellular Localization Prediction","authors":"Qing Zhao, Dong Wang, Yuehui Chen, Xumi Qu","doi":"10.1109/ICICTA.2015.272","DOIUrl":null,"url":null,"abstract":"Multisite protein sub-cellular localization prediction has become the hot topic relating biological information in recent years. Quite a lot of researchers have researched multisite protein sub-cellular localization for a long time. However, the accuracy still needs to be improved. As one of the researchers, I should explore new methods to improve the prediction accuracy. I choose Gpos-mPLOC data set in this paper. In addition, combining the pseudo amino acid composition, position vector and entropy density three effective feature extraction methods arbitrarily to extract protein features. Then, putting these features into multi-label k nearest neighbor classifier to predict protein sub-cellular location. The experiment proves that different feature combination methods can result in different prediction accuracy through the Jack-knife test and I can choose the best feature combination method to predict multisite protein sub-cellular location.","PeriodicalId":231694,"journal":{"name":"2015 8th International Conference on Intelligent Computation Technology and Automation (ICICTA)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2015-06-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 8th International Conference on Intelligent Computation Technology and Automation (ICICTA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICICTA.2015.272","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Multisite protein sub-cellular localization prediction has become the hot topic relating biological information in recent years. Quite a lot of researchers have researched multisite protein sub-cellular localization for a long time. However, the accuracy still needs to be improved. As one of the researchers, I should explore new methods to improve the prediction accuracy. I choose Gpos-mPLOC data set in this paper. In addition, combining the pseudo amino acid composition, position vector and entropy density three effective feature extraction methods arbitrarily to extract protein features. Then, putting these features into multi-label k nearest neighbor classifier to predict protein sub-cellular location. The experiment proves that different feature combination methods can result in different prediction accuracy through the Jack-knife test and I can choose the best feature combination method to predict multisite protein sub-cellular location.