{"title":"预测蛋白磷酸化位点的新编码方案","authors":"Zimo Yin, Junyan Tan","doi":"10.1109/ISB.2012.6314113","DOIUrl":null,"url":null,"abstract":"Protein phosphorylation is involved in most cellular functions. Because of the importance of protein phosphorylation, many methods are conducted to identify the phosphorylation sites. Experimental methods for identifying phosphorylation sites are not only costly but also time consuming. Hence, computational methods are highly desired. In this paper, three new encoding methods, BinCTF(Binary-conjoint triad feature), CTF2(new conjoint triad feature) and BinCTF2(Binary-new conjoint triad feature), which are the modification of Binary and CTF encoding, are developed. Then an ensemble support vector machine is applied to predict the phosphorylation sites related to serine (S), threonine (T) and tyrosine (Y) residues. The numerical results indicate that some of the performance of these new methods are better than previous methods.","PeriodicalId":224011,"journal":{"name":"2012 IEEE 6th International Conference on Systems Biology (ISB)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"New encoding schemes for prediction of protein phosphorylation sites\",\"authors\":\"Zimo Yin, Junyan Tan\",\"doi\":\"10.1109/ISB.2012.6314113\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Protein phosphorylation is involved in most cellular functions. Because of the importance of protein phosphorylation, many methods are conducted to identify the phosphorylation sites. Experimental methods for identifying phosphorylation sites are not only costly but also time consuming. Hence, computational methods are highly desired. In this paper, three new encoding methods, BinCTF(Binary-conjoint triad feature), CTF2(new conjoint triad feature) and BinCTF2(Binary-new conjoint triad feature), which are the modification of Binary and CTF encoding, are developed. Then an ensemble support vector machine is applied to predict the phosphorylation sites related to serine (S), threonine (T) and tyrosine (Y) residues. The numerical results indicate that some of the performance of these new methods are better than previous methods.\",\"PeriodicalId\":224011,\"journal\":{\"name\":\"2012 IEEE 6th International Conference on Systems Biology (ISB)\",\"volume\":\"48 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-09-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 IEEE 6th International Conference on Systems Biology (ISB)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISB.2012.6314113\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE 6th International Conference on Systems Biology (ISB)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISB.2012.6314113","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
New encoding schemes for prediction of protein phosphorylation sites
Protein phosphorylation is involved in most cellular functions. Because of the importance of protein phosphorylation, many methods are conducted to identify the phosphorylation sites. Experimental methods for identifying phosphorylation sites are not only costly but also time consuming. Hence, computational methods are highly desired. In this paper, three new encoding methods, BinCTF(Binary-conjoint triad feature), CTF2(new conjoint triad feature) and BinCTF2(Binary-new conjoint triad feature), which are the modification of Binary and CTF encoding, are developed. Then an ensemble support vector machine is applied to predict the phosphorylation sites related to serine (S), threonine (T) and tyrosine (Y) residues. The numerical results indicate that some of the performance of these new methods are better than previous methods.