{"title":"基于支持向量机和密码子编码方案的蛋白质二级结构预测","authors":"Masood Zamani, S. C. Kremer","doi":"10.1109/BIBMW.2012.6470326","DOIUrl":null,"url":null,"abstract":"In this study, we evaluate the performance of a protein secondary structure prediction model using a new amino acid \"codon\" encoding inspired by genetic codon mappings. The dimensionality of the binary codon encoding is less than that of an orthogonal encoding which requires less computations. Protein secondary structure prediction is an important step for machine learning techniques ultimately applied for protein 3D structure prediction. In the proposed model, one-stage binary support vector machines are employed, and the efficiency of the codon encoding to that of a commonly used orthogonal encoding are compared without incorporating protein evolutionary and structural information for an unbiased comparison. The performance of the classification model is measured according to Q3 and segment overlap (SOV) scores. The scores are compared with those of the prediction methods using an orthogonal encoding and protein sequence profiles. The experimental results indicate higher prediction accuracy based on Q3 SOV scores when sequence profiles are not used. Also, the relative classification scores of the proposed method are comparable with the methods incorporating protein global and evolutionary information. The experimental result implies the encoding scheme is able to integrate the evolutionary information into the prediction model since the encoding is based on genetic codon mappings which are the building blocks of amino acid formations at the primary level of biological processes. The codon encoding is worthwhile to be investigated using more complex learning architectures with the profiles and structural properties of proteins.","PeriodicalId":6392,"journal":{"name":"2012 IEEE International Conference on Bioinformatics and Biomedicine Workshops","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2012-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Protein secondary structure prediction using support vector machines and a codon encoding scheme\",\"authors\":\"Masood Zamani, S. C. Kremer\",\"doi\":\"10.1109/BIBMW.2012.6470326\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this study, we evaluate the performance of a protein secondary structure prediction model using a new amino acid \\\"codon\\\" encoding inspired by genetic codon mappings. The dimensionality of the binary codon encoding is less than that of an orthogonal encoding which requires less computations. Protein secondary structure prediction is an important step for machine learning techniques ultimately applied for protein 3D structure prediction. In the proposed model, one-stage binary support vector machines are employed, and the efficiency of the codon encoding to that of a commonly used orthogonal encoding are compared without incorporating protein evolutionary and structural information for an unbiased comparison. The performance of the classification model is measured according to Q3 and segment overlap (SOV) scores. The scores are compared with those of the prediction methods using an orthogonal encoding and protein sequence profiles. The experimental results indicate higher prediction accuracy based on Q3 SOV scores when sequence profiles are not used. Also, the relative classification scores of the proposed method are comparable with the methods incorporating protein global and evolutionary information. The experimental result implies the encoding scheme is able to integrate the evolutionary information into the prediction model since the encoding is based on genetic codon mappings which are the building blocks of amino acid formations at the primary level of biological processes. The codon encoding is worthwhile to be investigated using more complex learning architectures with the profiles and structural properties of proteins.\",\"PeriodicalId\":6392,\"journal\":{\"name\":\"2012 IEEE International Conference on Bioinformatics and Biomedicine Workshops\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-10-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 IEEE International Conference on Bioinformatics and Biomedicine Workshops\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/BIBMW.2012.6470326\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE International Conference on Bioinformatics and Biomedicine Workshops","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIBMW.2012.6470326","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Protein secondary structure prediction using support vector machines and a codon encoding scheme
In this study, we evaluate the performance of a protein secondary structure prediction model using a new amino acid "codon" encoding inspired by genetic codon mappings. The dimensionality of the binary codon encoding is less than that of an orthogonal encoding which requires less computations. Protein secondary structure prediction is an important step for machine learning techniques ultimately applied for protein 3D structure prediction. In the proposed model, one-stage binary support vector machines are employed, and the efficiency of the codon encoding to that of a commonly used orthogonal encoding are compared without incorporating protein evolutionary and structural information for an unbiased comparison. The performance of the classification model is measured according to Q3 and segment overlap (SOV) scores. The scores are compared with those of the prediction methods using an orthogonal encoding and protein sequence profiles. The experimental results indicate higher prediction accuracy based on Q3 SOV scores when sequence profiles are not used. Also, the relative classification scores of the proposed method are comparable with the methods incorporating protein global and evolutionary information. The experimental result implies the encoding scheme is able to integrate the evolutionary information into the prediction model since the encoding is based on genetic codon mappings which are the building blocks of amino acid formations at the primary level of biological processes. The codon encoding is worthwhile to be investigated using more complex learning architectures with the profiles and structural properties of proteins.