{"title":"聚类破碎蛋白键角分析","authors":"Justin S. Diamond","doi":"10.17706/ijbbb.2020.10.2.74-83","DOIUrl":null,"url":null,"abstract":"The desire for accurate protein prediction algorithms has been a hallmark of computational biology achievements. Still, better algorithms and methodologies can achieve even greater success with implication across a diverse range of biological and medicinal fields such as protein function inference. Accurate prediction methods rely heavily on sequence similarity, however structure is more evolutionary conserved, i.e. structure is an alternate characteristic for ancestral relationships between proteins. The premise of this work is that similar structural features will be clustered together, which may show a unique amino acid and secondary structure (SS) distribution, which can be, incorporated into HMMs for SS prediction and protein function inference algorithms. With structural-evolutionary relationship in mind, I propose a methodology for ‘structure’ based SS prediction methods using HMM and k-mean and fuzzy k -means fragmented protein clusters. When fragment distributions were incorporated into HMMs, the average accuracy increased by 1 percent but showed an increase in accuracy of up to 13 percent for particular sequences. The HMM results were not so promising, however the clustering of protein structure fragments by C-alphas bond angles shows to be a useful length-independent metric for inferring functional relationships between proteins.","PeriodicalId":13816,"journal":{"name":"International Journal of Bioscience, Biochemistry and Bioinformatics","volume":"10 1","pages":"74-83"},"PeriodicalIF":0.0000,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Analysis of Clustering Fragmented Protein Bond Angles\",\"authors\":\"Justin S. Diamond\",\"doi\":\"10.17706/ijbbb.2020.10.2.74-83\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The desire for accurate protein prediction algorithms has been a hallmark of computational biology achievements. Still, better algorithms and methodologies can achieve even greater success with implication across a diverse range of biological and medicinal fields such as protein function inference. Accurate prediction methods rely heavily on sequence similarity, however structure is more evolutionary conserved, i.e. structure is an alternate characteristic for ancestral relationships between proteins. The premise of this work is that similar structural features will be clustered together, which may show a unique amino acid and secondary structure (SS) distribution, which can be, incorporated into HMMs for SS prediction and protein function inference algorithms. With structural-evolutionary relationship in mind, I propose a methodology for ‘structure’ based SS prediction methods using HMM and k-mean and fuzzy k -means fragmented protein clusters. When fragment distributions were incorporated into HMMs, the average accuracy increased by 1 percent but showed an increase in accuracy of up to 13 percent for particular sequences. The HMM results were not so promising, however the clustering of protein structure fragments by C-alphas bond angles shows to be a useful length-independent metric for inferring functional relationships between proteins.\",\"PeriodicalId\":13816,\"journal\":{\"name\":\"International Journal of Bioscience, Biochemistry and Bioinformatics\",\"volume\":\"10 1\",\"pages\":\"74-83\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Bioscience, Biochemistry and Bioinformatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.17706/ijbbb.2020.10.2.74-83\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Bioscience, Biochemistry and Bioinformatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.17706/ijbbb.2020.10.2.74-83","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Analysis of Clustering Fragmented Protein Bond Angles
The desire for accurate protein prediction algorithms has been a hallmark of computational biology achievements. Still, better algorithms and methodologies can achieve even greater success with implication across a diverse range of biological and medicinal fields such as protein function inference. Accurate prediction methods rely heavily on sequence similarity, however structure is more evolutionary conserved, i.e. structure is an alternate characteristic for ancestral relationships between proteins. The premise of this work is that similar structural features will be clustered together, which may show a unique amino acid and secondary structure (SS) distribution, which can be, incorporated into HMMs for SS prediction and protein function inference algorithms. With structural-evolutionary relationship in mind, I propose a methodology for ‘structure’ based SS prediction methods using HMM and k-mean and fuzzy k -means fragmented protein clusters. When fragment distributions were incorporated into HMMs, the average accuracy increased by 1 percent but showed an increase in accuracy of up to 13 percent for particular sequences. The HMM results were not so promising, however the clustering of protein structure fragments by C-alphas bond angles shows to be a useful length-independent metric for inferring functional relationships between proteins.