基于贝叶斯学习方法的蛋白质二级结构预测

14th IEEE International Conference on Tools with Artificial Intelligence, 2002. (ICTAI 2002). Proceedings. Pub Date : 2002-11-04 DOI:10.1109/TAI.2002.1180812

Peng Wang, Du Zhang

{"title":"基于贝叶斯学习方法的蛋白质二级结构预测","authors":"Peng Wang, Du Zhang","doi":"10.1109/TAI.2002.1180812","DOIUrl":null,"url":null,"abstract":"This paper describes a Bayesian learning based approach to protein secondary structure prediction. Four secondary structure types are considered, including /spl alpha/-helix, /spl beta/-strand, /spl beta/-turn and coil. A six-letter exchange group is utilized to represent a protein sequence. Training cases are expressed as sequence quaternion. A tool called Predictor is developed in Java that implements the proposed approach. To evaluate the tool, we select, from the protein data bank and based on the principle of one-protein-per-family according to the structure family of SCOP, six hundred and twenty-three known proteins without pair wise sequence homology. Several training/test data splits have been tried. The results show that our proposed approach can produce prediction accuracy comparable to those of the traditional prediction methods. Predictor has user-friendly and easy-to-use GUIs, and is of practical value to the molecular biology researchers.","PeriodicalId":197064,"journal":{"name":"14th IEEE International Conference on Tools with Artificial Intelligence, 2002. (ICTAI 2002). Proceedings.","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2002-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Protein secondary structure prediction with Bayesian learning method\",\"authors\":\"Peng Wang, Du Zhang\",\"doi\":\"10.1109/TAI.2002.1180812\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper describes a Bayesian learning based approach to protein secondary structure prediction. Four secondary structure types are considered, including /spl alpha/-helix, /spl beta/-strand, /spl beta/-turn and coil. A six-letter exchange group is utilized to represent a protein sequence. Training cases are expressed as sequence quaternion. A tool called Predictor is developed in Java that implements the proposed approach. To evaluate the tool, we select, from the protein data bank and based on the principle of one-protein-per-family according to the structure family of SCOP, six hundred and twenty-three known proteins without pair wise sequence homology. Several training/test data splits have been tried. The results show that our proposed approach can produce prediction accuracy comparable to those of the traditional prediction methods. Predictor has user-friendly and easy-to-use GUIs, and is of practical value to the molecular biology researchers.\",\"PeriodicalId\":197064,\"journal\":{\"name\":\"14th IEEE International Conference on Tools with Artificial Intelligence, 2002. (ICTAI 2002). Proceedings.\",\"volume\":\"7 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2002-11-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"14th IEEE International Conference on Tools with Artificial Intelligence, 2002. (ICTAI 2002). Proceedings.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/TAI.2002.1180812\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"14th IEEE International Conference on Tools with Artificial Intelligence, 2002. (ICTAI 2002). Proceedings.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TAI.2002.1180812","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

摘要

本文介绍了一种基于贝叶斯学习的蛋白质二级结构预测方法。考虑了四种二级结构类型，包括/spl alpha/-helix， /spl beta/-strand， /spl beta/-turn和coil。一个六个字母的交换基团被用来表示一个蛋白质序列。训练用例用序列四元数表示。在Java中开发了一个名为Predictor的工具来实现所提出的方法。为了评估该工具，我们从蛋白质数据库中，根据SCOP的结构家族，根据每个家族一个蛋白质的原则，选择了623个已知的没有对序列同源性的蛋白质。已经尝试了几种训练/测试数据分割。结果表明，该方法的预测精度与传统预测方法相当。预测器具有用户友好和易于使用的图形用户界面，对分子生物学研究人员具有实用价值。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Protein secondary structure prediction with Bayesian learning method

This paper describes a Bayesian learning based approach to protein secondary structure prediction. Four secondary structure types are considered, including /spl alpha/-helix, /spl beta/-strand, /spl beta/-turn and coil. A six-letter exchange group is utilized to represent a protein sequence. Training cases are expressed as sequence quaternion. A tool called Predictor is developed in Java that implements the proposed approach. To evaluate the tool, we select, from the protein data bank and based on the principle of one-protein-per-family according to the structure family of SCOP, six hundred and twenty-three known proteins without pair wise sequence homology. Several training/test data splits have been tried. The results show that our proposed approach can produce prediction accuracy comparable to those of the traditional prediction methods. Predictor has user-friendly and easy-to-use GUIs, and is of practical value to the molecular biology researchers.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

14th IEEE International Conference on Tools with Artificial Intelligence, 2002. (ICTAI 2002). Proceedings.

自引率

0.00%

发文量

期刊最新文献

Machine learning for software engineering: case studies in software reuse Active tracking and cloning of facial expressions using spatio-temporal information Fusing cooperative technical-specification knowledge components Ontology construction for information selection An intelligent brokering system to support multi-agent Web-based 4/sup th/-party logistics