{"title":"用图上的函数编码蛋白质结构","authors":"Promita Bose, Xiaxia Yu, R. Harrison","doi":"10.1109/BIBMW.2011.6112396","DOIUrl":null,"url":null,"abstract":"The application of machine learning and datamining to the analysis and prediction of protein structure is a research area with potentially high impact in both computer science and biology. Proteins structures are inherently complicated objects with a mixture of crisp and fuzzy properties. Therefore developing effective representations for them is a research problem in itself, while quantifying and predicting properties and structure is of immediate importance in structural biology. This paper focuses on developing a compact, effective, efficient and accurate representation of protein structure that is compatible with widely used machine learning tools like the SVM. Graphs based on Delaunay triangulation are used to represent the structure, and then functions are constructed from these graphs to develop constant-size representations of protein structure that are tightly bound to the amino acid sequence. The representations preserve sufficient information to be valuable for model vs. experimental structure classification and regression analysis of model quality.","PeriodicalId":6345,"journal":{"name":"2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)","volume":"1 1","pages":"338-344"},"PeriodicalIF":0.0000,"publicationDate":"2011-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"18","resultStr":"{\"title\":\"Encoding protein structure with functions on graphs\",\"authors\":\"Promita Bose, Xiaxia Yu, R. Harrison\",\"doi\":\"10.1109/BIBMW.2011.6112396\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The application of machine learning and datamining to the analysis and prediction of protein structure is a research area with potentially high impact in both computer science and biology. Proteins structures are inherently complicated objects with a mixture of crisp and fuzzy properties. Therefore developing effective representations for them is a research problem in itself, while quantifying and predicting properties and structure is of immediate importance in structural biology. This paper focuses on developing a compact, effective, efficient and accurate representation of protein structure that is compatible with widely used machine learning tools like the SVM. Graphs based on Delaunay triangulation are used to represent the structure, and then functions are constructed from these graphs to develop constant-size representations of protein structure that are tightly bound to the amino acid sequence. The representations preserve sufficient information to be valuable for model vs. experimental structure classification and regression analysis of model quality.\",\"PeriodicalId\":6345,\"journal\":{\"name\":\"2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)\",\"volume\":\"1 1\",\"pages\":\"338-344\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-11-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"18\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/BIBMW.2011.6112396\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIBMW.2011.6112396","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Encoding protein structure with functions on graphs
The application of machine learning and datamining to the analysis and prediction of protein structure is a research area with potentially high impact in both computer science and biology. Proteins structures are inherently complicated objects with a mixture of crisp and fuzzy properties. Therefore developing effective representations for them is a research problem in itself, while quantifying and predicting properties and structure is of immediate importance in structural biology. This paper focuses on developing a compact, effective, efficient and accurate representation of protein structure that is compatible with widely used machine learning tools like the SVM. Graphs based on Delaunay triangulation are used to represent the structure, and then functions are constructed from these graphs to develop constant-size representations of protein structure that are tightly bound to the amino acid sequence. The representations preserve sufficient information to be valuable for model vs. experimental structure classification and regression analysis of model quality.