L. Vardharaj, Sumit Ranjan, Parampreet Singh, Narendran Rajagopalan, C. Mala
{"title":"一种蛋白质亚结构匹配的新方法","authors":"L. Vardharaj, Sumit Ranjan, Parampreet Singh, Narendran Rajagopalan, C. Mala","doi":"10.1109/WICT.2011.6141303","DOIUrl":null,"url":null,"abstract":"The rapidly increasing volumes of structural data of proteins has led to need of algorithms which can rapidly predict functions for proteins based on structure. Similarity between protein structures can provide evidence of possible functional similarity. In this paper, an attempt is made to efficiently recognize similar protein structures in the protein database contain thousands of proteins. This paper gives an efficient heuristic algorithm for finding protein 3D substructures in a 3D protein structure that are similar to a given query 3D protein substructure. This algorithm can be used for searching a database of protein 3D structures. Our approach is to divide the protein structure into sub-structures of size of query structure and compare each sub structure with the query protein using Procrustes algorithm which is based on the root mean square distance between the structures. The division involves constructing a bounding box over both the query and protein structure and dividing the bigger box into sizes of the smaller box. The above algorithm is implemented in parallel using message passing interface. Experiments show that our algorithm can find similar 3D substructures in reasonable time. This paper also presents various statistics as how our algorithm performs against a sequential algorithm and how the algorithm performs with varying sizes of the query structure.","PeriodicalId":178645,"journal":{"name":"2011 World Congress on Information and Communication Technologies","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A novel approach to protein substructure matching\",\"authors\":\"L. Vardharaj, Sumit Ranjan, Parampreet Singh, Narendran Rajagopalan, C. Mala\",\"doi\":\"10.1109/WICT.2011.6141303\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The rapidly increasing volumes of structural data of proteins has led to need of algorithms which can rapidly predict functions for proteins based on structure. Similarity between protein structures can provide evidence of possible functional similarity. In this paper, an attempt is made to efficiently recognize similar protein structures in the protein database contain thousands of proteins. This paper gives an efficient heuristic algorithm for finding protein 3D substructures in a 3D protein structure that are similar to a given query 3D protein substructure. This algorithm can be used for searching a database of protein 3D structures. Our approach is to divide the protein structure into sub-structures of size of query structure and compare each sub structure with the query protein using Procrustes algorithm which is based on the root mean square distance between the structures. The division involves constructing a bounding box over both the query and protein structure and dividing the bigger box into sizes of the smaller box. The above algorithm is implemented in parallel using message passing interface. Experiments show that our algorithm can find similar 3D substructures in reasonable time. This paper also presents various statistics as how our algorithm performs against a sequential algorithm and how the algorithm performs with varying sizes of the query structure.\",\"PeriodicalId\":178645,\"journal\":{\"name\":\"2011 World Congress on Information and Communication Technologies\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2011 World Congress on Information and Communication Technologies\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/WICT.2011.6141303\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 World Congress on Information and Communication Technologies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WICT.2011.6141303","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
The rapidly increasing volumes of structural data of proteins has led to need of algorithms which can rapidly predict functions for proteins based on structure. Similarity between protein structures can provide evidence of possible functional similarity. In this paper, an attempt is made to efficiently recognize similar protein structures in the protein database contain thousands of proteins. This paper gives an efficient heuristic algorithm for finding protein 3D substructures in a 3D protein structure that are similar to a given query 3D protein substructure. This algorithm can be used for searching a database of protein 3D structures. Our approach is to divide the protein structure into sub-structures of size of query structure and compare each sub structure with the query protein using Procrustes algorithm which is based on the root mean square distance between the structures. The division involves constructing a bounding box over both the query and protein structure and dividing the bigger box into sizes of the smaller box. The above algorithm is implemented in parallel using message passing interface. Experiments show that our algorithm can find similar 3D substructures in reasonable time. This paper also presents various statistics as how our algorithm performs against a sequential algorithm and how the algorithm performs with varying sizes of the query structure.