一种蛋白质亚结构匹配的新方法

2011 World Congress on Information and Communication Technologies Pub Date : 2011-12-01 DOI:10.1109/WICT.2011.6141303

L. Vardharaj, Sumit Ranjan, Parampreet Singh, Narendran Rajagopalan, C. Mala

{"title":"一种蛋白质亚结构匹配的新方法","authors":"L. Vardharaj, Sumit Ranjan, Parampreet Singh, Narendran Rajagopalan, C. Mala","doi":"10.1109/WICT.2011.6141303","DOIUrl":null,"url":null,"abstract":"The rapidly increasing volumes of structural data of proteins has led to need of algorithms which can rapidly predict functions for proteins based on structure. Similarity between protein structures can provide evidence of possible functional similarity. In this paper, an attempt is made to efficiently recognize similar protein structures in the protein database contain thousands of proteins. This paper gives an efficient heuristic algorithm for finding protein 3D substructures in a 3D protein structure that are similar to a given query 3D protein substructure. This algorithm can be used for searching a database of protein 3D structures. Our approach is to divide the protein structure into sub-structures of size of query structure and compare each sub structure with the query protein using Procrustes algorithm which is based on the root mean square distance between the structures. The division involves constructing a bounding box over both the query and protein structure and dividing the bigger box into sizes of the smaller box. The above algorithm is implemented in parallel using message passing interface. Experiments show that our algorithm can find similar 3D substructures in reasonable time. This paper also presents various statistics as how our algorithm performs against a sequential algorithm and how the algorithm performs with varying sizes of the query structure.","PeriodicalId":178645,"journal":{"name":"2011 World Congress on Information and Communication Technologies","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A novel approach to protein substructure matching\",\"authors\":\"L. Vardharaj, Sumit Ranjan, Parampreet Singh, Narendran Rajagopalan, C. Mala\",\"doi\":\"10.1109/WICT.2011.6141303\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The rapidly increasing volumes of structural data of proteins has led to need of algorithms which can rapidly predict functions for proteins based on structure. Similarity between protein structures can provide evidence of possible functional similarity. In this paper, an attempt is made to efficiently recognize similar protein structures in the protein database contain thousands of proteins. This paper gives an efficient heuristic algorithm for finding protein 3D substructures in a 3D protein structure that are similar to a given query 3D protein substructure. This algorithm can be used for searching a database of protein 3D structures. Our approach is to divide the protein structure into sub-structures of size of query structure and compare each sub structure with the query protein using Procrustes algorithm which is based on the root mean square distance between the structures. The division involves constructing a bounding box over both the query and protein structure and dividing the bigger box into sizes of the smaller box. The above algorithm is implemented in parallel using message passing interface. Experiments show that our algorithm can find similar 3D substructures in reasonable time. This paper also presents various statistics as how our algorithm performs against a sequential algorithm and how the algorithm performs with varying sizes of the query structure.\",\"PeriodicalId\":178645,\"journal\":{\"name\":\"2011 World Congress on Information and Communication Technologies\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2011 World Congress on Information and Communication Technologies\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/WICT.2011.6141303\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 World Congress on Information and Communication Technologies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WICT.2011.6141303","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

随着蛋白质结构数据量的迅速增加，需要基于结构快速预测蛋白质功能的算法。蛋白质结构之间的相似性可以提供可能的功能相似性的证据。本文试图在包含数千种蛋白质的蛋白质数据库中有效地识别相似的蛋白质结构。本文给出了一种有效的启发式算法，用于在三维蛋白质结构中寻找与给定查询三维蛋白质子结构相似的蛋白质三维子结构。该算法可用于蛋白质三维结构数据库的搜索。我们的方法是将蛋白质结构划分为查询结构大小的子结构，并使用基于结构间均方根距离的Procrustes算法将每个子结构与查询蛋白质进行比较。划分包括在查询和蛋白质结构上构建一个边界框，并将较大的框划分为较小的框的大小。上述算法采用消息传递接口并行实现。实验表明，该算法可以在合理的时间内找到相似的三维子结构。本文还介绍了各种统计数据，例如我们的算法对顺序算法的执行情况，以及算法对不同大小的查询结构的执行情况。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

A novel approach to protein substructure matching

The rapidly increasing volumes of structural data of proteins has led to need of algorithms which can rapidly predict functions for proteins based on structure. Similarity between protein structures can provide evidence of possible functional similarity. In this paper, an attempt is made to efficiently recognize similar protein structures in the protein database contain thousands of proteins. This paper gives an efficient heuristic algorithm for finding protein 3D substructures in a 3D protein structure that are similar to a given query 3D protein substructure. This algorithm can be used for searching a database of protein 3D structures. Our approach is to divide the protein structure into sub-structures of size of query structure and compare each sub structure with the query protein using Procrustes algorithm which is based on the root mean square distance between the structures. The division involves constructing a bounding box over both the query and protein structure and dividing the bigger box into sizes of the smaller box. The above algorithm is implemented in parallel using message passing interface. Experiments show that our algorithm can find similar 3D substructures in reasonable time. This paper also presents various statistics as how our algorithm performs against a sequential algorithm and how the algorithm performs with varying sizes of the query structure.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2011 World Congress on Information and Communication Technologies

自引率

0.00%

发文量

期刊最新文献

Cloud based model for senior citizens wellness management Application of genetic algorithm on quality graded networks for intelligent routing Role of ICT in the educational upliftment of women - Indian scenario Code clones in program test sequence identification An impact of ridgelet transform in handwritten recognition: A study on very large dataset of Kannada script