A novel approach to protein substructure matching

L. Vardharaj, Sumit Ranjan, Parampreet Singh, Narendran Rajagopalan, C. Mala
{"title":"A novel approach to protein substructure matching","authors":"L. Vardharaj, Sumit Ranjan, Parampreet Singh, Narendran Rajagopalan, C. Mala","doi":"10.1109/WICT.2011.6141303","DOIUrl":null,"url":null,"abstract":"The rapidly increasing volumes of structural data of proteins has led to need of algorithms which can rapidly predict functions for proteins based on structure. Similarity between protein structures can provide evidence of possible functional similarity. In this paper, an attempt is made to efficiently recognize similar protein structures in the protein database contain thousands of proteins. This paper gives an efficient heuristic algorithm for finding protein 3D substructures in a 3D protein structure that are similar to a given query 3D protein substructure. This algorithm can be used for searching a database of protein 3D structures. Our approach is to divide the protein structure into sub-structures of size of query structure and compare each sub structure with the query protein using Procrustes algorithm which is based on the root mean square distance between the structures. The division involves constructing a bounding box over both the query and protein structure and dividing the bigger box into sizes of the smaller box. The above algorithm is implemented in parallel using message passing interface. Experiments show that our algorithm can find similar 3D substructures in reasonable time. This paper also presents various statistics as how our algorithm performs against a sequential algorithm and how the algorithm performs with varying sizes of the query structure.","PeriodicalId":178645,"journal":{"name":"2011 World Congress on Information and Communication Technologies","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 World Congress on Information and Communication Technologies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WICT.2011.6141303","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The rapidly increasing volumes of structural data of proteins has led to need of algorithms which can rapidly predict functions for proteins based on structure. Similarity between protein structures can provide evidence of possible functional similarity. In this paper, an attempt is made to efficiently recognize similar protein structures in the protein database contain thousands of proteins. This paper gives an efficient heuristic algorithm for finding protein 3D substructures in a 3D protein structure that are similar to a given query 3D protein substructure. This algorithm can be used for searching a database of protein 3D structures. Our approach is to divide the protein structure into sub-structures of size of query structure and compare each sub structure with the query protein using Procrustes algorithm which is based on the root mean square distance between the structures. The division involves constructing a bounding box over both the query and protein structure and dividing the bigger box into sizes of the smaller box. The above algorithm is implemented in parallel using message passing interface. Experiments show that our algorithm can find similar 3D substructures in reasonable time. This paper also presents various statistics as how our algorithm performs against a sequential algorithm and how the algorithm performs with varying sizes of the query structure.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
一种蛋白质亚结构匹配的新方法
随着蛋白质结构数据量的迅速增加,需要基于结构快速预测蛋白质功能的算法。蛋白质结构之间的相似性可以提供可能的功能相似性的证据。本文试图在包含数千种蛋白质的蛋白质数据库中有效地识别相似的蛋白质结构。本文给出了一种有效的启发式算法,用于在三维蛋白质结构中寻找与给定查询三维蛋白质子结构相似的蛋白质三维子结构。该算法可用于蛋白质三维结构数据库的搜索。我们的方法是将蛋白质结构划分为查询结构大小的子结构,并使用基于结构间均方根距离的Procrustes算法将每个子结构与查询蛋白质进行比较。划分包括在查询和蛋白质结构上构建一个边界框,并将较大的框划分为较小的框的大小。上述算法采用消息传递接口并行实现。实验表明,该算法可以在合理的时间内找到相似的三维子结构。本文还介绍了各种统计数据,例如我们的算法对顺序算法的执行情况,以及算法对不同大小的查询结构的执行情况。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Cloud based model for senior citizens wellness management Application of genetic algorithm on quality graded networks for intelligent routing Role of ICT in the educational upliftment of women - Indian scenario Code clones in program test sequence identification An impact of ridgelet transform in handwritten recognition: A study on very large dataset of Kannada script
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1