A new protein folding algorithm based on hydrophobic compactness: Rigid Unconnected Secondary Structure Iterative Assembly (RUSSIA). I: Methodology.

Denis Znamenskiy, Jacques Chomilier, Khan Le Tuan, Jean-Paul Mornon
{"title":"A new protein folding algorithm based on hydrophobic compactness: Rigid Unconnected Secondary Structure Iterative Assembly (RUSSIA). I: Methodology.","authors":"Denis Znamenskiy,&nbsp;Jacques Chomilier,&nbsp;Khan Le Tuan,&nbsp;Jean-Paul Mornon","doi":"10.1093/protein/gzg140","DOIUrl":null,"url":null,"abstract":"<p><p>We present an algorithm that is able to propose compact models of protein 3D structures, only starting from the prediction of the nature and length of regular secondary structures. Helices are modeled by cylinders and sheets by helicoid surfaces, all strands of a sheet being considered as a single block. It means that relative topology of the strands inside one sheet is a prerequisite. Loops are only considered as constraints, given by the maximal distance between their Calpha extremities according to their sequence length. Unconnected regular secondary structures are reduced to a single point, the center of their hydrophobic faces. These centers are then repeatedly moved in order to obtain a compact hydrophobic core. To prevent secondary structures from interpenetrating, a repulsive term is introduced in the function whose minimization leads to the compact structure. This RUSSIA (Rigid Unconnected Secondary Structure Assembly) algorithm has the advantage of relying on a small number of variables and therefore many initial conformations can be tested. Flexibility is produced in the following way: helices or sheets are allowed to rotate around the direction leading to the center of the model; residues in a sheet can slide along the main direction of the strand where they are embedded. RUSSIA is fast and simple and it produces on a test set several neighbor good models with an r.m.s. to the native structures in the range 1.4-3.7 A. These models can be further treated by statistical potentials used in threading approaches in order to detect the best candidate. The limits of the present method are the following: small proteins with few secondary structures are excluded; multi domain proteins must be split into several compact globular domains from their sequences; sheets of more than five strands and completely buried helices are not treated. In this first paper the algorithm is developed and in Part II, which follows, some applications are presented and the program is evaluated.</p>","PeriodicalId":20902,"journal":{"name":"Protein engineering","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2003-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1093/protein/gzg140","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Protein engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/protein/gzg140","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

We present an algorithm that is able to propose compact models of protein 3D structures, only starting from the prediction of the nature and length of regular secondary structures. Helices are modeled by cylinders and sheets by helicoid surfaces, all strands of a sheet being considered as a single block. It means that relative topology of the strands inside one sheet is a prerequisite. Loops are only considered as constraints, given by the maximal distance between their Calpha extremities according to their sequence length. Unconnected regular secondary structures are reduced to a single point, the center of their hydrophobic faces. These centers are then repeatedly moved in order to obtain a compact hydrophobic core. To prevent secondary structures from interpenetrating, a repulsive term is introduced in the function whose minimization leads to the compact structure. This RUSSIA (Rigid Unconnected Secondary Structure Assembly) algorithm has the advantage of relying on a small number of variables and therefore many initial conformations can be tested. Flexibility is produced in the following way: helices or sheets are allowed to rotate around the direction leading to the center of the model; residues in a sheet can slide along the main direction of the strand where they are embedded. RUSSIA is fast and simple and it produces on a test set several neighbor good models with an r.m.s. to the native structures in the range 1.4-3.7 A. These models can be further treated by statistical potentials used in threading approaches in order to detect the best candidate. The limits of the present method are the following: small proteins with few secondary structures are excluded; multi domain proteins must be split into several compact globular domains from their sequences; sheets of more than five strands and completely buried helices are not treated. In this first paper the algorithm is developed and in Part II, which follows, some applications are presented and the program is evaluated.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
一种基于疏水紧密性的蛋白质折叠新算法:刚性非连接二级结构迭代组装(俄罗斯)。我:方法。
我们提出了一种算法,能够提出蛋白质三维结构的紧凑模型,仅从预测规则二级结构的性质和长度开始。螺旋由圆柱体模拟,薄片由螺旋面模拟,薄片的所有链都被认为是一个单一的块。这意味着薄片内链的相对拓扑结构是一个先决条件。环路仅被认为是约束,由它们的序列长度与它们的Calpha末端之间的最大距离给出。不连接的规则二级结构被简化为一个点,即它们疏水面的中心。然后反复移动这些中心,以获得致密的疏水核心。为了防止二级结构相互穿透,在函数中引入了一个排斥项,使其最小化导致结构紧凑。该算法具有依赖于少量变量的优点,因此可以测试许多初始构象。柔性是通过以下方式产生的:螺旋或薄片可以绕着通向模型中心的方向旋转;薄片中的残基可以沿着嵌入它们的链的主要方向滑动。俄罗斯是快速和简单的,它在一个测试集上产生了几个相邻的好模型,对本地结构的均方根在1.4-3.7 a范围内。这些模型可以通过线程方法中使用的统计势进一步处理,以检测最佳候选。本方法的局限性是:排除了具有很少二级结构的小蛋白质;多结构域蛋白必须从序列中分裂成几个紧密的球状结构域;超过五股的薄片和完全掩埋的螺旋不处理。本文首先对该算法进行了开发,第二部分给出了一些应用,并对该算法进行了评价。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Development of Novel Cellular Imaging Tools Using Protein Engineering High‐Throughput Mass Spectrometry Complements Protein Engineering Programming Novel Cancer Therapeutics: Design Principles for Chimeric Antigen Receptors Recent Advances in Cell Surface Display Technologies for Directed Protein Evolution Protein Engineering by Efficient Sequence Space Exploration Through Combination of Directed Evolution and Computational Design Methodologies
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1