Statistical measures on residue-level protein structural properties.

Yuanyuan Huang, Stephen Bonett, Andrzej Kloczkowski, Robert Jernigan, Zhijun Wu
{"title":"Statistical measures on residue-level protein structural properties.","authors":"Yuanyuan Huang,&nbsp;Stephen Bonett,&nbsp;Andrzej Kloczkowski,&nbsp;Robert Jernigan,&nbsp;Zhijun Wu","doi":"10.1007/s10969-011-9104-4","DOIUrl":null,"url":null,"abstract":"<p><p>The atomic-level structural properties of proteins, such as bond lengths, bond angles, and torsion angles, have been well studied and understood based on either chemistry knowledge or statistical analysis. Similar properties on the residue-level, such as the distances between two residues and the angles formed by short sequences of residues, can be equally important for structural analysis and modeling, but these have not been examined and documented on a similar scale. While these properties are difficult to measure experimentally, they can be statistically estimated in meaningful ways based on their distributions in known proteins structures. Residue-level structural properties including various types of residue distances and angles are estimated statistically. A software package is built to provide direct access to the statistical data for the properties including some important correlations not previously investigated. The distributions of residue distances and angles may vary with varying sequences, but in most cases, are concentrated in some high probability ranges, corresponding to their frequent occurrences in either α-helices or β-sheets. Strong correlations among neighboring residue angles, similar to those between neighboring torsion angles at the atomic-level, are revealed based on their statistical measures. Residue-level statistical potentials can be defined using the statistical distributions and correlations of the residue distances and angles. Ramachandran-like plots for strongly correlated residue angles are plotted and analyzed. Their applications to structural evaluation and refinement are demonstrated. With the increase in both number and quality of known protein structures, many structural properties can be derived from sets of protein structures by statistical analysis and data mining, and these can even be used as a supplement to the experimental data for structure determinations. Indeed, the statistical measures on various types of residue distances and angles provide more systematic and quantitative assessments on these properties, which can otherwise be estimated only individually and qualitatively. Their distributions and correlations in known protein structures show their importance for providing insights into how proteins may fold naturally to various residue-level structures.</p>","PeriodicalId":73957,"journal":{"name":"Journal of structural and functional genomics","volume":"12 2","pages":"119-36"},"PeriodicalIF":0.0000,"publicationDate":"2011-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1007/s10969-011-9104-4","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of structural and functional genomics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s10969-011-9104-4","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2011/3/31 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6

Abstract

The atomic-level structural properties of proteins, such as bond lengths, bond angles, and torsion angles, have been well studied and understood based on either chemistry knowledge or statistical analysis. Similar properties on the residue-level, such as the distances between two residues and the angles formed by short sequences of residues, can be equally important for structural analysis and modeling, but these have not been examined and documented on a similar scale. While these properties are difficult to measure experimentally, they can be statistically estimated in meaningful ways based on their distributions in known proteins structures. Residue-level structural properties including various types of residue distances and angles are estimated statistically. A software package is built to provide direct access to the statistical data for the properties including some important correlations not previously investigated. The distributions of residue distances and angles may vary with varying sequences, but in most cases, are concentrated in some high probability ranges, corresponding to their frequent occurrences in either α-helices or β-sheets. Strong correlations among neighboring residue angles, similar to those between neighboring torsion angles at the atomic-level, are revealed based on their statistical measures. Residue-level statistical potentials can be defined using the statistical distributions and correlations of the residue distances and angles. Ramachandran-like plots for strongly correlated residue angles are plotted and analyzed. Their applications to structural evaluation and refinement are demonstrated. With the increase in both number and quality of known protein structures, many structural properties can be derived from sets of protein structures by statistical analysis and data mining, and these can even be used as a supplement to the experimental data for structure determinations. Indeed, the statistical measures on various types of residue distances and angles provide more systematic and quantitative assessments on these properties, which can otherwise be estimated only individually and qualitatively. Their distributions and correlations in known protein structures show their importance for providing insights into how proteins may fold naturally to various residue-level structures.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
残馀水平蛋白质结构特性的统计方法。
基于化学知识或统计分析,蛋白质的原子级结构特性,如键长、键角和扭转角,已经得到了很好的研究和理解。残基水平上的类似性质,如两个残基之间的距离和残基短序列形成的角度,对于结构分析和建模同样重要,但这些还没有在类似的尺度上进行检查和记录。虽然这些特性很难通过实验测量,但基于它们在已知蛋白质结构中的分布,可以以有意义的方式进行统计估计。残差水平的结构性质包括各种残差距离和残差角度的统计估计。构建了一个软件包来提供对属性的统计数据的直接访问,这些属性包括一些以前没有研究过的重要相关性。残基距离和残基角度的分布随序列的变化而变化,但在大多数情况下,残基距离和残基角度的分布集中在一些高概率范围内,这与它们在α-螺旋或β-片中的频繁出现相对应。基于它们的统计度量,揭示了相邻剩余角之间的强相关性,类似于原子水平上相邻扭转角之间的强相关性。残差水平的统计势可以用残差距离和残差角度的统计分布和相关性来定义。绘制并分析了强相关残馀角的ramachandran样图。说明了它们在结构评价和结构优化中的应用。随着已知蛋白质结构的数量和质量的增加,通过统计分析和数据挖掘可以从蛋白质结构集中获得许多结构特性,这些甚至可以作为结构确定实验数据的补充。事实上,对各种残差距离和残差角度的统计度量提供了对这些性质更系统和定量的评价,否则只能单独和定性地估计。它们在已知蛋白质结构中的分布和相关性显示了它们对于深入了解蛋白质如何自然折叠成各种残基水平结构的重要性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Structural Genomics: General Applications Classification of ligand molecules in PDB with graph match-based structural superposition HOMCOS: an updated server to search and model complex 3D structures. NLDB: a database for 3D protein-ligand interactions in enzymatic reactions. Toward the next step in G protein-coupled receptor research: a knowledge-driven analysis for the next potential targets in drug discovery
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1