首页 > 最新文献

Computer applications in the biosciences : CABIOS最新文献

英文 中文
PAML: a program package for phylogenetic analysis by maximum likelihood. PAML:用最大似然法进行系统发育分析的程序包。
Pub Date : 1997-10-01 DOI: 10.1093/bioinformatics/13.5.555
Z Yang
PAML, currently in version 1.2, is a package of programs for phylogenetic analyses of DNA and protein sequences using the method of maximum likelihood (ML). The programs can be used for (i) maximum likelihood estimation of evolutionary parameters such as branch lengths in a phylogenetic tree, the transition/transversion rate ratio, the shape parameter of the gamma distribution for variable evolutionary rates at sites, and rate parameters for different genes; (ii) likelihood ratio test of hypotheses concerning sequence evolution, such as rate constancy and independence among sites and rate constancy among lineages (the molecular clock); (iii) calculation of substitution rates at sites and reconstruction of ancestral nucleotide or amino acid sequences; and (iv) phylogenetic tree reconstruction by maximum likelihood and Bayesian methods. The strength of PAML, in comparison with other phylogenetic packages currently available, is its implementation of a variety of evolutionary models. These include several models of variable evolutionary rates among sites, models for combined analyses of multiple gene sequence data and models for amino acid sequences. Multifurcating trees are supported, as well as trees in which some sequences are ancestral to some others. A heuristic tree search algorithm (star decomposition) is used in the package, but tree making is not a strong point of the current version, although work is under way to implement efficient search algorithms. Major programs in the package, as well as the types of analyses they perform, are listed in Table 1. More details are available in the documentation included in the package, written using Microsoft Word. PAML is distributed free of charge for academic use only. The package, including ANSI C source codes, documentation, example data sets, and control files, can be obtained by anonymous ftp at mw511.biol.berkeley.edu/pub, or from the Indiana molecular biology ftp site at ftp.bio.indiana.edu under the directory Incoming or molbio/evolve . MAC and PowerMac executables are also available, although DOS executables are not prepared yet. Further information about the package is available from the World Wide Web at
{"title":"PAML: a program package for phylogenetic analysis by maximum likelihood.","authors":"Z Yang","doi":"10.1093/bioinformatics/13.5.555","DOIUrl":"https://doi.org/10.1093/bioinformatics/13.5.555","url":null,"abstract":"PAML, currently in version 1.2, is a package of programs for phylogenetic analyses of DNA and protein sequences using the method of maximum likelihood (ML). The programs can be used for (i) maximum likelihood estimation of evolutionary parameters such as branch lengths in a phylogenetic tree, the transition/transversion rate ratio, the shape parameter of the gamma distribution for variable evolutionary rates at sites, and rate parameters for different genes; (ii) likelihood ratio test of hypotheses concerning sequence evolution, such as rate constancy and independence among sites and rate constancy among lineages (the molecular clock); (iii) calculation of substitution rates at sites and reconstruction of ancestral nucleotide or amino acid sequences; and (iv) phylogenetic tree reconstruction by maximum likelihood and Bayesian methods. The strength of PAML, in comparison with other phylogenetic packages currently available, is its implementation of a variety of evolutionary models. These include several models of variable evolutionary rates among sites, models for combined analyses of multiple gene sequence data and models for amino acid sequences. Multifurcating trees are supported, as well as trees in which some sequences are ancestral to some others. A heuristic tree search algorithm (star decomposition) is used in the package, but tree making is not a strong point of the current version, although work is under way to implement efficient search algorithms. Major programs in the package, as well as the types of analyses they perform, are listed in Table 1. More details are available in the documentation included in the package, written using Microsoft Word. PAML is distributed free of charge for academic use only. The package, including ANSI C source codes, documentation, example data sets, and control files, can be obtained by anonymous ftp at mw511.biol.berkeley.edu/pub, or from the Indiana molecular biology ftp site at ftp.bio.indiana.edu under the directory Incoming or molbio/evolve . MAC and PowerMac executables are also available, although DOS executables are not prepared yet. Further information about the package is available from the World Wide Web at","PeriodicalId":77081,"journal":{"name":"Computer applications in the biosciences : CABIOS","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1997-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1093/bioinformatics/13.5.555","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20296463","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5528
MACS: automatic counting of objects based on shape recognition. MACS:基于形状识别的物体自动计数。
Pub Date : 1997-10-01 DOI: 10.1093/bioinformatics/13.5.563
J P Rolland, P Bon, D Thomas
MACS is a tool for obtaining basic measurements of cell domains and for automatic counting of particles like colloidal gold probes.
{"title":"MACS: automatic counting of objects based on shape recognition.","authors":"J P Rolland, P Bon, D Thomas","doi":"10.1093/bioinformatics/13.5.563","DOIUrl":"https://doi.org/10.1093/bioinformatics/13.5.563","url":null,"abstract":"MACS is a tool for obtaining basic measurements of cell domains and for automatic counting of particles like colloidal gold probes.","PeriodicalId":77081,"journal":{"name":"Computer applications in the biosciences : CABIOS","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1997-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1093/bioinformatics/13.5.563","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20296467","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Estimation of equilibrium constants using automated group contribution methods. 用自动群贡献法估计平衡常数。
Pub Date : 1997-10-01 DOI: 10.1093/bioinformatics/13.5.537
R G Forsythe, P D Karp, M L Mavrovouniotis

Motivation: Group contribution methods are frequently used for estimating physical properties of compounds from their molecular structures. An algorithm for estimating Gibbs energies of formation through group contribution methods has been automated in an object-oriented framework. The algorithm decomposes compound structures according to a basis set of groups. It permits the use of wildcards and is able to distinguish between ring groups and chain groups that use similar search structures. Past methods relied on manual decomposition of compounds into constituent groups.

Results: The software is written in Common LISP and requires < 2 min to estimate Gibbs energies of formation for a database of 780 species of varying size and complexity. The software allows rapid expansion to incorporate different basis sets and to estimate a variety of other physical properties.

动机:基团贡献法经常用于从分子结构估计化合物的物理性质。在面向对象的框架下,实现了一种利用群贡献法估计吉布斯能量的自动化算法。该算法根据一组基集对复合结构进行分解。它允许使用通配符,并且能够区分使用相似搜索结构的环组和链组。过去的方法依赖于人工将化合物分解成组成基团。结果:该软件是用Common LISP编写的,对780个不同大小和复杂程度的物种数据库估计吉布斯形成能需要< 2分钟。该软件允许快速扩展,以纳入不同的基础集,并估计各种其他物理性质。
{"title":"Estimation of equilibrium constants using automated group contribution methods.","authors":"R G Forsythe,&nbsp;P D Karp,&nbsp;M L Mavrovouniotis","doi":"10.1093/bioinformatics/13.5.537","DOIUrl":"https://doi.org/10.1093/bioinformatics/13.5.537","url":null,"abstract":"<p><strong>Motivation: </strong>Group contribution methods are frequently used for estimating physical properties of compounds from their molecular structures. An algorithm for estimating Gibbs energies of formation through group contribution methods has been automated in an object-oriented framework. The algorithm decomposes compound structures according to a basis set of groups. It permits the use of wildcards and is able to distinguish between ring groups and chain groups that use similar search structures. Past methods relied on manual decomposition of compounds into constituent groups.</p><p><strong>Results: </strong>The software is written in Common LISP and requires < 2 min to estimate Gibbs energies of formation for a database of 780 species of varying size and complexity. The software allows rapid expansion to incorporate different basis sets and to estimate a variety of other physical properties.</p>","PeriodicalId":77081,"journal":{"name":"Computer applications in the biosciences : CABIOS","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1997-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1093/bioinformatics/13.5.537","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20296460","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
A platform-independent graphical user interface for SEQSEE and XALIGN. SEQSEE和XALIGN的独立于平台的图形用户界面。
Pub Date : 1997-10-01 DOI: 10.1093/bioinformatics/13.5.561
D S Wishart, S Fortin, D R Woloschuk, W Wong, T Rosborough, G Van Domselaar, J Schaeffer, D Szafron
SEQSEE (Wishart et cil., 1994a) and XALIGN (Wishart et cil., 1994b) are two text-based, menu-driven programs developed specifically for comprehensive protein sequence analysis. Originally compiled to run on SUN and SGI workstations only, SEQSEE and XALIGN have been distributed to more than 300 laboratories around the world. Both programs have been used in a variety of applications ranging from routine sequence analysis to the identification of previously unknown sequence relationships (Upton et cil., 1992, 1993; Dulhanty and Riordan, 1994). Since releasing these programs, we have received numerous requests asking if they could be ported to additional computer platforms (Macintosh and PC) or if the text-based menus could be replaced with a more friendly graphical user interface (GUI).
{"title":"A platform-independent graphical user interface for SEQSEE and XALIGN.","authors":"D S Wishart,&nbsp;S Fortin,&nbsp;D R Woloschuk,&nbsp;W Wong,&nbsp;T Rosborough,&nbsp;G Van Domselaar,&nbsp;J Schaeffer,&nbsp;D Szafron","doi":"10.1093/bioinformatics/13.5.561","DOIUrl":"https://doi.org/10.1093/bioinformatics/13.5.561","url":null,"abstract":"SEQSEE (Wishart et cil., 1994a) and XALIGN (Wishart et cil., 1994b) are two text-based, menu-driven programs developed specifically for comprehensive protein sequence analysis. Originally compiled to run on SUN and SGI workstations only, SEQSEE and XALIGN have been distributed to more than 300 laboratories around the world. Both programs have been used in a variety of applications ranging from routine sequence analysis to the identification of previously unknown sequence relationships (Upton et cil., 1992, 1993; Dulhanty and Riordan, 1994). Since releasing these programs, we have received numerous requests asking if they could be ported to additional computer platforms (Macintosh and PC) or if the text-based menus could be replaced with a more friendly graphical user interface (GUI).","PeriodicalId":77081,"journal":{"name":"Computer applications in the biosciences : CABIOS","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1997-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1093/bioinformatics/13.5.561","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20296466","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
DSC: public domain protein secondary structure predication. DSC:公共域蛋白质二级结构预测。
Pub Date : 1997-08-01 DOI: 10.1093/bioinformatics/13.4.473
R D King, M Saqi, R Sayle, M J Sternberg
{"title":"DSC: public domain protein secondary structure predication.","authors":"R D King,&nbsp;M Saqi,&nbsp;R Sayle,&nbsp;M J Sternberg","doi":"10.1093/bioinformatics/13.4.473","DOIUrl":"https://doi.org/10.1093/bioinformatics/13.4.473","url":null,"abstract":"","PeriodicalId":77081,"journal":{"name":"Computer applications in the biosciences : CABIOS","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1997-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1093/bioinformatics/13.4.473","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20224212","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 65
JaMBW 1.1: Java-based Molecular Biologists' Workbench. JaMBW 1.1:基于java的分子生物学家工作台
Pub Date : 1997-08-01 DOI: 10.1093/bioinformatics/13.4.475
L I Toldo
{"title":"JaMBW 1.1: Java-based Molecular Biologists' Workbench.","authors":"L I Toldo","doi":"10.1093/bioinformatics/13.4.475","DOIUrl":"https://doi.org/10.1093/bioinformatics/13.4.475","url":null,"abstract":"","PeriodicalId":77081,"journal":{"name":"Computer applications in the biosciences : CABIOS","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1997-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1093/bioinformatics/13.4.475","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20224213","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Rapid protein fragment search using hash functions based on the Fourier transform. 基于傅里叶变换的哈希函数快速蛋白质片段搜索。
Pub Date : 1997-08-01 DOI: 10.1093/bioinformatics/13.4.357
T Akutsu, K Onizuka, M Ishikawa

Motivation: Since the protein structure database has been growing very rapidly in recent years, the development of efficient methods for searching for similar structures is very important.

Results: This paper presents a novel method for searching for similar fragments of proteins. In this method, a hash vector (a vector of real numbers) is associated with each fixed-length fragment of three-dimensional protein structure. Each vector consists of low-frequency components of the Fourier-like spectrum for the distances between C alpha atoms and the centroid. Then, we can analyze the similarity between fragments by evaluating the difference between hash vectors. The novel aspect of the method is that the following property is proved theoretically: if the root mean square distance between two fragments is small, then the distance between the hash vectors is small. Several variants of this method were compared with a naive method and a previous method using PDB data. The results show that the fastest one among the variants is 18-80 times faster than the naive method, and 3-10 times faster than the previous method.

动机:由于近年来蛋白质结构数据库的增长非常迅速,因此开发有效的方法来搜索相似结构是非常重要的。结果:本文提出了一种寻找相似蛋白片段的新方法。在这种方法中,一个哈希向量(实数向量)与三维蛋白质结构的每个定长片段相关联。每个向量由C原子和质心之间距离的类傅立叶谱的低频分量组成。然后,我们可以通过计算哈希向量之间的差异来分析片段之间的相似性。该方法的新颖之处在于从理论上证明了以下性质:如果两个片段之间的均方根距离较小,则哈希向量之间的距离较小。将该方法的几种变体与原始方法和先前使用PDB数据的方法进行了比较。结果表明,其中速度最快的方法比朴素方法快18 ~ 80倍,比原方法快3 ~ 10倍。
{"title":"Rapid protein fragment search using hash functions based on the Fourier transform.","authors":"T Akutsu,&nbsp;K Onizuka,&nbsp;M Ishikawa","doi":"10.1093/bioinformatics/13.4.357","DOIUrl":"https://doi.org/10.1093/bioinformatics/13.4.357","url":null,"abstract":"<p><strong>Motivation: </strong>Since the protein structure database has been growing very rapidly in recent years, the development of efficient methods for searching for similar structures is very important.</p><p><strong>Results: </strong>This paper presents a novel method for searching for similar fragments of proteins. In this method, a hash vector (a vector of real numbers) is associated with each fixed-length fragment of three-dimensional protein structure. Each vector consists of low-frequency components of the Fourier-like spectrum for the distances between C alpha atoms and the centroid. Then, we can analyze the similarity between fragments by evaluating the difference between hash vectors. The novel aspect of the method is that the following property is proved theoretically: if the root mean square distance between two fragments is small, then the distance between the hash vectors is small. Several variants of this method were compared with a naive method and a previous method using PDB data. The results show that the fastest one among the variants is 18-80 times faster than the naive method, and 3-10 times faster than the previous method.</p>","PeriodicalId":77081,"journal":{"name":"Computer applications in the biosciences : CABIOS","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1997-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1093/bioinformatics/13.4.357","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20225047","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Visual BLAST and visual FASTA: graphic workbenches for interactive analysis of full BLAST and FASTA outputs under MICROSOFT WINDOWS 95/NT. Visual BLAST和Visual FASTA:用于在MICROSOFT WINDOWS 95/NT下对完整BLAST和FASTA输出进行交互式分析的图形工作台。
Pub Date : 1997-08-01 DOI: 10.1093/bioinformatics/13.4.407
P Durand, L Canard, J P Mornon

Motivation: When routinely analysing protein sequences, detailed analysis of database search results made with BLAST and FASTA becomes exceedingly time consuming and tedious work, as the resultant file may contain a list of hundreds of potential homologies. The interpretation of these results is usually carried out with a text editor which is not a convenient tool for this analysis. In addition, the format of data within BLAST and FASTA output files makes them difficult to read.

Results: To facilitate and accelerate this analysis, we present for the first time, two easy-to-use programs designed for interactive analysis of full BLAST and FASTA output files containing protein sequence alignments. The programs, Visual BLAST and Visual FASTA, run under Microsoft Windows 95 or NT systems. They are based on the same intuitive graphical user interface (GUI) with extensive viewing, searching, editing, printing and multithreading capabilities. These programs improve the browsing of BLAST/FASTA results by offering a more convenient presentation of these results. They also implement on a computer several analytical tools which automate a manual methodology used for detailed analysis of BLAST and FASTA outputs. These tools include a pairwise sequence alignment viewer, a Hydrophobic Cluster Analysis plot alignment viewer and a tool displaying a graphical map of all database sequences aligned with the query sequence. In addition. Visual Blast includes tools for multiple sequence alignment analysis (with an amino acid patterns search engine), and Visual FASTA provides a GUI to the FASTA program.

动机:在常规分析蛋白质序列时,使用BLAST和FASTA对数据库搜索结果进行详细分析是非常耗时和繁琐的工作,因为结果文件可能包含数百个潜在同源性的列表。这些结果的解释通常是用文本编辑器进行的,这对于这种分析来说不是一个方便的工具。此外,BLAST和FASTA输出文件中的数据格式使它们难以读取。结果:为了促进和加速这一分析,我们首次提出了两个易于使用的程序,用于包含蛋白质序列比对的完整BLAST和FASTA输出文件的交互式分析。这两个程序分别是Visual BLAST和Visual FASTA,在微软Windows 95或NT系统下运行。它们都基于相同的直观图形用户界面(GUI),具有广泛的查看、搜索、编辑、打印和多线程功能。这些程序通过提供更方便的结果呈现来改进BLAST/FASTA结果的浏览。他们还在计算机上实现了几种分析工具,这些工具可以自动执行用于BLAST和FASTA输出详细分析的手动方法。这些工具包括一个成对序列比对查看器,一个疏水聚类分析图比对查看器和一个显示与查询序列对齐的所有数据库序列的图形化地图的工具。此外。Visual Blast包括用于多序列比对分析的工具(带有氨基酸模式搜索引擎),Visual FASTA为FASTA程序提供GUI。
{"title":"Visual BLAST and visual FASTA: graphic workbenches for interactive analysis of full BLAST and FASTA outputs under MICROSOFT WINDOWS 95/NT.","authors":"P Durand,&nbsp;L Canard,&nbsp;J P Mornon","doi":"10.1093/bioinformatics/13.4.407","DOIUrl":"https://doi.org/10.1093/bioinformatics/13.4.407","url":null,"abstract":"<p><strong>Motivation: </strong>When routinely analysing protein sequences, detailed analysis of database search results made with BLAST and FASTA becomes exceedingly time consuming and tedious work, as the resultant file may contain a list of hundreds of potential homologies. The interpretation of these results is usually carried out with a text editor which is not a convenient tool for this analysis. In addition, the format of data within BLAST and FASTA output files makes them difficult to read.</p><p><strong>Results: </strong>To facilitate and accelerate this analysis, we present for the first time, two easy-to-use programs designed for interactive analysis of full BLAST and FASTA output files containing protein sequence alignments. The programs, Visual BLAST and Visual FASTA, run under Microsoft Windows 95 or NT systems. They are based on the same intuitive graphical user interface (GUI) with extensive viewing, searching, editing, printing and multithreading capabilities. These programs improve the browsing of BLAST/FASTA results by offering a more convenient presentation of these results. They also implement on a computer several analytical tools which automate a manual methodology used for detailed analysis of BLAST and FASTA outputs. These tools include a pairwise sequence alignment viewer, a Hydrophobic Cluster Analysis plot alignment viewer and a tool displaying a graphical map of all database sequences aligned with the query sequence. In addition. Visual Blast includes tools for multiple sequence alignment analysis (with an amino acid patterns search engine), and Visual FASTA provides a GUI to the FASTA program.</p>","PeriodicalId":77081,"journal":{"name":"Computer applications in the biosciences : CABIOS","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1997-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1093/bioinformatics/13.4.407","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20225660","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 23
Objectively judging the quality of a protein structure from a Ramachandran plot. 从拉马钱德兰图中客观判断蛋白质结构的质量。
Pub Date : 1997-08-01 DOI: 10.1093/bioinformatics/13.4.425
R W Hooft, C Sander, G Vriend

Motivation: Statistical methods that compare observed and expected distributions of experimental observables provide powerful tools for the quality control of protein structures. The distribution of backbone dihedral angles ('Ramachandran plot') has often been used for such quality control, but without a firm statistical foundation.

Results: A new and-simple method is presented for judging the quality of a protein structure based on the distribution of backbone dihedral angles. Inputs to the method are 60 torsion angle distributions extracted from protein structures solved at high resolution; one for each combination of residue type and tri-state secondary structure. Output for a protein is a Ramachandran Z-score, expressing the quality of the Ramachandran plot relative to current state-of-the-art structures.

动机:比较实验观察到的分布和预期分布的统计方法为蛋白质结构的质量控制提供了强有力的工具。主干二面角的分布(“Ramachandran图”)经常被用于这种质量控制,但没有坚实的统计基础。结果:提出了一种基于骨架二面角分布判断蛋白质结构质量的简便方法。该方法的输入是从高分辨率解算的蛋白质结构中提取的60个扭转角分布;残基型和三态二级结构的组合各一个。蛋白质的输出是Ramachandran z分数,表示Ramachandran图相对于当前最先进结构的质量。
{"title":"Objectively judging the quality of a protein structure from a Ramachandran plot.","authors":"R W Hooft,&nbsp;C Sander,&nbsp;G Vriend","doi":"10.1093/bioinformatics/13.4.425","DOIUrl":"https://doi.org/10.1093/bioinformatics/13.4.425","url":null,"abstract":"<p><strong>Motivation: </strong>Statistical methods that compare observed and expected distributions of experimental observables provide powerful tools for the quality control of protein structures. The distribution of backbone dihedral angles ('Ramachandran plot') has often been used for such quality control, but without a firm statistical foundation.</p><p><strong>Results: </strong>A new and-simple method is presented for judging the quality of a protein structure based on the distribution of backbone dihedral angles. Inputs to the method are 60 torsion angle distributions extracted from protein structures solved at high resolution; one for each combination of residue type and tri-state secondary structure. Output for a protein is a Ramachandran Z-score, expressing the quality of the Ramachandran plot relative to current state-of-the-art structures.</p>","PeriodicalId":77081,"journal":{"name":"Computer applications in the biosciences : CABIOS","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1997-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1093/bioinformatics/13.4.425","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20225662","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 239
DISTREE: a tool for estimating genetic distances between aligned DNA sequences. DISTREE:用于估计排列DNA序列之间的遗传距离的工具。
Pub Date : 1997-08-01 DOI: 10.1093/bioinformatics/13.4.445
J Schäfer, M Schöniger

Motivation: Substitution rates estimated from aligned DNA data can be used as genetic distances to investigate the phylogenetic relationship of those sequences. For this purpose, a Markov model of nucleotide substitution has to be assumed that describes this process most adequately.

Results: A program is presented that estimates substitution rates and their standard errors for a variety of Markov models. The model introduced by Hasegawa et al. (J. Mol. Evol., 22, 160-174, 1985) is the only one for which distances and standard deviations need to be calculated numerically, since analytical formulae cannot be derived. Each model is implemented in two different variants: (i) assuming rate homogeneity or (ii) starting from Gamma-distributed substitution rates across sequence sites. The estimation of heterogeneous substitution rates is based on a method suggested by Tamura and Nei (Mol. Biol. Evol., 10, 512-526, 1993). All required parameters are estimated from sequence data, hence the user is not asked to supply any additional input. One goal of the program is to support the user when choosing a particular model that describes most adequately the evolution of the given data set. For this purpose, a more detailed analysis of this model fit is provided. Phylogenetic trees reconstructed from the inferred distances using the neighbor-joining algorithm are also available.

动机:从比对的DNA数据中估计的替代率可以用作研究这些序列的系统发育关系的遗传距离。为此,必须假设一个最充分地描述这一过程的核苷酸替代的马尔可夫模型。结果:提出了一个程序,估计替代率及其标准误差的各种马尔可夫模型。Hasegawa et al. (J. Mol. evolution .)引入的模型。(22,160 -174, 1985)是唯一需要用数值方法计算距离和标准偏差的方法,因为无法推导出解析公式。每个模型以两种不同的变体实现:(i)假设速率同质性或(ii)从序列位点上的γ分布替代率开始。非均相取代率的估算基于Tamura和Nei (Mol. Biol)提出的方法。另一个星球。, 10, 512-526, 1993)。所有必需的参数都是从序列数据中估计出来的,因此不要求用户提供任何额外的输入。该程序的一个目标是支持用户选择最充分地描述给定数据集演变的特定模型。为此,提供了对该模型拟合的更详细的分析。利用邻居连接算法从推断的距离重建系统发育树也是可行的。
{"title":"DISTREE: a tool for estimating genetic distances between aligned DNA sequences.","authors":"J Schäfer,&nbsp;M Schöniger","doi":"10.1093/bioinformatics/13.4.445","DOIUrl":"https://doi.org/10.1093/bioinformatics/13.4.445","url":null,"abstract":"<p><strong>Motivation: </strong>Substitution rates estimated from aligned DNA data can be used as genetic distances to investigate the phylogenetic relationship of those sequences. For this purpose, a Markov model of nucleotide substitution has to be assumed that describes this process most adequately.</p><p><strong>Results: </strong>A program is presented that estimates substitution rates and their standard errors for a variety of Markov models. The model introduced by Hasegawa et al. (J. Mol. Evol., 22, 160-174, 1985) is the only one for which distances and standard deviations need to be calculated numerically, since analytical formulae cannot be derived. Each model is implemented in two different variants: (i) assuming rate homogeneity or (ii) starting from Gamma-distributed substitution rates across sequence sites. The estimation of heterogeneous substitution rates is based on a method suggested by Tamura and Nei (Mol. Biol. Evol., 10, 512-526, 1993). All required parameters are estimated from sequence data, hence the user is not asked to supply any additional input. One goal of the program is to support the user when choosing a particular model that describes most adequately the evolution of the given data set. For this purpose, a more detailed analysis of this model fit is provided. Phylogenetic trees reconstructed from the inferred distances using the neighbor-joining algorithm are also available.</p>","PeriodicalId":77081,"journal":{"name":"Computer applications in the biosciences : CABIOS","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1997-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1093/bioinformatics/13.4.445","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20224209","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
期刊
Computer applications in the biosciences : CABIOS
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1