Computer applications in the biosciences : CABIOS最新文献

英文中文

PSeq-Gen: an application for the Monte Carlo simulation of protein sequence evolution along phylogenetic trees. PSeq-Gen:一个应用程序，用于沿着系统发育树的蛋白质序列进化的蒙特卡罗模拟。

Computer applications in the biosciences : CABIOS

Pub Date : 1997-10-01 DOI: 10.1093/bioinformatics/13.5.559

N C Grassly, J Adachi, A Rambaut

PSeq-Gen will simulate the evolution of protein sequences along evolutionary trees following the procedures previously reported for the DNA sequence simulator Seq-Gen (Sequence-Generator, Rambaut and Grassly, 1997). Statis-tics calculated from these sequences can be used to give expectations under specific null hypotheses of protein evolution. This Monte Carlo simulation approach to testing hypotheses is often termed 'parametric bootstrapping' (see, for example, Efron, 1985; Huelsenbeck et al., 19%; Huelsenbeck and Rannala, 1997), and has many powerful applications, such as testing the molecular clock (Goldman, 1993), detecting recombination (Grassly and Holmes, 1997), and evaluating competing phylogenetic hypotheses (Hillis et al., 19%). Three are mitochondrial each

引用次数: 49

MACS: automatic counting of objects based on shape recognition. MACS:基于形状识别的物体自动计数。

Computer applications in the biosciences : CABIOS

Pub Date : 1997-10-01 DOI: 10.1093/bioinformatics/13.5.563

J P Rolland, P Bon, D Thomas

MACS is a tool for obtaining basic measurements of cell domains and for automatic counting of particles like colloidal gold probes.

引用次数: 8

PAML: a program package for phylogenetic analysis by maximum likelihood. PAML:用最大似然法进行系统发育分析的程序包。

Computer applications in the biosciences : CABIOS

Pub Date : 1997-10-01 DOI: 10.1093/bioinformatics/13.5.555

Z Yang

PAML, currently in version 1.2, is a package of programs for phylogenetic analyses of DNA and protein sequences using the method of maximum likelihood (ML). The programs can be used for (i) maximum likelihood estimation of evolutionary parameters such as branch lengths in a phylogenetic tree, the transition/transversion rate ratio, the shape parameter of the gamma distribution for variable evolutionary rates at sites, and rate parameters for different genes; (ii) likelihood ratio test of hypotheses concerning sequence evolution, such as rate constancy and independence among sites and rate constancy among lineages (the molecular clock); (iii) calculation of substitution rates at sites and reconstruction of ancestral nucleotide or amino acid sequences; and (iv) phylogenetic tree reconstruction by maximum likelihood and Bayesian methods. The strength of PAML, in comparison with other phylogenetic packages currently available, is its implementation of a variety of evolutionary models. These include several models of variable evolutionary rates among sites, models for combined analyses of multiple gene sequence data and models for amino acid sequences. Multifurcating trees are supported, as well as trees in which some sequences are ancestral to some others. A heuristic tree search algorithm (star decomposition) is used in the package, but tree making is not a strong point of the current version, although work is under way to implement efficient search algorithms. Major programs in the package, as well as the types of analyses they perform, are listed in Table 1. More details are available in the documentation included in the package, written using Microsoft Word. PAML is distributed free of charge for academic use only. The package, including ANSI C source codes, documentation, example data sets, and control files, can be obtained by anonymous ftp at mw511.biol.berkeley.edu/pub, or from the Indiana molecular biology ftp site at ftp.bio.indiana.edu under the directory Incoming or molbio/evolve . MAC and PowerMac executables are also available, although DOS executables are not prepared yet. Further information about the package is available from the World Wide Web at

{"title":"PAML: a program package for phylogenetic analysis by maximum likelihood.","authors":"Z Yang","doi":"10.1093/bioinformatics/13.5.555","DOIUrl":"https://doi.org/10.1093/bioinformatics/13.5.555","url":null,"abstract":"PAML, currently in version 1.2, is a package of programs for phylogenetic analyses of DNA and protein sequences using the method of maximum likelihood (ML). The programs can be used for (i) maximum likelihood estimation of evolutionary parameters such as branch lengths in a phylogenetic tree, the transition/transversion rate ratio, the shape parameter of the gamma distribution for variable evolutionary rates at sites, and rate parameters for different genes; (ii) likelihood ratio test of hypotheses concerning sequence evolution, such as rate constancy and independence among sites and rate constancy among lineages (the molecular clock); (iii) calculation of substitution rates at sites and reconstruction of ancestral nucleotide or amino acid sequences; and (iv) phylogenetic tree reconstruction by maximum likelihood and Bayesian methods. The strength of PAML, in comparison with other phylogenetic packages currently available, is its implementation of a variety of evolutionary models. These include several models of variable evolutionary rates among sites, models for combined analyses of multiple gene sequence data and models for amino acid sequences. Multifurcating trees are supported, as well as trees in which some sequences are ancestral to some others. A heuristic tree search algorithm (star decomposition) is used in the package, but tree making is not a strong point of the current version, although work is under way to implement efficient search algorithms. Major programs in the package, as well as the types of analyses they perform, are listed in Table 1. More details are available in the documentation included in the package, written using Microsoft Word. PAML is distributed free of charge for academic use only. The package, including ANSI C source codes, documentation, example data sets, and control files, can be obtained by anonymous ftp at mw511.biol.berkeley.edu/pub, or from the Indiana molecular biology ftp site at ftp.bio.indiana.edu under the directory Incoming or molbio/evolve . MAC and PowerMac executables are also available, although DOS executables are not prepared yet. Further information about the package is available from the World Wide Web at","PeriodicalId":77081,"journal":{"name":"Computer applications in the biosciences : CABIOS","volume":"13 5","pages":"555-6"},"PeriodicalIF":0.0,"publicationDate":"1997-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1093/bioinformatics/13.5.555","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20296463","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5528

Estimation of equilibrium constants using automated group contribution methods. 用自动群贡献法估计平衡常数。

Computer applications in the biosciences : CABIOS

Pub Date : 1997-10-01 DOI: 10.1093/bioinformatics/13.5.537

R G Forsythe, P D Karp, M L Mavrovouniotis

Motivation: Group contribution methods are frequently used for estimating physical properties of compounds from their molecular structures. An algorithm for estimating Gibbs energies of formation through group contribution methods has been automated in an object-oriented framework. The algorithm decomposes compound structures according to a basis set of groups. It permits the use of wildcards and is able to distinguish between ring groups and chain groups that use similar search structures. Past methods relied on manual decomposition of compounds into constituent groups.

Results: The software is written in Common LISP and requires < 2 min to estimate Gibbs energies of formation for a database of 780 species of varying size and complexity. The software allows rapid expansion to incorporate different basis sets and to estimate a variety of other physical properties.

动机:基团贡献法经常用于从分子结构估计化合物的物理性质。在面向对象的框架下，实现了一种利用群贡献法估计吉布斯能量的自动化算法。该算法根据一组基集对复合结构进行分解。它允许使用通配符，并且能够区分使用相似搜索结构的环组和链组。过去的方法依赖于人工将化合物分解成组成基团。结果:该软件是用Common LISP编写的，对780个不同大小和复杂程度的物种数据库估计吉布斯形成能需要< 2分钟。该软件允许快速扩展，以纳入不同的基础集，并估计各种其他物理性质。

引用次数: 14

A platform-independent graphical user interface for SEQSEE and XALIGN. SEQSEE和XALIGN的独立于平台的图形用户界面。

Computer applications in the biosciences : CABIOS

Pub Date : 1997-10-01 DOI: 10.1093/bioinformatics/13.5.561

D S Wishart, S Fortin, D R Woloschuk, W Wong, T Rosborough, G Van Domselaar, J Schaeffer, D Szafron

SEQSEE (Wishart et cil., 1994a) and XALIGN (Wishart et cil., 1994b) are two text-based, menu-driven programs developed specifically for comprehensive protein sequence analysis. Originally compiled to run on SUN and SGI workstations only, SEQSEE and XALIGN have been distributed to more than 300 laboratories around the world. Both programs have been used in a variety of applications ranging from routine sequence analysis to the identification of previously unknown sequence relationships (Upton et cil., 1992, 1993; Dulhanty and Riordan, 1994). Since releasing these programs, we have received numerous requests asking if they could be ported to additional computer platforms (Macintosh and PC) or if the text-based menus could be replaced with a more friendly graphical user interface (GUI).

引用次数: 8

DSC: public domain protein secondary structure predication. DSC:公共域蛋白质二级结构预测。

Computer applications in the biosciences : CABIOS

Pub Date : 1997-08-01 DOI: 10.1093/bioinformatics/13.4.473

R D King, M Saqi, R Sayle, M J Sternberg

引用次数: 65

JaMBW 1.1: Java-based Molecular Biologists' Workbench. JaMBW 1.1:基于java的分子生物学家工作台

Computer applications in the biosciences : CABIOS

Pub Date : 1997-08-01 DOI: 10.1093/bioinformatics/13.4.475

L I Toldo

引用次数: 12

Rapid protein fragment search using hash functions based on the Fourier transform. 基于傅里叶变换的哈希函数快速蛋白质片段搜索。

Computer applications in the biosciences : CABIOS

Pub Date : 1997-08-01 DOI: 10.1093/bioinformatics/13.4.357

T Akutsu, K Onizuka, M Ishikawa

Motivation: Since the protein structure database has been growing very rapidly in recent years, the development of efficient methods for searching for similar structures is very important.

Results: This paper presents a novel method for searching for similar fragments of proteins. In this method, a hash vector (a vector of real numbers) is associated with each fixed-length fragment of three-dimensional protein structure. Each vector consists of low-frequency components of the Fourier-like spectrum for the distances between C alpha atoms and the centroid. Then, we can analyze the similarity between fragments by evaluating the difference between hash vectors. The novel aspect of the method is that the following property is proved theoretically: if the root mean square distance between two fragments is small, then the distance between the hash vectors is small. Several variants of this method were compared with a naive method and a previous method using PDB data. The results show that the fastest one among the variants is 18-80 times faster than the naive method, and 3-10 times faster than the previous method.

动机:由于近年来蛋白质结构数据库的增长非常迅速，因此开发有效的方法来搜索相似结构是非常重要的。结果:本文提出了一种寻找相似蛋白片段的新方法。在这种方法中，一个哈希向量(实数向量)与三维蛋白质结构的每个定长片段相关联。每个向量由C原子和质心之间距离的类傅立叶谱的低频分量组成。然后，我们可以通过计算哈希向量之间的差异来分析片段之间的相似性。该方法的新颖之处在于从理论上证明了以下性质:如果两个片段之间的均方根距离较小，则哈希向量之间的距离较小。将该方法的几种变体与原始方法和先前使用PDB数据的方法进行了比较。结果表明，其中速度最快的方法比朴素方法快18 ~ 80倍，比原方法快3 ~ 10倍。

{"title":"Rapid protein fragment search using hash functions based on the Fourier transform.","authors":"T Akutsu, K Onizuka, M Ishikawa","doi":"10.1093/bioinformatics/13.4.357","DOIUrl":"https://doi.org/10.1093/bioinformatics/13.4.357","url":null,"abstract":"Motivation: Since the protein structure database has been growing very rapidly in recent years, the development of efficient methods for searching for similar structures is very important.Results: This paper presents a novel method for searching for similar fragments of proteins. In this method, a hash vector (a vector of real numbers) is associated with each fixed-length fragment of three-dimensional protein structure. Each vector consists of low-frequency components of the Fourier-like spectrum for the distances between C alpha atoms and the centroid. Then, we can analyze the similarity between fragments by evaluating the difference between hash vectors. The novel aspect of the method is that the following property is proved theoretically: if the root mean square distance between two fragments is small, then the distance between the hash vectors is small. Several variants of this method were compared with a naive method and a previous method using PDB data. The results show that the fastest one among the variants is 18-80 times faster than the naive method, and 3-10 times faster than the previous method.","PeriodicalId":77081,"journal":{"name":"Computer applications in the biosciences : CABIOS","volume":"13 4","pages":"357-64"},"PeriodicalIF":0.0,"publicationDate":"1997-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1093/bioinformatics/13.4.357","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20225047","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Visual BLAST and visual FASTA: graphic workbenches for interactive analysis of full BLAST and FASTA outputs under MICROSOFT WINDOWS 95/NT. Visual BLAST和Visual FASTA:用于在MICROSOFT WINDOWS 95/NT下对完整BLAST和FASTA输出进行交互式分析的图形工作台。

Computer applications in the biosciences : CABIOS

Pub Date : 1997-08-01 DOI: 10.1093/bioinformatics/13.4.407

P Durand, L Canard, J P Mornon

Motivation: When routinely analysing protein sequences, detailed analysis of database search results made with BLAST and FASTA becomes exceedingly time consuming and tedious work, as the resultant file may contain a list of hundreds of potential homologies. The interpretation of these results is usually carried out with a text editor which is not a convenient tool for this analysis. In addition, the format of data within BLAST and FASTA output files makes them difficult to read.

Results: To facilitate and accelerate this analysis, we present for the first time, two easy-to-use programs designed for interactive analysis of full BLAST and FASTA output files containing protein sequence alignments. The programs, Visual BLAST and Visual FASTA, run under Microsoft Windows 95 or NT systems. They are based on the same intuitive graphical user interface (GUI) with extensive viewing, searching, editing, printing and multithreading capabilities. These programs improve the browsing of BLAST/FASTA results by offering a more convenient presentation of these results. They also implement on a computer several analytical tools which automate a manual methodology used for detailed analysis of BLAST and FASTA outputs. These tools include a pairwise sequence alignment viewer, a Hydrophobic Cluster Analysis plot alignment viewer and a tool displaying a graphical map of all database sequences aligned with the query sequence. In addition. Visual Blast includes tools for multiple sequence alignment analysis (with an amino acid patterns search engine), and Visual FASTA provides a GUI to the FASTA program.

动机:在常规分析蛋白质序列时，使用BLAST和FASTA对数据库搜索结果进行详细分析是非常耗时和繁琐的工作，因为结果文件可能包含数百个潜在同源性的列表。这些结果的解释通常是用文本编辑器进行的，这对于这种分析来说不是一个方便的工具。此外，BLAST和FASTA输出文件中的数据格式使它们难以读取。结果:为了促进和加速这一分析，我们首次提出了两个易于使用的程序，用于包含蛋白质序列比对的完整BLAST和FASTA输出文件的交互式分析。这两个程序分别是Visual BLAST和Visual FASTA，在微软Windows 95或NT系统下运行。它们都基于相同的直观图形用户界面(GUI)，具有广泛的查看、搜索、编辑、打印和多线程功能。这些程序通过提供更方便的结果呈现来改进BLAST/FASTA结果的浏览。他们还在计算机上实现了几种分析工具，这些工具可以自动执行用于BLAST和FASTA输出详细分析的手动方法。这些工具包括一个成对序列比对查看器，一个疏水聚类分析图比对查看器和一个显示与查询序列对齐的所有数据库序列的图形化地图的工具。此外。Visual Blast包括用于多序列比对分析的工具(带有氨基酸模式搜索引擎)，Visual FASTA为FASTA程序提供GUI。

{"title":"Visual BLAST and visual FASTA: graphic workbenches for interactive analysis of full BLAST and FASTA outputs under MICROSOFT WINDOWS 95/NT.","authors":"P Durand, L Canard, J P Mornon","doi":"10.1093/bioinformatics/13.4.407","DOIUrl":"https://doi.org/10.1093/bioinformatics/13.4.407","url":null,"abstract":"Motivation: When routinely analysing protein sequences, detailed analysis of database search results made with BLAST and FASTA becomes exceedingly time consuming and tedious work, as the resultant file may contain a list of hundreds of potential homologies. The interpretation of these results is usually carried out with a text editor which is not a convenient tool for this analysis. In addition, the format of data within BLAST and FASTA output files makes them difficult to read.Results: To facilitate and accelerate this analysis, we present for the first time, two easy-to-use programs designed for interactive analysis of full BLAST and FASTA output files containing protein sequence alignments. The programs, Visual BLAST and Visual FASTA, run under Microsoft Windows 95 or NT systems. They are based on the same intuitive graphical user interface (GUI) with extensive viewing, searching, editing, printing and multithreading capabilities. These programs improve the browsing of BLAST/FASTA results by offering a more convenient presentation of these results. They also implement on a computer several analytical tools which automate a manual methodology used for detailed analysis of BLAST and FASTA outputs. These tools include a pairwise sequence alignment viewer, a Hydrophobic Cluster Analysis plot alignment viewer and a tool displaying a graphical map of all database sequences aligned with the query sequence. In addition. Visual Blast includes tools for multiple sequence alignment analysis (with an amino acid patterns search engine), and Visual FASTA provides a GUI to the FASTA program.","PeriodicalId":77081,"journal":{"name":"Computer applications in the biosciences : CABIOS","volume":"13 4","pages":"407-13"},"PeriodicalIF":0.0,"publicationDate":"1997-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1093/bioinformatics/13.4.407","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20225660","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 23

Objectively judging the quality of a protein structure from a Ramachandran plot. 从拉马钱德兰图中客观判断蛋白质结构的质量。

Computer applications in the biosciences : CABIOS

Pub Date : 1997-08-01 DOI: 10.1093/bioinformatics/13.4.425

R W Hooft, C Sander, G Vriend

Motivation: Statistical methods that compare observed and expected distributions of experimental observables provide powerful tools for the quality control of protein structures. The distribution of backbone dihedral angles ('Ramachandran plot') has often been used for such quality control, but without a firm statistical foundation.

Results: A new and-simple method is presented for judging the quality of a protein structure based on the distribution of backbone dihedral angles. Inputs to the method are 60 torsion angle distributions extracted from protein structures solved at high resolution; one for each combination of residue type and tri-state secondary structure. Output for a protein is a Ramachandran Z-score, expressing the quality of the Ramachandran plot relative to current state-of-the-art structures.

动机:比较实验观察到的分布和预期分布的统计方法为蛋白质结构的质量控制提供了强有力的工具。主干二面角的分布(“Ramachandran图”)经常被用于这种质量控制，但没有坚实的统计基础。结果:提出了一种基于骨架二面角分布判断蛋白质结构质量的简便方法。该方法的输入是从高分辨率解算的蛋白质结构中提取的60个扭转角分布;残基型和三态二级结构的组合各一个。蛋白质的输出是Ramachandran z分数，表示Ramachandran图相对于当前最先进结构的质量。

引用次数: 239

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Computer applications in the biosciences : CABIOS

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀