Pub Date : 1997-12-01DOI: 10.1093/bioinformatics/13.6.587
P A Doris, A Hayward-Lester, J K Hays
Motivation: We have developed software to assist in the computation of gene expression from data obtained in competitive reverse transcription-polymerase chain reaction (RT-PCR). This report describes the mathematical basis of competitive RT-PCR and discusses the criteria which must be met to permit accurate estimations of gene expression to be obtained using this technique.
Results: The software that has been developed assists in both the assessment of assay performance (specifically in establishing the equality of amplification efficiency of the native and competitor templates) and in the routine analysis of data obtained in quantitation of gene expression by competitive RT-PCR. The software is a 100 kb module which functions as a Microsoft Excel add-in. It is compatible with both Windows and Mac versions of Excel 5 and Excel 7 on the Windows 95 platform, and employs the spreadsheet, statistical and graphing capabilities incorporated into Excel.
Availability: The software can be downloaded from http://www.grad.ttuhsc.edu/archive/. A brief summary in both HTML and Microsoft Word 6 format of the installation and use of the software is also located at this website.
{"title":"Q-RT-PCR: data analysis software for measurement of gene expression by competitive RT-PCR.","authors":"P A Doris, A Hayward-Lester, J K Hays","doi":"10.1093/bioinformatics/13.6.587","DOIUrl":"https://doi.org/10.1093/bioinformatics/13.6.587","url":null,"abstract":"<p><strong>Motivation: </strong>We have developed software to assist in the computation of gene expression from data obtained in competitive reverse transcription-polymerase chain reaction (RT-PCR). This report describes the mathematical basis of competitive RT-PCR and discusses the criteria which must be met to permit accurate estimations of gene expression to be obtained using this technique.</p><p><strong>Results: </strong>The software that has been developed assists in both the assessment of assay performance (specifically in establishing the equality of amplification efficiency of the native and competitor templates) and in the routine analysis of data obtained in quantitation of gene expression by competitive RT-PCR. The software is a 100 kb module which functions as a Microsoft Excel add-in. It is compatible with both Windows and Mac versions of Excel 5 and Excel 7 on the Windows 95 platform, and employs the spreadsheet, statistical and graphing capabilities incorporated into Excel.</p><p><strong>Availability: </strong>The software can be downloaded from http://www.grad.ttuhsc.edu/archive/. A brief summary in both HTML and Microsoft Word 6 format of the installation and use of the software is also located at this website.</p>","PeriodicalId":77081,"journal":{"name":"Computer applications in the biosciences : CABIOS","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1997-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1093/bioinformatics/13.6.587","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20401804","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1997-12-01DOI: 10.1093/bioinformatics/13.6.617
D Fenyö
Disulfide cross-linkages stabilize the structure of many proteins 1993). Therefore, knowledge of the disulfide linkages in a protein provides useful, albeit partial, high-resolution structural information which can be obtained from experiments that are much less time consuming than the X-ray crystallography or NMR spectroscopy experiments for complete structure elucidation.
{"title":"A software tool for the analysis of mass spectrometric disulfide mapping experiments.","authors":"D Fenyö","doi":"10.1093/bioinformatics/13.6.617","DOIUrl":"https://doi.org/10.1093/bioinformatics/13.6.617","url":null,"abstract":"Disulfide cross-linkages stabilize the structure of many proteins 1993). Therefore, knowledge of the disulfide linkages in a protein provides useful, albeit partial, high-resolution structural information which can be obtained from experiments that are much less time consuming than the X-ray crystallography or NMR spectroscopy experiments for complete structure elucidation.","PeriodicalId":77081,"journal":{"name":"Computer applications in the biosciences : CABIOS","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1997-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1093/bioinformatics/13.6.617","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20401808","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1997-10-01DOI: 10.1093/bioinformatics/13.5.509
I Jonassen
Motivation: We have previously reported an algorithm for discovering patterns conserved in sets of related unaligned protein sequences. The algorithm was implemented in a program called Pratt. Pratt allows the user to define a class of patterns (e.g. the degree of ambiguity allowed and the length and number of gaps), and is then guaranteed to find the conserved patterns in this class scoring highest according to a defined fitness measure. In many cases, this version of Pratt was very efficient, but in other cases it was too time consuming to be applied. Hence, a more efficient algorithm was needed.
Results: In this paper, we describe a new and improved searching strategy that has two main advantages over the old strategy. First, it allows for easier integration with programs for multiple sequence alignment and data base search. Secondly, it makes it possible to use branch-and-bound search, and heuristics, to speed up the search. The new search strategy has been implemented in a new version of the Pratt program.
{"title":"Efficient discovery of conserved patterns using a pattern graph.","authors":"I Jonassen","doi":"10.1093/bioinformatics/13.5.509","DOIUrl":"https://doi.org/10.1093/bioinformatics/13.5.509","url":null,"abstract":"<p><strong>Motivation: </strong>We have previously reported an algorithm for discovering patterns conserved in sets of related unaligned protein sequences. The algorithm was implemented in a program called Pratt. Pratt allows the user to define a class of patterns (e.g. the degree of ambiguity allowed and the length and number of gaps), and is then guaranteed to find the conserved patterns in this class scoring highest according to a defined fitness measure. In many cases, this version of Pratt was very efficient, but in other cases it was too time consuming to be applied. Hence, a more efficient algorithm was needed.</p><p><strong>Results: </strong>In this paper, we describe a new and improved searching strategy that has two main advantages over the old strategy. First, it allows for easier integration with programs for multiple sequence alignment and data base search. Secondly, it makes it possible to use branch-and-bound search, and heuristics, to speed up the search. The new search strategy has been implemented in a new version of the Pratt program.</p>","PeriodicalId":77081,"journal":{"name":"Computer applications in the biosciences : CABIOS","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1997-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1093/bioinformatics/13.5.509","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20298912","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1997-10-01DOI: 10.1093/bioinformatics/13.5.545
B Shomer
Motivation: The aims were to: enable users to deposit complex search profiles against the sequence databases; interface to an independent Sequence Retrieval System (SRS) server through the network to perform these searches on a daily basis through the last day's updates of these databases; mail users the reformatted search results, enabling local usage when loaded by a WWW browser.
Results: The deposition of one to many search profiles by the user leads to a daily search of the EMBL and SWISSPROT databases. The search profile is restricted to entries that were deposited during the last 24 h by using the SRS query manager to combine search sets. If the search is successful, the resulting html page is modified from relative URLs to absolute ones, enabling local usage by loading from disk. The results are sent to the user by e-mail.
{"title":"Seqalert--a daily sequence alertness server for the EMBL and SWISSPROT databases.","authors":"B Shomer","doi":"10.1093/bioinformatics/13.5.545","DOIUrl":"https://doi.org/10.1093/bioinformatics/13.5.545","url":null,"abstract":"<p><strong>Motivation: </strong>The aims were to: enable users to deposit complex search profiles against the sequence databases; interface to an independent Sequence Retrieval System (SRS) server through the network to perform these searches on a daily basis through the last day's updates of these databases; mail users the reformatted search results, enabling local usage when loaded by a WWW browser.</p><p><strong>Results: </strong>The deposition of one to many search profiles by the user leads to a daily search of the EMBL and SWISSPROT databases. The search profile is restricted to entries that were deposited during the last 24 h by using the SRS query manager to combine search sets. If the search is successful, the resulting html page is modified from relative URLs to absolute ones, enabling local usage by loading from disk. The results are sent to the user by e-mail.</p>","PeriodicalId":77081,"journal":{"name":"Computer applications in the biosciences : CABIOS","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1997-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1093/bioinformatics/13.5.545","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20296461","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1997-10-01DOI: 10.1093/bioinformatics/13.5.549
H Williams, J Zobel
Motivation: International sequencing efforts are creating huge nucleotide databases, which are used in searching applications to locate sequences homologous to a query sequence. In such applications, it is desirable that databases are stored compactly, that sequences can be accessed independently of the order in which they were stored, and that data can be rapidly retrieved from secondary storage, since disk costs are often the bottleneck in searching.
Results: We present a purpose-built direct coding scheme for fast retrieval and compression of genomic nucleotide data. The scheme is lossless, readily integrated with sequence search tools, and does not require a model. Direct coding gives good compression and allows faster retrieval than with either uncompressed data or data compressed by other methods, thus yielding significant improvements in search times for high-speed homology search tools.
{"title":"Compression of nucleotide databases for fast searching.","authors":"H Williams, J Zobel","doi":"10.1093/bioinformatics/13.5.549","DOIUrl":"https://doi.org/10.1093/bioinformatics/13.5.549","url":null,"abstract":"<p><strong>Motivation: </strong>International sequencing efforts are creating huge nucleotide databases, which are used in searching applications to locate sequences homologous to a query sequence. In such applications, it is desirable that databases are stored compactly, that sequences can be accessed independently of the order in which they were stored, and that data can be rapidly retrieved from secondary storage, since disk costs are often the bottleneck in searching.</p><p><strong>Results: </strong>We present a purpose-built direct coding scheme for fast retrieval and compression of genomic nucleotide data. The scheme is lossless, readily integrated with sequence search tools, and does not require a model. Direct coding gives good compression and allows faster retrieval than with either uncompressed data or data compressed by other methods, thus yielding significant improvements in search times for high-speed homology search tools.</p>","PeriodicalId":77081,"journal":{"name":"Computer applications in the biosciences : CABIOS","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1997-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1093/bioinformatics/13.5.549","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20296462","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1997-10-01DOI: 10.1093/bioinformatics/13.5.487
I N Shindyalov, P E Bourne
Motivation: To provide data management tools to maintain and query efficiently experimental and derived protein data with the goal of providing new insights into structure-function relationships. The tools should be portable, extensible, and accessible locally, or via the World Wide Web, providing data that would not otherwise be available.
Results: The initial phase of the work, the data representation and query of all available macromolecular structure data, including real-time access to complex property patterns based on the amino acid sequence, is reported. protein structure data taken from the Protein Data Bank (PDB) are decomposed into native and derived elementary properties, and represented as compact indexed objects minimizing storage requirements and query time for select types of query. In addition, collections of indices representing a particular property are maintained and can be queried for specific property patterns found across the whole database. The approach is proving applicable to a wide variety of data available on specific protein families.
{"title":"Protein data representation and query using optimized data decomposition.","authors":"I N Shindyalov, P E Bourne","doi":"10.1093/bioinformatics/13.5.487","DOIUrl":"https://doi.org/10.1093/bioinformatics/13.5.487","url":null,"abstract":"<p><strong>Motivation: </strong>To provide data management tools to maintain and query efficiently experimental and derived protein data with the goal of providing new insights into structure-function relationships. The tools should be portable, extensible, and accessible locally, or via the World Wide Web, providing data that would not otherwise be available.</p><p><strong>Results: </strong>The initial phase of the work, the data representation and query of all available macromolecular structure data, including real-time access to complex property patterns based on the amino acid sequence, is reported. protein structure data taken from the Protein Data Bank (PDB) are decomposed into native and derived elementary properties, and represented as compact indexed objects minimizing storage requirements and query time for select types of query. In addition, collections of indices representing a particular property are maintained and can be queried for specific property patterns found across the whole database. The approach is proving applicable to a wide variety of data available on specific protein families.</p>","PeriodicalId":77081,"journal":{"name":"Computer applications in the biosciences : CABIOS","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1997-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1093/bioinformatics/13.5.487","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20298910","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1997-10-01DOI: 10.1093/bioinformatics/13.5.497
S Goldberg, P Ferrand, N Q Nguyen, J F Boisvieux, S Hazout
Motivation: The approaches usually used for building large genetic maps consist of dividing the marker set into linkage groups and provide local orders that can be tested by multi-point linkage analysis. To deal with the limitations of these approaches, a strategy taking the marker set into account globally is defined.
Results: The paper presents a new approach called 'Bi-Dimensional Scaling Map (BDS-Map) for inferring marker orders and distances in genetic maps based on the use of an additional dimension orthogonal to the map into which markers are projected. Dynamical forces based on a two-point analysis are applied to tend to optimize the marker locations in space. The efficiency of the approach is exemplified on real data (16 and 70 markers on chromosomes 6 and 2, respectively) and simulated data (50 maps of 70 markers).
{"title":"Bi-dimensional scaling map (BDS-Map): an approach for building large genetic maps.","authors":"S Goldberg, P Ferrand, N Q Nguyen, J F Boisvieux, S Hazout","doi":"10.1093/bioinformatics/13.5.497","DOIUrl":"https://doi.org/10.1093/bioinformatics/13.5.497","url":null,"abstract":"<p><strong>Motivation: </strong>The approaches usually used for building large genetic maps consist of dividing the marker set into linkage groups and provide local orders that can be tested by multi-point linkage analysis. To deal with the limitations of these approaches, a strategy taking the marker set into account globally is defined.</p><p><strong>Results: </strong>The paper presents a new approach called 'Bi-Dimensional Scaling Map (BDS-Map) for inferring marker orders and distances in genetic maps based on the use of an additional dimension orthogonal to the map into which markers are projected. Dynamical forces based on a two-point analysis are applied to tend to optimize the marker locations in space. The efficiency of the approach is exemplified on real data (16 and 70 markers on chromosomes 6 and 2, respectively) and simulated data (50 maps of 70 markers).</p>","PeriodicalId":77081,"journal":{"name":"Computer applications in the biosciences : CABIOS","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1997-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1093/bioinformatics/13.5.497","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20298911","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1997-10-01DOI: 10.1093/bioinformatics/13.5.523
C Soderlund, I Longden, R Mott
Motivation: To meet the demands of large-scale sequencing, thousands of clones must be fingerprinted and assembled into contigs. To determine the order of clones, a typical experiment is to digest the clones with one or more restriction enzymes and measure the resulting fragments. The probability of two clones overlapping is based on the similarity of their fragments. A contig contains two or more overlapping clones and a minimal tiling path of clones is selected to be sequenced. Interactive software with algorithmic support is necessary to assemble the clones into contigs quickly.
Results: FPC (fingerprinted contigs) is an interactive program for building contigs from restriction fingerprinted clones. FPC uses an algorithm to cluster clones into contigs based on their probability of coincidence score. For each contig, it builds a consensus band (CB) map which is similar to a restriction map; but it does not try to resolve all the errors. The CB map is used to assign coordinates to the clones based on their alignment to the map and to provide a detailed visualization of the clone overlap. FPC has editing facilities for the user to refine the coordinates and to remove poorly fingerprinted clones. Functions are available for updating an FPC database with new clones. Contigs can easily be merged, split or deleted. Markers can be added to clones and are displayed with the appropriate contig. Sequence-ready clones can be selected and their sequencing status displayed. As such, FPC is an integrated program for the assembly of sequence-ready clones for large-scale sequencing projects.
{"title":"FPC: a system for building contigs from restriction fingerprinted clones.","authors":"C Soderlund, I Longden, R Mott","doi":"10.1093/bioinformatics/13.5.523","DOIUrl":"https://doi.org/10.1093/bioinformatics/13.5.523","url":null,"abstract":"<p><strong>Motivation: </strong>To meet the demands of large-scale sequencing, thousands of clones must be fingerprinted and assembled into contigs. To determine the order of clones, a typical experiment is to digest the clones with one or more restriction enzymes and measure the resulting fragments. The probability of two clones overlapping is based on the similarity of their fragments. A contig contains two or more overlapping clones and a minimal tiling path of clones is selected to be sequenced. Interactive software with algorithmic support is necessary to assemble the clones into contigs quickly.</p><p><strong>Results: </strong>FPC (fingerprinted contigs) is an interactive program for building contigs from restriction fingerprinted clones. FPC uses an algorithm to cluster clones into contigs based on their probability of coincidence score. For each contig, it builds a consensus band (CB) map which is similar to a restriction map; but it does not try to resolve all the errors. The CB map is used to assign coordinates to the clones based on their alignment to the map and to provide a detailed visualization of the clone overlap. FPC has editing facilities for the user to refine the coordinates and to remove poorly fingerprinted clones. Functions are available for updating an FPC database with new clones. Contigs can easily be merged, split or deleted. Markers can be added to clones and are displayed with the appropriate contig. Sequence-ready clones can be selected and their sequencing status displayed. As such, FPC is an integrated program for the assembly of sequence-ready clones for large-scale sequencing projects.</p>","PeriodicalId":77081,"journal":{"name":"Computer applications in the biosciences : CABIOS","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1997-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1093/bioinformatics/13.5.523","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20298913","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1997-10-01DOI: 10.1093/bioinformatics/13.5.557
W Gronwald, R F Boyko, B D Sykes
Comparison of NMR chemical shift files can provide the researcher with valuable insights. For example, a comparison of NMR chemical shifts obtained for the same protein at various temperatures can provide insights into the protein's thermal stability (Roder, 1989). Also, shift comparisons of a large protein subjected to various detergents can show whether protein tertiary structure is disrupted (Mutter and Altmann, 1985), or the detergent may have an effect on stabilizing the secondary structure in a small protein (Dyson etai, 1988). Chemical shift changes can verify metal bindings in proteins (Shaw et al., 1990) and it can be insightful to compare the overall effect on chemical shifts from protein mutations (Rajarathnam et al., 1994). Chemical shift comparison of two or more similar proteins can identify regions where the global fold and secondary structure are preserved, and where changes can be expected (Kim et al., 1994). Despite its useful nature, chemical shift comparison can be very tedious and error prone. The GSC program has been developed to make it easy to analyze chemical shifts of selected atoms between chemical shift files. We have used GSC with great success to compare predicted chemical shifts with observed ones to gain insight into the accuracy of our new NMR chemical shift prediction program ORB (Gronwald etai, 1997).Figure 1 shows a main screen snapshot of GSC. The program is started by designating one set of shifts as the 'reference shifts' and either one or two other shift sets as 'comparison shifts'. Once the user selects the atom type (e.g. HN, Ha, Ca, etc.) and the region of the sequences to study, the program draws line graph(s) depicting the difference in shifts between the reference and comparison shifts. The Pearson correlation coefficient (Larson and Marx, 1981) and the average error measure overall chemical shift difference. Here the average error is the sum of the absolute values of all chemical shift differences (in p.p.m.) divided by the number of compared atoms.
{"title":"GSC: a graphical program for NMR chemical shift comparison.","authors":"W Gronwald, R F Boyko, B D Sykes","doi":"10.1093/bioinformatics/13.5.557","DOIUrl":"https://doi.org/10.1093/bioinformatics/13.5.557","url":null,"abstract":"Comparison of NMR chemical shift files can provide the researcher with valuable insights. For example, a comparison of NMR chemical shifts obtained for the same protein at various temperatures can provide insights into the protein's thermal stability (Roder, 1989). Also, shift comparisons of a large protein subjected to various detergents can show whether protein tertiary structure is disrupted (Mutter and Altmann, 1985), or the detergent may have an effect on stabilizing the secondary structure in a small protein (Dyson etai, 1988). Chemical shift changes can verify metal bindings in proteins (Shaw et al., 1990) and it can be insightful to compare the overall effect on chemical shifts from protein mutations (Rajarathnam et al., 1994). Chemical shift comparison of two or more similar proteins can identify regions where the global fold and secondary structure are preserved, and where changes can be expected (Kim et al., 1994). Despite its useful nature, chemical shift comparison can be very tedious and error prone. The GSC program has been developed to make it easy to analyze chemical shifts of selected atoms between chemical shift files. We have used GSC with great success to compare predicted chemical shifts with observed ones to gain insight into the accuracy of our new NMR chemical shift prediction program ORB (Gronwald etai, 1997).Figure 1 shows a main screen snapshot of GSC. The program is started by designating one set of shifts as the 'reference shifts' and either one or two other shift sets as 'comparison shifts'. Once the user selects the atom type (e.g. HN, Ha, Ca, etc.) and the region of the sequences to study, the program draws line graph(s) depicting the difference in shifts between the reference and comparison shifts. The Pearson correlation coefficient (Larson and Marx, 1981) and the average error measure overall chemical shift difference. Here the average error is the sum of the absolute values of all chemical shift differences (in p.p.m.) divided by the number of compared atoms.","PeriodicalId":77081,"journal":{"name":"Computer applications in the biosciences : CABIOS","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1997-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1093/bioinformatics/13.5.557","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20296464","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1997-10-01DOI: 10.1093/bioinformatics/13.5.559
N C Grassly, J Adachi, A Rambaut
PSeq-Gen will simulate the evolution of protein sequences along evolutionary trees following the procedures previously reported for the DNA sequence simulator Seq-Gen (Sequence-Generator, Rambaut and Grassly, 1997). Statis-tics calculated from these sequences can be used to give expectations under specific null hypotheses of protein evolution. This Monte Carlo simulation approach to testing hypotheses is often termed 'parametric bootstrapping' (see, for example, Efron, 1985; Huelsenbeck et al., 19%; Huelsenbeck and Rannala, 1997), and has many powerful applications, such as testing the molecular clock (Goldman, 1993), detecting recombination (Grassly and Holmes, 1997), and evaluating competing phylogenetic hypotheses (Hillis et al., 19%). Three are mitochondrial each
{"title":"PSeq-Gen: an application for the Monte Carlo simulation of protein sequence evolution along phylogenetic trees.","authors":"N C Grassly, J Adachi, A Rambaut","doi":"10.1093/bioinformatics/13.5.559","DOIUrl":"https://doi.org/10.1093/bioinformatics/13.5.559","url":null,"abstract":"PSeq-Gen will simulate the evolution of protein sequences along evolutionary trees following the procedures previously reported for the DNA sequence simulator Seq-Gen (Sequence-Generator, Rambaut and Grassly, 1997). Statis-tics calculated from these sequences can be used to give expectations under specific null hypotheses of protein evolution. This Monte Carlo simulation approach to testing hypotheses is often termed 'parametric bootstrapping' (see, for example, Efron, 1985; Huelsenbeck et al., 19%; Huelsenbeck and Rannala, 1997), and has many powerful applications, such as testing the molecular clock (Goldman, 1993), detecting recombination (Grassly and Holmes, 1997), and evaluating competing phylogenetic hypotheses (Hillis et al., 19%). Three are mitochondrial each","PeriodicalId":77081,"journal":{"name":"Computer applications in the biosciences : CABIOS","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1997-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1093/bioinformatics/13.5.559","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20296465","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}