首页 > 最新文献

Computer applications in the biosciences : CABIOS最新文献

英文 中文
Q-RT-PCR: data analysis software for measurement of gene expression by competitive RT-PCR. Q-RT-PCR:通过竞争性RT-PCR测量基因表达的数据分析软件。
Pub Date : 1997-12-01 DOI: 10.1093/bioinformatics/13.6.587
P A Doris, A Hayward-Lester, J K Hays

Motivation: We have developed software to assist in the computation of gene expression from data obtained in competitive reverse transcription-polymerase chain reaction (RT-PCR). This report describes the mathematical basis of competitive RT-PCR and discusses the criteria which must be met to permit accurate estimations of gene expression to be obtained using this technique.

Results: The software that has been developed assists in both the assessment of assay performance (specifically in establishing the equality of amplification efficiency of the native and competitor templates) and in the routine analysis of data obtained in quantitation of gene expression by competitive RT-PCR. The software is a 100 kb module which functions as a Microsoft Excel add-in. It is compatible with both Windows and Mac versions of Excel 5 and Excel 7 on the Windows 95 platform, and employs the spreadsheet, statistical and graphing capabilities incorporated into Excel.

Availability: The software can be downloaded from http://www.grad.ttuhsc.edu/archive/. A brief summary in both HTML and Microsoft Word 6 format of the installation and use of the software is also located at this website.

动机:我们开发了软件来帮助计算从竞争性逆转录聚合酶链反应(RT-PCR)中获得的数据的基因表达。本报告描述了竞争性RT-PCR的数学基础,并讨论了必须满足的标准,以便使用该技术获得准确的基因表达估计。结果:已开发的软件有助于分析性能的评估(特别是在建立本地和竞争对手模板的扩增效率的平等),并有助于通过竞争RT-PCR定量基因表达获得的数据的常规分析。该软件是一个100 kb的模块,功能为微软Excel插件。它在Windows 95平台上兼容Windows和Mac版本的Excel 5和Excel 7,并采用Excel中包含的电子表格、统计和绘图功能。可用性:软件可从http://www.grad.ttuhsc.edu/archive/下载。本网页亦以HTML及microsoftword6格式简述软件的安装及使用方法。
{"title":"Q-RT-PCR: data analysis software for measurement of gene expression by competitive RT-PCR.","authors":"P A Doris,&nbsp;A Hayward-Lester,&nbsp;J K Hays","doi":"10.1093/bioinformatics/13.6.587","DOIUrl":"https://doi.org/10.1093/bioinformatics/13.6.587","url":null,"abstract":"<p><strong>Motivation: </strong>We have developed software to assist in the computation of gene expression from data obtained in competitive reverse transcription-polymerase chain reaction (RT-PCR). This report describes the mathematical basis of competitive RT-PCR and discusses the criteria which must be met to permit accurate estimations of gene expression to be obtained using this technique.</p><p><strong>Results: </strong>The software that has been developed assists in both the assessment of assay performance (specifically in establishing the equality of amplification efficiency of the native and competitor templates) and in the routine analysis of data obtained in quantitation of gene expression by competitive RT-PCR. The software is a 100 kb module which functions as a Microsoft Excel add-in. It is compatible with both Windows and Mac versions of Excel 5 and Excel 7 on the Windows 95 platform, and employs the spreadsheet, statistical and graphing capabilities incorporated into Excel.</p><p><strong>Availability: </strong>The software can be downloaded from http://www.grad.ttuhsc.edu/archive/. A brief summary in both HTML and Microsoft Word 6 format of the installation and use of the software is also located at this website.</p>","PeriodicalId":77081,"journal":{"name":"Computer applications in the biosciences : CABIOS","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1997-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1093/bioinformatics/13.6.587","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20401804","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A software tool for the analysis of mass spectrometric disulfide mapping experiments. 质谱二硫化物作图实验分析软件工具。
Pub Date : 1997-12-01 DOI: 10.1093/bioinformatics/13.6.617
D Fenyö
Disulfide cross-linkages stabilize the structure of many proteins 1993). Therefore, knowledge of the disulfide linkages in a protein provides useful, albeit partial, high-resolution structural information which can be obtained from experiments that are much less time consuming than the X-ray crystallography or NMR spectroscopy experiments for complete structure elucidation.
{"title":"A software tool for the analysis of mass spectrometric disulfide mapping experiments.","authors":"D Fenyö","doi":"10.1093/bioinformatics/13.6.617","DOIUrl":"https://doi.org/10.1093/bioinformatics/13.6.617","url":null,"abstract":"Disulfide cross-linkages stabilize the structure of many proteins 1993). Therefore, knowledge of the disulfide linkages in a protein provides useful, albeit partial, high-resolution structural information which can be obtained from experiments that are much less time consuming than the X-ray crystallography or NMR spectroscopy experiments for complete structure elucidation.","PeriodicalId":77081,"journal":{"name":"Computer applications in the biosciences : CABIOS","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1997-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1093/bioinformatics/13.6.617","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20401808","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
Efficient discovery of conserved patterns using a pattern graph. 使用模式图有效地发现保守模式。
Pub Date : 1997-10-01 DOI: 10.1093/bioinformatics/13.5.509
I Jonassen

Motivation: We have previously reported an algorithm for discovering patterns conserved in sets of related unaligned protein sequences. The algorithm was implemented in a program called Pratt. Pratt allows the user to define a class of patterns (e.g. the degree of ambiguity allowed and the length and number of gaps), and is then guaranteed to find the conserved patterns in this class scoring highest according to a defined fitness measure. In many cases, this version of Pratt was very efficient, but in other cases it was too time consuming to be applied. Hence, a more efficient algorithm was needed.

Results: In this paper, we describe a new and improved searching strategy that has two main advantages over the old strategy. First, it allows for easier integration with programs for multiple sequence alignment and data base search. Secondly, it makes it possible to use branch-and-bound search, and heuristics, to speed up the search. The new search strategy has been implemented in a new version of the Pratt program.

动机:我们之前已经报道了一种算法,用于发现相关未对齐蛋白质序列中保守的模式。这个算法是在一个叫做Pratt的程序中实现的。Pratt允许用户定义一类模式(例如允许的模糊程度和间隔的长度和数量),然后保证根据定义的适应度度量在该类中找到得分最高的保守模式。在许多情况下,这个版本的普拉特是非常有效的,但在其他情况下,它太耗时而无法应用。因此,需要一种更有效的算法。结果:在本文中,我们描述了一种新的和改进的搜索策略,与旧策略相比,它有两个主要优点。首先,它允许更容易地集成多个序列比对和数据库搜索程序。其次,它可以使用分支定界搜索和启发式算法来加快搜索速度。新的搜索策略已在新版本的普拉特程序中实现。
{"title":"Efficient discovery of conserved patterns using a pattern graph.","authors":"I Jonassen","doi":"10.1093/bioinformatics/13.5.509","DOIUrl":"https://doi.org/10.1093/bioinformatics/13.5.509","url":null,"abstract":"<p><strong>Motivation: </strong>We have previously reported an algorithm for discovering patterns conserved in sets of related unaligned protein sequences. The algorithm was implemented in a program called Pratt. Pratt allows the user to define a class of patterns (e.g. the degree of ambiguity allowed and the length and number of gaps), and is then guaranteed to find the conserved patterns in this class scoring highest according to a defined fitness measure. In many cases, this version of Pratt was very efficient, but in other cases it was too time consuming to be applied. Hence, a more efficient algorithm was needed.</p><p><strong>Results: </strong>In this paper, we describe a new and improved searching strategy that has two main advantages over the old strategy. First, it allows for easier integration with programs for multiple sequence alignment and data base search. Secondly, it makes it possible to use branch-and-bound search, and heuristics, to speed up the search. The new search strategy has been implemented in a new version of the Pratt program.</p>","PeriodicalId":77081,"journal":{"name":"Computer applications in the biosciences : CABIOS","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1997-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1093/bioinformatics/13.5.509","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20298912","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 172
Seqalert--a daily sequence alertness server for the EMBL and SWISSPROT databases. Seqalert—EMBL和SWISSPROT数据库的每日序列警报服务器。
Pub Date : 1997-10-01 DOI: 10.1093/bioinformatics/13.5.545
B Shomer

Motivation: The aims were to: enable users to deposit complex search profiles against the sequence databases; interface to an independent Sequence Retrieval System (SRS) server through the network to perform these searches on a daily basis through the last day's updates of these databases; mail users the reformatted search results, enabling local usage when loaded by a WWW browser.

Results: The deposition of one to many search profiles by the user leads to a daily search of the EMBL and SWISSPROT databases. The search profile is restricted to entries that were deposited during the last 24 h by using the SRS query manager to combine search sets. If the search is successful, the resulting html page is modified from relative URLs to absolute ones, enabling local usage by loading from disk. The results are sent to the user by e-mail.

动机:目的是:使用户能够将复杂的搜索配置文件存入序列数据库;通过网络与独立的序列检索系统(SRS)服务器连接,通过这些数据库的最后一天更新来执行这些查询;向用户发送重新格式化的搜索结果,以便在WWW浏览器加载时在本地使用。结果:用户对一个或多个搜索配置文件的沉积导致每天对EMBL和SWISSPROT数据库进行搜索。搜索概要文件仅限于使用SRS查询管理器组合搜索集而在过去24小时内存放的条目。如果搜索成功,生成的html页面将从相对url修改为绝对url,从而通过从磁盘加载实现本地使用。结果通过电子邮件发送给用户。
{"title":"Seqalert--a daily sequence alertness server for the EMBL and SWISSPROT databases.","authors":"B Shomer","doi":"10.1093/bioinformatics/13.5.545","DOIUrl":"https://doi.org/10.1093/bioinformatics/13.5.545","url":null,"abstract":"<p><strong>Motivation: </strong>The aims were to: enable users to deposit complex search profiles against the sequence databases; interface to an independent Sequence Retrieval System (SRS) server through the network to perform these searches on a daily basis through the last day's updates of these databases; mail users the reformatted search results, enabling local usage when loaded by a WWW browser.</p><p><strong>Results: </strong>The deposition of one to many search profiles by the user leads to a daily search of the EMBL and SWISSPROT databases. The search profile is restricted to entries that were deposited during the last 24 h by using the SRS query manager to combine search sets. If the search is successful, the resulting html page is modified from relative URLs to absolute ones, enabling local usage by loading from disk. The results are sent to the user by e-mail.</p>","PeriodicalId":77081,"journal":{"name":"Computer applications in the biosciences : CABIOS","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1997-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1093/bioinformatics/13.5.545","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20296461","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Compression of nucleotide databases for fast searching. 压缩核苷酸数据库的快速搜索。
Pub Date : 1997-10-01 DOI: 10.1093/bioinformatics/13.5.549
H Williams, J Zobel

Motivation: International sequencing efforts are creating huge nucleotide databases, which are used in searching applications to locate sequences homologous to a query sequence. In such applications, it is desirable that databases are stored compactly, that sequences can be accessed independently of the order in which they were stored, and that data can be rapidly retrieved from secondary storage, since disk costs are often the bottleneck in searching.

Results: We present a purpose-built direct coding scheme for fast retrieval and compression of genomic nucleotide data. The scheme is lossless, readily integrated with sequence search tools, and does not require a model. Direct coding gives good compression and allows faster retrieval than with either uncompressed data or data compressed by other methods, thus yielding significant improvements in search times for high-speed homology search tools.

动机:国际上的测序工作正在创建巨大的核苷酸数据库,这些数据库用于搜索应用程序来定位与查询序列同源的序列。在这样的应用程序中,希望数据库存储紧凑,可以独立于存储顺序访问序列,并且可以从二级存储中快速检索数据,因为磁盘成本通常是搜索的瓶颈。结果:我们提出了一个专门构建的直接编码方案,用于快速检索和压缩基因组核苷酸数据。该方案具有无损性,易于与序列搜索工具集成,并且不需要模型。与未压缩数据或使用其他方法压缩的数据相比,直接编码提供了良好的压缩,并且允许更快的检索,从而显著改善了高速同源性搜索工具的搜索时间。
{"title":"Compression of nucleotide databases for fast searching.","authors":"H Williams,&nbsp;J Zobel","doi":"10.1093/bioinformatics/13.5.549","DOIUrl":"https://doi.org/10.1093/bioinformatics/13.5.549","url":null,"abstract":"<p><strong>Motivation: </strong>International sequencing efforts are creating huge nucleotide databases, which are used in searching applications to locate sequences homologous to a query sequence. In such applications, it is desirable that databases are stored compactly, that sequences can be accessed independently of the order in which they were stored, and that data can be rapidly retrieved from secondary storage, since disk costs are often the bottleneck in searching.</p><p><strong>Results: </strong>We present a purpose-built direct coding scheme for fast retrieval and compression of genomic nucleotide data. The scheme is lossless, readily integrated with sequence search tools, and does not require a model. Direct coding gives good compression and allows faster retrieval than with either uncompressed data or data compressed by other methods, thus yielding significant improvements in search times for high-speed homology search tools.</p>","PeriodicalId":77081,"journal":{"name":"Computer applications in the biosciences : CABIOS","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1997-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1093/bioinformatics/13.5.549","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20296462","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 21
Protein data representation and query using optimized data decomposition. 蛋白质数据表示和查询使用优化的数据分解。
Pub Date : 1997-10-01 DOI: 10.1093/bioinformatics/13.5.487
I N Shindyalov, P E Bourne

Motivation: To provide data management tools to maintain and query efficiently experimental and derived protein data with the goal of providing new insights into structure-function relationships. The tools should be portable, extensible, and accessible locally, or via the World Wide Web, providing data that would not otherwise be available.

Results: The initial phase of the work, the data representation and query of all available macromolecular structure data, including real-time access to complex property patterns based on the amino acid sequence, is reported. protein structure data taken from the Protein Data Bank (PDB) are decomposed into native and derived elementary properties, and represented as compact indexed objects minimizing storage requirements and query time for select types of query. In addition, collections of indices representing a particular property are maintained and can be queried for specific property patterns found across the whole database. The approach is proving applicable to a wide variety of data available on specific protein families.

动机:提供数据管理工具,以有效地维护和查询实验和衍生的蛋白质数据,目的是为结构-功能关系提供新的见解。这些工具应该是可移植的、可扩展的,并且可以在本地访问,或者通过万维网访问,提供否则无法获得的数据。结果:报告了工作的初始阶段,所有可用的大分子结构数据的数据表示和查询,包括基于氨基酸序列的复杂属性模式的实时访问。从蛋白质数据库(PDB)中获取的蛋白质结构数据被分解为原生和派生的基本属性,并表示为紧凑的索引对象,从而最大限度地减少了存储需求和查询时间。此外,还维护了表示特定属性的索引集合,可以在整个数据库中查询特定的属性模式。事实证明,该方法适用于特定蛋白质家族的各种可用数据。
{"title":"Protein data representation and query using optimized data decomposition.","authors":"I N Shindyalov,&nbsp;P E Bourne","doi":"10.1093/bioinformatics/13.5.487","DOIUrl":"https://doi.org/10.1093/bioinformatics/13.5.487","url":null,"abstract":"<p><strong>Motivation: </strong>To provide data management tools to maintain and query efficiently experimental and derived protein data with the goal of providing new insights into structure-function relationships. The tools should be portable, extensible, and accessible locally, or via the World Wide Web, providing data that would not otherwise be available.</p><p><strong>Results: </strong>The initial phase of the work, the data representation and query of all available macromolecular structure data, including real-time access to complex property patterns based on the amino acid sequence, is reported. protein structure data taken from the Protein Data Bank (PDB) are decomposed into native and derived elementary properties, and represented as compact indexed objects minimizing storage requirements and query time for select types of query. In addition, collections of indices representing a particular property are maintained and can be queried for specific property patterns found across the whole database. The approach is proving applicable to a wide variety of data available on specific protein families.</p>","PeriodicalId":77081,"journal":{"name":"Computer applications in the biosciences : CABIOS","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1997-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1093/bioinformatics/13.5.487","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20298910","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Bi-dimensional scaling map (BDS-Map): an approach for building large genetic maps. 双向标度图(BDS-Map):一种构建大型遗传图谱的方法。
Pub Date : 1997-10-01 DOI: 10.1093/bioinformatics/13.5.497
S Goldberg, P Ferrand, N Q Nguyen, J F Boisvieux, S Hazout

Motivation: The approaches usually used for building large genetic maps consist of dividing the marker set into linkage groups and provide local orders that can be tested by multi-point linkage analysis. To deal with the limitations of these approaches, a strategy taking the marker set into account globally is defined.

Results: The paper presents a new approach called 'Bi-Dimensional Scaling Map (BDS-Map) for inferring marker orders and distances in genetic maps based on the use of an additional dimension orthogonal to the map into which markers are projected. Dynamical forces based on a two-point analysis are applied to tend to optimize the marker locations in space. The efficiency of the approach is exemplified on real data (16 and 70 markers on chromosomes 6 and 2, respectively) and simulated data (50 maps of 70 markers).

动机:通常用于构建大型遗传图谱的方法包括将标记集划分为连锁组,并提供可通过多点连锁分析进行测试的局部顺序。为了解决这些方法的局限性,定义了一种考虑全局标记集的策略。结果:本文提出了一种新的方法,称为“二维缩放图”(BDS-Map),用于推断遗传图谱中的标记顺序和距离,该方法基于使用与标记投影到的图谱正交的额外维度。利用基于两点分析的动态力来优化标记在空间中的位置。实际数据(6号和2号染色体上分别有16个和70个标记)和模拟数据(70个标记的50个图谱)验证了该方法的有效性。
{"title":"Bi-dimensional scaling map (BDS-Map): an approach for building large genetic maps.","authors":"S Goldberg,&nbsp;P Ferrand,&nbsp;N Q Nguyen,&nbsp;J F Boisvieux,&nbsp;S Hazout","doi":"10.1093/bioinformatics/13.5.497","DOIUrl":"https://doi.org/10.1093/bioinformatics/13.5.497","url":null,"abstract":"<p><strong>Motivation: </strong>The approaches usually used for building large genetic maps consist of dividing the marker set into linkage groups and provide local orders that can be tested by multi-point linkage analysis. To deal with the limitations of these approaches, a strategy taking the marker set into account globally is defined.</p><p><strong>Results: </strong>The paper presents a new approach called 'Bi-Dimensional Scaling Map (BDS-Map) for inferring marker orders and distances in genetic maps based on the use of an additional dimension orthogonal to the map into which markers are projected. Dynamical forces based on a two-point analysis are applied to tend to optimize the marker locations in space. The efficiency of the approach is exemplified on real data (16 and 70 markers on chromosomes 6 and 2, respectively) and simulated data (50 maps of 70 markers).</p>","PeriodicalId":77081,"journal":{"name":"Computer applications in the biosciences : CABIOS","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1997-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1093/bioinformatics/13.5.497","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20298911","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
FPC: a system for building contigs from restriction fingerprinted clones. FPC:从限制性指纹克隆中构建组件的系统。
Pub Date : 1997-10-01 DOI: 10.1093/bioinformatics/13.5.523
C Soderlund, I Longden, R Mott

Motivation: To meet the demands of large-scale sequencing, thousands of clones must be fingerprinted and assembled into contigs. To determine the order of clones, a typical experiment is to digest the clones with one or more restriction enzymes and measure the resulting fragments. The probability of two clones overlapping is based on the similarity of their fragments. A contig contains two or more overlapping clones and a minimal tiling path of clones is selected to be sequenced. Interactive software with algorithmic support is necessary to assemble the clones into contigs quickly.

Results: FPC (fingerprinted contigs) is an interactive program for building contigs from restriction fingerprinted clones. FPC uses an algorithm to cluster clones into contigs based on their probability of coincidence score. For each contig, it builds a consensus band (CB) map which is similar to a restriction map; but it does not try to resolve all the errors. The CB map is used to assign coordinates to the clones based on their alignment to the map and to provide a detailed visualization of the clone overlap. FPC has editing facilities for the user to refine the coordinates and to remove poorly fingerprinted clones. Functions are available for updating an FPC database with new clones. Contigs can easily be merged, split or deleted. Markers can be added to clones and are displayed with the appropriate contig. Sequence-ready clones can be selected and their sequencing status displayed. As such, FPC is an integrated program for the assembly of sequence-ready clones for large-scale sequencing projects.

动机:为了满足大规模测序的需求,必须对成千上万的克隆进行指纹识别并组装成contigs。为了确定克隆的顺序,一个典型的实验是用一种或多种限制性内切酶消化克隆并测量产生的片段。两个克隆重叠的概率是基于它们片段的相似性。contig包含两个或多个重叠克隆,并选择克隆的最小平铺路径进行测序。交互式软件与算法的支持是必要的,以迅速组装成组态克隆。结果:FPC (fingerprinting contigs)是一个从限制性指纹克隆中构建contigs的交互式程序。FPC使用一种算法根据它们的符合得分的概率将克隆聚类成组。对于每个配置,构建一个类似于约束映射的共识带(CB)映射;但它并没有试图解决所有的错误。CB地图用于根据克隆与地图的对齐情况为克隆分配坐标,并提供克隆重叠的详细可视化。FPC具有编辑功能,用户可以对坐标进行细化,并删除差的指纹克隆。函数可用于用新的克隆更新FPC数据库。配置可以很容易地合并、分割或删除。可以将标记添加到克隆中,并显示相应的配置。可以选择序列就绪的克隆并显示其测序状态。因此,FPC是用于大规模测序项目的序列准备克隆组装的集成程序。
{"title":"FPC: a system for building contigs from restriction fingerprinted clones.","authors":"C Soderlund,&nbsp;I Longden,&nbsp;R Mott","doi":"10.1093/bioinformatics/13.5.523","DOIUrl":"https://doi.org/10.1093/bioinformatics/13.5.523","url":null,"abstract":"<p><strong>Motivation: </strong>To meet the demands of large-scale sequencing, thousands of clones must be fingerprinted and assembled into contigs. To determine the order of clones, a typical experiment is to digest the clones with one or more restriction enzymes and measure the resulting fragments. The probability of two clones overlapping is based on the similarity of their fragments. A contig contains two or more overlapping clones and a minimal tiling path of clones is selected to be sequenced. Interactive software with algorithmic support is necessary to assemble the clones into contigs quickly.</p><p><strong>Results: </strong>FPC (fingerprinted contigs) is an interactive program for building contigs from restriction fingerprinted clones. FPC uses an algorithm to cluster clones into contigs based on their probability of coincidence score. For each contig, it builds a consensus band (CB) map which is similar to a restriction map; but it does not try to resolve all the errors. The CB map is used to assign coordinates to the clones based on their alignment to the map and to provide a detailed visualization of the clone overlap. FPC has editing facilities for the user to refine the coordinates and to remove poorly fingerprinted clones. Functions are available for updating an FPC database with new clones. Contigs can easily be merged, split or deleted. Markers can be added to clones and are displayed with the appropriate contig. Sequence-ready clones can be selected and their sequencing status displayed. As such, FPC is an integrated program for the assembly of sequence-ready clones for large-scale sequencing projects.</p>","PeriodicalId":77081,"journal":{"name":"Computer applications in the biosciences : CABIOS","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1997-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1093/bioinformatics/13.5.523","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20298913","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 303
GSC: a graphical program for NMR chemical shift comparison. 核磁共振化学位移比较的图形程序。
Pub Date : 1997-10-01 DOI: 10.1093/bioinformatics/13.5.557
W Gronwald, R F Boyko, B D Sykes
Comparison of NMR chemical shift files can provide the researcher with valuable insights. For example, a comparison of NMR chemical shifts obtained for the same protein at various temperatures can provide insights into the protein's thermal stability (Roder, 1989). Also, shift comparisons of a large protein subjected to various detergents can show whether protein tertiary structure is disrupted (Mutter and Altmann, 1985), or the detergent may have an effect on stabilizing the secondary structure in a small protein (Dyson etai, 1988). Chemical shift changes can verify metal bindings in proteins (Shaw et al., 1990) and it can be insightful to compare the overall effect on chemical shifts from protein mutations (Rajarathnam et al., 1994). Chemical shift comparison of two or more similar proteins can identify regions where the global fold and secondary structure are preserved, and where changes can be expected (Kim et al., 1994). Despite its useful nature, chemical shift comparison can be very tedious and error prone. The GSC program has been developed to make it easy to analyze chemical shifts of selected atoms between chemical shift files. We have used GSC with great success to compare predicted chemical shifts with observed ones to gain insight into the accuracy of our new NMR chemical shift prediction program ORB (Gronwald etai, 1997).Figure 1 shows a main screen snapshot of GSC. The program is started by designating one set of shifts as the 'reference shifts' and either one or two other shift sets as 'comparison shifts'. Once the user selects the atom type (e.g. HN, Ha, Ca, etc.) and the region of the sequences to study, the program draws line graph(s) depicting the difference in shifts between the reference and comparison shifts. The Pearson correlation coefficient (Larson and Marx, 1981) and the average error measure overall chemical shift difference. Here the average error is the sum of the absolute values of all chemical shift differences (in p.p.m.) divided by the number of compared atoms.
{"title":"GSC: a graphical program for NMR chemical shift comparison.","authors":"W Gronwald,&nbsp;R F Boyko,&nbsp;B D Sykes","doi":"10.1093/bioinformatics/13.5.557","DOIUrl":"https://doi.org/10.1093/bioinformatics/13.5.557","url":null,"abstract":"Comparison of NMR chemical shift files can provide the researcher with valuable insights. For example, a comparison of NMR chemical shifts obtained for the same protein at various temperatures can provide insights into the protein's thermal stability (Roder, 1989). Also, shift comparisons of a large protein subjected to various detergents can show whether protein tertiary structure is disrupted (Mutter and Altmann, 1985), or the detergent may have an effect on stabilizing the secondary structure in a small protein (Dyson etai, 1988). Chemical shift changes can verify metal bindings in proteins (Shaw et al., 1990) and it can be insightful to compare the overall effect on chemical shifts from protein mutations (Rajarathnam et al., 1994). Chemical shift comparison of two or more similar proteins can identify regions where the global fold and secondary structure are preserved, and where changes can be expected (Kim et al., 1994). Despite its useful nature, chemical shift comparison can be very tedious and error prone. The GSC program has been developed to make it easy to analyze chemical shifts of selected atoms between chemical shift files. We have used GSC with great success to compare predicted chemical shifts with observed ones to gain insight into the accuracy of our new NMR chemical shift prediction program ORB (Gronwald etai, 1997).Figure 1 shows a main screen snapshot of GSC. The program is started by designating one set of shifts as the 'reference shifts' and either one or two other shift sets as 'comparison shifts'. Once the user selects the atom type (e.g. HN, Ha, Ca, etc.) and the region of the sequences to study, the program draws line graph(s) depicting the difference in shifts between the reference and comparison shifts. The Pearson correlation coefficient (Larson and Marx, 1981) and the average error measure overall chemical shift difference. Here the average error is the sum of the absolute values of all chemical shift differences (in p.p.m.) divided by the number of compared atoms.","PeriodicalId":77081,"journal":{"name":"Computer applications in the biosciences : CABIOS","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1997-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1093/bioinformatics/13.5.557","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20296464","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PSeq-Gen: an application for the Monte Carlo simulation of protein sequence evolution along phylogenetic trees. PSeq-Gen:一个应用程序,用于沿着系统发育树的蛋白质序列进化的蒙特卡罗模拟。
Pub Date : 1997-10-01 DOI: 10.1093/bioinformatics/13.5.559
N C Grassly, J Adachi, A Rambaut
PSeq-Gen will simulate the evolution of protein sequences along evolutionary trees following the procedures previously reported for the DNA sequence simulator Seq-Gen (Sequence-Generator, Rambaut and Grassly, 1997). Statis-tics calculated from these sequences can be used to give expectations under specific null hypotheses of protein evolution. This Monte Carlo simulation approach to testing hypotheses is often termed 'parametric bootstrapping' (see, for example, Efron, 1985; Huelsenbeck et al., 19%; Huelsenbeck and Rannala, 1997), and has many powerful applications, such as testing the molecular clock (Goldman, 1993), detecting recombination (Grassly and Holmes, 1997), and evaluating competing phylogenetic hypotheses (Hillis et al., 19%). Three are mitochondrial each
{"title":"PSeq-Gen: an application for the Monte Carlo simulation of protein sequence evolution along phylogenetic trees.","authors":"N C Grassly,&nbsp;J Adachi,&nbsp;A Rambaut","doi":"10.1093/bioinformatics/13.5.559","DOIUrl":"https://doi.org/10.1093/bioinformatics/13.5.559","url":null,"abstract":"PSeq-Gen will simulate the evolution of protein sequences along evolutionary trees following the procedures previously reported for the DNA sequence simulator Seq-Gen (Sequence-Generator, Rambaut and Grassly, 1997). Statis-tics calculated from these sequences can be used to give expectations under specific null hypotheses of protein evolution. This Monte Carlo simulation approach to testing hypotheses is often termed 'parametric bootstrapping' (see, for example, Efron, 1985; Huelsenbeck et al., 19%; Huelsenbeck and Rannala, 1997), and has many powerful applications, such as testing the molecular clock (Goldman, 1993), detecting recombination (Grassly and Holmes, 1997), and evaluating competing phylogenetic hypotheses (Hillis et al., 19%). Three are mitochondrial each","PeriodicalId":77081,"journal":{"name":"Computer applications in the biosciences : CABIOS","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1997-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1093/bioinformatics/13.5.559","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20296465","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 49
期刊
Computer applications in the biosciences : CABIOS
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1