首页 > 最新文献

Current protocols in bioinformatics最新文献

英文 中文
Using pLink to Analyze Cross-Linked Peptides 使用pLink分析交联肽
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2018-02-15 DOI: 10.1002/0471250953.bi0821s49
Sheng-Bo Fan, Jia-Ming Meng, Shan Lu, Kun Zhang, Hao Yang, Hao Chi, Rui-Xiang Sun, Meng-Qiu Dong, Si-Min He

pLink is a search engine for high-throughput identification of cross-linked peptides from their tandem mass spectra, which is the data-analysis step in chemical cross-linking of proteins coupled with mass spectrometry analysis. pLink has accumulated more than 200 registered users from all over the world since its first release in 2012. After 2 years of continual development, a new version of pLink has been released, which is at least 40 times faster, more versatile, and more user-friendly. Also, the function of the new pLink has been expanded to identifying endogenous protein cross-linking sites such as disulfide bonds and SUMO (Small Ubiquitin-like MOdifier) modification sites. Integrated into the new version are two accessory tools: pLabel, to annotate spectra of cross-linked peptides for visual inspection and publication, and pConfig, to assist users in setting up search parameters. Here, we provide detailed guidance on running a database search for identification of protein cross-links using the 2014 version of pLink. © 2015 by John Wiley & Sons, Inc.

pLink是一个搜索引擎,用于从串联质谱中高通量鉴定交联肽,这是蛋白质化学交联与质谱分析的数据分析步骤。自2012年首次发布以来,pLink已经积累了来自世界各地的200多名注册用户。经过2年的持续开发,pLink的新版本已经发布,它的速度至少快了40倍,功能更强大,用户界面更友好。此外,新pLink的功能已经扩展到识别内源性蛋白质交联位点,如二硫键和SUMO(小泛素样修饰物)修饰位点。新版本中集成了两个辅助工具:pLabel,用于注释交联肽的光谱,用于视觉检查和出版,pConfig,用于帮助用户设置搜索参数。在这里,我们提供了使用2014版pLink运行数据库搜索以鉴定蛋白质交联的详细指导。©2015 by John Wiley &儿子,Inc。
{"title":"Using pLink to Analyze Cross-Linked Peptides","authors":"Sheng-Bo Fan,&nbsp;Jia-Ming Meng,&nbsp;Shan Lu,&nbsp;Kun Zhang,&nbsp;Hao Yang,&nbsp;Hao Chi,&nbsp;Rui-Xiang Sun,&nbsp;Meng-Qiu Dong,&nbsp;Si-Min He","doi":"10.1002/0471250953.bi0821s49","DOIUrl":"10.1002/0471250953.bi0821s49","url":null,"abstract":"<p>pLink is a search engine for high-throughput identification of cross-linked peptides from their tandem mass spectra, which is the data-analysis step in chemical cross-linking of proteins coupled with mass spectrometry analysis. pLink has accumulated more than 200 registered users from all over the world since its first release in 2012. After 2 years of continual development, a new version of pLink has been released, which is at least 40 times faster, more versatile, and more user-friendly. Also, the function of the new pLink has been expanded to identifying endogenous protein cross-linking sites such as disulfide bonds and SUMO (Small Ubiquitin-like MOdifier) modification sites. Integrated into the new version are two accessory tools: pLabel, to annotate spectra of cross-linked peptides for visual inspection and publication, and pConfig, to assist users in setting up search parameters. Here, we provide detailed guidance on running a database search for identification of protein cross-links using the 2014 version of pLink. © 2015 by John Wiley &amp; Sons, Inc.</p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"49 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2018-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/0471250953.bi0821s49","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"32997649","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 30
Investigating Protein Structure and Evolution with SCOP2 利用SCOP2研究蛋白质结构和进化
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2018-02-15 DOI: 10.1002/0471250953.bi0126s49
Antonina Andreeva, Dave Howorth, Cyrus Chothia, Eugene Kulesha, Alexey G. Murzin

SCOP2 is a successor to the Structural Classification of Proteins (SCOP) database that organizes proteins of known structure according to their structural and evolutionary relationships. It was designed to provide a more advanced framework for the classification of proteins. The SCOP2 classification is described in terms of a directed acyclic graph in which each node defines a relationship of particular type that is represented by a region of protein structure and sequence. The SCOP2 data are accessible via SCOP2-Browser and SCOP2-Graph. This protocol unit describes different ways to explore and investigate the SCOP2 evolutionary and structural groupings. © 2015 by John Wiley & Sons, Inc.

SCOP2是蛋白质结构分类(SCOP)数据库的后继版本,该数据库根据结构和进化关系对已知结构的蛋白质进行组织。它的设计目的是为蛋白质分类提供一个更高级的框架。SCOP2分类是根据一个有向无环图来描述的,其中每个节点定义了一个特定类型的关系,该关系由一个蛋白质结构和序列区域表示。SCOP2数据可以通过SCOP2- browser和SCOP2- graph访问。本协议单元描述了探索和研究SCOP2进化和结构分组的不同方法。©2015 by John Wiley &儿子,Inc。
{"title":"Investigating Protein Structure and Evolution with SCOP2","authors":"Antonina Andreeva,&nbsp;Dave Howorth,&nbsp;Cyrus Chothia,&nbsp;Eugene Kulesha,&nbsp;Alexey G. Murzin","doi":"10.1002/0471250953.bi0126s49","DOIUrl":"10.1002/0471250953.bi0126s49","url":null,"abstract":"<p>SCOP2 is a successor to the Structural Classification of Proteins (SCOP) database that organizes proteins of known structure according to their structural and evolutionary relationships. It was designed to provide a more advanced framework for the classification of proteins. The SCOP2 classification is described in terms of a directed acyclic graph in which each node defines a relationship of particular type that is represented by a region of protein structure and sequence. The SCOP2 data are accessible via SCOP2-Browser and SCOP2-Graph. This protocol unit describes different ways to explore and investigate the SCOP2 evolutionary and structural groupings. © 2015 by John Wiley &amp; Sons, Inc.</p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"49 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2018-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/0471250953.bi0126s49","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"32992293","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 19
Expression Data Analysis with Reactome Reactome表达数据分析
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2018-02-15 DOI: 10.1002/0471250953.bi0820s49
Steve Jupe, Antonio Fabregat, Henning Hermjakob

The Reactome database of curated biological pathways provides a tool for visualizing user-supplied expression data as an overlay on pathway diagrams, thereby affording an effective means to examine expression of the constituents of the pathway and determine whether all that are necessary are present. Several experiments can be visualized in succession, to determine whether expression changes with experimental conditions, a useful feature for examining a time-course, dose-response, or disease progression. © 2015 by John Wiley & Sons, Inc.

Reactome数据库提供了一种工具,可以将用户提供的表达数据可视化,作为通路图的覆盖层,从而提供了一种有效的方法来检查通路成分的表达,并确定是否所有必要的成分都存在。几个实验可以连续可视化,以确定表达是否随实验条件而变化,这是检查时间过程、剂量反应或疾病进展的有用特征。©2015 by John Wiley &儿子,Inc。
{"title":"Expression Data Analysis with Reactome","authors":"Steve Jupe,&nbsp;Antonio Fabregat,&nbsp;Henning Hermjakob","doi":"10.1002/0471250953.bi0820s49","DOIUrl":"10.1002/0471250953.bi0820s49","url":null,"abstract":"<p>The Reactome database of curated biological pathways provides a tool for visualizing user-supplied expression data as an overlay on pathway diagrams, thereby affording an effective means to examine expression of the constituents of the pathway and determine whether all that are necessary are present. Several experiments can be visualized in succession, to determine whether expression changes with experimental conditions, a useful feature for examining a time-course, dose-response, or disease progression. © 2015 by John Wiley &amp; Sons, Inc.</p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"49 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2018-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/0471250953.bi0820s49","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"32997648","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Using REDItools to Detect RNA Editing Events in NGS Datasets 使用redittools检测NGS数据集中的RNA编辑事件
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2018-02-15 DOI: 10.1002/0471250953.bi1212s49
Ernesto Picardi, Anna Maria D'Erchia, Antonio Montalvo, Graziano Pesole

RNA editing is a post-transcriptional/co-transcriptional molecular phenomenon whereby a genetic message is modified from the corresponding DNA template by means of substitutions, insertions, and/or deletions. It occurs in a variety of organisms and different cellular locations through evolutionally and biochemically unrelated proteins. RNA editing has a plethora of biological effects including the modulation of alternative splicing and fine-tuning of gene expression. RNA editing events by base substitutions can be detected on a genomic scale by NGS technologies through the REDItools package, an ad hoc suite of Python scripts to study RNA editing using RNA-Seq and DNA-Seq data or RNA-Seq data alone. REDItools implement effective filters to minimize biases due to sequencing errors, mapping errors, and SNPs. The package is freely available at Google Code repository (http://code.google.com/p/reditools/) and released under the MIT license. In the present unit we show three basic protocols corresponding to three main REDItools scripts. © 2015 by John Wiley & Sons, Inc.

RNA编辑是一种转录后/共转录的分子现象,通过替换、插入和/或删除的方式,从相应的DNA模板修改遗传信息。它通过进化和生物化学不相关的蛋白质发生在各种生物体和不同的细胞位置。RNA编辑具有多种生物学效应,包括选择性剪接的调节和基因表达的微调。通过REDItools包,NGS技术可以在基因组尺度上检测碱基替换的RNA编辑事件,REDItools包是一个专门的Python脚本套件,用于使用RNA- seq和DNA-Seq数据或单独使用RNA- seq数据研究RNA编辑。REDItools实现了有效的过滤器,以最大限度地减少由于测序错误、映射错误和snp造成的偏差。该软件包可在Google Code存储库(http://code.google.com/p/reditools/)免费获得,并根据MIT许可发布。在本单元中,我们将展示对应于三个主要REDItools脚本的三个基本协议。©2015 by John Wiley &儿子,Inc。
{"title":"Using REDItools to Detect RNA Editing Events in NGS Datasets","authors":"Ernesto Picardi,&nbsp;Anna Maria D'Erchia,&nbsp;Antonio Montalvo,&nbsp;Graziano Pesole","doi":"10.1002/0471250953.bi1212s49","DOIUrl":"10.1002/0471250953.bi1212s49","url":null,"abstract":"<p>RNA editing is a post-transcriptional/co-transcriptional molecular phenomenon whereby a genetic message is modified from the corresponding DNA template by means of substitutions, insertions, and/or deletions. It occurs in a variety of organisms and different cellular locations through evolutionally and biochemically unrelated proteins. RNA editing has a plethora of biological effects including the modulation of alternative splicing and fine-tuning of gene expression. RNA editing events by base substitutions can be detected on a genomic scale by NGS technologies through the REDItools package, an ad hoc suite of Python scripts to study RNA editing using RNA-Seq and DNA-Seq data or RNA-Seq data alone. REDItools implement effective filters to minimize biases due to sequencing errors, mapping errors, and SNPs. The package is freely available at Google Code repository (http://code.google.com/p/reditools/) and released under the MIT license. In the present unit we show three basic protocols corresponding to three main REDItools scripts. © 2015 by John Wiley &amp; Sons, Inc.</p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"49 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2018-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/0471250953.bi1212s49","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"32992294","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 32
Scoring Large-Scale Affinity Purification Mass Spectrometry Datasets with MiST 用MiST评分大规模亲和纯化质谱数据集
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2018-02-15 DOI: 10.1002/0471250953.bi0819s49
Erik Verschueren, John Von Dollen, Peter Cimermancic, Natali Gulbahce, Andrej Sali, Nevan J. Krogan

High-throughput Affinity Purification Mass Spectrometry (AP-MS) experiments can identify a large number of protein interactions, but only a fraction of these interactions are biologically relevant. Here, we describe a comprehensive computational strategy to process raw AP-MS data, perform quality controls, and prioritize biologically relevant bait-prey pairs in a set of replicated AP-MS experiments with Mass spectrometry interaction STatistics (MiST). The MiST score is a linear combination of prey quantity (abundance), abundance invariability across repeated experiments (reproducibility), and prey uniqueness relative to other baits (specificity). We describe how to run the full MiST analysis pipeline in an R environment and discuss a number of configurable options that allow the lay user to convert any large-scale AP-MS data into an interpretable, biologically relevant protein-protein interaction network. © 2015 by John Wiley & Sons, Inc.

高通量亲和纯化质谱(AP-MS)实验可以鉴定大量蛋白质相互作用,但这些相互作用中只有一小部分具有生物学相关性。在这里,我们描述了一种综合的计算策略来处理原始AP-MS数据,执行质量控制,并在一组使用质谱相互作用统计(MiST)的重复AP-MS实验中优先考虑生物学相关的诱饵-猎物对。MiST分数是猎物数量(丰度)、重复实验中丰度不变性(再现性)和猎物相对于其他诱饵的独特性(特异性)的线性组合。我们描述了如何在R环境中运行完整的MiST分析管道,并讨论了一些可配置的选项,这些选项允许外行用户将任何大规模AP-MS数据转换为可解释的、生物相关的蛋白质-蛋白质相互作用网络。©2015 by John Wiley &儿子,Inc。
{"title":"Scoring Large-Scale Affinity Purification Mass Spectrometry Datasets with MiST","authors":"Erik Verschueren,&nbsp;John Von Dollen,&nbsp;Peter Cimermancic,&nbsp;Natali Gulbahce,&nbsp;Andrej Sali,&nbsp;Nevan J. Krogan","doi":"10.1002/0471250953.bi0819s49","DOIUrl":"10.1002/0471250953.bi0819s49","url":null,"abstract":"<p>High-throughput Affinity Purification Mass Spectrometry (AP-MS) experiments can identify a large number of protein interactions, but only a fraction of these interactions are biologically relevant. Here, we describe a comprehensive computational strategy to process raw AP-MS data, perform quality controls, and prioritize biologically relevant bait-prey pairs in a set of replicated AP-MS experiments with Mass spectrometry interaction STatistics (MiST). The MiST score is a linear combination of prey quantity (abundance), abundance invariability across repeated experiments (reproducibility), and prey uniqueness relative to other baits (specificity). We describe how to run the full MiST analysis pipeline in an R environment and discuss a number of configurable options that allow the lay user to convert any large-scale AP-MS data into an interpretable, biologically relevant protein-protein interaction network. © 2015 by John Wiley &amp; Sons, Inc.</p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"49 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2018-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/0471250953.bi0819s49","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"32997647","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 57
Data Analysis Pipeline for RNA-seq Experiments: From Differential Expression to Cryptic Splicing RNA-seq实验的数据分析管道:从差异表达到隐剪接
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2018-02-13 DOI: 10.1002/cpbi.33
Hari Krishna Yalamanchili, Ying-Wooi Wan, Zhandong Liu

RNA sequencing (RNA-seq) is a high-throughput technology that provides unique insights into the transcriptome. It has a wide variety of applications in quantifying genes/isoforms and in detecting non-coding RNA, alternative splicing, and splice junctions. It is extremely important to comprehend the entire transcriptome for a thorough understanding of the cellular system. Several RNA-seq analysis pipelines have been proposed to date. However, no single analysis pipeline can capture dynamics of the entire transcriptome. Here, we compile and present a robust and commonly used analytical pipeline covering the entire spectrum of transcriptome analysis, including quality checks, alignment of reads, differential gene/transcript expression analysis, discovery of cryptic splicing events, and visualization. Challenges, critical parameters, and possible downstream functional analysis pipelines associated with each step are highlighted and discussed. This unit provides a comprehensive understanding of state-of-the-art RNA-seq analysis pipeline and a greater understanding of the transcriptome. © 2017 by John Wiley & Sons, Inc.

RNA测序(RNA-seq)是一种高通量技术,可提供对转录组的独特见解。它在定量基因/同种异构体和检测非编码RNA、选择性剪接和剪接连接方面具有广泛的应用。了解整个转录组对于彻底了解细胞系统是非常重要的。迄今为止,已经提出了几种RNA-seq分析管道。然而,没有单一的分析管道可以捕获整个转录组的动态。在这里,我们编译并展示了一个强大的、常用的分析管道,涵盖了转录组分析的整个范围,包括质量检查、reads比对、差异基因/转录物表达分析、隐剪接事件的发现和可视化。强调并讨论了与每个步骤相关的挑战、关键参数和可能的下游功能分析管道。本单元提供了对最先进的RNA-seq分析管道的全面了解和对转录组的更深入了解。©2017 by John Wiley &儿子,Inc。
{"title":"Data Analysis Pipeline for RNA-seq Experiments: From Differential Expression to Cryptic Splicing","authors":"Hari Krishna Yalamanchili,&nbsp;Ying-Wooi Wan,&nbsp;Zhandong Liu","doi":"10.1002/cpbi.33","DOIUrl":"10.1002/cpbi.33","url":null,"abstract":"<p>RNA sequencing (RNA-seq) is a high-throughput technology that provides unique insights into the transcriptome. It has a wide variety of applications in quantifying genes/isoforms and in detecting non-coding RNA, alternative splicing, and splice junctions. It is extremely important to comprehend the entire transcriptome for a thorough understanding of the cellular system. Several RNA-seq analysis pipelines have been proposed to date. However, no single analysis pipeline can capture dynamics of the entire transcriptome. Here, we compile and present a robust and commonly used analytical pipeline covering the entire spectrum of transcriptome analysis, including quality checks, alignment of reads, differential gene/transcript expression analysis, discovery of cryptic splicing events, and visualization. Challenges, critical parameters, and possible downstream functional analysis pipelines associated with each step are highlighted and discussed. This unit provides a comprehensive understanding of state-of-the-art RNA-seq analysis pipeline and a greater understanding of the transcriptome. © 2017 by John Wiley &amp; Sons, Inc.</p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"59 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2018-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cpbi.33","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"35402458","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 35
Using the Contextual Hub Analysis Tool (CHAT) in Cytoscape to Identify Contextually Relevant Network Hubs 在Cytoscape中使用上下文枢纽分析工具(CHAT)来识别上下文相关的网络枢纽
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2018-02-13 DOI: 10.1002/cpbi.35
Tanja Muetze, David J. Lynn

Highly connected nodes in biological networks are called network hubs. Hubs are topologically important to the structure of the network and have been shown to be preferentially associated with a range of phenotypes of interest. The relative importance of a hub node, however, can change depending on the biological context. Here, we provide a step-by-step protocol for using the Contextual Hub Analysis Tool (CHAT), an application within Cytoscape 3, which enables users to easily construct and visualize a network of interactions from a gene or protein list of interest, integrate contextual information, such as gene or protein expression data, and identify hub nodes that are more highly connected to contextual nodes than expected by chance. © 2017 by John Wiley & Sons, Inc.

生物网络中高度连接的节点被称为网络集线器。枢纽在拓扑结构上对网络结构很重要,并且已被证明优先与一系列感兴趣的表型相关。然而,枢纽节点的相对重要性可以根据生物环境而改变。在这里,我们提供了一个使用上下文枢纽分析工具(CHAT)的分步协议,这是Cytoscape 3中的一个应用程序,它使用户能够轻松地从感兴趣的基因或蛋白质列表中构建和可视化交互网络,整合上下文信息,如基因或蛋白质表达数据,并识别与上下文节点的连接程度比预期的要高的枢纽节点。©2017 by John Wiley &儿子,Inc。
{"title":"Using the Contextual Hub Analysis Tool (CHAT) in Cytoscape to Identify Contextually Relevant Network Hubs","authors":"Tanja Muetze,&nbsp;David J. Lynn","doi":"10.1002/cpbi.35","DOIUrl":"10.1002/cpbi.35","url":null,"abstract":"<p>Highly connected nodes in biological networks are called network hubs. Hubs are topologically important to the structure of the network and have been shown to be preferentially associated with a range of phenotypes of interest. The relative importance of a hub node, however, can change depending on the biological context. Here, we provide a step-by-step protocol for using the Contextual Hub Analysis Tool (CHAT), an application within Cytoscape 3, which enables users to easily construct and visualize a network of interactions from a gene or protein list of interest, integrate contextual information, such as gene or protein expression data, and identify hub nodes that are more highly connected to contextual nodes than expected by chance. © 2017 by John Wiley &amp; Sons, Inc.</p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"59 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2018-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cpbi.35","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"35402461","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Using SQL Databases for Sequence Similarity Searching and Analysis 基于SQL数据库的序列相似性搜索与分析
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2018-02-13 DOI: 10.1002/cpbi.32
William R. Pearson, Aaron J. Mackey

Relational databases can integrate diverse types of information and manage large sets of similarity search results, greatly simplifying genome-scale analyses. By focusing on taxonomic subsets of sequences, relational databases can reduce the size and redundancy of sequence libraries and improve the statistical significance of homologs. In addition, by loading similarity search results into a relational database, it becomes possible to explore and summarize the relationships between all of the proteins in an organism and those in other biological kingdoms. This unit describes how to use relational databases to improve the efficiency of sequence similarity searching and demonstrates various large-scale genomic analyses of homology-related data. It also describes the installation and use of a simple protein sequence database, seqdb_demo, which is used as a basis for the other protocols. The unit also introduces search_demo, a database that stores sequence similarity search results. The search_demo database is then used to explore the evolutionary relationships between E. coli proteins and proteins in other organisms in a large-scale comparative genomic analysis. © 2017 by John Wiley & Sons, Inc.

关系数据库可以集成不同类型的信息并管理大量相似搜索结果,极大地简化了基因组规模的分析。通过关注序列的分类子集,关系数据库可以减少序列库的大小和冗余,提高同源物的统计意义。此外,通过将相似性搜索结果加载到关系数据库中,可以探索和总结生物体中所有蛋白质与其他生物领域中的蛋白质之间的关系。本单元描述了如何使用关系数据库来提高序列相似性搜索的效率,并演示了同源性相关数据的各种大规模基因组分析。它还描述了一个简单的蛋白质序列数据库seqdb_demo的安装和使用,该数据库用作其他协议的基础。本单元还介绍了search_demo,这是一个存储序列相似性搜索结果的数据库。然后使用search_demo数据库在大规模比较基因组分析中探索大肠杆菌蛋白与其他生物体蛋白之间的进化关系。©2017 by John Wiley &儿子,Inc。
{"title":"Using SQL Databases for Sequence Similarity Searching and Analysis","authors":"William R. Pearson,&nbsp;Aaron J. Mackey","doi":"10.1002/cpbi.32","DOIUrl":"10.1002/cpbi.32","url":null,"abstract":"<p>Relational databases can integrate diverse types of information and manage large sets of similarity search results, greatly simplifying genome-scale analyses. By focusing on taxonomic subsets of sequences, relational databases can reduce the size and redundancy of sequence libraries and improve the statistical significance of homologs. In addition, by loading similarity search results into a relational database, it becomes possible to explore and summarize the relationships between all of the proteins in an organism and those in other biological kingdoms. This unit describes how to use relational databases to improve the efficiency of sequence similarity searching and demonstrates various large-scale genomic analyses of homology-related data. It also describes the installation and use of a simple protein sequence database, <span>seqdb_demo</span>, which is used as a basis for the other protocols. The unit also introduces <span>search_demo</span>, a database that stores sequence similarity search results. The <span>search_demo</span> database is then used to explore the evolutionary relationships between <i>E. coli</i> proteins and proteins in other organisms in a large-scale comparative genomic analysis. © 2017 by John Wiley &amp; Sons, Inc.</p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"59 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2018-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cpbi.32","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"35402459","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Finding Homologs in Amino Acid Sequences Using Network BLAST Searches 利用网络BLAST搜索在氨基酸序列中寻找同源物
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2018-02-13 DOI: 10.1002/cpbi.34
Istvan Ladunga

BLAST, the Basic Local Alignment Search Tool, is used more frequently than any other biosequence database search program. We show how to run searches on the Web, and demonstrate how to increase performance by fine-tuning arguments for a specific research project. We offer guidance for interpreting results, statistical significance and biological relevance issues, and suggest complementary analyses. This unit covers both protein-to-protein (blastp) searches and translated searches (blastx, tblastn, tfastx). blastx conceptually translates the query sequence and tblastn translates all nucleotide sequences in a database, while tblastx translates both the query and the database sequences into amino acid sequences. © 2017 by John Wiley & Sons, Inc.

BLAST,基本本地比对搜索工具,比任何其他生物序列数据库搜索程序使用更频繁。我们将展示如何在Web上运行搜索,并演示如何通过微调特定研究项目的参数来提高性能。我们为解释结果、统计显著性和生物学相关性问题提供指导,并建议进行补充分析。本单元涵盖蛋白质对蛋白质(blastp)搜索和翻译搜索(blastx, tblastn, tfastx)。Blastx从概念上翻译查询序列,tblastn翻译数据库中的所有核苷酸序列,而tblastx将查询序列和数据库序列翻译成氨基酸序列。©2017 by John Wiley &儿子,Inc。
{"title":"Finding Homologs in Amino Acid Sequences Using Network BLAST Searches","authors":"Istvan Ladunga","doi":"10.1002/cpbi.34","DOIUrl":"10.1002/cpbi.34","url":null,"abstract":"<p>BLAST, the Basic Local Alignment Search Tool, is used more frequently than any other biosequence database search program. We show how to run searches on the Web, and demonstrate how to increase performance by fine-tuning arguments for a specific research project. We offer guidance for interpreting results, statistical significance and biological relevance issues, and suggest complementary analyses. This unit covers both protein-to-protein (<i>blastp</i>) searches and translated searches (<i>blastx, tblastn, tfastx)</i>. <i>blastx</i> conceptually translates the query sequence and <i>tblastn</i> translates all nucleotide sequences in a database, while <i>tblastx</i> translates both the query and the database sequences into amino acid sequences. © 2017 by John Wiley &amp; Sons, Inc.</p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"59 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2018-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cpbi.34","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"35507620","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Using ProteomeScout: A Resource of Post-Translational Modifications, Their Experiments, and the Proteins That They Annotate 使用ProteomeScout:翻译后修饰的资源,他们的实验,和他们注释的蛋白质
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2018-02-13 DOI: 10.1002/cpbi.31
Arshag D. Mooradian, Jason M. Held, Kristen M. Naegle

Post-translational modifications (PTMs) of protein amino acids are ubiquitous and important to protein function, localization, degradation, and more. In recent years, there has been an explosion in the discovery of PTMs as a result of improvements in PTM measurement techniques, including quantitative measurements of PTMs across multiple conditions. ProteomeScout is a repository for such discovery and quantitative experiments and provides tools for visualizing PTMs within proteins, including where they are relative to other PTMS, domains, mutations, and structure. ProteomeScout additionally provides analysis tools for identifying statistically significant relationships in experimental datasets. This unit describes four basic protocols for working with the ProteomeScout Web interface or programmatically with the database download. © 2017 by John Wiley & Sons, Inc.

蛋白质氨基酸的翻译后修饰(PTMs)普遍存在,对蛋白质的功能、定位、降解等都很重要。近年来,由于PTM测量技术的改进,包括在多种条件下对PTM的定量测量,PTM的发现出现了爆炸式增长。ProteomeScout是此类发现和定量实验的存储库,并提供了可视化蛋白质内PTMs的工具,包括它们相对于其他PTMs,结构域,突变和结构的位置。ProteomeScout还提供了分析工具,用于识别实验数据集中的统计显著关系。本单元描述了使用ProteomeScout Web界面或以编程方式使用数据库下载的四种基本协议。©2017 by John Wiley &儿子,Inc。
{"title":"Using ProteomeScout: A Resource of Post-Translational Modifications, Their Experiments, and the Proteins That They Annotate","authors":"Arshag D. Mooradian,&nbsp;Jason M. Held,&nbsp;Kristen M. Naegle","doi":"10.1002/cpbi.31","DOIUrl":"10.1002/cpbi.31","url":null,"abstract":"<p>Post-translational modifications (PTMs) of protein amino acids are ubiquitous and important to protein function, localization, degradation, and more. In recent years, there has been an explosion in the discovery of PTMs as a result of improvements in PTM measurement techniques, including quantitative measurements of PTMs across multiple conditions. ProteomeScout is a repository for such discovery and quantitative experiments and provides tools for visualizing PTMs within proteins, including where they are relative to other PTMS, domains, mutations, and structure. ProteomeScout additionally provides analysis tools for identifying statistically significant relationships in experimental datasets. This unit describes four basic protocols for working with the ProteomeScout Web interface or programmatically with the database download. © 2017 by John Wiley &amp; Sons, Inc.</p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"59 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2018-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cpbi.31","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"35402460","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
期刊
Current protocols in bioinformatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1