首页 > 最新文献

Current protocols in bioinformatics最新文献

英文 中文
Protein Sequence Analysis Using the MPI Bioinformatics Toolkit 使用MPI生物信息学工具包进行蛋白质序列分析
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2020-12-14 DOI: 10.1002/cpbi.108
Felix Gabler, Seung-Zin Nam, Sebastian Till, Milot Mirdita, Martin Steinegger, Johannes Söding, Andrei N. Lupas, Vikram Alva

The MPI Bioinformatics Toolkit (https://toolkit.tuebingen.mpg.de) provides interactive access to a wide range of the best-performing bioinformatics tools and databases, including the state-of-the-art protein sequence comparison methods HHblits and HHpred. The Toolkit currently includes 35 external and in-house tools, covering functionalities such as sequence similarity searching, prediction of sequence features, and sequence classification. Due to this breadth of functionality, the tight interconnection of its constituent tools, and its ease of use, the Toolkit has become an important resource for biomedical research and for teaching protein sequence analysis to students in the life sciences. In this article, we provide detailed information on utilizing the three most widely accessed tools within the Toolkit: HHpred for the detection of homologs, HHpred in conjunction with MODELLER for structure prediction and homology modeling, and CLANS for the visualization of relationships in large sequence datasets. © 2020 The Authors.

Basic Protocol 1: Sequence similarity searching using HHpred

Alternate Protocol: Pairwise sequence comparison using HHpred

Support Protocol: Building a custom multiple sequence alignment using PSI-BLAST and forwarding it as input to HHpred

Basic Protocol 2: Calculation of homology models using HHpred and MODELLER

Basic Protocol 3: Cluster analysis using CLANS

MPI生物信息学工具包(https://toolkit.tuebingen.mpg.de)提供了对各种最佳生物信息学工具和数据库的交互式访问,包括最先进的蛋白质序列比较方法HHblits和HHpred。该工具包目前包括35个外部和内部工具,涵盖了序列相似性搜索、序列特征预测和序列分类等功能。由于这种广泛的功能,其组成工具的紧密互连,以及其易用性,该工具包已成为生物医学研究和向生命科学学生教授蛋白质序列分析的重要资源。在本文中,我们详细介绍了如何利用该工具包中最广泛使用的三个工具:用于同源物检测的HHpred,用于结构预测和同源性建模的HHpred与modeler结合使用,以及用于大型序列数据集中关系可视化的CLANS。©2020作者。基本协议1:使用HHpredAlternate协议进行序列相似性搜索;使用HHpredSupport协议进行两两序列比对;使用PSI-BLAST构建自定义多序列比对并转发给HHpredBasic协议2:使用HHpred和MODELLERBasic协议计算同源模型3:使用CLANS进行聚类分析
{"title":"Protein Sequence Analysis Using the MPI Bioinformatics Toolkit","authors":"Felix Gabler,&nbsp;Seung-Zin Nam,&nbsp;Sebastian Till,&nbsp;Milot Mirdita,&nbsp;Martin Steinegger,&nbsp;Johannes Söding,&nbsp;Andrei N. Lupas,&nbsp;Vikram Alva","doi":"10.1002/cpbi.108","DOIUrl":"10.1002/cpbi.108","url":null,"abstract":"<p>The MPI Bioinformatics Toolkit (https://toolkit.tuebingen.mpg.de) provides interactive access to a wide range of the best-performing bioinformatics tools and databases, including the state-of-the-art protein sequence comparison methods HHblits and HHpred. The Toolkit currently includes 35 external and in-house tools, covering functionalities such as sequence similarity searching, prediction of sequence features, and sequence classification. Due to this breadth of functionality, the tight interconnection of its constituent tools, and its ease of use, the Toolkit has become an important resource for biomedical research and for teaching protein sequence analysis to students in the life sciences. In this article, we provide detailed information on utilizing the three most widely accessed tools within the Toolkit: HHpred for the detection of homologs, HHpred in conjunction with MODELLER for structure prediction and homology modeling, and CLANS for the visualization of relationships in large sequence datasets. © 2020 The Authors.</p><p><b>Basic Protocol 1</b>: Sequence similarity searching using HHpred</p><p><b>Alternate Protocol</b>: Pairwise sequence comparison using HHpred</p><p><b>Support Protocol</b>: Building a custom multiple sequence alignment using PSI-BLAST and forwarding it as input to HHpred</p><p><b>Basic Protocol 2</b>: Calculation of homology models using HHpred and MODELLER</p><p><b>Basic Protocol 3</b>: Cluster analysis using CLANS</p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"72 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cpbi.108","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"38719394","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 352
Exploring Manually Curated Annotations of Intrinsically Disordered Proteins with DisProt 探索人工策划的注释内在无序的蛋白质与DisProt
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2020-10-05 DOI: 10.1002/cpbi.107
Federica Quaglia, András Hatos, Damiano Piovesan, Silvio C. E. Tosatto

DisProt is the major repository of manually curated data for intrinsically disordered proteins collected from the literature. Although lacking a stable tertiary structure under physiological conditions, intrinsically disordered proteins carry out a plethora of biological functions, some of them directly arising from their flexible nature. A growing number of scientific studies have been published during the last few decades in an effort to shed light on their unstructured state, their binding modes, and their functions. DisProt makes use of a team of expert biocurators to provide up-to-date annotations of intrinsically disordered proteins from the literature, making them available to the scientific community. Here we present a comprehensive description on how to use DisProt in different contexts and provide a detailed explanation of how to explore and interpret manually curated annotations of intrinsically disordered proteins. We describe how to search DisProt annotations, using both the web interface and the API for programmatic access. Finally, we explain how to visualize and interpret a DisProt entry, p53, a widely studied protein characterized by the presence of unstructured N-terminal and C-terminal regions. © 2020 Wiley Periodicals LLC.

Basic Protocol 1: Performing a search in DisProt

Support Protocol 1: Downloading options

Support Protocol 2: Programmatic access with DisProt REST API

Basic Protocol 2: Visualizing and interpreting DisProt entries: the p53 use case

Basic Protocol 3: Providing feedback and submitting new intrinsic disorder−related data

DisProt是从文献中收集的内在无序蛋白质的人工整理数据的主要存储库。尽管在生理条件下缺乏稳定的三级结构,但内在无序的蛋白质执行了大量的生物功能,其中一些功能直接源于它们的柔性性质。在过去的几十年里,越来越多的科学研究已经发表,以阐明它们的非结构化状态,它们的结合模式和它们的功能。DisProt利用一组专家生物馆长从文献中提供最新的内在无序蛋白质注释,使它们可供科学界使用。在这里,我们对如何在不同的环境中使用DisProt进行了全面的描述,并提供了如何探索和解释内在无序蛋白质的手动编辑注释的详细解释。我们描述了如何使用web界面和API进行编程访问来搜索DisProt注释。最后,我们解释了如何可视化和解释一个DisProt入口,p53,一个被广泛研究的蛋白质,其特征是存在非结构化的n端和c端区域。©2020 Wiley期刊有限责任公司基本协议1:在disprot中执行搜索支持协议1:下载选项支持协议2:使用DisProt REST api进行编程访问基本协议2:可视化和解释DisProt条目:p53用例基本协议3:提供反馈并提交新的内在无序相关数据
{"title":"Exploring Manually Curated Annotations of Intrinsically Disordered Proteins with DisProt","authors":"Federica Quaglia,&nbsp;András Hatos,&nbsp;Damiano Piovesan,&nbsp;Silvio C. E. Tosatto","doi":"10.1002/cpbi.107","DOIUrl":"10.1002/cpbi.107","url":null,"abstract":"<p>DisProt is the major repository of manually curated data for intrinsically disordered proteins collected from the literature. Although lacking a stable tertiary structure under physiological conditions, intrinsically disordered proteins carry out a plethora of biological functions, some of them directly arising from their flexible nature. A growing number of scientific studies have been published during the last few decades in an effort to shed light on their unstructured state, their binding modes, and their functions. DisProt makes use of a team of expert biocurators to provide up-to-date annotations of intrinsically disordered proteins from the literature, making them available to the scientific community. Here we present a comprehensive description on how to use DisProt in different contexts and provide a detailed explanation of how to explore and interpret manually curated annotations of intrinsically disordered proteins. We describe how to search DisProt annotations, using both the web interface and the API for programmatic access. Finally, we explain how to visualize and interpret a DisProt entry, p53, a widely studied protein characterized by the presence of unstructured N-terminal and C-terminal regions. © 2020 Wiley Periodicals LLC.</p><p><b>Basic Protocol 1</b>: Performing a search in DisProt</p><p><b>Support Protocol 1</b>: Downloading options</p><p><b>Support Protocol 2</b>: Programmatic access with DisProt REST API</p><p><b>Basic Protocol 2</b>: Visualizing and interpreting DisProt entries: the p53 use case</p><p><b>Basic Protocol 3</b>: Providing feedback and submitting new intrinsic disorder−related data</p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"72 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-10-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cpbi.107","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"38551117","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Network Building with the Cytoscape BioGateway App Explained in Five Use Cases 用五个用例解释使用Cytoscape BioGateway应用程序构建网络
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2020-09-28 DOI: 10.1002/cpbi.106
Rafael Riudavets Puig, Stian Holmås, Vladimir Mironov, Martin Kuiper

The BioGateway App is a plugin for the Cytoscape network editor, allowing users to interactively build biological networks by querying the Biogateway Resource Description Framework (RDF) triple store. BioGateway contains information from several curated resources including UniProtKB, IntAct, Gene Ontology Annotations, various datasets containing transcription-factor regulatory relations to specific target genes, and more. The BioGateway App facilitates the step-by-step creation of complex SPARQL queries through an intuitive Graphical User Interface, allowing users to build and explore biological interaction networks to assess, among other things, gene regulatory relationships, gene ontology annotations, and protein-protein interactions. As the BioGateway information content is most abundant for human proteins and genes, this article describes the utility of the tool through a series of use cases on these human data, starting from the most basic levels and then detailing applications that address some of the rich complexity of the integrated data. Network refinement and display can be further optimized via the selection and filtering possibilities that the Cytoscape framework provides. The use cases also provide examples to explore network information in other species, as they become supported by BioGateway. © 2020 The Authors.

Basic Protocol 1: Introducing a node from the canvas

Basic Protocol 2: Introducing a node from the query builder

Basic Protocol 3: Exploring molecular relationships between diseases

Basic Protocol 4: Find proteins with protein kinase activity involved in a disease and explore the context around them

Basic Protocol 5: Exploring the potential downstream effects after targeted inhibition of proteins

Support Protocol: Installation of the BioGateway plugin through the Cytoscape App Manager and from source

bioggateway App是Cytoscape网络编辑器的插件,允许用户通过查询bioggateway资源描述框架(RDF)三重存储来交互式地构建生物网络。bigateway包含来自几个策划资源的信息,包括UniProtKB,完好无损,基因本体注释,各种包含转录因子调控关系的数据集,以特定的目标基因,等等。BioGateway应用程序通过直观的图形用户界面促进了复杂SPARQL查询的逐步创建,允许用户构建和探索生物相互作用网络,以评估基因调控关系,基因本体注释和蛋白质-蛋白质相互作用等。由于bigateway的信息内容最丰富的是人类蛋白质和基因,本文通过一系列关于这些人类数据的用例来描述该工具的实用性,从最基本的级别开始,然后详细介绍解决集成数据的一些丰富复杂性的应用程序。通过Cytoscape框架提供的选择和过滤功能,可以进一步优化网络的细化和显示。这些用例还为探索其他物种的网络信息提供了示例,因为它们得到了BioGateway的支持。©2020作者。基本协议1:从画布中引入节点基本协议2:从查询生成器中引入节点基本协议3:探索疾病之间的分子关系基本协议4:寻找与疾病相关的蛋白激酶活性蛋白并探索其周围的环境基本协议5:探索靶向抑制蛋白后的潜在下游效应支持协议:通过Cytoscape应用程序管理器安装BioGateway插件
{"title":"Network Building with the Cytoscape BioGateway App Explained in Five Use Cases","authors":"Rafael Riudavets Puig,&nbsp;Stian Holmås,&nbsp;Vladimir Mironov,&nbsp;Martin Kuiper","doi":"10.1002/cpbi.106","DOIUrl":"10.1002/cpbi.106","url":null,"abstract":"<p>The BioGateway App is a plugin for the Cytoscape network editor, allowing users to interactively build biological networks by querying the Biogateway Resource Description Framework (RDF) triple store. BioGateway contains information from several curated resources including UniProtKB, IntAct, Gene Ontology Annotations, various datasets containing transcription-factor regulatory relations to specific target genes, and more. The BioGateway App facilitates the step-by-step creation of complex SPARQL queries through an intuitive Graphical User Interface, allowing users to build and explore biological interaction networks to assess, among other things, gene regulatory relationships, gene ontology annotations, and protein-protein interactions. As the BioGateway information content is most abundant for human proteins and genes, this article describes the utility of the tool through a series of use cases on these human data, starting from the most basic levels and then detailing applications that address some of the rich complexity of the integrated data. Network refinement and display can be further optimized via the selection and filtering possibilities that the Cytoscape framework provides. The use cases also provide examples to explore network information in other species, as they become supported by BioGateway. © 2020 The Authors.</p><p><b>Basic Protocol 1</b>: Introducing a node from the canvas</p><p><b>Basic Protocol 2</b>: Introducing a node from the query builder</p><p><b>Basic Protocol 3</b>: Exploring molecular relationships between diseases</p><p><b>Basic Protocol 4</b>: Find proteins with protein kinase activity involved in a disease and explore the context around them</p><p><b>Basic Protocol 5</b>: Exploring the potential downstream effects after targeted inhibition of proteins</p><p><b>Support Protocol</b>: Installation of the BioGateway plugin through the Cytoscape App Manager and from source</p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"72 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cpbi.106","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"38431054","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Expanding the Perseus Software for Omics Data Analysis With Custom Plugins 扩展珀尔修斯软件组学数据分析与自定义插件
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2020-09-15 DOI: 10.1002/cpbi.105
Sung-Huan Yu, Daniela Ferretti, Julia P. Schessner, Jan Daniel Rudolph, Georg H. H. Borner, Jürgen Cox

The Perseus software provides a comprehensive framework for the statistical analysis of large-scale quantitative proteomics data, also in combination with other omics dimensions. Rapid developments in proteomics technology and the ever-growing diversity of biological studies increasingly require the flexibility to incorporate computational methods designed by the user. Here, we present the new functionality of Perseus to integrate self-made plugins written in C#, R, or Python. The user-written codes will be fully integrated into the Perseus data analysis workflow as custom activities. This also makes language-specific R and Python libraries from CRAN (cran.r-project.org), Bioconductor (bioconductor.org), PyPI (pypi.org), and Anaconda (anaconda.org) accessible in Perseus. The different available approaches are explained in detail in this article. To facilitate the distribution of user-developed plugins among users, we have created a plugin repository for community sharing and filled it with the examples provided in this article and a collection of already existing and more extensive plugins. © 2020 The Authors.

Basic Protocol 1: Basic steps for R plugins

Support Protocol 1: R plugins with additional arguments

Basic Protocol 2: Basic steps for python plugins

Support Protocol 2: Python plugins with additional arguments

Basic Protocol 3: Basic steps and construction of C# plugins

Basic Protocol 4: Basic steps of construction and connection for R plugins with C# interface

Support Protocol 4: Advanced example of R Plugin with C# interface: UMAP

Basic Protocol 5: Basic steps of construction and connection for python plugins with C# interface

Support Protocol 5: Advanced example of python plugin with C# interface: UMAP

Support Protocol 6: A basic workflow for the analysis of label-free quantification proteomics data using perseus

Perseus软件为大规模定量蛋白质组学数据的统计分析提供了一个全面的框架,也与其他组学维度相结合。蛋白质组学技术的快速发展和生物研究的日益多样化越来越需要灵活地结合用户设计的计算方法。在这里,我们展示了Perseus的新功能,可以集成用c#、R或Python编写的自制插件。用户编写的代码将作为自定义活动完全集成到Perseus数据分析工作流中。这也使得来自CRAN (cran.r-project.org)、Bioconductor (biocondutor.org)、PyPI (pypi.org)和Anaconda (anaconda.org)的特定语言的R和Python库在Perseus中可以访问。本文将详细解释不同的可用方法。为了便于在用户之间分发用户开发的插件,我们创建了一个用于社区共享的插件存储库,并在其中填充了本文中提供的示例以及已经存在的更广泛的插件集合。©2020作者。基本协议1:R插件的基本步骤支持协议1:带附加参数的R插件基本协议2:python插件的基本步骤支持协议2:带附加参数的python插件基本协议3:c#插件的基本步骤和构造基本协议4:带c#接口的R插件的构造和连接的基本步骤支持协议4:带c#接口的R插件的高级示例:umap基本协议5:支持协议5:使用c#接口的python插件的高级示例:UMAPSupport协议6:使用perseus分析无标签定量蛋白质组学数据的基本工作流程
{"title":"Expanding the Perseus Software for Omics Data Analysis With Custom Plugins","authors":"Sung-Huan Yu,&nbsp;Daniela Ferretti,&nbsp;Julia P. Schessner,&nbsp;Jan Daniel Rudolph,&nbsp;Georg H. H. Borner,&nbsp;Jürgen Cox","doi":"10.1002/cpbi.105","DOIUrl":"10.1002/cpbi.105","url":null,"abstract":"<p>The Perseus software provides a comprehensive framework for the statistical analysis of large-scale quantitative proteomics data, also in combination with other omics dimensions. Rapid developments in proteomics technology and the ever-growing diversity of biological studies increasingly require the flexibility to incorporate computational methods designed by the user. Here, we present the new functionality of Perseus to integrate self-made plugins written in C#, R, or Python. The user-written codes will be fully integrated into the Perseus data analysis workflow as custom activities. This also makes language-specific R and Python libraries from CRAN (cran.r-project.org), Bioconductor (bioconductor.org), PyPI (pypi.org), and Anaconda (anaconda.org) accessible in Perseus. The different available approaches are explained in detail in this article. To facilitate the distribution of user-developed plugins among users, we have created a plugin repository for community sharing and filled it with the examples provided in this article and a collection of already existing and more extensive plugins. © 2020 The Authors.</p><p><b>Basic Protocol 1</b>: Basic steps for R plugins</p><p><b>Support Protocol 1</b>: R plugins with additional arguments</p><p><b>Basic Protocol 2</b>: Basic steps for python plugins</p><p><b>Support Protocol 2</b>: Python plugins with additional arguments</p><p><b>Basic Protocol 3</b>: Basic steps and construction of C# plugins</p><p><b>Basic Protocol 4</b>: Basic steps of construction and connection for R plugins with C# interface</p><p><b>Support Protocol 4</b>: Advanced example of R Plugin with C# interface: UMAP</p><p><b>Basic Protocol 5</b>: Basic steps of construction and connection for python plugins with C# interface</p><p><b>Support Protocol 5</b>: Advanced example of python plugin with C# interface: UMAP</p><p><b>Support Protocol 6</b>: A basic workflow for the analysis of label-free quantification proteomics data using perseus</p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"71 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cpbi.105","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"38383053","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Exploring Non-Coding RNAs in RNAcentral 探索rnaccentral中的非编码rna
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2020-08-26 DOI: 10.1002/cpbi.104
Blake A. Sweeney, Arina A. Tagmazian, Carlos E. Ribas, Robert D. Finn, Alex Bateman, Anton I. Petrov

Non-coding RNAs are essential for all life and carry out a wide range of functions. Information about these molecules is distributed across dozens of specialized resources. RNAcentral is a database of non-coding RNA sequences that provides a unified access point to non-coding RNA annotations from >40 member databases and helps provide insight into the function of these RNAs. This article describes different ways of accessing the data, including searching the website and retrieving the data programmatically over web APIs and a public database. We also demonstrate an example Galaxy workflow for using RNAcentral for RNA-seq differential expression analysis. RNAcentral is available at https://rnacentral.org. © 2020 The Authors.

Basic Protocol 1: Viewing RNAcentral sequence reports

Basic Protocol 2: Using RNAcentral text search to explore ncRNA sequences

Basic Protocol 3: Using RNAcentral sequence search

Basic Protocol 4: Using RNAcentral FTP archive

Support Protocol 1: Using web APIs for programmatic data access

Support Protocol 2: Using public Postgres database to export large datasets

Support Protocol 3: Analyze non-coding RNA in RNA-seq datasets using RNAcentral and Galaxy

非编码rna对所有生命都是必不可少的,并具有广泛的功能。关于这些分子的信息分布在几十个专门的资源中。RNAcentral是一个非编码RNA序列数据库,它为来自40个成员数据库的非编码RNA注释提供了一个统一的访问点,并有助于深入了解这些RNA的功能。本文描述了访问数据的不同方法,包括通过web api和公共数据库以编程方式搜索网站和检索数据。我们还演示了一个使用RNAcentral进行RNA-seq差异表达分析的示例Galaxy工作流。RNAcentral可在https://rnacentral.org上获得。©2020作者。基本协议1:查看RNAcentral序列报告基本协议2:使用RNAcentral文本搜索探索ncRNA序列基本协议3:使用RNAcentral文本搜索基本协议4:使用RNAcentral FTP归档支持协议1:使用web api进行程序化数据访问支持协议2:使用公共Postgres数据库导出大型数据集支持协议3:使用RNAcentral和Galaxy分析RNA-seq数据集中的非编码RNA
{"title":"Exploring Non-Coding RNAs in RNAcentral","authors":"Blake A. Sweeney,&nbsp;Arina A. Tagmazian,&nbsp;Carlos E. Ribas,&nbsp;Robert D. Finn,&nbsp;Alex Bateman,&nbsp;Anton I. Petrov","doi":"10.1002/cpbi.104","DOIUrl":"10.1002/cpbi.104","url":null,"abstract":"<p>Non-coding RNAs are essential for all life and carry out a wide range of functions. Information about these molecules is distributed across dozens of specialized resources. RNAcentral is a database of non-coding RNA sequences that provides a unified access point to non-coding RNA annotations from &gt;40 member databases and helps provide insight into the function of these RNAs. This article describes different ways of accessing the data, including searching the website and retrieving the data programmatically over web APIs and a public database. We also demonstrate an example Galaxy workflow for using RNAcentral for RNA-seq differential expression analysis. RNAcentral is available at https://rnacentral.org. © 2020 The Authors.</p><p><b>Basic Protocol 1</b>: Viewing RNAcentral sequence reports</p><p><b>Basic Protocol 2</b>: Using RNAcentral text search to explore ncRNA sequences</p><p><b>Basic Protocol 3</b>: Using RNAcentral sequence search</p><p><b>Basic Protocol 4</b>: Using RNAcentral FTP archive</p><p><b>Support Protocol 1</b>: Using web APIs for programmatic data access</p><p><b>Support Protocol 2</b>: Using public Postgres database to export large datasets</p><p><b>Support Protocol 3</b>: Analyze non-coding RNA in RNA-seq datasets using RNAcentral and Galaxy</p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"71 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cpbi.104","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"38405986","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
How to Illuminate the Dark Proteome Using the Multi-omic OpenProt Resource 如何利用多基因组开放资源揭示黑暗蛋白质组
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2020-08-11 DOI: 10.1002/cpbi.103
Marie A. Brunet, Amina M. Lekehal, Xavier Roucou

Ten of thousands of open reading frames (ORFs) are hidden within genomes. These alternative ORFs, or small ORFs, have eluded annotations because they are either small or within unsuspected locations. They are found in untranslated regions or overlap a known coding sequence in messenger RNA and anywhere in a “non-coding” RNA. Serendipitous discoveries have highlighted these ORFs’ importance in biological functions and pathways. With their discovery came the need for deeper ORF annotation and large-scale mining of public repositories to gather supporting experimental evidence. OpenProt, accessible at https://openprot.org/, is the first proteogenomic resource enforcing a polycistronic model of annotation across an exhaustive transcriptome for 10 species. Moreover, OpenProt reports experimental evidence cumulated across a re-analysis of 114 mass spectrometry and 87 ribosome profiling datasets. The multi-omics OpenProt resource also includes the identification of predicted functional domains and evaluation of conservation for all predicted ORFs. The OpenProt web server provides two query interfaces and one genome browser. The query interfaces allow for exploration of the coding potential of genes or transcripts of interest as well as custom downloads of all information contained in OpenProt. © 2020 The Authors.

Basic Protocol 1: Using the Search interface

Basic Protocol 2: Using the Downloads interface

基因组中隐藏着成千上万个开放阅读框(orf)。这些备选orf或小型orf都避开了注释,因为它们要么很小,要么位于未知位置。它们存在于信使RNA的非翻译区或与已知编码序列重叠的地方,以及“非编码”RNA的任何地方。偶然的发现突出了这些orf在生物学功能和途径中的重要性。随着他们的发现,需要更深入的ORF注释和对公共存储库的大规模挖掘来收集支持性的实验证据。OpenProt,可访问https://openprot.org/,是第一个蛋白质基因组资源,在10个物种的详尽转录组中执行多顺反子注释模型。此外,OpenProt报告了通过114个质谱分析和87个核糖体分析数据集重新分析积累的实验证据。多组学OpenProt资源还包括预测功能域的鉴定和所有预测orf的保守性评估。OpenProt web服务器提供了两个查询接口和一个基因组浏览器。查询接口允许探索基因的编码潜力或感兴趣的转录本,以及自定义下载OpenProt中包含的所有信息。©2020作者。基本协议1:使用Search接口基本协议2:使用Downloads接口
{"title":"How to Illuminate the Dark Proteome Using the Multi-omic OpenProt Resource","authors":"Marie A. Brunet,&nbsp;Amina M. Lekehal,&nbsp;Xavier Roucou","doi":"10.1002/cpbi.103","DOIUrl":"10.1002/cpbi.103","url":null,"abstract":"<p>Ten of thousands of open reading frames (ORFs) are hidden within genomes. These alternative ORFs, or small ORFs, have eluded annotations because they are either small or within unsuspected locations. They are found in untranslated regions or overlap a known coding sequence in messenger RNA and anywhere in a “non-coding” RNA. Serendipitous discoveries have highlighted these ORFs’ importance in biological functions and pathways. With their discovery came the need for deeper ORF annotation and large-scale mining of public repositories to gather supporting experimental evidence. OpenProt, accessible at https://openprot.org/, is the first proteogenomic resource enforcing a polycistronic model of annotation across an exhaustive transcriptome for 10 species. Moreover, OpenProt reports experimental evidence cumulated across a re-analysis of 114 mass spectrometry and 87 ribosome profiling datasets. The multi-omics OpenProt resource also includes the identification of predicted functional domains and evaluation of conservation for all predicted ORFs. The OpenProt web server provides two query interfaces and one genome browser. The query interfaces allow for exploration of the coding potential of genes or transcripts of interest as well as custom downloads of all information contained in OpenProt. © 2020 The Authors.</p><p><b>Basic Protocol 1</b>: Using the Search interface</p><p><b>Basic Protocol 2</b>: Using the Downloads interface</p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"71 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cpbi.103","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"38260181","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Using SPAdes De Novo Assembler 使用黑桃从头组装
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2020-06-19 DOI: 10.1002/cpbi.102
Andrey Prjibelski, Dmitry Antipov, Dmitry Meleshko, Alla Lapidus, Anton Korobeynikov

SPAdes—St. Petersburg genome Assembler—was originally developed for de novo assembly of genome sequencing data produced for cultivated microbial isolates and for single-cell genomic DNA sequencing. With time, the functionality of SPAdes was extended to enable assembly of IonTorrent data, as well as hybrid assembly from short and long reads (PacBio and Oxford Nanopore). In this article we present protocols for five different assembly pipelines that comprise the SPAdes package and that are used for assembly of metagenomes and transcriptomes as well as assembly of putative plasmids and biosynthetic gene clusters from whole-genome sequencing and metagenomic datasets. In addition, we present guidelines for understanding results with use cases for each pipeline, and several additional support protocols that help in using SPAdes properly. © 2020 Wiley Periodicals LLC.

Basic Protocol 1: Assembling isolate bacterial datasets

Basic Protocol 2: Assembling metagenomic datasets

Basic Protocol 3: Assembling sets of putative plasmids

Basic Protocol 4: Assembling transcriptomes

Basic Protocol 5: Assembling putative biosynthetic gene clusters

Support Protocol 1: Installing SPAdes

Support Protocol 2: Providing input via command line

Support Protocol 3: Providing input data via YAML format

Support Protocol 4: Restarting previous run

Support Protocol 5: Determining strand-specificity of RNA-seq data

SPAdes-St。彼得斯堡基因组组装器-最初开发的基因组测序数据的从头组装产生的培养微生物分离和单细胞基因组DNA测序。随着时间的推移,SPAdes的功能扩展到能够组装IonTorrent数据,以及从短读取和长读取(PacBio和Oxford Nanopore)混合组装。在本文中,我们介绍了五种不同的组装管道的协议,这些管道包括SPAdes包,用于组装宏基因组和转录组,以及组装来自全基因组测序和宏基因组数据集的推定质粒和生物合成基因簇。此外,我们还提供了一些指导方针,用于理解每个管道的用例结果,以及一些帮助正确使用SPAdes的附加支持协议。©2020 Wiley期刊有限公司基本协议1:组装分离细菌数据集基本协议2:组装宏基因组数据集基本协议3:组装推定质粒集基本协议4:组装转录组基本协议5:组装推定的生物合成基因集群支持协议1:安装spades支持协议2:通过命令行提供输入支持协议3:通过YAML格式提供输入数据支持协议4:重新启动以前的runSupport协议5:确定RNA-seq数据的链特异性
{"title":"Using SPAdes De Novo Assembler","authors":"Andrey Prjibelski,&nbsp;Dmitry Antipov,&nbsp;Dmitry Meleshko,&nbsp;Alla Lapidus,&nbsp;Anton Korobeynikov","doi":"10.1002/cpbi.102","DOIUrl":"10.1002/cpbi.102","url":null,"abstract":"<p>SPAdes—St. Petersburg genome Assembler—was originally developed for de novo assembly of genome sequencing data produced for cultivated microbial isolates and for single-cell genomic DNA sequencing. With time, the functionality of SPAdes was extended to enable assembly of IonTorrent data, as well as hybrid assembly from short and long reads (PacBio and Oxford Nanopore). In this article we present protocols for five different assembly pipelines that comprise the SPAdes package and that are used for assembly of metagenomes and transcriptomes as well as assembly of putative plasmids and biosynthetic gene clusters from whole-genome sequencing and metagenomic datasets. In addition, we present guidelines for understanding results with use cases for each pipeline, and several additional support protocols that help in using SPAdes properly. © 2020 Wiley Periodicals LLC.</p><p><b>Basic Protocol 1</b>: Assembling isolate bacterial datasets</p><p><b>Basic Protocol 2</b>: Assembling metagenomic datasets</p><p><b>Basic Protocol 3</b>: Assembling sets of putative plasmids</p><p><b>Basic Protocol 4</b>: Assembling transcriptomes</p><p><b>Basic Protocol 5</b>: Assembling putative biosynthetic gene clusters</p><p><b>Support Protocol 1</b>: Installing SPAdes</p><p><b>Support Protocol 2</b>: Providing input via command line</p><p><b>Support Protocol 3</b>: Providing input data via YAML format</p><p><b>Support Protocol 4</b>: Restarting previous run</p><p><b>Support Protocol 5</b>: Determining strand-specificity of RNA-seq data</p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"70 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cpbi.102","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"38062359","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 804
iSwathX 2.0 for Processing DDA Spectral Libraries for DIA Data Analysis iSwathX 2.0用于DIA数据分析的DDA谱库处理
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2020-06-01 DOI: 10.1002/cpbi.101
Zainab Noor, Abidali Mohamedali, Shoba Ranganathan

The iSwathX web application processes and normalizes mass spectrometry−based proteomics spectral libraries generated in the data-dependent acquisition (DDA) approach. These libraries are stored in various proteomics repositories such as PeptideAtlas and NIST, or are user-generated and provide reference data for data-independent acquisition (DIA) targeted data extraction and analysis. iSwathX 2.0 can efficiently normalize DDA data from different instruments, gathered at different instances, and make it compatible with specific DIA experiments. Novel functions for parallel processing of DDA libraries and DIA report files, along with various data visualizations, are available in iSwathX 2.0. The step-by-step protocols provided here describe how the libraries are uploaded, processed, visualized, and downloaded using various modules of the application. They also provide detailed guidelines on the use of DIA report files for data analysis and visualization. © 2020 Wiley Periodicals LLC.

Basic Protocol 1: Processing, combining, and visualizing two DDA libraries

Basic Protocol 2: Parallel processing and combination of multiple DDA libraries

Basic Protocol 3: Downstream processing, comparison, and visualization of DIA report files

iSwathX web应用程序处理并规范化了数据依赖采集(DDA)方法中生成的基于质谱的蛋白质组学谱库。这些库存储在各种蛋白质组学存储库中,如PeptideAtlas和NIST,或者是用户生成的,并为数据独立采集(DIA)目标数据提取和分析提供参考数据。iSwathX 2.0可以有效地规范化来自不同仪器、不同实例采集的DDA数据,并使其与特定的DIA实验兼容。iSwathX 2.0中提供了用于并行处理DDA库和DIA报告文件以及各种数据可视化的新功能。这里提供的分步协议描述了如何使用应用程序的各个模块上传、处理、可视化和下载库。它们还提供了使用DIA报告文件进行数据分析和可视化的详细指南。©2020 Wiley Periodicals llc .基本协议1:两个DDA库的处理、组合和可视化基本协议2:多个DDA库的并行处理和组合基本协议3:DIA报告文件的下游处理、比较和可视化
{"title":"iSwathX 2.0 for Processing DDA Spectral Libraries for DIA Data Analysis","authors":"Zainab Noor,&nbsp;Abidali Mohamedali,&nbsp;Shoba Ranganathan","doi":"10.1002/cpbi.101","DOIUrl":"10.1002/cpbi.101","url":null,"abstract":"<p>The iSwathX web application processes and normalizes mass spectrometry−based proteomics spectral libraries generated in the data-dependent acquisition (DDA) approach. These libraries are stored in various proteomics repositories such as PeptideAtlas and NIST, or are user-generated and provide reference data for data-independent acquisition (DIA) targeted data extraction and analysis. iSwathX 2.0 can efficiently normalize DDA data from different instruments, gathered at different instances, and make it compatible with specific DIA experiments. Novel functions for parallel processing of DDA libraries and DIA report files, along with various data visualizations, are available in iSwathX 2.0. The step-by-step protocols provided here describe how the libraries are uploaded, processed, visualized, and downloaded using various modules of the application. They also provide detailed guidelines on the use of DIA report files for data analysis and visualization. © 2020 Wiley Periodicals LLC.</p><p><b>Basic Protocol 1</b>: Processing, combining, and visualizing two DDA libraries</p><p><b>Basic Protocol 2</b>: Parallel processing and combination of multiple DDA libraries</p><p><b>Basic Protocol 3</b>: Downstream processing, comparison, and visualization of DIA report files</p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"70 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cpbi.101","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"37993966","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
QIIME 2 Enables Comprehensive End-to-End Analysis of Diverse Microbiome Data and Comparative Studies with Publicly Available Data QIIME 2能够对不同微生物组数据进行全面的端到端分析,并与公开数据进行比较研究。
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2020-04-28 DOI: 10.1002/cpbi.100
Mehrbod Estaki, Lingjing Jiang, Nicholas A. Bokulich, Daniel McDonald, Antonio González, Tomasz Kosciolek, Cameron Martino, Qiyun Zhu, Amanda Birmingham, Yoshiki Vázquez-Baeza, Matthew R. Dillon, Evan Bolyen, J. Gregory Caporaso, Rob Knight

QIIME 2 is a completely re-engineered microbiome bioinformatics platform based on the popular QIIME platform, which it has replaced. QIIME 2 facilitates comprehensive and fully reproducible microbiome data science, improving accessibility to diverse users by adding multiple user interfaces. QIIME 2 can be combined with Qiita, an open-source web-based platform, to re-use available data for meta-analysis. The following basic protocol describes how to install QIIME 2 on a single computer and analyze microbiome sequence data, from processing of raw DNA sequence reads through generating publishable interactive figures. These interactive figures allow readers of a study to interact with data with the same ease as its authors, advancing microbiome science transparency and reproducibility. We also show how plug-ins developed by the community to add analysis capabilities can be installed and used with QIIME 2, enhancing various aspects of microbiome analyses—e.g., improving taxonomic classification accuracy. Finally, we illustrate how users can perform meta-analyses combining different datasets using readily available public data through Qiita. In this tutorial, we analyze a subset of the Early Childhood Antibiotics and the Microbiome (ECAM) study, which tracked the microbiome composition and development of 43 infants in the United States from birth to 2 years of age, identifying microbiome associations with antibiotic exposure, delivery mode, and diet. For more information about QIIME 2, see https://qiime2.org. To troubleshoot or ask questions about QIIME 2 and microbiome analysis, join the active community at https://forum.qiime2.org. © 2020 The Authors.

Basic Protocol: Using QIIME 2 with microbiome data

Support Protocol: Further microbiome analyses

QIIME 2是一个完全重新设计的微生物组生物信息学平台,基于流行的QIIME平台,它已经取代了QIIME。QIIME 2促进了全面和完全可复制的微生物组数据科学,通过添加多个用户界面提高了对不同用户的可访问性。QIIME 2可以与Qiita(一个基于web的开源平台)结合,重新使用可用数据进行元分析。下面的基本协议描述了如何在单个计算机上安装QIIME 2并分析微生物组序列数据,从处理原始DNA序列读取到生成可发布的交互式数据。这些交互式数据使研究的读者可以像作者一样轻松地与数据交互,从而提高微生物组科学的透明度和可重复性。我们还展示了社区开发的插件如何添加分析功能,并与QIIME 2一起安装和使用,从而增强微生物组分析的各个方面。,提高分类学分类精度。最后,我们说明了用户如何通过Qiita使用现成的公共数据组合不同的数据集进行元分析。在本教程中,我们分析了早期儿童抗生素和微生物组(ECAM)研究的一个子集,该研究跟踪了43名美国婴儿从出生到2岁的微生物组组成和发育,确定了微生物组与抗生素暴露、分娩方式和饮食的关系。有关QIIME 2的更多信息,请参见https://qiime2.org。要解决问题或询问有关QIIME 2和微生物组分析的问题,请加入活跃的社区https://forum.qiime2.org。©2020作者。基本协议:使用QIIME 2与微生物组数据支持协议:进一步的微生物组分析
{"title":"QIIME 2 Enables Comprehensive End-to-End Analysis of Diverse Microbiome Data and Comparative Studies with Publicly Available Data","authors":"Mehrbod Estaki,&nbsp;Lingjing Jiang,&nbsp;Nicholas A. Bokulich,&nbsp;Daniel McDonald,&nbsp;Antonio González,&nbsp;Tomasz Kosciolek,&nbsp;Cameron Martino,&nbsp;Qiyun Zhu,&nbsp;Amanda Birmingham,&nbsp;Yoshiki Vázquez-Baeza,&nbsp;Matthew R. Dillon,&nbsp;Evan Bolyen,&nbsp;J. Gregory Caporaso,&nbsp;Rob Knight","doi":"10.1002/cpbi.100","DOIUrl":"10.1002/cpbi.100","url":null,"abstract":"<p>QIIME 2 is a completely re-engineered microbiome bioinformatics platform based on the popular QIIME platform, which it has replaced. QIIME 2 facilitates comprehensive and fully reproducible microbiome data science, improving accessibility to diverse users by adding multiple user interfaces. QIIME 2 can be combined with Qiita, an open-source web-based platform, to re-use available data for meta-analysis. The following basic protocol describes how to install QIIME 2 on a single computer and analyze microbiome sequence data, from processing of raw DNA sequence reads through generating publishable interactive figures. These interactive figures allow readers of a study to interact with data with the same ease as its authors, advancing microbiome science transparency and reproducibility. We also show how plug-ins developed by the community to add analysis capabilities can be installed and used with QIIME 2, enhancing various aspects of microbiome analyses—e.g., improving taxonomic classification accuracy. Finally, we illustrate how users can perform meta-analyses combining different datasets using readily available public data through Qiita. In this tutorial, we analyze a subset of the Early Childhood Antibiotics and the Microbiome (ECAM) study, which tracked the microbiome composition and development of 43 infants in the United States from birth to 2 years of age, identifying microbiome associations with antibiotic exposure, delivery mode, and diet. For more information about QIIME 2, see https://qiime2.org. To troubleshoot or ask questions about QIIME 2 and microbiome analysis, join the active community at https://forum.qiime2.org. © 2020 The Authors.</p><p><b>Basic Protocol</b>: Using QIIME 2 with microbiome data</p><p><b>Support Protocol</b>: Further microbiome analyses</p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"70 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cpbi.100","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"37879861","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 165
Analyzing Protein Disorder with IUPred2A 用IUPred2A分析蛋白质紊乱
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2020-04-01 DOI: 10.1002/cpbi.99
Gábor Erdős, Zsuzsanna Dosztányi

IUPred2A is a combined prediction tool designed to discover intrinsically disordered or conditionally disordered proteins and protein regions. Intrinsically disordered regions exist without a well-defined three-dimensional structure in isolation but carry out important biological functions. Over the years, various prediction methods have been developed to characterize disordered regions. The existence of disordered segments can also be dependent on different factors such as binding partners or environmental traits like pH or redox potential, and recognizing such regions represents additional computational challenges. In this article, we present detailed instructions on how to use IUPred2A, one of the most widely used tools for the prediction of disordered regions/proteins or conditionally disordered segments, and provide examples of how the predictions can be interpreted in different contexts. © 2020 The Authors.

Basic Protocol 1: Analyzing disorder propensity with IUPred2A online

Basic Protocol 2: Analyzing disordered binding regions using ANCHOR2

Support Protocol 1: Interpretation of the results

Basic Protocol 3: Analyzing redox-sensitive disordered regions

Support Protocol 2: Download options

Support Protocol 3: REST API for programmatic purposes

Basic Protocol 4: Using IUPred2A locally

IUPred2A是一种组合预测工具,旨在发现内在无序或条件无序的蛋白质和蛋白质区域。本质上无序的区域是孤立存在的,没有明确的三维结构,但却具有重要的生物学功能。多年来,人们开发了各种预测方法来表征无序区域。无序片段的存在也可能取决于不同的因素,如结合伙伴或环境特征,如pH或氧化还原电位,识别这些区域代表了额外的计算挑战。在本文中,我们详细介绍了如何使用IUPred2A,这是预测无序区域/蛋白质或条条状无序片段的最广泛使用的工具之一,并提供了如何在不同背景下解释预测的示例。©2020作者。基本协议1:使用IUPred2A在线分析无序倾向基本协议2:使用anchor分析无序结合区域支持协议1:结果解释基本协议3:分析氧化还原敏感无序区域支持协议2:下载选项支持协议3:用于编程目的的REST API基本协议4:在本地使用IUPred2A
{"title":"Analyzing Protein Disorder with IUPred2A","authors":"Gábor Erdős,&nbsp;Zsuzsanna Dosztányi","doi":"10.1002/cpbi.99","DOIUrl":"10.1002/cpbi.99","url":null,"abstract":"<p>IUPred2A is a combined prediction tool designed to discover intrinsically disordered or conditionally disordered proteins and protein regions. Intrinsically disordered regions exist without a well-defined three-dimensional structure in isolation but carry out important biological functions. Over the years, various prediction methods have been developed to characterize disordered regions. The existence of disordered segments can also be dependent on different factors such as binding partners or environmental traits like pH or redox potential, and recognizing such regions represents additional computational challenges. In this article, we present detailed instructions on how to use IUPred2A, one of the most widely used tools for the prediction of disordered regions/proteins or conditionally disordered segments, and provide examples of how the predictions can be interpreted in different contexts. © 2020 The Authors.</p><p><b>Basic Protocol 1</b>: Analyzing disorder propensity with IUPred2A online</p><p><b>Basic Protocol 2</b>: Analyzing disordered binding regions using ANCHOR2</p><p><b>Support Protocol 1</b>: Interpretation of the results</p><p><b>Basic Protocol 3</b>: Analyzing redox-sensitive disordered regions</p><p><b>Support Protocol 2</b>: Download options</p><p><b>Support Protocol 3</b>: REST API for programmatic purposes</p><p><b>Basic Protocol 4</b>: Using IUPred2A locally</p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"70 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cpbi.99","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"37792090","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 201
期刊
Current protocols in bioinformatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1