首页 > 最新文献

Current protocols in bioinformatics最新文献

英文 中文
MetaBridge: An Integrative Multi-Omics Tool for Metabolite-Enzyme Mapping metbridge:代谢酶图谱的综合多组学工具
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2020-03-21 DOI: 10.1002/cpbi.98
Travis Blimkie, Amy Huei-Yi Lee, Robert E.W. Hancock

MetaBridge is a web-based tool designed to facilitate the integration of metabolomics with other “omics” data types such as transcriptomics and proteomics. It uses data from the MetaCyc metabolic pathway database and the Kyoto Encyclopedia of Genes and Genomes (KEGG) to map metabolite compounds to directly interacting upstream or downstream enzymes in enzymatic reactions and metabolic pathways. The resulting list of enzymes can then be integrated with transcriptomics or proteomics data via protein-protein interaction networks to perform integrative multi-omics analyses. MetaBridge was developed to be intuitive and easy to use, requiring little to no prior computational experience. The protocols described here detail all steps involved in the use of MetaBridge, from preparing input data and performing metabolite mapping to utilizing the results to build a protein-protein interaction network. © 2020 by John Wiley & Sons, Inc.

Basic Protocol 1: Mapping metabolite data using MetaCyc identifiers

Basic Protocol 2: Mapping metabolite data using KEGG identifiers

Support Protocol 1: Converting compound names to HMDB IDs

Support Protocol 2: Submitting mapped genes produced by MetaBridge for protein-protein interaction (PPI) network construction

MetaBridge是一个基于网络的工具,旨在促进代谢组学与其他“组学”数据类型(如转录组学和蛋白质组学)的整合。它使用MetaCyc代谢途径数据库和京都基因与基因组百科全书(KEGG)的数据来绘制代谢物化合物在酶反应和代谢途径中直接相互作用的上游或下游酶。由此产生的酶列表可以通过蛋白质-蛋白质相互作用网络与转录组学或蛋白质组学数据集成,以执行综合多组学分析。MetaBridge是为了直观和易于使用而开发的,几乎不需要事先的计算经验。本文描述的协议详细介绍了使用MetaBridge所涉及的所有步骤,从准备输入数据和执行代谢物映射到利用结果构建蛋白质-蛋白质相互作用网络。©2020 by John Wiley &基本协议1:使用MetaCyc标识符绘制代谢物数据基本协议2:使用KEGG标识符绘制代谢物数据支持协议1:将化合物名称转换为HMDB id支持协议2:提交MetaBridge生成的用于蛋白质-蛋白质相互作用(PPI)网络构建的映射基因
{"title":"MetaBridge: An Integrative Multi-Omics Tool for Metabolite-Enzyme Mapping","authors":"Travis Blimkie,&nbsp;Amy Huei-Yi Lee,&nbsp;Robert E.W. Hancock","doi":"10.1002/cpbi.98","DOIUrl":"10.1002/cpbi.98","url":null,"abstract":"<p>MetaBridge is a web-based tool designed to facilitate the integration of metabolomics with other “omics” data types such as transcriptomics and proteomics. It uses data from the MetaCyc metabolic pathway database and the Kyoto Encyclopedia of Genes and Genomes (KEGG) to map metabolite compounds to directly interacting upstream or downstream enzymes in enzymatic reactions and metabolic pathways. The resulting list of enzymes can then be integrated with transcriptomics or proteomics data via protein-protein interaction networks to perform integrative multi-omics analyses. MetaBridge was developed to be intuitive and easy to use, requiring little to no prior computational experience. The protocols described here detail all steps involved in the use of MetaBridge, from preparing input data and performing metabolite mapping to utilizing the results to build a protein-protein interaction network. © 2020 by John Wiley &amp; Sons, Inc.</p><p><b>Basic Protocol 1</b>: Mapping metabolite data using MetaCyc identifiers</p><p><b>Basic Protocol 2</b>: Mapping metabolite data using KEGG identifiers</p><p><b>Support Protocol 1</b>: Converting compound names to HMDB IDs</p><p><b>Support Protocol 2</b>: Submitting mapped genes produced by MetaBridge for protein-protein interaction (PPI) network construction</p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"70 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cpbi.98","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"37760312","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Visualizing Human Protein-Protein Interactions and Subcellular Localizations on Cell Images Through CellMap 可视化人类蛋白质-蛋白质相互作用和亚细胞定位在细胞图像上通过细胞地图
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2020-03-09 DOI: 10.1002/cpbi.97
Christian Dallago, Tatyana Goldberg, Miguel Angel Andrade-Navarro, Gregorio Alanis-Lobato, Burkhard Rost

Visualizing protein data remains a challenging and stimulating task. Useful and intuitive visualization tools may help advance biomolecular and medical research; unintuitive tools may bar important breakthroughs. This protocol describes two use cases for the CellMap (http://cellmap.protein.properties) web tool. The tool allows researchers to visualize human protein-protein interaction data constrained by protein subcellular localizations. In the simplest form, proteins are visualized on cell images that also show protein-protein interactions (PPIs) through lines (edges) connecting the proteins across the compartments. At a glance, this simultaneously highlights spatial constraints that proteins are subject to in their physical environment and visualizes PPIs against these localizations. Visualizing two realities helps in decluttering the protein interaction visualization from “hairball” phenomena that arise when single proteins or groups thereof interact with hundreds of partners. © 2019 The Authors.

Basic Protocol 1: Visualizing proteins and their interactions on cell images

Basic Protocol 2: Displaying all interaction partners for a protein

可视化蛋白质数据仍然是一项具有挑战性和刺激性的任务。有用和直观的可视化工具可能有助于推进生物分子和医学研究;不直观的工具可能阻碍重大突破。本协议描述了CellMap (http://cellmap.protein.properties) web工具的两个用例。该工具允许研究人员可视化受蛋白质亚细胞定位限制的人类蛋白质-蛋白质相互作用数据。在最简单的形式中,蛋白质在细胞图像上可视化,也通过连接隔室蛋白质的线(边缘)显示蛋白质-蛋白质相互作用(PPIs)。乍一看,这同时突出了蛋白质在其物理环境中受到的空间限制,并显示了针对这些定位的ppi。可视化两种现实有助于从“毛球”现象中理清蛋白质相互作用的可视化,当单个蛋白质或其群体与数百个伙伴相互作用时,就会出现这种现象。©2019作者。基本方案1:在细胞图像上可视化蛋白质及其相互作用基本方案2:显示蛋白质的所有相互作用伙伴
{"title":"Visualizing Human Protein-Protein Interactions and Subcellular Localizations on Cell Images Through CellMap","authors":"Christian Dallago,&nbsp;Tatyana Goldberg,&nbsp;Miguel Angel Andrade-Navarro,&nbsp;Gregorio Alanis-Lobato,&nbsp;Burkhard Rost","doi":"10.1002/cpbi.97","DOIUrl":"10.1002/cpbi.97","url":null,"abstract":"<p>Visualizing protein data remains a challenging and stimulating task. Useful and intuitive visualization tools may help advance biomolecular and medical research; unintuitive tools may bar important breakthroughs. This protocol describes two use cases for the CellMap (http://cellmap.protein.properties) web tool. The tool allows researchers to visualize human protein-protein interaction data constrained by protein subcellular localizations. In the simplest form, proteins are visualized on cell images that also show protein-protein interactions (PPIs) through lines (edges) connecting the proteins across the compartments. At a glance, this simultaneously highlights spatial constraints that proteins are subject to in their physical environment and visualizes PPIs against these localizations. Visualizing two realities helps in decluttering the protein interaction visualization from “hairball” phenomena that arise when single proteins or groups thereof interact with hundreds of partners. © 2019 The Authors.</p><p><b>Basic Protocol 1</b>: Visualizing proteins and their interactions on cell images</p><p><b>Basic Protocol 2</b>: Displaying all interaction partners for a protein</p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"69 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-03-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cpbi.97","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"37718929","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Using ggtree to Visualize Data on Tree-Like Structures 使用ggtree在树状结构上可视化数据
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2020-03-05 DOI: 10.1002/cpbi.96
Guangchuang Yu

Ggtree is an R/Bioconductor package for visualizing tree-like structures and associated data. After 5 years of continual development, ggtree has been evolved as a package suite that contains treeio for tree data input and output, tidytree for tree data manipulation, and ggtree for tree data visualization. Ggtree was originally designed to work with phylogenetic trees, and has been expanded to support other tree-like structures, which extends the application of ggtree to present tree data in other disciplines. This article contains five basic protocols describing how to visualize trees using the grammar of graphics syntax, how to visualize hierarchical clustering results with associated data, how to estimate bootstrap values and visualize the values on the tree, how to estimate continuous and discrete ancestral traits and visualize ancestral states on the tree, and how to visualize a multiple sequence alignment with a phylogenetic tree. The ggtree package is freely available at https://www.bioconductor.org/packages/ggtree. © 2020 by John Wiley & Sons, Inc.

Basic Protocol 1: Using grammar of graphics for visualizing trees

Basic Protocol 2: Visualizing hierarchical clustering using ggtree

Basic Protocol 3: Visualizing bootstrap values as symbolic points

Basic Protocol 4: Visualizing ancestral status

Basic Protocol 5: Visualizing a multiple sequence alignment with a phylogenetic tree

Ggtree是一个R/Bioconductor包,用于可视化树状结构和相关数据。经过5年的不断发展,ggtree已经发展成为一个包套件,其中包含用于树数据输入和输出的treeio,用于树数据操作的tidytree和用于树数据可视化的ggtree。Ggtree最初设计用于处理系统发育树,并已扩展到支持其他树状结构,这扩展了Ggtree的应用,可以在其他学科中呈现树状数据。本文包含五个基本协议,描述了如何使用图形语法可视化树,如何使用关联数据可视化分层聚类结果,如何估计自举值并可视化树上的值,如何估计连续和离散祖先特征并可视化树上的祖先状态,以及如何使用系统发育树可视化多序列对齐。ggtree包可以在https://www.bioconductor.org/packages/ggtree上免费获得。©2020 by John Wiley &基本协议1:使用图形语法可视化树基本协议2:使用ggtree可视化分层聚类基本协议3:将引导值可视化为符号点基本协议4:将祖先状态可视化基本协议5:通过系统发育树可视化多个序列对齐
{"title":"Using ggtree to Visualize Data on Tree-Like Structures","authors":"Guangchuang Yu","doi":"10.1002/cpbi.96","DOIUrl":"10.1002/cpbi.96","url":null,"abstract":"<p>Ggtree is an R/Bioconductor package for visualizing tree-like structures and associated data. After 5 years of continual development, ggtree has been evolved as a package suite that contains treeio for tree data input and output, tidytree for tree data manipulation, and ggtree for tree data visualization. Ggtree was originally designed to work with phylogenetic trees, and has been expanded to support other tree-like structures, which extends the application of ggtree to present tree data in other disciplines. This article contains five basic protocols describing how to visualize trees using the grammar of graphics syntax, how to visualize hierarchical clustering results with associated data, how to estimate bootstrap values and visualize the values on the tree, how to estimate continuous and discrete ancestral traits and visualize ancestral states on the tree, and how to visualize a multiple sequence alignment with a phylogenetic tree. The ggtree package is freely available at https://www.bioconductor.org/packages/ggtree. © 2020 by John Wiley &amp; Sons, Inc.</p><p><b>Basic Protocol 1</b>: Using grammar of graphics for visualizing trees</p><p><b>Basic Protocol 2</b>: Visualizing hierarchical clustering using ggtree</p><p><b>Basic Protocol 3</b>: Visualizing bootstrap values as symbolic points</p><p><b>Basic Protocol 4</b>: Visualizing ancestral status</p><p><b>Basic Protocol 5</b>: Visualizing a multiple sequence alignment with a phylogenetic tree</p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"69 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cpbi.96","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"37730487","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 713
Mothulity Facilitates 16S/ITS Amplicon Diversity Analysis Mothulity便于16S/ITS扩增子多样性分析
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2020-02-24 DOI: 10.1002/cpbi.94
D. Izak, A. Gromadka, S. Kaczanowski

We present Mothulity—a novel interface for Mothur, a well-established tool for 16S/ITS biodiversity analysis. Although Mothur is a well-documented and virtually complete software suite, its proper execution might be challenging for first-time users, and editing the Mothur batch scripts is time consuming even for experienced users. Mothur produces little to no graphical output, leaving the generation of plots to the user. Mothulity minimizes the chance of human error through a minimalistic yet powerful interface, with most of the analysis parameters predefined or adjusted automatically. Time spent on running the analysis is drastically reduced, since Mothulity produces an HTML report with publication-quality figures. Finally, Mothulity can be conveniently used with the SLURM workload manager, and is thereby suitable for a range of computing facilities. © 2020 by John Wiley & Sons, Inc.

Basic Protocol 1: Standard operational procedure (SOP)

Basic Protocol 2: Generating report on pre-processed data

我们提出了mothulity -一个新的接口,用于Mothur,一个完善的16S/ITS生物多样性分析工具。尽管motherur是一个文档完备且几乎完整的软件套件,但它的正确执行对于第一次使用的用户来说可能是一个挑战,并且编辑motherur批处理脚本即使对于有经验的用户来说也是非常耗时的。motherur几乎不产生任何图形输出,将绘图的生成留给了用户。Mothulity通过简约而强大的界面将人为错误的机会降至最低,大多数分析参数都是预定义的或自动调整的。运行分析所花费的时间大大减少了,因为Mothulity生成的HTML报告具有出版质量的数据。最后,可以方便地将Mothulity与SLURM工作负载管理器一起使用,因此适用于各种计算设施。©2020 by John Wiley &基本协议1:标准操作程序(SOP)基本协议2:对预处理数据生成报告
{"title":"Mothulity Facilitates 16S/ITS Amplicon Diversity Analysis","authors":"D. Izak,&nbsp;A. Gromadka,&nbsp;S. Kaczanowski","doi":"10.1002/cpbi.94","DOIUrl":"10.1002/cpbi.94","url":null,"abstract":"<p>We present Mothulity—a novel interface for Mothur, a well-established tool for 16S/ITS biodiversity analysis. Although Mothur is a well-documented and virtually complete software suite, its proper execution might be challenging for first-time users, and editing the Mothur batch scripts is time consuming even for experienced users. Mothur produces little to no graphical output, leaving the generation of plots to the user. Mothulity minimizes the chance of human error through a minimalistic yet powerful interface, with most of the analysis parameters predefined or adjusted automatically. Time spent on running the analysis is drastically reduced, since Mothulity produces an HTML report with publication-quality figures. Finally, Mothulity can be conveniently used with the SLURM workload manager, and is thereby suitable for a range of computing facilities. © 2020 by John Wiley &amp; Sons, Inc.</p><p><b>Basic Protocol 1</b>: Standard operational procedure (SOP)</p><p><b>Basic Protocol 2</b>: Generating report on pre-processed data</p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"69 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-02-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cpbi.94","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"37670994","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Constructing and Analyzing Computational Models of Cell Signaling with BioModelAnalyzer 用BioModelAnalyzer构建和分析细胞信号传导的计算模型
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2020-02-20 DOI: 10.1002/cpbi.95
Benjamin A. Hall, Jasmin Fisher

BioModelAnalyzer (BMA) is an open-source graphical tool for the development of executable models of protein and gene networks within cells. Based upon the Qualitative Networks formalism, the user can rapidly construct large networks, either manually or by connecting motifs selected from a built-in library. After the appropriate functions for each variable are defined, the user has access to three analysis engines to test the model. In addition to standard simulation tools, BMA includes an interface to the stability-testing algorithm and to a graphical Linear Temporal Logic (LTL) editor and analysis tool. Alongside this, we have developed a novel ChatBot to aid users constructing LTL queries and to explain the interface and run through tutorials. Here we present worked examples of model construction and testing via the interface. As an initial example, we discuss fate decisions in Dictyostelium discoidum and cAMP signaling. We go on to describe the workflow leading to the construction of a published model of the germline of C. elegans. Finally, we demonstrate how to construct simple models from the built-in network motif library. © 2020 by John Wiley & Sons, Inc.

Basic Protocol 1: Modeling the signaling network of Dictyostelium discoidum

Basic Protocol 2: Modeling the germline progression of Caenorhabditis elegans

Basic Protocol 3: Constructing a model of the cell cycle using motifs

BioModelAnalyzer (BMA)是一个开源的图形工具,用于开发细胞内蛋白质和基因网络的可执行模型。基于定性网络形式,用户可以快速构建大型网络,无论是手动还是通过连接从内置库中选择的主题。在为每个变量定义了适当的函数之后,用户可以访问三个分析引擎来测试模型。除了标准仿真工具外,BMA还包括稳定性测试算法和图形线性时序逻辑(LTL)编辑器和分析工具的接口。除此之外,我们还开发了一个新颖的ChatBot来帮助用户构建LTL查询,并解释界面和运行教程。在这里,我们给出了通过接口进行模型构建和测试的工作示例。作为最初的例子,我们讨论了盘齿龙的命运决定和cAMP信号。我们继续描述工作流程导致建设一个已发表的秀丽隐杆线虫种系模型。最后,我们演示了如何从内置的网络motif库中构建简单的模型。©2020 by John Wiley &基本方案1:对盘状盘基骨菌的信号网络进行建模;基本方案2:对秀丽隐杆线虫的种系进展进行建模;基本方案3:利用基序构建细胞周期模型
{"title":"Constructing and Analyzing Computational Models of Cell Signaling with BioModelAnalyzer","authors":"Benjamin A. Hall,&nbsp;Jasmin Fisher","doi":"10.1002/cpbi.95","DOIUrl":"10.1002/cpbi.95","url":null,"abstract":"<p>BioModelAnalyzer (BMA) is an open-source graphical tool for the development of executable models of protein and gene networks within cells. Based upon the <i>Qualitative Networks</i> formalism, the user can rapidly construct large networks, either manually or by connecting motifs selected from a built-in library. After the appropriate functions for each variable are defined, the user has access to three analysis engines to test the model. In addition to standard simulation tools, BMA includes an interface to the stability-testing algorithm and to a graphical Linear Temporal Logic (LTL) editor and analysis tool. Alongside this, we have developed a novel ChatBot to aid users constructing LTL queries and to explain the interface and run through tutorials. Here we present worked examples of model construction and testing via the interface. As an initial example, we discuss fate decisions in <i>Dictyostelium discoidum</i> and cAMP signaling. We go on to describe the workflow leading to the construction of a published model of the germline of <i>C. elegans</i>. Finally, we demonstrate how to construct simple models from the built-in network motif library. © 2020 by John Wiley &amp; Sons, Inc.</p><p><b>Basic Protocol 1</b>: Modeling the signaling network of <i>Dictyostelium discoidum</i></p><p><b>Basic Protocol 2</b>: Modeling the germline progression of <i>Caenorhabditis elegans</i></p><p><b>Basic Protocol 3</b>: Constructing a model of the cell cycle using motifs</p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"69 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cpbi.95","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"37661228","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Using the MINT Database to Search Protein Interactions 使用MINT数据库搜索蛋白质相互作用
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2020-01-16 DOI: 10.1002/cpbi.93
Alberto Calderone, Marta Iannuccelli, Daniele Peluso, Luana Licata

The Molecular INTeractions Database (MINT) is a public database designed to store information about protein interactions. Protein interactions are extracted from scientific literature and annotated in the database by expert curators. Currently (October 2019), MINT contains information on more than 26,000 proteins and more than 131,600 interactions in over 30 model organisms. This article provides protocols for searching MINT over the Internet, using the new MINT Web Page. © 2020 by John Wiley & Sons, Inc.

Basic Protocol 1: Searching MINT over the internet

Alternate Protocol: MINT visualizer

Basic Protocol 2: Submitting interaction data

分子相互作用数据库(MINT)是一个公共数据库,旨在存储有关蛋白质相互作用的信息。蛋白质相互作用从科学文献中提取,并由专家管理员在数据库中注释。目前(2019年10月),MINT包含30多种模式生物中26,000多种蛋白质和131,600多种相互作用的信息。本文提供了使用新的MINT网页在Internet上搜索MINT的协议。©2020 by John Wiley &基本协议1:在互联网上搜索MINT替代协议:MINT可视化基本协议2:提交交互数据
{"title":"Using the MINT Database to Search Protein Interactions","authors":"Alberto Calderone,&nbsp;Marta Iannuccelli,&nbsp;Daniele Peluso,&nbsp;Luana Licata","doi":"10.1002/cpbi.93","DOIUrl":"10.1002/cpbi.93","url":null,"abstract":"<p>The Molecular INTeractions Database (MINT) is a public database designed to store information about protein interactions. Protein interactions are extracted from scientific literature and annotated in the database by expert curators. Currently (October 2019), MINT contains information on more than 26,000 proteins and more than 131,600 interactions in over 30 model organisms. This article provides protocols for searching MINT over the Internet, using the new MINT Web Page. © 2020 by John Wiley &amp; Sons, Inc.</p><p><b>Basic Protocol 1</b>: Searching MINT over the internet</p><p><b>Alternate Protocol</b>: MINT visualizer</p><p><b>Basic Protocol 2</b>: Submitting interaction data</p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"69 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-01-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cpbi.93","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"37549998","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
How to Illuminate the Druggable Genome Using Pharos 如何利用Pharos照亮可用药基因组
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2020-01-03 DOI: 10.1002/cpbi.92
Timothy Sheils, Stephen L. Mathias, Vishal B. Siramshetty, Giovanni Bocci, Cristian G. Bologa, Jeremy J. Yang, Anna Waller, Noel Southall, Dac-Trung Nguyen, Tudor I. Oprea

Pharos is an integrated web-based informatics platform for the analysis of data aggregated by the Illuminating the Druggable Genome (IDG) Knowledge Management Center, an NIH Common Fund initiative. The current version of Pharos (as of October 2019) spans 20,244 proteins in the human proteome, 19,880 disease and phenotype associations, and 226,829 ChEMBL compounds. This resource not only collates and analyzes data from over 60 high-quality resources to generate these types, but also uses text indexing to find less apparent connections between targets, and has recently begun to collaborate with institutions that generate data and resources. Proteins are ranked according to a knowledge-based classification system, which can help researchers to identify less studied “dark” targets that could be potentially further illuminated. This is an important process for both drug discovery and target validation, as more knowledge can accelerate target identification, and previously understudied proteins can serve as novel targets in drug discovery. Two basic protocols illustrate the levels of detail available for targets and several methods of finding targets of interest. An Alternate Protocol illustrates the difference in available knowledge between less and more studied targets. © 2020 by John Wiley & Sons, Inc.

Basic Protocol 1: Search for a target and view details

Alternate Protocol: Search for dark target and view details

Basic Protocol 2: Filter a target list to get refined results

Pharos是一个基于网络的综合信息平台,用于分析由NIH共同基金倡议的照亮可药物基因组(IDG)知识管理中心汇总的数据。当前版本的Pharos(截至2019年10月)涵盖了人类蛋白质组中的20,244种蛋白质,19880种疾病和表型关联,以及226,829种ChEMBL化合物。该资源不仅整理和分析来自60多个高质量资源的数据来生成这些类型,而且还使用文本索引来查找目标之间不太明显的联系,并且最近开始与生成数据和资源的机构合作。蛋白质根据基于知识的分类系统进行排序,这可以帮助研究人员识别研究较少的“黑暗”目标,这些目标可能会进一步被照亮。这对于药物发现和靶点验证都是一个重要的过程,因为更多的知识可以加速靶点的识别,并且以前未被充分研究的蛋白质可以作为药物发现的新靶点。两个基本协议说明了可用于目标的详细程度和寻找感兴趣目标的几种方法。备选方案说明了较少和较多研究目标之间现有知识的差异。©2020 by John Wiley &基本协议1:搜索目标并查看详细信息备用协议:搜索暗目标并查看详细信息基本协议2:过滤目标列表以获得精炼结果
{"title":"How to Illuminate the Druggable Genome Using Pharos","authors":"Timothy Sheils,&nbsp;Stephen L. Mathias,&nbsp;Vishal B. Siramshetty,&nbsp;Giovanni Bocci,&nbsp;Cristian G. Bologa,&nbsp;Jeremy J. Yang,&nbsp;Anna Waller,&nbsp;Noel Southall,&nbsp;Dac-Trung Nguyen,&nbsp;Tudor I. Oprea","doi":"10.1002/cpbi.92","DOIUrl":"10.1002/cpbi.92","url":null,"abstract":"<p>Pharos is an integrated web-based informatics platform for the analysis of data aggregated by the Illuminating the Druggable Genome (IDG) Knowledge Management Center, an NIH Common Fund initiative. The current version of Pharos (as of October 2019) spans 20,244 proteins in the human proteome, 19,880 disease and phenotype associations, and 226,829 ChEMBL compounds. This resource not only collates and analyzes data from over 60 high-quality resources to generate these types, but also uses text indexing to find less apparent connections between targets, and has recently begun to collaborate with institutions that generate data and resources. Proteins are ranked according to a knowledge-based classification system, which can help researchers to identify less studied “dark” targets that could be potentially further illuminated. This is an important process for both drug discovery and target validation, as more knowledge can accelerate target identification, and previously understudied proteins can serve as novel targets in drug discovery. Two basic protocols illustrate the levels of detail available for targets and several methods of finding targets of interest. An Alternate Protocol illustrates the difference in available knowledge between less and more studied targets. © 2020 by John Wiley &amp; Sons, Inc.</p><p><b>Basic Protocol 1</b>: Search for a target and view details</p><p><b>Alternate Protocol</b>: Search for dark target and view details</p><p><b>Basic Protocol 2</b>: Filter a target list to get refined results</p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"69 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cpbi.92","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"37509395","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 23
The MathIOmica Toolbox: General Analysis Utilities for Dynamic Omics Datasets MathIOmica工具箱:动态组学数据集的通用分析工具
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2019-12-18 DOI: 10.1002/cpbi.91
George I. Mias, Minzhang Zheng

MathIOmica is a package for bioinformatics, written in the Wolfram language, that provides multiple utilities to facilitate the analysis of longitudinal data generated from omics experiments, including transcriptomics, proteomics, and metabolomics data, as well as any generalized time series. MathIOmica uses Mathematica's notebook interface, wherein users can import longitudinal datasets, carry out quality control and normalization, generate time series, and classify temporal trends. MathIOmica provides spectral methods based on periodograms and autocorrelations for automatically detecting classes of temporal behavior and allowing the user to visualize collective temporal behavior, and also assess biological significance through Gene Ontology and pathway enrichment analyses. MathIOmica's time-series classification methods address common issues including missing data and uneven sampling in measurements. As such, the software is ideally suited for the analysis of experimental data from individualized profiling of subjects, can facilitate analysis of data from the emerging field of individualized health monitoring, and can detect temporal trends that may be associated with adverse health events. In this article, we import a transcriptomics (RNA-sequencing) dataset collected over multiple timepoints and generate time series for each transcript represented in the data. We classify the time series to identify classes of significant temporal trends (using autocorrelations). We assess statistical significance cutoffs in the classification by generating null distributions using randomly resampled time series. We then visualize the significant trends in heatmaps and assess biological significance using enrichment analyses. Finally, we visualize pathway results for statistically significant pathways of interest. © 2019 by John Wiley & Sons, Inc.

Basic Protocol: Time series analysis of transcriptomics expression dataset

MathIOmica是一个用Wolfram语言编写的生物信息学软件包,它提供了多种实用程序来促进组学实验产生的纵向数据的分析,包括转录组学、蛋白质组学和代谢组学数据,以及任何广义时间序列。MathIOmica使用Mathematica的笔记本界面,用户可以导入纵向数据集,进行质量控制和规范化,生成时间序列,对时间趋势进行分类。MathIOmica提供了基于周期图和自相关性的光谱方法,用于自动检测时间行为的类别,并允许用户可视化集体时间行为,并且还通过基因本体和途径富集分析来评估生物学意义。MathIOmica的时间序列分类方法解决了常见的问题,包括数据缺失和测量中的采样不均匀。因此,该软件非常适合分析来自受试者个性化分析的实验数据,可以促进对来自新兴的个性化健康监测领域的数据的分析,并且可以检测可能与不良健康事件相关的时间趋势。在本文中,我们导入在多个时间点收集的转录组学(rna测序)数据集,并为数据中表示的每个转录生成时间序列。我们对时间序列进行分类,以确定重要时间趋势的类别(使用自相关性)。我们通过使用随机重新采样的时间序列生成零分布来评估分类中的统计显著性截止点。然后,我们将热图中的重要趋势可视化,并使用富集分析评估生物学意义。最后,我们可视化的途径结果统计显著感兴趣的途径。©2019 by John Wiley &基本方案:转录组学表达数据集的时间序列分析
{"title":"The MathIOmica Toolbox: General Analysis Utilities for Dynamic Omics Datasets","authors":"George I. Mias,&nbsp;Minzhang Zheng","doi":"10.1002/cpbi.91","DOIUrl":"10.1002/cpbi.91","url":null,"abstract":"<p>MathIOmica is a package for bioinformatics, written in the Wolfram language, that provides multiple utilities to facilitate the analysis of longitudinal data generated from omics experiments, including transcriptomics, proteomics, and metabolomics data, as well as any generalized time series. MathIOmica uses Mathematica's notebook interface, wherein users can import longitudinal datasets, carry out quality control and normalization, generate time series, and classify temporal trends. MathIOmica provides spectral methods based on periodograms and autocorrelations for automatically detecting classes of temporal behavior and allowing the user to visualize collective temporal behavior, and also assess biological significance through Gene Ontology and pathway enrichment analyses. MathIOmica's time-series classification methods address common issues including missing data and uneven sampling in measurements. As such, the software is ideally suited for the analysis of experimental data from individualized profiling of subjects, can facilitate analysis of data from the emerging field of individualized health monitoring, and can detect temporal trends that may be associated with adverse health events. In this article, we import a transcriptomics (RNA-sequencing) dataset collected over multiple timepoints and generate time series for each transcript represented in the data. We classify the time series to identify classes of significant temporal trends (using autocorrelations). We assess statistical significance cutoffs in the classification by generating null distributions using randomly resampled time series. We then visualize the significant trends in heatmaps and assess biological significance using enrichment analyses. Finally, we visualize pathway results for statistically significant pathways of interest. © 2019 by John Wiley &amp; Sons, Inc.</p><p><b>Basic Protocol</b>: Time series analysis of transcriptomics expression dataset</p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"69 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cpbi.91","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"37469965","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
NCBI's Conserved Domain Database and Tools for Protein Domain Analysis NCBI的保守结构域数据库和蛋白质结构域分析工具
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2019-12-18 DOI: 10.1002/cpbi.90
Mingzhang Yang, Myra K. Derbyshire, Roxanne A. Yamashita, Aron Marchler-Bauer

The Conserved Domain Database (CDD) is a freely available resource for the annotation of sequences with the locations of conserved protein domain footprints, as well as functional sites and motifs inferred from these footprints. It includes protein domain and protein family models curated in house by CDD staff, as well as imported from a variety of other sources. The latest CDD release (v3.17, April 2019) contains more than 57,000 domain models, of which almost 15,000 were curated by CDD staff. The CDD curation effort increases coverage and provides finer-grained classifications of common and widely distributed protein domain families, for which a wealth of functional and structural data have become available. The CDD maintains both live search capabilities and an archive of pre-computed domain annotations for a selected subset of sequences tracked by the NCBI's Entrez protein database. These can be retrieved or computed for a single sequence using CD-Search or in bulk using Batch CD-Search, or computed via standalone RPS-BLAST plus the rpsbproc software package. The CDD can be accessed via https://www.ncbi.nlm.nih.gov/Structure/cdd/cdd.shtml. The three protocols listed here describe how to perform a CD-Search (Basic Protocol 1), a Batch CD-Search (Basic Protocol 2), and a Standalone RPS-BLAST and rpsbproc (Basic Protocol 3). © 2019 The Authors.

Basic Protocol 1: CD-search

Basic Protocol 2: Batch CD-search

Basic Protocol 3: Standalone RPS-BLAST and rpsbproc

保守结构域数据库(CDD)是一个免费的资源,用于标注具有保守蛋白结构域足迹位置的序列,以及从这些足迹推断出的功能位点和基序。它包括由CDD工作人员在内部策划的蛋白质结构域和蛋白质家族模型,以及从各种其他来源导入的模型。最新的CDD版本(v3.17, 2019年4月)包含超过57,000个域模型,其中近15,000个由CDD工作人员策划。CDD管理工作增加了覆盖范围,并提供了对常见和广泛分布的蛋白质结构域家族的更细粒度的分类,为此提供了丰富的功能和结构数据。CDD维护实时搜索功能和NCBI的Entrez蛋白质数据库跟踪的选定序列子集的预计算域注释存档。这些数据可以使用CD-Search检索或计算单个序列,也可以使用Batch CD-Search批量检索或计算,或者通过独立的RPS-BLAST加上rpsbproc软件包进行计算。CDD可以通过https://www.ncbi.nlm.nih.gov/Structure/cdd/cdd.shtml访问。这里列出的三个协议描述了如何执行cd搜索(基本协议1),批量cd搜索(基本协议2)以及独立RPS-BLAST和rpsbproc(基本协议3)。©2019作者。基本协议1:cd -search基本协议2:批量cd -search基本协议3:独立的RPS-BLAST和rpsbproc
{"title":"NCBI's Conserved Domain Database and Tools for Protein Domain Analysis","authors":"Mingzhang Yang,&nbsp;Myra K. Derbyshire,&nbsp;Roxanne A. Yamashita,&nbsp;Aron Marchler-Bauer","doi":"10.1002/cpbi.90","DOIUrl":"10.1002/cpbi.90","url":null,"abstract":"<p>The Conserved Domain Database (CDD) is a freely available resource for the annotation of sequences with the locations of conserved protein domain footprints, as well as functional sites and motifs inferred from these footprints. It includes protein domain and protein family models curated in house by CDD staff, as well as imported from a variety of other sources. The latest CDD release (v3.17, April 2019) contains more than 57,000 domain models, of which almost 15,000 were curated by CDD staff. The CDD curation effort increases coverage and provides finer-grained classifications of common and widely distributed protein domain families, for which a wealth of functional and structural data have become available. The CDD maintains both live search capabilities and an archive of pre-computed domain annotations for a selected subset of sequences tracked by the NCBI's Entrez protein database. These can be retrieved or computed for a single sequence using CD-Search or in bulk using Batch CD-Search, or computed via standalone RPS-BLAST plus the rpsbproc software package. The CDD can be accessed via https://www.ncbi.nlm.nih.gov/Structure/cdd/cdd.shtml. The three protocols listed here describe how to perform a CD-Search (Basic Protocol 1), a Batch CD-Search (Basic Protocol 2), and a Standalone RPS-BLAST and rpsbproc (Basic Protocol 3). © 2019 The Authors.</p><p><b>Basic Protocol 1</b>: CD-search</p><p><b>Basic Protocol 2</b>: Batch CD-search</p><p><b>Basic Protocol 3</b>: Standalone RPS-BLAST and rpsbproc</p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"69 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cpbi.90","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"37469483","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 107
Issue Information TOC 发布信息TOC
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2019-12-17 DOI: 10.1002/cpbi.65

Cover: In Chong et al. (http://doi.org/10.1002/cpbi.86), the image shows the MetaboAnalyst 4.0 flowchart. There are two main components: a data processing component to deal with different data inputs, and a data analysis component containing individual modules which can be categorized into “Exploratory Statistical Analysis,” “Functional Enrichment Analysis,” and “Data Integration and Systems Biology.” PROT X refers to the relevant protocol for that component/module.

封面:在Chong et al. (http://doi.org/10.1002/cpbi.86)中,图像显示了MetaboAnalyst 4.0流程图。有两个主要组件:处理不同数据输入的数据处理组件和包含单个模块的数据分析组件,这些模块可分为“探索性统计分析”、“功能富集分析”和“数据集成和系统生物学”。protx指的是该组件/模块的相关协议。
{"title":"Issue Information TOC","authors":"","doi":"10.1002/cpbi.65","DOIUrl":"10.1002/cpbi.65","url":null,"abstract":"<p><b>Cover</b>: In Chong et al. (http://doi.org/10.1002/cpbi.86), the image shows the MetaboAnalyst 4.0 flowchart. There are two main components: a data processing component to deal with different data inputs, and a data analysis component containing individual modules which can be categorized into “Exploratory Statistical Analysis,” “Functional Enrichment Analysis,” and “Data Integration and Systems Biology.” PROT X refers to the relevant protocol for that component/module.\u0000\u0000 <figure>\u0000 <div><picture>\u0000 <source></source></picture><p></p>\u0000 </div>\u0000 </figure></p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"68 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cpbi.65","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41539933","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Current protocols in bioinformatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1