首页 > 最新文献

Current protocols in bioinformatics最新文献

英文 中文
Using the PRIDE Database and ProteomeXchange for Submitting and Accessing Public Proteomics Datasets 使用PRIDE数据库和ProteomeXchange提交和访问公共蛋白质组学数据集
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2018-02-13 DOI: 10.1002/cpbi.30
Andrew F. Jarnuczak, Juan Antonio Vizcaíno

The ProteomeXchange (PX) Consortium is the unifying framework for world-leading mass spectrometry (MS)–based proteomics repositories. Current members include the PRIDE database (U.K.), PeptideAtlas/PASSEL, and MassIVE (U.S.A.), and jPOST (Japan). The Consortium standardizes submission and dissemination of public proteomics data worldwide. This is achieved through implementing common data submission guidelines and enforcing metadata requirements by each of the members. Furthermore, the members use a common identifier space. Each dataset receives a unique (PXD) accession number and is publicly accessible as soon as the associated scientific publications are released. The two basic protocols provide a step-by-step guide on how to submit data to the PRIDE database, and describe how to access the PX portal (called ProteomeCentral), which can be used to search datasets available in any of the PX members. © 2017 by John Wiley & Sons, Inc.

ProteomeXchange (PX)联盟是世界领先的基于质谱(MS)的蛋白质组学资源库的统一框架。目前的成员包括PRIDE数据库(英国)、PeptideAtlas/PASSEL、MassIVE(美国)和jPOST(日本)。该联盟标准化提交和传播全球公共蛋白质组学数据。这是通过实现公共数据提交指南和每个成员强制执行元数据需求来实现的。此外,成员使用一个公共标识符空间。每个数据集都有一个唯一的(PXD)登录号,一旦相关的科学出版物发布,就可以公开访问。这两个基本协议提供了如何向PRIDE数据库提交数据的分步指南,并描述了如何访问PX门户(称为ProteomeCentral),该门户可用于搜索任何PX成员中可用的数据集。©2017 by John Wiley &儿子,Inc。
{"title":"Using the PRIDE Database and ProteomeXchange for Submitting and Accessing Public Proteomics Datasets","authors":"Andrew F. Jarnuczak,&nbsp;Juan Antonio Vizcaíno","doi":"10.1002/cpbi.30","DOIUrl":"10.1002/cpbi.30","url":null,"abstract":"<p>The ProteomeXchange (PX) Consortium is the unifying framework for world-leading mass spectrometry (MS)–based proteomics repositories. Current members include the PRIDE database (U.K.), PeptideAtlas/PASSEL, and MassIVE (U.S.A.), and jPOST (Japan). The Consortium standardizes submission and dissemination of public proteomics data worldwide. This is achieved through implementing common data submission guidelines and enforcing metadata requirements by each of the members. Furthermore, the members use a common identifier space. Each dataset receives a unique (PXD) accession number and is publicly accessible as soon as the associated scientific publications are released. The two basic protocols provide a step-by-step guide on how to submit data to the PRIDE database, and describe how to access the PX portal (called ProteomeCentral), which can be used to search datasets available in any of the PX members. © 2017 by John Wiley &amp; Sons, Inc.</p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"59 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2018-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cpbi.30","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"35402463","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 39
Prediction of Protein-Protein Interactions 蛋白质相互作用的预测
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2017-12-08 DOI: 10.1002/cpbi.38
Max Kotlyar, Andrea E.M. Rossos, Igor Jurisica

The authors provide an overview of physical protein-protein interaction prediction, covering the main strategies for predicting interactions, approaches for assessing predictions, and online resources for accessing predictions. This unit focuses on the main advancements in each of these areas over the last decade. The methods and resources that are presented here are not an exhaustive set, but characterize the current state of the field—highlighting key challenges and achievements. © 2017 by John Wiley & Sons, Inc.

作者提供了物理蛋白质-蛋白质相互作用预测的概述,包括预测相互作用的主要策略,评估预测的方法,以及访问预测的在线资源。本单元侧重于过去十年中这些领域的主要进展。这里介绍的方法和资源并不是详尽的集合,但描述了该领域的现状,突出了主要的挑战和成就。©2017 by John Wiley &儿子,Inc。
{"title":"Prediction of Protein-Protein Interactions","authors":"Max Kotlyar,&nbsp;Andrea E.M. Rossos,&nbsp;Igor Jurisica","doi":"10.1002/cpbi.38","DOIUrl":"10.1002/cpbi.38","url":null,"abstract":"<p>The authors provide an overview of physical protein-protein interaction prediction, covering the main strategies for predicting interactions, approaches for assessing predictions, and online resources for accessing predictions. This unit focuses on the main advancements in each of these areas over the last decade. The methods and resources that are presented here are not an exhaustive set, but characterize the current state of the field—highlighting key challenges and achievements. © 2017 by John Wiley &amp; Sons, Inc.</p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"60 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2017-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cpbi.38","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"35629626","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 32
The HMMER Web Server for Protein Sequence Similarity Search 蛋白质序列相似性搜索的HMMER Web服务器
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2017-12-08 DOI: 10.1002/cpbi.40
Ananth Prakash, Matt Jeffryes, Alex Bateman, Robert D. Finn

Protein sequence similarity search is one of the most commonly used bioinformatics methods for identifying evolutionarily related proteins. In general, sequences that are evolutionarily related share some degree of similarity, and sequence-search algorithms use this principle to identify homologs. The requirement for a fast and sensitive sequence search method led to the development of the HMMER software, which in the latest version (v3.1) uses a combination of sophisticated acceleration heuristics and mathematical and computational optimizations to enable the use of profile hidden Markov models (HMMs) for sequence analysis. The HMMER Web server provides a common platform by linking the HMMER algorithms to databases, thereby enabling the search for homologs, as well as providing sequence and functional annotation by linking external databases. This unit describes three basic protocols and two alternate protocols that explain how to use the HMMER Web server using various input formats and user defined parameters. © 2017 by John Wiley & Sons, Inc.

蛋白质序列相似性搜索是鉴定进化相关蛋白最常用的生物信息学方法之一。一般来说,在进化上相关的序列具有一定程度的相似性,序列搜索算法使用这一原理来识别同源物。对快速和敏感的序列搜索方法的需求导致了HMMER软件的开发,该软件在最新版本(v3.1)中结合了复杂的加速启发式和数学和计算优化,从而能够使用剖面隐马尔可夫模型(hmm)进行序列分析。HMMER Web服务器通过将HMMER算法链接到数据库提供了一个公共平台,从而能够搜索同源物,并通过链接外部数据库提供序列和功能注释。本单元描述了三个基本协议和两个替代协议,这些协议解释了如何使用各种输入格式和用户定义的参数来使用HMMER Web服务器。©2017 by John Wiley &儿子,Inc。
{"title":"The HMMER Web Server for Protein Sequence Similarity Search","authors":"Ananth Prakash,&nbsp;Matt Jeffryes,&nbsp;Alex Bateman,&nbsp;Robert D. Finn","doi":"10.1002/cpbi.40","DOIUrl":"10.1002/cpbi.40","url":null,"abstract":"<p>Protein sequence similarity search is one of the most commonly used bioinformatics methods for identifying evolutionarily related proteins. In general, sequences that are evolutionarily related share some degree of similarity, and sequence-search algorithms use this principle to identify homologs. The requirement for a fast and sensitive sequence search method led to the development of the HMMER software, which in the latest version (v3.1) uses a combination of sophisticated acceleration heuristics and mathematical and computational optimizations to enable the use of profile hidden Markov models (HMMs) for sequence analysis. The HMMER Web server provides a common platform by linking the HMMER algorithms to databases, thereby enabling the search for homologs, as well as providing sequence and functional annotation by linking external databases. This unit describes three basic protocols and two alternate protocols that explain how to use the HMMER Web server using various input formats and user defined parameters. © 2017 by John Wiley &amp; Sons, Inc.</p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"60 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2017-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cpbi.40","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"35629628","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 103
Using the Arabidopsis Information Resource (TAIR) to Find Information About Arabidopsis Genes 利用拟南芥信息资源(TAIR)查找拟南芥基因信息
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2017-12-08 DOI: 10.1002/cpbi.36
Leonore Reiser, Shabari Subramaniam, Donghui Li, Eva Huala

The Arabidopsis Information Resource (TAIR; http://arabidopsis.org) is a comprehensive Web resource of Arabidopsis biology for plant scientists. TAIR curates and integrates information about genes, proteins, gene function, orthologs, gene expression, mutant phenotypes, biological materials such as clones and seed stocks, genetic markers, genetic and physical maps, genome organization, images of mutant plants, protein sub-cellular localizations, publications, and the research community. The various data types are extensively interconnected and can be accessed through a variety of Web-based search and display tools. This unit primarily focuses on some basic methods for searching, browsing, visualizing, and analyzing information about Arabidopsis genes and genome. Additionally, we describe how members of the community can share data using TAIR's Online Annotation Submission Tool (TOAST), in order to make their published research more accessible and visible. © 2017 by John Wiley & Sons, Inc.

拟南芥信息资源;http://arabidopsis.org)是一个面向植物科学家的拟南芥生物学综合资源网站。TAIR管理和整合有关基因、蛋白质、基因功能、同源物、基因表达、突变表型、生物材料(如克隆和种子库存)、遗传标记、遗传和物理图谱、基因组组织、突变植物图像、蛋白质亚细胞定位、出版物和研究社区的信息。各种数据类型广泛地相互连接,并且可以通过各种基于web的搜索和显示工具进行访问。本单元主要介绍拟南芥基因和基因组信息的搜索、浏览、可视化和分析的一些基本方法。此外,我们描述了社区成员如何使用TAIR的在线注释提交工具(TOAST)共享数据,以使他们发表的研究更容易访问和可见。©2017 by John Wiley &儿子,Inc。
{"title":"Using the Arabidopsis Information Resource (TAIR) to Find Information About Arabidopsis Genes","authors":"Leonore Reiser,&nbsp;Shabari Subramaniam,&nbsp;Donghui Li,&nbsp;Eva Huala","doi":"10.1002/cpbi.36","DOIUrl":"10.1002/cpbi.36","url":null,"abstract":"<p>The <i>Arabidopsis</i> Information Resource (TAIR; http://arabidopsis.org) is a comprehensive Web resource of <i>Arabidopsis</i> biology for plant scientists. TAIR curates and integrates information about genes, proteins, gene function, orthologs, gene expression, mutant phenotypes, biological materials such as clones and seed stocks, genetic markers, genetic and physical maps, genome organization, images of mutant plants, protein sub-cellular localizations, publications, and the research community. The various data types are extensively interconnected and can be accessed through a variety of Web-based search and display tools. This unit primarily focuses on some basic methods for searching, browsing, visualizing, and analyzing information about <i>Arabidopsis</i> genes and genome. Additionally, we describe how members of the community can share data using TAIR's Online Annotation Submission Tool (TOAST), in order to make their published research more accessible and visible. © 2017 by John Wiley &amp; Sons, Inc.</p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"60 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2017-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cpbi.36","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"35629629","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 54
Using the Seven Bridges Cancer Genomics Cloud to Access and Analyze Petabytes of Cancer Data 使用七桥癌症基因组云访问和分析数pb的癌症数据
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2017-12-08 DOI: 10.1002/cpbi.39
Raunaq Malhotra, Isheeta Seth, Erik Lehnert, Jing Zhao, Gaurav Kaushik, Elizabeth H. Williams, Anurag Sethi, Brandi N. Davis-Dusenbery

Next-generation sequencing has produced petabytes of data, but accessing and analyzing these data remain challenging. Traditionally, researchers investigating public datasets like The Cancer Genome Atlas (TCGA) would download the data to a high-performance cluster, which could take several weeks even with a highly optimized network connection. The National Cancer Institute (NCI) initiated the Cancer Genomics Cloud Pilots program to provide researchers with the resources to process data with cloud computational resources. We present protocols using one of these Cloud Pilots, the Seven Bridges Cancer Genomics Cloud (CGC), to find and query public datasets, bring your own data to the CGC, analyze data using standard or custom workflows, and benchmark tools for accuracy with interactive analysis features. These protocols demonstrate that the CGC is a data-analysis ecosystem that fully empowers researchers with a variety of areas of expertise and interests to collaborate in the analysis of petabytes of data. © 2017 by John Wiley & Sons, Inc.

下一代测序已经产生了数拍字节的数据,但访问和分析这些数据仍然具有挑战性。传统上,研究人员调查像癌症基因组图谱(TCGA)这样的公共数据集,需要将数据下载到高性能集群,即使在高度优化的网络连接下,这也需要几周的时间。美国国家癌症研究所(NCI)启动了癌症基因组学云试点项目,为研究人员提供使用云计算资源处理数据的资源。我们提出了使用这些云试点之一的协议,七桥癌症基因组云(CGC),查找和查询公共数据集,将您自己的数据带到CGC,使用标准或自定义工作流程分析数据,以及具有交互式分析功能的准确性基准工具。这些协议表明,CGC是一个数据分析生态系统,充分授权具有各种专业知识和兴趣领域的研究人员在pb级数据分析中进行合作。©2017 by John Wiley &儿子,Inc。
{"title":"Using the Seven Bridges Cancer Genomics Cloud to Access and Analyze Petabytes of Cancer Data","authors":"Raunaq Malhotra,&nbsp;Isheeta Seth,&nbsp;Erik Lehnert,&nbsp;Jing Zhao,&nbsp;Gaurav Kaushik,&nbsp;Elizabeth H. Williams,&nbsp;Anurag Sethi,&nbsp;Brandi N. Davis-Dusenbery","doi":"10.1002/cpbi.39","DOIUrl":"10.1002/cpbi.39","url":null,"abstract":"<p>Next-generation sequencing has produced petabytes of data, but accessing and analyzing these data remain challenging. Traditionally, researchers investigating public datasets like The Cancer Genome Atlas (TCGA) would download the data to a high-performance cluster, which could take several weeks even with a highly optimized network connection. The National Cancer Institute (NCI) initiated the Cancer Genomics Cloud Pilots program to provide researchers with the resources to process data with cloud computational resources. We present protocols using one of these Cloud Pilots, the Seven Bridges Cancer Genomics Cloud (CGC), to find and query public datasets, bring your own data to the CGC, analyze data using standard or custom workflows, and benchmark tools for accuracy with interactive analysis features. These protocols demonstrate that the CGC is a data-analysis ecosystem that fully empowers researchers with a variety of areas of expertise and interests to collaborate in the analysis of petabytes of data. © 2017 by John Wiley &amp; Sons, Inc.</p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"60 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2017-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cpbi.39","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"35629630","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Protein 3D Structure and Electron Microscopy Map Retrieval Using 3D-SURFER2.0 and EM-SURFER 使用3D- surfer2.0和EM-SURFER的蛋白质三维结构和电子显微镜图谱检索
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2017-12-08 DOI: 10.1002/cpbi.37
Xusi Han, Qing Wei, Daisuke Kihara

With the rapid growth in the number of solved protein structures stored in the Protein Data Bank (PDB) and the Electron Microscopy Data Bank (EMDB), it is essential to develop tools to perform real-time structure similarity searches against the entire structure database. Since conventional structure alignment methods need to sample different orientations of proteins in the three-dimensional space, they are time consuming and unsuitable for rapid, real-time database searches. To this end, we have developed 3D-SURFER and EM-SURFER, which utilize 3D Zernike descriptors (3DZD) to conduct high-throughput protein structure comparison, visualization, and analysis. Taking an atomic structure or an electron microscopy map of a protein or a protein complex as input, the 3DZD of a query protein is computed and compared with the 3DZD of all other proteins in PDB or EMDB. In addition, local geometrical characteristics of a query protein can be analyzed using VisGrid and LIGSITECSC in 3D-SURFER. This article describes how to use 3D-SURFER and EM-SURFER to carry out protein surface shape similarity searches, local geometric feature analysis, and interpretation of the search results. © 2017 by John Wiley & Sons, Inc.

随着蛋白质数据库(PDB)和电子显微镜数据库(EMDB)中已解决的蛋白质结构数量的快速增长,开发针对整个结构数据库进行实时结构相似性搜索的工具是必要的。由于传统的结构比对方法需要在三维空间中对蛋白质的不同取向进行采样,因此它们耗时且不适合快速、实时的数据库搜索。为此,我们开发了3D- surfer和EM-SURFER,利用3D泽尼克描述符(3DZD)进行高通量蛋白质结构比较、可视化和分析。以蛋白质或蛋白质复合物的原子结构或电子显微镜图为输入,计算查询蛋白质的3DZD,并与PDB或EMDB中所有其他蛋白质的3DZD进行比较。此外,还可以利用3D-SURFER中的VisGrid和LIGSITECSC分析查询蛋白的局部几何特征。本文介绍了如何使用3D-SURFER和EM-SURFER进行蛋白质表面形状相似性搜索,局部几何特征分析,以及搜索结果的解释。©2017 by John Wiley &儿子,Inc。
{"title":"Protein 3D Structure and Electron Microscopy Map Retrieval Using 3D-SURFER2.0 and EM-SURFER","authors":"Xusi Han,&nbsp;Qing Wei,&nbsp;Daisuke Kihara","doi":"10.1002/cpbi.37","DOIUrl":"10.1002/cpbi.37","url":null,"abstract":"<p>With the rapid growth in the number of solved protein structures stored in the Protein Data Bank (PDB) and the Electron Microscopy Data Bank (EMDB), it is essential to develop tools to perform real-time structure similarity searches against the entire structure database. Since conventional structure alignment methods need to sample different orientations of proteins in the three-dimensional space, they are time consuming and unsuitable for rapid, real-time database searches. To this end, we have developed 3D-SURFER and EM-SURFER, which utilize 3D Zernike descriptors (3DZD) to conduct high-throughput protein structure comparison, visualization, and analysis. Taking an atomic structure or an electron microscopy map of a protein or a protein complex as input, the 3DZD of a query protein is computed and compared with the 3DZD of all other proteins in PDB or EMDB. In addition, local geometrical characteristics of a query protein can be analyzed using VisGrid and LIGSITE<sup>CSC</sup> in 3D-SURFER. This article describes how to use 3D-SURFER and EM-SURFER to carry out protein surface shape similarity searches, local geometric feature analysis, and interpretation of the search results. © 2017 by John Wiley &amp; Sons, Inc.</p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"60 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2017-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cpbi.37","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"35629627","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Using the NONCODE Database Resource 使用NONCODE数据库资源
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2017-06-27 DOI: 10.1002/cpbi.25
Li Xiyuan, Bu Dechao, Sun Liang, Wu Yang, Fang Shuangsang, Li Hui, Luo Haitao, Luo Chunlong, Fang Wenzheng, Chen Runsheng, Zhao Yi

NONCODE is a comprehensive database that aims to present the most complete collection and annotation of non-coding RNAs, especially long non-coding RNAs (lncRNA genes), and thus NONCODE is essential to modern biological and medical research. Scientists are producing a flood of new data from which new lncRNA genes and lncRNA-disease relationships are continually being identified. NONCODE assimilates such information from a wide variety of sources including published articles, RNA-seq data, micro-array data and databases on genetic variation (dbSNP) and genome-wide associations (GWAS). NONCODE organizes all this information and makes it freely available to the public via the Internet. The NONCODE protocol provides step-by-step instructions on how to browse and search lncRNA information such as sequence, expression, and disease relationships, how to use the tools for functional prediction, species conservation assays, blast analysis, identifier conversion, and, finally, how to submit sequences to identify lncRNA genes. As of Dec 2016, NONCODE has cataloged 487,851 lncRNA genes sequenced from 16 species. © 2017 by John Wiley & Sons, Inc.

NONCODE是一个综合性数据库,旨在提供最完整的非编码rna,特别是长链非编码rna (lncRNA基因)的收集和注释,因此NONCODE对现代生物学和医学研究至关重要。科学家们正在产生大量的新数据,从中不断发现新的lncRNA基因和lncRNA与疾病的关系。NONCODE从各种各样的来源吸收这些信息,包括发表的文章、RNA-seq数据、微阵列数据和遗传变异(dbSNP)和全基因组关联(GWAS)数据库。NONCODE组织所有这些信息,并通过互联网向公众免费提供。NONCODE协议提供了关于如何浏览和搜索lncRNA信息的分步说明,如序列,表达和疾病关系,如何使用功能预测工具,物种保护分析,blast分析,标识符转换,以及最后如何提交序列以鉴定lncRNA基因。截至2016年12月,NONCODE已经对来自16个物种的487,851个lncRNA基因进行了测序。©2017 by John Wiley &儿子,Inc。
{"title":"Using the NONCODE Database Resource","authors":"Li Xiyuan,&nbsp;Bu Dechao,&nbsp;Sun Liang,&nbsp;Wu Yang,&nbsp;Fang Shuangsang,&nbsp;Li Hui,&nbsp;Luo Haitao,&nbsp;Luo Chunlong,&nbsp;Fang Wenzheng,&nbsp;Chen Runsheng,&nbsp;Zhao Yi","doi":"10.1002/cpbi.25","DOIUrl":"10.1002/cpbi.25","url":null,"abstract":"<p>NONCODE is a comprehensive database that aims to present the most complete collection and annotation of non-coding RNAs, especially long non-coding RNAs (lncRNA genes), and thus NONCODE is essential to modern biological and medical research. Scientists are producing a flood of new data from which new lncRNA genes and lncRNA-disease relationships are continually being identified. NONCODE assimilates such information from a wide variety of sources including published articles, RNA-seq data, micro-array data and databases on genetic variation (dbSNP) and genome-wide associations (GWAS). NONCODE organizes all this information and makes it freely available to the public via the Internet. The NONCODE protocol provides step-by-step instructions on how to browse and search lncRNA information such as sequence, expression, and disease relationships, how to use the tools for functional prediction, species conservation assays, blast analysis, identifier conversion, and, finally, how to submit sequences to identify lncRNA genes. As of Dec 2016, NONCODE has cataloged 487,851 lncRNA genes sequenced from 16 species. © 2017 by John Wiley &amp; Sons, Inc.</p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"58 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2017-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cpbi.25","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"35123102","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Finding Similar Nucleotide Sequences Using Network BLAST Searches 使用网络BLAST搜索寻找相似的核苷酸序列
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2017-06-27 DOI: 10.1002/cpbi.29
Istvan Ladunga

The Basic Local Alignment Search Tool (BLAST) is the first tool in the annotation of nucleotide or amino acid sequences. BLAST is a flagship of bioinformatics due to its performance and user-friendliness. Beginners and intermediate users will learn how to design and submit blastn and Megablast searches on the Web pages at the National Center for Biotechnology Information. We map nucleic acid sequences to genomes, find identical or similar mRNAs, expressed sequence tag, and noncoding RNA sequences, and run Megablast searches, which are much faster than blastn. Understanding results is assisted by taxonomy reports, genomic views, and multiple alignments. We interpret expected frequency thresholds, biological significance, and statistical significance. Weak hits provide no evidence, but indicate hints for further analyses. We find genes that may code for homologous proteins by translated BLAST. We reduce false positives by filtering out low-complexity regions. Parsed BLAST results can be integrated into analysis pipelines. Links in the output connect to Entrez and PubMed, as well as structural, sequence, interaction, and expression databases. This facilitates integration with a wide spectrum of biological knowledge. © 2017 by John Wiley & Sons, Inc.

BLAST (Basic Local Alignment Search Tool)是第一个用于核苷酸或氨基酸序列注释的工具。由于其性能和用户友好性,BLAST是生物信息学的旗舰产品。初学者和中级用户将学习如何在国家生物技术信息中心的网页上设计和提交blastn和Megablast搜索。我们将核酸序列映射到基因组,找到相同或相似的mrna,表达序列标签和非编码RNA序列,并运行Megablast搜索,这比blastn快得多。分类报告、基因组视图和多重比对有助于理解结果。我们解释了预期频率阈值、生物显著性和统计显著性。弱撞击不能提供证据,但为进一步分析提供了线索。我们发现了可能通过翻译BLAST编码同源蛋白的基因。我们通过过滤掉低复杂度区域来减少误报。解析后的BLAST结果可以集成到分析管道中。输出中的链接连接到Entrez和PubMed,以及结构、序列、交互和表达数据库。这有助于与广泛的生物学知识的整合。©2017 by John Wiley &儿子,Inc。
{"title":"Finding Similar Nucleotide Sequences Using Network BLAST Searches","authors":"Istvan Ladunga","doi":"10.1002/cpbi.29","DOIUrl":"10.1002/cpbi.29","url":null,"abstract":"<p>The Basic Local Alignment Search Tool (BLAST) is the first tool in the annotation of nucleotide or amino acid sequences. BLAST is a flagship of bioinformatics due to its performance and user-friendliness. Beginners and intermediate users will learn how to design and submit <i>blastn</i> and <i>Megablast</i> searches on the Web pages at the National Center for Biotechnology Information. We map nucleic acid sequences to genomes, find identical or similar mRNAs, expressed sequence tag, and noncoding RNA sequences, and run <i>Megablast</i> searches, which are much faster than <i>blastn</i>. Understanding results is assisted by taxonomy reports, genomic views, and multiple alignments. We interpret expected frequency thresholds, biological significance, and statistical significance. Weak hits provide no evidence, but indicate hints for further analyses. We find genes that may code for homologous proteins by translated BLAST. We reduce false positives by filtering out low-complexity regions. Parsed BLAST results can be integrated into analysis pipelines. Links in the output connect to Entrez and PubMed, as well as structural, sequence, interaction, and expression databases. This facilitates integration with a wide spectrum of biological knowledge. © 2017 by John Wiley &amp; Sons, Inc.</p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"58 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2017-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cpbi.29","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"35123103","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 20
SIGNOR: A Database of Causal Relationships Between Biological Entities—A Short Guide to Searching and Browsing SIGNOR:一个生物实体之间因果关系的数据库——一个搜索和浏览的简短指南
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2017-06-27 DOI: 10.1002/cpbi.28
Prisca Lo Surdo, Alberto Calderone, Gianni Cesareni, Livia Perfetto

SIGNOR (http://signor.uniroma2.it), the SIGnaling Network Open Resource, is a database designed to store experimentally validated causal interactions, i.e., interactions where a source entity has a regulatory effect (up-regulation, down-regulation, etc.) on a second target entity. SIGNOR acts both as a source of signaling information and a support for data analysis, modeling, and prediction. A user-friendly interface features the ability to search entries for any given protein or group of proteins and to display their interactions graphically in a network view. At the time of writing, SIGNOR stores approximately 16,000 manually curated interactions connecting more than 4,000 biological entities (proteins, chemicals, protein complexes, etc.) that play a role in signal transduction. SIGNOR also offers a collection of 37 signaling pathways. SIGNOR can be queried by three search tools: “single-entity” search, “multiple-entity” search, and “pathway” search. This manuscript describes two basic protocols detailing how to navigate and search the SIGNOR database and how to download the annotated dataset for local use. Finally, the support protocol reviews the utilities of the graphic visualizer. © 2017 by John Wiley & Sons, Inc.

SIGNOR (http://signor.uniroma2.it),信令网络开放资源,是一个数据库,旨在存储经过实验验证的因果相互作用,即源实体对第二个目标实体具有调节作用(上调,下调等)的相互作用。SIGNOR既是信号信息的来源,也是数据分析、建模和预测的支持。用户友好界面的特点是能够搜索任何给定蛋白质或蛋白质组的条目,并在网络视图中以图形方式显示它们的相互作用。在撰写本文时,SIGNOR存储了大约16,000个人工管理的相互作用,连接了在信号转导中起作用的4,000多个生物实体(蛋白质,化学物质,蛋白质复合物等)。SIGNOR还提供了37种信号通路的集合。SIGNOR可以通过三种搜索工具进行查询:“单实体”搜索、“多实体”搜索和“路径”搜索。本文描述了两个基本协议,详细说明了如何导航和搜索SIGNOR数据库,以及如何下载带注释的数据集以供本地使用。最后,支持协议回顾了图形可视化工具的实用程序。©2017 by John Wiley &儿子,Inc。
{"title":"SIGNOR: A Database of Causal Relationships Between Biological Entities—A Short Guide to Searching and Browsing","authors":"Prisca Lo Surdo,&nbsp;Alberto Calderone,&nbsp;Gianni Cesareni,&nbsp;Livia Perfetto","doi":"10.1002/cpbi.28","DOIUrl":"10.1002/cpbi.28","url":null,"abstract":"<p>SIGNOR (http://signor.uniroma2.it), the SIGnaling Network Open Resource, is a database designed to store experimentally validated causal interactions, i.e., interactions where a source entity has a regulatory effect (up-regulation, down-regulation, etc.) on a second target entity. SIGNOR acts both as a source of signaling information and a support for data analysis, modeling, and prediction. A user-friendly interface features the ability to search entries for any given protein or group of proteins and to display their interactions graphically in a network view. At the time of writing, SIGNOR stores approximately 16,000 manually curated interactions connecting more than 4,000 biological entities (proteins, chemicals, protein complexes, etc.) that play a role in signal transduction. SIGNOR also offers a collection of 37 signaling pathways. SIGNOR can be queried by three search tools: “single-entity” search, “multiple-entity” search, and “pathway” search. This manuscript describes two basic protocols detailing how to navigate and search the SIGNOR database and how to download the annotated dataset for local use. Finally, the support protocol reviews the utilities of the graphic visualizer. © 2017 by John Wiley &amp; Sons, Inc.</p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"58 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2017-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cpbi.28","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"35123104","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 26
Identifying Significantly Impacted Pathways and Putative Mechanisms with iPathwayGuide 使用iPathwayGuide识别显著受影响的通路和推测的机制
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2017-06-27 DOI: 10.1002/cpbi.24
Sidra Ahsan, Sorin Drăghici

iPathwayGuide is a gene expression analysis tool that provides biological context and inferences from data generated by high-throughput sequencing. iPathwayGuide utilizes a systems biology approach to identify significantly impacted signaling pathways, Gene Ontology terms, disease processes, predicted microRNAs, and putative mechanisms based on the given differential expression signature. By using a novel analytical approach called Impact Analysis, iPathwayGuide considers the role, position, and relationships of each gene within a pathway, which results in a significant reduction in false positives, as well as a better ability to identify the truly impacted pathways and putative mechanisms that can explain all measured gene expression changes. It is a Web-based, user-friendly, interactive tool that does not require prior training in bioinformatics. The protocols in this unit describe how to use iPathwayGuide to analyze a single contrast between two phenotypes (any number of samples), and provide guidance on how to interpret the results obtained from iPathwayGuide. Even though iPathwayGuide has powerful meta-analysis capabilities, these are not covered in this unit. © 2017 by John Wiley & Sons, Inc.

iPathwayGuide是一种基因表达分析工具,可从高通量测序产生的数据中提供生物学背景和推断。iPathwayGuide利用系统生物学方法识别显著受影响的信号通路、基因本体术语、疾病过程、预测的microrna和基于给定差异表达特征的推测机制。通过使用一种称为影响分析的新型分析方法,iPathwayGuide考虑了途径中每个基因的作用、位置和关系,从而大大减少了假阳性,以及更好地识别真正受影响的途径和可以解释所有测量基因表达变化的假定机制的能力。它是一个基于网络的、用户友好的交互式工具,不需要事先接受生物信息学方面的培训。本单元的协议描述了如何使用iPathwayGuide来分析两种表型(任意数量的样本)之间的单个对比,并提供了如何解释从iPathwayGuide获得的结果的指导。尽管iPathwayGuide具有强大的元分析功能,但本单元并未涉及这些功能。©2017 by John Wiley &儿子,Inc。
{"title":"Identifying Significantly Impacted Pathways and Putative Mechanisms with iPathwayGuide","authors":"Sidra Ahsan,&nbsp;Sorin Drăghici","doi":"10.1002/cpbi.24","DOIUrl":"10.1002/cpbi.24","url":null,"abstract":"<p>iPathwayGuide is a gene expression analysis tool that provides biological context and inferences from data generated by high-throughput sequencing. iPathwayGuide utilizes a systems biology approach to identify significantly impacted signaling pathways, Gene Ontology terms, disease processes, predicted microRNAs, and putative mechanisms based on the given differential expression signature. By using a novel analytical approach called Impact Analysis, iPathwayGuide considers the role, position, and relationships of each gene within a pathway, which results in a significant reduction in false positives, as well as a better ability to identify the truly impacted pathways and putative mechanisms that can explain all measured gene expression changes. It is a Web-based, user-friendly, interactive tool that does not require prior training in bioinformatics. The protocols in this unit describe how to use iPathwayGuide to analyze a single contrast between two phenotypes (any number of samples), and provide guidance on how to interpret the results obtained from iPathwayGuide. Even though iPathwayGuide has powerful meta-analysis capabilities, these are not covered in this unit. © 2017 by John Wiley &amp; Sons, Inc.</p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"57 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2017-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cpbi.24","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"35124380","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 61
期刊
Current protocols in bioinformatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1