首页 > 最新文献

Current protocols in bioinformatics最新文献

英文 中文
The ENCODE Portal as an Epigenomics Resource ENCODE门户作为表观基因组学资源
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2019-11-21 DOI: 10.1002/cpbi.89
Jennifer Jou, Idan Gabdank, Yunhai Luo, Khine Lin, Paul Sud, Zachary Myers, Jason A. Hilton, Meenakshi S. Kagda, Bonita Lam, Emma O'Neill, Philip Adenekan, Keenan Graham, Ulugbek K. Baymuradov, Stuart R. Miyasato, J. Seth Strattan, Otto Jolanki, Jin-Wook Lee, Casey Litton, Forrest Y. Tanaka, Benjamin C. Hitz, J. Michael Cherry

The Encyclopedia of DNA Elements (ENCODE) web portal hosts genomic data generated by the ENCODE Consortium, Genomics of Gene Regulation, The NIH Roadmap Epigenomics Consortium, and the modENCODE and modERN projects. The goal of the ENCODE project is to build a comprehensive map of the functional elements of the human and mouse genomes. Currently, the portal database stores over 500 TB of raw and processed data from over 15,000 experiments spanning assays that measure gene expression, DNA accessibility, DNA and RNA binding, DNA methylation, and 3D chromatin structure across numerous cell lines, tissue types, and differentiation states with selected genetic and molecular perturbations. The ENCODE portal provides unrestricted access to the aforementioned data and relevant metadata as a service to the scientific community. The metadata model captures the details of the experiments, raw and processed data files, and processing pipelines in human and machine-readable form and enables the user to search for specific data either using a web browser or programmatically via REST API. Furthermore, ENCODE data can be freely visualized or downloaded for additional analyses. © 2019 The Authors.

Basic Protocol: Query the portal

Support Protocol 1: Batch downloading

Support Protocol 2: Using the cart to download files

Support Protocol 3: Visualize data

Alternate Protocol: Query building and programmatic access

DNA元素百科全书(ENCODE)门户网站承载了ENCODE联盟、基因调控基因组学、NIH路线图表观基因组学联盟以及modENCODE和modERN项目生成的基因组数据。ENCODE计划的目标是建立人类和小鼠基因组功能元素的综合图谱。目前,门户数据库存储了超过500 TB的原始和处理数据,这些数据来自超过15,000个实验,涵盖了测量基因表达、DNA可及性、DNA和RNA结合、DNA甲基化和3D染色质结构的分析,这些分析跨越了许多细胞系、组织类型和具有选定遗传和分子扰动的分化状态。ENCODE门户提供对上述数据和相关元数据的无限制访问,作为对科学界的服务。元数据模型以人类和机器可读的形式捕获实验的细节、原始和处理过的数据文件以及处理管道,并使用户能够使用web浏览器或通过REST API以编程方式搜索特定数据。此外,ENCODE数据可以自由地可视化或下载以进行额外的分析。©2019作者。基本协议:查询portal支持协议1:批量下载支持协议2:使用购物车下载文件支持协议3:可视化数据备用协议:查询构建和程序化访问
{"title":"The ENCODE Portal as an Epigenomics Resource","authors":"Jennifer Jou,&nbsp;Idan Gabdank,&nbsp;Yunhai Luo,&nbsp;Khine Lin,&nbsp;Paul Sud,&nbsp;Zachary Myers,&nbsp;Jason A. Hilton,&nbsp;Meenakshi S. Kagda,&nbsp;Bonita Lam,&nbsp;Emma O'Neill,&nbsp;Philip Adenekan,&nbsp;Keenan Graham,&nbsp;Ulugbek K. Baymuradov,&nbsp;Stuart R. Miyasato,&nbsp;J. Seth Strattan,&nbsp;Otto Jolanki,&nbsp;Jin-Wook Lee,&nbsp;Casey Litton,&nbsp;Forrest Y. Tanaka,&nbsp;Benjamin C. Hitz,&nbsp;J. Michael Cherry","doi":"10.1002/cpbi.89","DOIUrl":"10.1002/cpbi.89","url":null,"abstract":"<p>The Encyclopedia of DNA Elements (ENCODE) web portal hosts genomic data generated by the ENCODE Consortium, Genomics of Gene Regulation, The NIH Roadmap Epigenomics Consortium, and the modENCODE and modERN projects. The goal of the ENCODE project is to build a comprehensive map of the functional elements of the human and mouse genomes. Currently, the portal database stores over 500 TB of raw and processed data from over 15,000 experiments spanning assays that measure gene expression, DNA accessibility, DNA and RNA binding, DNA methylation, and 3D chromatin structure across numerous cell lines, tissue types, and differentiation states with selected genetic and molecular perturbations. The ENCODE portal provides unrestricted access to the aforementioned data and relevant metadata as a service to the scientific community. The metadata model captures the details of the experiments, raw and processed data files, and processing pipelines in human and machine-readable form and enables the user to search for specific data either using a web browser or programmatically via REST API. Furthermore, ENCODE data can be freely visualized or downloaded for additional analyses. © 2019 The Authors.</p><p><b>Basic Protocol</b>: Query the portal</p><p><b>Support Protocol 1</b>: Batch downloading</p><p><b>Support Protocol 2</b>: Using the cart to download files</p><p><b>Support Protocol 3</b>: Visualize data</p><p><b>Alternate Protocol</b>: Query building and programmatic access</p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"68 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cpbi.89","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42016937","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 19
Using INTERSPIA to Explore the Dynamics of Protein-Protein Interactions Among Multiple Species 利用INTERSPIA探索多物种之间蛋白质-蛋白质相互作用的动力学
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2019-11-21 DOI: 10.1002/cpbi.88
Daehong Kwon, Daehwan Lee, Juyeon Kim, Jongin Lee, Mikang Sim, Jaebum Kim

INTER-Species Protein Interaction Analysis (INTERSPIA) is a web application for identifying diverse patterns of protein-protein interactions (PPIs) in different species. Given a set of proteins of interest to the user, INTERSPIA first discovers additional proteins that are functionally associated with the input proteins as well as different or common patterns of PPIs among the proteins in multiple species through a server-side pipeline. Second, it visualizes the dynamics of PPIs in multiple species via an easy-to-use web interface. This article contains a basic protocol describing how to visualize diverse patterns of PPIs of input proteins in multiple species, and how to use them for functional analysis in the web interface. INTERSPIA is freely available at http://bioinfo.konkuk.ac.kr/INTERSPIA/. © 2019 by John Wiley & Sons, Inc.

Basic Protocol: Running INTERSPIA using a list of input proteins

物种间蛋白质相互作用分析(INTERSPIA)是一个用于识别不同物种中蛋白质相互作用(PPIs)的不同模式的web应用程序。给定用户感兴趣的一组蛋白质,INTERSPIA首先通过服务器端管道发现与输入蛋白质在功能上相关的其他蛋白质,以及多个物种中蛋白质中不同或共同的ppi模式。其次,它通过一个易于使用的网络界面可视化多个物种的ppi动态。这篇文章包含了一个基本的协议,描述了如何在多个物种中可视化不同的输入蛋白的ppi模式,以及如何在web界面中使用它们进行功能分析。INTERSPIA可在http://bioinfo.konkuk.ac.kr/INTERSPIA/免费获得。©2019 by John Wiley &基本方案:使用输入蛋白列表运行INTERSPIA
{"title":"Using INTERSPIA to Explore the Dynamics of Protein-Protein Interactions Among Multiple Species","authors":"Daehong Kwon,&nbsp;Daehwan Lee,&nbsp;Juyeon Kim,&nbsp;Jongin Lee,&nbsp;Mikang Sim,&nbsp;Jaebum Kim","doi":"10.1002/cpbi.88","DOIUrl":"10.1002/cpbi.88","url":null,"abstract":"<p>INTER-Species Protein Interaction Analysis (INTERSPIA) is a web application for identifying diverse patterns of protein-protein interactions (PPIs) in different species. Given a set of proteins of interest to the user, INTERSPIA first discovers additional proteins that are functionally associated with the input proteins as well as different or common patterns of PPIs among the proteins in multiple species through a server-side pipeline. Second, it visualizes the dynamics of PPIs in multiple species via an easy-to-use web interface. This article contains a basic protocol describing how to visualize diverse patterns of PPIs of input proteins in multiple species, and how to use them for functional analysis in the web interface. INTERSPIA is freely available at http://bioinfo.konkuk.ac.kr/INTERSPIA/. © 2019 by John Wiley &amp; Sons, Inc.</p><p><b>Basic Protocol</b>: Running INTERSPIA using a list of input proteins</p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"68 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cpbi.88","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45311660","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Population Genetic Inference With MIGRATE 与迁移的群体遗传推断
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2019-10-24 DOI: 10.1002/cpbi.87
Peter Beerli, Somayeh Mashayekhi, Marjan Sadeghi, Marzieh Khodaei, Kyle Shaw

Many evolutionary biologists collect genetic data from natural populations and then need to investigate the relationship among these populations to compare different biogeographic hypotheses. MIGRATE, a useful tool for exploring relationships between populations and comparing hypotheses, has existed since 1998. Throughout the years, it has steadily improved in both the quality of algorithms used and in the efficiency of carrying out those calculations, thus allowing for a larger number of loci to be evaluated. This efficiency has been enhanced, as MIGRATE has been developed to perform many of its calculations concurrently when running on a computer cluster. The program is based on the coalescence theory and uses Bayesian inference to estimate posterior probability densities of all the parameters of a user-specified population model. Complex models, which include migration and colonization parameters, can be specified. These models can be evaluated using marginal likelihoods, thus allowing a user to compare the merits of different hypotheses. The three presented protocols will help novice users to develop sophisticated analysis techniques useful for their research projects. © 2019 The Authors.

Basic Protocol 1: First steps with MIGRATE

Basic Protocol 2: Population model specification

Basic Protocol 3: Prior distribution specification

Basic Protocol 4: Model selection

Support Protocol 1: Installing the program MIGRATE

Support Protocol 2: Installation of parallel MIGRATE

许多进化生物学家从自然种群中收集遗传数据,然后需要研究这些种群之间的关系,以比较不同的生物地理学假设。MIGRATE是一个探索种群间关系和比较假设的有用工具,自1998年以来一直存在。多年来,它在所用算法的质量和执行这些计算的效率方面都稳步提高,从而允许对更多的位点进行评估。这种效率已经得到了提高,因为MIGRATE已经被开发为在计算机集群上运行时并发地执行许多计算。该程序基于聚结理论,并使用贝叶斯推理来估计用户指定的人口模型中所有参数的后验概率密度。复杂的模型,包括迁移和殖民参数,可以指定。这些模型可以使用边际似然来评估,从而允许用户比较不同假设的优点。提出的三个协议将帮助新手用户开发对他们的研究项目有用的复杂分析技术。©2019作者。基本协议1:第一步与migrate基本协议2:人口模型规范基本协议3:先验分布规范基本协议4:模型选择支持协议1:安装程序migratesport协议2:安装并行迁移
{"title":"Population Genetic Inference With MIGRATE","authors":"Peter Beerli,&nbsp;Somayeh Mashayekhi,&nbsp;Marjan Sadeghi,&nbsp;Marzieh Khodaei,&nbsp;Kyle Shaw","doi":"10.1002/cpbi.87","DOIUrl":"10.1002/cpbi.87","url":null,"abstract":"<p>Many evolutionary biologists collect genetic data from natural populations and then need to investigate the relationship among these populations to compare different biogeographic hypotheses. MIGRATE, a useful tool for exploring relationships between populations and comparing hypotheses, has existed since 1998. Throughout the years, it has steadily improved in both the quality of algorithms used and in the efficiency of carrying out those calculations, thus allowing for a larger number of loci to be evaluated. This efficiency has been enhanced, as MIGRATE has been developed to perform many of its calculations concurrently when running on a computer cluster. The program is based on the coalescence theory and uses Bayesian inference to estimate posterior probability densities of all the parameters of a user-specified population model. Complex models, which include migration and colonization parameters, can be specified. These models can be evaluated using marginal likelihoods, thus allowing a user to compare the merits of different hypotheses. The three presented protocols will help novice users to develop sophisticated analysis techniques useful for their research projects. © 2019 The Authors.</p><p><b>Basic Protocol 1</b>: First steps with MIGRATE</p><p><b>Basic Protocol 2</b>: Population model specification</p><p><b>Basic Protocol 3</b>: Prior distribution specification</p><p><b>Basic Protocol 4</b>: Model selection</p><p><b>Support Protocol 1</b>: Installing the program MIGRATE</p><p><b>Support Protocol 2</b>: Installation of parallel MIGRATE</p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"68 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cpbi.87","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47483598","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 43
Using MetaboAnalyst 4.0 for Comprehensive and Integrative Metabolomics Data Analysis 使用MetaboAnalyst 4.0进行综合代谢组学数据分析
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2019-09-20 DOI: 10.1002/cpbi.86
Jasmine Chong, David S. Wishart, Jianguo Xia
MetaboAnalyst (https://www.metaboanalyst.ca) is an easy‐to‐use web‐based tool suite for comprehensive metabolomic data analysis, interpretation, and integration with other omics data. Since its first release in 2009, MetaboAnalyst has evolved significantly to meet the ever‐expanding bioinformatics demands from the rapidly growing metabolomics community. In addition to providing a variety of data processing and normalization procedures, MetaboAnalyst supports a wide array of functions for statistical, functional, as well as data visualization tasks. Some of the most widely used approaches include PCA (principal component analysis), PLS‐DA (partial least squares discriminant analysis), clustering analysis and visualization, MSEA (metabolite set enrichment analysis), MetPA (metabolic pathway analysis), biomarker selection via ROC (receiver operating characteristic) curve analysis, as well as time series and power analysis. The current version of MetaboAnalyst (4.0) features a complete overhaul of the user interface and significantly expanded underlying knowledge bases (compound database, pathway libraries, and metabolite sets). Three new modules have been added to support pathway activity prediction directly from mass peaks, biomarker meta‐analysis, and network‐based multi‐omics data integration. To enable more transparent and reproducible analysis of metabolomic data, we have released a companion R package (MetaboAnalystR) to complement the web‐based application. This article provides an overview of the main functional modules and the general workflow of MetaboAnalyst 4.0, followed by 12 detailed protocols: © 2019 by John Wiley & Sons, Inc.
MetaboAnalyst (https://www.metaboanalyst.ca)是一个易于使用的基于web的工具套件,用于全面的代谢组学数据分析,解释和与其他组学数据的集成。自2009年首次发布以来,MetaboAnalyst已经显著发展,以满足快速增长的代谢组学社区不断扩大的生物信息学需求。除了提供各种数据处理和规范化过程之外,MetaboAnalyst还支持用于统计、函数和数据可视化任务的各种功能。一些最广泛使用的方法包括PCA(主成分分析),PLS-DA(偏最小二乘判别分析),聚类分析和可视化,MSEA(代谢物集富集分析),MetPA(代谢途径分析),通过ROC(受试者工作特征)曲线分析进行生物标志物选择,以及时间序列和功率分析。当前版本的MetaboAnalyst(4.0)对用户界面进行了全面修改,并显著扩展了基础知识库(化合物数据库、途径库和代谢物集)。增加了三个新模块,以支持直接从质量峰,生物标志物荟萃分析和基于网络的多组学数据集成的途径活性预测。为了使代谢组学数据的分析更加透明和可重复,我们发布了一个配套的R包(MetaboAnalystR)来补充基于web的应用程序。本文概述了MetaboAnalyst 4.0的主要功能模块和一般工作流程,其次是12个详细协议:©2019 by John Wiley &基本方案1:数据上传、处理和归一化基本方案2:显著变量的识别基本方案3:多变量探索性数据分析基本方案4:代谢组学的功能解释数据库方案5:基于受试者工作特征(ROC)曲线的生物标志物分析基本方案6:时间序列和双因素数据分析基本方案7:样本量估计和功率分析基本方案8:联合通路分析基本协议9:MS峰到通路活性基本协议10:生物标志物元分析基本协议11:基于知识的多组学网络探索基本协议12:代谢分析介绍
{"title":"Using MetaboAnalyst 4.0 for Comprehensive and Integrative Metabolomics Data Analysis","authors":"Jasmine Chong,&nbsp;David S. Wishart,&nbsp;Jianguo Xia","doi":"10.1002/cpbi.86","DOIUrl":"10.1002/cpbi.86","url":null,"abstract":"MetaboAnalyst (https://www.metaboanalyst.ca) is an easy‐to‐use web‐based tool suite for comprehensive metabolomic data analysis, interpretation, and integration with other omics data. Since its first release in 2009, MetaboAnalyst has evolved significantly to meet the ever‐expanding bioinformatics demands from the rapidly growing metabolomics community. In addition to providing a variety of data processing and normalization procedures, MetaboAnalyst supports a wide array of functions for statistical, functional, as well as data visualization tasks. Some of the most widely used approaches include PCA (principal component analysis), PLS‐DA (partial least squares discriminant analysis), clustering analysis and visualization, MSEA (metabolite set enrichment analysis), MetPA (metabolic pathway analysis), biomarker selection via ROC (receiver operating characteristic) curve analysis, as well as time series and power analysis. The current version of MetaboAnalyst (4.0) features a complete overhaul of the user interface and significantly expanded underlying knowledge bases (compound database, pathway libraries, and metabolite sets). Three new modules have been added to support pathway activity prediction directly from mass peaks, biomarker meta‐analysis, and network‐based multi‐omics data integration. To enable more transparent and reproducible analysis of metabolomic data, we have released a companion R package (MetaboAnalystR) to complement the web‐based application. This article provides an overview of the main functional modules and the general workflow of MetaboAnalyst 4.0, followed by 12 detailed protocols: © 2019 by John Wiley & Sons, Inc.","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"68 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cpbi.86","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43320523","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1489
Issue Information TOC 发布信息TOC
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2019-09-16 DOI: 10.1002/cpbi.64

Cover: In Prandi and Demichelis (https://doi.org/10.1002/cpbi.81), the image shows a cartoon of the computation of beta and allelic fraction of informative SNPs. (A) Example of the allelic fraction (AF) and beta (β) values computed for five genomic positions (p1 to pm) corresponding to five informative SNPs. Positions p1 to pn are within a hemizygously deleted genomic segment, A, whereas genomic positions pn + 1 to pm lie within a wild-type genomic segment, B. (B to D) Examples of a normal cell and two different tumor cells. Tumor cells 1 and 2 differ in the status of genomic segment B. Histograms below the cell cartoons report the expected distribution of the AF of SNPs in genomic segments A and B together with the associated beta values. (E and F) Examples of two different tumor samples. Tumor sample 1 includes one normal cell and nine tumor cells with deleted genomic segment A and wild-type genomic segment B. Tumor sample 2 differs from tumor sample 1 in the presence of six tumor cells with a hemizygous deletion of genomic segment B. Expected distribution of the AF of informative SNPs together with estimated beta are depicted below each tumor sample cartoon.

封面:在Prandi和Demichelis (https://doi.org/10.1002/cpbi.81)中,图像显示了计算信息snp的β和等位基因部分的卡通。(A) 5个基因组位置(p1至pm)对应5个信息性snp的等位基因分数(AF)和β (β)值示例。位置p1至pn位于半合子缺失的基因组片段a中,而基因组位置pn + 1至pm位于野生型基因组片段B中。(B至D)正常细胞和两个不同肿瘤细胞的例子。肿瘤细胞1和2在基因组段B的状态不同。细胞图下面的直方图报告了基因组段A和B中snp的AF的预期分布以及相关的β值。(E和F)两种不同肿瘤样本的例子。肿瘤样本1包括1个正常细胞和9个缺失基因组片段A和野生型基因组片段b的肿瘤细胞。肿瘤样本2与肿瘤样本1的不同之处在于存在6个基因组片段b半合子缺失的肿瘤细胞。
{"title":"Issue Information TOC","authors":"","doi":"10.1002/cpbi.64","DOIUrl":"10.1002/cpbi.64","url":null,"abstract":"<p><b>Cover</b>: In Prandi and Demichelis (https://doi.org/10.1002/cpbi.81), the image shows a cartoon of the computation of beta and allelic fraction of informative SNPs. (<b>A</b>) Example of the allelic fraction (AF) and beta (β) values computed for five genomic positions (p<sub>1</sub> to p<sub><i>m</i></sub>) corresponding to five informative SNPs. Positions p<sub>1</sub> to p<sub><i>n</i></sub> are within a hemizygously deleted genomic segment, A, whereas genomic positions p<sub><i>n</i> + 1</sub> to p<sub><i>m</i></sub> lie within a wild-type genomic segment, B. (<b>B</b> to <b>D</b>) Examples of a normal cell and two different tumor cells. Tumor cells 1 and 2 differ in the status of genomic segment B. Histograms below the cell cartoons report the expected distribution of the AF of SNPs in genomic segments A and B together with the associated beta values. (<b>E</b> and <b>F</b>) Examples of two different tumor samples. Tumor sample 1 includes one normal cell and nine tumor cells with deleted genomic segment A and wild-type genomic segment B. Tumor sample 2 differs from tumor sample 1 in the presence of six tumor cells with a hemizygous deletion of genomic segment B. Expected distribution of the AF of informative SNPs together with estimated beta are depicted below each tumor sample cartoon.\t\u0000\u0000 <figure>\u0000 <div><picture>\u0000 <source></source></picture><p></p>\u0000 </div>\u0000 </figure></p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"67 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cpbi.64","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47235711","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Using MARRVEL v1.2 for Bioinformatics Analysis of Human Genes and Variant Pathogenicity 利用marvel v1.2进行人类基因和变异致病性的生物信息学分析
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2019-07-19 DOI: 10.1002/cpbi.85
Julia Wang, Dongxue Mao, Fatima Fazal, Seon-Young Kim, Shinya Yamamoto, Hugo Bellen, Zhandong Liu

One of the greatest challenges in the bioinformatic analysis of human sequencing data is identifying which variants are pathogenic. Numerous databases and tools have been generated to address this difficulty. However, these many useful data and tools are broadly dispersed, requiring users to search for their variants of interest through human genetic databases, variant function prediction tools, and model organism databases. To solve this problem, we collected data and observed workflows of human geneticists, clinicians, and model organism researchers to carefully select and display valuable information that facilitates the evaluation of whether a variant is likely to be pathogenic. This program, Model organism Aggregated Resources for Rare Variant ExpLoration (MARRVEL) v1.2, allows users to collect relevant data from 27 public sources for further efficient bioinformatic analysis of the pathogenicity of human variants. © 2019 by John Wiley & Sons, Inc.

在人类测序数据的生物信息学分析中,最大的挑战之一是确定哪些变异是致病的。已经产生了许多数据库和工具来解决这一困难。然而,这些有用的数据和工具是广泛分散的,需要用户通过人类遗传数据库、变异功能预测工具和模式生物数据库来搜索他们感兴趣的变体。为了解决这个问题,我们收集了数据,并观察了人类遗传学家、临床医生和模式生物研究人员的工作流程,以仔细选择和显示有价值的信息,从而有助于评估一种变异是否可能具有致病性。模型生物罕见变异探索聚合资源(marvel) v1.2程序允许用户从27个公共来源收集相关数据,以进一步有效地对人类变异的致病性进行生物信息学分析。©2019 by John Wiley &儿子,Inc。
{"title":"Using MARRVEL v1.2 for Bioinformatics Analysis of Human Genes and Variant Pathogenicity","authors":"Julia Wang,&nbsp;Dongxue Mao,&nbsp;Fatima Fazal,&nbsp;Seon-Young Kim,&nbsp;Shinya Yamamoto,&nbsp;Hugo Bellen,&nbsp;Zhandong Liu","doi":"10.1002/cpbi.85","DOIUrl":"10.1002/cpbi.85","url":null,"abstract":"<p>One of the greatest challenges in the bioinformatic analysis of human sequencing data is identifying which variants are pathogenic. Numerous databases and tools have been generated to address this difficulty. However, these many useful data and tools are broadly dispersed, requiring users to search for their variants of interest through human genetic databases, variant function prediction tools, and model organism databases. To solve this problem, we collected data and observed workflows of human geneticists, clinicians, and model organism researchers to carefully select and display valuable information that facilitates the evaluation of whether a variant is likely to be pathogenic. This program, Model organism Aggregated Resources for Rare Variant ExpLoration (MARRVEL) v1.2, allows users to collect relevant data from 27 public sources for further efficient bioinformatic analysis of the pathogenicity of human variants. © 2019 by John Wiley &amp; Sons, Inc.</p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"67 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cpbi.85","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44081559","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Predicting Sequence Features, Function, and Structure of Proteins Using MESSA 利用MESSA预测蛋白质的序列特征、功能和结构
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2019-07-15 DOI: 10.1002/cpbi.84
Archana S. Bhat, Nick V. Grishin

MEta-Server for protein Sequence Analysis (MESSA) is a tool that facilitates widespread protein sequence analysis by gathering structural (local sequence properties and three-dimensional structure) and functional (annotations from SWISS-PROT, Gene Ontology terms, and enzyme classification) predictions for a query protein of interest. MESSA uses multiple well-established tools to offer consensus-based predictions on important aspects of protein sequence analysis. Being freely available for noncommercial users and with a user-friendly interface, MESSA serves as an umbrella platform that overcomes the absence of a comprehensive tool for predictive protein analysis. This article reveals how to access MESSA via the Web and shows how to input a protein sequence to analyze using the MESSA web server. It also includes a detailed explanation of the output from MESSA to aid in better interpretation of results. © 2019 by John Wiley & Sons, Inc.

MEta-Server for protein Sequence Analysis (MESSA)是一种工具,通过收集感兴趣的查询蛋白质的结构(局部序列特性和三维结构)和功能(来自SWISS-PROT的注释,基因本体术语和酶分类)预测来促进广泛的蛋白质序列分析。MESSA使用多种完善的工具,在蛋白质序列分析的重要方面提供基于共识的预测。对于非商业用户来说,MESSA是免费的,并且具有用户友好的界面,它作为一个伞形平台,克服了预测蛋白质分析的综合工具的缺乏。本文揭示了如何通过Web访问MESSA,并展示了如何使用MESSA Web服务器输入要分析的蛋白质序列。它还包括对MESSA输出的详细解释,以帮助更好地解释结果。©2019 by John Wiley &儿子,Inc。
{"title":"Predicting Sequence Features, Function, and Structure of Proteins Using MESSA","authors":"Archana S. Bhat,&nbsp;Nick V. Grishin","doi":"10.1002/cpbi.84","DOIUrl":"10.1002/cpbi.84","url":null,"abstract":"<p>MEta-Server for protein Sequence Analysis (MESSA) is a tool that facilitates widespread protein sequence analysis by gathering structural (local sequence properties and three-dimensional structure) and functional (annotations from SWISS-PROT, Gene Ontology terms, and enzyme classification) predictions for a query protein of interest. MESSA uses multiple well-established tools to offer consensus-based predictions on important aspects of protein sequence analysis. Being freely available for noncommercial users and with a user-friendly interface, MESSA serves as an umbrella platform that overcomes the absence of a comprehensive tool for predictive protein analysis. This article reveals how to access MESSA via the Web and shows how to input a protein sequence to analyze using the MESSA web server. It also includes a detailed explanation of the output from MESSA to aid in better interpretation of results. © 2019 by John Wiley &amp; Sons, Inc.</p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"67 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cpbi.84","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41194260","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Ploidy- and Purity-Adjusted Allele-Specific DNA Analysis Using CLONETv2 使用CLONETv2进行倍性和纯度调整的等位基因特异性DNA分析
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2019-06-21 DOI: 10.1002/cpbi.81
Davide Prandi, Francesca Demichelis

High-throughput DNA sequencing technology provides base-level and statistically rich information about the genomic content of a sample. In the contexts of cancer research and precision oncology, thousands of genomes from paired tumor and matched normal samples are profiled and processed to determine somatic copy-number changes and single-nucleotide variations. Higher-order informative analyses, in the form of allele-specific copy-number assessments or subclonality quantification, require reliable estimates of tumor DNA ploidy and tumor cellularity. CLONETv2 provides a complete set of functions to process matched normal and tumor pairs using patient-specific genotype data, is independent of low-level tools (e.g., aligner, segmentation algorithm, mutation caller) and offers high-level functions to compute allele-specific copy number from segmented data and to identify subclonal population in the input sample. CLONETv2 is applicable to whole-genome, whole-exome and targeted sequencing data generated either from tissue or from liquid biopsy samples. © 2019 The Authors.

高通量DNA测序技术提供了样本基因组内容的基础水平和统计丰富的信息。在癌症研究和精确肿瘤学的背景下,来自配对肿瘤和匹配正常样本的数千个基因组被分析和处理,以确定体细胞拷贝数变化和单核苷酸变化。高阶信息分析,以等位基因特异性拷贝数评估或亚克隆性量化的形式,需要对肿瘤DNA倍性和肿瘤细胞性进行可靠的估计。CLONETv2提供了一套完整的功能,使用患者特异性基因型数据处理匹配的正常和肿瘤配对,独立于低级工具(例如,对准器、分割算法、突变调用者),并提供高级功能,从分段数据中计算等位基因特异性拷贝数,并识别输入样本中的亚克隆群体。CLONETv2适用于从组织或液体活检样本中产生的全基因组、全外显子组和靶向测序数据。©2019作者。
{"title":"Ploidy- and Purity-Adjusted Allele-Specific DNA Analysis Using CLONETv2","authors":"Davide Prandi,&nbsp;Francesca Demichelis","doi":"10.1002/cpbi.81","DOIUrl":"10.1002/cpbi.81","url":null,"abstract":"<p>High-throughput DNA sequencing technology provides base-level and statistically rich information about the genomic content of a sample. In the contexts of cancer research and precision oncology, thousands of genomes from paired tumor and matched normal samples are profiled and processed to determine somatic copy-number changes and single-nucleotide variations. Higher-order informative analyses, in the form of allele-specific copy-number assessments or subclonality quantification, require reliable estimates of tumor DNA ploidy and tumor cellularity. CLONETv2 provides a complete set of functions to process matched normal and tumor pairs using patient-specific genotype data, is independent of low-level tools (e.g., aligner, segmentation algorithm, mutation caller) and offers high-level functions to compute allele-specific copy number from segmented data and to identify subclonal population in the input sample. CLONETv2 is applicable to whole-genome, whole-exome and targeted sequencing data generated either from tissue or from liquid biopsy samples. © 2019 The Authors.</p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"67 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cpbi.81","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44187064","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Community Curation and Expert Curation of Human Long Noncoding RNAs with LncRNAWiki and LncBook 基于LncRNAWiki和LncBook的人类长链非编码rna的社区管理和专家管理
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2019-06-20 DOI: 10.1002/cpbi.82
Lina Ma, Jiabao Cao, Lin Liu, Zhao Li, Huma Shireen, Nashaiman Pervaiz, Fatima Batool, Rabail Z. Raza, Dong Zou, Yiming Bao, Amir A. Abbasi, Zhang Zhang

In recent years, the number of human long noncoding RNAs (lncRNAs) that have been identified has increased exponentially. However, these lncRNAs are poorly annotated compared to protein-coding genes, posing great challenges for a better understanding of their functional significance and elucidating their complex functioning molecular mechanisms. Here we employ both community and expert curation to yield a comprehensive collection of human lncRNAs and their annotations. Specifically, LncRNAWiki (http://lncrna.big.ac.cn/index.php/Main_Page) uses a wiki-based community curation model, thus showing great promise in dealing with the flood of biological knowledge, while LncBook (http://bigd.big.ac.cn/lncbook) is an expert curation–based database that provides a complement to LncRNAWiki. LncBook features a comprehensive collection of human lncRNAs and a systematic curation of lncRNAs by multi-omics data integration, functional annotation, and disease association. These protocols provide step-by-step instructions on how to browse and search a specific lncRNA and how to obtain a range of related information including expression, methylation, variation, function, and disease association. © 2019 by John Wiley & Sons, Inc.

近年来,已鉴定的人类长链非编码rna (lncRNAs)数量呈指数级增长。然而,与蛋白质编码基因相比,这些lncrna的注释较差,这为更好地理解其功能意义和阐明其复杂的功能分子机制带来了巨大的挑战。在这里,我们采用社区和专家策展来产生人类lncrna及其注释的综合集合。具体来说,LncRNAWiki (http://lncrna.big.ac.cn/index.php/Main_Page)使用了基于维基的社区管理模式,因此在处理生物知识的洪水方面显示出很大的希望,而lnbook (http://bigd.big.ac.cn/lncbook)是一个基于专家管理的数据库,为LncRNAWiki提供了补充。lnbook的特点是通过多组学数据集成、功能注释和疾病关联,全面收集人类lncrna,并对lncrna进行系统管理。这些协议提供了如何浏览和搜索特定lncRNA以及如何获得一系列相关信息的分步说明,包括表达,甲基化,变异,功能和疾病关联。©2019 by John Wiley &儿子,Inc。
{"title":"Community Curation and Expert Curation of Human Long Noncoding RNAs with LncRNAWiki and LncBook","authors":"Lina Ma,&nbsp;Jiabao Cao,&nbsp;Lin Liu,&nbsp;Zhao Li,&nbsp;Huma Shireen,&nbsp;Nashaiman Pervaiz,&nbsp;Fatima Batool,&nbsp;Rabail Z. Raza,&nbsp;Dong Zou,&nbsp;Yiming Bao,&nbsp;Amir A. Abbasi,&nbsp;Zhang Zhang","doi":"10.1002/cpbi.82","DOIUrl":"10.1002/cpbi.82","url":null,"abstract":"<p>In recent years, the number of human long noncoding RNAs (lncRNAs) that have been identified has increased exponentially. However, these lncRNAs are poorly annotated compared to protein-coding genes, posing great challenges for a better understanding of their functional significance and elucidating their complex functioning molecular mechanisms. Here we employ both community and expert curation to yield a comprehensive collection of human lncRNAs and their annotations. Specifically, LncRNAWiki (http://lncrna.big.ac.cn/index.php/Main_Page) uses a wiki-based community curation model, thus showing great promise in dealing with the flood of biological knowledge, while LncBook (http://bigd.big.ac.cn/lncbook) is an expert curation–based database that provides a complement to LncRNAWiki. LncBook features a comprehensive collection of human lncRNAs and a systematic curation of lncRNAs by multi-omics data integration, functional annotation, and disease association. These protocols provide step-by-step instructions on how to browse and search a specific lncRNA and how to obtain a range of related information including expression, methylation, variation, function, and disease association. © 2019 by John Wiley &amp; Sons, Inc.</p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"67 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cpbi.82","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47832300","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Using Mothur to Determine Bacterial Community Composition and Structure in 16S Ribosomal RNA Datasets 利用母亲测定16S核糖体RNA数据集中的细菌群落组成和结构
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2019-06-20 DOI: 10.1002/cpbi.83
Sruthi Chappidi, Erika C. Villa, Brandi L. Cantarel

The 16S ribosomal RNA (rRNA) gene is one of the scaffolding molecules of the prokaryotic ribosome. Because this gene is slow to evolve and has very well conserved regions, this gene is used to reconstruct phylogenies in prokaryotes. Universal primers can be used to amplify the gene in prokaryotes including bacteria and archaea. To determine the microbial composition in microbial communities using high-throughput short-read sequencing techniques, primers are designed to span two or three of the nine variable regions of the gene. Mothur, developed in 2009, is a suite of tools to study the composition and structure of bacterial communities. This package is freely available from the developers (https://www.mothur.org). This protocol will show how to (1) perform preprocessing of sequences to remove errors, (2) perform operational taxonomic unit (OTU) analysis to determine alpha and beta diversity, and (3) determine the taxonomic profile of OTUs and the environmental sample. © 2019 The Authors.

16S核糖体RNA (rRNA)基因是原核核糖体的支架分子之一。由于该基因进化缓慢且具有非常好的保守区域,因此该基因用于重建原核生物的系统发育。通用引物可用于在原核生物中扩增该基因,包括细菌和古细菌。为了使用高通量短读测序技术确定微生物群落中的微生物组成,引物被设计为跨越基因九个可变区域中的两个或三个。2009年开发的motherur是一套研究细菌群落组成和结构的工具。该软件包可从开发人员处免费获得(https://www.mothur.org)。该方案将展示如何(1)进行序列预处理以去除错误,(2)进行操作分类单元(OTU)分析以确定α和β多样性,以及(3)确定OTU和环境样本的分类特征。©2019作者。
{"title":"Using Mothur to Determine Bacterial Community Composition and Structure in 16S Ribosomal RNA Datasets","authors":"Sruthi Chappidi,&nbsp;Erika C. Villa,&nbsp;Brandi L. Cantarel","doi":"10.1002/cpbi.83","DOIUrl":"10.1002/cpbi.83","url":null,"abstract":"<p>The 16S ribosomal RNA (rRNA) gene is one of the scaffolding molecules of the prokaryotic ribosome. Because this gene is slow to evolve and has very well conserved regions, this gene is used to reconstruct phylogenies in prokaryotes. Universal primers can be used to amplify the gene in prokaryotes including bacteria and archaea. To determine the microbial composition in microbial communities using high-throughput short-read sequencing techniques, primers are designed to span two or three of the nine variable regions of the gene. Mothur, developed in 2009, is a suite of tools to study the composition and structure of bacterial communities. This package is freely available from the developers (https://www.mothur.org). This protocol will show how to (1) perform preprocessing of sequences to remove errors, (2) perform operational taxonomic unit (OTU) analysis to determine alpha and beta diversity, and (3) determine the taxonomic profile of OTUs and the environmental sample. © 2019 The Authors.</p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"67 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cpbi.83","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41685273","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 25
期刊
Current protocols in bioinformatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1