首页 > 最新文献

Bioinformatics advances最新文献

英文 中文
ISMB/ECCB 2023 organization benefited from the strengths of the French bioinformatics community. ISMB/ECCB 2023 组织得益于法国生物信息学界的优势。
Pub Date : 2024-05-03 eCollection Date: 2024-01-01 DOI: 10.1093/bioadv/vbae040
Anna-Sophie Fiston-Lavier, Sandra Dérozier, Guy Perrière, Marie-France Sagot
{"title":"ISMB/ECCB 2023 organization benefited from the strengths of the French bioinformatics community.","authors":"Anna-Sophie Fiston-Lavier, Sandra Dérozier, Guy Perrière, Marie-France Sagot","doi":"10.1093/bioadv/vbae040","DOIUrl":"https://doi.org/10.1093/bioadv/vbae040","url":null,"abstract":"","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-05-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11076915/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140892264","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
scX: a user-friendly tool for scRNAseq exploration. scX:用于探索 scRNAseq 的用户友好型工具。
IF 2.4 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-05-02 eCollection Date: 2024-01-01 DOI: 10.1093/bioadv/vbae062
Tomás V Waichman, M L Vercesi, Ariel A Berardino, Maximiliano S Beckel, Damiana Giacomini, Natalí B Rasetto, Magalí Herrero, Daniela J Di Bella, Paola Arlotta, Alejandro F Schinder, Ariel Chernomoretz

Motivation: Single-cell RNA sequencing (scRNAseq) has transformed our ability to explore biological systems. Nevertheless, proficient expertise is essential for handling and interpreting the data.

Results: In this article, we present scX, an R package built on the Shiny framework that streamlines the analysis, exploration, and visualization of single-cell experiments. With an interactive graphic interface, implemented as a web application, scX provides easy access to key scRNAseq analyses, including marker identification, gene expression profiling, and differential gene expression analysis. Additionally, scX seamlessly integrates with commonly used single-cell Seurat and SingleCellExperiment R objects, resulting in efficient processing and visualization of varied datasets. Overall, scX serves as a valuable and user-friendly tool for effortless exploration and sharing of single-cell data, simplifying some of the complexities inherent in scRNAseq analysis.

Availability and implementation: Source code can be downloaded from https://github.com/chernolabs/scX. A docker image is available from dockerhub as chernolabs/scx.

动机单细胞 RNA 测序(scRNAseq)改变了我们探索生物系统的能力。然而,熟练的专业知识对于处理和解释数据至关重要:在本文中,我们介绍了基于 Shiny 框架的 R 软件包 scX,它能简化单细胞实验的分析、探索和可视化。scX 采用交互式图形界面,以网络应用程序的形式实现,可轻松访问关键的 scRNAseq 分析,包括标记物鉴定、基因表达谱分析和差异基因表达分析。此外,scX 还能与常用的单细胞 Seurat 和 SingleCellExperiment R 对象无缝集成,从而实现各种数据集的高效处理和可视化。总之,scX 是一种有价值的用户友好型工具,可用于轻松探索和共享单细胞数据,简化了 scRNAseq 分析中固有的一些复杂性:源代码可从 https://github.com/chernolabs/scX 下载。可从 dockerhub 获取 docker 映像,即 chernolabs/scx。
{"title":"scX: a user-friendly tool for scRNAseq exploration.","authors":"Tomás V Waichman, M L Vercesi, Ariel A Berardino, Maximiliano S Beckel, Damiana Giacomini, Natalí B Rasetto, Magalí Herrero, Daniela J Di Bella, Paola Arlotta, Alejandro F Schinder, Ariel Chernomoretz","doi":"10.1093/bioadv/vbae062","DOIUrl":"10.1093/bioadv/vbae062","url":null,"abstract":"<p><strong>Motivation: </strong>Single-cell RNA sequencing (scRNAseq) has transformed our ability to explore biological systems. Nevertheless, proficient expertise is essential for handling and interpreting the data.</p><p><strong>Results: </strong>In this article, we present scX, an R package built on the Shiny framework that streamlines the analysis, exploration, and visualization of single-cell experiments. With an interactive graphic interface, implemented as a web application, scX provides easy access to key scRNAseq analyses, including marker identification, gene expression profiling, and differential gene expression analysis. Additionally, scX seamlessly integrates with commonly used single-cell Seurat and SingleCellExperiment R objects, resulting in efficient processing and visualization of varied datasets. Overall, scX serves as a valuable and user-friendly tool for effortless exploration and sharing of single-cell data, simplifying some of the complexities inherent in scRNAseq analysis.</p><p><strong>Availability and implementation: </strong>Source code can be downloaded from https://github.com/chernolabs/scX. A docker image is available from dockerhub as chernolabs/scx.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":null,"pages":null},"PeriodicalIF":2.4,"publicationDate":"2024-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11109472/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141082442","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Perspectives on tracking data reuse across biodata resources 跟踪生物数据资源中数据再利用情况的视角
Pub Date : 2024-04-25 DOI: 10.1093/bioadv/vbae057
Karen Ross, Frederic B Bastian, Matt Buys, Charles E Cook, Peter D'Eustachio, Melissa Harrison, H. Hermjakob, Donghui Li, Phillip Lord, Darren A Natale, Bjoern Peters, Paul W. Sternberg, Andrew I Su, Matthew Thakur, Paul D Thomas, Alex Bateman
Data reuse is a common and vital practice in molecular biology and enables the knowledge gathered over recent decades to drive discovery and innovation in the life sciences. Much of this knowledge has been collated into molecular biology databases, such as UniProtKB, and these resources derive enormous value from sharing data among themselves. However, quantifying and documenting this kind of data reuse remains a challenge. The paper reports on a one-day virtual workshop hosted by the UniProt Consortium in March 2023, attended by representatives from biodata resources, experts in data management, and NIH program managers. Workshop discussions focused on strategies for tracking data reuse, best practices for reusing data, and the challenges associated with data reuse and tracking. Surveys and discussions showed that data reuse is widespread, but critical information for reproducibility is sometimes lacking. Challenges include costs of tracking data reuse, tensions between tracking data and open sharing, restrictive licenses, and difficulties in tracking commercial data use. Recommendations that emerged from the discussion include: development of standardized formats for documenting data reuse, education about the obstacles posed by restrictive licenses, and continued recognition by funding agencies that data management is a critical activity that requires dedicated resources. Supplementary data are available at Bioinformatics Advances online.
数据再利用是分子生物学中常见的重要实践,它使近几十年来收集的知识能够推动生命科学的发现和创新。这些知识中的许多已被整理到分子生物学数据库(如 UniProtKB)中,这些资源从它们之间的数据共享中获得了巨大的价值。然而,量化和记录这种数据再利用仍然是一项挑战。 本文报告了 UniProt 联合会于 2023 年 3 月举办的为期一天的虚拟研讨会的情况,来自生物数据资源的代表、数据管理专家和美国国立卫生研究院(NIH)的项目经理参加了此次研讨会。研讨会重点讨论了数据再利用的跟踪策略、数据再利用的最佳实践以及与数据再利用和跟踪相关的挑战。调查和讨论结果表明,数据再利用非常普遍,但有时缺乏可重复性的关键信息。挑战包括跟踪数据再利用的成本、跟踪数据与开放共享之间的矛盾、限制性许可以及跟踪商业数据使用的困难。讨论中提出的建议包括:开发记录数据再利用的标准化格式,开展有关限制性许可所造成障碍的教育,以及资助机构继续认识到数据管理是一项需要专门资源的重要活动。 补充数据可在 Bioinformatics Advances 在线查阅。
{"title":"Perspectives on tracking data reuse across biodata resources","authors":"Karen Ross, Frederic B Bastian, Matt Buys, Charles E Cook, Peter D'Eustachio, Melissa Harrison, H. Hermjakob, Donghui Li, Phillip Lord, Darren A Natale, Bjoern Peters, Paul W. Sternberg, Andrew I Su, Matthew Thakur, Paul D Thomas, Alex Bateman","doi":"10.1093/bioadv/vbae057","DOIUrl":"https://doi.org/10.1093/bioadv/vbae057","url":null,"abstract":"\u0000 \u0000 \u0000 Data reuse is a common and vital practice in molecular biology and enables the knowledge gathered over recent decades to drive discovery and innovation in the life sciences. Much of this knowledge has been collated into molecular biology databases, such as UniProtKB, and these resources derive enormous value from sharing data among themselves. However, quantifying and documenting this kind of data reuse remains a challenge.\u0000 \u0000 \u0000 \u0000 The paper reports on a one-day virtual workshop hosted by the UniProt Consortium in March 2023, attended by representatives from biodata resources, experts in data management, and NIH program managers. Workshop discussions focused on strategies for tracking data reuse, best practices for reusing data, and the challenges associated with data reuse and tracking. Surveys and discussions showed that data reuse is widespread, but critical information for reproducibility is sometimes lacking. Challenges include costs of tracking data reuse, tensions between tracking data and open sharing, restrictive licenses, and difficulties in tracking commercial data use. Recommendations that emerged from the discussion include: development of standardized formats for documenting data reuse, education about the obstacles posed by restrictive licenses, and continued recognition by funding agencies that data management is a critical activity that requires dedicated resources.\u0000 \u0000 \u0000 \u0000 Supplementary data are available at Bioinformatics Advances online.\u0000","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140656991","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MerCat2: a versatile k-mer counter and diversity estimator for database-independent property analysis obtained from omics data. MerCat2:一种通用的 k-mer计数器和多样性估算器,用于从 omics 数据中获得与数据库无关的属性分析。
Pub Date : 2024-04-24 eCollection Date: 2024-01-01 DOI: 10.1093/bioadv/vbae061
Jose L Figueroa, Andrew Redinbo, Ajay Panyala, Sean Colby, Maren L Friesen, Lisa Tiemann, Richard Allen White

Motivation: MerCat2 ("Mer-Catenate2") is a versatile, parallel, scalable and modular property software package for robustly analyzing features in omics data. Using massively parallel sequencing raw reads, assembled contigs, and protein sequences from any platform as input, MerCat2 performs k-mer counting of any length k, resulting in feature abundance counts tables, quality control reports, protein feature metrics, and graphical representation (i.e. principal component analysis (PCA)).

Results: MerCat2 allows for direct analysis of data properties in a database-independent manner that initializes all data, which other profilers and assembly-based methods cannot perform. MerCat2 represents an integrated tool to illuminate omics data within a sample for rapid cross-examination and comparisons.

Availability and implementation: MerCat2 is written in Python and distributed under a BSD-3 license. The source code of MerCat2 is freely available at https://github.com/raw-lab/mercat2. MerCat2 is compatible with Python 3 on Mac OS X and Linux. MerCat2 can also be easily installed using bioconda: mamba create -n mercat2 -c conda-forge -c bioconda mercat2.

动机MerCat2("Mer-Catenate2")是一个多功能、并行、可扩展和模块化的属性软件包,用于对omics数据中的特征进行稳健分析。MerCat2 使用来自任何平台的大规模并行测序原始读数、组装 contigs 和蛋白质序列作为输入,执行任意长度 k 的 k-mer 计数,生成特征丰度计数表、质量控制报告、蛋白质特征度量和图形表示(即主成分分析 (PCA)):结果:MerCat2 允许以独立于数据库的方式直接分析数据属性,并对所有数据进行初始化,这是其他剖析器和基于组装的方法无法做到的。MerCat2 是一种综合工具,可用于快速交叉检验和比较样本中的组学数据:MerCat2 由 Python 编写,采用 BSD-3 许可发布。MerCat2 的源代码可在 https://github.com/raw-lab/mercat2 免费获取。MerCat2 与 Mac OS X 和 Linux 上的 Python 3 兼容。使用 bioconda 也能轻松安装 MerCat2:mamba create -n mercat2 -c conda-forge -c bioconda mercat2。
{"title":"MerCat2: a versatile <i>k</i>-mer counter and diversity estimator for database-independent property analysis obtained from omics data.","authors":"Jose L Figueroa, Andrew Redinbo, Ajay Panyala, Sean Colby, Maren L Friesen, Lisa Tiemann, Richard Allen White","doi":"10.1093/bioadv/vbae061","DOIUrl":"10.1093/bioadv/vbae061","url":null,"abstract":"<p><strong>Motivation: </strong>MerCat2 (\"Mer-Catenate2\") is a versatile, parallel, scalable and modular property software package for robustly analyzing features in omics data. Using massively parallel sequencing raw reads, assembled contigs, and protein sequences from any platform as input, MerCat2 performs <i>k</i>-mer counting of any length <i>k</i>, resulting in feature abundance counts tables, quality control reports, protein feature metrics, and graphical representation (i.e. principal component analysis (PCA)).</p><p><strong>Results: </strong>MerCat2 allows for direct analysis of data properties in a database-independent manner that initializes all data, which other profilers and assembly-based methods cannot perform. MerCat2 represents an integrated tool to illuminate omics data within a sample for rapid cross-examination and comparisons.</p><p><strong>Availability and implementation: </strong>MerCat2 is written in Python and distributed under a BSD-3 license. The source code of MerCat2 is freely available at https://github.com/raw-lab/mercat2. MerCat2 is compatible with Python 3 on Mac OS X and Linux. MerCat2 can also be easily installed using bioconda: mamba create -n mercat2 -c conda-forge -c bioconda mercat2.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11090762/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140923738","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep IDA: A Deep Learning Approach for Integrative Discriminant Analysis of Multi-omics Data with Feature Ranking- An Application to COVID-19 深度 IDA:利用特征排序对多组学数据进行综合判别分析的深度学习方法--在 COVID-19 中的应用
Pub Date : 2024-04-24 DOI: 10.1093/bioadv/vbae060
Jiuzhou Wang, S. Safo
Many diseases are complex heterogeneous conditions that affect multiple organs in the body and depend on the interplay between several factors that include molecular and environmental factors, requiring a holistic approach to better understand disease pathobiology. Most existing methods for integrating data from multiple sources and classifying individuals into one of multiple classes or disease groups have mainly focused on linear relationships despite the complexity of these relationships. On the other hand, methods for nonlinear association and classification studies are limited in their ability to identify variables to aid in our understanding of the complexity of the disease or can be applied to only two data types. We propose Deep IDA (Integrative Discriminant Analysis), a deep learning method to learn complex nonlinear transformations of two or more views such that resulting projections have maximum association and maximum separation. Further, we propose a feature ranking approach based on ensemble learning for interpretatble results. We test Deep IDA on both simulated data and two large real-world datasets, including RNA sequencing, metabolomics, and proteomics data pertaining to COVID-19 severity. We identified signatures that better discriminated COVID-19 patient groups, and related to neurological conditions, cancer, and metabolic diseases, corroborating current research findings and heightening the need to study the post sequelae effects of COVID-19 to devise effective treatments and to improve patient care. Our algorithms are implemented in PyTorch and available at: https://github.com/JiuzhouW/DeepIDA. Supplementary materials are available at Bioinformatics Advances online
许多疾病都是影响体内多个器官的复杂异质病症,取决于包括分子和环境因素在内的多种因素之间的相互作用,因此需要采用整体方法来更好地了解疾病的病理生物学。尽管这些关系错综复杂,但大多数现有方法都主要关注线性关系,用于整合来自多个来源的数据,并将个体划分为多个类别或疾病组别之一。另一方面,用于非线性关联和分类研究的方法在识别变量以帮助我们理解疾病的复杂性方面能力有限,或者只能应用于两种数据类型。 我们提出了深度 IDA(整合判别分析),这是一种深度学习方法,用于学习两个或多个视图的复杂非线性变换,从而使产生的投影具有最大关联性和最大分离性。此外,我们还提出了一种基于集合学习的特征排序方法,以获得可解释的结果。我们在模拟数据和两个大型真实数据集(包括与 COVID-19 严重程度相关的 RNA 测序、代谢组学和蛋白质组学数据)上测试了 Deep IDA。我们发现了能更好地区分 COVID-19 患者群体的特征,这些特征与神经系统疾病、癌症和代谢性疾病相关,证实了当前的研究成果,并提高了研究 COVID-19 后遗症影响的必要性,从而设计出有效的治疗方法并改善患者护理。 我们的算法是在 PyTorch 中实现的,可在以下网址获取:https://github.com/JiuzhouW/DeepIDA。 补充材料可在 Bioinformatics Advances 在线查阅。
{"title":"Deep IDA: A Deep Learning Approach for Integrative Discriminant Analysis of Multi-omics Data with Feature Ranking- An Application to COVID-19","authors":"Jiuzhou Wang, S. Safo","doi":"10.1093/bioadv/vbae060","DOIUrl":"https://doi.org/10.1093/bioadv/vbae060","url":null,"abstract":"\u0000 \u0000 \u0000 Many diseases are complex heterogeneous conditions that affect multiple organs in the body and depend on the interplay between several factors that include molecular and environmental factors, requiring a holistic approach to better understand disease pathobiology. Most existing methods for integrating data from multiple sources and classifying individuals into one of multiple classes or disease groups have mainly focused on linear relationships despite the complexity of these relationships. On the other hand, methods for nonlinear association and classification studies are limited in their ability to identify variables to aid in our understanding of the complexity of the disease or can be applied to only two data types.\u0000 \u0000 \u0000 \u0000 We propose Deep IDA (Integrative Discriminant Analysis), a deep learning method to learn complex nonlinear transformations of two or more views such that resulting projections have maximum association and maximum separation. Further, we propose a feature ranking approach based on ensemble learning for interpretatble results. We test Deep IDA on both simulated data and two large real-world datasets, including RNA sequencing, metabolomics, and proteomics data pertaining to COVID-19 severity. We identified signatures that better discriminated COVID-19 patient groups, and related to neurological conditions, cancer, and metabolic diseases, corroborating current research findings and heightening the need to study the post sequelae effects of COVID-19 to devise effective treatments and to improve patient care.\u0000 \u0000 \u0000 \u0000 Our algorithms are implemented in PyTorch and available at: https://github.com/JiuzhouW/DeepIDA.\u0000 \u0000 \u0000 \u0000 Supplementary materials are available at Bioinformatics Advances online\u0000","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140659998","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Syntactic sugars: crafting a regular expression framework for glycan structures. 语法糖:为聚糖结构设计正则表达式框架。
Pub Date : 2024-04-19 eCollection Date: 2024-01-01 DOI: 10.1093/bioadv/vbae059
Alexander R Bennett, Daniel Bojar

Motivation: Structural analysis of glycans poses significant challenges in glycobiology due to their complex sequences. Research questions such as analyzing the sequence content of the α1-6 branch in N-glycans, are biologically meaningful yet can be hard to automate.

Results: Here, we introduce a regular expression system, designed for glycans, feature-complete, and closely aligned with regular expression formatting. We use this to annotate glycan motifs of arbitrary complexity, perform differential expression analysis on designated sequence stretches, or elucidate branch-specific binding specificities of lectins in an automated manner. We are confident that glycan regular expressions will empower computational analyses of these sequences.

Availability and implementation: Our regular expression framework for glycans is implemented in Python and is incorporated into the open-source glycowork package (version 1.1+). Code and documentation are available at https://github.com/BojarLab/glycowork/blob/master/glycowork/motif/regex.py.

动机:由于聚糖序列复杂,对其进行结构分析是糖生物学领域的重大挑战。诸如分析 N-聚糖中 α1-6 分支的序列内容等研究问题具有生物学意义,但却很难实现自动化:在这里,我们介绍了一种正则表达式系统,该系统专为聚糖设计,特征完整,并与正则表达式格式紧密结合。我们用它来注释任意复杂程度的聚糖图案,对指定的序列片段进行差异表达分析,或以自动化方式阐明凝集素的分支特异性结合。我们相信,聚糖正则表达式将增强这些序列的计算分析能力:我们的聚糖正则表达式框架是用 Python 实现的,并纳入了开源的 glycowork 软件包(版本 1.1+)。代码和文档可从 https://github.com/BojarLab/glycowork/blob/master/glycowork/motif/regex.py 获取。
{"title":"Syntactic sugars: crafting a regular expression framework for glycan structures.","authors":"Alexander R Bennett, Daniel Bojar","doi":"10.1093/bioadv/vbae059","DOIUrl":"https://doi.org/10.1093/bioadv/vbae059","url":null,"abstract":"<p><strong>Motivation: </strong>Structural analysis of glycans poses significant challenges in glycobiology due to their complex sequences. Research questions such as analyzing the sequence content of the α1-6 branch in <i>N</i>-glycans, are biologically meaningful yet can be hard to automate.</p><p><strong>Results: </strong>Here, we introduce a regular expression system, designed for glycans, feature-complete, and closely aligned with regular expression formatting. We use this to annotate glycan motifs of arbitrary complexity, perform differential expression analysis on designated sequence stretches, or elucidate branch-specific binding specificities of lectins in an automated manner. We are confident that glycan regular expressions will empower computational analyses of these sequences.</p><p><strong>Availability and implementation: </strong>Our regular expression framework for glycans is implemented in Python and is incorporated into the open-source glycowork package (version 1.1+). Code and documentation are available at https://github.com/BojarLab/glycowork/blob/master/glycowork/motif/regex.py.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11069104/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140873530","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
GLIMMERS: glioma molecular markers exploration using long-read sequencing. GLIMMERS:利用长线程测序探索胶质瘤分子标记物。
Pub Date : 2024-04-15 eCollection Date: 2024-01-01 DOI: 10.1093/bioadv/vbae058
Wichayapat Thongrattana, Tantip Arigul, Bhoom Suktitipat, Manop Pithukpakorn, Sith Sathornsumetee, Thidathip Wongsurawat, Piroon Jenjaroenpun

Summary: The revised WHO guidelines for classifying and grading brain tumors include several copy number variation (CNV) markers. The turnaround time for detecting CNVs and alterations throughout the entire genome is drastically reduced with the customized read incremental approach on the nanopore platform. However, this approach is challenging for non-bioinformaticians due to the need to use multiple software tools, extract CNV markers and interpret results, which creates barriers due to the time and specialized resources that are necessary. To address this problem and help clinicians classify and grade brain tumors, we developed GLIMMERS: glioma molecular markers exploration using long-read sequencing, an open-access tool that automatically analyzes nanopore-based CNV data and generates simplified reports.

Availability and implementation: GLIMMERS is available at https://gitlab.com/silol_public/glimmers under the terms of the MIT license.

摘要:世卫组织修订的脑肿瘤分类和分级指南包括多个拷贝数变异(CNV)标记。利用纳米孔平台上的定制增量读取方法,检测整个基因组中的 CNV 和改变的周转时间大大缩短。然而,由于需要使用多种软件工具、提取 CNV 标记和解释结果,这种方法对非生物信息学家来说具有挑战性,这就造成了必要的时间和专业资源方面的障碍。为了解决这个问题并帮助临床医生对脑肿瘤进行分类和分级,我们开发了GLIMMERS:利用长读数测序探索胶质瘤分子标记物,这是一种开放获取的工具,可自动分析基于纳米孔的CNV数据并生成简化报告:GLIMMERS可在https://gitlab.com/silol_public/glimmers,采用MIT许可条款。
{"title":"GLIMMERS: glioma molecular markers exploration using long-read sequencing.","authors":"Wichayapat Thongrattana, Tantip Arigul, Bhoom Suktitipat, Manop Pithukpakorn, Sith Sathornsumetee, Thidathip Wongsurawat, Piroon Jenjaroenpun","doi":"10.1093/bioadv/vbae058","DOIUrl":"10.1093/bioadv/vbae058","url":null,"abstract":"<p><strong>Summary: </strong>The revised WHO guidelines for classifying and grading brain tumors include several copy number variation (CNV) markers. The turnaround time for detecting CNVs and alterations throughout the entire genome is drastically reduced with the customized read incremental approach on the nanopore platform. However, this approach is challenging for non-bioinformaticians due to the need to use multiple software tools, extract CNV markers and interpret results, which creates barriers due to the time and specialized resources that are necessary. To address this problem and help clinicians classify and grade brain tumors, we developed GLIMMERS: glioma molecular markers exploration using long-read sequencing, an open-access tool that automatically analyzes nanopore-based CNV data and generates simplified reports.</p><p><strong>Availability and implementation: </strong>GLIMMERS is available at https://gitlab.com/silol_public/glimmers under the terms of the MIT license.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11087932/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140913557","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
FlexStat: Combinatory differentially expressed protein extraction FlexStat:组合式差异表达蛋白质提取
Pub Date : 2024-04-11 DOI: 10.1093/bioadv/vbae056
Senuri De Silva, Asfa Alli-Shaik, J. Gunaratne
Mass spectrometry-based system proteomics allows identification of dysregulated protein hubs and associated disease-related features. Obtaining differentially expressed proteins (DEPs) is the most important step of downstream bioinformatics analysis. However, the extraction of statistically significant DEPs from datasets with multiple experimental conditions or disease types through currently available tools remains a laborious task. More often such an analysis requires considerable bioinformatics expertise, making it inaccessible to researchers with limited computational analytics experience. To uncover the differences among the many conditions within the data in a user-friendly manner, here we introduce FlexStat, a web-based interface that extracts DEPs through combinatory analysis. This tool accepts a protein expression matrix as input and systematically generates DEP results for every conceivable combination of various experimental conditions or disease types. FlexStat includes a suite of robust statistical tools for data preprocessing, in addition to DEP extraction, and publication-ready visualization, which are built on established R scientific libraries in an automated manner. This analytics suite was validated in diverse public proteomic datasets to showcase its high performance of rapid and simultaneous pairwise comparisons of comprehensive data sets. FlexStat is implemented in R and is freely available at https://jglab.shinyapps.io/flexstatv1-pipeline-only/. The source code is accessible at https://github.com/kts-desilva/FlexStat/tree/main. Supplementary data are available at Bioinformatics Advances online.
基于质谱技术的系统蛋白质组学可以识别调控失调的蛋白质中心和相关的疾病特征。获取差异表达蛋白(DEPs)是下游生物信息学分析最重要的一步。然而,通过现有工具从具有多种实验条件或疾病类型的数据集中提取具有统计学意义的差异表达蛋白仍然是一项艰巨的任务。这种分析往往需要大量的生物信息学专业知识,这使得计算分析经验有限的研究人员无法胜任。 为了以用户友好的方式揭示数据中多种条件之间的差异,我们在此介绍 FlexStat,这是一种基于网络的界面,可通过组合分析提取 DEPs。该工具接受蛋白质表达矩阵作为输入,并为各种实验条件或疾病类型的每一种可想象的组合系统地生成 DEP 结果。FlexStat 包括一套强大的统计工具,用于数据预处理、DEP 提取和可发布的可视化,这些工具都是以自动化方式建立在成熟的 R 科学库上。该分析套件已在各种公共蛋白质组数据集中进行了验证,以展示其对综合数据集进行快速、同步配对比较的高性能。 FlexStat 使用 R 语言实现,可在 https://jglab.shinyapps.io/flexstatv1-pipeline-only/ 免费获取。源代码可从 https://github.com/kts-desilva/FlexStat/tree/main 获取。 补充数据可在 Bioinformatics Advances 在线查阅。
{"title":"FlexStat: Combinatory differentially expressed protein extraction","authors":"Senuri De Silva, Asfa Alli-Shaik, J. Gunaratne","doi":"10.1093/bioadv/vbae056","DOIUrl":"https://doi.org/10.1093/bioadv/vbae056","url":null,"abstract":"\u0000 \u0000 \u0000 Mass spectrometry-based system proteomics allows identification of dysregulated protein hubs and associated disease-related features. Obtaining differentially expressed proteins (DEPs) is the most important step of downstream bioinformatics analysis. However, the extraction of statistically significant DEPs from datasets with multiple experimental conditions or disease types through currently available tools remains a laborious task. More often such an analysis requires considerable bioinformatics expertise, making it inaccessible to researchers with limited computational analytics experience.\u0000 \u0000 \u0000 \u0000 To uncover the differences among the many conditions within the data in a user-friendly manner, here we introduce FlexStat, a web-based interface that extracts DEPs through combinatory analysis. This tool accepts a protein expression matrix as input and systematically generates DEP results for every conceivable combination of various experimental conditions or disease types. FlexStat includes a suite of robust statistical tools for data preprocessing, in addition to DEP extraction, and publication-ready visualization, which are built on established R scientific libraries in an automated manner. This analytics suite was validated in diverse public proteomic datasets to showcase its high performance of rapid and simultaneous pairwise comparisons of comprehensive data sets.\u0000 \u0000 \u0000 \u0000 FlexStat is implemented in R and is freely available at https://jglab.shinyapps.io/flexstatv1-pipeline-only/. The source code is accessible at https://github.com/kts-desilva/FlexStat/tree/main.\u0000 \u0000 \u0000 \u0000 Supplementary data are available at Bioinformatics Advances online.\u0000","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140713192","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
OpenAnnotateApi: Python and R packages to efficiently annotate and analyze chromatin accessibility of genomic regions OpenAnnotateApi:用于高效注释和分析基因组区域染色质可及性的 Python 和 R 软件包
Pub Date : 2024-04-10 DOI: 10.1093/bioadv/vbae055
Zijing Gao, Rui Jiang, Shengquan Chen
Abstract Summary Chromatin accessibility serves as a critical measurement of physical contact between nuclear macromolecules and DNA sequence, providing valuable insights into the comprehensive landscape of regulatory mechanisms, thus we previously developed the OpenAnnotate web server. However, as an increasing number of epigenomic analysis software tools emerged, web-based annotation often faced limitations and inconveniences when integrated into these software pipelines. To address these issues, we here develop two software packages named OpenAnnotatePy and OpenAnnotateR. In addition to web-based functionalities, these packages encompass supplementary features, including the capability for simultaneous annotation across multiple cell types, advanced searching of systems, tissues and cell types, and converting the result to the data structure of mainstream tools. Moreover, we applied the packages to various scenarios, including cell type revealing, regulatory element prediction, and integration into mainstream single-cell ATAC-seq analysis pipelines including EpiScanpy, Signac, and ArchR. We anticipate that OpenAnnotateApi will significantly facilitate the deciphering of gene regulatory mechanisms, and offer crucial assistance in the field of epigenomic studies. Availability and implementation OpenAnnotateApi for R is available at https://github.com/ZjGaothu/OpenAnnotateR and for Python is available at https://github.com/ZjGaothu/OpenAnnotatePy.
摘要 染色质可及性是衡量核大分子与 DNA 序列之间物理接触的关键指标,为全面了解调控机制提供了宝贵的信息,因此我们之前开发了 OpenAnnotate 网络服务器。然而,随着表观基因组分析软件工具的不断涌现,基于网络的注释在集成到这些软件管道时往往面临限制和不便。为了解决这些问题,我们在此开发了两个软件包,分别名为 OpenAnnotatePy 和 OpenAnnotateR。 除了基于网络的功能外,这两个软件包还包含一些辅助功能,包括跨多种细胞类型同时注释,系统、组织和细胞类型的高级搜索,以及将结果转换为主流工具的数据结构。此外,我们还将这些软件包应用于各种情况,包括细胞类型揭示、调控元件预测以及集成到主流单细胞 ATAC-seq 分析管道(包括 EpiScanpy、Signac 和 ArchR)中。我们预计 OpenAnnotateApi 将极大地促进基因调控机制的破译,并为表观基因组研究领域提供重要帮助。可用性和实现 OpenAnnotateApi 的 R 语言版本可在 https://github.com/ZjGaothu/OpenAnnotateR 上获取,Python 语言版本可在 https://github.com/ZjGaothu/OpenAnnotatePy 上获取。
{"title":"OpenAnnotateApi: Python and R packages to efficiently annotate and analyze chromatin accessibility of genomic regions","authors":"Zijing Gao, Rui Jiang, Shengquan Chen","doi":"10.1093/bioadv/vbae055","DOIUrl":"https://doi.org/10.1093/bioadv/vbae055","url":null,"abstract":"Abstract Summary Chromatin accessibility serves as a critical measurement of physical contact between nuclear macromolecules and DNA sequence, providing valuable insights into the comprehensive landscape of regulatory mechanisms, thus we previously developed the OpenAnnotate web server. However, as an increasing number of epigenomic analysis software tools emerged, web-based annotation often faced limitations and inconveniences when integrated into these software pipelines. To address these issues, we here develop two software packages named OpenAnnotatePy and OpenAnnotateR. In addition to web-based functionalities, these packages encompass supplementary features, including the capability for simultaneous annotation across multiple cell types, advanced searching of systems, tissues and cell types, and converting the result to the data structure of mainstream tools. Moreover, we applied the packages to various scenarios, including cell type revealing, regulatory element prediction, and integration into mainstream single-cell ATAC-seq analysis pipelines including EpiScanpy, Signac, and ArchR. We anticipate that OpenAnnotateApi will significantly facilitate the deciphering of gene regulatory mechanisms, and offer crucial assistance in the field of epigenomic studies. Availability and implementation OpenAnnotateApi for R is available at https://github.com/ZjGaothu/OpenAnnotateR and for Python is available at https://github.com/ZjGaothu/OpenAnnotatePy.","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140720228","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Correction to: wTSA-CRAFT: an open-access web server for rapid analysis of thermal shift assay experiments. 更正:wTSA-CRAFT:用于快速分析热移实验的开放式网络服务器。
Pub Date : 2024-04-09 eCollection Date: 2024-01-01 DOI: 10.1093/bioadv/vbae050

[This corrects the article DOI: 10.1093/bioadv/vbad136.].

[此处更正了文章 DOI:10.1093/bioadv/vbad136]。
{"title":"Correction to: w<i>TSA-CRAFT</i>: an open-access web server for rapid analysis of thermal shift assay experiments.","authors":"","doi":"10.1093/bioadv/vbae050","DOIUrl":"https://doi.org/10.1093/bioadv/vbae050","url":null,"abstract":"<p><p>[This corrects the article DOI: 10.1093/bioadv/vbad136.].</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11004552/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140869477","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Bioinformatics advances
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1