首页 > 最新文献

Bioinformatics (Oxford, England)最新文献

英文 中文
Causal relationships between diseases mined from the literature improve the use of polygenic risk scores. 从文献中挖掘出的疾病之间的因果关系改进了多基因风险评分的使用。
Pub Date : 2024-11-01 DOI: 10.1093/bioinformatics/btae639
Sumyyah Toonsi, Iris Ivy Gauran, Hernando Ombao, Paul N Schofield, Robert Hoehndorf

Motivation: Identifying causal relations between diseases allows for the study of shared pathways, biological mechanisms, and inter-disease risks. Such causal relations can facilitate the identification of potential disease precursors and candidates for drug re-purposing. However, computational methods often lack access to these causal relations. Few approaches have been developed to automatically extract causal relationships between diseases from unstructured text, but they are often only focused on a small number of diseases, lack validation of the extracted causal relations, or do not make their data available.

Results: We automatically mined statements asserting a causal relation between diseases from the scientific literature by leveraging lexical patterns. Following automated mining of causal relations, we mapped the diseases to the International Classification of Diseases (ICD) identifiers to allow the direct application to clinical data. We provide quantitative and qualitative measures to evaluate the mined causal relations and compare to UK Biobank diagnosis data as a completely independent data source. The validated causal associations were used to create a directed acyclic graph that can be used by causal inference frameworks. We demonstrate the utility of our causal network by performing causal inference using the do-calculus, using relations within the graph to construct and improve polygenic risk scores, and disentangle the pleiotropic effects of variants.

Availability and implementation: The data are available through https://github.com/bio-ontology-research-group/causal-relations-between-diseases.

动机确定疾病之间的因果关系有助于研究共同的途径、生物机制和疾病间的风险。这种因果关系有助于识别潜在的疾病前兆和候选药物的再利用。然而,计算方法往往无法获取这些因果关系。从非结构化文本中自动提取疾病间因果关系的方法很少,但这些方法往往只关注少数疾病,缺乏对所提取因果关系的验证,或者不提供数据:结果:我们利用词汇模式自动挖掘科学文献中断言疾病之间存在因果关系的语句。在自动挖掘因果关系后,我们将疾病映射到国际疾病分类(ICD)标识符,以便直接应用于临床数据。我们提供了定量和定性措施来评估挖掘出的因果关系,并与作为完全独立数据源的英国生物库(UKB)诊断数据进行比较。经过验证的因果关联被用于创建有向无环图,该图可用于因果推理框架。我们使用 do-calculus 进行因果推理,利用图中的关系构建和改进多基因风险评分,并分离变异的多向效应,从而证明了我们的因果网络的实用性:数据可通过 https://github.com/bio-ontology-research-group/causal-relations-between-diseases.Supplementary 信息获取:补充数据可在 Bioinformatics online 上获取。
{"title":"Causal relationships between diseases mined from the literature improve the use of polygenic risk scores.","authors":"Sumyyah Toonsi, Iris Ivy Gauran, Hernando Ombao, Paul N Schofield, Robert Hoehndorf","doi":"10.1093/bioinformatics/btae639","DOIUrl":"10.1093/bioinformatics/btae639","url":null,"abstract":"<p><strong>Motivation: </strong>Identifying causal relations between diseases allows for the study of shared pathways, biological mechanisms, and inter-disease risks. Such causal relations can facilitate the identification of potential disease precursors and candidates for drug re-purposing. However, computational methods often lack access to these causal relations. Few approaches have been developed to automatically extract causal relationships between diseases from unstructured text, but they are often only focused on a small number of diseases, lack validation of the extracted causal relations, or do not make their data available.</p><p><strong>Results: </strong>We automatically mined statements asserting a causal relation between diseases from the scientific literature by leveraging lexical patterns. Following automated mining of causal relations, we mapped the diseases to the International Classification of Diseases (ICD) identifiers to allow the direct application to clinical data. We provide quantitative and qualitative measures to evaluate the mined causal relations and compare to UK Biobank diagnosis data as a completely independent data source. The validated causal associations were used to create a directed acyclic graph that can be used by causal inference frameworks. We demonstrate the utility of our causal network by performing causal inference using the do-calculus, using relations within the graph to construct and improve polygenic risk scores, and disentangle the pleiotropic effects of variants.</p><p><strong>Availability and implementation: </strong>The data are available through https://github.com/bio-ontology-research-group/causal-relations-between-diseases.</p>","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142514571","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Collapsible tree: interactive web app to present collapsible hierarchies. 可折叠树:交互式网络应用程序,用于呈现可折叠的层次结构。
Pub Date : 2024-11-01 DOI: 10.1093/bioinformatics/btae645
Yuan Gao, Rob Patro, Peng Jiang

Motivation: A crucial component of intuitive data visualization is presenting a hierarchical tree structure with interactive functions. For example, single-cell transcriptomics studies may generate gene expression values with developmental trajectories or cell lineage structures. Two common visualization methods, t-Distributed Stochastic Neighbor Embedding (t-SNE) and Uniform Manifold Approximation and Projection (UMAP), require two separate figures to depict the distribution of cell types and gene expression data, with low-dimension projections that may not capture the hierarchical structures among cells.

Results: Here, we present a JavaScript framework and an interactive web app named Collapsible Tree, which presents values jointly with interactive, expandable, and collapsible lineage structures. For example, the Collapsible Tree presents cellular states and gene expression from single-cell transcriptomics within a single hierarchical plot, enabling comparisons of gene expression across lineages and subtle patterns between sub-lineages. Our framework can facilitate the exploration of complicated value patterns that are not evident in UMAP or t-SNE plots.

Availability and implementation: The Collapsible Tree web interface is available at https://collapsibletree.data2in.net. The JavaScript library source code is available at https://github.com/data2intelligence/collapsible_tree.

动机直观数据可视化的一个重要组成部分是呈现具有交互功能的分层树结构。例如,单细胞转录组学研究可能会产生具有发育轨迹或细胞系结构的基因表达值。t-SNE和UMAP这两种常见的可视化方法需要两个独立的图来描述细胞类型和基因表达数据的分布,而低维度的投影可能无法捕捉细胞间的层次结构:在此,我们介绍了一个 JavaScript 框架和一个名为 "可折叠树 "的交互式网络应用程序,它能以交互式、可扩展和可折叠的系谱结构联合呈现数值。例如,"可折叠树 "将单细胞转录组学中的细胞状态和基因表达呈现在单个层次图中,从而可以比较各系间的基因表达以及子系间的微妙模式。我们的框架有助于探索在 UMAP 或 t-SNE 图中不明显的复杂值模式:Collapsible Tree 网络界面可在 https://collapsibletree.data2in.net 上获取。JavaScript 库源代码可在 https://github.com/data2intelligence/collapsible_tree 上获取。
{"title":"Collapsible tree: interactive web app to present collapsible hierarchies.","authors":"Yuan Gao, Rob Patro, Peng Jiang","doi":"10.1093/bioinformatics/btae645","DOIUrl":"10.1093/bioinformatics/btae645","url":null,"abstract":"<p><strong>Motivation: </strong>A crucial component of intuitive data visualization is presenting a hierarchical tree structure with interactive functions. For example, single-cell transcriptomics studies may generate gene expression values with developmental trajectories or cell lineage structures. Two common visualization methods, t-Distributed Stochastic Neighbor Embedding (t-SNE) and Uniform Manifold Approximation and Projection (UMAP), require two separate figures to depict the distribution of cell types and gene expression data, with low-dimension projections that may not capture the hierarchical structures among cells.</p><p><strong>Results: </strong>Here, we present a JavaScript framework and an interactive web app named Collapsible Tree, which presents values jointly with interactive, expandable, and collapsible lineage structures. For example, the Collapsible Tree presents cellular states and gene expression from single-cell transcriptomics within a single hierarchical plot, enabling comparisons of gene expression across lineages and subtle patterns between sub-lineages. Our framework can facilitate the exploration of complicated value patterns that are not evident in UMAP or t-SNE plots.</p><p><strong>Availability and implementation: </strong>The Collapsible Tree web interface is available at https://collapsibletree.data2in.net. The JavaScript library source code is available at https://github.com/data2intelligence/collapsible_tree.</p>","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11543613/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142514572","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
InterLabelGO+: unraveling label correlations in protein function prediction. InterLabelGO+:揭示蛋白质功能预测中的标签相关性
Pub Date : 2024-11-01 DOI: 10.1093/bioinformatics/btae655
Quancheng Liu, Chengxin Zhang, Lydia Freddolino

Motivation: Accurate protein function prediction is crucial for understanding biological processes and advancing biomedical research. However, the rapid growth of protein sequences far outpaces the experimental characterization of their functions, necessitating the development of automated computational methods.

Results: We present InterLabelGO+, a hybrid approach that integrates a deep learning-based method with an alignment-based method for improved protein function prediction. InterLabelGO+ incorporates a novel loss function that addresses label dependency and imbalance and further enhances performance through dynamic weighting of the alignment-based component. A preliminary version of InterLabelGO+ achieved a strong performance in the CAFA5 challenge, ranking sixth out of 1625 participating teams. Comprehensive evaluations on large-scale protein function prediction tasks demonstrate InterLabelGO+'s ability to accurately predict Gene Ontology terms across various functional categories and evaluation metrics.

Availability and implementation: The source code and datasets for InterLabelGO+ are freely available on GitHub at https://github.com/QuanEvans/InterLabelGO. A web-server is available at https://seq2fun.dcmb.med.umich.edu/InterLabelGO/. The software is implemented in Python and PyTorch, and is supported on Linux and macOS.

动机准确预测蛋白质功能对于了解生物过程和推动生物医学研究至关重要。然而,蛋白质序列的快速增长远远超过了对其功能的实验表征,因此有必要开发自动计算方法:我们提出的 InterLabelGO+ 是一种混合方法,它整合了基于深度学习的方法和基于比对的方法,用于改进蛋白质功能预测。InterLabelGO+ 采用了一种新颖的损失函数来解决标签依赖性和不平衡性问题,并通过对基于配准的部分进行动态加权来进一步提高性能。InterLabelGO+ 的初步版本在 CAFA5 挑战赛中表现出色,在 1625 个参赛团队中排名第六。对大规模蛋白质功能预测任务的综合评估表明,InterLabelGO+ 能够准确预测不同功能类别和评估指标的基因本体术语:InterLabelGO+ 的源代码和数据集可在 GitHub 上免费获取,网址为 https://github.com/QuanEvans/InterLabelGO。该软件使用 Python 和 PyTorch 实现,支持 Linux 和 macOS:补充图、表和数据可在 Bioinformatics online 上获取。
{"title":"InterLabelGO+: unraveling label correlations in protein function prediction.","authors":"Quancheng Liu, Chengxin Zhang, Lydia Freddolino","doi":"10.1093/bioinformatics/btae655","DOIUrl":"10.1093/bioinformatics/btae655","url":null,"abstract":"<p><strong>Motivation: </strong>Accurate protein function prediction is crucial for understanding biological processes and advancing biomedical research. However, the rapid growth of protein sequences far outpaces the experimental characterization of their functions, necessitating the development of automated computational methods.</p><p><strong>Results: </strong>We present InterLabelGO+, a hybrid approach that integrates a deep learning-based method with an alignment-based method for improved protein function prediction. InterLabelGO+ incorporates a novel loss function that addresses label dependency and imbalance and further enhances performance through dynamic weighting of the alignment-based component. A preliminary version of InterLabelGO+ achieved a strong performance in the CAFA5 challenge, ranking sixth out of 1625 participating teams. Comprehensive evaluations on large-scale protein function prediction tasks demonstrate InterLabelGO+'s ability to accurately predict Gene Ontology terms across various functional categories and evaluation metrics.</p><p><strong>Availability and implementation: </strong>The source code and datasets for InterLabelGO+ are freely available on GitHub at https://github.com/QuanEvans/InterLabelGO. A web-server is available at https://seq2fun.dcmb.med.umich.edu/InterLabelGO/. The software is implemented in Python and PyTorch, and is supported on Linux and macOS.</p>","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11568131/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142585161","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Biologically-informed killer cell immunoglobulin-like receptor gene annotation tool. 基于生物学信息的杀伤细胞免疫球蛋白样受体(KIR)基因注释工具。
Pub Date : 2024-11-01 DOI: 10.1093/bioinformatics/btae622
Michael K B Ford, Ananth Hari, Qinghui Zhou, Ibrahim Numanagić, S Cenk Sahinalp

Summary: Natural killer (NK) cells are essential components of the innate immune system, with their activity significantly regulated by Killer cell Immunoglobulin-like Receptors (KIRs). The diversity and structural complexity of KIR genes present significant challenges for accurate genotyping, essential for understanding NK cell functions and their implications in health and disease. Traditional genotyping methods struggle with the variable nature of KIR genes, leading to inaccuracies that can impede immunogenetic research. These challenges extend to high-quality phased assemblies, which have been recently popularized by the Human Pangenome Consortium. This article introduces BAKIR (Biologically informed Annotator for KIR locus), a tailored computational tool designed to overcome the challenges of KIR genotyping and annotation on high-quality, phased genome assemblies. BAKIR aims to enhance the accuracy of KIR gene annotations by structuring its annotation pipeline around identifying key functional mutations, thereby improving the identification and subsequent relevance of gene and allele calls. It uses a multi-stage mapping, alignment, and variant calling process to ensure high-precision gene and allele identification, while also maintaining high recall for sequences that are significantly mutated or truncated relative to the known allele database. BAKIR has been evaluated on a subset of the HPRC assemblies, where BAKIR was able to improve many of the associated annotations and call novel variants. BAKIR is freely available on GitHub, offering ease of access and use through multiple installation methods, including pip, conda, and singularity container, and is equipped with a user-friendly command-line interface, thereby promoting its adoption in the scientific community.

Availability and implementation: BAKIR is available at github.com/algo-cancer/bakir.

摘要:自然杀伤(NK)细胞是先天性免疫系统的重要组成部分,其活性受杀伤细胞免疫球蛋白样受体(KIR)的重要调节。KIR 基因的多样性和结构复杂性给准确的基因分型带来了巨大挑战,而准确的基因分型对于了解 NK 细胞的功能及其对健康和疾病的影响至关重要。传统的基因分型方法难以应对 KIR 基因的多变性,从而导致不准确性,阻碍了免疫遗传学的研究。这些挑战延伸到了高质量的分阶段组装,最近人类泛基因组联盟(Human Pangenome Consortium)推广了这种组装方法。本文介绍了 BAKIR(Biologically-informed Annotator for KIR locus),这是一种量身定制的计算工具,旨在克服在高质量分阶段基因组组装上进行 KIR 基因分型和注释所面临的挑战。BAKIR 的目标是通过围绕识别关键功能突变来构建其注释管道,从而提高 KIR 基因注释的准确性,从而改善基因和等位基因调用的识别和后续相关性。它采用多阶段映射、比对和变异调用过程,确保高精度的基因和等位基因鉴定,同时还能对相对于已知等位基因数据库有明显突变或截断的序列保持较高的召回率。BAKIR 已在 HPRC 集合的一个子集上进行了评估,BAKIR 能够改进许多相关注释并调用新的变异。BAKIR 可在 GitHub 上免费获取,通过多种安装方法(包括 pip、conda 和 singularity container)轻松访问和使用,并配备了用户友好的命令行界面,从而促进了其在科学界的应用:BAKIR 可在 github.com/algo-cancer/bakir 上获取:补充数据可在 Bioinformatics online 上获取。
{"title":"Biologically-informed killer cell immunoglobulin-like receptor gene annotation tool.","authors":"Michael K B Ford, Ananth Hari, Qinghui Zhou, Ibrahim Numanagić, S Cenk Sahinalp","doi":"10.1093/bioinformatics/btae622","DOIUrl":"10.1093/bioinformatics/btae622","url":null,"abstract":"<p><strong>Summary: </strong>Natural killer (NK) cells are essential components of the innate immune system, with their activity significantly regulated by Killer cell Immunoglobulin-like Receptors (KIRs). The diversity and structural complexity of KIR genes present significant challenges for accurate genotyping, essential for understanding NK cell functions and their implications in health and disease. Traditional genotyping methods struggle with the variable nature of KIR genes, leading to inaccuracies that can impede immunogenetic research. These challenges extend to high-quality phased assemblies, which have been recently popularized by the Human Pangenome Consortium. This article introduces BAKIR (Biologically informed Annotator for KIR locus), a tailored computational tool designed to overcome the challenges of KIR genotyping and annotation on high-quality, phased genome assemblies. BAKIR aims to enhance the accuracy of KIR gene annotations by structuring its annotation pipeline around identifying key functional mutations, thereby improving the identification and subsequent relevance of gene and allele calls. It uses a multi-stage mapping, alignment, and variant calling process to ensure high-precision gene and allele identification, while also maintaining high recall for sequences that are significantly mutated or truncated relative to the known allele database. BAKIR has been evaluated on a subset of the HPRC assemblies, where BAKIR was able to improve many of the associated annotations and call novel variants. BAKIR is freely available on GitHub, offering ease of access and use through multiple installation methods, including pip, conda, and singularity container, and is equipped with a user-friendly command-line interface, thereby promoting its adoption in the scientific community.</p><p><strong>Availability and implementation: </strong>BAKIR is available at github.com/algo-cancer/bakir.</p>","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11549020/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142482929","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep coupled registration and segmentation of multimodal whole-brain images. 多模态全脑图像的深度耦合配准和分割。
Pub Date : 2024-11-01 DOI: 10.1093/bioinformatics/btae606
Tingting Han, Jun Wu, Pengpeng Sheng, Yuanyuan Li, ZaiYang Tao, Lei Qu

Motivation: Recent brain mapping efforts are producing large-scale whole-brain images using different imaging modalities. Accurate alignment and delineation of anatomical structures in these images are essential for numerous studies. These requirements are typically modeled as two distinct tasks: registration and segmentation. However, prevailing methods, fail to fully explore and utilize the inherent correlation and complementarity between the two tasks. Furthermore, variations in brain anatomy, brightness, and texture pose another formidable challenge in designing multi-modal similarity metrics. A high-throughput approach capable of overcoming the bottleneck of multi-modal similarity metric design, while effective leveraging the highly correlated and complementary nature of two tasks is highly desirable.

Results: We introduce a deep learning framework for joint registration and segmentation of multi-modal brain images. Under this framework, registration and segmentation tasks are deeply coupled and collaborated at two hierarchical layers. In the inner layer, we establish a strong feature-level coupling between the two tasks by learning a unified common latent feature representation. In the outer layer, we introduce a mutually supervised dual-branch network to decouple latent features and facilitate task-level collaboration between registration and segmentation. Since the latent features we designed are also modality-independent, the bottleneck of designing multi-modal similarity metric is essentially addressed. Another merit offered by this framework is the interpretability of latent features, which allows intuitive manipulation of feature learning, thereby further enhancing network training efficiency and the performance of both tasks. Extensive experiments conducted on both multi-modal and mono-modal datasets of mouse and human brains demonstrate the superiority of our method.

Availability and implementation: The code is available at https://github.com/tingtingup/DCRS.

动机最近的脑图绘制工作正在利用不同的成像模式制作大规模的全脑图像。这些图像中解剖结构的精确配准和划分对许多研究至关重要。这些要求通常被模拟为两个不同的任务:配准和分割。然而,目前流行的方法未能充分探索和利用这两项任务之间固有的相关性和互补性。此外,大脑解剖结构、亮度和纹理的变化也给设计多模态相似度指标带来了巨大挑战。我们非常需要一种能够克服多模态相似性度量设计瓶颈的高通量方法,同时有效利用两个任务的高度相关性和互补性:我们为多模态大脑图像的联合配准和分割引入了一个深度学习框架。在这一框架下,配准和分割任务在两个层次上深度耦合和协作。在内层,我们通过学习统一的通用潜在特征表示,在两个任务之间建立了强大的特征级耦合。在外层,我们引入了一个相互监督的双分支网络,以解耦潜在特征,促进配准和分割之间的任务级协作。由于我们设计的潜在特征也与模式无关,因此从根本上解决了设计多模式相似性度量的瓶颈问题。该框架的另一个优点是潜在特征的可解释性,可以直观地操作特征学习,从而进一步提高网络训练效率和两个任务的性能。在小鼠和人类大脑的多模态和单模态数据集上进行的大量实验证明了我们方法的优越性:代码见 https://github.com/tingtingup/DCRS.Supplementary 信息:补充数据可在 Bioinformaticsonline 上获取。
{"title":"Deep coupled registration and segmentation of multimodal whole-brain images.","authors":"Tingting Han, Jun Wu, Pengpeng Sheng, Yuanyuan Li, ZaiYang Tao, Lei Qu","doi":"10.1093/bioinformatics/btae606","DOIUrl":"10.1093/bioinformatics/btae606","url":null,"abstract":"<p><strong>Motivation: </strong>Recent brain mapping efforts are producing large-scale whole-brain images using different imaging modalities. Accurate alignment and delineation of anatomical structures in these images are essential for numerous studies. These requirements are typically modeled as two distinct tasks: registration and segmentation. However, prevailing methods, fail to fully explore and utilize the inherent correlation and complementarity between the two tasks. Furthermore, variations in brain anatomy, brightness, and texture pose another formidable challenge in designing multi-modal similarity metrics. A high-throughput approach capable of overcoming the bottleneck of multi-modal similarity metric design, while effective leveraging the highly correlated and complementary nature of two tasks is highly desirable.</p><p><strong>Results: </strong>We introduce a deep learning framework for joint registration and segmentation of multi-modal brain images. Under this framework, registration and segmentation tasks are deeply coupled and collaborated at two hierarchical layers. In the inner layer, we establish a strong feature-level coupling between the two tasks by learning a unified common latent feature representation. In the outer layer, we introduce a mutually supervised dual-branch network to decouple latent features and facilitate task-level collaboration between registration and segmentation. Since the latent features we designed are also modality-independent, the bottleneck of designing multi-modal similarity metric is essentially addressed. Another merit offered by this framework is the interpretability of latent features, which allows intuitive manipulation of feature learning, thereby further enhancing network training efficiency and the performance of both tasks. Extensive experiments conducted on both multi-modal and mono-modal datasets of mouse and human brains demonstrate the superiority of our method.</p><p><strong>Availability and implementation: </strong>The code is available at https://github.com/tingtingup/DCRS.</p>","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11543610/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142482933","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
GCI: a continuity inspector for complete genome assembly. GCI:用于完整基因组组装的连续性检查器。
Pub Date : 2024-11-01 DOI: 10.1093/bioinformatics/btae633
Quanyu Chen, Chentao Yang, Guojie Zhang, Dongya Wu

Motivation: Recent advances in long-read sequencing technologies have significantly facilitated the production of high-quality genome assembly. The telomere-to-telomere (T2T) gapless assembly has become the new golden standard of genome assembly efforts. Several recent efforts have claimed to produce T2T-level reference genomes. However, a universal standard is still missing to qualify a genome assembly to be at T2T standard. Traditional genome assembly assessment metrics (N50 and its derivatives) have no capacity in differentiating between nearly T2T assembly and the truly T2T assembly in continuity either globally or locally. Additionally, these metrics are independent of raw reads, making them inflated easily by artificial operations. Therefore, a gaplessness evaluation tool at single-nucleotide resolution to reflect true completeness is urgently needed in the era of complete genomes.

Results: Here, we present a tool called Genome Continuity Inspector (GCI), designed to assess genome assembly continuity at single-base resolution, and evaluate how close an assembly is to the T2T level. GCI utilizes multiple aligners to map long reads from various sequencing platforms back to the assembly. By incorporating curated mapping coverage of high-confidence read alignments, GCI identifies potential assembly issues. Meanwhile, it provides GCI scores that quantify overall assembly continuity on the whole genome or chromosome scales.

Availability and implementation: The open-source GCI code is freely available on Github (https://github.com/yeeus/GCI) under the MIT license.

动机长读数测序技术的最新进展极大地促进了高质量基因组组装的产生。端粒到端粒(T2T)无间隙组装已成为基因组组装工作的新黄金标准。最近有几项工作声称能产生 T2T 水平的参考基因组。然而,目前仍缺乏一个通用标准来确定基因组组装是否达到 T2T 标准。传统的基因组组装评估指标(N50 及其衍生物)无法区分接近 T2T 组装和真正 T2T 组装的连续性,无论是在全球还是在本地。此外,这些指标与原始读数无关,很容易被人为操作夸大。因此,在全基因组时代,迫切需要一种单核苷酸分辨率的无间隙性评估工具来反映真正的完整性:在此,我们提出了一种名为基因组连续性检查器(GCI)的工具,旨在以单碱基分辨率评估基因组组装的连续性,并评估组装与 T2T 水平的接近程度。GCI 利用多个对齐器将来自不同测序平台的长读数映射回装配。通过结合高置信度读数对齐的策定映射覆盖率,GCI 可以识别潜在的组装问题。同时,它还提供 GCI 分数,量化全基因组或染色体范围内的整体组装连续性:开源 GCI 代码可在 Github (https://github.com/yeeus/GCI) 上免费获取,采用 MIT 许可。补充信息:补充数据可在 Bioinformatics online 上获取。
{"title":"GCI: a continuity inspector for complete genome assembly.","authors":"Quanyu Chen, Chentao Yang, Guojie Zhang, Dongya Wu","doi":"10.1093/bioinformatics/btae633","DOIUrl":"10.1093/bioinformatics/btae633","url":null,"abstract":"<p><strong>Motivation: </strong>Recent advances in long-read sequencing technologies have significantly facilitated the production of high-quality genome assembly. The telomere-to-telomere (T2T) gapless assembly has become the new golden standard of genome assembly efforts. Several recent efforts have claimed to produce T2T-level reference genomes. However, a universal standard is still missing to qualify a genome assembly to be at T2T standard. Traditional genome assembly assessment metrics (N50 and its derivatives) have no capacity in differentiating between nearly T2T assembly and the truly T2T assembly in continuity either globally or locally. Additionally, these metrics are independent of raw reads, making them inflated easily by artificial operations. Therefore, a gaplessness evaluation tool at single-nucleotide resolution to reflect true completeness is urgently needed in the era of complete genomes.</p><p><strong>Results: </strong>Here, we present a tool called Genome Continuity Inspector (GCI), designed to assess genome assembly continuity at single-base resolution, and evaluate how close an assembly is to the T2T level. GCI utilizes multiple aligners to map long reads from various sequencing platforms back to the assembly. By incorporating curated mapping coverage of high-confidence read alignments, GCI identifies potential assembly issues. Meanwhile, it provides GCI scores that quantify overall assembly continuity on the whole genome or chromosome scales.</p><p><strong>Availability and implementation: </strong>The open-source GCI code is freely available on Github (https://github.com/yeeus/GCI) under the MIT license.</p>","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11550331/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142482937","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MR Corge: sensitivity analysis of Mendelian randomization based on the core gene hypothesis for polygenic exposures. MR Corge:基于核心基因假说的孟德尔随机化对多基因暴露的敏感性分析。
Pub Date : 2024-11-01 DOI: 10.1093/bioinformatics/btae666
Wenmin Zhang, Chen-Yang Su, Satoshi Yoshiji, Tianyuan Lu

Summary: Mendelian randomization is being utilized to assess causal effects of polygenic exposures, where many genetic instruments are subject to horizontal pleiotropy. Existing methods for detecting and correcting for horizontal pleiotropy have important assumptions that may not be fulfilled. Built upon the core gene hypothesis, we developed MR Corge for performing sensitivity analysis of Mendelian randomization. MR Corge identifies a small number of putative core instruments that are more likely to affect genes with a direct biological role in an exposure and obtains causal effect estimates based on these instruments, thereby reducing the risk of horizontal pleiotropy. Using positive and negative controls, we demonstrated that MR Corge estimates aligned with established biomedical knowledge and the results of randomized controlled trials. MR Corge may be widely applied to investigate polygenic exposure-outcome relationships.

Availability and implementation: An open-sourced R package is available at https://github.com/zhwm/MRCorge.

摘要:目前正在利用孟德尔随机法来评估多基因暴露的因果效应,其中许多基因工具都受到横向多效性的影响。现有的检测和校正水平多效性的方法有一些重要的假设,这些假设可能无法实现。基于核心基因假设,我们开发了 MR Corge,用于对孟德尔随机化进行敏感性分析。MR Corge 可识别出少数几个更有可能影响在暴露中具有直接生物学作用的基因的假定核心工具,并根据这些工具获得因果效应估计值,从而降低水平多效性的风险。通过使用阳性和阴性对照,我们证明了 MR Corge 的估计值与已有的生物医学知识和随机对照试验的结果一致。MR Corge可广泛应用于研究多基因暴露-结果关系:开源 R 软件包可在 https://github.com/zhwm/MRCorge.Supplementary 上获取:补充数据可在 Bioinformatics online 上获取。
{"title":"MR Corge: sensitivity analysis of Mendelian randomization based on the core gene hypothesis for polygenic exposures.","authors":"Wenmin Zhang, Chen-Yang Su, Satoshi Yoshiji, Tianyuan Lu","doi":"10.1093/bioinformatics/btae666","DOIUrl":"10.1093/bioinformatics/btae666","url":null,"abstract":"<p><strong>Summary: </strong>Mendelian randomization is being utilized to assess causal effects of polygenic exposures, where many genetic instruments are subject to horizontal pleiotropy. Existing methods for detecting and correcting for horizontal pleiotropy have important assumptions that may not be fulfilled. Built upon the core gene hypothesis, we developed MR Corge for performing sensitivity analysis of Mendelian randomization. MR Corge identifies a small number of putative core instruments that are more likely to affect genes with a direct biological role in an exposure and obtains causal effect estimates based on these instruments, thereby reducing the risk of horizontal pleiotropy. Using positive and negative controls, we demonstrated that MR Corge estimates aligned with established biomedical knowledge and the results of randomized controlled trials. MR Corge may be widely applied to investigate polygenic exposure-outcome relationships.</p><p><strong>Availability and implementation: </strong>An open-sourced R package is available at https://github.com/zhwm/MRCorge.</p>","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11578597/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142607298","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Pf-HaploAtlas: an interactive web app for spatiotemporal analysis of Plasmodium falciparum genes. Pf-HaploAtlas:用于恶性疟原虫基因时空分析的交互式网络应用程序。
Pub Date : 2024-11-01 DOI: 10.1093/bioinformatics/btae673
Chiyun Lee, Eyyüb S Ünlü, Nina F D White, Jacob Almagro-Garcia, Cristina Ariani, Richard D Pearson

Motivation: Monitoring the genomic evolution of Plasmodium falciparum-the most widespread and deadliest of the human-infecting malaria species-is critical for making decisions in response to changes in drug resistance, diagnostic test failures, and vaccine effectiveness. The MalariaGEN data resources are the world's largest whole genome sequencing databases for Plasmodium parasites. The size and complexity of such data is a barrier to many potential end users in both public health and academic research. A user-friendly method for accessing and exploring data on the genetic variation of P. falciparum would greatly enable efforts in studying and controlling malaria.

Results: We developed Pf-HaploAtlas, a web application enabling exploratory data analysis of genomic variation without requiring advanced technical expertise. The app provides analysis-ready data catalogues and visualizations of amino acid haplotypes for all 5102 core P. falciparum genes. Pf-HaploAtlas facilitates comprehensive spatial and temporal exploration of genes and variants of interest by using data from 16 203 samples, from 33 countries, and spread between the years 1984 and 2018. The scope of Pf-HaploAtlas will expand with each new MalariaGEN Plasmodium data release.

Availability and implementation: Pf-HaploAtlas is available online for public use at https://apps.malariagen.net/pf-haploatlas, which allows users to download the underlying amino acid haplotype data for further analyses, and its source code is freely available on GitHub under the MIT licence at https://github.com/malariagen/pf-haploatlas.

动因:恶性疟原虫是人类感染疟疾种类中分布最广、最致命的一种,监测恶性疟原虫的基因组进化对于针对耐药性、诊断检测失败和疫苗有效性的变化做出决策至关重要。MalariaGEN 数据资源是世界上最大的疟原虫全基因组测序数据库。这些数据的规模和复杂性阻碍了公共卫生和学术研究领域的许多潜在最终用户。如果能有一种用户友好型方法来访问和探索恶性疟原虫基因变异数据,将极大地促进疟疾研究和控制工作:结果:我们开发了 Pf-HaploAtlas,这是一款网络应用程序,无需高级技术知识即可对基因组变异进行探索性数据分析。该应用程序为恶性疟原虫的所有 5102 个核心基因提供了分析就绪的数据目录和氨基酸单倍型可视化。Pf-HaploAtlas 利用来自 33 个国家的 16203 个样本的数据,从 1984 年到 2018 年对相关基因和变异进行了全面的时空探索。Pf-HaploAtlas 的范围将随着 MalariaGEN 疟原虫数据的发布而扩大:Pf-HaploAtlas 可在 https://apps.malariagen.net/pf-haploatlas 上供公众在线使用,用户可以下载底层氨基酸单倍型数据以进行进一步分析,其源代码可在 GitHub 上以 MIT 许可免费获取,网址为 https://github.com/malariagen/pf-haploatlas。
{"title":"Pf-HaploAtlas: an interactive web app for spatiotemporal analysis of Plasmodium falciparum genes.","authors":"Chiyun Lee, Eyyüb S Ünlü, Nina F D White, Jacob Almagro-Garcia, Cristina Ariani, Richard D Pearson","doi":"10.1093/bioinformatics/btae673","DOIUrl":"10.1093/bioinformatics/btae673","url":null,"abstract":"<p><strong>Motivation: </strong>Monitoring the genomic evolution of Plasmodium falciparum-the most widespread and deadliest of the human-infecting malaria species-is critical for making decisions in response to changes in drug resistance, diagnostic test failures, and vaccine effectiveness. The MalariaGEN data resources are the world's largest whole genome sequencing databases for Plasmodium parasites. The size and complexity of such data is a barrier to many potential end users in both public health and academic research. A user-friendly method for accessing and exploring data on the genetic variation of P. falciparum would greatly enable efforts in studying and controlling malaria.</p><p><strong>Results: </strong>We developed Pf-HaploAtlas, a web application enabling exploratory data analysis of genomic variation without requiring advanced technical expertise. The app provides analysis-ready data catalogues and visualizations of amino acid haplotypes for all 5102 core P. falciparum genes. Pf-HaploAtlas facilitates comprehensive spatial and temporal exploration of genes and variants of interest by using data from 16 203 samples, from 33 countries, and spread between the years 1984 and 2018. The scope of Pf-HaploAtlas will expand with each new MalariaGEN Plasmodium data release.</p><p><strong>Availability and implementation: </strong>Pf-HaploAtlas is available online for public use at https://apps.malariagen.net/pf-haploatlas, which allows users to download the underlying amino acid haplotype data for further analyses, and its source code is freely available on GitHub under the MIT licence at https://github.com/malariagen/pf-haploatlas.</p>","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11588202/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142683864","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
k-nonical space: sketching with reverse complements. k 名义空间:用反向补数绘制草图
Pub Date : 2024-11-01 DOI: 10.1093/bioinformatics/btae629
Guillaume Marçais, C S Elder, Carl Kingsford

Motivation: Sequences equivalent to their reverse complements (i.e. double-stranded DNA) have no analogue in text analysis and non-biological string algorithms. Despite this striking difference, algorithms designed for computational biology (e.g. sketching algorithms) are designed and tested in the same way as classical string algorithms. Then, as a post-processing step, these algorithms are adapted to work with genomic sequences by folding a k-mer and its reverse complement into a single sequence: The canonical representation (k-nonical space).

Results: The effect of using the canonical representation with sketching methods is understudied and not understood. As a first step, we use context-free sketching methods to illustrate the potentially detrimental effects of using canonical k-mers with string algorithms not designed to accommodate for them. In particular, we show that large stretches of the genome ("sketching deserts") are undersampled or entirely skipped by context-free sketching methods, effectively making these genomic regions invisible to subsequent algorithms using these sketches. We provide empirical data showing these effects and develop a theoretical framework explaining the appearance of sketching deserts. Finally, we propose two schemes to accommodate for these effects: (i) a new procedure that adapts existing sketching methods to k-nonical space and (ii) an optimization procedure to directly design new sketching methods for k-nonical space.

Availability and implementation: The code used in this analysis is available under a permissive license at https://github.com/Kingsford-Group/mdsscope.

动机与反向互补序列(即双链 DNA)等价的序列在文本分析和非生物字符串算法中并不存在。尽管存在这种显著差异,但为计算生物学设计的算法(如草图算法)的设计和测试方法与经典字符串算法相同。然后,作为后处理步骤,通过将 k-mer 及其反向补码折叠成单一序列,使这些算法适用于基因组序列:结果:对草图绘制方法使用规范表示法的效果研究不足,也不了解。作为第一步,我们使用无上下文草图方法来说明使用非标准 k-mers 的字符串算法可能产生的不利影响。特别是,我们展示了基因组的大片段("草图沙漠")被无上下文草图方法采样不足或完全跳过,从而有效地使使用这些草图的后续算法看不到这些基因组区域。我们提供了显示这些影响的经验数据,并建立了解释草图沙漠出现的理论框架。最后,我们提出了两种方案来适应这些效应:(1)一种新的程序,将现有的草图绘制方法适应于 k-nonical 空间;(2)一种优化程序,直接为 k-nonical 空间设计新的草图绘制方法:本分析中使用的代码可在 https://github.com/Kingsford-Group/mdsscope.Supplementary 信息网站的许可下获取:补充数据可在牛津生物信息学网站获取。
{"title":"k-nonical space: sketching with reverse complements.","authors":"Guillaume Marçais, C S Elder, Carl Kingsford","doi":"10.1093/bioinformatics/btae629","DOIUrl":"10.1093/bioinformatics/btae629","url":null,"abstract":"<p><strong>Motivation: </strong>Sequences equivalent to their reverse complements (i.e. double-stranded DNA) have no analogue in text analysis and non-biological string algorithms. Despite this striking difference, algorithms designed for computational biology (e.g. sketching algorithms) are designed and tested in the same way as classical string algorithms. Then, as a post-processing step, these algorithms are adapted to work with genomic sequences by folding a k-mer and its reverse complement into a single sequence: The canonical representation (k-nonical space).</p><p><strong>Results: </strong>The effect of using the canonical representation with sketching methods is understudied and not understood. As a first step, we use context-free sketching methods to illustrate the potentially detrimental effects of using canonical k-mers with string algorithms not designed to accommodate for them. In particular, we show that large stretches of the genome (\"sketching deserts\") are undersampled or entirely skipped by context-free sketching methods, effectively making these genomic regions invisible to subsequent algorithms using these sketches. We provide empirical data showing these effects and develop a theoretical framework explaining the appearance of sketching deserts. Finally, we propose two schemes to accommodate for these effects: (i) a new procedure that adapts existing sketching methods to k-nonical space and (ii) an optimization procedure to directly design new sketching methods for k-nonical space.</p><p><strong>Availability and implementation: </strong>The code used in this analysis is available under a permissive license at https://github.com/Kingsford-Group/mdsscope.</p>","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11549021/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142482938","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PhenoMultiOmics: an enzymatic reaction inferred multi-omics network visualization web server. PhenoMultiOmics:酶反应推断多组学网络可视化网络服务器。
Pub Date : 2024-11-01 DOI: 10.1093/bioinformatics/btae623
Yuying Shi, Botao Xu, Zhe Wang, Qitao Chen, Jie Chai, Cheng Wang

Motivation: Enzymatic reaction play a pivotal role in regulating cellular processes with a high degree of specificity to biological functions. When enzymatic reactions are disrupted by gene, protein, or metabolite dysfunctions in diseases, it becomes crucial to visualize the resulting perturbed enzymatic reaction-induced multi-omics network. Multi-omics network visualization aids in gaining a comprehensive understanding of the functionality and regulatory mechanisms within biological systems.

Results: In this study, we designed PhenoMultiOmics, an enzymatic reaction-based multi-omics web server designed to explore the scope of the multi-omics network across various cancer types. We first curated the PhenoMultiOmics database, which enables the retrieval of cancer-gene-protein-metabolite relationships based on the enzymatic reactions. We then developed the MultiOmics network visualization module to depict the interplay between genes, proteins, and metabolites in response to specific cancer-related enzymatic reactions. The biomarker discovery module facilitates functional analysis through differential omic feature expression and pathway enrichment analysis. PhenoMultiOmics has been applied to analyze the transcriptomics data of gastric cancer and the metabolomics data of lung cancer, providing mechanistic insights into interrupted enzymatic reactions and the associated multi-omics network.

Availability and implementation: PhenoMultiOmics is freely accessed at https://phenomultiomics.shinyapps.io/cancer/ with a user-friendly and interactive web interface.

动机酶促反应在调节细胞过程中发挥着关键作用,对生物功能具有高度特异性。当疾病中的基因、蛋白质或代谢物功能失调导致酶促反应紊乱时,将酶促反应引起的多组学网络可视化就变得至关重要。多组学网络可视化有助于全面了解生物系统的功能和调控机制:在这项研究中,我们设计了一个基于酶反应的多组学网络服务器 PhenoMultiOmics,旨在探索各种癌症类型的多组学网络范围。我们首先建立了 PhenoMultiOmics 数据库(PMODB),该数据库可根据酶促反应检索癌症基因-蛋白质-代谢物之间的关系。然后,我们开发了 MultiOmics 网络可视化模块,以描述基因、蛋白质和代谢物之间在特定癌症相关酶促反应中的相互作用。生物标记物发现模块通过差异 omic 特征表达和通路富集分析促进功能分析。PhenoMultiOmics 已被用于分析胃癌的转录组学数据和肺癌的代谢组学数据,为中断的酶反应和相关的多组学网络提供了深入的见解:PhenoMultiOmics 可在 https://phenomultiomics.shinyapps.io/cancer/ 免费访问,并提供用户友好的交互式网络界面:补充信息:补充数据可在 Bioinformatics online 上获取。
{"title":"PhenoMultiOmics: an enzymatic reaction inferred multi-omics network visualization web server.","authors":"Yuying Shi, Botao Xu, Zhe Wang, Qitao Chen, Jie Chai, Cheng Wang","doi":"10.1093/bioinformatics/btae623","DOIUrl":"10.1093/bioinformatics/btae623","url":null,"abstract":"<p><strong>Motivation: </strong>Enzymatic reaction play a pivotal role in regulating cellular processes with a high degree of specificity to biological functions. When enzymatic reactions are disrupted by gene, protein, or metabolite dysfunctions in diseases, it becomes crucial to visualize the resulting perturbed enzymatic reaction-induced multi-omics network. Multi-omics network visualization aids in gaining a comprehensive understanding of the functionality and regulatory mechanisms within biological systems.</p><p><strong>Results: </strong>In this study, we designed PhenoMultiOmics, an enzymatic reaction-based multi-omics web server designed to explore the scope of the multi-omics network across various cancer types. We first curated the PhenoMultiOmics database, which enables the retrieval of cancer-gene-protein-metabolite relationships based on the enzymatic reactions. We then developed the MultiOmics network visualization module to depict the interplay between genes, proteins, and metabolites in response to specific cancer-related enzymatic reactions. The biomarker discovery module facilitates functional analysis through differential omic feature expression and pathway enrichment analysis. PhenoMultiOmics has been applied to analyze the transcriptomics data of gastric cancer and the metabolomics data of lung cancer, providing mechanistic insights into interrupted enzymatic reactions and the associated multi-omics network.</p><p><strong>Availability and implementation: </strong>PhenoMultiOmics is freely accessed at https://phenomultiomics.shinyapps.io/cancer/ with a user-friendly and interactive web interface.</p>","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11549024/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142482944","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Bioinformatics (Oxford, England)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1