首页 > 最新文献

NPJ Systems Biology and Applications最新文献

英文 中文
EpiScan: accurate high-throughput mapping of antibody-specific epitopes using sequence information EpiScan:利用序列信息精确绘制抗体特异性表位的高通量图谱
IF 4 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-09-09 DOI: 10.1038/s41540-024-00432-7
Chuan Wang, Jiangyuan Wang, Wenjun Song, Guanzheng Luo, Taijiao Jiang

The identification of antibody-specific epitopes on virus proteins is crucial for vaccine development and drug design. Nonetheless, traditional wet-lab approaches for the identification of epitopes are both costly and labor-intensive, underscoring the need for the development of efficient and cost-effective computational tools. Here, EpiScan, an attention-based deep learning framework for predicting antibody-specific epitopes, is presented. EpiScan adopts a multi-input and single-output strategy by designing independent blocks for different parts of antibodies, including variable heavy chain (VH), variable light chain (VL), complementary determining regions (CDRs), and framework regions (FRs). The block predictions are weighted and integrated for the prediction of potential epitopes. Using multiple experimental data samples, we show that EpiScan, which only uses antibody sequence information, can accurately map epitopes on specific antigen structures. The antibody-specific epitopes on the receptor binding domain (RBD) of SARS coronavirus 2 (SARS-CoV-2) were located by EpiScan, and the potentially valuable vaccine epitope was identified. EpiScan can expedite the epitope mapping process for high-throughput antibody sequencing data, supporting vaccine design and drug development. Availability: For the convenience of related wet-experimental researchers, the source code and web server of EpiScan are publicly available at https://github.com/gzBiomedical/EpiScan.

鉴定病毒蛋白质上的抗体特异性表位对疫苗开发和药物设计至关重要。然而,用于鉴定表位的传统湿实验室方法既昂贵又耗费人力,这凸显了开发高效、经济的计算工具的必要性。这里介绍的 EpiScan 是一种基于注意力的深度学习框架,用于预测抗体特异性表位。EpiScan 采用多输入、单输出策略,为抗体的不同部分设计独立的区块,包括可变重链(VH)、可变轻链(VL)、互补决定区(CDR)和框架区(FR)。这些区块预测结果经过加权和整合,可用于预测潜在的表位。通过使用多个实验数据样本,我们证明了只使用抗体序列信息的 EpiScan 能够准确地绘制出特定抗原结构上的表位图。EpiScan 定位了 SARS 冠状病毒 2(SARS-CoV-2)受体结合域(RBD)上的抗体特异性表位,并确定了潜在的有价值疫苗表位。EpiScan 可以加快高通量抗体测序数据的表位图绘制过程,为疫苗设计和药物开发提供支持。可用性:为方便相关湿法实验研究人员使用,EpiScan 的源代码和网络服务器可在 https://github.com/gzBiomedical/EpiScan 上公开获取。
{"title":"EpiScan: accurate high-throughput mapping of antibody-specific epitopes using sequence information","authors":"Chuan Wang, Jiangyuan Wang, Wenjun Song, Guanzheng Luo, Taijiao Jiang","doi":"10.1038/s41540-024-00432-7","DOIUrl":"https://doi.org/10.1038/s41540-024-00432-7","url":null,"abstract":"<p>The identification of antibody-specific epitopes on virus proteins is crucial for vaccine development and drug design. Nonetheless, traditional wet-lab approaches for the identification of epitopes are both costly and labor-intensive, underscoring the need for the development of efficient and cost-effective computational tools. Here, EpiScan, an attention-based deep learning framework for predicting antibody-specific epitopes, is presented. EpiScan adopts a multi-input and single-output strategy by designing independent blocks for different parts of antibodies, including variable heavy chain (V<sub>H</sub>), variable light chain (V<sub>L</sub>), complementary determining regions (CDRs), and framework regions (FRs). The block predictions are weighted and integrated for the prediction of potential epitopes. Using multiple experimental data samples, we show that EpiScan, which only uses antibody sequence information, can accurately map epitopes on specific antigen structures. The antibody-specific epitopes on the receptor binding domain (RBD) of SARS coronavirus 2 (SARS-CoV-2) were located by EpiScan, and the potentially valuable vaccine epitope was identified. EpiScan can expedite the epitope mapping process for high-throughput antibody sequencing data, supporting vaccine design and drug development. Availability: For the convenience of related wet-experimental researchers, the source code and web server of EpiScan are publicly available at https://github.com/gzBiomedical/EpiScan.</p>","PeriodicalId":19345,"journal":{"name":"NPJ Systems Biology and Applications","volume":"249 1","pages":""},"PeriodicalIF":4.0,"publicationDate":"2024-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142206521","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Codon usage and expression-based features significantly improve prediction of CRISPR efficiency. 基于密码子用法和表达的特征大大提高了对 CRISPR 效率的预测。
IF 3.5 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-09-03 DOI: 10.1038/s41540-024-00431-8
Shaked Bergman, Tamir Tuller

CRISPR is a precise and effective genome editing technology; but despite several advancements during the last decade, our ability to computationally design gRNAs remains limited. Most predictive models have relatively low predictive power and utilize only the sequence of the target site as input. Here we suggest a new category of features, which incorporate the target site genomic position and the presence of genes close to it. We calculate four features based on gene expression and codon usage bias indices. We show, on CRISPR datasets taken from 3 different cell types, that such features perform comparably with 425 state-of-the-art predictive features, ranking in the top 2-12% of features. We trained new predictive models, showing that adding expression features to them significantly improves their r2 by up to 0.04 (relative increase of 39%), achieving average correlations of up to 0.38 on their validation sets; and that these features are deemed important by different feature importance metrics. We believe that incorporating the target site's position, in addition to its sequence, in features such as we have generated here will improve our ability to predict, design and understand CRISPR experiments going forward.

CRISPR 是一种精确而有效的基因组编辑技术;但尽管在过去十年中取得了一些进展,我们计算设计 gRNA 的能力仍然有限。大多数预测模型的预测能力相对较低,而且只利用目标位点的序列作为输入。在这里,我们提出了一类新的特征,它结合了目标位点的基因组位置和邻近基因的存在。我们根据基因表达和密码子使用偏差指数计算了四个特征。我们在取自 3 种不同细胞类型的 CRISPR 数据集上表明,这些特征的表现与 425 种最先进的预测特征不相上下,位居前 2-12% 的特征之列。我们训练了新的预测模型,结果表明,在模型中加入表达特征可显著提高模型的 r2,最高可达 0.04(相对提高 39%),在验证集上的平均相关性最高可达 0.38;而且这些特征被不同的特征重要性指标视为重要特征。我们相信,将目标位点的位置和序列纳入我们在此生成的特征中,将提高我们预测、设计和理解未来 CRISPR 实验的能力。
{"title":"Codon usage and expression-based features significantly improve prediction of CRISPR efficiency.","authors":"Shaked Bergman, Tamir Tuller","doi":"10.1038/s41540-024-00431-8","DOIUrl":"10.1038/s41540-024-00431-8","url":null,"abstract":"<p><p>CRISPR is a precise and effective genome editing technology; but despite several advancements during the last decade, our ability to computationally design gRNAs remains limited. Most predictive models have relatively low predictive power and utilize only the sequence of the target site as input. Here we suggest a new category of features, which incorporate the target site genomic position and the presence of genes close to it. We calculate four features based on gene expression and codon usage bias indices. We show, on CRISPR datasets taken from 3 different cell types, that such features perform comparably with 425 state-of-the-art predictive features, ranking in the top 2-12% of features. We trained new predictive models, showing that adding expression features to them significantly improves their r<sup>2</sup> by up to 0.04 (relative increase of 39%), achieving average correlations of up to 0.38 on their validation sets; and that these features are deemed important by different feature importance metrics. We believe that incorporating the target site's position, in addition to its sequence, in features such as we have generated here will improve our ability to predict, design and understand CRISPR experiments going forward.</p>","PeriodicalId":19345,"journal":{"name":"NPJ Systems Biology and Applications","volume":"10 1","pages":"100"},"PeriodicalIF":3.5,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11372048/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142126288","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Boolean model explains phenotypic plasticity changes underlying hepatic cancer stem cells emergence. 布尔模型解释了肝癌干细胞出现的表型可塑性变化。
IF 3.5 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-09-02 DOI: 10.1038/s41540-024-00422-9
Alexis Hernández-Magaña, Antonio Bensussen, Juan Carlos Martínez-García, Elena R Álvarez-Buylla

In several carcinomas, including hepatocellular carcinoma, it has been demonstrated that cancer stem cells (CSCs) have enhanced invasiveness and therapy resistance compared to differentiated cancer cells. Mathematical-computational tools could be valuable for integrating experimental results and understanding the phenotypic plasticity mechanisms for CSCs emergence. Based on the literature review, we constructed a Boolean model that recovers eight stable states (attractors) corresponding to the gene expression profile of hepatocytes and mesenchymal cells in senescent, quiescent, proliferative, and stem-like states. The epigenetic landscape associated with the regulatory network was analyzed. We observed that the loss of p53, p16, RB, or the constitutive activation of β-catenin and YAP1 increases the robustness of the proliferative stem-like phenotypes. Additionally, we found that p53 inactivation facilitates the transition of proliferative hepatocytes into stem-like mesenchymal phenotype. Thus, phenotypic plasticity may be altered, and stem-like phenotypes related to CSCs may be easier to attain following the mutation acquisition.

在包括肝细胞癌在内的多种癌症中,癌症干细胞(CSCs)与分化癌细胞相比,具有更强的侵袭性和耐药性。数学计算工具对于整合实验结果和理解 CSCs 出现的表型可塑性机制很有价值。在文献综述的基础上,我们构建了一个布尔模型,恢复了肝细胞和间充质细胞在衰老、静止、增殖和干样状态下基因表达谱的八个稳定状态(吸引子)。我们分析了与调控网络相关的表观遗传景观。我们观察到,p53、p16、RB 的缺失或β-catenin 和 YAP1 的组成性激活会增加增殖干样表型的稳健性。此外,我们还发现,p53 失活促进了增殖性肝细胞向干样间质表型的转变。因此,表型的可塑性可能会发生改变,突变后可能更容易获得与CSCs相关的干样表型。
{"title":"A Boolean model explains phenotypic plasticity changes underlying hepatic cancer stem cells emergence.","authors":"Alexis Hernández-Magaña, Antonio Bensussen, Juan Carlos Martínez-García, Elena R Álvarez-Buylla","doi":"10.1038/s41540-024-00422-9","DOIUrl":"10.1038/s41540-024-00422-9","url":null,"abstract":"<p><p>In several carcinomas, including hepatocellular carcinoma, it has been demonstrated that cancer stem cells (CSCs) have enhanced invasiveness and therapy resistance compared to differentiated cancer cells. Mathematical-computational tools could be valuable for integrating experimental results and understanding the phenotypic plasticity mechanisms for CSCs emergence. Based on the literature review, we constructed a Boolean model that recovers eight stable states (attractors) corresponding to the gene expression profile of hepatocytes and mesenchymal cells in senescent, quiescent, proliferative, and stem-like states. The epigenetic landscape associated with the regulatory network was analyzed. We observed that the loss of p53, p16, RB, or the constitutive activation of β-catenin and YAP1 increases the robustness of the proliferative stem-like phenotypes. Additionally, we found that p53 inactivation facilitates the transition of proliferative hepatocytes into stem-like mesenchymal phenotype. Thus, phenotypic plasticity may be altered, and stem-like phenotypes related to CSCs may be easier to attain following the mutation acquisition.</p>","PeriodicalId":19345,"journal":{"name":"NPJ Systems Biology and Applications","volume":"10 1","pages":"99"},"PeriodicalIF":3.5,"publicationDate":"2024-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11369243/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142120344","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Network topology and interaction logic determine states it supports. 网络拓扑和交互逻辑决定了它所支持的状态。
IF 3.5 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-08-28 DOI: 10.1038/s41540-024-00423-8
Tomáš Gedeon

In this review paper we summarize a recent progress on the problem of describing range of dynamics supported by a network. We show that there is natural connection between network models consisting of collections of multivalued monotone boolean functions and ordinary differential equations models. We show how to construct such collections and use them to answer questions about prevalence of cellular phenotypes that correspond to equilibria of network models.

在这篇综述论文中,我们总结了描述网络支持的动态范围问题的最新进展。我们表明,由多值单调布尔函数集合组成的网络模型与常微分方程模型之间存在天然联系。我们展示了如何构建这样的集合,并用它们来回答与网络模型均衡点相对应的细胞表型的流行率问题。
{"title":"Network topology and interaction logic determine states it supports.","authors":"Tomáš Gedeon","doi":"10.1038/s41540-024-00423-8","DOIUrl":"https://doi.org/10.1038/s41540-024-00423-8","url":null,"abstract":"<p><p>In this review paper we summarize a recent progress on the problem of describing range of dynamics supported by a network. We show that there is natural connection between network models consisting of collections of multivalued monotone boolean functions and ordinary differential equations models. We show how to construct such collections and use them to answer questions about prevalence of cellular phenotypes that correspond to equilibria of network models.</p>","PeriodicalId":19345,"journal":{"name":"NPJ Systems Biology and Applications","volume":"10 1","pages":"98"},"PeriodicalIF":3.5,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11358538/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142093646","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Recovering biomolecular network dynamics from single-cell omics data requires three time points. 从单细胞奥米克斯数据中恢复生物分子网络动态需要三个时间点。
IF 3.5 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-08-27 DOI: 10.1038/s41540-024-00424-7
Shu Wang, Muhammad Ali Al-Radhawi, Douglas A Lauffenburger, Eduardo D Sontag

Single-cell omics technologies can measure millions of cells for up to thousands of biomolecular features, enabling data-driven studies of complex biological networks. However, these high-throughput experimental techniques often cannot track individual cells over time, thus complicating the understanding of dynamics such as time trajectories of cell states. These "dynamical phenotypes" are key to understanding biological phenomena such as differentiation fates. We show by mathematical analysis that, in spite of high dimensionality and lack of individual cell traces, three time-points of single-cell omics data are theoretically necessary and sufficient to uniquely determine the network interaction matrix and associated dynamics. Moreover, we show through numerical simulations that an interaction matrix can be accurately determined with three or more time-points even in the presence of sampling and measurement noise typical of single-cell omics. Our results can guide the design of single-cell omics time-course experiments, and provide a tool for data-driven phase-space analysis.

单细胞组学技术可以测量数百万个细胞的数千种生物分子特征,从而对复杂的生物网络进行数据驱动研究。然而,这些高通量实验技术往往无法跟踪单个细胞的时间变化,从而使了解细胞状态的时间轨迹等动态变化变得更加复杂。这些 "动态表型 "是理解分化命运等生物现象的关键。我们通过数学分析证明,尽管维度很高且缺乏单个细胞的轨迹,但理论上单细胞全息数据的三个时间点对于唯一确定网络交互矩阵和相关动态是必要且充分的。此外,我们还通过数值模拟表明,即使存在单细胞全息数据典型的采样和测量噪声,也能通过三个或更多时间点准确确定相互作用矩阵。我们的研究结果可以指导单细胞组学时程实验的设计,并为数据驱动的相空间分析提供工具。
{"title":"Recovering biomolecular network dynamics from single-cell omics data requires three time points.","authors":"Shu Wang, Muhammad Ali Al-Radhawi, Douglas A Lauffenburger, Eduardo D Sontag","doi":"10.1038/s41540-024-00424-7","DOIUrl":"10.1038/s41540-024-00424-7","url":null,"abstract":"<p><p>Single-cell omics technologies can measure millions of cells for up to thousands of biomolecular features, enabling data-driven studies of complex biological networks. However, these high-throughput experimental techniques often cannot track individual cells over time, thus complicating the understanding of dynamics such as time trajectories of cell states. These \"dynamical phenotypes\" are key to understanding biological phenomena such as differentiation fates. We show by mathematical analysis that, in spite of high dimensionality and lack of individual cell traces, three time-points of single-cell omics data are theoretically necessary and sufficient to uniquely determine the network interaction matrix and associated dynamics. Moreover, we show through numerical simulations that an interaction matrix can be accurately determined with three or more time-points even in the presence of sampling and measurement noise typical of single-cell omics. Our results can guide the design of single-cell omics time-course experiments, and provide a tool for data-driven phase-space analysis.</p>","PeriodicalId":19345,"journal":{"name":"NPJ Systems Biology and Applications","volume":"10 1","pages":"97"},"PeriodicalIF":3.5,"publicationDate":"2024-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11350189/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142081061","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Constraint-based modelling predicts metabolic signatures of low and high-grade serous ovarian cancer. 基于约束的建模可预测低度和高度浆液性卵巢癌的代谢特征。
IF 3.5 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-08-24 DOI: 10.1038/s41540-024-00418-5
Kate E Meeson, Jean-Marc Schwartz

Ovarian cancer is an aggressive, heterogeneous disease, burdened with late diagnosis and resistance to chemotherapy. Clinical features of ovarian cancer could be explained by investigating its metabolism, and how the regulation of specific pathways links to individual phenotypes. Ovarian cancer is of particular interest for metabolic research due to its heterogeneous nature, with five distinct subtypes having been identified, each of which may display a unique metabolic signature. To elucidate metabolic differences, constraint-based modelling (CBM) represents a powerful technology, inviting the integration of 'omics' data, such as transcriptomics. However, many CBM methods have not prioritised accurate growth rate predictions, and there are very few ovarian cancer genome-scale studies. Here, a novel method for CBM has been developed, employing the genome-scale model Human1 and flux balance analysis, enabling the integration of in vitro growth rates, transcriptomics data and media conditions to predict the metabolic behaviour of cells. Using low- and high-grade ovarian cancer, subtype-specific metabolic differences have been predicted, which have been supported by publicly available CRISPR-Cas9 data from the Cancer Cell Line Encyclopaedia and an extensive literature review. Metabolic drivers of aggressive, invasive phenotypes, as well as pathways responsible for increased chemoresistance in low-grade cell lines have been suggested. Experimental gene dependency data has been used to validate areas of the pentose phosphate pathway as essential for low-grade cellular growth, highlighting potential vulnerabilities for this ovarian cancer subtype.

卵巢癌是一种侵袭性、异质性疾病,具有诊断晚和对化疗耐药的特点。卵巢癌的临床特征可以通过研究其代谢以及特定通路的调控与个体表型之间的联系来解释。卵巢癌的异质性使其成为代谢研究的热点,目前已发现五种不同的亚型,每种亚型都可能显示出独特的代谢特征。为了阐明代谢差异,基于约束的建模(CBM)是一项强大的技术,它可以整合转录组学等 "全息 "数据。然而,许多 CBM 方法都没有优先考虑准确的生长率预测,而且卵巢癌基因组规模的研究也很少。在此,我们开发了一种新的 CBM 方法,利用基因组尺度模型 Human1 和通量平衡分析,整合体外生长速率、转录组学数据和培养基条件,预测细胞的代谢行为。利用低分化卵巢癌和高分化卵巢癌,预测了亚型特异性代谢差异,这些差异得到了癌症细胞系百科全书中公开可用的 CRISPR-Cas9 数据和大量文献综述的支持。我们提出了侵袭性、侵袭性表型的代谢驱动因素,以及导致低分化细胞系化疗耐药性增强的途径。实验基因依赖性数据被用来验证磷酸戊糖通路的某些区域对低分化细胞的生长至关重要,突出了这种卵巢癌亚型的潜在脆弱性。
{"title":"Constraint-based modelling predicts metabolic signatures of low and high-grade serous ovarian cancer.","authors":"Kate E Meeson, Jean-Marc Schwartz","doi":"10.1038/s41540-024-00418-5","DOIUrl":"10.1038/s41540-024-00418-5","url":null,"abstract":"<p><p>Ovarian cancer is an aggressive, heterogeneous disease, burdened with late diagnosis and resistance to chemotherapy. Clinical features of ovarian cancer could be explained by investigating its metabolism, and how the regulation of specific pathways links to individual phenotypes. Ovarian cancer is of particular interest for metabolic research due to its heterogeneous nature, with five distinct subtypes having been identified, each of which may display a unique metabolic signature. To elucidate metabolic differences, constraint-based modelling (CBM) represents a powerful technology, inviting the integration of 'omics' data, such as transcriptomics. However, many CBM methods have not prioritised accurate growth rate predictions, and there are very few ovarian cancer genome-scale studies. Here, a novel method for CBM has been developed, employing the genome-scale model Human1 and flux balance analysis, enabling the integration of in vitro growth rates, transcriptomics data and media conditions to predict the metabolic behaviour of cells. Using low- and high-grade ovarian cancer, subtype-specific metabolic differences have been predicted, which have been supported by publicly available CRISPR-Cas9 data from the Cancer Cell Line Encyclopaedia and an extensive literature review. Metabolic drivers of aggressive, invasive phenotypes, as well as pathways responsible for increased chemoresistance in low-grade cell lines have been suggested. Experimental gene dependency data has been used to validate areas of the pentose phosphate pathway as essential for low-grade cellular growth, highlighting potential vulnerabilities for this ovarian cancer subtype.</p>","PeriodicalId":19345,"journal":{"name":"NPJ Systems Biology and Applications","volume":"10 1","pages":"96"},"PeriodicalIF":3.5,"publicationDate":"2024-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11344801/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142056191","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SignalingProfiler 2.0 a network-based approach to bridge multi-omics data to phenotypic hallmarks. SignalingProfiler 2.0 是一种基于网络的方法,可将多组学数据与表型特征联系起来。
IF 3.5 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-08-23 DOI: 10.1038/s41540-024-00417-6
Veronica Venafra, Francesca Sacco, Livia Perfetto

Unraveling how cellular signaling is remodeled upon perturbation is crucial for understanding disease mechanisms and identifying potential drug targets. In this pursuit, computational tools generating mechanistic hypotheses from multi-omics data have invaluable potential. Here, we present a newly implemented version (2.0) of SignalingProfiler, a multi-step pipeline to draw mechanistic hypotheses on the signaling events impacting cellular phenotypes. SignalingProfiler 2.0 derives context-specific signaling networks by integrating proteogenomic data with the prior knowledge-causal network. This is a freely accessible and flexible tool that incorporates statistical, footprint-based, and graph algorithms to accelerate the integration and interpretation of multi-omics data. Through a benchmarking process on three proof-of-concept studies, we demonstrate the tool's ability to generate hierarchical mechanistic networks recapitulating novel and known perturbed signaling and phenotypic outcomes, in both human and mice contexts. In summary, SignalingProfiler 2.0 addresses the emergent need to derive biologically relevant information from complex multi-omics data by extracting interpretable networks.

揭示细胞信号在受到干扰时是如何重塑的,对于了解疾病机制和确定潜在的药物靶点至关重要。在这一过程中,从多组学数据中生成机理假设的计算工具具有不可估量的潜力。在这里,我们介绍了 SignalingProfiler 的最新实施版本(2.0),它是一个多步骤管道,用于得出影响细胞表型的信号事件的机理假设。SignalingProfiler 2.0通过整合蛋白质基因组数据与先验知识-因果网络,推导出特定背景的信号网络。这是一款可免费使用的灵活工具,它结合了统计、基于足迹和图的算法,可加快多组学数据的整合和解释。通过对三项概念验证研究进行基准测试,我们证明了该工具有能力在人类和小鼠环境中生成层次分明的机理网络,重现新的和已知的扰动信号转导和表型结果。总之,SignalingProfiler 2.0 通过提取可解释的网络,满足了从复杂的多组学数据中获取生物相关信息的新需求。
{"title":"SignalingProfiler 2.0 a network-based approach to bridge multi-omics data to phenotypic hallmarks.","authors":"Veronica Venafra, Francesca Sacco, Livia Perfetto","doi":"10.1038/s41540-024-00417-6","DOIUrl":"10.1038/s41540-024-00417-6","url":null,"abstract":"<p><p>Unraveling how cellular signaling is remodeled upon perturbation is crucial for understanding disease mechanisms and identifying potential drug targets. In this pursuit, computational tools generating mechanistic hypotheses from multi-omics data have invaluable potential. Here, we present a newly implemented version (2.0) of SignalingProfiler, a multi-step pipeline to draw mechanistic hypotheses on the signaling events impacting cellular phenotypes. SignalingProfiler 2.0 derives context-specific signaling networks by integrating proteogenomic data with the prior knowledge-causal network. This is a freely accessible and flexible tool that incorporates statistical, footprint-based, and graph algorithms to accelerate the integration and interpretation of multi-omics data. Through a benchmarking process on three proof-of-concept studies, we demonstrate the tool's ability to generate hierarchical mechanistic networks recapitulating novel and known perturbed signaling and phenotypic outcomes, in both human and mice contexts. In summary, SignalingProfiler 2.0 addresses the emergent need to derive biologically relevant information from complex multi-omics data by extracting interpretable networks.</p>","PeriodicalId":19345,"journal":{"name":"NPJ Systems Biology and Applications","volume":"10 1","pages":"95"},"PeriodicalIF":3.5,"publicationDate":"2024-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11343843/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142046959","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Integrated-omics analysis with explainable deep networks on pathobiology of infant bronchiolitis. 利用可解释深度网络对婴儿支气管炎病理生物学进行综合组学分析
IF 3.5 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-08-22 DOI: 10.1038/s41540-024-00420-x
Tadao Ooka, Naoto Usuyama, Ryohei Shibata, Michihito Kyo, Jonathan M Mansbach, Zhaozhong Zhu, Carlos A Camargo, Kohei Hasegawa

Bronchiolitis is the leading cause of infant hospitalization. However, the molecular networks driving bronchiolitis pathobiology remain unknown. Integrative molecular networks, including the transcriptome and metabolome, can identify functional and regulatory pathways contributing to disease severity. Here, we integrated nasopharyngeal transcriptome and metabolome data of 397 infants hospitalized with bronchiolitis in a 17-center prospective cohort study. Using an explainable deep network model, we identified an omics-cluster comprising 401 transcripts and 38 metabolites that distinguishes bronchiolitis severity (test-set AUC, 0.828). This omics-cluster derived a molecular network, where innate immunity-related metabolites (e.g., ceramides) centralized and were characterized by toll-like receptor (TLR) and NF-κB signaling pathways (both FDR < 0.001). The network analyses identified eight modules and 50 existing drug candidates for repurposing, including prostaglandin I2 analogs (e.g., iloprost), which promote anti-inflammatory effects through TLR signaling. Our approach facilitates not only the identification of molecular networks underlying infant bronchiolitis but the development of pioneering treatment strategies.

支气管炎是婴儿住院治疗的主要原因。然而,驱动支气管炎病理生物学的分子网络仍然未知。包括转录组和代谢组在内的整合分子网络可以确定导致疾病严重程度的功能和调控途径。在此,我们整合了一项 17 个中心的前瞻性队列研究中 397 名因支气管炎住院的婴儿的鼻咽转录组和代谢组数据。利用可解释的深度网络模型,我们发现了一个由 401 个转录本和 38 个代谢物组成的全局集群(omics-cluster),它能区分支气管炎的严重程度(测试集 AUC 为 0.828)。这个全元素集群衍生出一个分子网络,在这个网络中,先天免疫相关代谢物(如神经酰胺)处于中心位置,并以收费样受体(TLR)和 NF-κB 信号通路(均为 FDR 2 类似物(如伊洛前列素),通过 TLR 信号促进抗炎作用)为特征。我们的方法不仅有助于确定婴儿支气管炎的分子网络,还有助于开发开创性的治疗策略。
{"title":"Integrated-omics analysis with explainable deep networks on pathobiology of infant bronchiolitis.","authors":"Tadao Ooka, Naoto Usuyama, Ryohei Shibata, Michihito Kyo, Jonathan M Mansbach, Zhaozhong Zhu, Carlos A Camargo, Kohei Hasegawa","doi":"10.1038/s41540-024-00420-x","DOIUrl":"10.1038/s41540-024-00420-x","url":null,"abstract":"<p><p>Bronchiolitis is the leading cause of infant hospitalization. However, the molecular networks driving bronchiolitis pathobiology remain unknown. Integrative molecular networks, including the transcriptome and metabolome, can identify functional and regulatory pathways contributing to disease severity. Here, we integrated nasopharyngeal transcriptome and metabolome data of 397 infants hospitalized with bronchiolitis in a 17-center prospective cohort study. Using an explainable deep network model, we identified an omics-cluster comprising 401 transcripts and 38 metabolites that distinguishes bronchiolitis severity (test-set AUC, 0.828). This omics-cluster derived a molecular network, where innate immunity-related metabolites (e.g., ceramides) centralized and were characterized by toll-like receptor (TLR) and NF-κB signaling pathways (both FDR < 0.001). The network analyses identified eight modules and 50 existing drug candidates for repurposing, including prostaglandin I<sub>2</sub> analogs (e.g., iloprost), which promote anti-inflammatory effects through TLR signaling. Our approach facilitates not only the identification of molecular networks underlying infant bronchiolitis but the development of pioneering treatment strategies.</p>","PeriodicalId":19345,"journal":{"name":"NPJ Systems Biology and Applications","volume":"10 1","pages":"93"},"PeriodicalIF":3.5,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11341550/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142036502","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Assessing structural uncertainty of biochemical regulatory networks in metabolic pathways under varying data quality. 评估不同数据质量下代谢途径中生化调控网络结构的不确定性。
IF 3.5 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-08-22 DOI: 10.1038/s41540-024-00412-x
Yue Han, Mark P Styczynski

Ordinary differential equation (ODE) models are powerful tools for studying the dynamics of metabolic pathways. However, key challenges lie in constructing ODE models for metabolic pathways, specifically in our limited knowledge about which metabolite levels control which reaction rates. Identification of these regulatory networks is further complicated by the limited availability of relevant data. Here, we assess the conditions under which it is feasible to accurately identify regulatory networks in metabolic pathways by computationally fitting candidate network models with biochemical systems theory (BST) kinetics to data of varying quality. We use network motifs commonly found in metabolic pathways as a simplified testbed. Key features correlated with the level of difficulty in identifying the correct regulatory network were identified, highlighting the impact of sampling rate, data noise, and data incompleteness on structural uncertainty. We found that for a simple branched network motif with an equal number of metabolites and fluxes, identification of the correct regulatory network can be largely achieved and is robust to missing one of the metabolite profiles. However, with a bi-substrate bi-product reaction or more fluxes than metabolites in the network motif, the identification becomes more challenging. Stronger regulatory interactions and higher metabolite concentrations were found to be correlated with less structural uncertainty. These results could aid efforts to predict whether the true metabolic regulatory network can be computationally identified for a given stoichiometric network topology and dataset quality, thus helping to identify optimal measures to mitigate such identifiability issues in kinetic model development.

常微分方程(ODE)模型是研究代谢途径动态的有力工具。然而,为代谢途径构建 ODE 模型的关键挑战在于我们对哪些代谢物水平控制哪些反应速率的了解有限。由于相关数据有限,这些调控网络的识别变得更加复杂。在这里,我们通过计算将候选网络模型与生化系统理论(BST)动力学拟合到不同质量的数据中,来评估在什么条件下可以准确识别代谢途径中的调控网络。我们使用代谢途径中常见的网络图案作为简化的试验平台。我们确定了与识别正确调控网络的难度相关的关键特征,突出了采样率、数据噪声和数据不完整性对结构不确定性的影响。我们发现,对于代谢物和通量数量相等的简单分支网络图案,基本上可以识别出正确的调控网络,而且对缺少其中一个代谢物图谱的情况也很稳健。然而,当网络图案中存在双底物双产物反应或通量多于代谢物时,识别工作就变得更具挑战性。研究发现,较强的调控相互作用和较高的代谢物浓度与较小的结构不确定性相关。这些结果有助于预测在给定的化学计量网络拓扑结构和数据集质量下,是否能通过计算识别出真正的代谢调控网络,从而帮助确定最佳措施,以减轻动力学模型开发中的可识别性问题。
{"title":"Assessing structural uncertainty of biochemical regulatory networks in metabolic pathways under varying data quality.","authors":"Yue Han, Mark P Styczynski","doi":"10.1038/s41540-024-00412-x","DOIUrl":"10.1038/s41540-024-00412-x","url":null,"abstract":"<p><p>Ordinary differential equation (ODE) models are powerful tools for studying the dynamics of metabolic pathways. However, key challenges lie in constructing ODE models for metabolic pathways, specifically in our limited knowledge about which metabolite levels control which reaction rates. Identification of these regulatory networks is further complicated by the limited availability of relevant data. Here, we assess the conditions under which it is feasible to accurately identify regulatory networks in metabolic pathways by computationally fitting candidate network models with biochemical systems theory (BST) kinetics to data of varying quality. We use network motifs commonly found in metabolic pathways as a simplified testbed. Key features correlated with the level of difficulty in identifying the correct regulatory network were identified, highlighting the impact of sampling rate, data noise, and data incompleteness on structural uncertainty. We found that for a simple branched network motif with an equal number of metabolites and fluxes, identification of the correct regulatory network can be largely achieved and is robust to missing one of the metabolite profiles. However, with a bi-substrate bi-product reaction or more fluxes than metabolites in the network motif, the identification becomes more challenging. Stronger regulatory interactions and higher metabolite concentrations were found to be correlated with less structural uncertainty. These results could aid efforts to predict whether the true metabolic regulatory network can be computationally identified for a given stoichiometric network topology and dataset quality, thus helping to identify optimal measures to mitigate such identifiability issues in kinetic model development.</p>","PeriodicalId":19345,"journal":{"name":"NPJ Systems Biology and Applications","volume":"10 1","pages":"94"},"PeriodicalIF":3.5,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11341918/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142036501","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Using DeepSignalingFlow to mine signaling flows interpreting mechanism of synergy of cocktails. 使用 DeepSignalingFlow 挖掘信号流,解释鸡尾酒的协同作用机制。
IF 3.5 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-08-21 DOI: 10.1038/s41540-024-00421-w
Heming Zhang, Yixin Chen, Philip Payne, Fuhai Li

Complex signaling pathways are believed to be responsible for drug resistance. Drug combinations perturbing multiple signaling targets have the potential to reduce drug resistance. The large-scale multi-omic datasets and experimental drug combination synergistic score data are valuable resources to study mechanisms of synergy (MoS) to guide the development of precision drug combinations. However, signaling patterns of MoS are complex and remain unclear, and thus it is challenging to identify synergistic drug combinations in clinical. Herein, we proposed a novel integrative and interpretable graph AI model, DeepSignalingFlow, to uncover the MoS by integrating and mining multi-omic data. The major innovation is that we uncover MoS by modeling the signaling flow from multi-omic features of essential disease proteins to the drug targets, which has not been introduced by the existing models. The model performance was assessed utilizing four distinct drug combination synergy evaluation datasets, i.e., NCI ALMANAC, O'Neil, DrugComb, and DrugCombDB. The comparison results showed that the proposed model outperformed existing graph AI models in terms of synergy score prediction, and can interpret MoS using the core signaling flows. The code is publicly accessible via Github: https://github.com/FuhaiLiAiLab/DeepSignalingFlow.

复杂的信号通路被认为是导致耐药性的原因。干扰多个信号靶点的药物组合有可能减少耐药性。大规模多组学数据集和实验性药物组合协同得分数据是研究协同机制(MoS)的宝贵资源,可用于指导精准药物组合的开发。然而,MoS 的信号转导模式十分复杂,目前仍不清楚,因此在临床上识别协同药物组合具有挑战性。在此,我们提出了一种新颖的整合性和可解释性图人工智能模型--DeepSignalingFlow,通过整合和挖掘多组学数据来揭示MoS。该模型的主要创新之处在于,我们通过模拟从基本疾病蛋白的多组学特征到药物靶点的信号流来揭示MoS,而现有模型并未引入这种信号流。我们利用 NCI ALMANAC、O'Neil、DrugComb 和 DrugCombDB 这四个不同的药物组合协同作用评估数据集对该模型的性能进行了评估。比较结果表明,所提出的模型在协同作用得分预测方面优于现有的图人工智能模型,并能利用核心信号流解释 MoS。代码可通过 Github 公开访问:https://github.com/FuhaiLiAiLab/DeepSignalingFlow。
{"title":"Using DeepSignalingFlow to mine signaling flows interpreting mechanism of synergy of cocktails.","authors":"Heming Zhang, Yixin Chen, Philip Payne, Fuhai Li","doi":"10.1038/s41540-024-00421-w","DOIUrl":"10.1038/s41540-024-00421-w","url":null,"abstract":"<p><p>Complex signaling pathways are believed to be responsible for drug resistance. Drug combinations perturbing multiple signaling targets have the potential to reduce drug resistance. The large-scale multi-omic datasets and experimental drug combination synergistic score data are valuable resources to study mechanisms of synergy (MoS) to guide the development of precision drug combinations. However, signaling patterns of MoS are complex and remain unclear, and thus it is challenging to identify synergistic drug combinations in clinical. Herein, we proposed a novel integrative and interpretable graph AI model, DeepSignalingFlow, to uncover the MoS by integrating and mining multi-omic data. The major innovation is that we uncover MoS by modeling the signaling flow from multi-omic features of essential disease proteins to the drug targets, which has not been introduced by the existing models. The model performance was assessed utilizing four distinct drug combination synergy evaluation datasets, i.e., NCI ALMANAC, O'Neil, DrugComb, and DrugCombDB. The comparison results showed that the proposed model outperformed existing graph AI models in terms of synergy score prediction, and can interpret MoS using the core signaling flows. The code is publicly accessible via Github: https://github.com/FuhaiLiAiLab/DeepSignalingFlow.</p>","PeriodicalId":19345,"journal":{"name":"NPJ Systems Biology and Applications","volume":"10 1","pages":"92"},"PeriodicalIF":3.5,"publicationDate":"2024-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11339460/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142018176","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
NPJ Systems Biology and Applications
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1