bioRxiv - Systems Biology最新文献

Decoding Cytokine Networks in Ulcerative Colitis to Identify Pathogenic Mechanisms and Therapeutic Targets 解码溃疡性结肠炎的细胞因子网络，确定致病机制和治疗靶点

bioRxiv - Systems Biology

Pub Date : 2024-09-16 DOI: 10.1101/2024.09.12.612623

Marton L. Olbei, Isabelle Hautefort, John P Thomas, Luca L. Csabai, Balazs Bogar, Hajir Ibraheim, Aamir Saifuddin, Dezso Modos, Nick Powell, Tamas Korcsmaros

Ulcerative colitis (UC) is a chronic inflammatory disorder of the gastrointestinal tract characterised by dysregulated cytokine signalling. Despite the advent of advanced therapies targeting cytokine signalling, treatment outcomes for UC patients remain suboptimal. Hence, there is a pressing need to better understand the complexity of cytokine regulation in UC by comprehensively mapping the interconnected cytokine signalling networks that are perturbed in UC patients. To address this, we undertook systems immunology modelling of single-cell transcriptomics data from colonic biopsies of treatment-naive and treatment-exposed UC patients to build complex cytokine signalling networks underpinned by putative cytokine-cytokine interactions. The generated cytokine networks effectively captured known physiologically relevant cytokine-cytokine interactions which we recapitulated in vitro in UC patient-derived colonic epithelial organoids. These networks revealed new aspects of UC pathogenesis, including a cytokine subnetwork that is unique to treatment-naive UC patients, the identification of highly rewired cytokines across UC disease states (IL22, TL1A, IL23A, and OSM), JAK paralogue-specific cytokine-cytokine interactions, and the positioning of TL1A as an important upstream regulator of TNF and IL23A as well as an attractive therapeutic target. Overall, these findings open up several avenues for guiding future cytokine-targeting therapeutic approaches in UC, and the presented methodology can be readily applied to gain similar insights into other immune-mediated inflammatory diseases (IMIDs).

溃疡性结肠炎（UC）是一种以细胞因子信号失调为特征的慢性胃肠道炎症性疾病。尽管出现了针对细胞因子信号的先进疗法，但 UC 患者的治疗效果仍不理想。因此，迫切需要通过全面绘制在 UC 患者中受到干扰的相互关联的细胞因子信号网络来更好地了解 UC 中细胞因子调控的复杂性。为了解决这个问题，我们对未接受治疗和接受治疗的 UC 患者结肠活检的单细胞转录组学数据进行了系统免疫学建模，以构建由假定的细胞因子-细胞因子相互作用支撑的复杂细胞因子信号网络。生成的细胞因子网络有效地捕捉到了已知的生理相关细胞因子-细胞因子相互作用，我们在体外重现了 UC 患者来源的结肠上皮细胞器官组织。这些网络揭示了 UC 发病机制的新方面，包括治疗无效的 UC 患者特有的细胞因子亚网络、UC 疾病状态中高度重联的细胞因子（IL22、TL1A、IL23A 和 OSM）的鉴定、JAK 对位基因特异性细胞因子-细胞因子相互作用，以及将 TL1A 定位为 TNF 和 IL23A 的重要上游调节因子和有吸引力的治疗靶点。总之，这些发现为指导未来针对 UC 的细胞因子治疗方法开辟了几条途径，而且所介绍的方法也可随时应用于对其他免疫介导的炎症性疾病（IMIDs）的类似研究。

{"title":"Decoding Cytokine Networks in Ulcerative Colitis to Identify Pathogenic Mechanisms and Therapeutic Targets","authors":"Marton L. Olbei, Isabelle Hautefort, John P Thomas, Luca L. Csabai, Balazs Bogar, Hajir Ibraheim, Aamir Saifuddin, Dezso Modos, Nick Powell, Tamas Korcsmaros","doi":"10.1101/2024.09.12.612623","DOIUrl":"https://doi.org/10.1101/2024.09.12.612623","url":null,"abstract":"Ulcerative colitis (UC) is a chronic inflammatory disorder of the gastrointestinal tract characterised by dysregulated cytokine signalling. Despite the advent of advanced therapies targeting cytokine signalling, treatment outcomes for UC patients remain suboptimal. Hence, there is a pressing need to better understand the complexity of cytokine regulation in UC by comprehensively mapping the interconnected cytokine signalling networks that are perturbed in UC patients. To address this, we undertook systems immunology modelling of single-cell transcriptomics data from colonic biopsies of treatment-naive and treatment-exposed UC patients to build complex cytokine signalling networks underpinned by putative cytokine-cytokine interactions. The generated cytokine networks effectively captured known physiologically relevant cytokine-cytokine interactions which we recapitulated in vitro in UC patient-derived colonic epithelial organoids. These networks revealed new aspects of UC pathogenesis, including a cytokine subnetwork that is unique to treatment-naive UC patients, the identification of highly rewired cytokines across UC disease states (IL22, TL1A, IL23A, and OSM), JAK paralogue-specific cytokine-cytokine interactions, and the positioning of TL1A as an important upstream regulator of TNF and IL23A as well as an attractive therapeutic target. Overall, these findings open up several avenues for guiding future cytokine-targeting therapeutic approaches in UC, and the presented methodology can be readily applied to gain similar insights into other immune-mediated inflammatory diseases (IMIDs).","PeriodicalId":501213,"journal":{"name":"bioRxiv - Systems Biology","volume":"202 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142256510","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

High-content microscopy and machine learning characterize a cell morphology signature of NF1 genotype in Schwann cells 高含量显微镜和机器学习描述了许旺细胞中 NF1 基因型的细胞形态特征

bioRxiv - Systems Biology

Pub Date : 2024-09-16 DOI: 10.1101/2024.09.11.612546

Jenna Tomkinson, Cameron Mattson, Michelle Mattson-Hoss, Herb Sarnoff, Stephanie J Bouley, James A Walker, Gregory P Way

Neurofibromatosis type 1 (NF1) is a multi-system, autosomal dominant genetic disorder driven by the systemic loss of the NF1 protein neurofibromin. Loss of neurofibromin in Schwann cells is particularly detrimental, as the acquisition of a second-hit (e.g., complete loss of NF1) can lead to the development of plexiform neurofibroma tumors. Plexiform neurofibromas are painful, disfiguring tumors with an approximately 1 in 5 chance of sarcoma transition. Selumetinib is currently the only medicine approved by the U.S. Food and Drug Administration (FDA) for the treatment of plexiform neurofibromas in a subset of patients. This motivates the need to develop new therapies, either derived to treat NF1 haploinsufficiency or complete loss of NF1 function. To identify new therapies, we need to understand the impact neurofibromin has on Schwann cells. Here, we aimed to characterize differences in high-content microscopy imaging in neurofibromin-deficient Schwann cells. We applied a fluorescence microscopy assay (called Cell Painting) to two isogenic Schwann cell lines, one of wildtype genotype (NF1+/+) and one of NF1 null genotype (NF1-/-). We modified the canonical Cell Painting assay to mark four organelles/subcellular compartments: nuclei, endoplasmic reticulum, mitochondria, and F-actin. We utilized CellProfiler pipelines to perform quality control, illumination correction, segmentation, and cell morphology feature extraction. We segmented 22,585 NF1 wildtype and null cells, utilized 907 significant cell morphology features representing various organelle shapes and intensity patterns, and trained a logistic regression machine learning model to predict the NF1 genotype of single Schwann cells. The machine learning model had high performance, with training and testing data yielding a balanced accuracy of 0.85 and 0.80, respectively. All of our data processing and analyses are freely available on GitHub. We look to improve upon this preliminary model in the future by applying it to large-scale drug screens of NF1 deficient cells to identify candidate drugs that return NF1 patient Schwann cells to phenocopy NF1 wildtype and healthier phenotype.

神经纤维瘤病 1 型（NF1）是一种多系统、常染色体显性遗传疾病，由 NF1 蛋白神经纤维瘤蛋白的系统性缺失引起。神经纤维瘤蛋白在许旺细胞中的缺失尤其有害，因为获得二击（如 NF1 完全缺失）可导致丛状神经纤维瘤的发生。丛状神经纤维瘤是一种疼痛、毁容性肿瘤，约有五分之一的几率转变为肉瘤。目前，Selumetinib 是美国食品和药物管理局（FDA）批准用于治疗部分患者丛状神经纤维瘤的唯一药物。这促使我们需要开发新的疗法，以治疗NF1单倍体缺乏症或NF1功能完全丧失。为了确定新疗法，我们需要了解神经纤维瘤蛋白对许旺细胞的影响。在这里，我们旨在描述神经纤维瘤蛋白缺陷型许旺细胞中高内容显微成像的差异。我们在两个同源的许旺细胞系中应用了一种荧光显微镜检测方法（称为 "细胞绘画"），一个是野生型基因型（NF1+/+），另一个是 NF1 基因型（NF1-/-）。我们修改了经典的细胞绘制检测方法，以标记四个细胞器/亚细胞区室：细胞核、内质网、线粒体和 F-肌动蛋白。我们利用 CellProfiler 管道进行质量控制、光照校正、分割和细胞形态特征提取。我们分割了 22,585 个 NF1 野生型和无效型细胞，利用代表各种细胞器形状和强度模式的 907 个重要细胞形态特征，并训练了一个逻辑回归机器学习模型来预测单个许旺细胞的 NF1 基因型。机器学习模型的性能很高，训练和测试数据的平衡准确率分别为 0.85 和 0.80。我们所有的数据处理和分析都可在 GitHub 上免费获取。我们希望在未来对这一初步模型进行改进，将其应用于 NF1 缺陷细胞的大规模药物筛选，以确定能使 NF1 患者许旺细胞恢复到 NF1 野生型表型和更健康表型的候选药物。

{"title":"High-content microscopy and machine learning characterize a cell morphology signature of NF1 genotype in Schwann cells","authors":"Jenna Tomkinson, Cameron Mattson, Michelle Mattson-Hoss, Herb Sarnoff, Stephanie J Bouley, James A Walker, Gregory P Way","doi":"10.1101/2024.09.11.612546","DOIUrl":"https://doi.org/10.1101/2024.09.11.612546","url":null,"abstract":"Neurofibromatosis type 1 (NF1) is a multi-system, autosomal dominant genetic disorder driven by the systemic loss of the NF1 protein neurofibromin. Loss of neurofibromin in Schwann cells is particularly detrimental, as the acquisition of a second-hit (e.g., complete loss of NF1) can lead to the development of plexiform neurofibroma tumors. Plexiform neurofibromas are painful, disfiguring tumors with an approximately 1 in 5 chance of sarcoma transition. Selumetinib is currently the only medicine approved by the U.S. Food and Drug Administration (FDA) for the treatment of plexiform neurofibromas in a subset of patients. This motivates the need to develop new therapies, either derived to treat NF1 haploinsufficiency or complete loss of NF1 function. To identify new therapies, we need to understand the impact neurofibromin has on Schwann cells. Here, we aimed to characterize differences in high-content microscopy imaging in neurofibromin-deficient Schwann cells. We applied a fluorescence microscopy assay (called Cell Painting) to two isogenic Schwann cell lines, one of wildtype genotype (NF1+/+) and one of NF1 null genotype (NF1-/-). We modified the canonical Cell Painting assay to mark four organelles/subcellular compartments: nuclei, endoplasmic reticulum, mitochondria, and F-actin. We utilized CellProfiler pipelines to perform quality control, illumination correction, segmentation, and cell morphology feature extraction. We segmented 22,585 NF1 wildtype and null cells, utilized 907 significant cell morphology features representing various organelle shapes and intensity patterns, and trained a logistic regression machine learning model to predict the NF1 genotype of single Schwann cells. The machine learning model had high performance, with training and testing data yielding a balanced accuracy of 0.85 and 0.80, respectively. All of our data processing and analyses are freely available on GitHub. We look to improve upon this preliminary model in the future by applying it to large-scale drug screens of NF1 deficient cells to identify candidate drugs that return NF1 patient Schwann cells to phenocopy NF1 wildtype and healthier phenotype.","PeriodicalId":501213,"journal":{"name":"bioRxiv - Systems Biology","volume":"3 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142256511","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Tissue-specific metabolomic signatures for a doublesex model of reduced sexual dimorphism 性二型减少的双性模型的组织特异性代谢组特征

bioRxiv - Systems Biology

Pub Date : 2024-09-16 DOI: 10.1101/2024.09.11.612537

Rene Coig, Benjamin R Harrison, Richard S Johnson, Michael J MacCoss, Daniel EL Promislow

Sex has a major effect on the metabolome. However, we do not yet understand the degree to which these quantitative sex differences in metabolism are associated with anatomical dimorphism and modulated by sex-specific tissues. In the fruit fly, Drosophila melanogaster, knocking out the doublesex (dsx) gene gives rise to adults with intermediate sex characteristics. Here we sought to determine the degree to which this key node in sexual development leads to sex differences in the fly metabolome. We measured 91 metabolites across head, thorax and abdomen in Drosophila, comparing the differences between distinctly sex-dimorphic flies with those of reduced sexual dimorphism: dsx null flies. Notably, in the reduced dimorphism flies, we observed a sex difference in only 1 of 91 metabolites, kynurenate, whereas 51% of metabolites (46/91) were significantly different between wildtype XX and XY flies in at least one tissue, suggesting that dsx plays a major role in sex differences in fly metabolism. Kynurenate was consistently higher in XX flies in both the presence and absence of functioning dsx. We observed tissue-specific consequences of knocking out dsx. Metabolites affected by sex were significantly enriched in branched chain amino acid metabolism and the mTOR pathway. This highlights the importance of considering variation in genes that cause anatomical sexual dimorphism when analyzing sex differences in metabolic profiles and interpreting their biological significance.

性别对代谢组有重大影响。然而，我们还不了解代谢中的这些定量性别差异在多大程度上与解剖学上的二态性有关，以及在多大程度上受性别特异性组织的调节。在果蝇黑腹果蝇中，敲除双性（dsx）基因会产生具有中间性特征的成虫。在这里，我们试图确定这一性发育的关键节点在多大程度上导致了果蝇代谢组的性别差异。我们测量了果蝇头部、胸部和腹部的 91 种代谢物，比较了明显的性别二形性果蝇与性别二形性降低的果蝇之间的差异：dsx 基因无效果蝇。值得注意的是，在性二态性降低的果蝇中，我们只观察到 91 种代谢物中的 1 种（犬尿酸盐）存在性别差异，而野生型 XX 和 XY 果蝇之间至少有 51% 的代谢物（46/91）在一种组织中存在显著差异，这表明dsx 在果蝇代谢的性别差异中起着重要作用。在存在和不存在功能dsx的情况下，XX蝇的犬尿酸盐一直较高。我们观察到敲除dsx对特定组织的影响。受性别影响的代谢物在支链氨基酸代谢和 mTOR 途径中明显富集。这凸显了在分析代谢特征的性别差异和解释其生物学意义时，考虑导致解剖学性二态性的基因变异的重要性。

{"title":"Tissue-specific metabolomic signatures for a doublesex model of reduced sexual dimorphism","authors":"Rene Coig, Benjamin R Harrison, Richard S Johnson, Michael J MacCoss, Daniel EL Promislow","doi":"10.1101/2024.09.11.612537","DOIUrl":"https://doi.org/10.1101/2024.09.11.612537","url":null,"abstract":"Sex has a major effect on the metabolome. However, we do not yet understand the degree to which these quantitative sex differences in metabolism are associated with anatomical dimorphism and modulated by sex-specific tissues. In the fruit fly, Drosophila melanogaster, knocking out the doublesex (dsx) gene gives rise to adults with intermediate sex characteristics. Here we sought to determine the degree to which this key node in sexual development leads to sex differences in the fly metabolome. We measured 91 metabolites across head, thorax and abdomen in Drosophila, comparing the differences between distinctly sex-dimorphic flies with those of reduced sexual dimorphism: dsx null flies. Notably, in the reduced dimorphism flies, we observed a sex difference in only 1 of 91 metabolites, kynurenate, whereas 51% of metabolites (46/91) were significantly different between wildtype XX and XY flies in at least one tissue, suggesting that dsx plays a major role in sex differences in fly metabolism. Kynurenate was consistently higher in XX flies in both the presence and absence of functioning dsx. We observed tissue-specific consequences of knocking out dsx. Metabolites affected by sex were significantly enriched in branched chain amino acid metabolism and the mTOR pathway. This highlights the importance of considering variation in genes that cause anatomical sexual dimorphism when analyzing sex differences in metabolic profiles and interpreting their biological significance.","PeriodicalId":501213,"journal":{"name":"bioRxiv - Systems Biology","volume":"31 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142256513","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Environment-mediated interactions cause an externalized and collective memory in microbes 环境介导的相互作用导致微生物的外化和集体记忆

bioRxiv - Systems Biology

Pub Date : 2024-09-14 DOI: 10.1101/2024.09.09.612037

Shubham Gajrani, Xiaozhou Ye, Christoph Ratzke

Microbes usually live in complex communities interacting with many other microbial species. These interactions determine who can persist in a community and how the overall community forms and functions. Microbes often exert interactions by chemically changing the environment, like taking up nutrients or producing toxins. These environmental changes can persist over time. We show here that such lasting environmental changes can cause a memory effect where current growth conditions alter interaction outcomes in the future. Importantly, this memory is only stored in the environment and not inside the bacterial cells. Only the collective effort of many bacteria can build up this memory, making it an emergent property of bacterial populations. This population memory can also impact the assembly of more complex communities and lead to different final communities depending on the system's past. Overall, we show that to understand interaction outcomes fully, we not only have to consider the interacting species and abiotic conditions but also the system's history.

微生物通常生活在复杂的群落中，与许多其他微生物物种相互作用。这些相互作用决定了谁能在群落中生存，以及整个群落是如何形成和发挥作用的。微生物通常通过化学方式改变环境，如吸收营养或产生毒素，来实现相互作用。这些环境变化会长期存在。我们在此证明，这种持久的环境变化会产生记忆效应，即当前的生长条件会改变未来的相互作用结果。重要的是，这种记忆只储存在环境中，而不是细菌细胞内。只有许多细菌的集体努力才能建立起这种记忆，从而使其成为细菌种群的一种突发特性。这种种群记忆还能影响更复杂群落的组装，并根据系统的过去导致不同的最终群落。总之，我们的研究表明，要充分理解相互作用的结果，我们不仅要考虑相互作用的物种和非生物条件，还要考虑系统的历史。

引用次数: 0

Parasitic and Commensal interactions among Mimiviruses, Sputnik-like virophages, and Transpovirons: A theoretical and dynamical systems approach. 米米病毒、类人造卫星病毒和跨病毒之间的寄生和共生相互作用：理论和动力系统方法

bioRxiv - Systems Biology

Pub Date : 2024-09-14 DOI: 10.1101/2024.09.13.610890

EMMANUEL ORTEGA ATEHORTUA, Juan Camilo Arboleda, Nicole Rivera, Gloria Machado, Boris Anghelo Rodriguez

Giant viruses have been in the scope of virologists since 2003 when they were isolated from Acanthamoeba spp. Giant viruses, in turn, get infected by another virus named virophage and a third biological entity that corresponds to a transpoviron which can be found in the capsids of giant and virophage viruses. So far, transpovirons seem to behave as commensal entities while some virophages exhibit commensal behavior under laboratory conditions. To study the system's behavior, we used a theoretical approximation and developed an ordinary differential equation model. The dynamical analysis showed that the system exhibits an oscillatory robust behavior leading to a hyperparasitic Lotka-Volterra dynamic. But the biological mechanism that underlines the transpoviron persistence over time remains unclear and its status as a commensal entity needs further assessment. Also, the ecological interaction that leads to the overall coexistence of the three viral entities needs to be further studied.

巨细胞病毒反过来又会受到另一种名为噬菌体的病毒和第三种生物实体的感染，后者相当于巨细胞病毒和噬菌体病毒外壳中的转环病毒。到目前为止，跨病毒似乎表现为共生实体，而一些噬菌体则在实验室条件下表现出共生行为。为了研究该系统的行为，我们采用了一种理论近似方法，并建立了一个常微分方程模型。动力学分析表明，该系统表现出一种振荡的稳健行为，导致一种超寄生的 Lotka-Volterra 动力。但是，跨环境长期存在的生物机制尚不清楚，其作为共生实体的地位也需要进一步评估。此外，导致三种病毒实体共存的生态相互作用也有待进一步研究。

引用次数: 0

Sequential design of single-cell experiments to identify discrete stochastic models for gene expression. 单细胞实验的序列设计，以确定基因表达的离散随机模型。

bioRxiv - Systems Biology

Pub Date : 2024-09-14 DOI: 10.1101/2024.09.12.612709

Joshua Cook, Eric Ron, Dmitri Svetlov, Luis Aguiulera, Brian Munsky

Control of gene regulation requires quantitatively accurate predictions of heterogeneous cellular responses. When inferred from single-cell experiments, discrete stochastic models can enable such predictions, but such experiments are highly adjustable, allowing for almost infinitely many potential designs (e.g., at different induction levels, for different measurement times, or considering different observed biological species). Not all experiments are equally informative, experiments are time-consuming or expensive to perform, and research begins with limited prior information with which to construct models. To address these concerns, we developed a sequential experiment design strategy that starts with simple preliminary experiments and then integrates chemical master equations to compute the likelihood of single-cell data, a Bayesian inference procedure to sample posterior parameter distributions, and a finite state projection based Fisher information matrix to estimate the expected information for different designs for subsequent experiments. Using simulated then real single-cell data, we determined practical working principles to reduce the overall number of experiments needed to achieve predictive, quantitative understanding of single-cell responses.

基因调控需要对异质性细胞反应进行精确的定量预测。当从单细胞实验中推断时，离散随机模型可实现此类预测，但此类实验的可调性很高，允许几乎无限多的潜在设计（例如，不同的诱导水平、不同的测量时间，或考虑不同的观察生物物种）。并非所有的实验都具有同样的信息量，实验耗时或成本高昂，而且研究开始时可用于构建模型的先验信息有限。为了解决这些问题，我们开发了一种顺序实验设计策略，从简单的初步实验开始，然后整合化学主方程来计算单细胞数据的可能性、贝叶斯推断程序来对后验参数分布进行采样，以及基于有限状态投影的费舍尔信息矩阵来估计后续实验不同设计的预期信息。利用模拟和真实的单细胞数据，我们确定了切实可行的工作原则，以减少实现对单细胞反应的预测性定量理解所需的实验总数。

{"title":"Sequential design of single-cell experiments to identify discrete stochastic models for gene expression.","authors":"Joshua Cook, Eric Ron, Dmitri Svetlov, Luis Aguiulera, Brian Munsky","doi":"10.1101/2024.09.12.612709","DOIUrl":"https://doi.org/10.1101/2024.09.12.612709","url":null,"abstract":"Control of gene regulation requires quantitatively accurate predictions of heterogeneous cellular responses. When inferred from single-cell experiments, discrete stochastic models can enable such predictions, but such experiments are highly adjustable, allowing for almost infinitely many potential designs (e.g., at different induction levels, for different measurement times, or considering different observed biological species). Not all experiments are equally informative, experiments are time-consuming or expensive to perform, and research begins with limited prior information with which to construct models. To address these concerns, we developed a sequential experiment design strategy that starts with simple preliminary experiments and then integrates chemical master equations to compute the likelihood of single-cell data, a Bayesian inference procedure to sample posterior parameter distributions, and a finite state projection based Fisher information matrix to estimate the expected information for different designs for subsequent experiments. Using simulated then real single-cell data, we determined practical working principles to reduce the overall number of experiments needed to achieve predictive, quantitative understanding of single-cell responses.","PeriodicalId":501213,"journal":{"name":"bioRxiv - Systems Biology","volume":"16 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142256512","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Computing Stackelberg Equilibrium for Cancer Treatment 计算癌症治疗的 Stackelberg 平衡

bioRxiv - Systems Biology

Pub Date : 2024-09-13 DOI: 10.1101/2024.09.09.612059

Sam Ganzfried

Recent work by Kleshnina et al. has presented a Stackelberg evolutionary game model in which the Stackelberg equilibrium strategy for the leading player corresponds to the optimal cancer treatment. We present an approach that is able to quickly and accurately solve the model presented in that work.

Kleshnina 等人最近的研究提出了一个斯塔克尔伯格演化博弈模型，其中领先玩家的斯塔克尔伯格均衡策略与最佳癌症治疗方法相对应。我们提出的方法能够快速、准确地求解该模型。

引用次数: 0

ElixirSeeker: A Machine Learning Framework Utilizing Attention-Driven Fusion of Molecular Fingerprints for the Discovery of Anti-Aging Compounds ElixirSeeker：利用注意力驱动的分子指纹融合发现抗衰老化合物的机器学习框架

bioRxiv - Systems Biology

Pub Date : 2024-09-13 DOI: 10.1101/2024.09.08.611839

Yan Pan, Hongxia Cai, Fang Ye, Wentao Xu, Zhihang Huang, Jingyuan Zhu, Yiwen Gong, Yutong Li, Anastasia Ngozi Ezemaduka, Shan Gao, Shunqi Liu, Guojun Li, Hao Li, Jing Yang, Junyu Ning, Bo Xian

Despite the growing interest in anti-aging drug development, high cost and low success rate pose a significant challenge. We present ElixirSeeker, a new machine-learning framework designed to help speed up the discovery of potential anti-aging compounds by utilizing the attention-driven fusion of molecular fingerprints. Our approach integrates molecular fingerprints generated by different algorithms and utilizes XGBoost to select optimal fingerprint lengths. Subsequently, we assign weights to the molecular fingerprints and employ Kernel Principal Component Analysis (KPCA) to reduce dimensionality, integrating different attention-driven methods. We trained the algorithm using DrugAge database. Our comprehensive analyses demonstrate that 64-bit Attention-ElixirFP maintains high predictive accuracy and F1 score while minimizing computational cost. Using ElixirSeeker to screen external compound databases, we identified a number of promising candidate anti-aging drugs. We tested top 6 hits and found that 4 of these compounds extend the lifespan of Caenorhabditis elegans, including Polyphyllin Ⅵ, Medrysone, Thymoquinone and Medrysone. This study illustrates that attention-driven fusion of fingerprints maximizes the learning of molecular activity features, providing a novel approach for high-throughput machine learning discovery of anti-aging molecules.

尽管人们对抗衰老药物开发的兴趣与日俱增，但高成本和低成功率构成了巨大挑战。我们介绍了 ElixirSeeker，这是一个新的机器学习框架，旨在利用注意力驱动的分子指纹融合，帮助加快潜在抗衰老化合物的发现。我们的方法整合了由不同算法生成的分子指纹，并利用 XGBoost 选择最佳指纹长度。随后，我们为分子指纹分配权重，并采用核主成分分析法（KPCA）来降低维度，从而整合不同的注意力驱动方法。我们使用 DrugAge 数据库对算法进行了训练。我们的综合分析表明，64 位 Attention-ElixirFP 保持了较高的预测准确率和 F1 分数，同时将计算成本降至最低。利用 ElixirSeeker 筛选外部化合物数据库，我们发现了许多有前景的候选抗衰老药物。我们测试了排名前 6 位的化合物，发现其中 4 种化合物能延长秀丽隐杆线虫的寿命，包括 Polyphyllin Ⅵ、Medrysone、Thymoquinone 和 Medrysone。这项研究表明，注意力驱动的指纹融合能最大限度地学习分子活性特征，为高通量机器学习发现抗衰老分子提供了一种新方法。

{"title":"ElixirSeeker: A Machine Learning Framework Utilizing Attention-Driven Fusion of Molecular Fingerprints for the Discovery of Anti-Aging Compounds","authors":"Yan Pan, Hongxia Cai, Fang Ye, Wentao Xu, Zhihang Huang, Jingyuan Zhu, Yiwen Gong, Yutong Li, Anastasia Ngozi Ezemaduka, Shan Gao, Shunqi Liu, Guojun Li, Hao Li, Jing Yang, Junyu Ning, Bo Xian","doi":"10.1101/2024.09.08.611839","DOIUrl":"https://doi.org/10.1101/2024.09.08.611839","url":null,"abstract":"Despite the growing interest in anti-aging drug development, high cost and low success rate pose a significant challenge. We present ElixirSeeker, a new machine-learning framework designed to help speed up the discovery of potential anti-aging compounds by utilizing the attention-driven fusion of molecular fingerprints. Our approach integrates molecular fingerprints generated by different algorithms and utilizes XGBoost to select optimal fingerprint lengths. Subsequently, we assign weights to the molecular fingerprints and employ Kernel Principal Component Analysis (KPCA) to reduce dimensionality, integrating different attention-driven methods. We trained the algorithm using DrugAge database. Our comprehensive analyses demonstrate that 64-bit Attention-ElixirFP maintains high predictive accuracy and F1 score while minimizing computational cost. Using ElixirSeeker to screen external compound databases, we identified a number of promising candidate anti-aging drugs. We tested top 6 hits and found that 4 of these compounds extend the lifespan of Caenorhabditis elegans, including Polyphyllin Ⅵ, Medrysone, Thymoquinone and Medrysone. This study illustrates that attention-driven fusion of fingerprints maximizes the learning of molecular activity features, providing a novel approach for high-throughput machine learning discovery of anti-aging molecules.","PeriodicalId":501213,"journal":{"name":"bioRxiv - Systems Biology","volume":"8 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142256642","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A scalable approach to absolute quantitation in metabolomics 代谢组学绝对定量的可扩展方法

bioRxiv - Systems Biology

Pub Date : 2024-09-13 DOI: 10.1101/2024.09.09.609906

Luke S Ferro, Alan Y. L. Wong, Jack Howland, Ana S. H. Costa, Jefferson G. Pruyne, Devesh Shah, Joshua D. Lauterbach, Steven B. Hooper, Mimoun Cadosch Delmar, Jack Geremia, Timothy Kassis, Naama Kanarek, Jennifer M. Campbell

Mass spectrometry-based metabolomics allows for the quantitation of metabolite levels in diverse biological samples. The traditional method of converting peak areas to absolute concentrations involves the use of matched heavy isotopologues. However, this approach is laborious and limited to a small number of metabolites. We addressed these limitations by developing PyxisTM, a machine learning-based technology which converts raw mass spectrometry data to absolute concentration measurements without the need for per-analyte standards. Here, we demonstrate Pyxis performance by quantifying metabolome concentration dynamics in murine blood plasma. Pyxis performed equivalently to traditional quantitation workflows used by research institutions, with a fraction of the time needed for analysis. We show that absolute quantitation by Pyxis can be expanded to include concentrations for additional metabolites, without the need to acquire new data. Furthermore, Pyxis allows for absolute quantitation as part of an untargeted metabolomics workflow. By removing the bottleneck of per-analyte standards, Pyxis allows for absolute quantitation in metabolomics that is scalable to large numbers of metabolites. The ability of Pyxis to make concentration-based measurements across the metabolome has the potential to deepen our understanding of diverse metabolic perturbations.

基于质谱的代谢组学可对各种生物样本中的代谢物水平进行定量分析。将峰面积转换为绝对浓度的传统方法包括使用匹配的重同位素。然而，这种方法费时费力，而且仅限于少量代谢物。我们通过开发 PyxisTM 解决了这些局限性，这是一种基于机器学习的技术，可将原始质谱数据转换为绝对浓度测量值，而无需每种分析物的标准。在这里，我们通过量化小鼠血浆中代谢组的浓度动态来展示 Pyxis 的性能。Pyxis 的性能与研究机构使用的传统定量工作流程相当，而分析所需的时间只是其一小部分。我们的研究表明，Pyxis 的绝对定量分析可以扩展到其他代谢物的浓度，而无需获取新的数据。此外，Pyxis 还能将绝对定量作为非靶向代谢组学工作流程的一部分。通过消除每个分析物标准的瓶颈，Pyxis 可以在代谢组学中进行绝对定量，并可扩展到大量代谢物。Pyxis 能够对整个代谢组进行基于浓度的测量，有望加深我们对各种代谢扰动的理解。

{"title":"A scalable approach to absolute quantitation in metabolomics","authors":"Luke S Ferro, Alan Y. L. Wong, Jack Howland, Ana S. H. Costa, Jefferson G. Pruyne, Devesh Shah, Joshua D. Lauterbach, Steven B. Hooper, Mimoun Cadosch Delmar, Jack Geremia, Timothy Kassis, Naama Kanarek, Jennifer M. Campbell","doi":"10.1101/2024.09.09.609906","DOIUrl":"https://doi.org/10.1101/2024.09.09.609906","url":null,"abstract":"Mass spectrometry-based metabolomics allows for the quantitation of metabolite levels in diverse biological samples. The traditional method of converting peak areas to absolute concentrations involves the use of matched heavy isotopologues. However, this approach is laborious and limited to a small number of metabolites. We addressed these limitations by developing PyxisTM, a machine learning-based technology which converts raw mass spectrometry data to absolute concentration measurements without the need for per-analyte standards. Here, we demonstrate Pyxis performance by quantifying metabolome concentration dynamics in murine blood plasma. Pyxis performed equivalently to traditional quantitation workflows used by research institutions, with a fraction of the time needed for analysis. We show that absolute quantitation by Pyxis can be expanded to include concentrations for additional metabolites, without the need to acquire new data. Furthermore, Pyxis allows for absolute quantitation as part of an untargeted metabolomics workflow. By removing the bottleneck of per-analyte standards, Pyxis allows for absolute quantitation in metabolomics that is scalable to large numbers of metabolites. The ability of Pyxis to make concentration-based measurements across the metabolome has the potential to deepen our understanding of diverse metabolic perturbations.","PeriodicalId":501213,"journal":{"name":"bioRxiv - Systems Biology","volume":"43 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142256641","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Leveraging large language models for metabolic engineering design 利用大型语言模型进行代谢工程设计

bioRxiv - Systems Biology

Pub Date : 2024-09-13 DOI: 10.1101/2024.09.09.612023

Xiongwen Li, Zhu Liang, Zhetao Guo, Ziyi Liu, Ke Wu, Jiahao Luo, Yuesheng Zhang, Lizheng Liu, Manda Sun, Yuanyuan Huang, Hongting Tang, Yu Chen, Tao Yu, Jens Nielsen, Feiran Li

Establishing efficient cell factories involves a continuous process of trial and error due to the intricate nature of metabolism. This complexity makes predicting effective engineering targets a challenging task. Therefore, it is vital to learn from the accumulated successes of previous designs for advancing future cell factory development. In this study, we developed a method based on large language models (LLMs) to extract metabolic engineering strategies from research articles on a large scale. We created a database containing over 29006 metabolic engineering entries, 1210 products and 751 organisms. Using this extracted data, we trained a hybrid model combining deep learning and mechanistic approaches to predict engineering targets. Our model outperformed traditional metabolic engineering target prediction algorithms, excelled in predicting the effects of gene modifications, and generalized well to out-of-distribution products and multiple gene combinations. Our study provides a valuable dataset, a chatbot, and an engineering target prediction model for the metabolic engineering field and exemplifies an efficient method for leveraging existing knowledge for future predictions.

由于新陈代谢的复杂性，建立高效的细胞工厂需要不断地尝试和犯错。这种复杂性使得预测有效的工程目标成为一项具有挑战性的任务。因此，汲取以往设计的成功经验对于推动未来细胞工厂的发展至关重要。在本研究中，我们开发了一种基于大型语言模型（LLM）的方法，从研究文章中大规模提取代谢工程策略。我们创建了一个数据库，其中包含超过 29006 个代谢工程条目、1210 种产品和 751 种生物。利用这些提取的数据，我们训练了一个结合深度学习和机理方法的混合模型来预测工程目标。我们的模型优于传统的代谢工程靶标预测算法，在预测基因修饰的影响方面表现出色，并能很好地泛化到分布外产品和多基因组合。我们的研究为代谢工程领域提供了一个宝贵的数据集、一个聊天机器人和一个工程目标预测模型，并示范了一种利用现有知识进行未来预测的高效方法。

{"title":"Leveraging large language models for metabolic engineering design","authors":"Xiongwen Li, Zhu Liang, Zhetao Guo, Ziyi Liu, Ke Wu, Jiahao Luo, Yuesheng Zhang, Lizheng Liu, Manda Sun, Yuanyuan Huang, Hongting Tang, Yu Chen, Tao Yu, Jens Nielsen, Feiran Li","doi":"10.1101/2024.09.09.612023","DOIUrl":"https://doi.org/10.1101/2024.09.09.612023","url":null,"abstract":"Establishing efficient cell factories involves a continuous process of trial and error due to the intricate nature of metabolism. This complexity makes predicting effective engineering targets a challenging task. Therefore, it is vital to learn from the accumulated successes of previous designs for advancing future cell factory development. In this study, we developed a method based on large language models (LLMs) to extract metabolic engineering strategies from research articles on a large scale. We created a database containing over 29006 metabolic engineering entries, 1210 products and 751 organisms. Using this extracted data, we trained a hybrid model combining deep learning and mechanistic approaches to predict engineering targets. Our model outperformed traditional metabolic engineering target prediction algorithms, excelled in predicting the effects of gene modifications, and generalized well to out-of-distribution products and multiple gene combinations. Our study provides a valuable dataset, a chatbot, and an engineering target prediction model for the metabolic engineering field and exemplifies an efficient method for leveraging existing knowledge for future predictions.","PeriodicalId":501213,"journal":{"name":"bioRxiv - Systems Biology","volume":"17 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142256516","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0