{"title":"Exploring the spatial effects influencing the EGFR/ERK pathway dynamics with machine learning surrogate models","authors":"Juan A. Garcia, Anass Bouchnita","doi":"10.1016/j.biosystems.2024.105360","DOIUrl":null,"url":null,"abstract":"<div><div>The fate of cells is regulated by biochemical reactions taking place inside of them, known as intracellular pathways. Cells display a variety of characteristics related to their shape, structure and contained fluid, which influences the diffusion of proteins and their interactions. To gain insights into the spatial effects shaping intracellular regulation, we apply machine learning (ML) to explore a previously developed spatial model of the epidermal growth factor receptor (EGFR) signaling. The model describes the reactions between molecular species inside of cells following the transient activation of EGF receptors. To train our ML models, we conduct 10,000 numerical simulations in parallel where we calculate the cumulative activation of molecules and transcription factors under various conditions such as different diffusion speeds, inactivation rates, and cell structures. We take advantage of the low computational cost of ML algorithms to investigate the effects of cell and nucleus sizes, the diffusion speed of proteins, and the inactivation rate of the Ras molecules on the activation strength of transcription factors. Our results suggest that the predictions by both neural networks and random forests yielded minimal mean square error (MSEs), while linear generalized models displayed a significantly larger MSE. The exploration of the surrogate models has shown that smaller cell and nucleus radii as well, larger diffusion coefficients, and reduced inactivation rates increase the activation of transcription factors. These results are confirmed by numerical simulations. Our ML algorithms can be readily incorporated within multiscale models of tumor growth to embed the spatial effects regulating intracellular pathways, enabling the use of complex cell models within multiscale models while reducing the computational cost.</div></div>","PeriodicalId":50730,"journal":{"name":"Biosystems","volume":"247 ","pages":"Article 105360"},"PeriodicalIF":2.0000,"publicationDate":"2024-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biosystems","FirstCategoryId":"99","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0303264724002454","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
The fate of cells is regulated by biochemical reactions taking place inside of them, known as intracellular pathways. Cells display a variety of characteristics related to their shape, structure and contained fluid, which influences the diffusion of proteins and their interactions. To gain insights into the spatial effects shaping intracellular regulation, we apply machine learning (ML) to explore a previously developed spatial model of the epidermal growth factor receptor (EGFR) signaling. The model describes the reactions between molecular species inside of cells following the transient activation of EGF receptors. To train our ML models, we conduct 10,000 numerical simulations in parallel where we calculate the cumulative activation of molecules and transcription factors under various conditions such as different diffusion speeds, inactivation rates, and cell structures. We take advantage of the low computational cost of ML algorithms to investigate the effects of cell and nucleus sizes, the diffusion speed of proteins, and the inactivation rate of the Ras molecules on the activation strength of transcription factors. Our results suggest that the predictions by both neural networks and random forests yielded minimal mean square error (MSEs), while linear generalized models displayed a significantly larger MSE. The exploration of the surrogate models has shown that smaller cell and nucleus radii as well, larger diffusion coefficients, and reduced inactivation rates increase the activation of transcription factors. These results are confirmed by numerical simulations. Our ML algorithms can be readily incorporated within multiscale models of tumor growth to embed the spatial effects regulating intracellular pathways, enabling the use of complex cell models within multiscale models while reducing the computational cost.
细胞的命运受其内部发生的生化反应(即细胞内途径)调控。细胞显示出与其形状、结构和所含液体有关的各种特征,这些特征影响着蛋白质的扩散及其相互作用。为了深入了解影响细胞内调控的空间效应,我们应用机器学习(ML)来探索之前开发的表皮生长因子受体(EGFR)信号传导空间模型。该模型描述了表皮生长因子受体瞬时激活后细胞内分子物种之间的反应。为了训练我们的 ML 模型,我们并行进行了 10,000 次数值模拟,计算在不同扩散速度、失活率和细胞结构等条件下分子和转录因子的累积激活。我们利用 ML 算法计算成本低的优势,研究了细胞和细胞核大小、蛋白质扩散速度和 Ras 分子失活率对转录因子激活强度的影响。我们的研究结果表明,神经网络和随机森林的预测均方误差(MSE)最小,而线性广义模型的 MSE 明显较大。对代用模型的探索表明,较小的细胞和细胞核半径、较大的扩散系数和较低的失活率都会增加转录因子的活化。数值模拟证实了这些结果。我们的 ML 算法可以很容易地融入肿瘤生长的多尺度模型中,以嵌入调节细胞内通路的空间效应,从而在多尺度模型中使用复杂的细胞模型,同时降低计算成本。
期刊介绍:
BioSystems encourages experimental, computational, and theoretical articles that link biology, evolutionary thinking, and the information processing sciences. The link areas form a circle that encompasses the fundamental nature of biological information processing, computational modeling of complex biological systems, evolutionary models of computation, the application of biological principles to the design of novel computing systems, and the use of biomolecular materials to synthesize artificial systems that capture essential principles of natural biological information processing.