首页 > 最新文献

EURASIP journal on bioinformatics & systems biology最新文献

英文 中文
From microscopy data to in silico environments for in vivo-oriented simulations. 从显微镜数据到体内定向模拟的计算机环境。
Pub Date : 2012-06-26 DOI: 10.1186/1687-4153-2012-7
Noriko Hiroi1, Michael Klann, Keisuke Iba, Pablo de Heras Ciechomski, Shuji Yamashita, Akito Tabira, Takahiro Okuhara, Takeshi Kubojima, Yasunori Okada, Kotaro Oka, Robin Mange, Michael Unger, Akira Funahashi, Heinz Koeppl

Abstract: : In our previous study, we introduced a combination methodology of Fluorescence Correlation Spectroscopy (FCS) and Transmission Electron Microscopy (TEM), which is powerful to investigate the effect of intracellular environment to biochemical reaction processes. Now, we developed a reconstruction method of realistic simulation spaces based on our TEM images. Interactive raytracing visualization of this space allows the perception of the overall 3D structure, which is not directly accessible from 2D TEM images. Simulation results show that the diffusion in such generated structures strongly depends on image post-processing. Frayed structures corresponding to noisy images hinder the diffusion much stronger than smooth surfaces from denoised images. This means that the correct identification of noise or structure is significant to reconstruct appropriate reaction environment in silico in order to estimate realistic behaviors of reactants in vivo. Static structures lead to anomalous diffusion due to the partial confinement. In contrast, mobile crowding agents do not lead to anomalous diffusion at moderate crowding levels. By varying the mobility of these non-reactive obstacles (NRO), we estimated the relationship between NRO diffusion coefficient (Dnro) and the anomaly in the tracer diffusion (α). For Dnro=21.96 to 44.49 μm2/s, the simulation results match the anomaly obtained from FCS measurements. This range of the diffusion coefficient from simulations is compatible with the range of the diffusion coefficient of structural proteins in the cytoplasm. In addition, we investigated the relationship between the radius of NRO and anomalous diffusion coefficient of tracers by the comparison between different simulations. The radius of NRO has to be 58 nm when the polymer moves with the same diffusion speed as a reactant, which is close to the radius of functional protein complexes in a cell.

摘要:在之前的研究中,我们介绍了一种荧光相关光谱(FCS)和透射电子显微镜(TEM)相结合的方法,该方法可以有效地研究细胞内环境对生化反应过程的影响。现在,我们开发了一种基于透射电镜图像的真实模拟空间重建方法。该空间的交互式光线追踪可视化允许感知整体3D结构,这是无法从2D TEM图像直接访问的。仿真结果表明,这种生成结构的扩散很大程度上依赖于图像后处理。噪声图像对应的磨损结构比去噪图像对应的光滑表面对扩散的阻碍要大得多。这意味着正确识别噪声或结构对于重建适当的硅反应环境,以估计反应物在体内的真实行为具有重要意义。静态结构由于局部约束导致了异常扩散。相反,在中等拥挤水平下,移动拥挤因子不会导致异常扩散。通过改变这些非反应性障碍物(NRO)的迁移率,我们估计了NRO扩散系数(Dnro)与示踪剂扩散异常(α)之间的关系。在Dnro=21.96 ~ 44.49 μm2/s范围内,模拟结果与FCS测量结果吻合。模拟得到的扩散系数范围与细胞质中结构蛋白的扩散系数范围是一致的。此外,通过不同模拟结果的对比,探讨了NRO半径与示踪剂异常扩散系数之间的关系。当聚合物以与反应物相同的扩散速度移动时,NRO半径必须为58 nm,这与细胞中功能蛋白复合物的半径接近。
{"title":"From microscopy data to in silico environments for in vivo-oriented simulations.","authors":"Noriko Hiroi1,&nbsp;Michael Klann,&nbsp;Keisuke Iba,&nbsp;Pablo de Heras Ciechomski,&nbsp;Shuji Yamashita,&nbsp;Akito Tabira,&nbsp;Takahiro Okuhara,&nbsp;Takeshi Kubojima,&nbsp;Yasunori Okada,&nbsp;Kotaro Oka,&nbsp;Robin Mange,&nbsp;Michael Unger,&nbsp;Akira Funahashi,&nbsp;Heinz Koeppl","doi":"10.1186/1687-4153-2012-7","DOIUrl":"https://doi.org/10.1186/1687-4153-2012-7","url":null,"abstract":"<p><strong>Abstract: </strong>: In our previous study, we introduced a combination methodology of Fluorescence Correlation Spectroscopy (FCS) and Transmission Electron Microscopy (TEM), which is powerful to investigate the effect of intracellular environment to biochemical reaction processes. Now, we developed a reconstruction method of realistic simulation spaces based on our TEM images. Interactive raytracing visualization of this space allows the perception of the overall 3D structure, which is not directly accessible from 2D TEM images. Simulation results show that the diffusion in such generated structures strongly depends on image post-processing. Frayed structures corresponding to noisy images hinder the diffusion much stronger than smooth surfaces from denoised images. This means that the correct identification of noise or structure is significant to reconstruct appropriate reaction environment in silico in order to estimate realistic behaviors of reactants in vivo. Static structures lead to anomalous diffusion due to the partial confinement. In contrast, mobile crowding agents do not lead to anomalous diffusion at moderate crowding levels. By varying the mobility of these non-reactive obstacles (NRO), we estimated the relationship between NRO diffusion coefficient (Dnro) and the anomaly in the tracer diffusion (α). For Dnro=21.96 to 44.49 μm2/s, the simulation results match the anomaly obtained from FCS measurements. This range of the diffusion coefficient from simulations is compatible with the range of the diffusion coefficient of structural proteins in the cytoplasm. In addition, we investigated the relationship between the radius of NRO and anomalous diffusion coefficient of tracers by the comparison between different simulations. The radius of NRO has to be 58 nm when the polymer moves with the same diffusion speed as a reactant, which is close to the radius of functional protein complexes in a cell.</p>","PeriodicalId":72957,"journal":{"name":"EURASIP journal on bioinformatics & systems biology","volume":"2012 ","pages":"7"},"PeriodicalIF":0.0,"publicationDate":"2012-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/1687-4153-2012-7","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"30719475","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Phase computations and phase models for discrete molecular oscillators. 离散分子振荡器的相位计算和相位模型。
Pub Date : 2012-06-11 DOI: 10.1186/1687-4153-2012-6
Onder Suvak, Alper Demir

Background: Biochemical oscillators perform crucial functions in cells, e.g., they set up circadian clocks. The dynamical behavior of oscillators is best described and analyzed in terms of the scalar quantity, phase. A rigorous and useful definition for phase is based on the so-called isochrons of oscillators. Phase computation techniques for continuous oscillators that are based on isochrons have been used for characterizing the behavior of various types of oscillators under the influence of perturbations such as noise.

Results: In this article, we extend the applicability of these phase computation methods to biochemical oscillators as discrete molecular systems, upon the information obtained from a continuous-state approximation of such oscillators. In particular, we describe techniques for computing the instantaneous phase of discrete, molecular oscillators for stochastic simulation algorithm generated sample paths. We comment on the accuracies and derive certain measures for assessing the feasibilities of the proposed phase computation methods. Phase computation experiments on the sample paths of well-known biological oscillators validate our analyses.

Conclusions: The impact of noise that arises from the discrete and random nature of the mechanisms that make up molecular oscillators can be characterized based on the phase computation techniques proposed in this article. The concept of isochrons is the natural choice upon which the phase notion of oscillators can be founded. The isochron-theoretic phase computation methods that we propose can be applied to discrete molecular oscillators of any dimension, provided that the oscillatory behavior observed in discrete-state does not vanish in a continuous-state approximation. Analysis of the full versatility of phase noise phenomena in molecular oscillators will be possible if a proper phase model theory is developed, without resorting to such approximations.

背景:生物化学振荡器在细胞中起着至关重要的作用,例如,它们设置生物钟。振子的动力学行为最好用标量相位来描述和分析。相位的一个严格而有用的定义是基于所谓的振子等时线。基于等时线的连续振子相位计算技术已被用于表征各种类型的振子在扰动(如噪声)影响下的行为。结果:在本文中,我们将这些相位计算方法的适用性扩展到作为离散分子系统的生化振荡器,基于从这些振荡器的连续状态近似获得的信息。特别是,我们描述了计算离散分子振荡器的瞬时相位的技术,用于随机模拟算法生成的样本路径。我们对所提出的相位计算方法的准确性进行了评价,并提出了评估其可行性的一些措施。在已知生物振荡器样本路径上的相位计算实验验证了我们的分析。结论:基于本文提出的相位计算技术,可以表征由构成分子振荡器的机制的离散和随机性质引起的噪声的影响。等时线的概念是建立振子相位概念的自然选择。我们提出的等时理论相位计算方法可以应用于任何维度的离散分子振子,只要在离散状态下观察到的振荡行为不会在连续状态近似中消失。如果开发出一种适当的相位模型理论,而不使用这种近似,就有可能分析分子振荡器中相位噪声现象的全部通用性。
{"title":"Phase computations and phase models for discrete molecular oscillators.","authors":"Onder Suvak,&nbsp;Alper Demir","doi":"10.1186/1687-4153-2012-6","DOIUrl":"https://doi.org/10.1186/1687-4153-2012-6","url":null,"abstract":"<p><strong>Background: </strong>Biochemical oscillators perform crucial functions in cells, e.g., they set up circadian clocks. The dynamical behavior of oscillators is best described and analyzed in terms of the scalar quantity, phase. A rigorous and useful definition for phase is based on the so-called isochrons of oscillators. Phase computation techniques for continuous oscillators that are based on isochrons have been used for characterizing the behavior of various types of oscillators under the influence of perturbations such as noise.</p><p><strong>Results: </strong>In this article, we extend the applicability of these phase computation methods to biochemical oscillators as discrete molecular systems, upon the information obtained from a continuous-state approximation of such oscillators. In particular, we describe techniques for computing the instantaneous phase of discrete, molecular oscillators for stochastic simulation algorithm generated sample paths. We comment on the accuracies and derive certain measures for assessing the feasibilities of the proposed phase computation methods. Phase computation experiments on the sample paths of well-known biological oscillators validate our analyses.</p><p><strong>Conclusions: </strong>The impact of noise that arises from the discrete and random nature of the mechanisms that make up molecular oscillators can be characterized based on the phase computation techniques proposed in this article. The concept of isochrons is the natural choice upon which the phase notion of oscillators can be founded. The isochron-theoretic phase computation methods that we propose can be applied to discrete molecular oscillators of any dimension, provided that the oscillatory behavior observed in discrete-state does not vanish in a continuous-state approximation. Analysis of the full versatility of phase noise phenomena in molecular oscillators will be possible if a proper phase model theory is developed, without resorting to such approximations.</p>","PeriodicalId":72957,"journal":{"name":"EURASIP journal on bioinformatics & systems biology","volume":"2012 1","pages":"6"},"PeriodicalIF":0.0,"publicationDate":"2012-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/1687-4153-2012-6","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"30681747","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Modeling stochasticity and variability in gene regulatory networks. 基因调控网络的随机性和可变性建模。
Pub Date : 2012-06-06 DOI: 10.1186/1687-4153-2012-5
David Murrugarra, Alan Veliz-Cuba, Boris Aguilar, Seda Arat, Reinhard Laubenbacher

Modeling stochasticity in gene regulatory networks is an important and complex problem in molecular systems biology. To elucidate intrinsic noise, several modeling strategies such as the Gillespie algorithm have been used successfully. This article contributes an approach as an alternative to these classical settings. Within the discrete paradigm, where genes, proteins, and other molecular components of gene regulatory networks are modeled as discrete variables and are assigned as logical rules describing their regulation through interactions with other components. Stochasticity is modeled at the biological function level under the assumption that even if the expression levels of the input nodes of an update rule guarantee activation or degradation there is a probability that the process will not occur due to stochastic effects. This approach allows a finer analysis of discrete models and provides a natural setup for cell population simulations to study cell-to-cell variability. We applied our methods to two of the most studied regulatory networks, the outcome of lambda phage infection of bacteria and the p53-mdm2 complex.

基因调控网络的随机性建模是分子系统生物学中一个重要而复杂的问题。为了阐明固有噪声,一些建模策略如Gillespie算法已被成功地应用。本文提供了一种替代这些经典设置的方法。在离散范式中,基因、蛋白质和基因调控网络的其他分子成分被建模为离散变量,并被分配为逻辑规则,通过与其他成分的相互作用来描述它们的调控。假设即使更新规则的输入节点的表达水平保证激活或退化,在生物功能水平上对随机性进行建模,但由于随机效应,该过程可能不会发生。这种方法允许对离散模型进行更精细的分析,并为细胞群模拟提供自然设置,以研究细胞间的可变性。我们将我们的方法应用于两个研究最多的调控网络,细菌的lambda噬菌体感染的结果和p53-mdm2复合物。
{"title":"Modeling stochasticity and variability in gene regulatory networks.","authors":"David Murrugarra,&nbsp;Alan Veliz-Cuba,&nbsp;Boris Aguilar,&nbsp;Seda Arat,&nbsp;Reinhard Laubenbacher","doi":"10.1186/1687-4153-2012-5","DOIUrl":"https://doi.org/10.1186/1687-4153-2012-5","url":null,"abstract":"<p><p> Modeling stochasticity in gene regulatory networks is an important and complex problem in molecular systems biology. To elucidate intrinsic noise, several modeling strategies such as the Gillespie algorithm have been used successfully. This article contributes an approach as an alternative to these classical settings. Within the discrete paradigm, where genes, proteins, and other molecular components of gene regulatory networks are modeled as discrete variables and are assigned as logical rules describing their regulation through interactions with other components. Stochasticity is modeled at the biological function level under the assumption that even if the expression levels of the input nodes of an update rule guarantee activation or degradation there is a probability that the process will not occur due to stochastic effects. This approach allows a finer analysis of discrete models and provides a natural setup for cell population simulations to study cell-to-cell variability. We applied our methods to two of the most studied regulatory networks, the outcome of lambda phage infection of bacteria and the p53-mdm2 complex.</p>","PeriodicalId":72957,"journal":{"name":"EURASIP journal on bioinformatics & systems biology","volume":"2012 1","pages":"5"},"PeriodicalIF":0.0,"publicationDate":"2012-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/1687-4153-2012-5","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"30670850","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 82
A visual analytics approach for models of heterogeneous cell populations. 异质细胞群模型的可视化分析方法。
Pub Date : 2012-05-31 DOI: 10.1186/1687-4153-2012-4
Jan Hasenauer, Julian Heinrich, Malgorzata Doszczak, Peter Scheurich, Daniel Weiskopf, Frank Allgöwer

In recent years, cell population models have become increasingly common. In contrast to classic single cell models, population models allow for the study of cell-to-cell variability, a crucial phenomenon in most populations of primary cells, cancer cells, and stem cells. Unfortunately, tools for in-depth analysis of population models are still missing. This problem originates from the complexity of population models. Particularly important are methods to determine the source of heterogeneity (e.g., genetics or epigenetic differences) and to select potential (bio-)markers. We propose an analysis based on visual analytics to tackle this problem. Our approach combines parallel-coordinates plots, used for a visual assessment of the high-dimensional dependencies, and nonlinear support vector machines, for the quantification of effects. The method can be employed to study qualitative and quantitative differences among cells. To illustrate the different components, we perform a case study using the proapoptotic signal transduction pathway involved in cellular apoptosis.

近年来,细胞群体模型已经变得越来越普遍。与经典的单细胞模型相比,群体模型允许研究细胞间的变异性,这是大多数原代细胞、癌细胞和干细胞群体中的关键现象。不幸的是,对人口模型进行深入分析的工具仍然缺失。这个问题源于人口模型的复杂性。特别重要的是确定异质性来源(例如,遗传学或表观遗传学差异)和选择潜在(生物)标记的方法。我们提出了一种基于视觉分析的分析方法来解决这个问题。我们的方法结合了平行坐标图,用于高维依赖性的视觉评估,以及非线性支持向量机,用于效果的量化。该方法可用于研究细胞间的定性和定量差异。为了说明不同的组成部分,我们进行了一个案例研究,使用促凋亡信号转导途径参与细胞凋亡。
{"title":"A visual analytics approach for models of heterogeneous cell populations.","authors":"Jan Hasenauer,&nbsp;Julian Heinrich,&nbsp;Malgorzata Doszczak,&nbsp;Peter Scheurich,&nbsp;Daniel Weiskopf,&nbsp;Frank Allgöwer","doi":"10.1186/1687-4153-2012-4","DOIUrl":"https://doi.org/10.1186/1687-4153-2012-4","url":null,"abstract":"<p><p> In recent years, cell population models have become increasingly common. In contrast to classic single cell models, population models allow for the study of cell-to-cell variability, a crucial phenomenon in most populations of primary cells, cancer cells, and stem cells. Unfortunately, tools for in-depth analysis of population models are still missing. This problem originates from the complexity of population models. Particularly important are methods to determine the source of heterogeneity (e.g., genetics or epigenetic differences) and to select potential (bio-)markers. We propose an analysis based on visual analytics to tackle this problem. Our approach combines parallel-coordinates plots, used for a visual assessment of the high-dimensional dependencies, and nonlinear support vector machines, for the quantification of effects. The method can be employed to study qualitative and quantitative differences among cells. To illustrate the different components, we perform a case study using the proapoptotic signal transduction pathway involved in cellular apoptosis.</p>","PeriodicalId":72957,"journal":{"name":"EURASIP journal on bioinformatics & systems biology","volume":"2012 1","pages":"4"},"PeriodicalIF":0.0,"publicationDate":"2012-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/1687-4153-2012-4","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"30657447","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
A novel cost function to estimate parameters of oscillatory biochemical systems. 一种新的估计振荡生化系统参数的代价函数。
Pub Date : 2012-05-16 DOI: 10.1186/1687-4153-2012-3
Seyedbehzad Nabavi, Cranos M Williams

Oscillatory pathways are among the most important classes of biochemical systems with examples ranging from circadian rhythms and cell cycle maintenance. Mathematical modeling of these highly interconnected biochemical networks is needed to meet numerous objectives such as investigating, predicting and controlling the dynamics of these systems. Identifying the kinetic rate parameters is essential for fully modeling these and other biological processes. These kinetic parameters, however, are not usually available from measurements and most of them have to be estimated by parameter fitting techniques. One of the issues with estimating kinetic parameters in oscillatory systems is the irregularities in the least square (LS) cost function surface used to estimate these parameters, which is caused by the periodicity of the measurements. These irregularities result in numerous local minima, which limit the performance of even some of the most robust global optimization algorithms. We proposed a parameter estimation framework to address these issues that integrates temporal information with periodic information embedded in the measurements used to estimate these parameters. This periodic information is used to build a proposed cost function with better surface properties leading to fewer local minima and better performance of global optimization algorithms. We verified for three oscillatory biochemical systems that our proposed cost function results in an increased ability to estimate accurate kinetic parameters as compared to the traditional LS cost function. We combine this cost function with an improved noise removal approach that leverages periodic characteristics embedded in the measurements to effectively reduce noise. The results provide strong evidence on the efficacy of this noise removal approach over the previous commonly used wavelet hard-thresholding noise removal methods. This proposed optimization framework results in more accurate kinetic parameters that will eventually lead to biochemical models that are more precise, predictable, and controllable.

振荡通路是生物化学系统中最重要的一类,其例子包括昼夜节律和细胞周期维持。这些高度互联的生化网络的数学建模需要满足许多目标,如调查,预测和控制这些系统的动态。确定动力学速率参数对于充分模拟这些和其他生物过程至关重要。然而,这些动力学参数通常不能从测量中得到,它们中的大多数必须通过参数拟合技术来估计。估计振荡系统动力学参数的问题之一是用于估计这些参数的最小二乘(LS)代价函数表面的不规则性,这是由测量的周期性引起的。这些不规则性导致了大量的局部最小值,从而限制了一些最健壮的全局优化算法的性能。我们提出了一个参数估计框架来解决这些问题,该框架将时间信息与嵌入在用于估计这些参数的测量中的周期性信息集成在一起。利用这些周期性信息构建具有更好表面特性的成本函数,从而减少局部极小值,提高全局优化算法的性能。我们对三个振荡生化系统进行了验证,与传统的LS成本函数相比,我们提出的成本函数可以提高估计准确动力学参数的能力。我们将此成本函数与改进的噪声去除方法相结合,该方法利用嵌入在测量中的周期性特征来有效地降低噪声。实验结果有力地证明了这种去噪方法比以往常用的小波硬阈值去噪方法的有效性。提出的优化框架将产生更精确的动力学参数,最终导致更精确、可预测和可控的生化模型。
{"title":"A novel cost function to estimate parameters of oscillatory biochemical systems.","authors":"Seyedbehzad Nabavi,&nbsp;Cranos M Williams","doi":"10.1186/1687-4153-2012-3","DOIUrl":"https://doi.org/10.1186/1687-4153-2012-3","url":null,"abstract":"<p><p> Oscillatory pathways are among the most important classes of biochemical systems with examples ranging from circadian rhythms and cell cycle maintenance. Mathematical modeling of these highly interconnected biochemical networks is needed to meet numerous objectives such as investigating, predicting and controlling the dynamics of these systems. Identifying the kinetic rate parameters is essential for fully modeling these and other biological processes. These kinetic parameters, however, are not usually available from measurements and most of them have to be estimated by parameter fitting techniques. One of the issues with estimating kinetic parameters in oscillatory systems is the irregularities in the least square (LS) cost function surface used to estimate these parameters, which is caused by the periodicity of the measurements. These irregularities result in numerous local minima, which limit the performance of even some of the most robust global optimization algorithms. We proposed a parameter estimation framework to address these issues that integrates temporal information with periodic information embedded in the measurements used to estimate these parameters. This periodic information is used to build a proposed cost function with better surface properties leading to fewer local minima and better performance of global optimization algorithms. We verified for three oscillatory biochemical systems that our proposed cost function results in an increased ability to estimate accurate kinetic parameters as compared to the traditional LS cost function. We combine this cost function with an improved noise removal approach that leverages periodic characteristics embedded in the measurements to effectively reduce noise. The results provide strong evidence on the efficacy of this noise removal approach over the previous commonly used wavelet hard-thresholding noise removal methods. This proposed optimization framework results in more accurate kinetic parameters that will eventually lead to biochemical models that are more precise, predictable, and controllable.</p>","PeriodicalId":72957,"journal":{"name":"EURASIP journal on bioinformatics & systems biology","volume":"2012 1","pages":"3"},"PeriodicalIF":0.0,"publicationDate":"2012-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/1687-4153-2012-3","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"30619698","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Determination of minimal transcriptional signatures of compounds for target prediction. 测定化合物的最小转录特征用于靶标预测。
Pub Date : 2012-05-10 DOI: 10.1186/1687-4153-2012-2
Florian Nigsch, Janna Hutz, Ben Cornett, Douglas W Selinger, Gregory McAllister, Somnath Bandyopadhyay, Joseph Loureiro, Jeremy L Jenkins

The identification of molecular target and mechanism of action of compounds is a key hurdle in drug discovery. Multiplexed techniques for bead-based expression profiling allow the measurement of transcriptional signatures of compound-treated cells in high-throughput mode. Such profiles can be used to gain insight into compounds' mode of action and the protein targets they are modulating. Through the proxy of target prediction from such gene signatures we explored important aspects of the use of transcriptional profiles to capture biological variability of perturbed cellular assays. We found that signatures derived from expression data and signatures derived from biological interaction networks performed equally well, and we showed that gene signatures can be optimised using a genetic algorithm. Gene signatures of approximately 128 genes seemed to be most generic, capturing a maximum of the perturbation inflicted on cells through compound treatment. Moreover, we found evidence for oxidative phosphorylation to be one of the most general ways to capture compound perturbation.

化合物的分子靶点和作用机制的确定是药物开发的关键环节。基于头部表达谱的多路复用技术允许在高通量模式下测量化合物处理细胞的转录特征。这样的轮廓可以用来深入了解化合物的作用模式和它们调节的蛋白质目标。通过这些基因特征的靶预测代理,我们探索了使用转录谱来捕获受干扰细胞测定的生物学变异性的重要方面。我们发现来自表达数据的签名和来自生物相互作用网络的签名表现同样好,并且我们表明基因签名可以使用遗传算法进行优化。大约128个基因的基因特征似乎是最通用的,捕获了通过复合处理对细胞施加的最大扰动。此外,我们发现氧化磷酸化是捕获化合物扰动的最一般方法之一的证据。
{"title":"Determination of minimal transcriptional signatures of compounds for target prediction.","authors":"Florian Nigsch,&nbsp;Janna Hutz,&nbsp;Ben Cornett,&nbsp;Douglas W Selinger,&nbsp;Gregory McAllister,&nbsp;Somnath Bandyopadhyay,&nbsp;Joseph Loureiro,&nbsp;Jeremy L Jenkins","doi":"10.1186/1687-4153-2012-2","DOIUrl":"https://doi.org/10.1186/1687-4153-2012-2","url":null,"abstract":"<p><p> The identification of molecular target and mechanism of action of compounds is a key hurdle in drug discovery. Multiplexed techniques for bead-based expression profiling allow the measurement of transcriptional signatures of compound-treated cells in high-throughput mode. Such profiles can be used to gain insight into compounds' mode of action and the protein targets they are modulating. Through the proxy of target prediction from such gene signatures we explored important aspects of the use of transcriptional profiles to capture biological variability of perturbed cellular assays. We found that signatures derived from expression data and signatures derived from biological interaction networks performed equally well, and we showed that gene signatures can be optimised using a genetic algorithm. Gene signatures of approximately 128 genes seemed to be most generic, capturing a maximum of the perturbation inflicted on cells through compound treatment. Moreover, we found evidence for oxidative phosphorylation to be one of the most general ways to capture compound perturbation.</p>","PeriodicalId":72957,"journal":{"name":"EURASIP journal on bioinformatics & systems biology","volume":"2012 1","pages":"2"},"PeriodicalIF":0.0,"publicationDate":"2012-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/1687-4153-2012-2","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"30607587","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
A top-down approach to classify enzyme functional classes and sub-classes using random forest. 一种自上而下的酶功能类和亚类随机森林分类方法。
Pub Date : 2012-02-29 DOI: 10.1186/1687-4153-2012-1
Chetan Kumar, Alok Choudhary

Advancements in sequencing technologies have witnessed an exponential rise in the number of newly found enzymes. Enzymes are proteins that catalyze bio-chemical reactions and play an important role in metabolic pathways. Commonly, function of such enzymes is determined by experiments that can be time consuming and costly. Hence, a need for a computing method is felt that can distinguish protein enzyme sequences from those of non-enzymes and reliably predict the function of the former. To address this problem, approaches that cluster enzymes based on their sequence and structural similarity have been presented. But, these approaches are known to fail for proteins that perform the same function and are dissimilar in their sequence and structure. In this article, we present a supervised machine learning model to predict the function class and sub-class of enzymes based on a set of 73 sequence-derived features. The functional classes are as defined by International Union of Biochemistry and Molecular Biology. Using an efficient data mining algorithm called random forest, we construct a top-down three layer model where the top layer classifies a query protein sequence as an enzyme or non-enzyme, the second layer predicts the main function class and bottom layer further predicts the sub-function class. The model reported overall classification accuracy of 94.87% for the first level, 87.7% for the second, and 84.25% for the bottom level. Our results compare very well with existing methods, and in many cases report better performance. Using feature selection methods, we have shown the biological relevance of a few of the top rank attributes.

测序技术的进步见证了新发现酶数量的指数级增长。酶是催化生物化学反应的蛋白质,在代谢途径中起重要作用。通常,这些酶的功能是通过实验来确定的,这可能是耗时和昂贵的。因此,需要一种能够区分蛋白质酶序列和非酶序列并可靠地预测前者功能的计算方法。为了解决这个问题,已经提出了基于序列和结构相似性的聚类酶的方法。但是,已知这些方法对于执行相同功能且序列和结构不同的蛋白质是失败的。在本文中,我们提出了一个基于73个序列衍生特征集的监督机器学习模型来预测酶的功能类和亚类。功能类由国际生物化学与分子生物学联合会定义。采用一种高效的随机森林数据挖掘算法,构建了自顶向下的三层模型,其中顶层将查询蛋白序列分类为酶或非酶,第二层预测主功能类,底层进一步预测子功能类。该模型第一层次的总体分类准确率为94.87%,第二层次为87.7%,最后层次为84.25%。我们的结果与现有的方法比较非常好,并且在许多情况下报告了更好的性能。使用特征选择方法,我们展示了一些顶级属性的生物学相关性。
{"title":"A top-down approach to classify enzyme functional classes and sub-classes using random forest.","authors":"Chetan Kumar,&nbsp;Alok Choudhary","doi":"10.1186/1687-4153-2012-1","DOIUrl":"https://doi.org/10.1186/1687-4153-2012-1","url":null,"abstract":"<p><p> Advancements in sequencing technologies have witnessed an exponential rise in the number of newly found enzymes. Enzymes are proteins that catalyze bio-chemical reactions and play an important role in metabolic pathways. Commonly, function of such enzymes is determined by experiments that can be time consuming and costly. Hence, a need for a computing method is felt that can distinguish protein enzyme sequences from those of non-enzymes and reliably predict the function of the former. To address this problem, approaches that cluster enzymes based on their sequence and structural similarity have been presented. But, these approaches are known to fail for proteins that perform the same function and are dissimilar in their sequence and structure. In this article, we present a supervised machine learning model to predict the function class and sub-class of enzymes based on a set of 73 sequence-derived features. The functional classes are as defined by International Union of Biochemistry and Molecular Biology. Using an efficient data mining algorithm called random forest, we construct a top-down three layer model where the top layer classifies a query protein sequence as an enzyme or non-enzyme, the second layer predicts the main function class and bottom layer further predicts the sub-function class. The model reported overall classification accuracy of 94.87% for the first level, 87.7% for the second, and 84.25% for the bottom level. Our results compare very well with existing methods, and in many cases report better performance. Using feature selection methods, we have shown the biological relevance of a few of the top rank attributes.</p>","PeriodicalId":72957,"journal":{"name":"EURASIP journal on bioinformatics & systems biology","volume":"2012 1","pages":"1"},"PeriodicalIF":0.0,"publicationDate":"2012-02-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/1687-4153-2012-1","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"30494874","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 47
Detecting controlling nodes of boolean regulatory networks. 布尔调节网络控制节点检测。
Pub Date : 2011-10-11 DOI: 10.1186/1687-4153-2011-6
Steffen Schober, David Kracht, Reinhard Heckel, Martin Bossert

Boolean models of regulatory networks are assumed to be tolerant to perturbations. That qualitatively implies that each function can only depend on a few nodes. Biologically motivated constraints further show that functions found in Boolean regulatory networks belong to certain classes of functions, for example, the unate functions. It turns out that these classes have specific properties in the Fourier domain. That motivates us to study the problem of detecting controlling nodes in classes of Boolean networks using spectral techniques. We consider networks with unbalanced functions and functions of an average sensitivity less than 23k, where k is the number of controlling variables for a function. Further, we consider the class of 1-low networks which include unate networks, linear threshold networks, and networks with nested canalyzing functions. We show that the application of spectral learning algorithms leads to both better time and sample complexity for the detection of controlling nodes compared with algorithms based on exhaustive search. For a particular algorithm, we state analytical upper bounds on the number of samples needed to find the controlling nodes of the Boolean functions. Further, improved algorithms for detecting controlling nodes in large-scale unate networks are given and numerically studied.

假设调节网络的布尔模型对扰动具有容忍度。这在定性上意味着每个函数只能依赖于几个节点。生物动机约束进一步表明,在布尔调节网络中发现的功能属于某些类型的功能,例如,单函数。这些类在傅里叶域中有特定的性质。这促使我们研究利用谱技术在布尔网络中检测控制节点的问题。我们考虑具有不平衡函数和平均灵敏度小于23k的函数的网络,其中k是函数的控制变量数。此外,我们考虑了一类1-low网络,其中包括unate网络,线性阈值网络和嵌套分析函数网络。我们表明,与基于穷举搜索的算法相比,谱学习算法的应用在控制节点检测方面具有更好的时间和样本复杂度。对于一个特定的算法,我们声明了找到布尔函数的控制节点所需的样本数量的解析上界。在此基础上,提出了一种用于大规模单一网络控制节点检测的改进算法,并进行了数值研究。
{"title":"Detecting controlling nodes of boolean regulatory networks.","authors":"Steffen Schober,&nbsp;David Kracht,&nbsp;Reinhard Heckel,&nbsp;Martin Bossert","doi":"10.1186/1687-4153-2011-6","DOIUrl":"https://doi.org/10.1186/1687-4153-2011-6","url":null,"abstract":"<p><p> Boolean models of regulatory networks are assumed to be tolerant to perturbations. That qualitatively implies that each function can only depend on a few nodes. Biologically motivated constraints further show that functions found in Boolean regulatory networks belong to certain classes of functions, for example, the unate functions. It turns out that these classes have specific properties in the Fourier domain. That motivates us to study the problem of detecting controlling nodes in classes of Boolean networks using spectral techniques. We consider networks with unbalanced functions and functions of an average sensitivity less than 23k, where k is the number of controlling variables for a function. Further, we consider the class of 1-low networks which include unate networks, linear threshold networks, and networks with nested canalyzing functions. We show that the application of spectral learning algorithms leads to both better time and sample complexity for the detection of controlling nodes compared with algorithms based on exhaustive search. For a particular algorithm, we state analytical upper bounds on the number of samples needed to find the controlling nodes of the Boolean functions. Further, improved algorithms for detecting controlling nodes in large-scale unate networks are given and numerically studied.</p>","PeriodicalId":72957,"journal":{"name":"EURASIP journal on bioinformatics & systems biology","volume":"2011 ","pages":"6"},"PeriodicalIF":0.0,"publicationDate":"2011-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/1687-4153-2011-6","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"30199263","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Unscented Kalman filter with parameter identifiability analysis for the estimation of multiple parameters in kinetic models. 基于参数可辨识性分析的Unscented卡尔曼滤波在动力学模型多参数估计中的应用。
Pub Date : 2011-10-11 DOI: 10.1186/1687-4153-2011-7
Syed Murtuza Baker, C Hart Poskar, Björn H Junker

In systems biology, experimentally measured parameters are not always available, necessitating the use of computationally based parameter estimation. In order to rely on estimated parameters, it is critical to first determine which parameters can be estimated for a given model and measurement set. This is done with parameter identifiability analysis. A kinetic model of the sucrose accumulation in the sugar cane culm tissue developed by Rohwer et al. was taken as a test case model. What differentiates this approach is the integration of an orthogonal-based local identifiability method into the unscented Kalman filter (UKF), rather than using the more common observability-based method which has inherent limitations. It also introduces a variable step size based on the system uncertainty of the UKF during the sensitivity calculation. This method identified 10 out of 12 parameters as identifiable. These ten parameters were estimated using the UKF, which was run 97 times. Throughout the repetitions the UKF proved to be more consistent than the estimation algorithms used for comparison.

在系统生物学中,实验测量的参数并不总是可用的,因此需要使用基于计算的参数估计。为了依赖估计的参数,首先确定对于给定的模型和测量集可以估计哪些参数是至关重要的。这是通过参数可识别性分析完成的。采用Rohwer等人建立的甘蔗茎组织中蔗糖积累的动力学模型作为试验用例模型。这种方法的不同之处在于将基于正交的局部可识别性方法集成到无气味卡尔曼滤波器(UKF)中,而不是使用更常见的基于可观察性的方法,这种方法具有固有的局限性。在灵敏度计算中引入了基于UKF系统不确定性的可变步长。该方法从12个参数中识别出10个可识别参数。使用UKF对这十个参数进行了估计,UKF运行了97次。在整个重复过程中,UKF被证明比用于比较的估计算法更加一致。
{"title":"Unscented Kalman filter with parameter identifiability analysis for the estimation of multiple parameters in kinetic models.","authors":"Syed Murtuza Baker,&nbsp;C Hart Poskar,&nbsp;Björn H Junker","doi":"10.1186/1687-4153-2011-7","DOIUrl":"https://doi.org/10.1186/1687-4153-2011-7","url":null,"abstract":"<p><p> In systems biology, experimentally measured parameters are not always available, necessitating the use of computationally based parameter estimation. In order to rely on estimated parameters, it is critical to first determine which parameters can be estimated for a given model and measurement set. This is done with parameter identifiability analysis. A kinetic model of the sucrose accumulation in the sugar cane culm tissue developed by Rohwer et al. was taken as a test case model. What differentiates this approach is the integration of an orthogonal-based local identifiability method into the unscented Kalman filter (UKF), rather than using the more common observability-based method which has inherent limitations. It also introduces a variable step size based on the system uncertainty of the UKF during the sensitivity calculation. This method identified 10 out of 12 parameters as identifiable. These ten parameters were estimated using the UKF, which was run 97 times. Throughout the repetitions the UKF proved to be more consistent than the estimation algorithms used for comparison.</p>","PeriodicalId":72957,"journal":{"name":"EURASIP journal on bioinformatics & systems biology","volume":" ","pages":"7"},"PeriodicalIF":0.0,"publicationDate":"2011-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/1687-4153-2011-7","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"30200695","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Measuring and analyzing tissue specificity of human genes and protein complexes. 测量和分析人类基因和蛋白质复合物的组织特异性。
Pub Date : 2011-08-04 DOI: 10.1186/1687-4153-2011-5
Dorothea Emig, Tim Kacprowski, Mario Albrecht

Proteins and their interactions are essential for the survival of each human cell. Knowledge of their tissue occurrence is important for understanding biological processes. Therefore, we analyzed microarray and high-throughput RNA-sequencing data to identify tissue-specific and universally expressed genes. Gene expression data were used to investigate the presence of proteins, protein interactions and protein complexes in different tissues. Our comparison shows that the detection of tissue-specific genes and proteins strongly depends on the applied measurement technique. We found that microarrays are less sensitive for low expressed genes than high-throughput sequencing. Functional analyses based on microarray data are thus biased towards high expressed genes. This also means that previous biological findings based on microarrays might have to be re-examined using high-throughput sequencing results.

蛋白质及其相互作用对每个人类细胞的生存至关重要。了解它们的组织发生对于理解生物过程是很重要的。因此,我们分析了微阵列和高通量rna测序数据,以鉴定组织特异性和普遍表达的基因。基因表达数据用于研究蛋白质、蛋白质相互作用和蛋白质复合物在不同组织中的存在。我们的比较表明,组织特异性基因和蛋白质的检测在很大程度上取决于应用的测量技术。我们发现微阵列对低表达基因的敏感性低于高通量测序。因此,基于微阵列数据的功能分析偏向于高表达基因。这也意味着以前基于微阵列的生物学发现可能必须使用高通量测序结果重新检查。
{"title":"Measuring and analyzing tissue specificity of human genes and protein complexes.","authors":"Dorothea Emig,&nbsp;Tim Kacprowski,&nbsp;Mario Albrecht","doi":"10.1186/1687-4153-2011-5","DOIUrl":"https://doi.org/10.1186/1687-4153-2011-5","url":null,"abstract":"<p><p> Proteins and their interactions are essential for the survival of each human cell. Knowledge of their tissue occurrence is important for understanding biological processes. Therefore, we analyzed microarray and high-throughput RNA-sequencing data to identify tissue-specific and universally expressed genes. Gene expression data were used to investigate the presence of proteins, protein interactions and protein complexes in different tissues. Our comparison shows that the detection of tissue-specific genes and proteins strongly depends on the applied measurement technique. We found that microarrays are less sensitive for low expressed genes than high-throughput sequencing. Functional analyses based on microarray data are thus biased towards high expressed genes. This also means that previous biological findings based on microarrays might have to be re-examined using high-throughput sequencing results.</p>","PeriodicalId":72957,"journal":{"name":"EURASIP journal on bioinformatics & systems biology","volume":" ","pages":"5"},"PeriodicalIF":0.0,"publicationDate":"2011-08-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/1687-4153-2011-5","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"30185097","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 21
期刊
EURASIP journal on bioinformatics & systems biology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1