首页 > 最新文献

EURASIP journal on bioinformatics & systems biology最新文献

英文 中文
Stability from structure: metabolic networks are unlike other biological networks. 结构稳定性:代谢网络不同于其他生物网络。
Pub Date : 2009-01-01 DOI: 10.1155/2009/630695
P van Nes, D Bellomo, M J T Reinders, D de Ridder

In recent work, attempts have been made to link the structure of biochemical networks to their complex dynamics. It was shown that structurally stable network motifs are enriched in such networks. In this work, we investigate to what extent these findings apply to metabolic networks. To this end, we extend a previously proposed method by changing the null model for determining motif enrichment, by using interaction types directly obtained from structural interaction matrices, by generating a distribution of partial derivatives of reaction rates and by simulating enzymatic regulation on metabolic networks. Our findings suggest that the conclusions drawn in previous work cannot be extended to metabolic networks, that is, structurally stable network motifs are not enriched in metabolic networks.

在最近的工作中,已经尝试将生化网络的结构与其复杂的动力学联系起来。结果表明,结构稳定的网络基序在这些网络中丰富。在这项工作中,我们调查了这些发现在多大程度上适用于代谢网络。为此,我们扩展了先前提出的方法,通过改变确定基序富集的零模型,通过使用直接从结构相互作用矩阵中获得的相互作用类型,通过生成反应速率的偏导数分布以及通过模拟酶对代谢网络的调节。我们的研究结果表明,以前的工作结论不能推广到代谢网络,即结构稳定的网络基序在代谢网络中并不丰富。
{"title":"Stability from structure: metabolic networks are unlike other biological networks.","authors":"P van Nes,&nbsp;D Bellomo,&nbsp;M J T Reinders,&nbsp;D de Ridder","doi":"10.1155/2009/630695","DOIUrl":"https://doi.org/10.1155/2009/630695","url":null,"abstract":"<p><p>In recent work, attempts have been made to link the structure of biochemical networks to their complex dynamics. It was shown that structurally stable network motifs are enriched in such networks. In this work, we investigate to what extent these findings apply to metabolic networks. To this end, we extend a previously proposed method by changing the null model for determining motif enrichment, by using interaction types directly obtained from structural interaction matrices, by generating a distribution of partial derivatives of reaction rates and by simulating enzymatic regulation on metabolic networks. Our findings suggest that the conclusions drawn in previous work cannot be extended to metabolic networks, that is, structurally stable network motifs are not enriched in metabolic networks.</p>","PeriodicalId":72957,"journal":{"name":"EURASIP journal on bioinformatics & systems biology","volume":" ","pages":"630695"},"PeriodicalIF":0.0,"publicationDate":"2009-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1155/2009/630695","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9786273","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Using a state-space model and location analysis to infer time-delayed regulatory networks. 使用状态空间模型和位置分析来推断时滞调节网络。
Pub Date : 2009-01-01 Epub Date: 2009-10-15 DOI: 10.1155/2009/484601
Chushin Koh, Fang-Xiang Wu, Gopalan Selvaraj, Anthony J Kusalik

Computational gene regulation models provide a means for scientists to draw biological inferences from time-course gene expression data. Based on the state-space approach, we developed a new modeling tool for inferring gene regulatory networks, called time-delayed Gene Regulatory Networks (tdGRNs). tdGRN takes time-delayed regulatory relationships into consideration when developing the model. In addition, a priori biological knowledge from genome-wide location analysis is incorporated into the structure of the gene regulatory network. tdGRN is evaluated on both an artificial dataset and a published gene expression data set. It not only determines regulatory relationships that are known to exist but also uncovers potential new ones. The results indicate that the proposed tool is effective in inferring gene regulatory relationships with time delay. tdGRN is complementary to existing methods for inferring gene regulatory networks. The novel part of the proposed tool is that it is able to infer time-delayed regulatory relationships.

计算基因调控模型为科学家提供了一种从基因表达数据中得出生物学推论的方法。基于状态空间方法,我们开发了一种新的用于推断基因调控网络的建模工具,称为时滞基因调控网络(tdGRNs)。tdGRN在开发模型时考虑了时滞调节关系。此外,来自全基因组定位分析的先验生物学知识被纳入基因调控网络的结构中。tdGRN在人工数据集和已发表的基因表达数据集上进行了评估。它不仅决定了已知存在的调节关系,而且还揭示了潜在的新关系。结果表明,该工具可以有效地推断基因调控与时间延迟的关系。tdGRN是对现有推断基因调控网络方法的补充。提出的工具的新颖之处在于它能够推断时间延迟的调节关系。
{"title":"Using a state-space model and location analysis to infer time-delayed regulatory networks.","authors":"Chushin Koh,&nbsp;Fang-Xiang Wu,&nbsp;Gopalan Selvaraj,&nbsp;Anthony J Kusalik","doi":"10.1155/2009/484601","DOIUrl":"https://doi.org/10.1155/2009/484601","url":null,"abstract":"<p><p>Computational gene regulation models provide a means for scientists to draw biological inferences from time-course gene expression data. Based on the state-space approach, we developed a new modeling tool for inferring gene regulatory networks, called time-delayed Gene Regulatory Networks (tdGRNs). tdGRN takes time-delayed regulatory relationships into consideration when developing the model. In addition, a priori biological knowledge from genome-wide location analysis is incorporated into the structure of the gene regulatory network. tdGRN is evaluated on both an artificial dataset and a published gene expression data set. It not only determines regulatory relationships that are known to exist but also uncovers potential new ones. The results indicate that the proposed tool is effective in inferring gene regulatory relationships with time delay. tdGRN is complementary to existing methods for inferring gene regulatory networks. The novel part of the proposed tool is that it is able to infer time-delayed regulatory relationships.</p>","PeriodicalId":72957,"journal":{"name":"EURASIP journal on bioinformatics & systems biology","volume":" ","pages":"484601"},"PeriodicalIF":0.0,"publicationDate":"2009-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1155/2009/484601","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"28072916","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 20
Adaptive dynamics of regulatory networks: size matters. 调节网络的自适应动态:大小问题。
Pub Date : 2009-01-01 Epub Date: 2009-03-12 DOI: 10.1155/2009/618502
Dirk Repsilber, Thomas Martinetz, Mats Björklund

To accomplish adaptability, all living organisms are constructed of regulatory networks on different levels which are capable to differentially respond to a variety of environmental inputs. Structure of regulatory networks determines their phenotypical plasticity, that is, the degree of detail and appropriateness of regulatory replies to environmental or developmental challenges. This regulatory network structure is encoded within the genotype. Our conceptual simulation study investigates how network structure constrains the evolution of networks and their adaptive abilities. The focus is on the structural parameter network size. We show that small regulatory networks adapt fast, but not as good as larger networks in the longer perspective. Selection leads to an optimal network size dependent on heterogeneity of the environment and time pressure of adaptation. Optimal mutation rates are higher for smaller networks. We put special emphasis on discussing our simulation results on the background of functional observations from experimental and evolutionary biology.

为了实现适应性,所有生物都是由不同层次的调节网络构成的,这些调节网络能够对各种环境输入做出不同的反应。调节网络的结构决定了它们的表型可塑性,也就是说,调节对环境或发展挑战的反应的详细程度和适当性。这种调控网络结构被编码在基因型中。我们的概念模拟研究探讨了网络结构如何约束网络的进化及其适应能力。重点研究了网络大小的结构参数。我们表明,小型监管网络适应速度很快,但从长远来看,不如大型网络适应得好。选择导致最优网络大小取决于环境的异质性和适应的时间压力。对于较小的网络,最佳突变率更高。我们特别强调在实验和进化生物学的功能观察背景下讨论我们的模拟结果。
{"title":"Adaptive dynamics of regulatory networks: size matters.","authors":"Dirk Repsilber,&nbsp;Thomas Martinetz,&nbsp;Mats Björklund","doi":"10.1155/2009/618502","DOIUrl":"https://doi.org/10.1155/2009/618502","url":null,"abstract":"<p><p>To accomplish adaptability, all living organisms are constructed of regulatory networks on different levels which are capable to differentially respond to a variety of environmental inputs. Structure of regulatory networks determines their phenotypical plasticity, that is, the degree of detail and appropriateness of regulatory replies to environmental or developmental challenges. This regulatory network structure is encoded within the genotype. Our conceptual simulation study investigates how network structure constrains the evolution of networks and their adaptive abilities. The focus is on the structural parameter network size. We show that small regulatory networks adapt fast, but not as good as larger networks in the longer perspective. Selection leads to an optimal network size dependent on heterogeneity of the environment and time pressure of adaptation. Optimal mutation rates are higher for smaller networks. We put special emphasis on discussing our simulation results on the background of functional observations from experimental and evolutionary biology.</p>","PeriodicalId":72957,"journal":{"name":"EURASIP journal on bioinformatics & systems biology","volume":" ","pages":"618502"},"PeriodicalIF":0.0,"publicationDate":"2009-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1155/2009/618502","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"28082821","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On the impact of entropy estimation on transcriptional regulatory network inference based on mutual information. 熵估计对基于互信息的转录调控网络推断的影响。
Pub Date : 2009-01-01 DOI: 10.1155/2009/308959
Catharina Olsen, Patrick E Meyer, Gianluca Bontempi

The reverse engineering of transcription regulatory networks from expression data is gaining large interest in the bioinformatics community. An important family of inference techniques is represented by algorithms based on information theoretic measures which rely on the computation of pairwise mutual information. This paper aims to study the impact of the entropy estimator on the quality of the inferred networks. This is done by means of a comprehensive study which takes into consideration three state-of-the-art mutual information algorithms: ARACNE, CLR, and MRNET. Two different setups are considered in this work. The first one considers a set of 12 synthetically generated datasets to compare 8 different entropy estimators and three network inference algorithms. The two methods emerging as the most accurate ones from the first set of experiments are the MRNET method combined with the newly applied Spearman correlation and the CLR method combined with the Pearson correlation. The validation of these two techniques is then carried out on a set of 10 public domain microarray datasets measuring the transcriptional regulatory activity in the yeast organism.

从表达数据的转录调控网络的逆向工程在生物信息学社区中获得了很大的兴趣。基于信息论测度的算法是一类重要的推理技术,它依赖于两两互信息的计算。本文旨在研究熵估计量对推断网络质量的影响。这是通过一项综合研究来完成的,该研究考虑了三种最先进的互信息算法:ARACNE, CLR和MRNET。在这项工作中考虑了两种不同的设置。第一种方法考虑了一组12个综合生成的数据集,比较了8种不同的熵估计器和3种网络推理算法。第一组实验中出现的最准确的两种方法是结合新应用的Spearman相关的MRNET方法和结合Pearson相关的CLR方法。这两种技术的验证随后在一组10个公共领域微阵列数据集上进行,测量酵母生物体内的转录调控活性。
{"title":"On the impact of entropy estimation on transcriptional regulatory network inference based on mutual information.","authors":"Catharina Olsen,&nbsp;Patrick E Meyer,&nbsp;Gianluca Bontempi","doi":"10.1155/2009/308959","DOIUrl":"https://doi.org/10.1155/2009/308959","url":null,"abstract":"<p><p>The reverse engineering of transcription regulatory networks from expression data is gaining large interest in the bioinformatics community. An important family of inference techniques is represented by algorithms based on information theoretic measures which rely on the computation of pairwise mutual information. This paper aims to study the impact of the entropy estimator on the quality of the inferred networks. This is done by means of a comprehensive study which takes into consideration three state-of-the-art mutual information algorithms: ARACNE, CLR, and MRNET. Two different setups are considered in this work. The first one considers a set of 12 synthetically generated datasets to compare 8 different entropy estimators and three network inference algorithms. The two methods emerging as the most accurate ones from the first set of experiments are the MRNET method combined with the newly applied Spearman correlation and the CLR method combined with the Pearson correlation. The validation of these two techniques is then carried out on a set of 10 public domain microarray datasets measuring the transcriptional regulatory activity in the yeast organism.</p>","PeriodicalId":72957,"journal":{"name":"EURASIP journal on bioinformatics & systems biology","volume":" ","pages":"308959"},"PeriodicalIF":0.0,"publicationDate":"2009-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1155/2009/308959","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9425662","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 73
Compressive sensing DNA microarrays. 压缩传感DNA微阵列。
Pub Date : 2009-01-01 Epub Date: 2009-01-13 DOI: 10.1155/2009/162824
Wei Dai, Mona A Sheikh, Olgica Milenkovic, Richard G Baraniuk

Compressive sensing microarrays (CSMs) are DNA-based sensors that operate using group testing and compressive sensing (CS) principles. In contrast to conventional DNA microarrays, in which each genetic sensor is designed to respond to a single target, in a CSM, each sensor responds to a set of targets. We study the problem of designing CSMs that simultaneously account for both the constraints from CS theory and the biochemistry of probe-target DNA hybridization. An appropriate cross-hybridization model is proposed for CSMs, and several methods are developed for probe design and CS signal recovery based on the new model. Lab experiments suggest that in order to achieve accurate hybridization profiling, consensus probe sequences are required to have sequence homology of at least 80% with all targets to be detected. Furthermore, out-of-equilibrium datasets are usually as accurate as those obtained from equilibrium conditions. Consequently, one can use CSMs in applications in which only short hybridization times are allowed.

压缩感知微阵列(csm)是基于dna的传感器,使用组测试和压缩感知(CS)原理进行操作。在传统的DNA微阵列中,每个基因传感器被设计为对单个目标做出反应,而在CSM中,每个传感器对一组目标做出反应。我们研究了同时考虑CS理论和探针-靶DNA杂交的生物化学约束的csm设计问题。提出了合适的交叉杂交模型,并在此基础上提出了探头设计和信号恢复的几种方法。实验室实验表明,为了实现准确的杂交分析,一致的探针序列需要与所有待检测目标具有至少80%的序列同源性。此外,非平衡数据集通常与从平衡条件下获得的数据集一样准确。因此,可以在只允许短杂交时间的应用中使用csm。
{"title":"Compressive sensing DNA microarrays.","authors":"Wei Dai,&nbsp;Mona A Sheikh,&nbsp;Olgica Milenkovic,&nbsp;Richard G Baraniuk","doi":"10.1155/2009/162824","DOIUrl":"https://doi.org/10.1155/2009/162824","url":null,"abstract":"<p><p>Compressive sensing microarrays (CSMs) are DNA-based sensors that operate using group testing and compressive sensing (CS) principles. In contrast to conventional DNA microarrays, in which each genetic sensor is designed to respond to a single target, in a CSM, each sensor responds to a set of targets. We study the problem of designing CSMs that simultaneously account for both the constraints from CS theory and the biochemistry of probe-target DNA hybridization. An appropriate cross-hybridization model is proposed for CSMs, and several methods are developed for probe design and CS signal recovery based on the new model. Lab experiments suggest that in order to achieve accurate hybridization profiling, consensus probe sequences are required to have sequence homology of at least 80% with all targets to be detected. Furthermore, out-of-equilibrium datasets are usually as accurate as those obtained from equilibrium conditions. Consequently, one can use CSMs in applications in which only short hybridization times are allowed.</p>","PeriodicalId":72957,"journal":{"name":"EURASIP journal on bioinformatics & systems biology","volume":" ","pages":"162824"},"PeriodicalIF":0.0,"publicationDate":"2009-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1155/2009/162824","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"27937313","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 79
Towards systems biology of heterosis: a hypothesis about molecular network structure applied for the Arabidopsis metabolome. 走向杂种优势的系统生物学:拟南芥代谢组的分子网络结构假说。
Pub Date : 2009-01-01 DOI: 10.1155/2009/147157
Sandra Andorf, Tanja Gärtner, Matthias Steinfath, Hanna Witucka-Wall, Thomas Altmann, Dirk Repsilber

We propose a network structure-based model for heterosis, and investigate it relying on metabolite profiles from Arabidopsis. A simple feed-forward two-layer network model (the Steinbuch matrix) is used in our conceptual approach. It allows for directly relating structural network properties with biological function. Interpreting heterosis as increased adaptability, our model predicts that the biological networks involved show increasing connectivity of regulatory interactions. A detailed analysis of metabolite profile data reveals that the increasing-connectivity prediction is true for graphical Gaussian models in our data from early development. This mirrors properties of observed heterotic Arabidopsis phenotypes. Furthermore, the model predicts a limit for increasing hybrid vigor with increasing heterozygosity--a known phenomenon in the literature.

我们提出了一个基于网络结构的杂种优势模型,并根据拟南芥的代谢物谱对其进行了研究。在我们的概念方法中使用了一个简单的前馈两层网络模型(Steinbuch矩阵)。它允许将结构网络特性与生物功能直接联系起来。将杂种优势解释为适应性的增强,我们的模型预测,涉及的生物网络显示出越来越多的调节相互作用的连通性。对代谢物剖面数据的详细分析表明,在我们早期发展的数据中,图形高斯模型的连通性预测是正确的。这反映了观察到的拟南芥杂种表型的特性。此外,该模型还预测了杂合度增加对杂种活力增加的限制——这是文献中已知的现象。
{"title":"Towards systems biology of heterosis: a hypothesis about molecular network structure applied for the Arabidopsis metabolome.","authors":"Sandra Andorf,&nbsp;Tanja Gärtner,&nbsp;Matthias Steinfath,&nbsp;Hanna Witucka-Wall,&nbsp;Thomas Altmann,&nbsp;Dirk Repsilber","doi":"10.1155/2009/147157","DOIUrl":"https://doi.org/10.1155/2009/147157","url":null,"abstract":"<p><p>We propose a network structure-based model for heterosis, and investigate it relying on metabolite profiles from Arabidopsis. A simple feed-forward two-layer network model (the Steinbuch matrix) is used in our conceptual approach. It allows for directly relating structural network properties with biological function. Interpreting heterosis as increased adaptability, our model predicts that the biological networks involved show increasing connectivity of regulatory interactions. A detailed analysis of metabolite profile data reveals that the increasing-connectivity prediction is true for graphical Gaussian models in our data from early development. This mirrors properties of observed heterotic Arabidopsis phenotypes. Furthermore, the model predicts a limit for increasing hybrid vigor with increasing heterozygosity--a known phenomenon in the literature.</p>","PeriodicalId":72957,"journal":{"name":"EURASIP journal on bioinformatics & systems biology","volume":" ","pages":"147157"},"PeriodicalIF":0.0,"publicationDate":"2009-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1155/2009/147157","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9801476","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Modelling transcriptional regulation with a mixture of factor analyzers and variational Bayesian expectation maximization. 模拟转录调节与因子分析和变分贝叶斯期望最大化的混合物。
Pub Date : 2009-01-01 DOI: 10.1155/2009/601068
Kuang Lin, Dirk Husmeier

Understanding the mechanisms of gene transcriptional regulation through analysis of high-throughput postgenomic data is one of the central problems of computational systems biology. Various approaches have been proposed, but most of them fail to address at least one of the following objectives: (1) allow for the fact that transcription factors are potentially subject to posttranscriptional regulation; (2) allow for the fact that transcription factors cooperate as a functional complex in regulating gene expression, and (3) provide a model and a learning algorithm with manageable computational complexity. The objective of the present study is to propose and test a method that addresses these three issues. The model we employ is a mixture of factor analyzers, in which the latent variables correspond to different transcription factors, grouped into complexes or modules. We pursue inference in a Bayesian framework, using the Variational Bayesian Expectation Maximization (VBEM) algorithm for approximate inference of the posterior distributions of the model parameters, and estimation of a lower bound on the marginal likelihood for model selection. We have evaluated the performance of the proposed method on three criteria: activity profile reconstruction, gene clustering, and network inference.

通过分析高通量基因组后数据来理解基因转录调控的机制是计算系统生物学的核心问题之一。已经提出了各种方法,但大多数方法都未能解决以下目标中的至少一个:(1)考虑到转录因子可能受到转录后调控的事实;(2)考虑到转录因子作为一个功能复合体在调节基因表达方面的合作,以及(3)提供一个具有可控计算复杂性的模型和学习算法。本研究的目的是提出并测试一种解决这三个问题的方法。我们采用的模型是因子分析的混合物,其中潜在变量对应于不同的转录因子,分组成复合体或模块。我们在贝叶斯框架中进行推理,使用变分贝叶斯期望最大化(VBEM)算法对模型参数的后验分布进行近似推理,并估计模型选择的边际似然的下界。我们在三个标准上评估了所提出方法的性能:活动谱重建、基因聚类和网络推断。
{"title":"Modelling transcriptional regulation with a mixture of factor analyzers and variational Bayesian expectation maximization.","authors":"Kuang Lin,&nbsp;Dirk Husmeier","doi":"10.1155/2009/601068","DOIUrl":"https://doi.org/10.1155/2009/601068","url":null,"abstract":"<p><p>Understanding the mechanisms of gene transcriptional regulation through analysis of high-throughput postgenomic data is one of the central problems of computational systems biology. Various approaches have been proposed, but most of them fail to address at least one of the following objectives: (1) allow for the fact that transcription factors are potentially subject to posttranscriptional regulation; (2) allow for the fact that transcription factors cooperate as a functional complex in regulating gene expression, and (3) provide a model and a learning algorithm with manageable computational complexity. The objective of the present study is to propose and test a method that addresses these three issues. The model we employ is a mixture of factor analyzers, in which the latent variables correspond to different transcription factors, grouped into complexes or modules. We pursue inference in a Bayesian framework, using the Variational Bayesian Expectation Maximization (VBEM) algorithm for approximate inference of the posterior distributions of the model parameters, and estimation of a lower bound on the marginal likelihood for model selection. We have evaluated the performance of the proposed method on three criteria: activity profile reconstruction, gene clustering, and network inference.</p>","PeriodicalId":72957,"journal":{"name":"EURASIP journal on bioinformatics & systems biology","volume":" ","pages":"601068"},"PeriodicalIF":0.0,"publicationDate":"2009-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1155/2009/601068","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9433271","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Is bagging effective in the classification of small-sample genomic and proteomic data? bagging在小样本基因组和蛋白质组学数据分类中有效吗?
Pub Date : 2009-01-01 DOI: 10.1155/2009/158368
T T Vu, U M Braga-Neto

There has been considerable interest recently in the application of bagging in the classification of both gene-expression data and protein-abundance mass spectrometry data. The approach is often justified by the improvement it produces on the performance of unstable, overfitting classification rules under small-sample situations. However, the question of real practical interest is whether the ensemble scheme will improve performance of those classifiers sufficiently to beat the performance of single stable, nonoverfitting classifiers, in the case of small-sample genomic and proteomic data sets. To investigate that question, we conducted a detailed empirical study, using publicly-available data sets from published genomic and proteomic studies. We observed that, under t-test and RELIEF filter-based feature selection, bagging generally does a good job of improving the performance of unstable, overfitting classifiers, such as CART decision trees and neural networks, but that improvement was not sufficient to beat the performance of single stable, nonoverfitting classifiers, such as diagonal and plain linear discriminant analysis, or 3-nearest neighbors. Furthermore, as expected, the ensemble method did not improve the performance of these classifiers significantly. Representative experimental results are presented and discussed in this work.

最近在基因表达数据和蛋白质丰度质谱数据的分类中应用bagging技术引起了相当大的兴趣。在小样本情况下,这种方法对不稳定的、过拟合的分类规则的性能的改进通常是合理的。然而,真正实际的问题是,在小样本基因组和蛋白质组学数据集的情况下,集成方案是否能够充分提高这些分类器的性能,以击败单一稳定的、非过拟合的分类器的性能。为了调查这个问题,我们进行了一项详细的实证研究,使用了来自已发表的基因组和蛋白质组学研究的公开数据集。我们观察到,在t检验和基于RELIEF滤波器的特征选择下,套袋通常可以很好地提高不稳定、过拟合分类器(如CART决策树和神经网络)的性能,但这种改进不足以击败单一稳定、非过拟合分类器(如对角和普通线性判别分析)或3近邻分类器的性能。此外,正如预期的那样,集成方法并没有显著提高这些分类器的性能。本文给出了具有代表性的实验结果,并对其进行了讨论。
{"title":"Is bagging effective in the classification of small-sample genomic and proteomic data?","authors":"T T Vu,&nbsp;U M Braga-Neto","doi":"10.1155/2009/158368","DOIUrl":"https://doi.org/10.1155/2009/158368","url":null,"abstract":"<p><p>There has been considerable interest recently in the application of bagging in the classification of both gene-expression data and protein-abundance mass spectrometry data. The approach is often justified by the improvement it produces on the performance of unstable, overfitting classification rules under small-sample situations. However, the question of real practical interest is whether the ensemble scheme will improve performance of those classifiers sufficiently to beat the performance of single stable, nonoverfitting classifiers, in the case of small-sample genomic and proteomic data sets. To investigate that question, we conducted a detailed empirical study, using publicly-available data sets from published genomic and proteomic studies. We observed that, under t-test and RELIEF filter-based feature selection, bagging generally does a good job of improving the performance of unstable, overfitting classifiers, such as CART decision trees and neural networks, but that improvement was not sufficient to beat the performance of single stable, nonoverfitting classifiers, such as diagonal and plain linear discriminant analysis, or 3-nearest neighbors. Furthermore, as expected, the ensemble method did not improve the performance of these classifiers significantly. Representative experimental results are presented and discussed in this work.</p>","PeriodicalId":72957,"journal":{"name":"EURASIP journal on bioinformatics & systems biology","volume":" ","pages":"158368"},"PeriodicalIF":0.0,"publicationDate":"2009-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1155/2009/158368","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9786272","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Transition dependency: a gene-gene interaction measure for times series microarray data. 过渡依赖:时间序列微阵列数据的基因-基因相互作用测量。
Pub Date : 2009-01-01 Epub Date: 2009-02-05 DOI: 10.1155/2009/535869
Xin Gao, Daniel Q Pu, Peter X-K Song

Gene-Gene dependency plays a very important role in system biology as it pertains to the crucial understanding of different biological mechanisms. Time-course microarray data provides a new platform useful to reveal the dynamic mechanism of gene-gene dependencies. Existing interaction measures are mostly based on association measures, such as Pearson or Spearman correlations. However, it is well known that such interaction measures can only capture linear or monotonic dependency relationships but not for nonlinear combinatorial dependency relationships. With the invocation of hidden Markov models, we propose a new measure of pairwise dependency based on transition probabilities. The new dynamic interaction measure checks whether or not the joint transition kernel of the bivariate state variables is the product of two marginal transition kernels. This new measure enables us not only to evaluate the strength, but also to infer the details of gene dependencies. It reveals nonlinear combinatorial dependency structure in two aspects: between two genes and across adjacent time points. We conduct a bootstrap-based chi(2) test for presence/absence of the dependency between every pair of genes. Simulation studies and real biological data analysis demonstrate the application of the proposed method. The software package is available under request.

基因-基因依赖性在系统生物学中起着非常重要的作用,因为它关系到对不同生物机制的重要理解。时间过程微阵列数据为揭示基因-基因依赖的动态机制提供了一个新的平台。现有的交互度量大多基于关联度量,如Pearson或Spearman相关性。然而,众所周知,这种相互作用度量只能捕获线性或单调依赖关系,而不能捕获非线性组合依赖关系。利用隐马尔可夫模型,提出了一种基于转移概率的两两依赖度量方法。新的动态相互作用测度检查二元状态变量的联合转移核是否为两个边缘转移核的乘积。这种新方法不仅使我们能够评估强度,而且还可以推断基因依赖性的细节。它揭示了两个基因之间和相邻时间点之间的非线性组合依赖结构。我们对每对基因之间是否存在依赖关系进行了基于自举的chi(2)检验。仿真研究和实际生物数据分析验证了该方法的应用。该软件包可根据要求提供。
{"title":"Transition dependency: a gene-gene interaction measure for times series microarray data.","authors":"Xin Gao,&nbsp;Daniel Q Pu,&nbsp;Peter X-K Song","doi":"10.1155/2009/535869","DOIUrl":"https://doi.org/10.1155/2009/535869","url":null,"abstract":"<p><p>Gene-Gene dependency plays a very important role in system biology as it pertains to the crucial understanding of different biological mechanisms. Time-course microarray data provides a new platform useful to reveal the dynamic mechanism of gene-gene dependencies. Existing interaction measures are mostly based on association measures, such as Pearson or Spearman correlations. However, it is well known that such interaction measures can only capture linear or monotonic dependency relationships but not for nonlinear combinatorial dependency relationships. With the invocation of hidden Markov models, we propose a new measure of pairwise dependency based on transition probabilities. The new dynamic interaction measure checks whether or not the joint transition kernel of the bivariate state variables is the product of two marginal transition kernels. This new measure enables us not only to evaluate the strength, but also to infer the details of gene dependencies. It reveals nonlinear combinatorial dependency structure in two aspects: between two genes and across adjacent time points. We conduct a bootstrap-based chi(2) test for presence/absence of the dependency between every pair of genes. Simulation studies and real biological data analysis demonstrate the application of the proposed method. The software package is available under request.</p>","PeriodicalId":72957,"journal":{"name":"EURASIP journal on bioinformatics & systems biology","volume":" ","pages":"535869"},"PeriodicalIF":0.0,"publicationDate":"2009-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1155/2009/535869","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"27992495","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Intervention in context-sensitive probabilistic Boolean networks revisited. 重新审视上下文敏感概率布尔网络中的干预。
Pub Date : 2009-01-01 Epub Date: 2009-04-15 DOI: 10.1155/2009/360864
Babak Faryabi, Golnaz Vahedi, Jean-Francois Chamberland, Aniruddha Datta, Edward R Dougherty

An approximate representation for the state space of a context-sensitive probabilistic Boolean network has previously been proposed and utilized to devise therapeutic intervention strategies. Whereas the full state of a context-sensitive probabilistic Boolean network is specified by an ordered pair composed of a network context and a gene-activity profile, this approximate representation collapses the state space onto the gene-activity profiles alone. This reduction yields an approximate transition probability matrix, absent of context, for the Markov chain associated with the context-sensitive probabilistic Boolean network. As with many approximation methods, a price must be paid for using a reduced model representation, namely, some loss of optimality relative to using the full state space. This paper examines the effects on intervention performance caused by the reduction with respect to various values of the model parameters. This task is performed using a new derivation for the transition probability matrix of the context-sensitive probabilistic Boolean network. This expression of transition probability distributions is in concert with the original definition of context-sensitive probabilistic Boolean network. The performance of optimal and approximate therapeutic strategies is compared for both synthetic networks and a real case study. It is observed that the approximate representation describes the dynamics of the context-sensitive probabilistic Boolean network through the instantaneously random probabilistic Boolean network with similar parameters.

上下文敏感的概率布尔网络的状态空间的近似表示已经被提出并用于设计治疗干预策略。而上下文敏感的概率布尔网络的完整状态是由网络上下文和基因活动概况组成的有序对指定的,这种近似表示将状态空间单独折叠到基因活动概况上。这种约简产生了与上下文敏感的概率布尔网络相关的马尔可夫链的近似转移概率矩阵,没有上下文。与许多近似方法一样,使用简化的模型表示必须付出代价,即,相对于使用完整状态空间,会损失一些最优性。本文考察了相对于模型参数的不同值的减少对干预效果的影响。该任务是使用上下文敏感概率布尔网络的转移概率矩阵的新推导来执行的。这种转移概率分布的表达式与上下文敏感概率布尔网络的原始定义是一致的。在合成网络和实际案例研究中比较了最优治疗策略和近似治疗策略的性能。观察到,近似表示通过具有相似参数的瞬时随机概率布尔网络来描述上下文敏感概率布尔网络的动态。
{"title":"Intervention in context-sensitive probabilistic Boolean networks revisited.","authors":"Babak Faryabi,&nbsp;Golnaz Vahedi,&nbsp;Jean-Francois Chamberland,&nbsp;Aniruddha Datta,&nbsp;Edward R Dougherty","doi":"10.1155/2009/360864","DOIUrl":"https://doi.org/10.1155/2009/360864","url":null,"abstract":"<p><p>An approximate representation for the state space of a context-sensitive probabilistic Boolean network has previously been proposed and utilized to devise therapeutic intervention strategies. Whereas the full state of a context-sensitive probabilistic Boolean network is specified by an ordered pair composed of a network context and a gene-activity profile, this approximate representation collapses the state space onto the gene-activity profiles alone. This reduction yields an approximate transition probability matrix, absent of context, for the Markov chain associated with the context-sensitive probabilistic Boolean network. As with many approximation methods, a price must be paid for using a reduced model representation, namely, some loss of optimality relative to using the full state space. This paper examines the effects on intervention performance caused by the reduction with respect to various values of the model parameters. This task is performed using a new derivation for the transition probability matrix of the context-sensitive probabilistic Boolean network. This expression of transition probability distributions is in concert with the original definition of context-sensitive probabilistic Boolean network. The performance of optimal and approximate therapeutic strategies is compared for both synthetic networks and a real case study. It is observed that the approximate representation describes the dynamics of the context-sensitive probabilistic Boolean network through the instantaneously random probabilistic Boolean network with similar parameters.</p>","PeriodicalId":72957,"journal":{"name":"EURASIP journal on bioinformatics & systems biology","volume":" ","pages":"360864"},"PeriodicalIF":0.0,"publicationDate":"2009-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1155/2009/360864","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"28141504","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 38
期刊
EURASIP journal on bioinformatics & systems biology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1