首页 > 最新文献

2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)最新文献

英文 中文
Regularization of sequence data for machine learning 用于机器学习的序列数据正则化
Bryan Bai, S. C. Kremer
We examine the problem of classifying biological sequences, and in particular the challenge of generalizing results to novel input data. We observe that the high-dimensionality of sequence data representations results in an extremely sparsely populated input space. This motivates a need for regularization (a form of inductive bias), in order to achieve generalization. We discuss regularization in the context of regular neural networks, deep belief networks and support vector machines, and provide experimental results for these architectures. Our results support the importance of using an effective regularization method and identify which methods work well on a real-world dataset.
我们研究了分类生物序列的问题,特别是将结果推广到新输入数据的挑战。我们观察到,序列数据表示的高维导致了一个极其稀疏的输入空间。这激发了对正则化(归纳偏差的一种形式)的需求,以实现泛化。我们在规则神经网络、深度信念网络和支持向量机的背景下讨论了正则化,并提供了这些架构的实验结果。我们的结果支持使用有效的正则化方法的重要性,并确定哪些方法在真实数据集上工作得很好。
{"title":"Regularization of sequence data for machine learning","authors":"Bryan Bai, S. C. Kremer","doi":"10.1109/BIBMW.2011.6112350","DOIUrl":"https://doi.org/10.1109/BIBMW.2011.6112350","url":null,"abstract":"We examine the problem of classifying biological sequences, and in particular the challenge of generalizing results to novel input data. We observe that the high-dimensionality of sequence data representations results in an extremely sparsely populated input space. This motivates a need for regularization (a form of inductive bias), in order to achieve generalization. We discuss regularization in the context of regular neural networks, deep belief networks and support vector machines, and provide experimental results for these architectures. Our results support the importance of using an effective regularization method and identify which methods work well on a real-world dataset.","PeriodicalId":6345,"journal":{"name":"2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)","volume":"13 1","pages":"19-25"},"PeriodicalIF":0.0,"publicationDate":"2011-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87631016","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Active Protein Interaction Network and Its Application on Protein Complex Detection 活性蛋白相互作用网络及其在蛋白复合物检测中的应用
Jianxin Wang, Xiaoqing Peng, Min Li, Yong Luo, Yi Pan
In recent years, more and more attentions are focused on modelling and analyzing dynamic network. Some researchers attempted to extract dynamic network by combining the dynamic information from gene expression data or sub cellular localization data with protein network. However, the dynamics of proteins' presence does not guarantee the dynamics of interactions, since the presence of a protein does not indicate the protein's activity. The activity of a protein is closely connected with its function. Thus only the dynamics of proteins activity ensure the dynamics of interaction. The gene expression of a cellular process or cycle carries more information than only the dynamics of proteins' presence. We assume that a protein is active when its expression values are near its maximum expression value, since the expression quantity will decrease after it has performed its function that leads a feedback for controlling the expression quantity. In this paper, we proposed a method to identify active time points for each protein in a cellular process or cycle by using a 3-sigma principle to compute an active threshold for each gene according to the characteristics of its expression curve. Combined the activity information and protein interaction network, we can construct an active protein interaction network (APPI). To demonstrate the efficiency of APPI network model, we applied it on complex detection. Compared with single threshold time series networks, APPI network achieves a better performance on protein complex prediction.
近年来,动态网络的建模和分析越来越受到人们的关注。一些研究者试图将基因表达数据或亚细胞定位数据中的动态信息与蛋白质网络相结合来提取动态网络。然而,蛋白质存在的动态并不能保证相互作用的动态,因为蛋白质的存在并不表明蛋白质的活性。蛋白质的活性与其功能密切相关。因此,只有蛋白质活性的动态才能保证相互作用的动态。细胞过程或周期的基因表达携带的信息比仅仅是蛋白质存在的动态更多。我们假设一个蛋白在其表达值接近其最大表达值时是有活性的,因为它在完成其功能后,表达量会减少,导致控制表达量的反馈。在本文中,我们提出了一种识别细胞过程或周期中每个蛋白质的活性时间点的方法,该方法使用3-sigma原理根据每个基因的表达曲线特征计算每个基因的活性阈值。将活性信息与蛋白质相互作用网络相结合,构建活性蛋白质相互作用网络(APPI)。为了验证APPI网络模型的有效性,我们将其应用于复杂检测。与单阈值时间序列网络相比,APPI网络在蛋白质复合体预测上取得了更好的性能。
{"title":"Active Protein Interaction Network and Its Application on Protein Complex Detection","authors":"Jianxin Wang, Xiaoqing Peng, Min Li, Yong Luo, Yi Pan","doi":"10.1109/BIBM.2011.45","DOIUrl":"https://doi.org/10.1109/BIBM.2011.45","url":null,"abstract":"In recent years, more and more attentions are focused on modelling and analyzing dynamic network. Some researchers attempted to extract dynamic network by combining the dynamic information from gene expression data or sub cellular localization data with protein network. However, the dynamics of proteins' presence does not guarantee the dynamics of interactions, since the presence of a protein does not indicate the protein's activity. The activity of a protein is closely connected with its function. Thus only the dynamics of proteins activity ensure the dynamics of interaction. The gene expression of a cellular process or cycle carries more information than only the dynamics of proteins' presence. We assume that a protein is active when its expression values are near its maximum expression value, since the expression quantity will decrease after it has performed its function that leads a feedback for controlling the expression quantity. In this paper, we proposed a method to identify active time points for each protein in a cellular process or cycle by using a 3-sigma principle to compute an active threshold for each gene according to the characteristics of its expression curve. Combined the activity information and protein interaction network, we can construct an active protein interaction network (APPI). To demonstrate the efficiency of APPI network model, we applied it on complex detection. Compared with single threshold time series networks, APPI network achieves a better performance on protein complex prediction.","PeriodicalId":6345,"journal":{"name":"2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)","volume":"12 1","pages":"37-42"},"PeriodicalIF":0.0,"publicationDate":"2011-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90168797","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
Using semantic similarity to detect features in yeast protein complexes 利用语义相似度检测酵母蛋白复合物的特征
P. Guzzi, Marianna Milano, P. Veltri, M. Cannataro
Biological data stored in databases can be associated with information (knowledge) such as experiments, properties and functions, response to drugs etc. Such a knowledge is often stored in biological ontologies. Gene Ontology is one of the main resource of biological knowledge providing both a categorization of terms and a source of annotation for genes and proteins. This enables the use of ontology-based methodologies for the analysis of proteins and their functions. One methodology is based on semantic based similarity measures. Recently there is a growing interest in the use of semantic based methodologies to the analysis of protein interaction data such as the prediction of protein complexes based on semantic similarity measures. Despite this interest, there is the need for an assessment of semantic measures as well as a deep study on the impact of the chosen measure in the obtained results. This paper treats the problem of using semantic similarity measure to analyse protein complexes and to improve protein complexes prediction frameworks. Tests have been performed in yeast protein complexes. Results indicate that there exists a bias among measures as well as an higher value of semantic similarity within proteins that participate in the same complex, proving both a possible use of semantic similarity protein complexes prediction and a suggestion in the measure.
存储在数据库中的生物数据可以与实验、特性和功能、对药物的反应等信息(知识)相关联。这样的知识通常存储在生物本体中。基因本体论是生物学知识的主要来源之一,为基因和蛋白质提供术语分类和注释来源。这使得使用基于本体的方法来分析蛋白质及其功能成为可能。一种方法是基于基于语义的相似度度量。最近,人们对使用基于语义的方法来分析蛋白质相互作用数据越来越感兴趣,例如基于语义相似度量的蛋白质复合物的预测。尽管有这种兴趣,仍然需要对语义度量进行评估,以及对所选度量在获得结果中的影响进行深入研究。本文研究了利用语义相似度来分析蛋白质复合体的问题,并改进了蛋白质复合体的预测框架。已经在酵母蛋白复合物中进行了试验。结果表明,测量之间存在偏差,并且参与同一复合体的蛋白质之间的语义相似值较高,这证明了语义相似蛋白复合体预测的可能用途和测量中的建议。
{"title":"Using semantic similarity to detect features in yeast protein complexes","authors":"P. Guzzi, Marianna Milano, P. Veltri, M. Cannataro","doi":"10.1109/BIBMW.2011.6112419","DOIUrl":"https://doi.org/10.1109/BIBMW.2011.6112419","url":null,"abstract":"Biological data stored in databases can be associated with information (knowledge) such as experiments, properties and functions, response to drugs etc. Such a knowledge is often stored in biological ontologies. Gene Ontology is one of the main resource of biological knowledge providing both a categorization of terms and a source of annotation for genes and proteins. This enables the use of ontology-based methodologies for the analysis of proteins and their functions. One methodology is based on semantic based similarity measures. Recently there is a growing interest in the use of semantic based methodologies to the analysis of protein interaction data such as the prediction of protein complexes based on semantic similarity measures. Despite this interest, there is the need for an assessment of semantic measures as well as a deep study on the impact of the chosen measure in the obtained results. This paper treats the problem of using semantic similarity measure to analyse protein complexes and to improve protein complexes prediction frameworks. Tests have been performed in yeast protein complexes. Results indicate that there exists a bias among measures as well as an higher value of semantic similarity within proteins that participate in the same complex, proving both a possible use of semantic similarity protein complexes prediction and a suggestion in the measure.","PeriodicalId":6345,"journal":{"name":"2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)","volume":"119 5 1","pages":"495-502"},"PeriodicalIF":0.0,"publicationDate":"2011-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88751159","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A comparative study of text classification approaches for personalized retrieval in PubMed PubMed中个性化检索文本分类方法的比较研究
Sachintha Pitigala, Cen Li, S. Seo
Retrieval of the information relevant to one's need from PubMed is becoming increasingly challenging due to its large volume and rapid growth. The traditional information search techniques based on keyword matching are insufficient for large databases such as PubMed. A personalized article retrieval system that is tailored to individual researcher's specific interests and selects only highly relevant articles can be a helpful tool in the field of Bioinformatics. The text classification methods developed in the text mining community have shown good results in differentiating relevant articles from the irrelevant ones. This study compares two text classification methods, Naïve Bayes and Support Vector Machines, in order to study the effectiveness of the two methods on classifying full text articles in the case when only a small set of training data is available. The comparison results show that the Naïve Bayes method is a better choice than Support Vector Machines in building a personalized article retrieval system which can learn (train) from a small set of full text articles.
由于PubMed的庞大容量和快速增长,从PubMed中检索与个人需求相关的信息变得越来越具有挑战性。传统的基于关键词匹配的信息搜索技术对于PubMed这样的大型数据库来说是不够的。一个个性化的文章检索系统,是量身定制的个人研究人员的具体兴趣,只选择高度相关的文章,可以是一个有用的工具,在生物信息学领域。文本挖掘社区开发的文本分类方法在区分相关文章和不相关文章方面取得了良好的效果。本研究比较了Naïve贝叶斯和支持向量机两种文本分类方法,以研究两种方法在训练数据较少的情况下对全文文章进行分类的有效性。对比结果表明,Naïve贝叶斯方法比支持向量机更适合于构建一个能够从少量全文文章中学习(训练)的个性化文章检索系统。
{"title":"A comparative study of text classification approaches for personalized retrieval in PubMed","authors":"Sachintha Pitigala, Cen Li, S. Seo","doi":"10.1109/BIBMW.2011.6112503","DOIUrl":"https://doi.org/10.1109/BIBMW.2011.6112503","url":null,"abstract":"Retrieval of the information relevant to one's need from PubMed is becoming increasingly challenging due to its large volume and rapid growth. The traditional information search techniques based on keyword matching are insufficient for large databases such as PubMed. A personalized article retrieval system that is tailored to individual researcher's specific interests and selects only highly relevant articles can be a helpful tool in the field of Bioinformatics. The text classification methods developed in the text mining community have shown good results in differentiating relevant articles from the irrelevant ones. This study compares two text classification methods, Naïve Bayes and Support Vector Machines, in order to study the effectiveness of the two methods on classifying full text articles in the case when only a small set of training data is available. The comparison results show that the Naïve Bayes method is a better choice than Support Vector Machines in building a personalized article retrieval system which can learn (train) from a small set of full text articles.","PeriodicalId":6345,"journal":{"name":"2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)","volume":"67 1","pages":"919-921"},"PeriodicalIF":0.0,"publicationDate":"2011-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89067841","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Additive and multiplicative genome-wide association models identify genes associated with growth 加性和倍增性全基因组关联模型鉴定与生长相关的基因
Cynthia Zavala, N. Serao, M. Villamil, G. Caetano-Anollés, S. Rodriguez-Zas
Standard genome-wide association studies evaluate the association between single nucleotide polymorphisms (SNPs or Genotype G) and phenotype (e.g. growth) conditional on non-SNP covariates including environmental factors (E, e.g. diet) or population stratification, on an additive fashion. For traits known to be the result of genotype-by-environment interactions (G×E), like growth, a multiplicative model could potentially uncover additional SNPs that influence growth on a context-dependent (e.g. diet or breed) fashion. The objective of this study was to assess and compare the performance of context-independent (additive, G+E) and context-dependent (multiplicative, G+E+G×E) models to identify polymorphisms and corresponding genes associated with growth that are context-independent and context-dependent. In addition to single-SNP analysis, a multi-SNP haplotype-based analysis that can increase the precision of the estimates was evaluated for the additive model.
标准全基因组关联研究评估单核苷酸多态性(snp或基因型G)和表型(如生长)之间的关联,条件是非snp协变量,包括环境因素(E,如饮食)或群体分层,以加性方式。对于已知是基因型与环境相互作用的结果的性状(G×E),如生长,乘法模型可能会发现影响生长的其他snp,这些snp依赖于环境(例如饮食或品种)。本研究的目的是评估和比较环境无关(加性,G+E)和环境依赖(乘法,G+E+G×E)模型的性能,以确定与生长相关的环境无关和环境依赖的多态性和相应基因。除了单snp分析外,还对基于多snp单倍型的分析进行了评估,该分析可以提高加性模型的估计精度。
{"title":"Additive and multiplicative genome-wide association models identify genes associated with growth","authors":"Cynthia Zavala, N. Serao, M. Villamil, G. Caetano-Anollés, S. Rodriguez-Zas","doi":"10.1109/BIBMW.2011.6112527","DOIUrl":"https://doi.org/10.1109/BIBMW.2011.6112527","url":null,"abstract":"Standard genome-wide association studies evaluate the association between single nucleotide polymorphisms (SNPs or Genotype G) and phenotype (e.g. growth) conditional on non-SNP covariates including environmental factors (E, e.g. diet) or population stratification, on an additive fashion. For traits known to be the result of genotype-by-environment interactions (G×E), like growth, a multiplicative model could potentially uncover additional SNPs that influence growth on a context-dependent (e.g. diet or breed) fashion. The objective of this study was to assess and compare the performance of context-independent (additive, G+E) and context-dependent (multiplicative, G+E+G×E) models to identify polymorphisms and corresponding genes associated with growth that are context-independent and context-dependent. In addition to single-SNP analysis, a multi-SNP haplotype-based analysis that can increase the precision of the estimates was evaluated for the additive model.","PeriodicalId":6345,"journal":{"name":"2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)","volume":"24 1","pages":"975-977"},"PeriodicalIF":0.0,"publicationDate":"2011-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85978997","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improved RNA-Seq Partitions in Linear Models for Isoform Quantification 改进的RNA-Seq划分线性模型用于异构体定量
Brian E. Howard, P. Veronese, S. Heber
Here, we present an extension of our is form quantification method that accommodates paired end RNA Sequencing data. We explore several alternate methods of partitioning read count data in order to better exploit the available fragment size distribution, and to reduce the variance in the resulting estimates. In many cases, this significantly improves the accuracy of our approach.
在这里,我们提出了一种扩展我们的形式量化方法,以适应成对末端RNA测序数据。我们探索了几种分区读计数数据的替代方法,以便更好地利用可用的片段大小分布,并减少结果估计的方差。在许多情况下,这大大提高了我们的方法的准确性。
{"title":"Improved RNA-Seq Partitions in Linear Models for Isoform Quantification","authors":"Brian E. Howard, P. Veronese, S. Heber","doi":"10.1109/BIBM.2011.102","DOIUrl":"https://doi.org/10.1109/BIBM.2011.102","url":null,"abstract":"Here, we present an extension of our is form quantification method that accommodates paired end RNA Sequencing data. We explore several alternate methods of partitioning read count data in order to better exploit the available fragment size distribution, and to reduce the variance in the resulting estimates. In many cases, this significantly improves the accuracy of our approach.","PeriodicalId":6345,"journal":{"name":"2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)","volume":"24 1","pages":"151-154"},"PeriodicalIF":0.0,"publicationDate":"2011-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85037371","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Artificial Neural Network Based Approach for Identification of Native Protein Structures Using an Extended Forcefield 基于人工神经网络的扩展力场蛋白质结构识别方法
T. M. Fawcett, S. Irausquin, Mikhail Simin, H. Valafar
Current protein force fields like the ones seen in CHARMM or Xplor-NIH have many terms that include bonded and non-bonded terms. Yet the force fields do not take into account the use of hydrogen bonds which are important for secondary structure creation and stabilization of proteins. SCOPE is an open-source program that generates proteins from rotamer space. It then creates a force field that uses only non-bonded and hydrogen bond energy terms to create a profile for a given protein. The profiles can then be used in an artificial neural network to create a linear model which is funneled to the true protein conformation.
目前在CHARMM或explore - nih中看到的蛋白质力场有许多术语,包括键合和非键合术语。然而,这些力场并没有考虑到氢键的使用,而氢键对于蛋白质的二级结构的形成和稳定是很重要的。SCOPE是一个从旋转体空间生成蛋白质的开源程序。然后,它创建一个力场,仅使用非键和氢键能量项来创建给定蛋白质的轮廓。然后,这些轮廓可以用于人工神经网络,以创建一个线性模型,该模型汇集到真实的蛋白质构象中。
{"title":"An Artificial Neural Network Based Approach for Identification of Native Protein Structures Using an Extended Forcefield","authors":"T. M. Fawcett, S. Irausquin, Mikhail Simin, H. Valafar","doi":"10.1109/BIBM.2011.53","DOIUrl":"https://doi.org/10.1109/BIBM.2011.53","url":null,"abstract":"Current protein force fields like the ones seen in CHARMM or Xplor-NIH have many terms that include bonded and non-bonded terms. Yet the force fields do not take into account the use of hydrogen bonds which are important for secondary structure creation and stabilization of proteins. SCOPE is an open-source program that generates proteins from rotamer space. It then creates a force field that uses only non-bonded and hydrogen bond energy terms to create a profile for a given protein. The profiles can then be used in an artificial neural network to create a linear model which is funneled to the true protein conformation.","PeriodicalId":6345,"journal":{"name":"2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)","volume":"15 1","pages":"500-505"},"PeriodicalIF":0.0,"publicationDate":"2011-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89766963","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Multiobjective optizition shuffled frog-leaping biclustering 多目标优化洗牌蛙跃双聚类
Junwan Liu, Xiaohua Hu, Zhoujun Li, Yiming Chen
Biclustering of DNA microarray data that can mine significant patterns to help in understanding gene regulation and interactions. This is a classical multi-objective optimization problem (MOP). Recently, many researchers have developed stochastic search methods that mimic the efficient behavior of species such as ants, bees, birds and frogs, as a means to seek faster and more robust solutions to complex optimization problems. The particle swarm optimization(PSO) is a heuristics-based optimization approach simulating the movements of a bird flock finding food. The shuffled frog leaping algorithm (SFLA) is a population-based cooperative search metaphor combining the benefits of the local search of PSO and the global shuffled of information of the complex evolution technique. This paper introduces SFL algorithm to solve biclustering of microarray data, and proposes a novel multi-objective shuffled frog leaping biclustering(MOSFLB) algorithm to mine coherent patterns from microarray data. Experimental results on two real datasets show that our approach can effectively find significant biclusters of high quality.
DNA微阵列数据的双聚类,可以挖掘重要的模式,以帮助理解基因调控和相互作用。这是一个经典的多目标优化问题。近年来,许多研究人员开发了模拟蚂蚁、蜜蜂、鸟类和青蛙等物种的有效行为的随机搜索方法,作为一种寻求复杂优化问题更快、更鲁棒解的手段。粒子群优化算法(PSO)是一种基于启发式算法的模拟鸟群觅食运动的优化方法。shuffle frog leapalgorithm (SFLA)是一种基于种群的协同搜索算法,结合了粒子群算法的局部搜索和复杂进化技术的全局信息重组的优点。引入SFL算法解决微阵列数据的双聚类问题,提出了一种新的多目标shuffle frog跳跃双聚类(MOSFLB)算法从微阵列数据中挖掘相干模式。在两个真实数据集上的实验结果表明,我们的方法可以有效地发现高质量的显著双聚类。
{"title":"Multiobjective optizition shuffled frog-leaping biclustering","authors":"Junwan Liu, Xiaohua Hu, Zhoujun Li, Yiming Chen","doi":"10.1109/BIBMW.2011.6112368","DOIUrl":"https://doi.org/10.1109/BIBMW.2011.6112368","url":null,"abstract":"Biclustering of DNA microarray data that can mine significant patterns to help in understanding gene regulation and interactions. This is a classical multi-objective optimization problem (MOP). Recently, many researchers have developed stochastic search methods that mimic the efficient behavior of species such as ants, bees, birds and frogs, as a means to seek faster and more robust solutions to complex optimization problems. The particle swarm optimization(PSO) is a heuristics-based optimization approach simulating the movements of a bird flock finding food. The shuffled frog leaping algorithm (SFLA) is a population-based cooperative search metaphor combining the benefits of the local search of PSO and the global shuffled of information of the complex evolution technique. This paper introduces SFL algorithm to solve biclustering of microarray data, and proposes a novel multi-objective shuffled frog leaping biclustering(MOSFLB) algorithm to mine coherent patterns from microarray data. Experimental results on two real datasets show that our approach can effectively find significant biclusters of high quality.","PeriodicalId":6345,"journal":{"name":"2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)","volume":"46 1","pages":"151-156"},"PeriodicalIF":0.0,"publicationDate":"2011-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84430425","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
The Traditional Chinese Medicine syndromes of psoriasis in Chinese patients: Contribution of demographic and clinical variables 中国银屑病患者的中医证候:人口统计学和临床变量的贡献
Zehui He, Chuanjian Lu, A. Ou
This article was to examine the specific contribution of demographic, medical and psychological variables to the Traditional Chinese Medicine (TCM) Syndromes of psoriasis. A cross-sectional survey of psoriasis patients was conducted at 7 hospitals of TCM of different regions. In all, 671 psoriasis patients underwent a clinical assessment including differentiation of TCM syndromes and psoriasis severity (assessed by Psoriasis Area and Severity Index, PASI). Patients also completed questions on demographic data and a quality of life scale (Dermatology Life Quality Index, DLQI). The three main TCM syndromes were included: 354 patients with Wind-heat (52.8%), 161 with Blood-stasis (24.0%), and 156 with Blood-dryness (23.2%). They distributed differently in subgroups of patients with different gender, age, chronic disease, duration of psoriasis, PASI, and DLQI score. The TCM syndromes were close related to demographic and clinical conditions of patients. TCM clinical treatment should consider both characteristics of syndrome and demographic variables of psoriasis.
本文旨在探讨人口统计学、医学和心理变量对银屑病中医证候的具体贡献。对不同地区7家中医院的银屑病患者进行了横断面调查。对671例银屑病患者进行临床评估,包括中医证候辨证和银屑病严重程度(用银屑病面积和严重程度指数PASI评估)。患者还完成了人口统计数据和生活质量量表(皮肤病生活质量指数,DLQI)的问题。3个主要中医证型包括:风热型354例(52.8%),血瘀型161例(24.0%),血燥型156例(23.2%)。在不同性别、年龄、慢性疾病、银屑病病程、PASI和DLQI评分的患者亚组中分布不同。中医证候与患者的人口学特征和临床条件密切相关。中医临床治疗应兼顾银屑病的证候特点和人口学变量。
{"title":"The Traditional Chinese Medicine syndromes of psoriasis in Chinese patients: Contribution of demographic and clinical variables","authors":"Zehui He, Chuanjian Lu, A. Ou","doi":"10.1109/BIBMW.2011.6112468","DOIUrl":"https://doi.org/10.1109/BIBMW.2011.6112468","url":null,"abstract":"This article was to examine the specific contribution of demographic, medical and psychological variables to the Traditional Chinese Medicine (TCM) Syndromes of psoriasis. A cross-sectional survey of psoriasis patients was conducted at 7 hospitals of TCM of different regions. In all, 671 psoriasis patients underwent a clinical assessment including differentiation of TCM syndromes and psoriasis severity (assessed by Psoriasis Area and Severity Index, PASI). Patients also completed questions on demographic data and a quality of life scale (Dermatology Life Quality Index, DLQI). The three main TCM syndromes were included: 354 patients with Wind-heat (52.8%), 161 with Blood-stasis (24.0%), and 156 with Blood-dryness (23.2%). They distributed differently in subgroups of patients with different gender, age, chronic disease, duration of psoriasis, PASI, and DLQI score. The TCM syndromes were close related to demographic and clinical conditions of patients. TCM clinical treatment should consider both characteristics of syndrome and demographic variables of psoriasis.","PeriodicalId":6345,"journal":{"name":"2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)","volume":"20 1","pages":"765-768"},"PeriodicalIF":0.0,"publicationDate":"2011-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87906436","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Learning Condition-Dependent Dynamical PPI Networks from Conflict-Sensitive Phosphorylation Dynamics 从冲突敏感磷酸化动力学中学习条件依赖的动态PPI网络
Qiong Cheng, M. Ogihara, Vineet K Gupta
An important issue in protein-protein interaction network studies is the identification of interaction dynamics. Two factors contribute to the dynamics. One, not all proteins may be expressed in a given cell, and two, competition may exist among multiple proteins for a particular protein domain. Taking into account these two factors, we propose a novel approach to predict protein-protein interaction network dynamics by learning from conflict-sensitive phosphorylation dynamics. We built a training model from conflict-sensitive phosphorylation dynamics. In this model, each node is not an individual protein but a protein-protein pair and is labeled with terms representing conditions in which the interaction should be observed. We mapped the protein pairs in a vector space, built hyper-edges over the interaction nodes, and developed rank-like SVM with Laplacian regularizers for PPI network dynamics prediction. We also employed the standard F1 measure for evaluating the effectiveness of classification results.
蛋白质-蛋白质相互作用网络研究中的一个重要问题是相互作用动力学的识别。有两个因素促成了这种动态。首先,不是所有的蛋白质都可以在一个给定的细胞中表达,其次,多个蛋白质之间可能存在竞争,以争夺一个特定的蛋白质结构域。考虑到这两个因素,我们提出了一种通过学习冲突敏感磷酸化动力学来预测蛋白质-蛋白质相互作用网络动力学的新方法。我们根据冲突敏感性磷酸化动力学建立了一个训练模型。在这个模型中,每个节点不是一个单独的蛋白质,而是一个蛋白质-蛋白质对,并用表示相互作用应该被观察到的条件的术语来标记。我们将蛋白质对映射到向量空间中,在交互节点上构建超边缘,并使用拉普拉斯正则化器开发了类秩支持向量机,用于PPI网络动态预测。我们还采用标准的F1测度来评价分类结果的有效性。
{"title":"Learning Condition-Dependent Dynamical PPI Networks from Conflict-Sensitive Phosphorylation Dynamics","authors":"Qiong Cheng, M. Ogihara, Vineet K Gupta","doi":"10.1109/BIBM.2011.127","DOIUrl":"https://doi.org/10.1109/BIBM.2011.127","url":null,"abstract":"An important issue in protein-protein interaction network studies is the identification of interaction dynamics. Two factors contribute to the dynamics. One, not all proteins may be expressed in a given cell, and two, competition may exist among multiple proteins for a particular protein domain. Taking into account these two factors, we propose a novel approach to predict protein-protein interaction network dynamics by learning from conflict-sensitive phosphorylation dynamics. We built a training model from conflict-sensitive phosphorylation dynamics. In this model, each node is not an individual protein but a protein-protein pair and is labeled with terms representing conditions in which the interaction should be observed. We mapped the protein pairs in a vector space, built hyper-edges over the interaction nodes, and developed rank-like SVM with Laplacian regularizers for PPI network dynamics prediction. We also employed the standard F1 measure for evaluating the effectiveness of classification results.","PeriodicalId":6345,"journal":{"name":"2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)","volume":"48 1","pages":"309-312"},"PeriodicalIF":0.0,"publicationDate":"2011-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87013565","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1