首页 > 最新文献

2010 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)最新文献

英文 中文
seGOsa: Software environment for gene ontology-driven similarity assessment 用于基因本体驱动的相似性评估的软件环境
Pub Date : 2010-12-01 DOI: 10.1109/BIBM.2010.5706624
Huiru Zheng, F. Azuaje, Haiying Wang
In recent years there has been a growing trend towards the adoption of ontologies to support comprehensive, large-scale functional genomics research. This paper introduces seGOsa, a user-friendly cross-platform system to support large-scale assessment of Gene Ontology (GO)-driven similarity among gene products. Using information-theoretic approaches, the system exploits both topological features of the GO (i.e., between-term relationships in the hierarchy) and statistical features of the model organism databases annotated to the GO (i.e., term frequency) to assess functional similarity among gene products. Based on the assumption that the more information two terms share in common, the more similar they are, three GO-driven similarity measures (Resnik's, Lin's and Jiang's metrics) have been implemented to measure between-term similarity within each of the GO hierarchies. Meanwhile, seGOsa offers two approaches (simple and highest average similarity) to assessing the similarity between gene products based on the aggregation of between-term similarities. The program is freely available for non-profit use on request from the authors.
近年来,采用本体论来支持全面、大规模的功能基因组学研究的趋势越来越明显。本文介绍了一个支持大规模评估基因本体(Gene Ontology, GO)驱动的基因产物相似性的用户友好的跨平台系统seGOsa。该系统利用信息论方法,利用GO的拓扑特征(即层次结构中的术语间关系)和注释到GO的模式生物数据库的统计特征(即术语频率)来评估基因产物之间的功能相似性。基于两个术语共享的信息越多,它们就越相似的假设,已经实现了三个GO驱动的相似性度量(Resnik的,Lin的和Jiang的度量)来度量每个GO层次中的术语之间的相似性。同时,seGOsa提供了两种方法(简单相似度和最高平均相似度)来评估基因产物之间基于术语间相似度聚集的相似度。该程序是免费提供给非营利性的要求,从作者。
{"title":"seGOsa: Software environment for gene ontology-driven similarity assessment","authors":"Huiru Zheng, F. Azuaje, Haiying Wang","doi":"10.1109/BIBM.2010.5706624","DOIUrl":"https://doi.org/10.1109/BIBM.2010.5706624","url":null,"abstract":"In recent years there has been a growing trend towards the adoption of ontologies to support comprehensive, large-scale functional genomics research. This paper introduces seGOsa, a user-friendly cross-platform system to support large-scale assessment of Gene Ontology (GO)-driven similarity among gene products. Using information-theoretic approaches, the system exploits both topological features of the GO (i.e., between-term relationships in the hierarchy) and statistical features of the model organism databases annotated to the GO (i.e., term frequency) to assess functional similarity among gene products. Based on the assumption that the more information two terms share in common, the more similar they are, three GO-driven similarity measures (Resnik's, Lin's and Jiang's metrics) have been implemented to measure between-term similarity within each of the GO hierarchies. Meanwhile, seGOsa offers two approaches (simple and highest average similarity) to assessing the similarity between gene products based on the aggregation of between-term similarities. The program is freely available for non-profit use on request from the authors.","PeriodicalId":275098,"journal":{"name":"2010 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129794000","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Iterative correction of suppressed peptide profiles from FTMS measurements 从FTMS测量抑制肽谱的迭代校正
Pub Date : 2010-12-01 DOI: 10.1109/BIBM.2010.5706567
Xuepo Ma, T. Hestilow, Jian Cui, Jianqiu Zhang
Analysis of peptide profiles from a Liquid Chromatography Fourier Transform Mass Spectrometry (LC/FTMS) measurement reveals a non-linear distortion in intensity. Investigation of the measured CVi /C12 ratios comparing with theoretical ones shows that the non-linearity can be attributed to low intensity signal suppression of low abundance peptide peaks. We find that the suppression is homogenous for different isotopes of identical peptides but non-homogenous for different peptides. We developed an iterative correction algorithm that corrects the intensity distortions for peptides with relatively high abundance. This algorithm can be applied in a wide range of applications using FTMS.
从液相色谱傅立叶变换质谱(LC/FTMS)测量肽谱分析揭示了强度的非线性失真。实测的CVi /C12比值与理论值比较表明,非线性可归因于低丰度肽峰的低强度信号抑制。我们发现对相同多肽的不同同位素的抑制是均匀的,而对不同多肽的抑制是不均匀的。我们开发了一种迭代校正算法,用于校正相对较高丰度的肽的强度扭曲。该算法可广泛应用于FTMS的应用中。
{"title":"Iterative correction of suppressed peptide profiles from FTMS measurements","authors":"Xuepo Ma, T. Hestilow, Jian Cui, Jianqiu Zhang","doi":"10.1109/BIBM.2010.5706567","DOIUrl":"https://doi.org/10.1109/BIBM.2010.5706567","url":null,"abstract":"Analysis of peptide profiles from a Liquid Chromatography Fourier Transform Mass Spectrometry (LC/FTMS) measurement reveals a non-linear distortion in intensity. Investigation of the measured CVi /C12 ratios comparing with theoretical ones shows that the non-linearity can be attributed to low intensity signal suppression of low abundance peptide peaks. We find that the suppression is homogenous for different isotopes of identical peptides but non-homogenous for different peptides. We developed an iterative correction algorithm that corrects the intensity distortions for peptides with relatively high abundance. This algorithm can be applied in a wide range of applications using FTMS.","PeriodicalId":275098,"journal":{"name":"2010 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"82 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128348902","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MetaPhyler: Taxonomic profiling for metagenomic sequences MetaPhyler:宏基因组序列的分类分析
Pub Date : 2010-12-01 DOI: 10.1109/BIBM.2010.5706544
Bo Liu, Theodore Gibbons, M. Ghodsi, Mihai Pop
A major goal of metagenomics is to characterize the microbial diversity of an environment. The most popular approach relies on 16S rRNA sequencing, however this approach can generate biased estimates due to differences in the copy number of the 16S rRNA gene between even closely related organisms, and due to PCR artifacts. The taxonomic composition can also be determined from whole-metagenome sequencing data by matching individual sequences against a database of reference genes. One major limitation of prior methods used for this purpose is the use of a universal classification threshold for all genes at all taxonomic levels. We propose that better classification results can be obtained by tuning the taxonomic classifier to each matching length, reference gene, and taxonomic level. We present a novel taxonomic profiler MetaPhyler, which uses marker genes as a taxonomic reference. Results on simulated datasets demonstrate that MetaPhyler outperforms other tools commonly used in this context (CARMA, Megan and PhymmBL). We also present interesting results obtained by applying MetaPhyler to a real metagenomic dataset.
宏基因组学的一个主要目标是描述环境中的微生物多样性。最流行的方法依赖于16S rRNA测序,然而这种方法可能会产生有偏差的估计,因为即使是密切相关的生物体之间16S rRNA基因的拷贝数存在差异,并且由于PCR伪影。分类组成也可以通过将单个序列与参考基因数据库相匹配,从全宏基因组测序数据中确定。用于此目的的先前方法的一个主要限制是在所有分类水平上对所有基因使用通用分类阈值。我们提出,通过调整分类分类器的匹配长度、内参基因和分类水平,可以获得更好的分类结果。我们提出了一个新的分类分析器MetaPhyler,它使用标记基因作为分类参考。在模拟数据集上的结果表明,MetaPhyler优于这种情况下常用的其他工具(CARMA、Megan和PhymmBL)。我们还介绍了通过将MetaPhyler应用于真实的宏基因组数据集获得的有趣结果。
{"title":"MetaPhyler: Taxonomic profiling for metagenomic sequences","authors":"Bo Liu, Theodore Gibbons, M. Ghodsi, Mihai Pop","doi":"10.1109/BIBM.2010.5706544","DOIUrl":"https://doi.org/10.1109/BIBM.2010.5706544","url":null,"abstract":"A major goal of metagenomics is to characterize the microbial diversity of an environment. The most popular approach relies on 16S rRNA sequencing, however this approach can generate biased estimates due to differences in the copy number of the 16S rRNA gene between even closely related organisms, and due to PCR artifacts. The taxonomic composition can also be determined from whole-metagenome sequencing data by matching individual sequences against a database of reference genes. One major limitation of prior methods used for this purpose is the use of a universal classification threshold for all genes at all taxonomic levels. We propose that better classification results can be obtained by tuning the taxonomic classifier to each matching length, reference gene, and taxonomic level. We present a novel taxonomic profiler MetaPhyler, which uses marker genes as a taxonomic reference. Results on simulated datasets demonstrate that MetaPhyler outperforms other tools commonly used in this context (CARMA, Megan and PhymmBL). We also present interesting results obtained by applying MetaPhyler to a real metagenomic dataset.","PeriodicalId":275098,"journal":{"name":"2010 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125439969","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 68
Structure identification and parameter estimation of biological s-systems 生物s系统的结构辨识与参数估计
Pub Date : 2010-12-01 DOI: 10.1109/BIBM.2010.5706586
Li-Zhi Liu, Fang-Xiang Wu, Li-Li Han, W. Zhang
Reconstruction of a biological system from its experimental time series data is a challenging task in systems biology. The S-system which consists of a group of nonlinear ordinary differential equations is an effective model to characterize molecular biological systems and analyze the system dynamics. However, inference of S-systems without the knowledge of system structure is not a trivial task due to its nonlinearity and complexity. In this paper, a pruning separable parameter estimation algorithm is proposed for inferring S-systems. This novel algorithm combines the separable parameter estimation method and a pruning strategy, which includes adding an ℓ1 regularization term to the objective function and pruning the solution with a threshold value. The performance of the pruning strategy in the proposed algorithm is evaluated from two aspects: the parameter estimation error and structure identification accuracy. The proposed algorithm is applied to two S-systems with simulated data. The results show that the proposed algorithm has much lower estimation error and much higher identification accuracy than the existing method.
从实验时间序列数据重建生物系统是系统生物学中一项具有挑战性的任务。由一组非线性常微分方程组成的s系统是表征分子生物系统和分析系统动力学的有效模型。然而,由于s系统的非线性和复杂性,在不了解系统结构的情况下进行s系统的推理并不是一件容易的事情。本文提出了一种用于推断s系统的剪枝可分离参数估计算法。该算法将可分离参数估计方法与剪枝策略相结合,在目标函数中加入一个1正则化项,并用一个阈值对解进行剪枝。从参数估计误差和结构识别精度两个方面对所提算法中剪枝策略的性能进行了评价。将该算法应用于两个具有模拟数据的s系统。结果表明,与现有方法相比,该算法具有更小的估计误差和更高的识别精度。
{"title":"Structure identification and parameter estimation of biological s-systems","authors":"Li-Zhi Liu, Fang-Xiang Wu, Li-Li Han, W. Zhang","doi":"10.1109/BIBM.2010.5706586","DOIUrl":"https://doi.org/10.1109/BIBM.2010.5706586","url":null,"abstract":"Reconstruction of a biological system from its experimental time series data is a challenging task in systems biology. The S-system which consists of a group of nonlinear ordinary differential equations is an effective model to characterize molecular biological systems and analyze the system dynamics. However, inference of S-systems without the knowledge of system structure is not a trivial task due to its nonlinearity and complexity. In this paper, a pruning separable parameter estimation algorithm is proposed for inferring S-systems. This novel algorithm combines the separable parameter estimation method and a pruning strategy, which includes adding an ℓ1 regularization term to the objective function and pruning the solution with a threshold value. The performance of the pruning strategy in the proposed algorithm is evaluated from two aspects: the parameter estimation error and structure identification accuracy. The proposed algorithm is applied to two S-systems with simulated data. The results show that the proposed algorithm has much lower estimation error and much higher identification accuracy than the existing method.","PeriodicalId":275098,"journal":{"name":"2010 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"157 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126645595","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Spatially constrained fuzzy hyper-prototype clustering with application to brain tissue segmentation 空间约束模糊超原型聚类在脑组织分割中的应用
Pub Date : 2010-12-01 DOI: 10.1109/BIBM.2010.5706598
Jin Liu, T. Pham, W. Wen, P. Sachdev
Motivated by fuzzy clustering incorporating spatial information, we present a spatially constrained fuzzy hyper-prototype clustering algorithm in this paper. This approach uses hyperplanes as cluster centers and adds a spatial regularizer into the fuzzy objective function. Formulation of the new fuzzy objective function is presented; and its iterative numerical solution, which minimizes the objective function, derived. We applied the proposed algorithm for the segmentation of brain MRI data. Experimental results have demonstrated that the proposed clustering method outperforms other fuzzy clustering models.
基于空间信息的模糊聚类,提出了一种空间约束的模糊超原型聚类算法。该方法采用超平面作为聚类中心,并在模糊目标函数中加入空间正则化器。给出了新的模糊目标函数的表达式;并推导出目标函数极小化的迭代数值解。我们将该算法应用于脑MRI数据的分割。实验结果表明,本文提出的聚类方法优于其他模糊聚类模型。
{"title":"Spatially constrained fuzzy hyper-prototype clustering with application to brain tissue segmentation","authors":"Jin Liu, T. Pham, W. Wen, P. Sachdev","doi":"10.1109/BIBM.2010.5706598","DOIUrl":"https://doi.org/10.1109/BIBM.2010.5706598","url":null,"abstract":"Motivated by fuzzy clustering incorporating spatial information, we present a spatially constrained fuzzy hyper-prototype clustering algorithm in this paper. This approach uses hyperplanes as cluster centers and adds a spatial regularizer into the fuzzy objective function. Formulation of the new fuzzy objective function is presented; and its iterative numerical solution, which minimizes the objective function, derived. We applied the proposed algorithm for the segmentation of brain MRI data. Experimental results have demonstrated that the proposed clustering method outperforms other fuzzy clustering models.","PeriodicalId":275098,"journal":{"name":"2010 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"96 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123170755","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Semi-supervised learning protein complexes from protein interaction networks 从蛋白质相互作用网络中半监督学习蛋白质复合物
Pub Date : 2010-12-01 DOI: 10.1109/BIBM.2010.5706571
Lei Shi, A. Zhang
New technological advances in large-scale proteinprotein interaction (PPI) detection provide researchers a valuable source for elucidating the bimolecular mechanism in the cell. In this paper, we investigate the problem of protein complex detection from noisy protein interaction data, i.e., finding the subsets of proteins that are closely coupled via protein interactions. Many people try to solve the problem by finding dense subgraphs in protein interaction networks with unsupervised methods. Here, we stand from a different point of view, redefining the properties and features for protein complexes and designing a “semi-supervised” method to analyze the problem. First we select some representative topological features and biological features to represent the protein complexes and then utilize the training data to build a multi-layer neural network model and finally detect hidden protein complexes in the protein-protein interaction networks with the obtained model. Experiments show the desirable properties of our proposed algorithm and the effectiveness of our approach.
大规模蛋白相互作用(PPI)检测的新技术进展为研究人员阐明细胞内的双分子机制提供了有价值的来源。在本文中,我们研究了从嘈杂的蛋白质相互作用数据中检测蛋白质复合物的问题,即找到通过蛋白质相互作用紧密耦合的蛋白质子集。许多人试图用无监督的方法在蛋白质相互作用网络中寻找密集子图来解决这个问题。在这里,我们站在不同的角度,重新定义了蛋白质复合物的性质和特征,并设计了一种“半监督”的方法来分析问题。首先选取一些具有代表性的拓扑特征和生物学特征来表示蛋白质复合物,然后利用训练数据构建多层神经网络模型,最后利用得到的模型检测蛋白质-蛋白质相互作用网络中隐藏的蛋白质复合物。实验结果表明,该算法具有良好的性能和有效性。
{"title":"Semi-supervised learning protein complexes from protein interaction networks","authors":"Lei Shi, A. Zhang","doi":"10.1109/BIBM.2010.5706571","DOIUrl":"https://doi.org/10.1109/BIBM.2010.5706571","url":null,"abstract":"New technological advances in large-scale proteinprotein interaction (PPI) detection provide researchers a valuable source for elucidating the bimolecular mechanism in the cell. In this paper, we investigate the problem of protein complex detection from noisy protein interaction data, i.e., finding the subsets of proteins that are closely coupled via protein interactions. Many people try to solve the problem by finding dense subgraphs in protein interaction networks with unsupervised methods. Here, we stand from a different point of view, redefining the properties and features for protein complexes and designing a “semi-supervised” method to analyze the problem. First we select some representative topological features and biological features to represent the protein complexes and then utilize the training data to build a multi-layer neural network model and finally detect hidden protein complexes in the protein-protein interaction networks with the obtained model. Experiments show the desirable properties of our proposed algorithm and the effectiveness of our approach.","PeriodicalId":275098,"journal":{"name":"2010 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125152311","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Enhanced reference guided assembly 增强型参考导向装配
Pub Date : 2010-12-01 DOI: 10.1109/BIBM.2010.5706540
F. Cattonaro, A. Policriti, F. Vezzi
Next Generation Sequencing has totally changed genomics: we are able to produce huge amounts of data at an incredible low cost if compared to Sanger sequencing. Despite this some old problems have become even more difficult, denovo assembly being on top of this list. In this paper we propose a novel method that aims at improving de-novo assembly in presence of a closely related reference. The idea is to combine de-novo assembly and reference guided assembly in order to obtain an enhanced assembly.
下一代测序完全改变了基因组学:与桑格测序相比,我们能够以极低的成本产生大量数据。尽管如此,一些老问题已经变得更加困难,在这个列表的顶部组装。在本文中,我们提出了一种新的方法,旨在改善de-novo组装存在密切相关的参考。这个想法是结合de-novo组装和参考指导组装,以获得一个增强的组装。
{"title":"Enhanced reference guided assembly","authors":"F. Cattonaro, A. Policriti, F. Vezzi","doi":"10.1109/BIBM.2010.5706540","DOIUrl":"https://doi.org/10.1109/BIBM.2010.5706540","url":null,"abstract":"Next Generation Sequencing has totally changed genomics: we are able to produce huge amounts of data at an incredible low cost if compared to Sanger sequencing. Despite this some old problems have become even more difficult, denovo assembly being on top of this list. In this paper we propose a novel method that aims at improving de-novo assembly in presence of a closely related reference. The idea is to combine de-novo assembly and reference guided assembly in order to obtain an enhanced assembly.","PeriodicalId":275098,"journal":{"name":"2010 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"235 3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122884772","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
SoyMetDB: The soybean metabolome database SoyMetDB:大豆代谢组数据库
Pub Date : 2010-12-01 DOI: 10.1109/BIBM.2010.5706563
T. Joshi, Q. Yao, D. F. Levi, L. Brechenmacher, B. Valliyodan, G. Stacey, H. Nguyen, Dong Xu
SoyMetDB is a metabolomic database for soybean, developed to target the growing needs of the soybean community. The goal is to provide a one-stop web resource for integrating, mining and visualizing soybean metabolomic data, including identification and expression of various metabolites across different experiments and time courses. It incorporates GC-MS and LC-MS based metabolite-profiling data dynamically linked to metabolite information from other public metabolomic databases, including HMDB and Knapsack. SoyMetDB includes Arabidopsis metabolomic data for cross-species comparisons and can retrieve information including the expression patterns of various experiments for complete or partial metabolite name queries. It also incorporates a pathway viewer tool integrating the data from various experimental conditions and presenting them on the pathways to highlight the expressed metabolite, and identifies the most highly represented pathways for multiple metabolite queries. SoyMetDB can be accessed at http://soymetdb.org.
SoyMetDB是一个大豆代谢组学数据库,旨在满足大豆群体日益增长的需求。目标是为整合、挖掘和可视化大豆代谢组学数据提供一站式网络资源,包括不同实验和时间过程中各种代谢物的鉴定和表达。它结合了基于GC-MS和LC-MS的代谢物分析数据,动态链接到其他公共代谢组学数据库(包括HMDB和backpack)的代谢物信息。SoyMetDB包括拟南芥代谢组学数据,用于跨物种比较,并可以检索各种实验的表达模式信息,用于完整或部分代谢物名称查询。它还集成了一个途径查看器工具,集成了来自各种实验条件的数据,并将它们呈现在途径上,以突出显示表达的代谢物,并为多个代谢物查询识别最具代表性的途径。SoyMetDB可以通过http://soymetdb.org访问。
{"title":"SoyMetDB: The soybean metabolome database","authors":"T. Joshi, Q. Yao, D. F. Levi, L. Brechenmacher, B. Valliyodan, G. Stacey, H. Nguyen, Dong Xu","doi":"10.1109/BIBM.2010.5706563","DOIUrl":"https://doi.org/10.1109/BIBM.2010.5706563","url":null,"abstract":"SoyMetDB is a metabolomic database for soybean, developed to target the growing needs of the soybean community. The goal is to provide a one-stop web resource for integrating, mining and visualizing soybean metabolomic data, including identification and expression of various metabolites across different experiments and time courses. It incorporates GC-MS and LC-MS based metabolite-profiling data dynamically linked to metabolite information from other public metabolomic databases, including HMDB and Knapsack. SoyMetDB includes Arabidopsis metabolomic data for cross-species comparisons and can retrieve information including the expression patterns of various experiments for complete or partial metabolite name queries. It also incorporates a pathway viewer tool integrating the data from various experimental conditions and presenting them on the pathways to highlight the expressed metabolite, and identifies the most highly represented pathways for multiple metabolite queries. SoyMetDB can be accessed at http://soymetdb.org.","PeriodicalId":275098,"journal":{"name":"2010 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"119 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131292626","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Selecting informative genes by Lasso and Dantzig selector for linear classifiers 用Lasso和Dantzig选择器选择线性分类器中的信息基因
Pub Date : 2010-12-01 DOI: 10.1109/BIBM.2010.5706651
Songfeng Zheng, Weixiang Liu
Automatically selecting a subset of genes with strong discriminative power is a very important step in classification problems based on gene expression data. Lasso and Dantzig selector are known to have automatic variable selection ability in linear regression analysis. This paper employs Lasso and Dantzig selector to select most informative genes for representing the class label as a linear function of gene expression data. The selected genes are further used to fit linear classifiers for cancer classification. On 3 publicly available cancer datasets, the experimental results show that in general, Lasso is more capable than Dantzig selector in selecting informative genes for classification.
在基于基因表达数据的分类问题中,自动选择具有强判别能力的基因子集是非常重要的一步。Lasso和Dantzig选择器在线性回归分析中具有自动变量选择能力。本文采用Lasso和Dantzig选择器选择信息量最大的基因,将类标签表示为基因表达数据的线性函数。选择的基因进一步用于拟合线性分类器进行癌症分类。在3个公开的癌症数据集上,实验结果表明,总的来说,Lasso比Dantzig选择器更能选择信息基因进行分类。
{"title":"Selecting informative genes by Lasso and Dantzig selector for linear classifiers","authors":"Songfeng Zheng, Weixiang Liu","doi":"10.1109/BIBM.2010.5706651","DOIUrl":"https://doi.org/10.1109/BIBM.2010.5706651","url":null,"abstract":"Automatically selecting a subset of genes with strong discriminative power is a very important step in classification problems based on gene expression data. Lasso and Dantzig selector are known to have automatic variable selection ability in linear regression analysis. This paper employs Lasso and Dantzig selector to select most informative genes for representing the class label as a linear function of gene expression data. The selected genes are further used to fit linear classifiers for cancer classification. On 3 publicly available cancer datasets, the experimental results show that in general, Lasso is more capable than Dantzig selector in selecting informative genes for classification.","PeriodicalId":275098,"journal":{"name":"2010 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134366733","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Protein-protein interaction prediction via Collective Matrix Factorization 基于集合矩阵分解的蛋白质相互作用预测
Pub Date : 2010-12-01 DOI: 10.1109/BIBM.2010.5706537
Qian Xu, E. Xiang, Qiang Yang
Protein-protein interactions (PPI) play an important role in cellular processes and metabolic processes within a cell. An important task is to determine the existence of interactions among proteins. Unfortunately, existing biological experimental techniques are expensive, time-consuming and labor-intensive. The network structures of many such networks are sparse, incomplete and noisy, containing many false positive and false negatives. Thus, state-of-the-art methods for link prediction in these networks often cannot give satisfactory prediction results, especially when some networks are extremely sparse. Noticing that we typically have more than one PPI network available, we naturally wonder whether it is possible to 'transfer' the linkage knowledge from some existing, relatively dense networks to a sparse network, to improve the prediction performance. Noticing that a network structure can be modeled using a matrix model, in this paper, we introduce the well-known Collective Matrix Factorization (CMF) technique to 'transfer' usable linkage knowledge from relatively dense interaction network to a sparse target network. Our approach is to establish the correspondence between a source and a target network via network similarities. We test this method on two real protein-protein interaction networks, Helicobacter pylori (as a target network) and Human (as a source network). Our experimental results show that our method can achieve higher and more robust performance as compared to some baseline methods.
蛋白质-蛋白质相互作用(PPI)在细胞过程和细胞内代谢过程中起着重要作用。一项重要的任务是确定蛋白质之间是否存在相互作用。不幸的是,现有的生物实验技术是昂贵的,耗时和劳动密集型的。许多此类网络的网络结构是稀疏的、不完整的和有噪声的,包含许多假阳性和假阴性。因此,在这些网络中,最先进的链路预测方法往往不能给出令人满意的预测结果,特别是当一些网络非常稀疏时。注意到我们通常有多个可用的PPI网络,我们很自然地想知道是否有可能将链接知识从一些现有的、相对密集的网络“转移”到一个稀疏的网络,以提高预测性能。注意到网络结构可以使用矩阵模型建模,在本文中,我们引入了著名的集体矩阵分解(CMF)技术,将可用的链接知识从相对密集的交互网络“转移”到稀疏的目标网络。我们的方法是通过网络相似性在源网络和目标网络之间建立对应关系。我们在两个真实的蛋白质-蛋白质相互作用网络上测试了这种方法,幽门螺杆菌(作为目标网络)和人类(作为源网络)。实验结果表明,与一些基线方法相比,该方法具有更高的鲁棒性。
{"title":"Protein-protein interaction prediction via Collective Matrix Factorization","authors":"Qian Xu, E. Xiang, Qiang Yang","doi":"10.1109/BIBM.2010.5706537","DOIUrl":"https://doi.org/10.1109/BIBM.2010.5706537","url":null,"abstract":"Protein-protein interactions (PPI) play an important role in cellular processes and metabolic processes within a cell. An important task is to determine the existence of interactions among proteins. Unfortunately, existing biological experimental techniques are expensive, time-consuming and labor-intensive. The network structures of many such networks are sparse, incomplete and noisy, containing many false positive and false negatives. Thus, state-of-the-art methods for link prediction in these networks often cannot give satisfactory prediction results, especially when some networks are extremely sparse. Noticing that we typically have more than one PPI network available, we naturally wonder whether it is possible to 'transfer' the linkage knowledge from some existing, relatively dense networks to a sparse network, to improve the prediction performance. Noticing that a network structure can be modeled using a matrix model, in this paper, we introduce the well-known Collective Matrix Factorization (CMF) technique to 'transfer' usable linkage knowledge from relatively dense interaction network to a sparse target network. Our approach is to establish the correspondence between a source and a target network via network similarities. We test this method on two real protein-protein interaction networks, Helicobacter pylori (as a target network) and Human (as a source network). Our experimental results show that our method can achieve higher and more robust performance as compared to some baseline methods.","PeriodicalId":275098,"journal":{"name":"2010 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117090235","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 33
期刊
2010 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1