首页 > 最新文献

Proceedings. IEEE Computational Systems Bioinformatics Conference最新文献

英文 中文
Bioinformatic insights from metagenomics through visualization. 通过可视化从宏基因组学获得生物信息学见解。
Pub Date : 2005-01-01 DOI: 10.1109/csb.2005.19
Susan L Havre, Bobbie-Jo Webb-Robertson, Anuj Shah, Christian Posse, Banu Gopalan, Fred J Brockman

Cutting-edge biological and bioinformatics research seeks a systems perspective through the analysis of multiple types of high-throughput and other experimental data for the same sample. Systems-level analysis requires the integration and fusion of such data, typically through advanced statistics and mathematics. Visualization is a complementary computational approach that supports integration and analysis of complex data or its derivatives. We present a bioinformatics visualization prototype, Juxter, which depicts categorical information derived from or assigned to these diverse data for the purpose of comparing patterns across categorizations. The visualization allows users to easily discern correlated and anomalous patterns in the data. These patterns, which might not be detected automatically by algorithms, may reveal valuable information leading to insight and discovery. We describe the visualization and interaction capabilities and demonstrate its utility in a new field, metagenomics, which combines molecular biology and genetics to identify and characterize genetic material from multi-species microbial samples.

尖端的生物学和生物信息学研究通过对同一样本的多种类型的高通量和其他实验数据的分析,寻求一个系统的视角。系统级分析通常需要通过高级统计和数学对这些数据进行整合和融合。可视化是一种辅助的计算方法,支持对复杂数据或其衍生物进行集成和分析。我们提出了一个生物信息学可视化原型,并特,它描述了从这些不同的数据衍生或分配的分类信息,以比较不同分类的模式。可视化使用户可以轻松地识别数据中的相关模式和异常模式。这些模式可能不会被算法自动检测到,但可能会揭示有价值的信息,从而导致洞察力和发现。我们描述了可视化和交互能力,并展示了它在一个新领域的应用,宏基因组学,它结合了分子生物学和遗传学来鉴定和表征多物种微生物样本的遗传物质。
{"title":"Bioinformatic insights from metagenomics through visualization.","authors":"Susan L Havre,&nbsp;Bobbie-Jo Webb-Robertson,&nbsp;Anuj Shah,&nbsp;Christian Posse,&nbsp;Banu Gopalan,&nbsp;Fred J Brockman","doi":"10.1109/csb.2005.19","DOIUrl":"https://doi.org/10.1109/csb.2005.19","url":null,"abstract":"<p><p>Cutting-edge biological and bioinformatics research seeks a systems perspective through the analysis of multiple types of high-throughput and other experimental data for the same sample. Systems-level analysis requires the integration and fusion of such data, typically through advanced statistics and mathematics. Visualization is a complementary computational approach that supports integration and analysis of complex data or its derivatives. We present a bioinformatics visualization prototype, Juxter, which depicts categorical information derived from or assigned to these diverse data for the purpose of comparing patterns across categorizations. The visualization allows users to easily discern correlated and anomalous patterns in the data. These patterns, which might not be detected automatically by algorithms, may reveal valuable information leading to insight and discovery. We describe the visualization and interaction capabilities and demonstrate its utility in a new field, metagenomics, which combines molecular biology and genetics to identify and characterize genetic material from multi-species microbial samples.</p>","PeriodicalId":87417,"journal":{"name":"Proceedings. IEEE Computational Systems Bioinformatics Conference","volume":" ","pages":"341-50"},"PeriodicalIF":0.0,"publicationDate":"2005-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/csb.2005.19","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"25830779","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
Proceedings of 2005 IEEE Computational Systems Bioinformatics Conference. August 8-11, 2005. Stanford, California, USA. 2005 IEEE计算系统生物信息学会议论文集。2005年8月8-11日。斯坦福,加州,美国。
{"title":"Proceedings of 2005 IEEE Computational Systems Bioinformatics Conference. August 8-11, 2005. Stanford, California, USA.","authors":"","doi":"","DOIUrl":"","url":null,"abstract":"","PeriodicalId":87417,"journal":{"name":"Proceedings. IEEE Computational Systems Bioinformatics Conference","volume":" ","pages":"3-402"},"PeriodicalIF":0.0,"publicationDate":"2005-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"26697901","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Tree decomposition based fast search of RNA structures including pseudoknots in genomes. 基于树分解的基因组中包含假结的RNA结构快速搜索。
Pub Date : 2005-01-01 DOI: 10.1109/csb.2005.52
Yinglei Song, Chunmei Liu, Russell Malmberg, Fangfang Pan, Liming Cai

Searching genomes for RNA secondary structure with computational methods has become an important approach to the annotation of non-coding RNAs. However, due to the lack of efficient algorithms for accurate RNA structure-sequence alignment, computer programs capable of fast and effectively searching genomes for RNA secondary structures have not been available. In this paper, a novel RNA structure profiling model is introduced based on the notion of a conformational graph to specify the consensus structure of an RNA family. Tree decomposition yields a small tree width t for such conformation graphs (e.g., t = 2 for stem loops and only a slight increase for pseudo-knots). Within this modelling framework, the optimal alignment of a sequence to the structure model corresponds to finding a maximum valued isomorphic subgraph and consequently can be accomplished through dynamic programming on the tree decomposition of the conformational graph in time O(k(t)N(2)), where k is a small parameter; and N is the size of the projiled RNA structure. Experiments show that the application of the alignment algorithm to search in genomes yields the same search accuracy as methods based on a Covariance model with a significant reduction in computation time. In particular; very accurate searches of tmRNAs in bacteria genomes and of telomerase RNAs in yeast genomes can be accomplished in days, as opposed to months required by other methods. The tree decomposition based searching tool is free upon request and can be downloaded at our site h t t p ://w.uga.edu/RNA-informatics/software/index.php.

利用计算方法在基因组中搜索RNA二级结构已成为非编码RNA标注的重要途径。然而,由于缺乏精确的RNA结构序列比对的有效算法,能够快速有效地搜索基因组RNA二级结构的计算机程序尚未可用。本文基于构象图的概念,提出了一种新的RNA结构分析模型,用于确定RNA家族的一致结构。对于这样的构象图,树分解产生一个小的树宽度t(例如,对于茎环,t = 2,对于伪结,只略微增加)。在该建模框架中,序列与结构模型的最优对齐对应于寻找最大值同构子图,因此可以通过在时间O(k(t)N(2))上对构象图进行树分解的动态规划来完成,其中k是一个小参数;N是被分解RNA结构的大小。实验表明,该算法与基于协方差模型的方法具有相同的搜索精度,且计算时间显著减少。特别是;细菌基因组中的tmrna和酵母基因组中的端粒酶rna的精确搜索可以在几天内完成,而其他方法则需要几个月。基于树分解的搜索工具是免费的,可以在我们的网站下载:https://w.uga.edu/RNA-informatics/software/index.php。
{"title":"Tree decomposition based fast search of RNA structures including pseudoknots in genomes.","authors":"Yinglei Song,&nbsp;Chunmei Liu,&nbsp;Russell Malmberg,&nbsp;Fangfang Pan,&nbsp;Liming Cai","doi":"10.1109/csb.2005.52","DOIUrl":"https://doi.org/10.1109/csb.2005.52","url":null,"abstract":"<p><p>Searching genomes for RNA secondary structure with computational methods has become an important approach to the annotation of non-coding RNAs. However, due to the lack of efficient algorithms for accurate RNA structure-sequence alignment, computer programs capable of fast and effectively searching genomes for RNA secondary structures have not been available. In this paper, a novel RNA structure profiling model is introduced based on the notion of a conformational graph to specify the consensus structure of an RNA family. Tree decomposition yields a small tree width t for such conformation graphs (e.g., t = 2 for stem loops and only a slight increase for pseudo-knots). Within this modelling framework, the optimal alignment of a sequence to the structure model corresponds to finding a maximum valued isomorphic subgraph and consequently can be accomplished through dynamic programming on the tree decomposition of the conformational graph in time O(k(t)N(2)), where k is a small parameter; and N is the size of the projiled RNA structure. Experiments show that the application of the alignment algorithm to search in genomes yields the same search accuracy as methods based on a Covariance model with a significant reduction in computation time. In particular; very accurate searches of tmRNAs in bacteria genomes and of telomerase RNAs in yeast genomes can be accomplished in days, as opposed to months required by other methods. The tree decomposition based searching tool is free upon request and can be downloaded at our site h t t p ://w.uga.edu/RNA-informatics/software/index.php.</p>","PeriodicalId":87417,"journal":{"name":"Proceedings. IEEE Computational Systems Bioinformatics Conference","volume":" ","pages":"223-34"},"PeriodicalIF":0.0,"publicationDate":"2005-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/csb.2005.52","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"25829319","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 46
Clustering genes using gene expression and text literature data. 利用基因表达和文本文献数据聚类基因。
Pub Date : 2005-01-01 DOI: 10.1109/csb.2005.23
Chengyong Yang, Erliang Zeng, Tao Li, Giri Narasimhan

Clustering of gene expression data is a standard technique used to identify closely related genes. In this paper, we develop a new clustering algorithm, MSC (Multi-Source Clustering), to perform exploratory analysis using two or more diverse sources of data. In particular, we investigate the problem of improving the clustering by integrating information obtained from gene expression data with knowledge extracted from biomedical text literature. In each iteration of algorithm MSC, an EM-type procedure is employed to bootstrap the model obtained from one data source by starting with the cluster assignments obtained in the previous iteration using the other data sources. Upon convergence, the two individual models are used to construct the final cluster assignment. We compare the results of algorithm MSC for two data sources with the results obtained when the clustering is applied on the two sources of data separately. We also compare it with that obtained using the feature level integration method that performs the clustering after simply concatenating the features obtained from the two data sources. We show that the z-scores of the clustering results from MSC are better than that from the other methods. To evaluate our clusters better, function enrichment results are presented using terms from the Gene Ontology database. Finally, by investigating the success of motif detection programs that use the clusters, we show that our approach integrating gene expression data and text data reveals clusters that are biologically more meaningful than those identified using gene expression data alone.

基因表达数据聚类是一种用于鉴定密切相关基因的标准技术。在本文中,我们开发了一种新的聚类算法,MSC(多源聚类),使用两个或多个不同的数据源进行探索性分析。特别地,我们研究了通过整合从基因表达数据中获得的信息和从生物医学文本文献中提取的知识来提高聚类的问题。在MSC算法的每次迭代中,采用em型过程从前一次迭代中使用其他数据源获得的聚类分配开始,从一个数据源获得模型。收敛后,使用两个单独的模型来构造最终的聚类分配。将MSC算法对两个数据源的聚类结果与分别对两个数据源进行聚类的结果进行了比较。我们还将其与使用特征级集成方法获得的结果进行了比较,该方法在简单地将两个数据源获得的特征连接起来后进行聚类。我们发现,MSC聚类结果的z分数优于其他方法。为了更好地评估我们的聚类,功能富集结果使用基因本体数据库中的术语来呈现。最后,通过研究使用聚类的基序检测程序的成功,我们表明,我们的方法整合基因表达数据和文本数据揭示的聚类比单独使用基因表达数据识别的聚类在生物学上更有意义。
{"title":"Clustering genes using gene expression and text literature data.","authors":"Chengyong Yang,&nbsp;Erliang Zeng,&nbsp;Tao Li,&nbsp;Giri Narasimhan","doi":"10.1109/csb.2005.23","DOIUrl":"https://doi.org/10.1109/csb.2005.23","url":null,"abstract":"<p><p>Clustering of gene expression data is a standard technique used to identify closely related genes. In this paper, we develop a new clustering algorithm, MSC (Multi-Source Clustering), to perform exploratory analysis using two or more diverse sources of data. In particular, we investigate the problem of improving the clustering by integrating information obtained from gene expression data with knowledge extracted from biomedical text literature. In each iteration of algorithm MSC, an EM-type procedure is employed to bootstrap the model obtained from one data source by starting with the cluster assignments obtained in the previous iteration using the other data sources. Upon convergence, the two individual models are used to construct the final cluster assignment. We compare the results of algorithm MSC for two data sources with the results obtained when the clustering is applied on the two sources of data separately. We also compare it with that obtained using the feature level integration method that performs the clustering after simply concatenating the features obtained from the two data sources. We show that the z-scores of the clustering results from MSC are better than that from the other methods. To evaluate our clusters better, function enrichment results are presented using terms from the Gene Ontology database. Finally, by investigating the success of motif detection programs that use the clusters, we show that our approach integrating gene expression data and text data reveals clusters that are biologically more meaningful than those identified using gene expression data alone.</p>","PeriodicalId":87417,"journal":{"name":"Proceedings. IEEE Computational Systems Bioinformatics Conference","volume":" ","pages":"329-40"},"PeriodicalIF":0.0,"publicationDate":"2005-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/csb.2005.23","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"25830778","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Automated validation of polymerase chain reactions using amplicon melting curves. 利用扩增子熔化曲线自动验证聚合酶链反应。
Pub Date : 2005-01-01 DOI: 10.1109/csb.2005.17
Tobias P Mann, Richard Humbert, John A Stamatoyannopolous, William Stafford Noble

PCR, the polymerase chain reaction, is a fundamental tool of molecular biology. Quantitative PCR is the gold-standard methodology for determination of DNA copy numbers, quantitating transcription, and numerous other applications. A major barrier to large-scale application of PCR for quantitative genomic analyses is the current requirement for manual validation of individual PCR reactions to ensure generation of a single product. This typically requires visual inspection either of gel electrophoreses or temperature dissociation ("melting") curves of individual PCR reactions - a time-consuming and costly process. Here we describe a robust computational solution to this fundamental problem. Using a training set of 10,080 reactions comprising multiple quantitative PCR reactions from each of 1,728 unique human genomic amplicons, we developed a support vector machine classifier capable of discriminating single-product PCR reactions with better than 99% accuracy. This approach has broad utility, and eliminates a major bottleneck to widespread application of PCR for high-throughput genomic applications.

聚合酶链反应(PCR)是分子生物学的基本工具。定量PCR是测定DNA拷贝数,定量转录和许多其他应用的金标准方法。大规模应用PCR进行定量基因组分析的一个主要障碍是目前需要手动验证单个PCR反应,以确保产生单一产物。这通常需要对单个PCR反应的凝胶电泳或温度解离(“融化”)曲线进行目视检查,这是一个耗时且昂贵的过程。在这里,我们描述了这个基本问题的一个健壮的计算解决方案。使用包含10080个反应的训练集,包括来自1728个独特的人类基因组扩增子的多个定量PCR反应,我们开发了一个支持向量机分类器,能够区分单产物PCR反应,准确率超过99%。这种方法具有广泛的实用性,消除了PCR在高通量基因组应用中广泛应用的主要瓶颈。
{"title":"Automated validation of polymerase chain reactions using amplicon melting curves.","authors":"Tobias P Mann,&nbsp;Richard Humbert,&nbsp;John A Stamatoyannopolous,&nbsp;William Stafford Noble","doi":"10.1109/csb.2005.17","DOIUrl":"https://doi.org/10.1109/csb.2005.17","url":null,"abstract":"<p><p>PCR, the polymerase chain reaction, is a fundamental tool of molecular biology. Quantitative PCR is the gold-standard methodology for determination of DNA copy numbers, quantitating transcription, and numerous other applications. A major barrier to large-scale application of PCR for quantitative genomic analyses is the current requirement for manual validation of individual PCR reactions to ensure generation of a single product. This typically requires visual inspection either of gel electrophoreses or temperature dissociation (\"melting\") curves of individual PCR reactions - a time-consuming and costly process. Here we describe a robust computational solution to this fundamental problem. Using a training set of 10,080 reactions comprising multiple quantitative PCR reactions from each of 1,728 unique human genomic amplicons, we developed a support vector machine classifier capable of discriminating single-product PCR reactions with better than 99% accuracy. This approach has broad utility, and eliminates a major bottleneck to widespread application of PCR for high-throughput genomic applications.</p>","PeriodicalId":87417,"journal":{"name":"Proceedings. IEEE Computational Systems Bioinformatics Conference","volume":" ","pages":"377-85"},"PeriodicalIF":0.0,"publicationDate":"2005-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/csb.2005.17","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"25830783","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A Two-Step Approach for Clustering Proteins based on Protein Interaction Profile. 基于蛋白质相互作用谱的两步聚类方法。
Pub Date : 2005-01-01 DOI: 10.1109/BIBE.2005.10
Pengjun Pei, Aidong Zhang

High-throughput methods for detecting protein-protein interactions (PPI) have given researchers an initial global picture of protein interactions on a genomic scale. The huge data sets generated by such experiments pose new challenges in data analysis. Though clustering methods have been successfully applied in many areas in bioinformatics, many clustering algorithms cannot be readily applied on protein interaction data sets. One main problem is that the similarity between two proteins cannot be easily defined. This paper proposes a probabilistic model to define the similarity based on conditional probabilities. We then propose a two-step method for estimating the similarity between two proteins based on protein interaction profile. In the first step, the model is trained with proteins with known annotation. Based on this model, similarities are calculated in the second step. Experiments show that our method improves performance.

检测蛋白质-蛋白质相互作用(PPI)的高通量方法为研究人员提供了基因组尺度上蛋白质相互作用的初步全局图像。这些实验产生的庞大数据集对数据分析提出了新的挑战。虽然聚类方法已经成功地应用于生物信息学的许多领域,但许多聚类算法并不能很容易地应用于蛋白质相互作用数据集。一个主要问题是两种蛋白质之间的相似性不能轻易定义。本文提出了一个基于条件概率的概率模型来定义相似度。然后,我们提出了一种基于蛋白质相互作用谱估计两种蛋白质之间相似性的两步方法。第一步,使用已知标注的蛋白质对模型进行训练。基于该模型,第二步计算相似度。实验表明,该方法提高了性能。
{"title":"A Two-Step Approach for Clustering Proteins based on Protein Interaction Profile.","authors":"Pengjun Pei,&nbsp;Aidong Zhang","doi":"10.1109/BIBE.2005.10","DOIUrl":"https://doi.org/10.1109/BIBE.2005.10","url":null,"abstract":"<p><p>High-throughput methods for detecting protein-protein interactions (PPI) have given researchers an initial global picture of protein interactions on a genomic scale. The huge data sets generated by such experiments pose new challenges in data analysis. Though clustering methods have been successfully applied in many areas in bioinformatics, many clustering algorithms cannot be readily applied on protein interaction data sets. One main problem is that the similarity between two proteins cannot be easily defined. This paper proposes a probabilistic model to define the similarity based on conditional probabilities. We then propose a two-step method for estimating the similarity between two proteins based on protein interaction profile. In the first step, the model is trained with proteins with known annotation. Based on this model, similarities are calculated in the second step. Experiments show that our method improves performance.</p>","PeriodicalId":87417,"journal":{"name":"Proceedings. IEEE Computational Systems Bioinformatics Conference","volume":"2005 1544467","pages":"201-209"},"PeriodicalIF":0.0,"publicationDate":"2005-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/BIBE.2005.10","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"27898789","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Deformable modeling for improved calculation of molecular velocities from single-particle tracking. 通过单粒子跟踪改进分子速度计算的可变形模型。
Pub Date : 2005-01-01 DOI: 10.1109/csb.2005.28
Peter M Kasson, Mark M Davis, Axel T Brunger

Single-particle tracking provides a powerful technique for measuring dynamic cellular processes on the level of individual molecules. Much recent work has been devoted to using single particle tracking to measure long-range movement of particles on the cell surface, including methods for automated localization and tracking of particles [1-3]. However, most particle tracking studies to date ignore cell surface curvature and dynamic cellular deformation, factors frequently present in physiologically relevant situations. In this report, we perform quantitative evaluation of single-particle tracking on curved and deforming cell surfaces. We also introduce a new hybrid method that uses non-rigid cellular modeling for improved computation of single-particle tracking trajectories on the surfaces of cells undergoing deformation. This method combines single-molecule and bulk fluorescence measurements in an automated manner to enable more accurate and robust characterization of dynamic cell physiology and regulation.

单粒子跟踪为在单个分子水平上测量动态细胞过程提供了一种强大的技术。最近的许多工作都致力于使用单粒子跟踪来测量细胞表面上粒子的远程运动,包括自动定位和跟踪粒子的方法[1-3]。然而,迄今为止,大多数粒子跟踪研究忽略了细胞表面曲率和动态细胞变形,这些因素经常出现在生理相关的情况下。在本报告中,我们对弯曲和变形的细胞表面上的单粒子跟踪进行了定量评估。我们还介绍了一种新的混合方法,该方法使用非刚性细胞建模来改进变形细胞表面上单粒子跟踪轨迹的计算。该方法结合了单分子和整体荧光测量自动化的方式,使动态细胞生理和调节的更准确和稳健的表征。
{"title":"Deformable modeling for improved calculation of molecular velocities from single-particle tracking.","authors":"Peter M Kasson,&nbsp;Mark M Davis,&nbsp;Axel T Brunger","doi":"10.1109/csb.2005.28","DOIUrl":"https://doi.org/10.1109/csb.2005.28","url":null,"abstract":"<p><p>Single-particle tracking provides a powerful technique for measuring dynamic cellular processes on the level of individual molecules. Much recent work has been devoted to using single particle tracking to measure long-range movement of particles on the cell surface, including methods for automated localization and tracking of particles [1-3]. However, most particle tracking studies to date ignore cell surface curvature and dynamic cellular deformation, factors frequently present in physiologically relevant situations. In this report, we perform quantitative evaluation of single-particle tracking on curved and deforming cell surfaces. We also introduce a new hybrid method that uses non-rigid cellular modeling for improved computation of single-particle tracking trajectories on the surfaces of cells undergoing deformation. This method combines single-molecule and bulk fluorescence measurements in an automated manner to enable more accurate and robust characterization of dynamic cell physiology and regulation.</p>","PeriodicalId":87417,"journal":{"name":"Proceedings. IEEE Computational Systems Bioinformatics Conference","volume":" ","pages":"208-11"},"PeriodicalIF":0.0,"publicationDate":"2005-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/csb.2005.28","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"25829947","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Reconstructing phylogenetic networks using maximum parsimony. 利用最大简约法重构系统发育网络。
Pub Date : 2005-01-01 DOI: 10.1109/csb.2005.47
Luay Nakhleh, Guohua Jin, Fengmei Zhao, John Mellor-Crummey

Phylogenies - the evolutionary histories of groups of organisms - are one of the most widely used tools throughout the life sciences, as well as objects of research within systematics, evolutionary biology, epidemiology, etc. Almost every tool devised to date to reconstruct phylogenies produces trees; yet it is widely understood and accepted that trees oversimplify the evolutionary histories of many groups of organims, most prominently bacteria (because of horizontal gene transfer) and plants (because of hybrid speciation). Various methods and criteria have been introduced for phylogenetic tree reconstruction. Parsimony is one of the most widely used and studied criteria, and various accurate and efficient heuristics for reconstructing trees based on parsimony have been devised. Jotun Hein suggested a straightforward extension of the parsimony criterion to phylogenetic networks. In this paper we formalize this concept, and provide the first experimental study of the quality of parsimony as a criterion for constructing and evaluating phylogenetic networks. Our results show that, when extended to phylogenetic networks, the parsimony criterion produces promising results. In a great majority of the cases in our experiments, the parsimony criterion accurately predicts the numbers and placements of non-tree events.

系统发生学——生物体群体的进化史——是整个生命科学中使用最广泛的工具之一,也是系统学、进化生物学、流行病学等领域的研究对象。迄今为止,几乎所有用来重建系统发育的工具都能生成树;然而,人们普遍理解和接受的是,树木过度简化了许多生物群体的进化历史,最突出的是细菌(因为水平基因转移)和植物(因为杂交物种形成)。系统发育树重建的方法和标准多种多样。简约性是应用最广泛和研究最广泛的准则之一,人们设计了各种基于简约性的精确高效的树重构启发式方法。Jotun Hein建议将简约标准直接扩展到系统发育网络。在本文中,我们形式化了这一概念,并首次提供了将简约性质量作为构建和评价系统发育网络的标准的实验研究。我们的结果表明,当扩展到系统发育网络时,简约准则产生了有希望的结果。在我们的实验中的绝大多数情况下,简约标准准确地预测了非树事件的数量和位置。
{"title":"Reconstructing phylogenetic networks using maximum parsimony.","authors":"Luay Nakhleh,&nbsp;Guohua Jin,&nbsp;Fengmei Zhao,&nbsp;John Mellor-Crummey","doi":"10.1109/csb.2005.47","DOIUrl":"https://doi.org/10.1109/csb.2005.47","url":null,"abstract":"<p><p>Phylogenies - the evolutionary histories of groups of organisms - are one of the most widely used tools throughout the life sciences, as well as objects of research within systematics, evolutionary biology, epidemiology, etc. Almost every tool devised to date to reconstruct phylogenies produces trees; yet it is widely understood and accepted that trees oversimplify the evolutionary histories of many groups of organims, most prominently bacteria (because of horizontal gene transfer) and plants (because of hybrid speciation). Various methods and criteria have been introduced for phylogenetic tree reconstruction. Parsimony is one of the most widely used and studied criteria, and various accurate and efficient heuristics for reconstructing trees based on parsimony have been devised. Jotun Hein suggested a straightforward extension of the parsimony criterion to phylogenetic networks. In this paper we formalize this concept, and provide the first experimental study of the quality of parsimony as a criterion for constructing and evaluating phylogenetic networks. Our results show that, when extended to phylogenetic networks, the parsimony criterion produces promising results. In a great majority of the cases in our experiments, the parsimony criterion accurately predicts the numbers and placements of non-tree events.</p>","PeriodicalId":87417,"journal":{"name":"Proceedings. IEEE Computational Systems Bioinformatics Conference","volume":" ","pages":"93-102"},"PeriodicalIF":0.0,"publicationDate":"2005-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/csb.2005.47","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"25829994","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 66
A robust meta-classification strategy for cancer diagnosis from gene expression data. 基于基因表达数据的癌症诊断的稳健元分类策略。
Pub Date : 2005-01-01 DOI: 10.1109/csb.2005.7
Gabriela Alexe, Gyan Bhanot, Babu Venkataraghavan, Ramakrishna Ramaswamy, Jorge Lepre, Arnold J Levine, Gustavo Stolovitzky

One of the major challenges in cancer diagnosis from microarray data is to develop robust classification models which are independent of the analysis techniques used and can combine data from different laboratories. We propose a meta-classification scheme which uses a robust multivariate gene selection procedure and integrates the results of several machine learning tools trained on raw and pattern data. We validate our method by applying it to distinguish diffuse large B-cell lymphoma (DLBCL) from follicular lymphoma (FL) on two independent datasets: the HuGeneFL Affmetrixy dataset of Shipp et al. (www. genome.wi.mit.du/MPR /lymphoma) and the Hu95Av2 Affymetrix dataset (DallaFavera's laboratory, Columbia University). Our meta-classification technique achieves higher predictive accuracies than each of the individual classifiers trained on the same dataset and is robust against various data perturbations. We also find that combinations of p53 responsive genes (e.g., p53, PLK1 and CDK2) are highly predictive of the phenotype.

从微阵列数据进行癌症诊断的主要挑战之一是开发健壮的分类模型,该模型独立于所使用的分析技术,并且可以结合来自不同实验室的数据。我们提出了一种元分类方案,该方案使用稳健的多变量基因选择程序,并集成了几种机器学习工具在原始数据和模式数据上训练的结果。我们通过应用该方法在两个独立的数据集(Shipp等人的HuGeneFL Affmetrixy数据集)上区分弥漫性大b细胞淋巴瘤(DLBCL)和滤泡性淋巴瘤(FL)来验证我们的方法。genome.wi.mit。du/MPR /淋巴瘤)和Hu95Av2 Affymetrix数据集(DallaFavera实验室,哥伦比亚大学)。我们的元分类技术实现了比在同一数据集上训练的每个单独分类器更高的预测精度,并且对各种数据扰动具有鲁棒性。我们还发现p53应答基因(如p53、PLK1和CDK2)的组合可以高度预测表型。
{"title":"A robust meta-classification strategy for cancer diagnosis from gene expression data.","authors":"Gabriela Alexe,&nbsp;Gyan Bhanot,&nbsp;Babu Venkataraghavan,&nbsp;Ramakrishna Ramaswamy,&nbsp;Jorge Lepre,&nbsp;Arnold J Levine,&nbsp;Gustavo Stolovitzky","doi":"10.1109/csb.2005.7","DOIUrl":"https://doi.org/10.1109/csb.2005.7","url":null,"abstract":"<p><p>One of the major challenges in cancer diagnosis from microarray data is to develop robust classification models which are independent of the analysis techniques used and can combine data from different laboratories. We propose a meta-classification scheme which uses a robust multivariate gene selection procedure and integrates the results of several machine learning tools trained on raw and pattern data. We validate our method by applying it to distinguish diffuse large B-cell lymphoma (DLBCL) from follicular lymphoma (FL) on two independent datasets: the HuGeneFL Affmetrixy dataset of Shipp et al. (www. genome.wi.mit.du/MPR /lymphoma) and the Hu95Av2 Affymetrix dataset (DallaFavera's laboratory, Columbia University). Our meta-classification technique achieves higher predictive accuracies than each of the individual classifiers trained on the same dataset and is robust against various data perturbations. We also find that combinations of p53 responsive genes (e.g., p53, PLK1 and CDK2) are highly predictive of the phenotype.</p>","PeriodicalId":87417,"journal":{"name":"Proceedings. IEEE Computational Systems Bioinformatics Conference","volume":" ","pages":"322-5"},"PeriodicalIF":0.0,"publicationDate":"2005-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/csb.2005.7","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"25830777","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
A tree-decomposition approach to protein structure prediction. 蛋白质结构预测的树分解方法。
Pub Date : 2005-01-01 DOI: 10.1109/csb.2005.9
Jinbo Xu, Feng Jiao, Bonnie Berger

This paper proposes a tree decomposition of protein structures, which can be used to efficiently solve two key subproblems of protein structure prediction: protein threading for backbone prediction and protein side-chain prediction. To develop a unified tree-decomposition based approach to these two subproblems, we model them as a geometric neighborhood graph labeling problem. Theoretically, we can have a low-degree polynomial time algorithm to decompose a geometric neighborhood graph G = (V, E) into components with size O(|V|((2/3))log|V|). The computational complexity of the tree-decomposition based graph labeling algorithms is O(|V|Delta(tw+1)) where Delta is the average number of possible labels for each vertex and tw( = O(|V|((2/3))log|V|)) the tree width of G. Empirically, tw is very small and the tree-decomposition method can solve these two problems very efficiently. This paper also compares the computational efficiency of the tree-decomposition approach with the linear programming approach to these two problems and identifies the condition under which the tree-decomposition approach is more efficient than the linear programming approach. Experimental result indicates that the tree-decomposition approach is more efficient most of the time.

本文提出了一种蛋白质结构的树分解方法,该方法可以有效地解决蛋白质结构预测的两个关键子问题:用于主链预测的蛋白质线程和蛋白质侧链预测的蛋白质线程。为了开发一种统一的基于树分解的方法来解决这两个子问题,我们将它们建模为一个几何邻域图标记问题。理论上,我们可以用低次多项式时间算法将几何邻域图G = (V, E)分解为大小为O(|V|(2/3))log|V|)的分量。基于树分解的图标记算法的计算复杂度为O(|V|Delta(tw+1)),其中Delta为每个顶点可能标记的平均数目,tw(= O(|V|(2/3))log|V|))为g的树宽度。经验表明,tw很小,树分解方法可以非常有效地解决这两个问题。本文还比较了树分解方法与线性规划方法对这两个问题的计算效率,并确定了树分解方法比线性规划方法效率更高的条件。实验结果表明,树分解方法在大多数情况下都是有效的。
{"title":"A tree-decomposition approach to protein structure prediction.","authors":"Jinbo Xu,&nbsp;Feng Jiao,&nbsp;Bonnie Berger","doi":"10.1109/csb.2005.9","DOIUrl":"https://doi.org/10.1109/csb.2005.9","url":null,"abstract":"<p><p>This paper proposes a tree decomposition of protein structures, which can be used to efficiently solve two key subproblems of protein structure prediction: protein threading for backbone prediction and protein side-chain prediction. To develop a unified tree-decomposition based approach to these two subproblems, we model them as a geometric neighborhood graph labeling problem. Theoretically, we can have a low-degree polynomial time algorithm to decompose a geometric neighborhood graph G = (V, E) into components with size O(|V|((2/3))log|V|). The computational complexity of the tree-decomposition based graph labeling algorithms is O(|V|Delta(tw+1)) where Delta is the average number of possible labels for each vertex and tw( = O(|V|((2/3))log|V|)) the tree width of G. Empirically, tw is very small and the tree-decomposition method can solve these two problems very efficiently. This paper also compares the computational efficiency of the tree-decomposition approach with the linear programming approach to these two problems and identifies the condition under which the tree-decomposition approach is more efficient than the linear programming approach. Experimental result indicates that the tree-decomposition approach is more efficient most of the time.</p>","PeriodicalId":87417,"journal":{"name":"Proceedings. IEEE Computational Systems Bioinformatics Conference","volume":" ","pages":"247-56"},"PeriodicalIF":0.0,"publicationDate":"2005-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/csb.2005.9","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"25829321","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 59
期刊
Proceedings. IEEE Computational Systems Bioinformatics Conference
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1