The Fourth Georgia Tech-University of Georgia International Conference in Bioinformatics: in silico Biology, Biological Networks, from Genomics to Epidemiology (November 13-16, 2003, Atlanta, Georgia, USA)

Journal of bioinformatics Pub Date : 2003-01-01 DOI:10.1093/bioinformatics/btg365

{"title":"The Fourth Georgia Tech-University of Georgia International Conference in Bioinformatics: in silico Biology, Biological Networks, from Genomics to Epidemiology (November 13-16, 2003, Atlanta, Georgia, USA)","authors":"","doi":"10.1093/bioinformatics/btg365","DOIUrl":null,"url":null,"abstract":"Three previous bioinformatics meetings organized by Georgia Tech were held in Atlanta in 1997, 1999 and 2001 have attracted outstanding researchers from 14 countries around the globe and established this conference as a major academic forum for intensive and open exchange of new ideas. This year we again have invited many rising junior scientists to deliver plenary lectures on their discoveries made in silico, with many of these accomplishments driven by the explosion of genomic and proteomic data. The focus of the 2003 conference, in comparison with the previous ones, has shifted even further to the most complicated aspects of genomics and proteomics manifested by the impressive advances in creating consistent models of networks of cellular processes (http://opal. biology.gatech.edu/GeneMark/conference/). Other changes include the University of Georgia joining Georgia Tech as a co-organizer of the meeting and a move of the conference site to the brand new Georgia Tech Conference Center & Hotel opened in the Midtown area extension of Georgia Tech campus near the Atlanta Arts Center and the city’s historic area. Several papers presented at the conference are included in this current Special Issue of Bioinformatics, with their publication serving a parallel avenue of presentation for conference participants. Seven papers out of seventeen submitted manuscripts successfully passed peer review and were accepted for publication. The paper by Rocco and Critchlow describes a method for scanning the internet to identify web sites with particular functionalities and developing wrappers for automatic queries of those sites. They tested it on sites that perform BLAST searches and showed reasonable success at automatic classification and query submission. An important aspect of the work is the ability of the system to learn the appropriate queries. This is an important and very challenging problem and this paper offers interesting algorithmic ideas. Qian, Lin, Luscombe, Yu and Gerstein address the important and challenging computational problem of defining relationships between transcription factors (TFs) and their target genes. They apply support vector machines to microarray expression data to generate novel predictions from known interactions. Interestingly, their approach does not use DNA sequence information. Existing databases of experimentally confirmed TF-target pairs only contain positive examples, so the authors are careful to construct a balanced set of both positive and negative training data. The resulting database ofin silico predictions of TF-target pairs is a valuable complement to the information that can be derived from chromatin-immunoprecipitation microarray (‘ChIP on a chip’) experiments. Lukashin, Lukashev and Fuchs attack the problems of inference of topology of gene regulation networks that manifests itself in capricious patterns of gene expression in microarray experiments. They verify that genetic networks have many properties, such as connectivity degree distribution, error and attack tolerance and network redundancy, not unlike those of protein–protein interaction networks established a few years earlier. Krause, von Mering and Bork describe a careful large-scale omputational analysis of yeast protein interactome, aiming at identifying yet undiscovered protein complexes and evaluating reliability of known pieces of information. They define a new similarity measure for grouping protein-interaction data and end up with a minimally redundant collection of protein complexes produced by an unsupervised clustering of raw experimental data. The paper by Han and Ju describes a program for rapidly displaying graphs with thousands of edges and nodes, as is common for protein-interaction networks. The method allows a variety of viewing options, such as focusing on specific subgraphs and collapsing parts of the graph to reduce its size. The algorithm is efficient enough to make it practical on the very large graph problems required for whole proteome analysis. They also provide tools for comparing two or more graphs to highlight their similarities and differences. Joel Bader describes an efficient, greedy algorithm that automatically extracts biologically relevant networks from protein–protein interaction data, starting from selected seed proteins (hence the interesting name of the algorithm, SEEDY). The interaction data are notoriously noisy and Bader attempts to use confidence criteria developed in his previous work to enhance the reliability of the produced networks. There is also an interesting attempt to analyze simultaneously protein–protein and genetic interaction data. Gomez, Noble and Rzhetsky address the formidable problem of predicting protein–protein interactions on genome scale by developing an attraction–repulsion learning model. Simply put, evolutionarily conserved features of interacting protein pairs contribute to attraction, whereas the features of pairs that do not interact contribute to repulsion. The authors show that the model works well with known interactions and that including repulsion (not only attraction)","PeriodicalId":90576,"journal":{"name":"Journal of bioinformatics","volume":"04 1","pages":"1867-1868"},"PeriodicalIF":0.0000,"publicationDate":"2003-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of bioinformatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/bioinformatics/btg365","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Three previous bioinformatics meetings organized by Georgia Tech were held in Atlanta in 1997, 1999 and 2001 have attracted outstanding researchers from 14 countries around the globe and established this conference as a major academic forum for intensive and open exchange of new ideas. This year we again have invited many rising junior scientists to deliver plenary lectures on their discoveries made in silico, with many of these accomplishments driven by the explosion of genomic and proteomic data. The focus of the 2003 conference, in comparison with the previous ones, has shifted even further to the most complicated aspects of genomics and proteomics manifested by the impressive advances in creating consistent models of networks of cellular processes (http://opal. biology.gatech.edu/GeneMark/conference/). Other changes include the University of Georgia joining Georgia Tech as a co-organizer of the meeting and a move of the conference site to the brand new Georgia Tech Conference Center & Hotel opened in the Midtown area extension of Georgia Tech campus near the Atlanta Arts Center and the city’s historic area. Several papers presented at the conference are included in this current Special Issue of Bioinformatics, with their publication serving a parallel avenue of presentation for conference participants. Seven papers out of seventeen submitted manuscripts successfully passed peer review and were accepted for publication. The paper by Rocco and Critchlow describes a method for scanning the internet to identify web sites with particular functionalities and developing wrappers for automatic queries of those sites. They tested it on sites that perform BLAST searches and showed reasonable success at automatic classification and query submission. An important aspect of the work is the ability of the system to learn the appropriate queries. This is an important and very challenging problem and this paper offers interesting algorithmic ideas. Qian, Lin, Luscombe, Yu and Gerstein address the important and challenging computational problem of defining relationships between transcription factors (TFs) and their target genes. They apply support vector machines to microarray expression data to generate novel predictions from known interactions. Interestingly, their approach does not use DNA sequence information. Existing databases of experimentally confirmed TF-target pairs only contain positive examples, so the authors are careful to construct a balanced set of both positive and negative training data. The resulting database ofin silico predictions of TF-target pairs is a valuable complement to the information that can be derived from chromatin-immunoprecipitation microarray (‘ChIP on a chip’) experiments. Lukashin, Lukashev and Fuchs attack the problems of inference of topology of gene regulation networks that manifests itself in capricious patterns of gene expression in microarray experiments. They verify that genetic networks have many properties, such as connectivity degree distribution, error and attack tolerance and network redundancy, not unlike those of protein–protein interaction networks established a few years earlier. Krause, von Mering and Bork describe a careful large-scale omputational analysis of yeast protein interactome, aiming at identifying yet undiscovered protein complexes and evaluating reliability of known pieces of information. They define a new similarity measure for grouping protein-interaction data and end up with a minimally redundant collection of protein complexes produced by an unsupervised clustering of raw experimental data. The paper by Han and Ju describes a program for rapidly displaying graphs with thousands of edges and nodes, as is common for protein-interaction networks. The method allows a variety of viewing options, such as focusing on specific subgraphs and collapsing parts of the graph to reduce its size. The algorithm is efficient enough to make it practical on the very large graph problems required for whole proteome analysis. They also provide tools for comparing two or more graphs to highlight their similarities and differences. Joel Bader describes an efficient, greedy algorithm that automatically extracts biologically relevant networks from protein–protein interaction data, starting from selected seed proteins (hence the interesting name of the algorithm, SEEDY). The interaction data are notoriously noisy and Bader attempts to use confidence criteria developed in his previous work to enhance the reliability of the produced networks. There is also an interesting attempt to analyze simultaneously protein–protein and genetic interaction data. Gomez, Noble and Rzhetsky address the formidable problem of predicting protein–protein interactions on genome scale by developing an attraction–repulsion learning model. Simply put, evolutionarily conserved features of interacting protein pairs contribute to attraction, whereas the features of pairs that do not interact contribute to repulsion. The authors show that the model works well with known interactions and that including repulsion (not only attraction)

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

第四届佐治亚理工大学生物信息学国际会议:从基因组学到流行病学的硅生物学、生物网络(2003年11月13-16日，美国佐治亚州亚特兰大)

佐治亚理工学院于1997年、1999年和2001年在亚特兰大举办了三届生物信息学会议，吸引了来自全球14个国家的杰出研究人员，并将此会议建立为一个集中和开放交流新思想的重要学术论坛。今年，我们再次邀请了许多崭露头角的年轻科学家来做全体讲座，介绍他们在计算机上的发现，其中许多成就是由基因组和蛋白质组学数据的爆炸式增长推动的。与之前的会议相比，2003年会议的焦点已经进一步转移到基因组学和蛋白质组学最复杂的方面，这体现在创建细胞过程网络的一致模型方面取得了令人印象深刻的进展(http://opal)。biology.gatech.edu/GeneMark/conference/)。其他变化包括佐治亚大学加入佐治亚理工学院作为会议的共同组织者，并将会议地点转移到全新的佐治亚理工学院会议中心和酒店，该会议中心和酒店位于亚特兰大艺术中心和城市历史区域附近的佐治亚理工学院校园的中城扩展区。在会议上发表的几篇论文被包括在本期《生物信息学》特刊中，它们的出版为会议参与者提供了一个平行的展示途径。17篇投稿论文中有7篇成功通过同行评议，并被接受发表。Rocco和Critchlow的论文描述了一种方法，用于扫描互联网以识别具有特定功能的网站，并开发用于自动查询这些网站的包装器。他们在执行BLAST搜索的网站上进行了测试，并在自动分类和查询提交方面取得了一定的成功。这项工作的一个重要方面是系统学习适当查询的能力。这是一个非常重要且具有挑战性的问题，本文提供了有趣的算法思想。Qian, Lin, Luscombe, Yu和Gerstein解决了定义转录因子(tf)与其靶基因之间关系的重要且具有挑战性的计算问题。他们将支持向量机应用于微阵列表达数据，从已知的相互作用中产生新的预测。有趣的是，他们的方法不使用DNA序列信息。现有的实验证实的tf -目标对数据库只包含正例，因此作者谨慎地构建一个正负训练数据的平衡集。由此产生的tf靶对的计算机预测数据库是对染色质免疫沉淀微阵列(“芯片上的芯片”)实验所得信息的有价值的补充。Lukashin, Lukashev和Fuchs攻击了基因调控网络拓扑的推理问题，这些问题在微阵列实验中反复无常的基因表达模式中表现出来。他们验证了遗传网络具有许多特性，如连接度分布、容错和攻击以及网络冗余，这与几年前建立的蛋白质-蛋白质相互作用网络没有什么不同。Krause, von Mering和Bork描述了对酵母蛋白相互作用组的仔细的大规模计算分析，旨在识别尚未发现的蛋白质复合物并评估已知信息片段的可靠性。他们为分组蛋白质相互作用数据定义了一种新的相似性度量，并最终通过对原始实验数据的无监督聚类产生最小冗余的蛋白质复合物集合。Han和Ju的论文描述了一个快速显示具有数千条边和节点的图形的程序，这在蛋白质相互作用网络中很常见。该方法允许多种查看选项，例如关注特定子图和折叠图的部分以减小其大小。该算法是有效的，足以使它适用于整个蛋白质组分析所需的非常大的图形问题。它们还提供了比较两个或多个图形的工具，以突出它们的相似点和不同点。Joel Bader描述了一种高效的贪婪算法，该算法从选定的种子蛋白开始，从蛋白质相互作用数据中自动提取生物相关网络(因此该算法有了有趣的名字，SEEDY)。交互数据是出了名的嘈杂，Bader试图使用在他之前的工作中开发的置信度标准来提高生成的网络的可靠性。还有一个有趣的尝试是同时分析蛋白质-蛋白质和基因相互作用的数据。Gomez、Noble和Rzhetsky通过建立一个吸引-排斥学习模型，解决了在基因组尺度上预测蛋白质-蛋白质相互作用的棘手问题。简单地说，相互作用的蛋白质对的进化保守特征有助于吸引，而不相互作用的蛋白质对的特征有助于排斥。作者表明，该模型适用于已知的相互作用，包括排斥(不仅是吸引力)。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Journal of bioinformatics

自引率

0.00%

发文量