首页 > 最新文献

Computers & chemistry最新文献

英文 中文
Developing Bioinformatics Computer Skills 发展生物信息学计算机技能
Pub Date : 2002-07-01 DOI: 10.1016/S0097-8485(02)00028-1
Jean-Loup Risler
{"title":"Developing Bioinformatics Computer Skills","authors":"Jean-Loup Risler","doi":"10.1016/S0097-8485(02)00028-1","DOIUrl":"10.1016/S0097-8485(02)00028-1","url":null,"abstract":"","PeriodicalId":79331,"journal":{"name":"Computers & chemistry","volume":"26 5","pages":"Pages 553-555"},"PeriodicalIF":0.0,"publicationDate":"2002-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/S0097-8485(02)00028-1","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76433108","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Grand metaphors of biology in the genome era 基因组时代生物学的伟大隐喻
Pub Date : 2002-07-01 DOI: 10.1016/S0097-8485(02)00024-4
Andrzej K Konopka
{"title":"Grand metaphors of biology in the genome era","authors":"Andrzej K Konopka","doi":"10.1016/S0097-8485(02)00024-4","DOIUrl":"10.1016/S0097-8485(02)00024-4","url":null,"abstract":"","PeriodicalId":79331,"journal":{"name":"Computers & chemistry","volume":"26 5","pages":"Pages 397-401"},"PeriodicalIF":0.0,"publicationDate":"2002-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/S0097-8485(02)00024-4","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90275132","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 29
Genome compartimentation by a Hybrid Chromosome Model (HχM). Application to Saccharomyces cerevisae subtelomeres 杂交染色体模型的基因组比较(HχM)。酿酒酵母亚端粒的应用
Pub Date : 2002-07-01 DOI: 10.1016/S0097-8485(02)00006-2
Alexandre G. de Brevern , France Loirat , Anne Badel-Chagnon , Cécile André , Pierre Vincens , Serge Hazout

The aim of this paper is to present a new approach, called ‘Hybrid Chromosome Model’ (HχM), which allows both the extraction of regions of similarity between two sequences, and the compartimentation of a set of DNA sequences. The principle of the method consists in compacting a set of sequences (split into fragments of fixed length) into a ‘hybrid chromosome’, which results from the stacking of the whole sequence fragments. We have illustrated our approach on the 32 subtelomeres of Saccharomyces cerevisae. The compartimentation of these chromosome extremities into common regions of similarity has been carried out. The approach HχM is a fast and efficient tool for mapping entire genomes and for extracting ancient duplications within or between genomes.

本文的目的是提出一种新的方法,称为“杂交染色体模型”(HχM),它既可以提取两个序列之间的相似性区域,也可以对一组DNA序列进行比较。该方法的原理是将一组序列(分成固定长度的片段)压缩成一个“杂交染色体”,这是由整个序列片段的堆叠产生的。我们已经在酿酒酵母的32个亚端粒上说明了我们的方法。这些染色体末端的比较成共同的相似区域已经进行。HχM方法是一种快速有效的绘制全基因组和提取基因组内部或基因组之间古代重复序列的工具。
{"title":"Genome compartimentation by a Hybrid Chromosome Model (HχM). Application to Saccharomyces cerevisae subtelomeres","authors":"Alexandre G. de Brevern ,&nbsp;France Loirat ,&nbsp;Anne Badel-Chagnon ,&nbsp;Cécile André ,&nbsp;Pierre Vincens ,&nbsp;Serge Hazout","doi":"10.1016/S0097-8485(02)00006-2","DOIUrl":"10.1016/S0097-8485(02)00006-2","url":null,"abstract":"<div><p>The aim of this paper is to present a new approach, called ‘Hybrid Chromosome Model’ (HχM), which allows both the extraction of regions of similarity between two sequences, and the compartimentation of a set of DNA sequences. The principle of the method consists in compacting a set of sequences (split into fragments of fixed length) into a ‘hybrid chromosome’, which results from the stacking of the whole sequence fragments. We have illustrated our approach on the 32 subtelomeres of <em>Saccharomyces cerevisae</em>. The compartimentation of these chromosome extremities into common regions of similarity has been carried out. The approach HχM is a fast and efficient tool for mapping entire genomes and for extracting ancient duplications within or between genomes.</p></div>","PeriodicalId":79331,"journal":{"name":"Computers & chemistry","volume":"26 5","pages":"Pages 437-445"},"PeriodicalIF":0.0,"publicationDate":"2002-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/S0097-8485(02)00006-2","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82419633","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Bioinformatics: A Practical Guide to the Analysis of Genes and Proteins 生物信息学:基因和蛋白质分析的实用指南
Pub Date : 2002-07-01 DOI: 10.1016/S0097-8485(02)00027-X
Jean-Loup Risler
{"title":"Bioinformatics: A Practical Guide to the Analysis of Genes and Proteins","authors":"Jean-Loup Risler","doi":"10.1016/S0097-8485(02)00027-X","DOIUrl":"10.1016/S0097-8485(02)00027-X","url":null,"abstract":"","PeriodicalId":79331,"journal":{"name":"Computers & chemistry","volume":"26 5","pages":"Pages 549-551"},"PeriodicalIF":0.0,"publicationDate":"2002-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/S0097-8485(02)00027-X","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84741377","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
This Is Biology: The Science of the Living World 这是生物学:生命世界的科学
Pub Date : 2002-07-01 DOI: 10.1016/S0097-8485(02)00025-6
Andrzej K Konopka
{"title":"This Is Biology: The Science of the Living World","authors":"Andrzej K Konopka","doi":"10.1016/S0097-8485(02)00025-6","DOIUrl":"10.1016/S0097-8485(02)00025-6","url":null,"abstract":"","PeriodicalId":79331,"journal":{"name":"Computers & chemistry","volume":"26 5","pages":"Pages 543-545"},"PeriodicalIF":0.0,"publicationDate":"2002-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/S0097-8485(02)00025-6","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83438036","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Applications of recursive segmentation to the analysis of DNA sequences 递归分割在DNA序列分析中的应用
Pub Date : 2002-07-01 DOI: 10.1016/S0097-8485(02)00010-4
Wentian Li , Pedro Bernaola-Galván , Fatameh Haghighi , Ivo Grosse

Recursive segmentation is a procedure that partitions a DNA sequence into domains with a homogeneous composition of the four nucleotides A, C, G and T. This procedure can also be applied to any sequence converted from a DNA sequence, such as to a binary strong(G+C)/weak(A+T) sequence, to a binary sequence indicating the presence or absence of the dinucleotide CpG, or to a sequence indicating both the base and the codon position information. We apply various conversion schemes in order to address the following five DNA sequence analysis problems: isochore mapping, CpG island detection, locating the origin and terminus of replication in bacterial genomes, finding complex repeats in telomere sequences, and delineating coding and noncoding regions. We find that the recursive segmentation procedure can successfully detect isochore borders, CpG islands, and the origin and terminus of replication, but it needs improvement for detecting complex repeats as well as borders between coding and noncoding regions.

递归分割是一种将DNA序列划分为具有四种核苷酸a、C、G和T组成的均匀结构域的过程。该过程也可以应用于从DNA序列转换的任何序列,例如转换为二元强(G+C)/弱(a +T)序列,转换为指示二核苷酸CpG存在或不存在的二元序列,或转换为指示碱基和密码子位置信息的序列。为了解决以下五个DNA序列分析问题,我们应用了各种转换方案:等质粒定位,CpG岛检测,定位细菌基因组复制的起点和终点,发现端粒序列中的复杂重复序列,以及描述编码区和非编码区。我们发现递归分割方法可以成功地检测等差边界、CpG岛和复制的起点和终点,但在检测复杂重复以及编码区和非编码区之间的边界方面需要改进。
{"title":"Applications of recursive segmentation to the analysis of DNA sequences","authors":"Wentian Li ,&nbsp;Pedro Bernaola-Galván ,&nbsp;Fatameh Haghighi ,&nbsp;Ivo Grosse","doi":"10.1016/S0097-8485(02)00010-4","DOIUrl":"10.1016/S0097-8485(02)00010-4","url":null,"abstract":"<div><p>Recursive segmentation is a procedure that partitions a DNA sequence into domains with a homogeneous composition of the four nucleotides A, C, G and T. This procedure can also be applied to any sequence converted from a DNA sequence, such as to a binary strong(G+C)/weak(A+T) sequence, to a binary sequence indicating the presence or absence of the dinucleotide CpG, or to a sequence indicating both the base and the codon position information. We apply various conversion schemes in order to address the following five DNA sequence analysis problems: isochore mapping, CpG island detection, locating the origin and terminus of replication in bacterial genomes, finding complex repeats in telomere sequences, and delineating coding and noncoding regions. We find that the recursive segmentation procedure can successfully detect isochore borders, CpG islands, and the origin and terminus of replication, but it needs improvement for detecting complex repeats as well as borders between coding and noncoding regions.</p></div>","PeriodicalId":79331,"journal":{"name":"Computers & chemistry","volume":"26 5","pages":"Pages 491-510"},"PeriodicalIF":0.0,"publicationDate":"2002-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/S0097-8485(02)00010-4","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79853764","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 105
Pairwise sequence alignment using a PROSITE pattern-derived similarity score 使用PROSITE模式衍生的相似性评分的两两序列比对
Pub Date : 2002-07-01 DOI: 10.1016/S0097-8485(02)00005-0
J.-P. Comet , J. Henry

Existing methods for alignments are based on edition costs computed additionally position by position, according to a fixed substitution matrix: a substitution always has the same weight regardless of the position. Nevertheless the biologist favours a similarity according to his knowledge of the structure or the function of the sequences considered. In the particular case of proteins, we present a method consisting in integrating other information, such as patterns of the PROSITE databank, in the classical dynamic programming algorithm. The method consists in making an alignment by dynamic programming taking a decision not only letter by letter as in the Smith & Waterman algorithm but also by giving a reward when aligning patterns.

现有的对齐方法是根据固定的替换矩阵,每个位置额外计算的编辑成本:无论位置如何,替换总是具有相同的权重。然而,生物学家根据他对所考虑的序列的结构或功能的了解倾向于相似性。在蛋白质的特殊情况下,我们提出了一种方法,包括整合其他信息,如PROSITE数据库的模式,在经典的动态规划算法。该方法包括通过动态规划进行对齐,而不是像Smith &沃特曼算法还通过在对齐模式时给予奖励。
{"title":"Pairwise sequence alignment using a PROSITE pattern-derived similarity score","authors":"J.-P. Comet ,&nbsp;J. Henry","doi":"10.1016/S0097-8485(02)00005-0","DOIUrl":"10.1016/S0097-8485(02)00005-0","url":null,"abstract":"<div><p>Existing methods for alignments are based on edition costs computed additionally position by position, according to a fixed substitution matrix: a substitution always has the same weight regardless of the position. Nevertheless the biologist favours a similarity according to his knowledge of the structure or the function of the sequences considered. In the particular case of proteins, we present a method consisting in integrating other information, such as patterns of the PROSITE databank, in the classical dynamic programming algorithm. The method consists in making an alignment by dynamic programming taking a decision not only letter by letter as in the Smith &amp; Waterman algorithm but also by giving a reward when aligning patterns.</p></div>","PeriodicalId":79331,"journal":{"name":"Computers & chemistry","volume":"26 5","pages":"Pages 421-436"},"PeriodicalIF":0.0,"publicationDate":"2002-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/S0097-8485(02)00005-0","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83782343","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Modeling comparative mapping using objects and associations 使用对象和关联建模比较映射
Pub Date : 2002-07-01 DOI: 10.1016/S0097-8485(02)00004-9
G Bronner , B Spataro , M Page , C Gautier , F Rechenmann

Spatial information on genome organization is essential for both gene prediction and annotation among species and a better understanding of genomes functioning and evolution. We propose in this article an object-association model to formalize comparative genomic mapping. This model is being implemented in the GeMCore knowledge base, for which some original capabilities are described. GeMCore associated to the GeMME graphical interface for molecular evolution was used to spatially characterize the minor shift phenomenon between human and mouse.

基因组组织的空间信息对于物种间的基因预测和注释以及更好地理解基因组的功能和进化至关重要。我们在这篇文章中提出了一个对象关联模型来形式化比较基因组图谱。该模型正在GeMCore知识库中实现,其中描述了一些原始功能。GeMCore与GeMME分子进化图形界面相关联,用于在空间上表征人和小鼠之间的微小移动现象。
{"title":"Modeling comparative mapping using objects and associations","authors":"G Bronner ,&nbsp;B Spataro ,&nbsp;M Page ,&nbsp;C Gautier ,&nbsp;F Rechenmann","doi":"10.1016/S0097-8485(02)00004-9","DOIUrl":"10.1016/S0097-8485(02)00004-9","url":null,"abstract":"<div><p>Spatial information on genome organization is essential for both gene prediction and annotation among species and a better understanding of genomes functioning and evolution. We propose in this article an object-association model to formalize comparative genomic mapping. This model is being implemented in the GeMCore knowledge base, for which some original capabilities are described. GeMCore associated to the GeMME graphical interface for molecular evolution was used to spatially characterize the minor shift phenomenon between human and mouse.</p></div>","PeriodicalId":79331,"journal":{"name":"Computers & chemistry","volume":"26 5","pages":"Pages 413-420"},"PeriodicalIF":0.0,"publicationDate":"2002-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/S0097-8485(02)00004-9","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78169809","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Local weighting schemes for protein multiple sequence alignment 蛋白质多序列比对的局部加权方法
Pub Date : 2002-07-01 DOI: 10.1016/S0097-8485(02)00008-6
Jaap Heringa

This paper describes three weighting schemes for improving the accuracy of progressive multiple sequence alignment methods: (1) global profile pre-processing, to capture for each sequence information about other sequences in a profile before the actual multiple alignment takes place; (2) local pre-processing; which incorporates a new protocol to only use non-overlapping local sequence regions to construct the pre-processed profiles; and (3) local–global alignment, a weighting scheme based on the double dynamic programming (DDP) technique to softly bias global alignment to local sequence motifs. The first two schemes allow the compilation of residue-specific multiple alignment reliability indices, which can be used in an iterative fashion. The schemes have been implemented with associated iterative modes in the PRALINE multiple sequence alignment method, and have been evaluated using the BAliBASE benchmark alignment database. These tests indicate that PRALINE is a toolbox able to build alignments with very high quality. We found that local profile pre-processing raises the alignment quality by 5.5% compared to PRALINE alignments generated under default conditions. Iteration enhances the quality by a further percentage point. The implications of multiple alignment scoring functions and iteration in relation to alignment quality and benchmarking are discussed.

为了提高渐进式多序列比对方法的精度,本文提出了三种加权方案:(1)全局剖面预处理,在实际进行多序列比对之前,为每个序列捕获剖面中其他序列的信息;(2)局部预处理;该算法引入了一种新的协议,只使用不重叠的局部序列区域来构建预处理后的轮廓;(3)局部-全局对齐,这是一种基于双动态规划(DDP)技术的加权方案,可使全局对齐对局部序列基元进行软偏置。前两种方案允许编制残差特定的多重对准可靠性指标,这些指标可以以迭代方式使用。在PRALINE多序列比对方法中,采用相关迭代模式对方案进行了实现,并利用BAliBASE基准比对数据库对方案进行了评价。这些测试表明PRALINE是一个工具箱,能够以非常高的质量构建校准。我们发现,与默认条件下生成的PRALINE对齐相比,本地配置文件预处理将对齐质量提高了5.5%。迭代将质量进一步提高了一个百分点。讨论了多对齐评分函数和迭代对对齐质量和基准的影响。
{"title":"Local weighting schemes for protein multiple sequence alignment","authors":"Jaap Heringa","doi":"10.1016/S0097-8485(02)00008-6","DOIUrl":"10.1016/S0097-8485(02)00008-6","url":null,"abstract":"<div><p>This paper describes three weighting schemes for improving the accuracy of progressive multiple sequence alignment methods: (1) global profile pre-processing, to capture for each sequence information about other sequences in a profile before the actual multiple alignment takes place; (2) local pre-processing; which incorporates a new protocol to only use non-overlapping local sequence regions to construct the pre-processed profiles; and (3) local–global alignment, a weighting scheme based on the double dynamic programming (DDP) technique to softly bias global alignment to local sequence motifs. The first two schemes allow the compilation of residue-specific multiple alignment reliability indices, which can be used in an iterative fashion. The schemes have been implemented with associated iterative modes in the PRALINE multiple sequence alignment method, and have been evaluated using the BAliBASE benchmark alignment database. These tests indicate that PRALINE is a toolbox able to build alignments with very high quality. We found that local profile pre-processing raises the alignment quality by 5.5% compared to PRALINE alignments generated under default conditions. Iteration enhances the quality by a further percentage point. The implications of multiple alignment scoring functions and iteration in relation to alignment quality and benchmarking are discussed.</p></div>","PeriodicalId":79331,"journal":{"name":"Computers & chemistry","volume":"26 5","pages":"Pages 459-477"},"PeriodicalIF":0.0,"publicationDate":"2002-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/S0097-8485(02)00008-6","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73167550","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 49
Deciphering Arabidopsis thaliana gene neighborhoods through bibliographic co-citations 通过文献共引解读拟南芥基因邻域
Pub Date : 2002-07-01 DOI: 10.1016/S0097-8485(02)00011-6
A. Louis , H. Chiapello , C. Fabry , E. Ollivier , A. Hénaut

In the framework of genome annotation, scientific literature is obviously the major source of biological knowledge. The aim of the work described in this paper is to exploit this source of data for the model plant Arabidopsis thaliana. The first step has consisted in constituting a relevant bibliographic references dataset for plant genomic research. Genes co-citations have then been systematically annotated in this reference dataset, starting from the simple idea that if genes are cited in the same publication, they must probably share some related functional properties. In order to deal with the synonymous gene name problem; a gene name reference list has been constituted starting from A. thaliana SwissProt entries. This list was used to build clusters of co-cited genes by a single linkage procedure such that any gene in a given cluster possesses at least one co-cited partner in the same cluster. Analysis of the clusters demonstrate the biological consistency of this approach, with only very few fortuitous links. As an example, a cluster including genes related to flowering time is more deeply described in the paper. Finally, a graphical representation of each cluster was performed, which provides a convenient way to retrieve the genes (the nodes of the graphs) and the references in which they were co-cited (the edges of the graphs). All the results can be accessed at the URL http://chlora.Igi.infobiogen.fr:1234/bib_arath/.

在基因组注释的框架中,科学文献显然是生物学知识的主要来源。本文所描述的工作的目的是利用这一数据来源的模式植物拟南芥。第一步是为植物基因组研究建立一个相关的参考书目数据集。从一个简单的想法开始,如果基因在同一出版物中被引用,它们可能具有一些相关的功能特性,然后在这个参考数据集中系统地注释了基因共引。为了处理同义基因名称问题;从拟南芥SwissProt条目开始,构建了一个基因名称参考表。该列表用于通过单一链接程序构建共被引基因簇,使得给定簇中的任何基因在同一簇中至少具有一个共被引伙伴。对聚类的分析证明了这种方法的生物学一致性,只有很少的偶然联系。作为一个例子,本文更深入地描述了一个包含与开花时间有关的基因簇。最后,对每个聚类进行图形化表示,这提供了一种方便的方法来检索基因(图的节点)和它们被共同引用的参考文献(图的边缘)。所有的结果都可以通过URL http://chlora.Igi.infobiogen.fr:1234/bib_arath/访问。
{"title":"Deciphering Arabidopsis thaliana gene neighborhoods through bibliographic co-citations","authors":"A. Louis ,&nbsp;H. Chiapello ,&nbsp;C. Fabry ,&nbsp;E. Ollivier ,&nbsp;A. Hénaut","doi":"10.1016/S0097-8485(02)00011-6","DOIUrl":"10.1016/S0097-8485(02)00011-6","url":null,"abstract":"<div><p>In the framework of genome annotation, scientific literature is obviously the major source of biological knowledge. The aim of the work described in this paper is to exploit this source of data for the model plant <em>Arabidopsis thaliana</em>. The first step has consisted in constituting a relevant bibliographic references dataset for plant genomic research. Genes co-citations have then been systematically annotated in this reference dataset, starting from the simple idea that if genes are cited in the same publication, they must probably share some related functional properties. In order to deal with the synonymous gene name problem; a gene name reference list has been constituted starting from <em>A. thaliana</em> SwissProt entries. This list was used to build clusters of co-cited genes by a single linkage procedure such that any gene in a given cluster possesses at least one co-cited partner in the same cluster. Analysis of the clusters demonstrate the biological consistency of this approach, with only very few fortuitous links. As an example, a cluster including genes related to flowering time is more deeply described in the paper. Finally, a graphical representation of each cluster was performed, which provides a convenient way to retrieve the genes (the nodes of the graphs) and the references in which they were co-cited (the edges of the graphs). All the results can be accessed at the URL <span>http://chlora.Igi.infobiogen.fr:1234/bib_arath/</span><svg><path></path></svg>.</p></div>","PeriodicalId":79331,"journal":{"name":"Computers & chemistry","volume":"26 5","pages":"Pages 511-519"},"PeriodicalIF":0.0,"publicationDate":"2002-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/S0097-8485(02)00011-6","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84143076","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
期刊
Computers & chemistry
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1