{"title":"利用全基因组序列构建病毒关系树","authors":"Jing-doo Wang, Yi-Chun Wang","doi":"10.1109/BIBE.2018.00010","DOIUrl":null,"url":null,"abstract":"This paper aims to construct two relationship trees of all viruses using two types of genomic sequences, DNA(\"deoxyribonucleic acid\") and CDS (\"coding sequence\"), respectively, via a previously developed approach BBRD (BLAST-Based Relative Distance). The BBRD approach is capable to construct the relationship trees of different genomic sequences without identifying common conserved regions among these sequences for comparison in advance. The experimental resources of viruses, with complete genome sequences, were downloaded from NCBI(National Center for Biotechnology Information) at 2018/3/1, and there are 7,535 viruses with whole DNA sequences and 7,434 viruses with at least one CDS sequences. Experimental results show that the relationship tree constructed via DNA sequences seems to be more consistent with the taxonomy of viruses in ICTV (International Committee on Taxonomy of Viruses) than that constructed via CDS sequences. Furthermore, observing the neighbors of one unknown virus within the relationship trees can provide hints to determine or guess its taxonomic information for the biologist or virologist. This study may inspect the fitness of the structures (skeletons) of one existing taxonomy, e.g. ICTV, by observing the relationship tree and providing the parts of subtree without consistence.","PeriodicalId":127507,"journal":{"name":"2018 IEEE 18th International Conference on Bioinformatics and Bioengineering (BIBE)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Constructing the Relationship Tree of All Viruses via Whole Genomic Sequences\",\"authors\":\"Jing-doo Wang, Yi-Chun Wang\",\"doi\":\"10.1109/BIBE.2018.00010\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper aims to construct two relationship trees of all viruses using two types of genomic sequences, DNA(\\\"deoxyribonucleic acid\\\") and CDS (\\\"coding sequence\\\"), respectively, via a previously developed approach BBRD (BLAST-Based Relative Distance). The BBRD approach is capable to construct the relationship trees of different genomic sequences without identifying common conserved regions among these sequences for comparison in advance. The experimental resources of viruses, with complete genome sequences, were downloaded from NCBI(National Center for Biotechnology Information) at 2018/3/1, and there are 7,535 viruses with whole DNA sequences and 7,434 viruses with at least one CDS sequences. Experimental results show that the relationship tree constructed via DNA sequences seems to be more consistent with the taxonomy of viruses in ICTV (International Committee on Taxonomy of Viruses) than that constructed via CDS sequences. Furthermore, observing the neighbors of one unknown virus within the relationship trees can provide hints to determine or guess its taxonomic information for the biologist or virologist. This study may inspect the fitness of the structures (skeletons) of one existing taxonomy, e.g. ICTV, by observing the relationship tree and providing the parts of subtree without consistence.\",\"PeriodicalId\":127507,\"journal\":{\"name\":\"2018 IEEE 18th International Conference on Bioinformatics and Bioengineering (BIBE)\",\"volume\":\"35 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 IEEE 18th International Conference on Bioinformatics and Bioengineering (BIBE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/BIBE.2018.00010\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE 18th International Conference on Bioinformatics and Bioengineering (BIBE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIBE.2018.00010","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
本文旨在利用DNA(“脱氧核糖核酸”)和CDS(“编码序列”)这两种类型的基因组序列,通过先前开发的基于blast的相对距离(BBRD)方法,分别构建所有病毒的两个关系树。BBRD方法能够构建不同基因组序列的关系树,而无需事先确定这些序列之间的共同保守区域进行比较。2018年3月1日,从NCBI(National Center for Biotechnology Information)下载病毒全基因组序列实验资源,拥有完整DNA序列的病毒有7535种,拥有至少一个CDS序列的病毒有7434种。实验结果表明,通过DNA序列构建的关系树似乎比通过CDS序列构建的关系树更符合ICTV (International Committee on taxonomy of viruses)的病毒分类。此外,观察一个未知病毒在关系树中的邻居可以为生物学家或病毒学家提供确定或猜测其分类信息的提示。本研究可能通过观察关系树和提供子树中不一致的部分来检验现有分类(如ICTV)的结构(骨架)的适应度。
Constructing the Relationship Tree of All Viruses via Whole Genomic Sequences
This paper aims to construct two relationship trees of all viruses using two types of genomic sequences, DNA("deoxyribonucleic acid") and CDS ("coding sequence"), respectively, via a previously developed approach BBRD (BLAST-Based Relative Distance). The BBRD approach is capable to construct the relationship trees of different genomic sequences without identifying common conserved regions among these sequences for comparison in advance. The experimental resources of viruses, with complete genome sequences, were downloaded from NCBI(National Center for Biotechnology Information) at 2018/3/1, and there are 7,535 viruses with whole DNA sequences and 7,434 viruses with at least one CDS sequences. Experimental results show that the relationship tree constructed via DNA sequences seems to be more consistent with the taxonomy of viruses in ICTV (International Committee on Taxonomy of Viruses) than that constructed via CDS sequences. Furthermore, observing the neighbors of one unknown virus within the relationship trees can provide hints to determine or guess its taxonomic information for the biologist or virologist. This study may inspect the fitness of the structures (skeletons) of one existing taxonomy, e.g. ICTV, by observing the relationship tree and providing the parts of subtree without consistence.