OrthoDB and BUSCO update: annotation of orthologs with wider sampling of genomes

IF 16.6 2区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Nucleic Acids Research Pub Date : 2024-11-13 DOI:10.1093/nar/gkae987
Fredrik Tegenfeldt, Dmitry Kuznetsov, Mosè Manni, Matthew Berkeley, Evgeny M Zdobnov, Evgenia V Kriventseva
{"title":"OrthoDB and BUSCO update: annotation of orthologs with wider sampling of genomes","authors":"Fredrik Tegenfeldt, Dmitry Kuznetsov, Mosè Manni, Matthew Berkeley, Evgeny M Zdobnov, Evgenia V Kriventseva","doi":"10.1093/nar/gkae987","DOIUrl":null,"url":null,"abstract":"OrthoDB (https://www.orthodb.org) offers evolutionary and functional annotations of orthologous genes in the widest sampling of eukaryotes, prokaryotes, and viruses, extending experimental gene function knowledge to newly sequenced genomes. We collect gene annotations, delineate hierarchical gene orthology and annotate the orthologous groups (OGs) with functional and evolutionary traits. OrthoDB is the leading resource for species diversity, striving to sample the most diverse and well-researched organisms with the highest quality genomic data. This update expands to include 5827 eukaryotic genomes. We have also added coding DNA sequences (CDSs) and gene loci coordinates. OrthoDB can be browsed, downloaded, or accessed using REST API, SPARQL/RDF and now also via API packages for Python and R Bioconductor. OrthoLoger (https://orthologer.ezlab.org), the tool used for inferring orthologs in OrthoDB, is now available as a Conda package and through BioContainers. ODB-mapper, a component of OrthoLoger, streamlines annotation of genes from newly sequenced genomes with OrthoDB evolutionary and functional descriptors. The benchmarking sets of universal single-copy orthologs (BUSCO), derived from OrthoDB, had correspondingly a major update. The BUSCO tool (https://busco.ezlab.org) has become a standard in genomics, uniquely capable of assessing both eukaryotic and prokaryotic species. It is applicable to gene sets, transcriptomes, genome assemblies and metagenomic bins.","PeriodicalId":19471,"journal":{"name":"Nucleic Acids Research","volume":"14 1","pages":""},"PeriodicalIF":16.6000,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nucleic Acids Research","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/nar/gkae987","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

OrthoDB (https://www.orthodb.org) offers evolutionary and functional annotations of orthologous genes in the widest sampling of eukaryotes, prokaryotes, and viruses, extending experimental gene function knowledge to newly sequenced genomes. We collect gene annotations, delineate hierarchical gene orthology and annotate the orthologous groups (OGs) with functional and evolutionary traits. OrthoDB is the leading resource for species diversity, striving to sample the most diverse and well-researched organisms with the highest quality genomic data. This update expands to include 5827 eukaryotic genomes. We have also added coding DNA sequences (CDSs) and gene loci coordinates. OrthoDB can be browsed, downloaded, or accessed using REST API, SPARQL/RDF and now also via API packages for Python and R Bioconductor. OrthoLoger (https://orthologer.ezlab.org), the tool used for inferring orthologs in OrthoDB, is now available as a Conda package and through BioContainers. ODB-mapper, a component of OrthoLoger, streamlines annotation of genes from newly sequenced genomes with OrthoDB evolutionary and functional descriptors. The benchmarking sets of universal single-copy orthologs (BUSCO), derived from OrthoDB, had correspondingly a major update. The BUSCO tool (https://busco.ezlab.org) has become a standard in genomics, uniquely capable of assessing both eukaryotic and prokaryotic species. It is applicable to gene sets, transcriptomes, genome assemblies and metagenomic bins.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
OrthoDB 和 BUSCO 更新:利用更广泛的基因组取样注释直向同源物
OrthoDB (https://www.orthodb.org) 提供真核生物、原核生物和病毒中最广泛的直向同源基因的进化和功能注释,将实验基因功能知识扩展到新测序的基因组。我们收集基因注释,划分分级基因正交,并用功能和进化特征注释正交组(OGs)。OrthoDB 是物种多样性方面的领先资源,致力于用最高质量的基因组数据采样最多样、研究最深入的生物。本次更新包括了 5827 个真核生物基因组。我们还增加了编码 DNA 序列(CDS)和基因位点坐标。OrthoDB 可通过 REST API、SPARQL/RDF 进行浏览、下载或访问,现在还可通过 Python 和 R Bioconductor 的 API 包进行访问。OrthoLoger (https://orthologer.ezlab.org) 是用于在 OrthoDB 中推断直向同源物的工具,现已作为 Conda 软件包并通过 BioContainers 提供。ODB-mapper 是 OrthoLoger 的一个组件,它利用 OrthoDB 的进化和功能描述符简化了新测序基因组中基因的注释。源自 OrthoDB 的通用单拷贝同源物(BUSCO)基准集也进行了相应的重大更新。BUSCO 工具 (https://busco.ezlab.org) 已成为基因组学的一个标准,能够同时评估真核生物和原核生物物种。它适用于基因组、转录组、基因组组装和元基因组。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Nucleic Acids Research
Nucleic Acids Research 生物-生化与分子生物学
CiteScore
27.10
自引率
4.70%
发文量
1057
审稿时长
2 months
期刊介绍: Nucleic Acids Research (NAR) is a scientific journal that publishes research on various aspects of nucleic acids and proteins involved in nucleic acid metabolism and interactions. It covers areas such as chemistry and synthetic biology, computational biology, gene regulation, chromatin and epigenetics, genome integrity, repair and replication, genomics, molecular biology, nucleic acid enzymes, RNA, and structural biology. The journal also includes a Survey and Summary section for brief reviews. Additionally, each year, the first issue is dedicated to biological databases, and an issue in July focuses on web-based software resources for the biological community. Nucleic Acids Research is indexed by several services including Abstracts on Hygiene and Communicable Diseases, Animal Breeding Abstracts, Agricultural Engineering Abstracts, Agbiotech News and Information, BIOSIS Previews, CAB Abstracts, and EMBASE.
期刊最新文献
ClinVar: updates to support classifications of both germline and somatic variants miRStart 2.0: enhancing miRNA regulatory insights through deep learning-based TSS identification miRTarBase 2025: updates to the collection of experimentally validated microRNA-target interactions. GoFCards: an integrated database and analytic platform for gain of function variants in humans. The PIWI-interacting protein Gtsf1 controls the selective degradation of small RNAs in Paramecium
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1