On the collection and integration of SARS-CoV-2 genome data

IF 3.5 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH Biosafety and Health Pub Date : 2023-08-01 DOI:10.1016/j.bsheal.2023.07.004
Lina Ma , Wei Zhao , Tianhao Huang , Enhui Jin , Gangao Wu , Wenming Zhao , Yiming Bao
{"title":"On the collection and integration of SARS-CoV-2 genome data","authors":"Lina Ma ,&nbsp;Wei Zhao ,&nbsp;Tianhao Huang ,&nbsp;Enhui Jin ,&nbsp;Gangao Wu ,&nbsp;Wenming Zhao ,&nbsp;Yiming Bao","doi":"10.1016/j.bsheal.2023.07.004","DOIUrl":null,"url":null,"abstract":"<div><p>Genome data of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is essential for virus diagnosis, vaccine development, and variant surveillance. To archive and integrate worldwide SARS-CoV-2 genome data, a series of resources have been constructed, serving as a fundamental infrastructure for SARS-CoV-2 research, pandemic prevention and control, and coronavirus disease 2019 (COVID-19) therapy. Here we present an overview of extant SARS-CoV-2 resources that are devoted to genome data deposition and integration. We review deposition resources in data accessibility, metadata standardization, data curation and annotation; review integrative resources in data source, de-redundancy processing, data curation and quality assessment, and variant annotation. Moreover, we address issues that impede SARS-CoV-2 genome data integration, including low-complexity, inconsistency and absence of isolate name, sequence inconsistency, asynchronous update of genome data, and mismatched metadata. We finally provide insights into data standardization consensus and data submission guidelines, to promote SARS-CoV-2 genome data sharing and integration.</p></div>","PeriodicalId":36178,"journal":{"name":"Biosafety and Health","volume":null,"pages":null},"PeriodicalIF":3.5000,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biosafety and Health","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2590053623000812","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}
引用次数: 2

Abstract

Genome data of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is essential for virus diagnosis, vaccine development, and variant surveillance. To archive and integrate worldwide SARS-CoV-2 genome data, a series of resources have been constructed, serving as a fundamental infrastructure for SARS-CoV-2 research, pandemic prevention and control, and coronavirus disease 2019 (COVID-19) therapy. Here we present an overview of extant SARS-CoV-2 resources that are devoted to genome data deposition and integration. We review deposition resources in data accessibility, metadata standardization, data curation and annotation; review integrative resources in data source, de-redundancy processing, data curation and quality assessment, and variant annotation. Moreover, we address issues that impede SARS-CoV-2 genome data integration, including low-complexity, inconsistency and absence of isolate name, sequence inconsistency, asynchronous update of genome data, and mismatched metadata. We finally provide insights into data standardization consensus and data submission guidelines, to promote SARS-CoV-2 genome data sharing and integration.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
关于严重急性呼吸系统综合征冠状病毒2型基因组数据的收集和整合
严重急性呼吸综合征冠状病毒2 (SARS-CoV-2)基因组数据对病毒诊断、疫苗开发和变异监测至关重要。为存档和整合全球SARS-CoV-2基因组数据,构建了一系列资源,为SARS-CoV-2研究、大流行防控和COVID-19治疗提供基础设施。在这里,我们概述了致力于基因组数据沉积和整合的现有SARS-CoV-2资源。综述了沉积资源在数据可及性、元数据标准化、数据管理和注释方面的研究进展;回顾数据源、去冗余处理、数据管理和质量评估以及变体注释方面的综合资源。此外,我们还解决了阻碍SARS-CoV-2基因组数据整合的问题,包括低复杂性、分离株名称不一致和缺失、序列不一致、基因组数据异步更新以及元数据不匹配。最后提出数据标准化共识和数据提交指南,促进SARS-CoV-2基因组数据共享与整合。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Biosafety and Health
Biosafety and Health Medicine-Infectious Diseases
CiteScore
7.60
自引率
0.00%
发文量
116
审稿时长
66 days
期刊最新文献
Establishment of the benchmarking tool for evaluating the operation of biorepositories for pathogenic resource using a modified Delphi method An online survey among convalescents 5 months post SARS-CoV-2 infection in China Relationship between climatic factors and the flea index of two plague hosts in Xilingol League, Inner Mongolia Autonomous Region Automated robot and artificial intelligence-powered wastewater surveillance for proactive mpox outbreak prediction The differential effects of integrase strand transfer inhibitors and efavirenz on neuropsychiatric conditions and brain imaging in HIV-positive men who have sex with men
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1