Changqiao You , Shuai Jiang , Yunyun Ding , Shunxing Ye , Xiaoxiao Zou , Hongming Zhang , Zeqi Li , Fenglin Chen , Yongliang Li , Xingyi Ge , Xinhong Guo
{"title":"基于完整基因组序列遗传测试的可公开获取的 RNA 条形码片段,用于从 HCoV 和 SARSr-CoV-2 系中识别 SARS-CoV-2","authors":"Changqiao You , Shuai Jiang , Yunyun Ding , Shunxing Ye , Xiaoxiao Zou , Hongming Zhang , Zeqi Li , Fenglin Chen , Yongliang Li , Xingyi Ge , Xinhong Guo","doi":"10.1016/j.virs.2024.01.006","DOIUrl":null,"url":null,"abstract":"<div><p>Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the pathogen responsible for coronavirus disease 2019 (COVID-19), continues to evolve, giving rise to more variants and global reinfections. Previous research has demonstrated that barcode segments can effectively and cost-efficiently identify specific species within closely related populations. In this study, we designed and tested RNA barcode segments based on genetic evolutionary relationships to facilitate the efficient and accurate identification of SARS-CoV-2 from extensive virus samples, including human coronaviruses (HCoVs) and SARSr-CoV-2 lineages. Nucleotide sequences sourced from NCBI and GISAID were meticulously selected and curated to construct training sets, encompassing 1733 complete genome sequences of HCoVs and SARSr-CoV-2 lineages. Through genetic-level species testing, we validated the accuracy and reliability of the barcode segments for identifying SARS-CoV-2. Subsequently, 75 main and subordinate species-specific barcode segments for SARS-CoV-2, located in <em>ORF1ab</em>, <em>S</em>, <em>E</em>, <em>ORF7a</em>, and <em>N</em> coding sequences, were intercepted and screened based on single-nucleotide polymorphism sites and weighted scores. Post-testing, these segments exhibited high recall rates (nearly 100%), specificity (almost 30% at the nucleotide level), and precision (100%) performance on identification. They were eventually visualized using one and two-dimensional combined barcodes and deposited in an online database (<span>http://virusbarcodedatabase.top/</span><svg><path></path></svg>). The successful integration of barcoding technology in SARS-CoV-2 identification provides valuable insights for future studies involving complete genome sequence polymorphism analysis. Moreover, this cost-effective and efficient identification approach also provides valuable reference for future research endeavors related to virus surveillance.</p></div>","PeriodicalId":23654,"journal":{"name":"Virologica Sinica","volume":null,"pages":null},"PeriodicalIF":5.5000,"publicationDate":"2024-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1995820X24000063/pdfft?md5=d17814d2aee23bb4d075d608e8b938aa&pid=1-s2.0-S1995820X24000063-main.pdf","citationCount":"0","resultStr":"{\"title\":\"RNA barcode segments for SARS-CoV-2 identification from HCoVs and SARSr-CoV-2 lineages\",\"authors\":\"Changqiao You , Shuai Jiang , Yunyun Ding , Shunxing Ye , Xiaoxiao Zou , Hongming Zhang , Zeqi Li , Fenglin Chen , Yongliang Li , Xingyi Ge , Xinhong Guo\",\"doi\":\"10.1016/j.virs.2024.01.006\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the pathogen responsible for coronavirus disease 2019 (COVID-19), continues to evolve, giving rise to more variants and global reinfections. Previous research has demonstrated that barcode segments can effectively and cost-efficiently identify specific species within closely related populations. In this study, we designed and tested RNA barcode segments based on genetic evolutionary relationships to facilitate the efficient and accurate identification of SARS-CoV-2 from extensive virus samples, including human coronaviruses (HCoVs) and SARSr-CoV-2 lineages. Nucleotide sequences sourced from NCBI and GISAID were meticulously selected and curated to construct training sets, encompassing 1733 complete genome sequences of HCoVs and SARSr-CoV-2 lineages. Through genetic-level species testing, we validated the accuracy and reliability of the barcode segments for identifying SARS-CoV-2. Subsequently, 75 main and subordinate species-specific barcode segments for SARS-CoV-2, located in <em>ORF1ab</em>, <em>S</em>, <em>E</em>, <em>ORF7a</em>, and <em>N</em> coding sequences, were intercepted and screened based on single-nucleotide polymorphism sites and weighted scores. Post-testing, these segments exhibited high recall rates (nearly 100%), specificity (almost 30% at the nucleotide level), and precision (100%) performance on identification. They were eventually visualized using one and two-dimensional combined barcodes and deposited in an online database (<span>http://virusbarcodedatabase.top/</span><svg><path></path></svg>). The successful integration of barcoding technology in SARS-CoV-2 identification provides valuable insights for future studies involving complete genome sequence polymorphism analysis. Moreover, this cost-effective and efficient identification approach also provides valuable reference for future research endeavors related to virus surveillance.</p></div>\",\"PeriodicalId\":23654,\"journal\":{\"name\":\"Virologica Sinica\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":5.5000,\"publicationDate\":\"2024-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S1995820X24000063/pdfft?md5=d17814d2aee23bb4d075d608e8b938aa&pid=1-s2.0-S1995820X24000063-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Virologica Sinica\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1995820X24000063\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"Medicine\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Virologica Sinica","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1995820X24000063","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Medicine","Score":null,"Total":0}
RNA barcode segments for SARS-CoV-2 identification from HCoVs and SARSr-CoV-2 lineages
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the pathogen responsible for coronavirus disease 2019 (COVID-19), continues to evolve, giving rise to more variants and global reinfections. Previous research has demonstrated that barcode segments can effectively and cost-efficiently identify specific species within closely related populations. In this study, we designed and tested RNA barcode segments based on genetic evolutionary relationships to facilitate the efficient and accurate identification of SARS-CoV-2 from extensive virus samples, including human coronaviruses (HCoVs) and SARSr-CoV-2 lineages. Nucleotide sequences sourced from NCBI and GISAID were meticulously selected and curated to construct training sets, encompassing 1733 complete genome sequences of HCoVs and SARSr-CoV-2 lineages. Through genetic-level species testing, we validated the accuracy and reliability of the barcode segments for identifying SARS-CoV-2. Subsequently, 75 main and subordinate species-specific barcode segments for SARS-CoV-2, located in ORF1ab, S, E, ORF7a, and N coding sequences, were intercepted and screened based on single-nucleotide polymorphism sites and weighted scores. Post-testing, these segments exhibited high recall rates (nearly 100%), specificity (almost 30% at the nucleotide level), and precision (100%) performance on identification. They were eventually visualized using one and two-dimensional combined barcodes and deposited in an online database (http://virusbarcodedatabase.top/). The successful integration of barcoding technology in SARS-CoV-2 identification provides valuable insights for future studies involving complete genome sequence polymorphism analysis. Moreover, this cost-effective and efficient identification approach also provides valuable reference for future research endeavors related to virus surveillance.
Virologica SinicaBiochemistry, Genetics and Molecular Biology-Molecular Medicine
CiteScore
7.70
自引率
1.80%
发文量
3149
期刊介绍:
Virologica Sinica is an international journal which aims at presenting the cutting-edge research on viruses all over the world. The journal publishes peer-reviewed original research articles, reviews, and letters to the editor, to encompass the latest developments in all branches of virology, including research on animal, plant and microbe viruses. The journal welcomes articles on virus discovery and characterization, viral epidemiology, viral pathogenesis, virus-host interaction, vaccine development, antiviral agents and therapies, and virus related bio-techniques. Virologica Sinica, the official journal of Chinese Society for Microbiology, will serve as a platform for the communication and exchange of academic information and ideas in an international context.
Electronic ISSN: 1995-820X; Print ISSN: 1674-0769