Pub Date : 2023-12-29eCollection Date: 2024-01-01DOI: 10.1093/ve/vead077
Florence Débarre
While the exact context of the emergence of SARS-CoV-2 remains uncertain, data accumulated since 2020 have provided an increasingly more precise picture of Wuhan's Huanan Seafood Wholesale Market, to which the earliest clusters of human cases of Covid-19 were linked. After the market closed on January 1st 2020, teams from the Chinese Center for Disease Control and Prevention collected environmental samples, and sequenced them. Metagenomic sequencing data from these samples were shared in early 2023. These data confirmed that non-human animals susceptible to SARS-CoV-2 were present in the market before it closed, but also that these animals were located in the side of the market with most human cases, and in a corner with comparatively more SARS-CoV-2-positive environmental samples. The environmental samples were however collected after abundant human-to-human transmission had taken place in the market, precluding any identification of a non-human animal host. Jesse Bloom recently investigated associations between SARS-CoV-2 and non-human animals, concluding that the data failed to indicate whether non-human animals were infected by SARS-CoV-2, despite this being an already acknowledged limitation of the data. Here I explain why a correlation analysis could not confidently conclude which hosts(s) may have shed SARS-CoV-2 in the market, and I rebut the suggestion that such analyses had been encouraged. I show that Bloom's investigation ignores the temporal and spatial structure of the data, which led to incorrect interpretations. Finally, I show that criteria put forward by Bloom to identify the host(s) that shed environmental SARS-CoV-2 would also exclude humans. Progress on the topic of SARS-CoV-2's origin requires a clear distinction between scientific studies and news articles (mis)interpreting them.
{"title":"What we can and cannot learn from SARS-CoV-2 and animals in metagenomic samples from the Huanan market.","authors":"Florence Débarre","doi":"10.1093/ve/vead077","DOIUrl":"10.1093/ve/vead077","url":null,"abstract":"<p><p>While the exact context of the emergence of SARS-CoV-2 remains uncertain, data accumulated since 2020 have provided an increasingly more precise picture of Wuhan's Huanan Seafood Wholesale Market, to which the earliest clusters of human cases of Covid-19 were linked. After the market closed on January 1st 2020, teams from the Chinese Center for Disease Control and Prevention collected environmental samples, and sequenced them. Metagenomic sequencing data from these samples were shared in early 2023. These data confirmed that non-human animals susceptible to SARS-CoV-2 were present in the market before it closed, but also that these animals were located in the side of the market with most human cases, and in a corner with comparatively more SARS-CoV-2-positive environmental samples. The environmental samples were however collected after abundant human-to-human transmission had taken place in the market, precluding any identification of a non-human animal host. Jesse Bloom recently investigated associations between SARS-CoV-2 and non-human animals, concluding that the data failed to indicate whether non-human animals were infected by SARS-CoV-2, despite this being an already acknowledged limitation of the data. Here I explain why a correlation analysis could not confidently conclude which hosts(s) may have shed SARS-CoV-2 in the market, and I rebut the suggestion that such analyses had been encouraged. I show that Bloom's investigation ignores the temporal and spatial structure of the data, which led to incorrect interpretations. Finally, I show that criteria put forward by Bloom to identify the host(s) that shed environmental SARS-CoV-2 would also exclude humans. Progress on the topic of SARS-CoV-2's origin requires a clear distinction between scientific studies and news articles (mis)interpreting them.</p>","PeriodicalId":56026,"journal":{"name":"Virus Evolution","volume":"10 1","pages":"vead077"},"PeriodicalIF":5.5,"publicationDate":"2023-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10868546/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139742790","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-12-28eCollection Date: 2024-01-01DOI: 10.1093/ve/vead075
Gila Lustig, Yashica Ganga, Hylton E Rodel, Houriiyah Tegally, Afrah Khairallah, Laurelle Jackson, Sandile Cele, Khadija Khan, Zesuliwe Jule, Kajal Reedoy, Farina Karim, Mallory Bernstein, Thumbi Ndung'u, Mahomed-Yunus S Moosa, Derseree Archary, Tulio de Oliveira, Richard Lessells, Richard A Neher, Salim S Abdool Karim, Alex Sigal
One mechanism of variant formation may be evolution during long-term infection in immunosuppressed people. To understand the viral phenotypes evolved during such infection, we tested SARS-CoV-2 viruses evolved from an ancestral B.1 lineage infection lasting over 190 days post-diagnosis in an advanced HIV disease immunosuppressed individual. Sequence and phylogenetic analysis showed two evolving sub-lineages, with the second sub-lineage replacing the first sub-lineage in a seeming evolutionary sweep. Each sub-lineage independently evolved escape from neutralizing antibodies. The most evolved virus for the first sub-lineage (isolated day 34) and the second sub-lineage (isolated day 190) showed similar escape from ancestral SARS-CoV-2 and Delta-variant infection elicited neutralizing immunity despite having no spike mutations in common relative to the B.1 lineage. The day 190 isolate also evolved higher cell-cell fusion and faster viral replication and caused more cell death relative to virus isolated soon after diagnosis, though cell death was similar to day 34 first sub-lineage virus. These data show that SARS-CoV-2 strains in prolonged infection in a single individual can follow independent evolutionary trajectories which lead to neutralization escape and other changes in viral properties.
{"title":"SARS-CoV-2 infection in immunosuppression evolves sub-lineages which independently accumulate neutralization escape mutations.","authors":"Gila Lustig, Yashica Ganga, Hylton E Rodel, Houriiyah Tegally, Afrah Khairallah, Laurelle Jackson, Sandile Cele, Khadija Khan, Zesuliwe Jule, Kajal Reedoy, Farina Karim, Mallory Bernstein, Thumbi Ndung'u, Mahomed-Yunus S Moosa, Derseree Archary, Tulio de Oliveira, Richard Lessells, Richard A Neher, Salim S Abdool Karim, Alex Sigal","doi":"10.1093/ve/vead075","DOIUrl":"10.1093/ve/vead075","url":null,"abstract":"<p><p>One mechanism of variant formation may be evolution during long-term infection in immunosuppressed people. To understand the viral phenotypes evolved during such infection, we tested SARS-CoV-2 viruses evolved from an ancestral B.1 lineage infection lasting over 190 days post-diagnosis in an advanced HIV disease immunosuppressed individual. Sequence and phylogenetic analysis showed two evolving sub-lineages, with the second sub-lineage replacing the first sub-lineage in a seeming evolutionary sweep. Each sub-lineage independently evolved escape from neutralizing antibodies. The most evolved virus for the first sub-lineage (isolated day 34) and the second sub-lineage (isolated day 190) showed similar escape from ancestral SARS-CoV-2 and Delta-variant infection elicited neutralizing immunity despite having no spike mutations in common relative to the B.1 lineage. The day 190 isolate also evolved higher cell-cell fusion and faster viral replication and caused more cell death relative to virus isolated soon after diagnosis, though cell death was similar to day 34 first sub-lineage virus. These data show that SARS-CoV-2 strains in prolonged infection in a single individual can follow independent evolutionary trajectories which lead to neutralization escape and other changes in viral properties.</p>","PeriodicalId":56026,"journal":{"name":"Virus Evolution","volume":"10 1","pages":"vead075"},"PeriodicalIF":5.3,"publicationDate":"2023-12-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10868398/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139742789","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-12-28eCollection Date: 2024-01-01DOI: 10.1093/ve/vead082
Bradley Schwab, John Yin
Viruses persist in nature owing to their extreme genetic heterogeneity and large population sizes, which enable them to evade host immune defenses, escape antiviral drugs, and adapt to new hosts. The persistence of viruses is challenging to study because mutations affect multiple virus genes, interactions among genes in their impacts on virus growth are seldom known, and measures of viral fitness are yet to be standardized. To address these challenges, we employed a data-driven computational model of cell infection by a virus. The infection model accounted for the kinetics of viral gene expression, functional gene-gene interactions, genome replication, and allocation of host cellular resources to produce progeny of vesicular stomatitis virus, a prototype RNA virus. We used this model to computationally probe how interactions among genes carrying up to eleven deleterious mutations affect different measures of virus fitness: single-cycle growth yields and multicycle rates of infection spread. Individual mutations were implemented by perturbing biophysical parameters associated with individual gene functions of the wild-type model. Our analysis revealed synergistic epistasis among deleterious mutations in their effects on virus yield; so adverse effects of single deleterious mutations were amplified by interaction. For the same mutations, multicycle infection spread indicated weak or negligible epistasis, where single mutations act alone in their effects on infection spread. These results were robust to simulation in high- and low-host resource environments. Our work highlights how different types and magnitudes of epistasis can arise for genetically identical virus variants, depending on the fitness measure. More broadly, gene-gene interactions can differently affect how viruses grow and spread.
{"title":"Computational multigene interactions in virus growth and infection spread.","authors":"Bradley Schwab, John Yin","doi":"10.1093/ve/vead082","DOIUrl":"10.1093/ve/vead082","url":null,"abstract":"<p><p>Viruses persist in nature owing to their extreme genetic heterogeneity and large population sizes, which enable them to evade host immune defenses, escape antiviral drugs, and adapt to new hosts. The persistence of viruses is challenging to study because mutations affect multiple virus genes, interactions among genes in their impacts on virus growth are seldom known, and measures of viral fitness are yet to be standardized. To address these challenges, we employed a data-driven computational model of cell infection by a virus. The infection model accounted for the kinetics of viral gene expression, functional gene-gene interactions, genome replication, and allocation of host cellular resources to produce progeny of vesicular stomatitis virus, a prototype RNA virus. We used this model to computationally probe how interactions among genes carrying up to eleven deleterious mutations affect different measures of virus fitness: single-cycle growth yields and multicycle rates of infection spread. Individual mutations were implemented by perturbing biophysical parameters associated with individual gene functions of the wild-type model. Our analysis revealed synergistic epistasis among deleterious mutations in their effects on virus yield; so adverse effects of single deleterious mutations were amplified by interaction. For the same mutations, multicycle infection spread indicated weak or negligible epistasis, where single mutations act alone in their effects on infection spread. These results were robust to simulation in high- and low-host resource environments. Our work highlights how different types and magnitudes of epistasis can arise for genetically identical virus variants, depending on the fitness measure. More broadly, gene-gene interactions can differently affect how viruses grow and spread.</p>","PeriodicalId":56026,"journal":{"name":"Virus Evolution","volume":"10 1","pages":"vead082"},"PeriodicalIF":5.3,"publicationDate":"2023-12-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10868543/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139742787","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-12-28eCollection Date: 2024-01-01DOI: 10.1093/ve/vead086
Vasanthi Avadhanula, Daniel Paiva Agustinho, Vipin Kumar Menon, Roy F Chemaly, Dimpy P Shah, Xiang Qin, Anil Surathu, Harshavardhan Doddapaneni, Donna M Muzny, Ginger A Metcalf, Sara Javornik Cregeen, Richard A Gibbs, Joseph F Petrosino, Fritz J Sedlazeck, Pedro A Piedra
Respiratory syncytial virus (RSV) infection in immunocompromised individuals often leads to prolonged illness, progression to severe lower respiratory tract infection, and even death. How the host immune environment of the hematopoietic stem cell transplant (HCT) adults can affect viral genetic variation during an acute infection is not understood well. In the present study, we performed whole genome sequencing of RSV/A or RSV/B from samples collected longitudinally from HCT adults with normal (<14 days) and delayed (≥14 days) RSV clearance who were enrolled in a ribavirin trial. We determined the inter-host and intra-host genetic variation of RSV and the effect of mutations on putative glycosylation sites. The inter-host variation of RSV is centered in the attachment (G) and fusion (F) glycoprotein genes followed by polymerase (L) and matrix (M) genes. Interestingly, the overall genetic variation was constant between normal and delayed clearance groups for both RSV/A and RSV/B. Intra-host variation primarily occurred in the G gene followed by non-structural protein (NS1) and L genes; however, gain or loss of stop codons and frameshift mutations appeared only in the G gene and only in the delayed viral clearance group. Potential gain or loss of O-linked glycosylation sites in the G gene occurred both in RSV/A and RSV/B isolates. For RSV F gene, loss of N-linked glycosylation site occurred in three RSV/B isolates within an antigenic epitope. Both oral and aerosolized ribavirin did not cause any mutations in the L gene. In summary, prolonged viral shedding and immune deficiency resulted in RSV variation, especially in structural mutations in the G gene, possibly associated with immune evasion. Therefore, sequencing and monitoring of RSV isolates from immunocompromised patients are crucial as they can create escape mutants that can impact the effectiveness of upcoming vaccines and treatments.
{"title":"Inter and intra-host diversity of RSV in hematopoietic stem cell transplant adults with normal and delayed viral clearance.","authors":"Vasanthi Avadhanula, Daniel Paiva Agustinho, Vipin Kumar Menon, Roy F Chemaly, Dimpy P Shah, Xiang Qin, Anil Surathu, Harshavardhan Doddapaneni, Donna M Muzny, Ginger A Metcalf, Sara Javornik Cregeen, Richard A Gibbs, Joseph F Petrosino, Fritz J Sedlazeck, Pedro A Piedra","doi":"10.1093/ve/vead086","DOIUrl":"10.1093/ve/vead086","url":null,"abstract":"<p><p>Respiratory syncytial virus (RSV) infection in immunocompromised individuals often leads to prolonged illness, progression to severe lower respiratory tract infection, and even death. How the host immune environment of the hematopoietic stem cell transplant (HCT) adults can affect viral genetic variation during an acute infection is not understood well. In the present study, we performed whole genome sequencing of RSV/A or RSV/B from samples collected longitudinally from HCT adults with normal (<14 days) and delayed (≥14 days) RSV clearance who were enrolled in a ribavirin trial. We determined the inter-host and intra-host genetic variation of RSV and the effect of mutations on putative glycosylation sites. The inter-host variation of RSV is centered in the attachment (G) and fusion (F) glycoprotein genes followed by polymerase (L) and matrix (M) genes. Interestingly, the overall genetic variation was constant between normal and delayed clearance groups for both RSV/A and RSV/B. Intra-host variation primarily occurred in the G gene followed by non-structural protein (NS1) and L genes; however, gain or loss of stop codons and frameshift mutations appeared only in the G gene and only in the delayed viral clearance group. Potential gain or loss of O-linked glycosylation sites in the G gene occurred both in RSV/A and RSV/B isolates. For RSV F gene, loss of N-linked glycosylation site occurred in three RSV/B isolates within an antigenic epitope. Both oral and aerosolized ribavirin did not cause any mutations in the L gene. In summary, prolonged viral shedding and immune deficiency resulted in RSV variation, especially in structural mutations in the G gene, possibly associated with immune evasion. Therefore, sequencing and monitoring of RSV isolates from immunocompromised patients are crucial as they can create escape mutants that can impact the effectiveness of upcoming vaccines and treatments.</p>","PeriodicalId":56026,"journal":{"name":"Virus Evolution","volume":"10 1","pages":"vead086"},"PeriodicalIF":5.5,"publicationDate":"2023-12-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10868550/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139742788","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-12-28eCollection Date: 2024-01-01DOI: 10.1093/ve/vead088
Nadja Brait, Thomas Hackl, Côme Morel, Antoni Exbrayat, Serafin Gutierrez, Sebastian Lequime
Large-scale metagenomic and -transcriptomic studies have revolutionized our understanding of viral diversity and abundance. In contrast, endogenous viral elements (EVEs), remnants of viral sequences integrated into host genomes, have received limited attention in the context of virus discovery, especially in RNA-Seq data. EVEs resemble their original viruses, a challenge that makes distinguishing between active infections and integrated remnants difficult, affecting virus classification and biases downstream analyses. Here, we systematically assess the effects of EVEs on a prototypical virus discovery pipeline, evaluate their impact on data integrity and classification accuracy, and provide some recommendations for better practices. We examined EVEs and exogenous viral sequences linked to Orthomyxoviridae, a diverse family of negative-sense segmented RNA viruses, in 13 genomic and 538 transcriptomic datasets of Culicinae mosquitoes. Our analysis revealed a substantial number of viral sequences in transcriptomic datasets. However, a significant portion appeared not to be exogenous viruses but transcripts derived from EVEs. Distinguishing between transcribed EVEs and exogenous virus sequences was especially difficult in samples with low viral abundance. For example, three transcribed EVEs showed full-length segments, devoid of frameshift and nonsense mutations, exhibiting sufficient mean read depths that qualify them as exogenous virus hits. Mapping reads on a host genome containing EVEs before assembly somewhat alleviated the EVE burden, but it led to a drastic reduction of viral hits and reduced quality of assemblies, especially in regions of the viral genome relatively similar to EVEs. Our study highlights that our knowledge of the genetic diversity of viruses can be altered by the underestimated presence of EVEs in transcriptomic datasets, leading to false positives and altered or missing sequence information. Thus, recognizing and addressing the influence of EVEs in virus discovery pipelines will be key in enhancing our ability to capture the full spectrum of viral diversity.
大规模的元基因组和转录组研究彻底改变了我们对病毒多样性和丰度的认识。相比之下,内源性病毒元件(EVEs),即整合到宿主基因组中的病毒序列残余,在病毒发现方面受到的关注有限,尤其是在 RNA-Seq 数据中。EVEs 与原始病毒相似,这使得区分活性感染和整合残余病毒变得困难,影响了病毒分类并使下游分析出现偏差。在这里,我们系统地评估了 EVEs 对原型病毒发现管道的影响,评估了它们对数据完整性和分类准确性的影响,并为更好的实践提出了一些建议。我们研究了 13 个蚊子基因组数据集和 538 个转录组数据集中与正粘病毒科(Orthomyxoviridae)相关的 EVEs 和外源病毒序列。我们的分析在转录组数据集中发现了大量病毒序列。然而,有很大一部分似乎不是外源病毒,而是来自于 EVE 的转录本。在病毒丰度较低的样本中,区分转录的 EVE 和外源病毒序列尤其困难。例如,有三个转录的 EVEs 显示出全长片段,没有框架转换和无义突变,其平均读取深度足以使其成为外源病毒的命中片段。在组装前将读数映射到含有 EVE 的宿主基因组上在一定程度上减轻了 EVE 的负担,但却导致病毒命中率的急剧下降和组装质量的降低,尤其是在与 EVE 相对相似的病毒基因组区域。我们的研究突出表明,由于低估了转录组数据集中 EVE 的存在,我们对病毒遗传多样性的了解可能会被改变,从而导致假阳性和序列信息的改变或缺失。因此,认识并解决 EVEs 对病毒发现管道的影响将是提高我们全面捕捉病毒多样性能力的关键。
{"title":"A tale of caution: How endogenous viral elements affect virus discovery in transcriptomic data.","authors":"Nadja Brait, Thomas Hackl, Côme Morel, Antoni Exbrayat, Serafin Gutierrez, Sebastian Lequime","doi":"10.1093/ve/vead088","DOIUrl":"10.1093/ve/vead088","url":null,"abstract":"<p><p>Large-scale metagenomic and -transcriptomic studies have revolutionized our understanding of viral diversity and abundance. In contrast, endogenous viral elements (EVEs), remnants of viral sequences integrated into host genomes, have received limited attention in the context of virus discovery, especially in RNA-Seq data. EVEs resemble their original viruses, a challenge that makes distinguishing between active infections and integrated remnants difficult, affecting virus classification and biases downstream analyses. Here, we systematically assess the effects of EVEs on a prototypical virus discovery pipeline, evaluate their impact on data integrity and classification accuracy, and provide some recommendations for better practices. We examined EVEs and exogenous viral sequences linked to <i>Orthomyxoviridae</i>, a diverse family of negative-sense segmented RNA viruses, in 13 genomic and 538 transcriptomic datasets of Culicinae mosquitoes. Our analysis revealed a substantial number of viral sequences in transcriptomic datasets. However, a significant portion appeared not to be exogenous viruses but transcripts derived from EVEs. Distinguishing between transcribed EVEs and exogenous virus sequences was especially difficult in samples with low viral abundance. For example, three transcribed EVEs showed full-length segments, devoid of frameshift and nonsense mutations, exhibiting sufficient mean read depths that qualify them as exogenous virus hits. Mapping reads on a host genome containing EVEs before assembly somewhat alleviated the EVE burden, but it led to a drastic reduction of viral hits and reduced quality of assemblies, especially in regions of the viral genome relatively similar to EVEs. Our study highlights that our knowledge of the genetic diversity of viruses can be altered by the underestimated presence of EVEs in transcriptomic datasets, leading to false positives and altered or missing sequence information. Thus, recognizing and addressing the influence of EVEs in virus discovery pipelines will be key in enhancing our ability to capture the full spectrum of viral diversity.</p>","PeriodicalId":56026,"journal":{"name":"Virus Evolution","volume":"10 1","pages":"vead088"},"PeriodicalIF":5.5,"publicationDate":"2023-12-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10956553/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140186397","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Abigail A Howell, Cyril J Versoza, Susanne P Pfeifer
The rapid emergence and spread of antimicrobial resistance across the globe have prompted the usage of bacteriophages (i.e., viruses that infect bacteria) in a variety of applications ranging from agriculture to biotechnology and medicine. In order to effectively guide the application of bacteriophages in these multifaceted areas, information about their host ranges – that is the bacterial strains or species that a bacteriophage can successfully infect and kill – is essential. Utilizing 16 broad-spectrum (polyvalent) bacteriophages with experimentally validated host ranges, we here benchmark the performance of 11 recently developed computational host range prediction tools that provide a promising and highly scalable supplement to traditional, but laborious, experimental procedures. We show that machine- and deep-learning approaches offer the highest levels of accuracy and precision – however, their predominant predictions at the species- or genus-level render them ill-suited for applications outside of an ecosystems metagenomics framework. In contrast, only moderate sensitivity (<80%) could be reached at the strain-level, albeit at low levels of precision (<40%). Taken together, these limitations demonstrate that there remains room for improvement in the active scientific field of in silico host prediction to combat the challenge of guiding experimental designs to identify the most promising bacteriophage candidates for any given application.
{"title":"Computational host range prediction – the good, the bad, and the ugly","authors":"Abigail A Howell, Cyril J Versoza, Susanne P Pfeifer","doi":"10.1093/ve/vead083","DOIUrl":"https://doi.org/10.1093/ve/vead083","url":null,"abstract":"The rapid emergence and spread of antimicrobial resistance across the globe have prompted the usage of bacteriophages (i.e., viruses that infect bacteria) in a variety of applications ranging from agriculture to biotechnology and medicine. In order to effectively guide the application of bacteriophages in these multifaceted areas, information about their host ranges – that is the bacterial strains or species that a bacteriophage can successfully infect and kill – is essential. Utilizing 16 broad-spectrum (polyvalent) bacteriophages with experimentally validated host ranges, we here benchmark the performance of 11 recently developed computational host range prediction tools that provide a promising and highly scalable supplement to traditional, but laborious, experimental procedures. We show that machine- and deep-learning approaches offer the highest levels of accuracy and precision – however, their predominant predictions at the species- or genus-level render them ill-suited for applications outside of an ecosystems metagenomics framework. In contrast, only moderate sensitivity (&lt;80%) could be reached at the strain-level, albeit at low levels of precision (&lt;40%). Taken together, these limitations demonstrate that there remains room for improvement in the active scientific field of in silico host prediction to combat the challenge of guiding experimental designs to identify the most promising bacteriophage candidates for any given application.","PeriodicalId":56026,"journal":{"name":"Virus Evolution","volume":"1 1","pages":""},"PeriodicalIF":5.3,"publicationDate":"2023-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138826161","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Norma R Forero-Muñoz, Renata L Muylaert, Stephanie N Seifert, Gregory F Albery, Daniel J Becker, Colin J Carlson, Timothée Poisot
Pathogen evolution is one of the least predictable components of disease emergence, particularly in nature. Here, building on principles established by the geographic mosaic theory of coevolution, we develop a quantitative, spatially-explicit framework for mapping the evolutionary risk of viral emergence. Driven by interest in diseases like SARS, MERS, and COVID-19, we examine the global biogeography of bat-origin betacoronaviruses, and find that coevolutionary principles suggest geographies of risk that are distinct from the hotspots and coldspots of host richness. Further, our framework helps explain patterns like a unique pool of merbecoviruses in the Neotropics, a recently-discovered lineage of divergent nobecoviruses in Madagascar, and–most importantly–hotspots of diversification in southeast Asia, sub-Saharan Africa, and the Middle East that correspond to the site of previous zoonotic emergence events. Our framework may help identify hotspots of future risk that have also been previously overlooked, like west Africa and the Indian subcontinent, and may more broadly help researchers understand how host ecology shapes the evolution and diversity of pandemic threats.
{"title":"The coevolutionary mosaic of bat betacoronavirus emergence risk","authors":"Norma R Forero-Muñoz, Renata L Muylaert, Stephanie N Seifert, Gregory F Albery, Daniel J Becker, Colin J Carlson, Timothée Poisot","doi":"10.1093/ve/vead079","DOIUrl":"https://doi.org/10.1093/ve/vead079","url":null,"abstract":"Pathogen evolution is one of the least predictable components of disease emergence, particularly in nature. Here, building on principles established by the geographic mosaic theory of coevolution, we develop a quantitative, spatially-explicit framework for mapping the evolutionary risk of viral emergence. Driven by interest in diseases like SARS, MERS, and COVID-19, we examine the global biogeography of bat-origin betacoronaviruses, and find that coevolutionary principles suggest geographies of risk that are distinct from the hotspots and coldspots of host richness. Further, our framework helps explain patterns like a unique pool of merbecoviruses in the Neotropics, a recently-discovered lineage of divergent nobecoviruses in Madagascar, and–most importantly–hotspots of diversification in southeast Asia, sub-Saharan Africa, and the Middle East that correspond to the site of previous zoonotic emergence events. Our framework may help identify hotspots of future risk that have also been previously overlooked, like west Africa and the Indian subcontinent, and may more broadly help researchers understand how host ecology shapes the evolution and diversity of pandemic threats.","PeriodicalId":56026,"journal":{"name":"Virus Evolution","volume":"205 1","pages":""},"PeriodicalIF":5.3,"publicationDate":"2023-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138826728","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
F Chevenet, D Fargette, P Bastide, T Vitré, S Guindon
EvoLaps is a user-friendly web application designed to visualize the spatial and temporal spread of pathogens. It takes an annotated tree as entry, such as a maximum clade credibility tree obtained through continuous phylogeographic inference. By following a ‘Top-Down’ reading of a tree recursively, transitions (latitude/longitude changes from a node to its children) are represented on a cartographic background using graphical paths. The complete set of paths forms the phylogeographic scenario. EvoLaps offers several features to analyze complex scenarios: (i) enhanced path display using multiple graphical variables with time-dependent gradients, (ii) cross-highlighting and selection capabilities between the phylogeographic scenario and the phylogenetic tree, (iii) production of specific spatio-temporal scales and synthetic views through dynamic and iterative clustering of localities into spatial clusters, (iv) animation of the phylogeographic scenario using tree brushing, which can be done manually or automatically, gradually over time or at specific time intervals, and for the entire tree or a specific clade, and (v) an evolving library of additional tools. EvoLaps is freely available for use at evolaps.org.
{"title":"EvoLaps 2: Advanced Phylogeographic Visualization","authors":"F Chevenet, D Fargette, P Bastide, T Vitré, S Guindon","doi":"10.1093/ve/vead078","DOIUrl":"https://doi.org/10.1093/ve/vead078","url":null,"abstract":"EvoLaps is a user-friendly web application designed to visualize the spatial and temporal spread of pathogens. It takes an annotated tree as entry, such as a maximum clade credibility tree obtained through continuous phylogeographic inference. By following a ‘Top-Down’ reading of a tree recursively, transitions (latitude/longitude changes from a node to its children) are represented on a cartographic background using graphical paths. The complete set of paths forms the phylogeographic scenario. EvoLaps offers several features to analyze complex scenarios: (i) enhanced path display using multiple graphical variables with time-dependent gradients, (ii) cross-highlighting and selection capabilities between the phylogeographic scenario and the phylogenetic tree, (iii) production of specific spatio-temporal scales and synthetic views through dynamic and iterative clustering of localities into spatial clusters, (iv) animation of the phylogeographic scenario using tree brushing, which can be done manually or automatically, gradually over time or at specific time intervals, and for the entire tree or a specific clade, and (v) an evolving library of additional tools. EvoLaps is freely available for use at evolaps.org.","PeriodicalId":56026,"journal":{"name":"Virus Evolution","volume":"13 1","pages":""},"PeriodicalIF":5.3,"publicationDate":"2023-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138820036","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Ignacio Garcia, Yunsung Lee, Ola Brynildsrud, Vegard Eldholm, Per Magnus, Anita Blomfeldt, Truls M Leegaard, Fredrik Müller, Susanne Dudman, Dominique A Caugant
Vaccination against SARS-CoV-2 has greatly mitigated the impact of the COVID-19 pandemic. However, concerns have been raised about the degree to which vaccination might drive the emergence and selection of immune escape mutations that will hamper the efficacy of the vaccines. In this study we investigate whether vaccination impacted the micro-scale adaptive evolution of SARS-CoV-2 in the Oslo region of Norway, during the first nine months of 2021, a period in which the population went from near-zero to almost 90% vaccine coverage in the population over 50 years old. Weekly aggregated data stratified by age on vaccine uptake and number of SARS-CoV-2 cases in the area were obtained from the National Immunization Registry and the Norwegian Surveillance System for Communicable Diseases, respectively. A total of 6,438 virus sequences (7.5% of the registered cases) along with metadata were available. We used a causal-driven approach to investigate the relationship between vaccination progress and changes in the frequency of 362 mutations present in at least 10 samples, conditioned on the emergence of new lineages, time, and population vaccination coverage. After validating our approach, we identified 21 positive and 12 negative connections between vaccination progress and mutation prevalence, most of them were outside the Spike protein. We observed a tendency for the mutations that we identified as positively connected with vaccination to decrease as the vaccinated population increased. After modelling the fitness of different competing mutations in a population, we found that our observations could be explained by a clonal interference phenomenon in which high fitness mutations would be outcompeted by the emergence or introduction of other high-fitness mutations.
{"title":"Tracing the adaptive evolution of SARS-CoV-2 during vaccine roll-out in Norway","authors":"Ignacio Garcia, Yunsung Lee, Ola Brynildsrud, Vegard Eldholm, Per Magnus, Anita Blomfeldt, Truls M Leegaard, Fredrik Müller, Susanne Dudman, Dominique A Caugant","doi":"10.1093/ve/vead081","DOIUrl":"https://doi.org/10.1093/ve/vead081","url":null,"abstract":"Vaccination against SARS-CoV-2 has greatly mitigated the impact of the COVID-19 pandemic. However, concerns have been raised about the degree to which vaccination might drive the emergence and selection of immune escape mutations that will hamper the efficacy of the vaccines. In this study we investigate whether vaccination impacted the micro-scale adaptive evolution of SARS-CoV-2 in the Oslo region of Norway, during the first nine months of 2021, a period in which the population went from near-zero to almost 90% vaccine coverage in the population over 50 years old. Weekly aggregated data stratified by age on vaccine uptake and number of SARS-CoV-2 cases in the area were obtained from the National Immunization Registry and the Norwegian Surveillance System for Communicable Diseases, respectively. A total of 6,438 virus sequences (7.5% of the registered cases) along with metadata were available. We used a causal-driven approach to investigate the relationship between vaccination progress and changes in the frequency of 362 mutations present in at least 10 samples, conditioned on the emergence of new lineages, time, and population vaccination coverage. After validating our approach, we identified 21 positive and 12 negative connections between vaccination progress and mutation prevalence, most of them were outside the Spike protein. We observed a tendency for the mutations that we identified as positively connected with vaccination to decrease as the vaccinated population increased. After modelling the fitness of different competing mutations in a population, we found that our observations could be explained by a clonal interference phenomenon in which high fitness mutations would be outcompeted by the emergence or introduction of other high-fitness mutations.","PeriodicalId":56026,"journal":{"name":"Virus Evolution","volume":"270 1","pages":""},"PeriodicalIF":5.3,"publicationDate":"2023-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138826158","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Coxsackievirus A16 (CVA16) is a major pathogen that causes hand, foot, and mouth disease (HFMD). The recombination form (RF) shifts and global transmission dynamics of CVA16 remain unknown. In this retrospective study, global sequences of CVA16 were retrieved from the GenBank database and analyzed using comprehensive phylogenetic inference, RF surveys, and population structure. A total of 1663 sequences were collected, forming a 442-sequences dataset for VP1 coding region analysis and a 345-sequences dataset for RF identification. Based on the VP1 coding region used for serotyping, three genotypes (A, B, and D), two subgenotypes of genotype B (B1 and B2), and three clusters of subgenotype B1 (B1a, B1b, and B1c) were identified. Cluster B1b has dominated the global epidemics, B2 disappeared in 2000, and D is an emerging genotype dating back to August 2002. Globally, four oscillation phases of CVA16 evolution, with a peak in 2013, and three migration pathways were identified. Europe, China, and Japan have served as the seeds for the global transmission of CVA16. Based on the 3D coding region of the RFs, five clusters of RFs (RF-A to -E) were identified. The shift in RFs from RF-B and RF-C to RF-D was accompanied by a change in genotype from B2 to B1a and B1c and then to B1b. In conclusion, the evolution and population dynamics of CVA16, especially the coevolution of 3D and VP1 genes, revealed that genotype evolution and RF replacement were synergistic rather than stochastic.
{"title":"Synergetic association between coxsackievirus A16 genotype evolution and recombinant form shifts","authors":"Zhenzhi Han, Fangming Wang, Jinbo Xiao, Hanhaoyu Fu, Yang Song, Mingli Jiang, Huanhuan Lu, Jichen Li, Yanpeng Xu, Runan Zhu, Yong Zhang, Linqing Zhao","doi":"10.1093/ve/vead080","DOIUrl":"https://doi.org/10.1093/ve/vead080","url":null,"abstract":"Coxsackievirus A16 (CVA16) is a major pathogen that causes hand, foot, and mouth disease (HFMD). The recombination form (RF) shifts and global transmission dynamics of CVA16 remain unknown. In this retrospective study, global sequences of CVA16 were retrieved from the GenBank database and analyzed using comprehensive phylogenetic inference, RF surveys, and population structure. A total of 1663 sequences were collected, forming a 442-sequences dataset for VP1 coding region analysis and a 345-sequences dataset for RF identification. Based on the VP1 coding region used for serotyping, three genotypes (A, B, and D), two subgenotypes of genotype B (B1 and B2), and three clusters of subgenotype B1 (B1a, B1b, and B1c) were identified. Cluster B1b has dominated the global epidemics, B2 disappeared in 2000, and D is an emerging genotype dating back to August 2002. Globally, four oscillation phases of CVA16 evolution, with a peak in 2013, and three migration pathways were identified. Europe, China, and Japan have served as the seeds for the global transmission of CVA16. Based on the 3D coding region of the RFs, five clusters of RFs (RF-A to -E) were identified. The shift in RFs from RF-B and RF-C to RF-D was accompanied by a change in genotype from B2 to B1a and B1c and then to B1b. In conclusion, the evolution and population dynamics of CVA16, especially the coevolution of 3D and VP1 genes, revealed that genotype evolution and RF replacement were synergistic rather than stochastic.","PeriodicalId":56026,"journal":{"name":"Virus Evolution","volume":"31 1","pages":""},"PeriodicalIF":5.3,"publicationDate":"2023-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138827037","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}