Pub Date : 2025-09-18eCollection Date: 2025-01-01DOI: 10.1093/ve/veaf068
Michael D Kupperman, Ruian Ke, Thomas Leitner
Robust sampling methods are foundational to inferences using phylogenies. Yet the impact of using contact tracing, a type of non-uniform sampling used in public health applications such as infectious disease outbreak investigations, has not been investigated in the molecular epidemiology field. To understand how contact tracing influences a recovered phylogeny, we developed a new simulation tool called SEEPS (Sequence Evolution and Epidemiological Process Simulator) that allows for the simulation of contact tracing and the resulting transmission tree, pathogen phylogeny, and corresponding virus genetic sequences. Importantly, SEEPS takes within-host evolution into account when generating pathogen phylogenies and sequences from transmission histories. Using SEEPS, we demonstrate that contact tracing can significantly impact the structure of the resulting tree, as described by popular tree statistics. Contact tracing generates phylogenies that are less balanced than the underlying transmission process, less representative of the larger epidemiological process, and affects the internal/external branch length ratios that characterize specific epidemiological scenarios. We also examined real data from a 2007-2008 Swedish HIV-1 outbreak and the broader 1998-2010 European HIV-1 epidemic to highlight the differences in contact tracing and expected phylogenies. Aided by SEEPS, we show that the data collection of the Swedish outbreak was strongly influenced by contact tracing even after downsampling, while the broader European Union epidemic showed little evidence of universal contact tracing, agreeing with the known epidemiological information about sampling and spread. Overall, our results highlight the importance of including possible non-uniform sampling schemes when examining phylogenetic trees. For that, SEEPS serves as a useful tool to evaluate such impacts, thereby facilitating better phylogenetic inferences of the characteristics of a disease outbreak. SEEPS is available at https://github.com/MolEvolEpid/SEEPS.
{"title":"Identifying impacts of contact tracing on HIV epidemiological inference from phylogenetic data.","authors":"Michael D Kupperman, Ruian Ke, Thomas Leitner","doi":"10.1093/ve/veaf068","DOIUrl":"10.1093/ve/veaf068","url":null,"abstract":"<p><p>Robust sampling methods are foundational to inferences using phylogenies. Yet the impact of using contact tracing, a type of non-uniform sampling used in public health applications such as infectious disease outbreak investigations, has not been investigated in the molecular epidemiology field. To understand how contact tracing influences a recovered phylogeny, we developed a new simulation tool called SEEPS (Sequence Evolution and Epidemiological Process Simulator) that allows for the simulation of contact tracing and the resulting transmission tree, pathogen phylogeny, and corresponding virus genetic sequences. Importantly, SEEPS takes within-host evolution into account when generating pathogen phylogenies and sequences from transmission histories. Using SEEPS, we demonstrate that contact tracing can significantly impact the structure of the resulting tree, as described by popular tree statistics. Contact tracing generates phylogenies that are less balanced than the underlying transmission process, less representative of the larger epidemiological process, and affects the internal/external branch length ratios that characterize specific epidemiological scenarios. We also examined real data from a 2007-2008 Swedish HIV-1 outbreak and the broader 1998-2010 European HIV-1 epidemic to highlight the differences in contact tracing and expected phylogenies. Aided by SEEPS, we show that the data collection of the Swedish outbreak was strongly influenced by contact tracing even after downsampling, while the broader European Union epidemic showed little evidence of universal contact tracing, agreeing with the known epidemiological information about sampling and spread. Overall, our results highlight the importance of including possible non-uniform sampling schemes when examining phylogenetic trees. For that, SEEPS serves as a useful tool to evaluate such impacts, thereby facilitating better phylogenetic inferences of the characteristics of a disease outbreak. SEEPS is available at https://github.com/MolEvolEpid/SEEPS.</p>","PeriodicalId":56026,"journal":{"name":"Virus Evolution","volume":"11 1","pages":"veaf068"},"PeriodicalIF":4.0,"publicationDate":"2025-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12481701/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145208365","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-09-18eCollection Date: 2025-01-01DOI: 10.1093/ve/veaf074
Katherine Brown, Andrew Edwin Firth
Public databases of protein sequences, such as the National Center for Biotechnology Information (NCBI) Protein repository and UniProt, contain millions of proteins identified in samples from specific species but named as uncharacterized or hypothetical due to a lack of information about their function. Many such sequences are actually derived from RNA viruses, either due to viral infection of the original sample, contamination, or endogenous viral elements (EVEs) integrated into the genome of the sample species. Many proteins from RNA virus discovery research are also deposited into these repositories but are labelled as uncharacterized and only classified taxonomically at a superkingdom or realm level. Sequences from protein repositories not labelled specifically as being derived from the RNA-viral RNA-dependent RNA polymerase (RdRp) protein are often used as negative controls when looking to identify viral RdRp sequences, so the presence of unlabelled viruses amongst these datasets is problematic. These sequences also represent a source of information about novel viruses and EVEs. In this study, we screened uncharacterized proteins from two large public protein repositories-NCBI Protein and UniProt-to identify sequences likely to be derived from RNA viral RdRp and to perform detailed characterization of sequences of interest. We identified 3560 such sequences, many derived from EVEs. Many are previously unknown EVEs, which led to characterization of additional, related sequences. For example, a group of orbi-like viruses infecting nematodes was uncovered that appears to have both ancient endogenous and circulating exogenous members. Many integrations of mito-like viruses into plant genomes were also found. In several host taxonomic groups, the first example of an EVE, and in some cases the first example of any RNA virus, was uncovered. The large number of EVEs uncovered by this relatively small-scale search suggests that only a fraction of the true diversity of EVEs is currently known. We also provide provisional taxonomic annotations for RdRps, currently only listed as members of the Riboviria realm. A number of sequences are identified that are indistinguishable from viruses but are labelled as bacteria, seemingly as a result of mislabelling or contamination. Non-RdRp sequences that share near-significant similarity with RdRp are also characterized. Finally, recommendations are made for generating useful negative controls and sets of negative control sequences are provided.
{"title":"Uncovering hundreds of exogenous and endogenous RNA viral RdRp sequences amongst uncharacterized sequences in public protein databases.","authors":"Katherine Brown, Andrew Edwin Firth","doi":"10.1093/ve/veaf074","DOIUrl":"10.1093/ve/veaf074","url":null,"abstract":"<p><p>Public databases of protein sequences, such as the National Center for Biotechnology Information (NCBI) Protein repository and UniProt, contain millions of proteins identified in samples from specific species but named as uncharacterized or hypothetical due to a lack of information about their function. Many such sequences are actually derived from RNA viruses, either due to viral infection of the original sample, contamination, or endogenous viral elements (EVEs) integrated into the genome of the sample species. Many proteins from RNA virus discovery research are also deposited into these repositories but are labelled as uncharacterized and only classified taxonomically at a superkingdom or realm level. Sequences from protein repositories not labelled specifically as being derived from the RNA-viral RNA-dependent RNA polymerase (RdRp) protein are often used as negative controls when looking to identify viral RdRp sequences, so the presence of unlabelled viruses amongst these datasets is problematic. These sequences also represent a source of information about novel viruses and EVEs. In this study, we screened uncharacterized proteins from two large public protein repositories-NCBI Protein and UniProt-to identify sequences likely to be derived from RNA viral RdRp and to perform detailed characterization of sequences of interest. We identified 3560 such sequences, many derived from EVEs. Many are previously unknown EVEs, which led to characterization of additional, related sequences. For example, a group of orbi-like viruses infecting nematodes was uncovered that appears to have both ancient endogenous and circulating exogenous members. Many integrations of mito-like viruses into plant genomes were also found. In several host taxonomic groups, the first example of an EVE, and in some cases the first example of any RNA virus, was uncovered. The large number of EVEs uncovered by this relatively small-scale search suggests that only a fraction of the true diversity of EVEs is currently known. We also provide provisional taxonomic annotations for RdRps, currently only listed as members of the <i>Riboviria</i> realm. A number of sequences are identified that are indistinguishable from viruses but are labelled as bacteria, seemingly as a result of mislabelling or contamination. Non-RdRp sequences that share near-significant similarity with RdRp are also characterized. Finally, recommendations are made for generating useful negative controls and sets of negative control sequences are provided.</p>","PeriodicalId":56026,"journal":{"name":"Virus Evolution","volume":"11 1","pages":"veaf074"},"PeriodicalIF":4.0,"publicationDate":"2025-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12548735/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145379585","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-09-17eCollection Date: 2025-01-01DOI: 10.1093/ve/veaf072
Thomas Höfler, Michaela Zeitlow, Ji Y Kim, Emanuel Wyler, Jakob Trimpert
Glycoproteins cover the surface of enveloped viruses such as herpes simplex virus 1 (HSV-1). Whilst essential for cellular attachment and entry, they also are excellent targets for host immune responses. This dichotomy culminates in an evolutionary struggle in which receptor recognition and immune escape are intricately balanced. Herpesviruses feature a variety of different glycoproteins with diverse molecular functions. Here, we describe the rapid evolution of HSV-1 towards syncytial plaque phenotypes in Vero cell culture, as well as anti-gD antibody resistance in human foreskin fibroblast cells. Using a mild hypermutator virus to accelerate experimental evolution, we identified multiple genetic variants leading to syncytial plaques. Strikingly, these variants differentially affect interactions within viral populations. Whilst gK mutants engage in collective syncytia formation upon entry, accelerate superinfection exclusion and maintain fitness advantages at high multiplicities of infection, gB and gD mutants do not. Furthermore, we find gE mutants which lead to mouse anti-gD antibody resistance and cross protect wt virus in mixed populations. Our findings suggest complex social interactions within herpesvirus populations and illustrate the evolutionary plasticity and diverse function of their glycoproteins.
{"title":"Rapid glycoprotein evolution enables variant interactions in herpes simplex virus type 1.","authors":"Thomas Höfler, Michaela Zeitlow, Ji Y Kim, Emanuel Wyler, Jakob Trimpert","doi":"10.1093/ve/veaf072","DOIUrl":"10.1093/ve/veaf072","url":null,"abstract":"<p><p>Glycoproteins cover the surface of enveloped viruses such as herpes simplex virus 1 (HSV-1). Whilst essential for cellular attachment and entry, they also are excellent targets for host immune responses. This dichotomy culminates in an evolutionary struggle in which receptor recognition and immune escape are intricately balanced. Herpesviruses feature a variety of different glycoproteins with diverse molecular functions. Here, we describe the rapid evolution of HSV-1 towards syncytial plaque phenotypes in Vero cell culture, as well as anti-gD antibody resistance in human foreskin fibroblast cells. Using a mild hypermutator virus to accelerate experimental evolution, we identified multiple genetic variants leading to syncytial plaques. Strikingly, these variants differentially affect interactions within viral populations. Whilst gK mutants engage in collective syncytia formation upon entry, accelerate superinfection exclusion and maintain fitness advantages at high multiplicities of infection, gB and gD mutants do not. Furthermore, we find gE mutants which lead to mouse anti-gD antibody resistance and cross protect wt virus in mixed populations. Our findings suggest complex social interactions within herpesvirus populations and illustrate the evolutionary plasticity and diverse function of their glycoproteins.</p>","PeriodicalId":56026,"journal":{"name":"Virus Evolution","volume":"11 1","pages":"veaf072"},"PeriodicalIF":4.0,"publicationDate":"2025-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12499918/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145245840","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-09-16eCollection Date: 2025-01-01DOI: 10.1093/ve/veaf069
Gayle F Petersen, Silvia Pavan, Daryl Ariawan, Ole Tietz, Sepehr Nematollahzadeh, Subir Sarker, Jade K Forwood, Gualtiero Alvisi
Anelloviridae members are ubiquitous viruses with a small, negative sense, single-stranded DNA genome which is replicated by host cell DNA polymerases. Anelloviruses are postulated to interact with the host cell nuclear transport machinery, however, the lack of reliable cell culture models strongly limits our knowledge regarding Anelloviridae-host interactions. In particular, capsid nuclear import is a largely uncharacterized process. We addressed this by investigating the relationship between host cell nuclear transport receptors (NTRs) and ORF1, the putative capsid protein from torque teno douroucouli virus (TTDoV). We identified the subcellular targeting signals and NTRs responsible for its nucleolar and nuclear localization, and characterized their relative contribution to ORF1 subcellular localization. In the absence of other viral proteins, ORF1 accumulated in the nucleoli. Bioinformatics analysis revealed a putative classical nuclear localization signal (cNLS) within the highly conserved N-terminal arginine rich motif (ARM) ('NLSn', 27-RRWRRRPRRRRRPYRRRPYRRYGRRRKVRRR-57), and an additional C-terminal cNLS ('NLSc', 632-LPPPEKRARWGF-643), which has been specifically acquired by Anelloviridae capsids with larger projection domains. Such NLSs play distinct roles in ORF1 subcellular localization by interacting with specific NTRs. NLSn, a non-classical NLS, features broad importin (IMP) binding affinity yet plays a minor role in nuclear import, being responsible for nucleolar targeting likely through interaction with nucleolar components. NLSc, a bona fide cNLS, specifically interacts with IMPα and is the main driver of active nuclear transport in an IMPα/β1-dependent fashion. These findings suggest an evolutionary correlation between the acquisition of progressively larger projection domains and the presence of additional cNLSs in Anelloviridae capsids, aimed at maximizing IMPα/β1-mediated nuclear import.
{"title":"Nuclear trafficking of <i>Anelloviridae</i> capsid protein ORF1 reflects modular evolution of subcellular targeting signals.","authors":"Gayle F Petersen, Silvia Pavan, Daryl Ariawan, Ole Tietz, Sepehr Nematollahzadeh, Subir Sarker, Jade K Forwood, Gualtiero Alvisi","doi":"10.1093/ve/veaf069","DOIUrl":"10.1093/ve/veaf069","url":null,"abstract":"<p><p><i>Anelloviridae</i> members are ubiquitous viruses with a small, negative sense, single-stranded DNA genome which is replicated by host cell DNA polymerases. Anelloviruses are postulated to interact with the host cell nuclear transport machinery, however, the lack of reliable cell culture models strongly limits our knowledge regarding <i>Anelloviridae</i>-host interactions. In particular, capsid nuclear import is a largely uncharacterized process. We addressed this by investigating the relationship between host cell nuclear transport receptors (NTRs) and ORF1, the putative capsid protein from torque teno douroucouli virus (TTDoV). We identified the subcellular targeting signals and NTRs responsible for its nucleolar and nuclear localization, and characterized their relative contribution to ORF1 subcellular localization. In the absence of other viral proteins, ORF1 accumulated in the nucleoli. Bioinformatics analysis revealed a putative classical nuclear localization signal (cNLS) within the highly conserved N-terminal arginine rich motif (ARM) ('NLSn', 27-RRWRRRPRRRRRPYRRRPYRRYGRRRKVRRR-57), and an additional C-terminal cNLS ('NLSc', 632-LPPPEKRARWGF-643), which has been specifically acquired by <i>Anelloviridae</i> capsids with larger projection domains. Such NLSs play distinct roles in ORF1 subcellular localization by interacting with specific NTRs. NLSn, a non-classical NLS, features broad importin (IMP) binding affinity yet plays a minor role in nuclear import, being responsible for nucleolar targeting likely through interaction with nucleolar components. NLSc, a <i>bona fide</i> cNLS, specifically interacts with IMPα and is the main driver of active nuclear transport in an IMPα/β1-dependent fashion. These findings suggest an evolutionary correlation between the acquisition of progressively larger projection domains and the presence of additional cNLSs in <i>Anelloviridae</i> capsids, aimed at maximizing IMPα/β1-mediated nuclear import.</p>","PeriodicalId":56026,"journal":{"name":"Virus Evolution","volume":"11 1","pages":"veaf069"},"PeriodicalIF":4.0,"publicationDate":"2025-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12486385/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145214531","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-09-06eCollection Date: 2025-01-01DOI: 10.1093/ve/veaf066
Arianna Bregoli, Dennis Benedetti, Mattia Calzolari, Roberto Benevenia, Antonello Di Nardo, Anna Castelli, Manuel Corsa, Santina Grazioli, Chiara Chiapponi, Emiliana Brocchi, Giulia Pezzoni
Swine vesicular disease virus (SVDV) was considered endemic in Italy until 2015, when eradication was achieved. From 1992, when the most recent antigenic-genomic variant of SVDV was identified, to 2015, ~685 SVD outbreaks occurred. In order to understand the evolutionary drivers of SVDV in Italy, 152 viral strains were selected and whole-genome sequencing was performed. Comparison of the polyprotein coding region revealed a minimum pairwise identity of 85% and 98% at the nucleotide and amino acid levels, respectively, consistent with the antigenic homogeneity of the analysed strains. Phylogenetic analysis confirmed the presence of two sublineages: sublineage 1, which evolved and circulated specifically in Italy since 1995, and sublineage 2, which also includes strains circulating in both Spain and Portugal in 1993 and in Portugal during 2003. Recombination analysis revealed a breakpoint site within the 2B coding region, resulting in a recombination strain originating from viruses belonging to the two sublineages, which was dated to the beginning of 2008. This single recombination event gave rise to at least 20 recombinant strains, which circulated together with their parental strains until 2010 and from then on circulated on their own until eradication in 2015. Our data showed that, apart from the recombination event, SVDV, which has been present on Italian territory for >20 years, has not been subject to positive selective pressures that would have conferred a possible evolutionary advantage.
{"title":"Genetic evolution of swine vesicular disease viruses circulating in Italy from 1992 to the eradication in 2015 and emergence of a recent recombinant strain.","authors":"Arianna Bregoli, Dennis Benedetti, Mattia Calzolari, Roberto Benevenia, Antonello Di Nardo, Anna Castelli, Manuel Corsa, Santina Grazioli, Chiara Chiapponi, Emiliana Brocchi, Giulia Pezzoni","doi":"10.1093/ve/veaf066","DOIUrl":"10.1093/ve/veaf066","url":null,"abstract":"<p><p>Swine vesicular disease virus (SVDV) was considered endemic in Italy until 2015, when eradication was achieved. From 1992, when the most recent antigenic-genomic variant of SVDV was identified, to 2015, ~685 SVD outbreaks occurred. In order to understand the evolutionary drivers of SVDV in Italy, 152 viral strains were selected and whole-genome sequencing was performed. Comparison of the polyprotein coding region revealed a minimum pairwise identity of 85% and 98% at the nucleotide and amino acid levels, respectively, consistent with the antigenic homogeneity of the analysed strains. Phylogenetic analysis confirmed the presence of two sublineages: sublineage 1, which evolved and circulated specifically in Italy since 1995, and sublineage 2, which also includes strains circulating in both Spain and Portugal in 1993 and in Portugal during 2003. Recombination analysis revealed a breakpoint site within the 2B coding region, resulting in a recombination strain originating from viruses belonging to the two sublineages, which was dated to the beginning of 2008. This single recombination event gave rise to at least 20 recombinant strains, which circulated together with their parental strains until 2010 and from then on circulated on their own until eradication in 2015. Our data showed that, apart from the recombination event, SVDV, which has been present on Italian territory for >20 years, has not been subject to positive selective pressures that would have conferred a possible evolutionary advantage.</p>","PeriodicalId":56026,"journal":{"name":"Virus Evolution","volume":"11 1","pages":"veaf066"},"PeriodicalIF":4.0,"publicationDate":"2025-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12476138/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145187541","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-09-05eCollection Date: 2025-01-01DOI: 10.1093/ve/veaf067
Jackie E Mahar, Jonathon C O Mifsud, Kate Van Brussel, Anna E Lachenauer, Erin Harvey, Olivia M H Turnbull, Stefanie Bonat, Thomas M Newsome, Annabelle Olsson, Antje Chiu-Werner, Menna E Jones, Edward C Holmes, Solomon Maina
Ribgrass mosaic virus (RMV) and related viruses of the genus Tobamovirus (family Virgaviridae) are cruciferous plant pathogens that represent a threat to global horticultural systems. In Australia, they are considered exotic biosecurity threats, and an incursion of these viruses would require rapid and strict control efforts. However, current surveillance methods for these viruses are limited. We examined whether RMV and related tobamoviruses could be detected through the analysis of mammalian gut metatranscriptomes. Accordingly, we identified five different tobamoviruses in one or more gut metatranscriptomes of the Eastern grey kangaroo, fallow deer, domestic dog, spotted-tailed quoll, feral cat, and the Tasmanian devil. One of the tobamoviruses was also detected in a tick metatranscriptome. The five tobamoviruses detected were: (i) RMV, (ii) a novel relative of RMV, (iii and iv) two highly diverse novel tobamoviruses, and (v) the plant pathogen tobacco mild green mosaic virus (TMGMV) already known to be present in Australia. Subsequent phylogenetic analysis provided information on their origin and spread through Australia. RMV was detected at multiple sites in both the Australian Capital Territory (ACT) and Tasmania, two regions separated by ~700 km of land and 200 km of water. The novel relative of RMV was detected in the ACT and New South Wales (NSW), the highly divergent novel tobamoviruses were each detected in a single state (NSW and Queensland, QLD), while TMGMV was detected in QLD. This work highlights the potential utility of metatranscriptomic sequencing of wild animal gut samples for the surveillance of biosecurity threats to native and agricultural plant species and for studying their evolution in new environments.
脊草花叶病毒(RMV)和相关的托巴莫病毒属病毒是十字花科植物病原体,对全球园艺系统构成威胁。在澳大利亚,它们被认为是外来的生物安全威胁,这些病毒的入侵需要迅速和严格的控制努力。然而,目前对这些病毒的监测方法是有限的。我们通过分析哺乳动物肠道亚转录组来检测RMV和相关的乙型肝炎病毒。因此,我们在东部灰袋鼠、黇鹿、家狗、斑尾袋鼬、野猫和袋獾的一个或多个肠道亚转录组中鉴定出五种不同的多巴胺病毒。在蜱虫亚转录组中也检测到其中一种乙型病毒。检测到的五种烟叶病毒是:(i)烟叶病毒,(ii)烟叶病毒的一种新亲缘病毒,(iii和iv)两种高度多样化的新型烟叶病毒,以及(v)已知存在于澳大利亚的植物病原体烟草温和绿花叶病毒(TMGMV)。随后的系统发育分析提供了它们起源和在澳大利亚传播的信息。在澳大利亚首都地区(ACT)和塔斯马尼亚州的多个地点都检测到RMV,这两个地区相距约700公里的陆地和200公里的水域。在澳大利亚首都领地(ACT)和新南威尔士州(New South Wales, NSW)检测到RMV的新亲缘病毒,在新南威尔士州(NSW)和昆士兰州(Queensland, QLD)各检测到高度分化的新型托巴莫病毒,而在昆士兰州(QLD)检测到TMGMV。这项工作强调了野生动物肠道样本的亚转录组测序在监测本地和农业植物物种的生物安全威胁以及研究它们在新环境中的进化方面的潜在效用。
{"title":"Plant biosecurity threats detected using metatranscriptomic sequencing of animal gut contents.","authors":"Jackie E Mahar, Jonathon C O Mifsud, Kate Van Brussel, Anna E Lachenauer, Erin Harvey, Olivia M H Turnbull, Stefanie Bonat, Thomas M Newsome, Annabelle Olsson, Antje Chiu-Werner, Menna E Jones, Edward C Holmes, Solomon Maina","doi":"10.1093/ve/veaf067","DOIUrl":"10.1093/ve/veaf067","url":null,"abstract":"<p><p>Ribgrass mosaic virus (RMV) and related viruses of the genus <i>Tobamovirus</i> (family <i>Virgaviridae</i>) are cruciferous plant pathogens that represent a threat to global horticultural systems. In Australia, they are considered exotic biosecurity threats, and an incursion of these viruses would require rapid and strict control efforts. However, current surveillance methods for these viruses are limited. We examined whether RMV and related tobamoviruses could be detected through the analysis of mammalian gut metatranscriptomes. Accordingly, we identified five different tobamoviruses in one or more gut metatranscriptomes of the Eastern grey kangaroo, fallow deer, domestic dog, spotted-tailed quoll, feral cat, and the Tasmanian devil. One of the tobamoviruses was also detected in a tick metatranscriptome. The five tobamoviruses detected were: (i) RMV, (ii) a novel relative of RMV, (iii and iv) two highly diverse novel tobamoviruses, and (v) the plant pathogen tobacco mild green mosaic virus (TMGMV) already known to be present in Australia. Subsequent phylogenetic analysis provided information on their origin and spread through Australia. RMV was detected at multiple sites in both the Australian Capital Territory (ACT) and Tasmania, two regions separated by ~700 km of land and 200 km of water. The novel relative of RMV was detected in the ACT and New South Wales (NSW), the highly divergent novel tobamoviruses were each detected in a single state (NSW and Queensland, QLD), while TMGMV was detected in QLD. This work highlights the potential utility of metatranscriptomic sequencing of wild animal gut samples for the surveillance of biosecurity threats to native and agricultural plant species and for studying their evolution in new environments.</p>","PeriodicalId":56026,"journal":{"name":"Virus Evolution","volume":"11 1","pages":"veaf067"},"PeriodicalIF":4.0,"publicationDate":"2025-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12461698/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145187562","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-08-30eCollection Date: 2025-01-01DOI: 10.1093/ve/veaf065
Sanni Översti, Emily Gaul, Björn-Erik Ole Jensen, Denise Kühnert
Time-stamped genomic sequences from rapidly evolving pathogens can be used to estimate the rates of evolution through molecular tip-dating. The validity of this approach, however, depends on whether detectable levels of genetic variation have accumulated over the given sampling interval, generating a temporal signal. Moreover, molecular dating methods have demonstrated varying degrees of systematic biases under different biologically realistic scenarios, such as the presence of phylo-temporal clustering. Persistent SARS-CoV-2 infections in immunocompromised individuals have been linked to accelerated intrahost molecular rates compared to those of global lineages, facilitating the emergence of novel viral lineages. Yet, studies reporting elevated rates lack assessment of data properties, such as evaluation of temporal signal and comparison of multiple methods of inference, both crucial for robust rate estimation. In this study, we applied a range of molecular dating approaches to reassess the rate of SARS-CoV-2 intrahost evolution in immunocompromised individuals using publicly available datasets. Our findings suggest that even during long-term infections, the limited number of genetic changes accumulating may pose a challenge for robust inference of within-host evolutionary rates, particularly when relying on consensus sequences and when datasets are small or unevenly sampled. Moreover, our results highlight that when certain methodological limitations are overlooked, evolutionary rates can be significantly overestimated. In general, our findings demonstrate that estimating within-host evolutionary rates is a challenging question necessitating thorough assessment of data quality, careful selection of appropriate methods, and cautious interpretation of the resulting estimates. Whereas our phylogenetic analyses of viral consensus sequences provide no evidence of elevated evolutionary rates across the complete genome during chronic SARS-CoV-2 infection, prolonged viral shedding may nevertheless promote the emergence of new viral variants in immunocompromised individuals.
{"title":"Phylogenetic meta-analysis of persistent SARS-CoV-2 infections in immunocompromised individuals highlights the challenges of robust evolutionary rate estimation caused by low genetic diversity.","authors":"Sanni Översti, Emily Gaul, Björn-Erik Ole Jensen, Denise Kühnert","doi":"10.1093/ve/veaf065","DOIUrl":"10.1093/ve/veaf065","url":null,"abstract":"<p><p>Time-stamped genomic sequences from rapidly evolving pathogens can be used to estimate the rates of evolution through molecular tip-dating. The validity of this approach, however, depends on whether detectable levels of genetic variation have accumulated over the given sampling interval, generating a temporal signal. Moreover, molecular dating methods have demonstrated varying degrees of systematic biases under different biologically realistic scenarios, such as the presence of phylo-temporal clustering. Persistent SARS-CoV-2 infections in immunocompromised individuals have been linked to accelerated intrahost molecular rates compared to those of global lineages, facilitating the emergence of novel viral lineages. Yet, studies reporting elevated rates lack assessment of data properties, such as evaluation of temporal signal and comparison of multiple methods of inference, both crucial for robust rate estimation. In this study, we applied a range of molecular dating approaches to reassess the rate of SARS-CoV-2 intrahost evolution in immunocompromised individuals using publicly available datasets. Our findings suggest that even during long-term infections, the limited number of genetic changes accumulating may pose a challenge for robust inference of within-host evolutionary rates, particularly when relying on consensus sequences and when datasets are small or unevenly sampled. Moreover, our results highlight that when certain methodological limitations are overlooked, evolutionary rates can be significantly overestimated. In general, our findings demonstrate that estimating within-host evolutionary rates is a challenging question necessitating thorough assessment of data quality, careful selection of appropriate methods, and cautious interpretation of the resulting estimates. Whereas our phylogenetic analyses of viral consensus sequences provide no evidence of elevated evolutionary rates across the complete genome during chronic SARS-CoV-2 infection, prolonged viral shedding may nevertheless promote the emergence of new viral variants in immunocompromised individuals.</p>","PeriodicalId":56026,"journal":{"name":"Virus Evolution","volume":"11 1","pages":"veaf065"},"PeriodicalIF":4.0,"publicationDate":"2025-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12477587/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145202226","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-08-23eCollection Date: 2025-01-01DOI: 10.1093/ve/veaf064
Stephanie Bellman, Nima Shariatzadeh, Tyshawn Ferrell, Audrey Long, Leah Aeschleman, Ellie Fausett, Tim Walsh, Seana Cleary, Isabella Roeske, Erick Ojeda, Madison Schwab, Hannah Dakanay, Sam R Telford Iii, Heidi K Goethert, Gonzalo Vazquez-Prokopec, Anne Piantadosi
Heartland virus (HRTV) is an emerging tick-borne virus associated with severe illness in the USA. There are large gaps in knowledge of HRTV diversity, evolution, and transmission due to a paucity of HRTV-positive samples and genome sequences. We identified a focal site of HRTV-positive Amblyomma americanum ticks in central Georgia and developed a novel multiplex-amplicon sequencing assay to generate full HRTV genome sequences. By screening over 21 000 field-collected ticks from 2021 to 2023, we identified six positive pools. Five were collected from the site in central Georgia where our group first detected HRTV-positive ticks in 2019 and one from a site in western Georgia ~175 km away. The HRTV genome sequences from Georgia were highly related, even across this distance and over five years. Reference HRTV genome sequences from across the USA were also geographically clustered. Time-scaled phylogenetic analysis suggested a recent spread of HRTV in the USA, with all available sequences sharing a common ancestor within the last 75 years, since the mid-1900s, and sequences from Georgia sharing a common ancestor within the last 15 years, since 2010. Our observed spatial clustering of HRTV and the high degree of genetic conservation in our persistent focus suggest the importance of small spatial dynamics in HRTV transmission ecology.
{"title":"Focal persistence and phylodynamics of Heartland virus in Georgia.","authors":"Stephanie Bellman, Nima Shariatzadeh, Tyshawn Ferrell, Audrey Long, Leah Aeschleman, Ellie Fausett, Tim Walsh, Seana Cleary, Isabella Roeske, Erick Ojeda, Madison Schwab, Hannah Dakanay, Sam R Telford Iii, Heidi K Goethert, Gonzalo Vazquez-Prokopec, Anne Piantadosi","doi":"10.1093/ve/veaf064","DOIUrl":"10.1093/ve/veaf064","url":null,"abstract":"<p><p>Heartland virus (HRTV) is an emerging tick-borne virus associated with severe illness in the USA. There are large gaps in knowledge of HRTV diversity, evolution, and transmission due to a paucity of HRTV-positive samples and genome sequences. We identified a focal site of HRTV-positive <i>Amblyomma americanum</i> ticks in central Georgia and developed a novel multiplex-amplicon sequencing assay to generate full HRTV genome sequences. By screening over 21 000 field-collected ticks from 2021 to 2023, we identified six positive pools. Five were collected from the site in central Georgia where our group first detected HRTV-positive ticks in 2019 and one from a site in western Georgia ~175 km away. The HRTV genome sequences from Georgia were highly related, even across this distance and over five years. Reference HRTV genome sequences from across the USA were also geographically clustered. Time-scaled phylogenetic analysis suggested a recent spread of HRTV in the USA, with all available sequences sharing a common ancestor within the last 75 years, since the mid-1900s, and sequences from Georgia sharing a common ancestor within the last 15 years, since 2010. Our observed spatial clustering of HRTV and the high degree of genetic conservation in our persistent focus suggest the importance of small spatial dynamics in HRTV transmission ecology.</p>","PeriodicalId":56026,"journal":{"name":"Virus Evolution","volume":"11 1","pages":"veaf064"},"PeriodicalIF":4.0,"publicationDate":"2025-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12456176/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145139503","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-08-23eCollection Date: 2025-01-01DOI: 10.1093/ve/veaf061
Christian Oehlmann, Jiangbin Fan, Michael J Rihlmann, Hannes Lemme, Jörg Müller, Birgit Ruoff, Jörg T Wennmann, Johannes A Jehle
Amplicon-based next-generation sequencing (aNGS) is a powerful tool in diagnostics and genetic studies. We developed an aNGS approach to study the population structure of the Lymantria dispar multiple nucleopolyhedrovirus (LdMNPV), a specific pathogen of the spongy moth Lymantria dispar, a devastating lepidopteran pest in European, Asian, and American deciduous forests. Naturally occurring pathogens, such as LdMNPV, are frequently reported to cause epizootics and a rapid decline of insect pest populations. DNA samples of pooled LdMNPV-infected larvae from forest regions in Northern Bavaria (Germany) were subjected to whole genome sequencing (WGS) and aNGS optimization. Then, five marker regions were identified in the genome of LdMNPV for PCR amplification, covering 21 highly specific single-nucleotide polymorphism (SNP) positions that enabled comprehensive analysis at the intra- and intersample levels. These markers were used in aNGS analyses of 70 single larvae collected in 12 forest sites, followed by SNP-based hierarchical clustering on principal components (HCPC). This approach identified three LdMNPV population clusters consisting of homogenous (pure) and heterogeneous (mixed) LdMNPV samples. To explain the genetic variability within each sample, a model based on linear optimization was developed and validated by comparing the predictions from aNGS and WGS data. The analyses showed that LdMNPV from Bavarian forests carried genetic variants highly similar to those present in the commercial product Gypchek®, developed for biocontrol. The distribution of genetic characteristics showed some trends of geographic and temporal prevalence, which are indicative of short-distance and long-distance transmission. The aNGS approach offers a fast, cost-effective, and comprehensive insight into the natural population structure of LdMNPV.
{"title":"Amplicon-based analyses of single-nucleotide polymorphisms reveal the genetic structure of a forest insect baculovirus.","authors":"Christian Oehlmann, Jiangbin Fan, Michael J Rihlmann, Hannes Lemme, Jörg Müller, Birgit Ruoff, Jörg T Wennmann, Johannes A Jehle","doi":"10.1093/ve/veaf061","DOIUrl":"10.1093/ve/veaf061","url":null,"abstract":"<p><p>Amplicon-based next-generation sequencing (aNGS) is a powerful tool in diagnostics and genetic studies. We developed an aNGS approach to study the population structure of the Lymantria dispar multiple nucleopolyhedrovirus (LdMNPV), a specific pathogen of the spongy moth <i>Lymantria dispar</i>, a devastating lepidopteran pest in European, Asian, and American deciduous forests. Naturally occurring pathogens, such as LdMNPV, are frequently reported to cause epizootics and a rapid decline of insect pest populations. DNA samples of pooled LdMNPV-infected larvae from forest regions in Northern Bavaria (Germany) were subjected to whole genome sequencing (WGS) and aNGS optimization. Then, five marker regions were identified in the genome of LdMNPV for PCR amplification, covering 21 highly specific single-nucleotide polymorphism (SNP) positions that enabled comprehensive analysis at the intra- and intersample levels. These markers were used in aNGS analyses of 70 single larvae collected in 12 forest sites, followed by SNP-based hierarchical clustering on principal components (HCPC). This approach identified three LdMNPV population clusters consisting of homogenous (pure) and heterogeneous (mixed) LdMNPV samples. To explain the genetic variability within each sample, a model based on linear optimization was developed and validated by comparing the predictions from aNGS and WGS data. The analyses showed that LdMNPV from Bavarian forests carried genetic variants highly similar to those present in the commercial product Gypchek®, developed for biocontrol. The distribution of genetic characteristics showed some trends of geographic and temporal prevalence, which are indicative of short-distance and long-distance transmission. The aNGS approach offers a fast, cost-effective, and comprehensive insight into the natural population structure of LdMNPV.</p>","PeriodicalId":56026,"journal":{"name":"Virus Evolution","volume":"11 1","pages":"veaf061"},"PeriodicalIF":4.0,"publicationDate":"2025-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12522038/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145309871","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-08-22eCollection Date: 2025-01-01DOI: 10.1093/ve/veaf063
Cinzia Borgogna, Ilaria Cislaghi, Sarah Turati, Alessandra Mozzi, Diego Forni, Rachele Cagliani, Manuela Sironi, Marisa Gariglio
Betacoronaviruses (β-CoVs) display divergent mechanisms to evade host antiviral responses, yet the evolutionary origin and functional relevance of their strategies remain unclear. Stress granules (SGs), central to host defenses, are disrupted by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) nucleocapsid (N) protein via its interaction with G3BP1 mediated by an 15ITFG18 motif. This interaction inhibits SG assembly, enhancing viral replication and immune evasion. Here, we analyzed 179 N protein sequences across β-CoVs subgenera and identified the ITFG motif in sarbecoviruses but not in MERS-CoV or embecoviruses. Among tested CoVs, only SARS-CoV-2 N showed robust G3BP1 binding in vitro and in infected cells. Phylogenetic analyses revealed that the ΦxFG motif emerged independently in sarbecoviruses and a bat-infecting merbecovirus clade. The VGTF motif in these merbecoviruses also binds to G3BP1, suggesting convergent evolution in viral evasion strategies. The emergence of this motif was unrelated to 4a protein inactivation, another viral protein that inhibits SG formation.
{"title":"Convergent evolution of the G3BP1-binding motif in betacoronavirus nucleocapsid proteins.","authors":"Cinzia Borgogna, Ilaria Cislaghi, Sarah Turati, Alessandra Mozzi, Diego Forni, Rachele Cagliani, Manuela Sironi, Marisa Gariglio","doi":"10.1093/ve/veaf063","DOIUrl":"10.1093/ve/veaf063","url":null,"abstract":"<p><p>Betacoronaviruses (β-CoVs) display divergent mechanisms to evade host antiviral responses, yet the evolutionary origin and functional relevance of their strategies remain unclear. Stress granules (SGs), central to host defenses, are disrupted by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) nucleocapsid (N) protein <i>via</i> its interaction with G3BP1 mediated by an <sup>15</sup>ITFG<sup>18</sup> motif. This interaction inhibits SG assembly, enhancing viral replication and immune evasion. Here, we analyzed 179 N protein sequences across β-CoVs subgenera and identified the ITFG motif in sarbecoviruses but not in MERS-CoV or embecoviruses. Among tested CoVs, only SARS-CoV-2 N showed robust G3BP1 binding <i>in vitro</i> and in infected cells. Phylogenetic analyses revealed that the ΦxFG motif emerged independently in sarbecoviruses and a bat-infecting merbecovirus clade. The VGTF motif in these merbecoviruses also binds to G3BP1, suggesting convergent evolution in viral evasion strategies. The emergence of this motif was unrelated to 4a protein inactivation, another viral protein that inhibits SG formation.</p>","PeriodicalId":56026,"journal":{"name":"Virus Evolution","volume":"11 1","pages":"veaf063"},"PeriodicalIF":4.0,"publicationDate":"2025-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12421999/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145042373","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}