detectEVE: Fast, Sensitive and Precise Detection of Endogenous Viral Elements in Genomic Data.

IF 5.5 1区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Molecular Ecology Resources Pub Date : 2025-02-12 DOI:10.1111/1755-0998.14083
Nadja Brait, Thomas Hackl, Sebastian Lequime
{"title":"detectEVE: Fast, Sensitive and Precise Detection of Endogenous Viral Elements in Genomic Data.","authors":"Nadja Brait, Thomas Hackl, Sebastian Lequime","doi":"10.1111/1755-0998.14083","DOIUrl":null,"url":null,"abstract":"<p><p>Endogenous viral elements (EVEs) are fragments of viral genomic material embedded within the host genome. Retroviruses contribute to the majority of EVEs because of their genomic integration during their life cycle; however, the latter can also arise from non-retroviral RNA or DNA viruses, then collectively known as non-retroviral (nr) EVEs. Detecting nrEVEs poses challenges because of their sequence and genomic structural diversity, contributing to the scarcity of specific tools designed for nrEVEs detection. Here, we introduce detectEVE, a user-friendly and open-source tool designed for the accurate identification of nrEVEs in genomic assemblies. detectEVE deviates from other nrEVE detection pipelines, which usually classify sequences in a more rigid manner as either virus-associated or not. Instead, we implemented a scaling system assigning confidence scores to hits in protein sequence similarity searches, using bit score distributions and search hints related to various viral characteristics, allowing for higher sensitivity and specificity. Our benchmarking shows that detectEVE is computationally efficient and accurate, as well as considerably faster than existing approaches, because of its resource-efficient parallel execution. Our tool can help to fill current gaps in both host-associated fields and virus-related studies. This includes (i) enhancing genome annotations with metadata for EVE loci, (ii) conducting large-scale paleo-virological studies to explore deep viral evolutionary histories, and (iii) aiding in the identification of actively expressed EVEs in transcriptomic data, reducing the risk of misinterpretations between exogenous viruses and EVEs.</p>","PeriodicalId":211,"journal":{"name":"Molecular Ecology Resources","volume":" ","pages":"e14083"},"PeriodicalIF":5.5000,"publicationDate":"2025-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecular Ecology Resources","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1111/1755-0998.14083","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Endogenous viral elements (EVEs) are fragments of viral genomic material embedded within the host genome. Retroviruses contribute to the majority of EVEs because of their genomic integration during their life cycle; however, the latter can also arise from non-retroviral RNA or DNA viruses, then collectively known as non-retroviral (nr) EVEs. Detecting nrEVEs poses challenges because of their sequence and genomic structural diversity, contributing to the scarcity of specific tools designed for nrEVEs detection. Here, we introduce detectEVE, a user-friendly and open-source tool designed for the accurate identification of nrEVEs in genomic assemblies. detectEVE deviates from other nrEVE detection pipelines, which usually classify sequences in a more rigid manner as either virus-associated or not. Instead, we implemented a scaling system assigning confidence scores to hits in protein sequence similarity searches, using bit score distributions and search hints related to various viral characteristics, allowing for higher sensitivity and specificity. Our benchmarking shows that detectEVE is computationally efficient and accurate, as well as considerably faster than existing approaches, because of its resource-efficient parallel execution. Our tool can help to fill current gaps in both host-associated fields and virus-related studies. This includes (i) enhancing genome annotations with metadata for EVE loci, (ii) conducting large-scale paleo-virological studies to explore deep viral evolutionary histories, and (iii) aiding in the identification of actively expressed EVEs in transcriptomic data, reducing the risk of misinterpretations between exogenous viruses and EVEs.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
求助全文
约1分钟内获得全文 去求助
来源期刊
Molecular Ecology Resources
Molecular Ecology Resources 生物-进化生物学
CiteScore
15.60
自引率
5.20%
发文量
170
审稿时长
3 months
期刊介绍: Molecular Ecology Resources promotes the creation of comprehensive resources for the scientific community, encompassing computer programs, statistical and molecular advancements, and a diverse array of molecular tools. Serving as a conduit for disseminating these resources, the journal targets a broad audience of researchers in the fields of evolution, ecology, and conservation. Articles in Molecular Ecology Resources are crafted to support investigations tackling significant questions within these disciplines. In addition to original resource articles, Molecular Ecology Resources features Reviews, Opinions, and Comments relevant to the field. The journal also periodically releases Special Issues focusing on resource development within specific areas.
期刊最新文献
detectEVE: Fast, Sensitive and Precise Detection of Endogenous Viral Elements in Genomic Data. Corrigendum to "Quantifying Mitochondrial Heteroplasmy Diversity: A Computational Approach". A Strategy of Assessing Gene Copy Number Differentiation Between Populations Using Ultra-Fast De Novo Assembly of Next-Generation Sequencing Data. Integration of De Novo Chromosome-Level Genome and Population Resequencing of Peganum (Nitrariaceae): A Case Study of Speciation and Evolutionary Trajectories in Arid Central Asia. The Long and the Short of It: Nanopore-Based eDNA Metabarcoding of Marine Vertebrates Works; Sensitivity and Species-Level Assignment Depend on Amplicon Lengths.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1