逆转录病毒识别的快速算法

W. Ashlock, S. Datta
{"title":"逆转录病毒识别的快速算法","authors":"W. Ashlock, S. Datta","doi":"10.1109/GENSIPS.2010.5719668","DOIUrl":null,"url":null,"abstract":"Retroviruses have important roles to play in medicine, evolution, and biology. A key step towards understanding the effect of retroviruses on hosts is identifying them in the host genome. Detecting retroviruses using sequence alignment is difficult because are very diverse and have high mutation rates. We propose a fast, accurate algorithm for detecting retroviruses that uses supervised machine learning and three sets of features. One set of novel features identify the characteristic reading frame structure of retroviruses. The other two sets include features that have been used by other researchers for exon finding. Our algorithm distinguishes retroviral genomes from non-coding sequences and endogenous retroviruses from non-coding sequences and from genes with high accuracy. It also distinguishes endogenous retroviruses from intact retroviral genomes, lentiviruses from other retroviruses, all with high accuracy.","PeriodicalId":388703,"journal":{"name":"2010 IEEE International Workshop on Genomic Signal Processing and Statistics (GENSIPS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Fast algorithms for recognizing retroviruses\",\"authors\":\"W. Ashlock, S. Datta\",\"doi\":\"10.1109/GENSIPS.2010.5719668\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Retroviruses have important roles to play in medicine, evolution, and biology. A key step towards understanding the effect of retroviruses on hosts is identifying them in the host genome. Detecting retroviruses using sequence alignment is difficult because are very diverse and have high mutation rates. We propose a fast, accurate algorithm for detecting retroviruses that uses supervised machine learning and three sets of features. One set of novel features identify the characteristic reading frame structure of retroviruses. The other two sets include features that have been used by other researchers for exon finding. Our algorithm distinguishes retroviral genomes from non-coding sequences and endogenous retroviruses from non-coding sequences and from genes with high accuracy. It also distinguishes endogenous retroviruses from intact retroviral genomes, lentiviruses from other retroviruses, all with high accuracy.\",\"PeriodicalId\":388703,\"journal\":{\"name\":\"2010 IEEE International Workshop on Genomic Signal Processing and Statistics (GENSIPS)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 IEEE International Workshop on Genomic Signal Processing and Statistics (GENSIPS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/GENSIPS.2010.5719668\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 IEEE International Workshop on Genomic Signal Processing and Statistics (GENSIPS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GENSIPS.2010.5719668","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

逆转录病毒在医学、进化和生物学中发挥着重要作用。了解逆转录病毒对宿主的影响的关键一步是在宿主基因组中识别它们。利用序列比对检测逆转录病毒是困难的,因为逆转录病毒种类繁多,突变率高。我们提出了一种快速,准确的算法来检测逆转录病毒,该算法使用监督机器学习和三组特征。一组新的特征确定了逆转录病毒的特征性阅读框结构。另外两组包括其他研究人员用来寻找外显子的特征。我们的算法区分逆转录病毒基因组与非编码序列,内源性逆转录病毒与非编码序列和基因具有很高的准确性。它还能区分内源性逆转录病毒与完整逆转录病毒基因组,慢病毒与其他逆转录病毒,准确度都很高。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Fast algorithms for recognizing retroviruses
Retroviruses have important roles to play in medicine, evolution, and biology. A key step towards understanding the effect of retroviruses on hosts is identifying them in the host genome. Detecting retroviruses using sequence alignment is difficult because are very diverse and have high mutation rates. We propose a fast, accurate algorithm for detecting retroviruses that uses supervised machine learning and three sets of features. One set of novel features identify the characteristic reading frame structure of retroviruses. The other two sets include features that have been used by other researchers for exon finding. Our algorithm distinguishes retroviral genomes from non-coding sequences and endogenous retroviruses from non-coding sequences and from genes with high accuracy. It also distinguishes endogenous retroviruses from intact retroviral genomes, lentiviruses from other retroviruses, all with high accuracy.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Network propagation models for gene selection Subtype specific breast cancer event prediction Importance sampling method for efficient estimation of the probability of rare events in biochemical reaction systems Bayesian MMSE estimation of classification error and performance on real genomic data Pathway and network analysis probing epigenetic influences on chemosensitivity in ovarian cancer
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1