Rapid Real-time Squiggle Classification for Read until using RawMap.

Archives of clinical and biomedical research Pub Date : 2023-01-01 Epub Date: 2023-01-28 DOI:10.26502/acbr.50170318
Harisankar Sadasivan, Jack Wadden, Kush Goliya, Piyush Ranjan, Robert P Dickson, David Blaauw, Reetuparna Das, Satish Narayanasamy
{"title":"Rapid Real-time Squiggle Classification for Read until using RawMap.","authors":"Harisankar Sadasivan,&nbsp;Jack Wadden,&nbsp;Kush Goliya,&nbsp;Piyush Ranjan,&nbsp;Robert P Dickson,&nbsp;David Blaauw,&nbsp;Reetuparna Das,&nbsp;Satish Narayanasamy","doi":"10.26502/acbr.50170318","DOIUrl":null,"url":null,"abstract":"<p><p>ReadUntil enables Oxford Nanopore Technology's (ONT) sequencers to selectively sequence reads of target species in real-time. This enables efficient microbial enrichment for applications such as microbial abundance estimation and is particularly beneficial for metagenomic samples with a very high fraction of non-target reads (> 99% can be human reads). However, read-until requires a fast and accurate software filter that analyzes a short prefix of a read and determines if it belongs to a microbe of interest (target) or not. The baseline Read Until pipeline uses a deep neural network-based basecaller called Guppy and is slow and inaccurate for this task (~60% of bases sequenced are unclassified). We present RawMap, an efficient CPU-only microbial species-agnostic Read Until classifier for filtering non-target human reads in the squiggle space. RawMap uses a Support Vector Machine (SVM), which is trained to distinguish human from microbe using non-linear and non-stationary characteristics of ONT's squiggle output (continuous electrical signals). Compared to the baseline Read Until pipeline, RawMap is a 1327X faster classifier and significantly improves the sequencing time and cost, and compute time savings. We show that RawMap augmented pipelines reduce sequencing time and cost by ~24% and computing cost by 22%. Additionally, since RawMap is agnostic to microbial species, it can also classify microbial species it is not trained on. We also discuss how RawMap may be used as an alternative to the RT-PCR test for viral load quantification of SARS-CoV-2.</p>","PeriodicalId":72279,"journal":{"name":"Archives of clinical and biomedical research","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10022530/pdf/nihms-1875752.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Archives of clinical and biomedical research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.26502/acbr.50170318","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/1/28 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

ReadUntil enables Oxford Nanopore Technology's (ONT) sequencers to selectively sequence reads of target species in real-time. This enables efficient microbial enrichment for applications such as microbial abundance estimation and is particularly beneficial for metagenomic samples with a very high fraction of non-target reads (> 99% can be human reads). However, read-until requires a fast and accurate software filter that analyzes a short prefix of a read and determines if it belongs to a microbe of interest (target) or not. The baseline Read Until pipeline uses a deep neural network-based basecaller called Guppy and is slow and inaccurate for this task (~60% of bases sequenced are unclassified). We present RawMap, an efficient CPU-only microbial species-agnostic Read Until classifier for filtering non-target human reads in the squiggle space. RawMap uses a Support Vector Machine (SVM), which is trained to distinguish human from microbe using non-linear and non-stationary characteristics of ONT's squiggle output (continuous electrical signals). Compared to the baseline Read Until pipeline, RawMap is a 1327X faster classifier and significantly improves the sequencing time and cost, and compute time savings. We show that RawMap augmented pipelines reduce sequencing time and cost by ~24% and computing cost by 22%. Additionally, since RawMap is agnostic to microbial species, it can also classify microbial species it is not trained on. We also discuss how RawMap may be used as an alternative to the RT-PCR test for viral load quantification of SARS-CoV-2.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用RawMap之前读取的快速实时波形分类。
ReadUntil使牛津纳米孔技术公司(ONT)的测序仪能够实时选择性地对目标物种的读数进行测序。这使得微生物富集能够有效地用于微生物丰度估计等应用,并且对于具有非常高比例的非靶标读数(>99%可以是人类读数)的宏基因组样品尤其有益。然而,读取直到需要一个快速准确的软件过滤器,该过滤器分析读取的短前缀,并确定它是否属于感兴趣的微生物(目标)。基线Read Until管道使用了一个名为Guppy的基于深度神经网络的basecaller,并且速度慢且不准确(约60%的测序碱基未分类)。我们提出了RawMap,这是一种高效的仅CPU的微生物物种不可知的Read Until分类器,用于在歪歪扭扭的空间中过滤非目标人类读数。RawMap使用支持向量机(SVM),该机器使用ONT的波形输出(连续电信号)的非线性和非平稳特性进行训练,以区分人类和微生物。与基线Read Until流水线相比,RawMap是一个速度快1327X的分类器,显著提高了测序时间和成本,并节省了计算时间。我们表明,RawMap增强的管道减少了约24%的测序时间和成本,减少了22%的计算成本。此外,由于RawMap对微生物物种不可知,它也可以对未经训练的微生物物种进行分类。我们还讨论了如何将RawMap用作RT-PCR检测的替代方法,以量化严重急性呼吸系统综合征冠状病毒2型的病毒载量。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Herb Stevia Rebaudiana’s Functionalities, Safety, and Applications: A Review Strict Lockdown versus Flexible Social Distance Strategy for COVID-19 Disease: a Cost-Effectiveness Analysis. Prevalence, Trends, and Harm Perception Associated with E-Cigarettes and Vaping among Adolescents in Saudi Arabia. Rapid Real-time Squiggle Classification for Read until using RawMap. Comparative Analysis of Global Hepatic Gene Expression in Adolescents and Adults with Non-alcoholic Fatty Liver Disease.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1