AsaruSim: a single-cell and spatial RNA-Seq Nanopore long-reads simulation workflow.

Ali Hamraoui, Laurent Jourdren, Morgane Thomas-Chollier
{"title":"AsaruSim: a single-cell and spatial RNA-Seq Nanopore long-reads simulation workflow.","authors":"Ali Hamraoui, Laurent Jourdren, Morgane Thomas-Chollier","doi":"10.1093/bioinformatics/btaf087","DOIUrl":null,"url":null,"abstract":"<p><strong>Motivation: </strong>The combination of long-read sequencing technologies like Oxford Nanopore with single-cell RNA sequencing (scRNAseq) assays enables the detailed exploration of transcriptomic complexity, including isoform detection and quantification, by capturing full-length cDNAs. However, challenges remain, including the lack of advanced simulation tools that can effectively mimic the unique complexities of scRNAseq long-read datasets. Such tools are essential for the evaluation and optimization of isoform detection methods dedicated to single-cell long-read studies.</p><p><strong>Results: </strong>We developed AsaruSim, a workflow that simulates synthetic single-cell long-read Nanopore datasets, closely mimicking real experimental data. AsaruSim employs a multi-step process that includes the creation of a synthetic count matrix, generation of perfect reads, optional PCR amplification, introduction of sequencing errors, and comprehensive quality control reporting. Applied to a dataset of human peripheral blood mononuclear cells, AsaruSim accurately reproduced experimental read characteristics.</p><p><strong>Availability and implementation: </strong>The source code and full documentation are available at https://github.com/GenomiqueENS/AsaruSim.</p>","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":5.4000,"publicationDate":"2025-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11897429/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinformatics (Oxford, England)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/bioinformatics/btaf087","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Motivation: The combination of long-read sequencing technologies like Oxford Nanopore with single-cell RNA sequencing (scRNAseq) assays enables the detailed exploration of transcriptomic complexity, including isoform detection and quantification, by capturing full-length cDNAs. However, challenges remain, including the lack of advanced simulation tools that can effectively mimic the unique complexities of scRNAseq long-read datasets. Such tools are essential for the evaluation and optimization of isoform detection methods dedicated to single-cell long-read studies.

Results: We developed AsaruSim, a workflow that simulates synthetic single-cell long-read Nanopore datasets, closely mimicking real experimental data. AsaruSim employs a multi-step process that includes the creation of a synthetic count matrix, generation of perfect reads, optional PCR amplification, introduction of sequencing errors, and comprehensive quality control reporting. Applied to a dataset of human peripheral blood mononuclear cells, AsaruSim accurately reproduced experimental read characteristics.

Availability and implementation: The source code and full documentation are available at https://github.com/GenomiqueENS/AsaruSim.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
AsaruSim:单细胞和空间RNA-Seq纳米孔长读模拟工作流程。
动机:将牛津纳米孔等长读测序技术与单细胞RNA测序(scRNAseq)分析相结合,通过捕获全长cdna,可以详细探索转录组的复杂性,包括异型检测和定量。然而,挑战仍然存在,包括缺乏先进的模拟工具,可以有效地模拟scnaseq长读数据集的独特复杂性。这样的工具是必不可少的评估和优化同种异构体检测方法专用于单细胞长读研究。结果:我们开发了AsaruSim,这是一个模拟合成单细胞长读纳米孔数据集的工作流程,非常接近真实的实验数据。AsaruSim采用多步骤流程,包括创建合成UMI计数矩阵,生成完美读数,可选PCR扩增,引入测序错误,以及全面的质量控制报告。AsaruSim应用于人外周血单个核细胞(PBMCs)数据集,准确地再现了实验读数特征。可用性:源代码和完整的文档可在:https://github.com/GenomiqueENS/AsaruSim.Supplementary信息:补充数据可在Bioinformatics在线。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
VDJ-Insights: simplifying the annotation of genomic IG and TCR regions. CoMBCR: Co-Learning Multi-Modalities of BCRs and Gene Expressions. HXMS: a standardized file format for HX-MS data. SeOMLR: one-step multi-view latent representation with self-weighted ensemble learning for multi-omics cancer subtyping. PEtab-GUI: A graphical user interface to create, edit and inspect PEtab parameter estimation problems.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1