QUIC-seq: Quick ultra-affordable high-throughput convenient RNA sequencing

IF 10.5 1区生物学 Q1 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Plant Biotechnology Journal Pub Date : 2024-10-24 DOI:10.1111/pbi.14496

Shouzhen Teng, Dan Wang, Yiheng Qian, Revocatus Bahitwa, Jinghong Shao, Mingrui Suo, Mingchi Xu, Luyuan Yang, Tianyi Li, Qiuying Yu, Hai Wang

{"title":"QUIC-seq: Quick ultra-affordable high-throughput convenient RNA sequencing","authors":"Shouzhen Teng, Dan Wang, Yiheng Qian, Revocatus Bahitwa, Jinghong Shao, Mingrui Suo, Mingchi Xu, Luyuan Yang, Tianyi Li, Qiuying Yu, Hai Wang","doi":"10.1111/pbi.14496","DOIUrl":null,"url":null,"abstract":"Quantifying transcript levels is essential for understanding the gene functions and regulatory networks. To achieve this, various techniques have been developed to measure gene transcription levels, including transcriptome sequencing (Mutz et al., 2013). Standard RNA-seq experiment involves several key steps: RNA extraction, mRNA purification, reverse transcription, second-strand cDNA synthesis, adapter ligation, and library amplification. Traditionally, RNA-seq covers the entire coding region and requires sufficient sequencing depth to produce reliable results. To address this, targeted 3′ end transcriptome sequencing methods like PAT-seq (Harrison et al., 2015) have been developed, reducing both the sequencing coverage and costs, but these methods can only process one RNA sample at a time, limiting their application in large-scale studies. Fortunately, advancements like SiPAS (Wang et al., 2022) and MP3RNA-seq (Chen et al., 2021) have made the construction of 3′ terminal libraries more convenient and suitable for high-throughput applications. Recent studies have shown that Tn5 can act on RNA/DNA hybrids (Choi et al., 2024; Lu et al., 2020), eliminating the need for cDNA second-strand synthesis. Thus, we developed QUIC-seq, a quick, ultra-affordable, high-throughput and convenient method for gene expression analysis.The workflow for QUIC-seq library preparation is illustrated in Figure 1a. Initially, total RNA is extracted from various samples and quantified. Approximately 500 ng RNA from each sample is transferred into a 96-well plate for reverse transcription using specific reverse transcription primers (Table S1). Next, half of the RNA/DNA hybrids from each well are pooled and recovered with magnetic beads. The RNA/DNA hybrids are then fragmented using Tn5 transposase, followed by direct PCR amplification. The amplified product is selected using magnetic beads and sequenced on illumina sequencing platform.Optimizing the QUIC-seq library conditions involved adjusting various steps to achieve the best results. First, we tested the effect of reverse transcriptase from different manufacturers (Figure S1). The AIII enzyme performed the best, followed by VIII enzyme and TIV enzyme. Despite AIII's superior performance, its cost was double that of VIII, making it less economical. We also examined the effect of inactivating the reverse transcription enzyme. The results indicated that enzyme inactivation negatively affected the subsequent data. This could be due to high temperatures causing the RNA/DNA hybrids to denature and separate, resulting in sub-optimal library construction. In addition, we found that whether SDS was added or not had little effect on the number of genes detected (Figure 1b; Figure S2).When Tn5 fragment RNA/DNA hybrids, it creates a 9 bp gap. Previous studies (Choi et al., 2024; Lu et al., 2020) suggested adding Bst3.0 or reverse transcriptase to fill this gap. However, we found that more detected genes (Figure 1b) were achieved without adding Bst3.0 or reverse transcriptase. This indicates that Takara Ex Premier polymerase effectively fills the hybrids' gap – a novel finding in this study. In summary, we eliminated the steps of reverse transcriptase inactivation, Tn5 transposase inactivation, and Bst3.0 or reverse transcriptase gap-filling, streamlining the procedure and reducing both time and cost.To assess the reproducibility of the library construction, we performed two independent QUIC-seq experiments. The results demonstrated a high correlation coefficient of 0.98 between the replicates (Figure 1c). TruSeq reads were evenly distributed across the entire gene, while QUIC-seq reads were concentrated at the 3′ end as expected (Figure 1d). QUIC-seq has fewer reads aligned to intergenic regions than TruSeq, improving read utilization (Figure S3). To test for sample mixing, we used RNA from Arabidopsis, rice and maize in a single experiment. Approximately 99.8% of the reads from these samples aligned with their respective reference genomes, indicating minimal cross-contamination between barcodes (Figure 1e; Figure S4). Theoretical predictions suggest that increasing read counts leads to more gene detections, but also raises costs. Our analysis showed that around 20 000 genes were detected at 1.5 × 106 reads, with the number plateauing thereafter (Figure 1f).The performances of QUIC-seq and other RNA-seq methods were compared. The correlation coefficient between the TruSeq and QUIC-seq library construction methods was 0.9 (Figure 1g), comparable to other 3′ terminal RNA-seq methods like MP3RNA-seq (Chen et al., 2021). In terms of gene identification, both methods detected 22 939 common genes (Figure 1h; Table S2). However, QUIC-seq offers significant advantages in cost and throughput. Unlike MP3RNA-seq, QUIC-seq simplifies the process by omitting RNA strand degradation and the conversion of single-stranded cDNA to double-stranded cDNA. While BOLT-seq requires the same amount of time for library preparation, it involves more complex steps such as adding Tn5 and performing PCR amplification in a 96-well plate (Choi et al., 2024). In contrast, QUIC-seq requires only reverse transcription in a 96-well plate, with Tn5 fragmentation and PCR amplification done in a few Eppendorf tubes, significantly reducing operational complexity. Additionally, QUIC-seq libraries includes three barcodes, further reducing costs. The incorporation of UMIs in the reverse transcription primers also enables more accurate gene expression quantification after UMI deduplication, a feature lacking in BOLT-seq. In summary, QUIC-seq is a simplified and efficient library preparation method that can process high-throughput samples (96 samples) within 4 h at a significantly reduced cost of only $0.823 per sample (Figure 1i; Tables S3 and S4).Given the method's stability and convenience, we conducted several experiments using QUIC-seq. Initially, we identified 2477 differentially expressed genes (DEGs) in response to nitrogen and phosphorus stress, with 1339 genes up-regulated and 1138 down-regulated (Figure 1j; Figure S5). To further explore the role of Opaque2 (O2) and prolamin-box binding factor 1 (PBF1) in endosperm formation and to understand the expression regulatory network, we used QUIC-seq to screen target genes. In a previous transcriptome analysis using O2-GFP and GFP, 1715 up-regulated and 772 down-regulated genes were identified with FDR < 0.05 (Zhu et al., 2023). Under stricter criteria (FDR < 0.05, |log2FoldChange| > 1), QUIC-seq identified 2281 O2-up-regulated and 2913 O2-down-regulated genes, revealing more DEGs compared to PER-seq (Figure S6). Additionally, we found 539 PBF1-activated and 480 PBF1-repressed genes, with 88 genes up-regulated and 217 genes down-regulated in both O2-GFP and PBF1-GFP datasets. In analysing the expression regulatory network of BBM, we identified four transcription factor genes with functions similar to BBM. QUIC-seq was employed for library construction and sequencing, leading to the identification of DEGs in bHLH48, WRKY95, GATA28, and IFA1, ultimately uncovering 50 co-expressed regulatory genes associated with these four transcription factors (Figure 1k; Figure S7).The authors declare they have submitted an innovation patent related to this work (application number: 202410796820X).HW conceived and designed the project. ST, DW and HQ conducted the experiments and analysed the data. HS, MS, MX, LY, TL and QY participated in some experiments. ST wrote the manuscript. HW and REVO revised the manuscript.","PeriodicalId":221,"journal":{"name":"Plant Biotechnology Journal","volume":"23 1","pages":"281-283"},"PeriodicalIF":10.5000,"publicationDate":"2024-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/pbi.14496","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Plant Biotechnology Journal","FirstCategoryId":"5","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/pbi.14496","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOTECHNOLOGY & APPLIED MICROBIOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Quantifying transcript levels is essential for understanding the gene functions and regulatory networks. To achieve this, various techniques have been developed to measure gene transcription levels, including transcriptome sequencing (Mutz et al., 2013). Standard RNA-seq experiment involves several key steps: RNA extraction, mRNA purification, reverse transcription, second-strand cDNA synthesis, adapter ligation, and library amplification. Traditionally, RNA-seq covers the entire coding region and requires sufficient sequencing depth to produce reliable results. To address this, targeted 3′ end transcriptome sequencing methods like PAT-seq (Harrison et al., 2015) have been developed, reducing both the sequencing coverage and costs, but these methods can only process one RNA sample at a time, limiting their application in large-scale studies. Fortunately, advancements like SiPAS (Wang et al., 2022) and MP3RNA-seq (Chen et al., 2021) have made the construction of 3′ terminal libraries more convenient and suitable for high-throughput applications. Recent studies have shown that Tn5 can act on RNA/DNA hybrids (Choi et al., 2024; Lu et al., 2020), eliminating the need for cDNA second-strand synthesis. Thus, we developed QUIC-seq, a quick, ultra-affordable, high-throughput and convenient method for gene expression analysis.

The workflow for QUIC-seq library preparation is illustrated in Figure 1a. Initially, total RNA is extracted from various samples and quantified. Approximately 500 ng RNA from each sample is transferred into a 96-well plate for reverse transcription using specific reverse transcription primers (Table S1). Next, half of the RNA/DNA hybrids from each well are pooled and recovered with magnetic beads. The RNA/DNA hybrids are then fragmented using Tn5 transposase, followed by direct PCR amplification. The amplified product is selected using magnetic beads and sequenced on illumina sequencing platform.

Optimizing the QUIC-seq library conditions involved adjusting various steps to achieve the best results. First, we tested the effect of reverse transcriptase from different manufacturers (Figure S1). The AIII enzyme performed the best, followed by VIII enzyme and TIV enzyme. Despite AIII's superior performance, its cost was double that of VIII, making it less economical. We also examined the effect of inactivating the reverse transcription enzyme. The results indicated that enzyme inactivation negatively affected the subsequent data. This could be due to high temperatures causing the RNA/DNA hybrids to denature and separate, resulting in sub-optimal library construction. In addition, we found that whether SDS was added or not had little effect on the number of genes detected (Figure 1b; Figure S2).

When Tn5 fragment RNA/DNA hybrids, it creates a 9 bp gap. Previous studies (Choi et al., 2024; Lu et al., 2020) suggested adding Bst3.0 or reverse transcriptase to fill this gap. However, we found that more detected genes (Figure 1b) were achieved without adding Bst3.0 or reverse transcriptase. This indicates that Takara Ex Premier polymerase effectively fills the hybrids' gap – a novel finding in this study. In summary, we eliminated the steps of reverse transcriptase inactivation, Tn5 transposase inactivation, and Bst3.0 or reverse transcriptase gap-filling, streamlining the procedure and reducing both time and cost.

To assess the reproducibility of the library construction, we performed two independent QUIC-seq experiments. The results demonstrated a high correlation coefficient of 0.98 between the replicates (Figure 1c). TruSeq reads were evenly distributed across the entire gene, while QUIC-seq reads were concentrated at the 3′ end as expected (Figure 1d). QUIC-seq has fewer reads aligned to intergenic regions than TruSeq, improving read utilization (Figure S3). To test for sample mixing, we used RNA from Arabidopsis, rice and maize in a single experiment. Approximately 99.8% of the reads from these samples aligned with their respective reference genomes, indicating minimal cross-contamination between barcodes (Figure 1e; Figure S4). Theoretical predictions suggest that increasing read counts leads to more gene detections, but also raises costs. Our analysis showed that around 20 000 genes were detected at 1.5 × 10⁶ reads, with the number plateauing thereafter (Figure 1f).

The performances of QUIC-seq and other RNA-seq methods were compared. The correlation coefficient between the TruSeq and QUIC-seq library construction methods was 0.9 (Figure 1g), comparable to other 3′ terminal RNA-seq methods like MP3RNA-seq (Chen et al., 2021). In terms of gene identification, both methods detected 22 939 common genes (Figure 1h; Table S2). However, QUIC-seq offers significant advantages in cost and throughput. Unlike MP3RNA-seq, QUIC-seq simplifies the process by omitting RNA strand degradation and the conversion of single-stranded cDNA to double-stranded cDNA. While BOLT-seq requires the same amount of time for library preparation, it involves more complex steps such as adding Tn5 and performing PCR amplification in a 96-well plate (Choi et al., 2024). In contrast, QUIC-seq requires only reverse transcription in a 96-well plate, with Tn5 fragmentation and PCR amplification done in a few Eppendorf tubes, significantly reducing operational complexity. Additionally, QUIC-seq libraries includes three barcodes, further reducing costs. The incorporation of UMIs in the reverse transcription primers also enables more accurate gene expression quantification after UMI deduplication, a feature lacking in BOLT-seq. In summary, QUIC-seq is a simplified and efficient library preparation method that can process high-throughput samples (96 samples) within 4 h at a significantly reduced cost of only $0.823 per sample (Figure 1i; Tables S3 and S4).

Given the method's stability and convenience, we conducted several experiments using QUIC-seq. Initially, we identified 2477 differentially expressed genes (DEGs) in response to nitrogen and phosphorus stress, with 1339 genes up-regulated and 1138 down-regulated (Figure 1j; Figure S5). To further explore the role of Opaque2 (O2) and prolamin-box binding factor 1 (PBF1) in endosperm formation and to understand the expression regulatory network, we used QUIC-seq to screen target genes. In a previous transcriptome analysis using O2-GFP and GFP, 1715 up-regulated and 772 down-regulated genes were identified with FDR < 0.05 (Zhu et al., 2023). Under stricter criteria (FDR < 0.05, |log2FoldChange| > 1), QUIC-seq identified 2281 O2-up-regulated and 2913 O2-down-regulated genes, revealing more DEGs compared to PER-seq (Figure S6). Additionally, we found 539 PBF1-activated and 480 PBF1-repressed genes, with 88 genes up-regulated and 217 genes down-regulated in both O2-GFP and PBF1-GFP datasets. In analysing the expression regulatory network of BBM, we identified four transcription factor genes with functions similar to BBM. QUIC-seq was employed for library construction and sequencing, leading to the identification of DEGs in bHLH48, WRKY95, GATA28, and IFA1, ultimately uncovering 50 co-expressed regulatory genes associated with these four transcription factors (Figure 1k; Figure S7).

The authors declare they have submitted an innovation patent related to this work (application number: 202410796820X).

HW conceived and designed the project. ST, DW and HQ conducted the experiments and analysed the data. HS, MS, MX, LY, TL and QY participated in some experiments. ST wrote the manuscript. HW and REVO revised the manuscript.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

QUIC-seq：超实惠的快速高通量便捷 RNA 测序

量化转录物水平对于理解基因功能和调控网络至关重要。为了实现这一目标，已经开发了各种技术来测量基因转录水平，包括转录组测序（Mutz et al., 2013）。标准RNA-seq实验包括几个关键步骤：RNA提取、mRNA纯化、反转录、第二链cDNA合成、适配器连接和文库扩增。传统上，RNA-seq覆盖了整个编码区，需要足够的测序深度才能产生可靠的结果。为了解决这个问题，已经开发出针对性的3 '端转录组测序方法，如PAT-seq (Harrison et al., 2015)，降低了测序覆盖率和成本，但这些方法一次只能处理一个RNA样本，限制了它们在大规模研究中的应用。幸运的是，SiPAS （Wang et al., 2022）和MP3RNA-seq （Chen et al., 2021）等技术的进步使得构建3′端库更加方便，更适合高通量应用。最近的研究表明，Tn5可以作用于RNA/DNA杂交体(Choi et al., 2024；Lu et al., 2020)，消除了cDNA第二链合成的需要。因此，我们开发了QUIC-seq，一种快速、超经济、高通量和方便的基因表达分析方法。QUIC-seq库准备的工作流程如图1a所示。首先，从各种样品中提取总RNA并进行定量。每个样品中约500 ng RNA转移到96孔板中，使用特定的逆转录引物进行逆转录（表S1）。接下来，将每口井中一半的RNA/DNA杂交体汇集起来，用磁珠回收。然后使用Tn5转座酶将RNA/DNA杂交体片段化，然后进行直接PCR扩增。用磁珠选择扩增产物，在illumina测序平台上测序。优化QUIC-seq库条件包括调整各个步骤以达到最佳结果。首先，我们测试了来自不同制造商的逆转录酶的效果（图S1）。AIII酶表现最好，VIII酶次之，TIV酶次之。尽管AIII的性能优越，但其成本是VIII的两倍，使其不太经济。我们还研究了灭活逆转录酶的效果。结果表明，酶失活对后续数据产生负面影响。这可能是由于高温导致RNA/DNA杂交体变性和分离，导致文库构建不理想。此外，我们发现是否添加SDS对检测到的基因数量影响不大(图1b；图S2)。当Tn5片段RNA/DNA杂交时，它会产生一个9bp的间隙。既往研究(Choi et al., 2024；Lu et al., 2020)建议添加Bst3.0或逆转录酶来填补这一空白。然而，我们发现在不添加Bst3.0或逆转录酶的情况下，可以检测到更多的基因（图1b）。这表明Takara Ex Premier聚合酶有效地填补了杂交种的空白——这是本研究的一个新发现。总之，我们消除了逆转录酶失活、Tn5转座酶失活和Bst3.0或逆转录酶缺口填充的步骤，简化了程序，降低了时间和成本。为了评估文库构建的可重复性，我们进行了两个独立的QUIC-seq实验。结果显示，重复之间的相关系数为0.98（图1c）。TruSeq reads均匀分布在整个基因中，而QUIC-seq reads如预期的那样集中在3 '端（图1d）。与TruSeq相比，QUIC-seq与基因间区对齐的reads更少，从而提高了读取利用率（图S3）。为了测试样品混合，我们在一个实验中使用了来自拟南芥、水稻和玉米的RNA。来自这些样本的大约99.8%的reads与各自的参考基因组对齐，表明条形码之间的交叉污染最小(图1e；图S4)。理论预测表明，读取次数的增加会导致更多的基因检测，但也会提高成本。我们的分析表明，在1.5 × 106 reads中检测到约20,000个基因，此后数量趋于稳定（图1f）。比较了QUIC-seq和其他RNA-seq方法的性能。TruSeq和QUIC-seq文库构建方法的相关系数为0.9（图1g），与MP3RNA-seq等其他3 '端RNA-seq方法相当（Chen et al., 2021）。在基因鉴定方面，两种方法共检测到共有基因22 939个(图1h；表S2)。然而，QUIC-seq在成本和吞吐量方面具有显著优势。与MP3RNA-seq不同，QUIC-seq通过省略RNA链降解和单链cDNA到双链cDNA的转化简化了这一过程。虽然BOLT-seq需要相同的文库准备时间，但它涉及更复杂的步骤，例如添加Tn5并在96孔板中进行PCR扩增（Choi等人，2024）。相比之下，QUIC-seq只需要在96孔板中进行逆转录，Tn5片段和PCR扩增在几个Eppendorf管中完成，大大降低了操作的复杂性。此外，QUIC-seq库包括三个条形码，进一步降低了成本。在逆转录引物中加入UMIs还可以在UMIs重复数据删除后更准确地进行基因表达量化，这是BOLT-seq所缺乏的功能。综上所述，QUIC-seq是一种简化高效的文库制备方法，可以在4小时内处理高通量样品（96个样品），成本显著降低，每个样品仅为0.823美元(图1i；表S3和S4)。考虑到该方法的稳定性和方便性，我们使用QUIC-seq进行了多次实验。最初，我们鉴定出2477个差异表达基因（DEGs）响应氮磷胁迫，其中1339个基因上调，1138个基因下调(图1j；图S5)。为了进一步探索不透明蛋白2 （O2）和prolamin-box binding factor 1 （PBF1）在胚乳形成中的作用，了解其表达调控网络，我们采用QUIC-seq筛选靶基因。在之前使用O2-GFP和GFP的转录组分析中，FDR &lt； 0.05鉴定出1715个上调基因和772个下调基因（Zhu et al., 2023）。在更严格的标准下（FDR < 0.05, |log2FoldChange| > 1）， QUIC-seq鉴定出2281个o2上调基因和2913个o2下调基因，与PER-seq相比，揭示出更多的DEGs（图S6）。此外，在O2-GFP和PBF1-GFP数据集中，我们发现539个pbf1激活基因和480个pbf1抑制基因，其中88个基因上调，217个基因下调。在分析BBM的表达调控网络时，我们发现了四个功能与BBM相似的转录因子基因。利用QUIC-seq进行文库构建和测序，鉴定出bHLH48、WRKY95、GATA28和IFA1中的deg，最终发现了与这四种转录因子相关的50个共表达调控基因(图1k；图S7)。作者声明已提交与本工作相关的创新专利（申请号：202410796820X）。HW构思并设计了这个项目。ST， DW和HQ进行了实验并分析了数据。HS、MS、MX、LY、TL、QY参与了部分实验。ST写了手稿。HW和REVO对稿件进行了修改。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Plant Biotechnology Journal 生物-生物工程与应用微生物

CiteScore

20.50

自引率

2.90%

发文量

201

审稿时长

1 months

期刊介绍： Plant Biotechnology Journal aspires to publish original research and insightful reviews of high impact, authored by prominent researchers in applied plant science. The journal places a special emphasis on molecular plant sciences and their practical applications through plant biotechnology. Our goal is to establish a platform for showcasing significant advances in the field, encompassing curiosity-driven studies with potential applications, strategic research in plant biotechnology, scientific analysis of crucial issues for the beneficial utilization of plant sciences, and assessments of the performance of plant biotechnology products in practical applications.