Abdullah Al Mohit, Niher Ranjan Das, Arushi Jain, Nazmir Binta Alam, Ananda Mustafiz
Next-generation RNA sequencing (RNA-seq) allows researchers to study gene expression across the whole genome. However, its analysis often needs powerful computers and advanced command-line skills, which can be challenging when resources are limited. This protocol provides a simple, start-to-finish RNA-seq data analysis method that is easy to follow, reproducible, and requires minimal local hardware. It uses free tools such as SRA Toolkit, FastQC, Trimmomatic, BWA/HISAT2, Samtools, and Subread, along with Python and R for further analysis using Google Colab. The process includes downloading raw data from NCBI GEO/SRA, checking data quality, trimming adapters and low-quality reads, aligning sequences to reference genomes, converting file formats, counting reads, normalizing to TPM, and creating visualizations such as heatmaps, bar plots, and volcano plots. Differential gene expression is analyzed with pyDESeq2, and functional enrichment is done using g:Profiler. Troubleshooting in RNA-seq generally involves configuring essential tools, resolving path and dependency issues, and ensuring proper handling of paired-end reads during analysis. By running the heavy computational steps on cloud platforms, this workflow makes RNA-seq analysis affordable and accessible to more researchers. © 2026 Wiley Periodicals LLC.
Basic Protocol 1: Extracting and processing a high-throughput RNA-seq dataset with the command prompt and Windows Subsystem for Linux
Basic Protocol 2: Normalization and visualization of processed RNA-seq dataset with Google Colab and Python 3
{"title":"Streamline Protocol for Bulk-RNA Sequencing: From Data Extraction to Expression Analysis","authors":"Abdullah Al Mohit, Niher Ranjan Das, Arushi Jain, Nazmir Binta Alam, Ananda Mustafiz","doi":"10.1002/cpz1.70304","DOIUrl":"10.1002/cpz1.70304","url":null,"abstract":"<p>Next-generation RNA sequencing (RNA-seq) allows researchers to study gene expression across the whole genome. However, its analysis often needs powerful computers and advanced command-line skills, which can be challenging when resources are limited. This protocol provides a simple, start-to-finish RNA-seq data analysis method that is easy to follow, reproducible, and requires minimal local hardware. It uses free tools such as SRA Toolkit, FastQC, Trimmomatic, BWA/HISAT2, Samtools, and Subread, along with Python and R for further analysis using Google Colab. The process includes downloading raw data from NCBI GEO/SRA, checking data quality, trimming adapters and low-quality reads, aligning sequences to reference genomes, converting file formats, counting reads, normalizing to TPM, and creating visualizations such as heatmaps, bar plots, and volcano plots. Differential gene expression is analyzed with pyDESeq2, and functional enrichment is done using g:Profiler. Troubleshooting in RNA-seq generally involves configuring essential tools, resolving path and dependency issues, and ensuring proper handling of paired-end reads during analysis. By running the heavy computational steps on cloud platforms, this workflow makes RNA-seq analysis affordable and accessible to more researchers. © 2026 Wiley Periodicals LLC.</p><p><b>Basic Protocol 1</b>: Extracting and processing a high-throughput RNA-seq dataset with the command prompt and Windows Subsystem for Linux</p><p><b>Basic Protocol 2</b>: Normalization and visualization of processed RNA-seq dataset with Google Colab and Python 3</p>","PeriodicalId":93970,"journal":{"name":"Current protocols","volume":"6 2","pages":""},"PeriodicalIF":2.2,"publicationDate":"2026-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146121272","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0