{"title":"Uncovering disease-related multicellular pathway modules on large-scale single-cell transcriptomes with scPAFA.","authors":"Zhuoli Huang, Yuhui Zheng, Weikai Wang, Wenwen Zhou, Yanbo Zhang, Chen Wei, Xiuqing Zhang, Xin Jin, Jianhua Yin","doi":"10.1038/s42003-024-07238-7","DOIUrl":null,"url":null,"abstract":"<p><p>Pathway analysis is a crucial analytical phase in disease research on single-cell RNA sequencing (scRNA-seq) data, offering biological interpretations based on prior knowledge. However, currently available tools for generating cell-level pathway activity scores (PAS) exhibit computational inefficacy in large-scale scRNA-seq datasets. Additionally, disease-related pathways are often identified through cross-condition comparisons within specific cell types, overlooking potential patterns that involve multiple cell types. Here, we present single-cell pathway activity factor analysis (scPAFA), a Python library designed for large-scale single-cell datasets allowing rapid PAS computation and uncovering biologically interpretable disease-related multicellular pathway modules, which are low-dimensional representations of disease-related PAS alterations in multiple cell types. Application on colorectal cancer (CRC) datasets and large-scale lupus atlas over 1.2 million cells demonstrated that scPAFA can achieve over 40-fold reductions in the runtime of PAS computation and further identified reliable and interpretable multicellular pathway modules that capture the heterogeneity of CRC and transcriptional abnormalities in lupus patients, respectively. Overall, scPAFA presents a valuable addition to existing research tools in disease research, with the potential to reveal complex disease mechanisms and support biomarker discovery at the pathway level.</p>","PeriodicalId":10552,"journal":{"name":"Communications Biology","volume":"7 1","pages":"1523"},"PeriodicalIF":5.2000,"publicationDate":"2024-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11569158/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Communications Biology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1038/s42003-024-07238-7","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Pathway analysis is a crucial analytical phase in disease research on single-cell RNA sequencing (scRNA-seq) data, offering biological interpretations based on prior knowledge. However, currently available tools for generating cell-level pathway activity scores (PAS) exhibit computational inefficacy in large-scale scRNA-seq datasets. Additionally, disease-related pathways are often identified through cross-condition comparisons within specific cell types, overlooking potential patterns that involve multiple cell types. Here, we present single-cell pathway activity factor analysis (scPAFA), a Python library designed for large-scale single-cell datasets allowing rapid PAS computation and uncovering biologically interpretable disease-related multicellular pathway modules, which are low-dimensional representations of disease-related PAS alterations in multiple cell types. Application on colorectal cancer (CRC) datasets and large-scale lupus atlas over 1.2 million cells demonstrated that scPAFA can achieve over 40-fold reductions in the runtime of PAS computation and further identified reliable and interpretable multicellular pathway modules that capture the heterogeneity of CRC and transcriptional abnormalities in lupus patients, respectively. Overall, scPAFA presents a valuable addition to existing research tools in disease research, with the potential to reveal complex disease mechanisms and support biomarker discovery at the pathway level.
期刊介绍:
Communications Biology is an open access journal from Nature Research publishing high-quality research, reviews and commentary in all areas of the biological sciences. Research papers published by the journal represent significant advances bringing new biological insight to a specialized area of research.