{"title":"利用 scPAFA 在大规模单细胞转录组上发现与疾病相关的多细胞通路模块。","authors":"Zhuoli Huang, Yuhui Zheng, Weikai Wang, Wenwen Zhou, Yanbo Zhang, Chen Wei, Xiuqing Zhang, Xin Jin, Jianhua Yin","doi":"10.1038/s42003-024-07238-7","DOIUrl":null,"url":null,"abstract":"<p><p>Pathway analysis is a crucial analytical phase in disease research on single-cell RNA sequencing (scRNA-seq) data, offering biological interpretations based on prior knowledge. However, currently available tools for generating cell-level pathway activity scores (PAS) exhibit computational inefficacy in large-scale scRNA-seq datasets. Additionally, disease-related pathways are often identified through cross-condition comparisons within specific cell types, overlooking potential patterns that involve multiple cell types. Here, we present single-cell pathway activity factor analysis (scPAFA), a Python library designed for large-scale single-cell datasets allowing rapid PAS computation and uncovering biologically interpretable disease-related multicellular pathway modules, which are low-dimensional representations of disease-related PAS alterations in multiple cell types. Application on colorectal cancer (CRC) datasets and large-scale lupus atlas over 1.2 million cells demonstrated that scPAFA can achieve over 40-fold reductions in the runtime of PAS computation and further identified reliable and interpretable multicellular pathway modules that capture the heterogeneity of CRC and transcriptional abnormalities in lupus patients, respectively. Overall, scPAFA presents a valuable addition to existing research tools in disease research, with the potential to reveal complex disease mechanisms and support biomarker discovery at the pathway level.</p>","PeriodicalId":10552,"journal":{"name":"Communications Biology","volume":"7 1","pages":"1523"},"PeriodicalIF":5.2000,"publicationDate":"2024-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11569158/pdf/","citationCount":"0","resultStr":"{\"title\":\"Uncovering disease-related multicellular pathway modules on large-scale single-cell transcriptomes with scPAFA.\",\"authors\":\"Zhuoli Huang, Yuhui Zheng, Weikai Wang, Wenwen Zhou, Yanbo Zhang, Chen Wei, Xiuqing Zhang, Xin Jin, Jianhua Yin\",\"doi\":\"10.1038/s42003-024-07238-7\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Pathway analysis is a crucial analytical phase in disease research on single-cell RNA sequencing (scRNA-seq) data, offering biological interpretations based on prior knowledge. However, currently available tools for generating cell-level pathway activity scores (PAS) exhibit computational inefficacy in large-scale scRNA-seq datasets. Additionally, disease-related pathways are often identified through cross-condition comparisons within specific cell types, overlooking potential patterns that involve multiple cell types. Here, we present single-cell pathway activity factor analysis (scPAFA), a Python library designed for large-scale single-cell datasets allowing rapid PAS computation and uncovering biologically interpretable disease-related multicellular pathway modules, which are low-dimensional representations of disease-related PAS alterations in multiple cell types. Application on colorectal cancer (CRC) datasets and large-scale lupus atlas over 1.2 million cells demonstrated that scPAFA can achieve over 40-fold reductions in the runtime of PAS computation and further identified reliable and interpretable multicellular pathway modules that capture the heterogeneity of CRC and transcriptional abnormalities in lupus patients, respectively. Overall, scPAFA presents a valuable addition to existing research tools in disease research, with the potential to reveal complex disease mechanisms and support biomarker discovery at the pathway level.</p>\",\"PeriodicalId\":10552,\"journal\":{\"name\":\"Communications Biology\",\"volume\":\"7 1\",\"pages\":\"1523\"},\"PeriodicalIF\":5.2000,\"publicationDate\":\"2024-11-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11569158/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Communications Biology\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1038/s42003-024-07238-7\",\"RegionNum\":1,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Communications Biology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1038/s42003-024-07238-7","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0
摘要
通路分析是单细胞 RNA 测序(scRNA-seq)数据疾病研究中的一个关键分析阶段,它能根据先前的知识提供生物学解释。然而,目前可用来生成细胞级通路活性评分(PAS)的工具在大规模 scRNA-seq 数据集中表现出计算效率低下的问题。此外,疾病相关通路通常是通过特定细胞类型内的跨条件比较来确定的,忽略了涉及多种细胞类型的潜在模式。在这里,我们介绍了单细胞通路活性因子分析(scPAFA),这是一个专为大规模单细胞数据集设计的 Python 库,可以快速计算 PAS 并发现可从生物学角度解释的疾病相关多细胞通路模块,这些模块是多种细胞类型中疾病相关 PAS 改变的低维表示。在结直肠癌(CRC)数据集和超过120万个细胞的大规模狼疮图谱上的应用表明,scPAFA能将PAS计算的运行时间缩短40倍以上,并进一步确定了可靠且可解释的多细胞通路模块,这些模块分别捕捉了CRC和狼疮患者转录异常的异质性。总之,scPAFA 是对现有疾病研究工具的宝贵补充,有望揭示复杂的疾病机制,支持通路水平的生物标记物发现。
Uncovering disease-related multicellular pathway modules on large-scale single-cell transcriptomes with scPAFA.
Pathway analysis is a crucial analytical phase in disease research on single-cell RNA sequencing (scRNA-seq) data, offering biological interpretations based on prior knowledge. However, currently available tools for generating cell-level pathway activity scores (PAS) exhibit computational inefficacy in large-scale scRNA-seq datasets. Additionally, disease-related pathways are often identified through cross-condition comparisons within specific cell types, overlooking potential patterns that involve multiple cell types. Here, we present single-cell pathway activity factor analysis (scPAFA), a Python library designed for large-scale single-cell datasets allowing rapid PAS computation and uncovering biologically interpretable disease-related multicellular pathway modules, which are low-dimensional representations of disease-related PAS alterations in multiple cell types. Application on colorectal cancer (CRC) datasets and large-scale lupus atlas over 1.2 million cells demonstrated that scPAFA can achieve over 40-fold reductions in the runtime of PAS computation and further identified reliable and interpretable multicellular pathway modules that capture the heterogeneity of CRC and transcriptional abnormalities in lupus patients, respectively. Overall, scPAFA presents a valuable addition to existing research tools in disease research, with the potential to reveal complex disease mechanisms and support biomarker discovery at the pathway level.
期刊介绍:
Communications Biology is an open access journal from Nature Research publishing high-quality research, reviews and commentary in all areas of the biological sciences. Research papers published by the journal represent significant advances bringing new biological insight to a specialized area of research.