Wenwen Min, Zhen Wang, Fangfang Zhu, Taosheng Xu, Shunfang Wang
{"title":"scASDC: Attention Enhanced Structural Deep Clustering for Single-cell RNA-seq Data","authors":"Wenwen Min, Zhen Wang, Fangfang Zhu, Taosheng Xu, Shunfang Wang","doi":"arxiv-2408.05258","DOIUrl":null,"url":null,"abstract":"Single-cell RNA sequencing (scRNA-seq) data analysis is pivotal for\nunderstanding cellular heterogeneity. However, the high sparsity and complex\nnoise patterns inherent in scRNA-seq data present significant challenges for\ntraditional clustering methods. To address these issues, we propose a deep\nclustering method, Attention-Enhanced Structural Deep Embedding Graph\nClustering (scASDC), which integrates multiple advanced modules to improve\nclustering accuracy and robustness.Our approach employs a multi-layer graph\nconvolutional network (GCN) to capture high-order structural relationships\nbetween cells, termed as the graph autoencoder module. To mitigate the\noversmoothing issue in GCNs, we introduce a ZINB-based autoencoder module that\nextracts content information from the data and learns latent representations of\ngene expression. These modules are further integrated through an attention\nfusion mechanism, ensuring effective combination of gene expression and\nstructural information at each layer of the GCN. Additionally, a\nself-supervised learning module is incorporated to enhance the robustness of\nthe learned embeddings. Extensive experiments demonstrate that scASDC\noutperforms existing state-of-the-art methods, providing a robust and effective\nsolution for single-cell clustering tasks. Our method paves the way for more\naccurate and meaningful analysis of single-cell RNA sequencing data,\ncontributing to better understanding of cellular heterogeneity and biological\nprocesses. All code and public datasets used in this paper are available at\n\\url{https://github.com/wenwenmin/scASDC} and\n\\url{https://zenodo.org/records/12814320}.","PeriodicalId":501070,"journal":{"name":"arXiv - QuanBio - Genomics","volume":"86 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuanBio - Genomics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.05258","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Single-cell RNA sequencing (scRNA-seq) data analysis is pivotal for
understanding cellular heterogeneity. However, the high sparsity and complex
noise patterns inherent in scRNA-seq data present significant challenges for
traditional clustering methods. To address these issues, we propose a deep
clustering method, Attention-Enhanced Structural Deep Embedding Graph
Clustering (scASDC), which integrates multiple advanced modules to improve
clustering accuracy and robustness.Our approach employs a multi-layer graph
convolutional network (GCN) to capture high-order structural relationships
between cells, termed as the graph autoencoder module. To mitigate the
oversmoothing issue in GCNs, we introduce a ZINB-based autoencoder module that
extracts content information from the data and learns latent representations of
gene expression. These modules are further integrated through an attention
fusion mechanism, ensuring effective combination of gene expression and
structural information at each layer of the GCN. Additionally, a
self-supervised learning module is incorporated to enhance the robustness of
the learned embeddings. Extensive experiments demonstrate that scASDC
outperforms existing state-of-the-art methods, providing a robust and effective
solution for single-cell clustering tasks. Our method paves the way for more
accurate and meaningful analysis of single-cell RNA sequencing data,
contributing to better understanding of cellular heterogeneity and biological
processes. All code and public datasets used in this paper are available at
\url{https://github.com/wenwenmin/scASDC} and
\url{https://zenodo.org/records/12814320}.