{"title":"GraphCVAE: Uncovering cell heterogeneity and therapeutic target discovery through residual and contrastive learning","authors":"Zhiwei Zhang, Mengqiu Wang, Ruoyan Dai, Zhenghui Wang, Lixin Lei, Xudong Zhao, Kaitai Han, Chaojing Shi, Qianjin Guo","doi":"10.1016/j.lfs.2024.123208","DOIUrl":null,"url":null,"abstract":"<div><div>Advancements in Spatial Transcriptomics (ST) technologies in recent years have transformed the analysis of tissue structure and function within spatial contexts. However, accurately identifying spatial domains remains challenging due to data sparsity and noise. Traditional clustering methods often fail to capture spatial dependencies, while spatial clustering methods struggle with batch effects and data integration. We introduce GraphCVAE, a model designed to enhance spatial domain identification by integrating spatial and morphological information, correcting batch effects, and managing heterogeneous data. GraphCVAE employs a multi-layer Graph Convolutional Network (GCN) and a variational autoencoder to improve the representation and integration of spatial information. Through contrastive learning, the model captures subtle differences between cell types and states. Extensive testing on various ST datasets demonstrates GraphCVAE's robustness and biological contributions. In the dorsolateral prefrontal cortex (DLPFC) dataset, it accurately delineates cortical layer boundaries. In glioblastoma, GraphCVAE reveals critical therapeutic targets such as TF and NFIB. In colorectal cancer, it explores the role of the extracellular matrix in colorectal cancer. The model's performance metrics consistently surpass existing methods, validating its effectiveness. GraphCVAE's advanced visualization capabilities further highlight its precision in resolving spatial structures, making it a powerful tool for spatial transcriptomics analysis and offering new insights into disease studies.</div></div>","PeriodicalId":18122,"journal":{"name":"Life sciences","volume":"359 ","pages":"Article 123208"},"PeriodicalIF":5.2000,"publicationDate":"2024-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Life sciences","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0024320524007987","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}
引用次数: 0
Abstract
Advancements in Spatial Transcriptomics (ST) technologies in recent years have transformed the analysis of tissue structure and function within spatial contexts. However, accurately identifying spatial domains remains challenging due to data sparsity and noise. Traditional clustering methods often fail to capture spatial dependencies, while spatial clustering methods struggle with batch effects and data integration. We introduce GraphCVAE, a model designed to enhance spatial domain identification by integrating spatial and morphological information, correcting batch effects, and managing heterogeneous data. GraphCVAE employs a multi-layer Graph Convolutional Network (GCN) and a variational autoencoder to improve the representation and integration of spatial information. Through contrastive learning, the model captures subtle differences between cell types and states. Extensive testing on various ST datasets demonstrates GraphCVAE's robustness and biological contributions. In the dorsolateral prefrontal cortex (DLPFC) dataset, it accurately delineates cortical layer boundaries. In glioblastoma, GraphCVAE reveals critical therapeutic targets such as TF and NFIB. In colorectal cancer, it explores the role of the extracellular matrix in colorectal cancer. The model's performance metrics consistently surpass existing methods, validating its effectiveness. GraphCVAE's advanced visualization capabilities further highlight its precision in resolving spatial structures, making it a powerful tool for spatial transcriptomics analysis and offering new insights into disease studies.
期刊介绍:
Life Sciences is an international journal publishing articles that emphasize the molecular, cellular, and functional basis of therapy. The journal emphasizes the understanding of mechanism that is relevant to all aspects of human disease and translation to patients. All articles are rigorously reviewed.
The Journal favors publication of full-length papers where modern scientific technologies are used to explain molecular, cellular and physiological mechanisms. Articles that merely report observations are rarely accepted. Recommendations from the Declaration of Helsinki or NIH guidelines for care and use of laboratory animals must be adhered to. Articles should be written at a level accessible to readers who are non-specialists in the topic of the article themselves, but who are interested in the research. The Journal welcomes reviews on topics of wide interest to investigators in the life sciences. We particularly encourage submission of brief, focused reviews containing high-quality artwork and require the use of mechanistic summary diagrams.