{"title":"Accurate Spatial Heterogeneity Dissection and Gene Regulation Interpretation for Spatial Transcriptomics using Dual Graph Contrastive Learning.","authors":"Zhuohan Yu, Yuning Yang, Xingjian Chen, Ka-Chun Wong, Zhaolei Zhang, Yuming Zhao, Xiangtao Li","doi":"10.1002/advs.202410081","DOIUrl":null,"url":null,"abstract":"<p><p>Recent advances in spatial transcriptomics have enabled simultaneous preservation of high-throughput gene expression profiles and the spatial context, enabling high-resolution exploration of distinct regional characterization in tissue. To effectively understand the underlying biological mechanisms within tissue microenvironments, there is a requisite for methods that can accurately capture external spatial heterogeneity and interpret internal gene regulation from spatial transcriptomics data. However, current methods for region identification often lack the simultaneous characterizing of spatial structure and gene regulation, thereby limiting the ability of spatial dissection and gene interpretation. Here, stDCL is developed, a dual graph contrastive learning method to identify spatial domains and interpret gene regulation in spatial transcriptomics data. stDCL adaptively incorporates gene expression data and spatial information via a graph embedding autoencoder, thereby preserving critical information within the latent embedding representations. In addition, dual graph contrastive learning is proposed to train the model, ensuring that the latent embedding representation closely resembles the actual spatial distribution and exhibits cluster similarity. Benchmarking stDCL against other state-of-the-art clustering methods using complex cortex datasets demonstrates its superior accuracy and effectiveness in identifying spatial domains. Our analysis of the imputation matrices generated by stDCL reveals its capability to reconstruct spatial hierarchical structures and refine differential expression assessment. Furthermore, it is demonstrated that the versatility of stDCL in interpretability of gene regulation, spatial heterogeneity at high resolution, and embryonic developmental patterns. In addition, it is also showed that stDCL can successfully annotate disease-associated astrocyte subtypes in Alzheimer's disease and unravel multiple relevant pathways and regulatory mechanisms.</p>","PeriodicalId":117,"journal":{"name":"Advanced Science","volume":" ","pages":"e2410081"},"PeriodicalIF":14.3000,"publicationDate":"2024-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced Science","FirstCategoryId":"88","ListUrlMain":"https://doi.org/10.1002/advs.202410081","RegionNum":1,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Recent advances in spatial transcriptomics have enabled simultaneous preservation of high-throughput gene expression profiles and the spatial context, enabling high-resolution exploration of distinct regional characterization in tissue. To effectively understand the underlying biological mechanisms within tissue microenvironments, there is a requisite for methods that can accurately capture external spatial heterogeneity and interpret internal gene regulation from spatial transcriptomics data. However, current methods for region identification often lack the simultaneous characterizing of spatial structure and gene regulation, thereby limiting the ability of spatial dissection and gene interpretation. Here, stDCL is developed, a dual graph contrastive learning method to identify spatial domains and interpret gene regulation in spatial transcriptomics data. stDCL adaptively incorporates gene expression data and spatial information via a graph embedding autoencoder, thereby preserving critical information within the latent embedding representations. In addition, dual graph contrastive learning is proposed to train the model, ensuring that the latent embedding representation closely resembles the actual spatial distribution and exhibits cluster similarity. Benchmarking stDCL against other state-of-the-art clustering methods using complex cortex datasets demonstrates its superior accuracy and effectiveness in identifying spatial domains. Our analysis of the imputation matrices generated by stDCL reveals its capability to reconstruct spatial hierarchical structures and refine differential expression assessment. Furthermore, it is demonstrated that the versatility of stDCL in interpretability of gene regulation, spatial heterogeneity at high resolution, and embryonic developmental patterns. In addition, it is also showed that stDCL can successfully annotate disease-associated astrocyte subtypes in Alzheimer's disease and unravel multiple relevant pathways and regulatory mechanisms.
期刊介绍:
Advanced Science is a prestigious open access journal that focuses on interdisciplinary research in materials science, physics, chemistry, medical and life sciences, and engineering. The journal aims to promote cutting-edge research by employing a rigorous and impartial review process. It is committed to presenting research articles with the highest quality production standards, ensuring maximum accessibility of top scientific findings. With its vibrant and innovative publication platform, Advanced Science seeks to revolutionize the dissemination and organization of scientific knowledge.