DGSIST: Clustering spatial transcriptome data based on deep graph structure Infomax

IF 4.2 3区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Methods Pub Date : 2024-10-15 DOI:10.1016/j.ymeth.2024.10.002
Yu-Han Xiu , Si-Lin Sun , Bing-Wei Zhou , Ying Wan , Hua Tang , Hai-Xia Long
{"title":"DGSIST: Clustering spatial transcriptome data based on deep graph structure Infomax","authors":"Yu-Han Xiu ,&nbsp;Si-Lin Sun ,&nbsp;Bing-Wei Zhou ,&nbsp;Ying Wan ,&nbsp;Hua Tang ,&nbsp;Hai-Xia Long","doi":"10.1016/j.ymeth.2024.10.002","DOIUrl":null,"url":null,"abstract":"<div><div>Although spatial transcriptomics data provide valuable insights into gene expression profiles and the spatial structure of tissues, most studies rely solely on gene expression information, underutilizing the spatial data. To fully leverage the potential of spatial transcriptomics and graph neural networks, the DGSI (Deep Graph Structure Infomax) model is proposed. This innovative graph data processing model uses graph convolutional neural networks and employs an unsupervised learning approach. It maximizes the mutual information between graph-level and node-level representations, emphasizing flexible sampling and aggregation of nodes and their neighbors. This effectively captures and incorporates local information from nodes into the overall graph structure. Additionally, this paper developed the DGSIST framework, an unsupervised cell clustering method that integrates the DGSI model, SVD dimensionality reduction algorithm, and k-means++ clustering algorithm. This aims to identify cell types accurately. DGSIST fully uses spatial transcriptomics data and outperforms existing methods in accuracy. Demonstrations of DGSIST’s capability across various tissue types and technological platforms have shown its effectiveness in accurately identifying spatial domains in multiple tissue sections. Compared to other spatial clustering methods, DGSIST excels in cell clustering and effectively eliminates batch effects without needing batch correction. DGSIST excels in spatial clustering analysis, spatial variation identification, and differential gene expression detection and directly applies to graph analysis tasks, such as node classification, link prediction, or graph clustering. Anticipation lies in the contribution of the DGSIST framework to a deeper understanding of the spatial organizational structures of diseases such as cancer.</div></div>","PeriodicalId":390,"journal":{"name":"Methods","volume":"231 ","pages":"Pages 226-236"},"PeriodicalIF":4.2000,"publicationDate":"2024-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Methods","FirstCategoryId":"99","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1046202324002196","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0

Abstract

Although spatial transcriptomics data provide valuable insights into gene expression profiles and the spatial structure of tissues, most studies rely solely on gene expression information, underutilizing the spatial data. To fully leverage the potential of spatial transcriptomics and graph neural networks, the DGSI (Deep Graph Structure Infomax) model is proposed. This innovative graph data processing model uses graph convolutional neural networks and employs an unsupervised learning approach. It maximizes the mutual information between graph-level and node-level representations, emphasizing flexible sampling and aggregation of nodes and their neighbors. This effectively captures and incorporates local information from nodes into the overall graph structure. Additionally, this paper developed the DGSIST framework, an unsupervised cell clustering method that integrates the DGSI model, SVD dimensionality reduction algorithm, and k-means++ clustering algorithm. This aims to identify cell types accurately. DGSIST fully uses spatial transcriptomics data and outperforms existing methods in accuracy. Demonstrations of DGSIST’s capability across various tissue types and technological platforms have shown its effectiveness in accurately identifying spatial domains in multiple tissue sections. Compared to other spatial clustering methods, DGSIST excels in cell clustering and effectively eliminates batch effects without needing batch correction. DGSIST excels in spatial clustering analysis, spatial variation identification, and differential gene expression detection and directly applies to graph analysis tasks, such as node classification, link prediction, or graph clustering. Anticipation lies in the contribution of the DGSIST framework to a deeper understanding of the spatial organizational structures of diseases such as cancer.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
DGSIST:基于深度图结构的空间转录组数据聚类 Infomax.
虽然空间转录组学数据为了解基因表达谱和组织的空间结构提供了宝贵的视角,但大多数研究仅依赖于基因表达信息,对空间数据利用不足。为了充分利用空间转录组学和图神经网络的潜力,我们提出了 DGSI(深度图结构 Infomax)模型。这种创新的图数据处理模型使用图卷积神经网络,并采用无监督学习方法。它最大化了图层和节点层表征之间的互信息,强调节点及其邻居的灵活采样和聚合。这能有效捕捉节点的局部信息并将其纳入整体图结构中。此外,本文还开发了 DGSIST 框架,这是一种无监督细胞聚类方法,集成了 DGSI 模型、SVD 降维算法和 k-means++ 聚类算法。其目的是准确识别细胞类型。DGSIST 充分利用了空间转录组学数据,其准确性优于现有方法。DGSIST 在各种组织类型和技术平台上的能力展示表明,它能有效准确地识别多个组织切片中的空间域。与其他空间聚类方法相比,DGSIST 在细胞聚类方面表现出色,能有效消除批次效应,无需批次校正。DGSIST 在空间聚类分析、空间变异识别和差异基因表达检测方面表现出色,并可直接应用于节点分类、链接预测或图聚类等图分析任务。我们期待 DGSIST 框架能为深入了解癌症等疾病的空间组织结构做出贡献。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Methods
Methods 生物-生化研究方法
CiteScore
9.80
自引率
2.10%
发文量
222
审稿时长
11.3 weeks
期刊介绍: Methods focuses on rapidly developing techniques in the experimental biological and medical sciences. Each topical issue, organized by a guest editor who is an expert in the area covered, consists solely of invited quality articles by specialist authors, many of them reviews. Issues are devoted to specific technical approaches with emphasis on clear detailed descriptions of protocols that allow them to be reproduced easily. The background information provided enables researchers to understand the principles underlying the methods; other helpful sections include comparisons of alternative methods giving the advantages and disadvantages of particular methods, guidance on avoiding potential pitfalls, and suggestions for troubleshooting.
期刊最新文献
Ab-Amy 2.0: Predicting light chain amyloidogenic risk of therapeutic antibodies based on antibody language model. SITP: A single cell bioinformatics analysis flow captures proteasome markers in the development of breast cancer Data Preprocessing Methods for Selective Sweep Detection using Convolutional Neural Networks. Exploring drug-target interaction prediction on cold-start scenarios via meta-learning-based graph transformer. MVCLST: A spatial transcriptome data analysis pipeline for cell type classification based on multi-view comparative learning
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1