Xingyu Liao, Yanyan Li, Shuangyi Li, Long Wen, Xingyi Li, Bin Yu
{"title":"Enhanced Integration of Single-Cell Multi-Omics Data Using Graph Attention Networks.","authors":"Xingyu Liao, Yanyan Li, Shuangyi Li, Long Wen, Xingyi Li, Bin Yu","doi":"10.1021/acssynbio.4c00864","DOIUrl":null,"url":null,"abstract":"<p><p>The continuous advancement of single-cell multimodal omics (scMulti-omics) technologies offers unprecedented opportunities to measure various modalities, including RNA expression, protein abundance, gene perturbation, DNA methylation, and chromatin accessibility at single-cell resolution. These advances hold significant potential for breakthroughs by integrating diverse omics modalities. However, the data generated from different omics layers often face challenges due to high dimensionality, heterogeneity, and sparsity, which can adversely impact the accuracy and efficiency of data integration analyses. To address these challenges, we propose a high-precision analysis method called scMGAT (single-cell multiomics data analysis based on multihead graph attention networks). This method effectively coordinates reliable information across multiomics data sets using a multihead attention mechanism, allowing for better management of the heterogeneous characteristics inherent in scMulti-omics data. We evaluated scMGAT's performance on eight sets of real scMulti-omics data, including samples from both human and mouse. The experimental results demonstrate that scMGAT significantly enhances the quality of multiomics data and improves the accuracy of cell-type annotation compared to state-of-the-art methods. scMGAT is now freely accessible at https://github.com/Xingyu-Liao/scMGAT.</p>","PeriodicalId":26,"journal":{"name":"ACS Synthetic Biology","volume":" ","pages":""},"PeriodicalIF":3.7000,"publicationDate":"2025-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Synthetic Biology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1021/acssynbio.4c00864","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
The continuous advancement of single-cell multimodal omics (scMulti-omics) technologies offers unprecedented opportunities to measure various modalities, including RNA expression, protein abundance, gene perturbation, DNA methylation, and chromatin accessibility at single-cell resolution. These advances hold significant potential for breakthroughs by integrating diverse omics modalities. However, the data generated from different omics layers often face challenges due to high dimensionality, heterogeneity, and sparsity, which can adversely impact the accuracy and efficiency of data integration analyses. To address these challenges, we propose a high-precision analysis method called scMGAT (single-cell multiomics data analysis based on multihead graph attention networks). This method effectively coordinates reliable information across multiomics data sets using a multihead attention mechanism, allowing for better management of the heterogeneous characteristics inherent in scMulti-omics data. We evaluated scMGAT's performance on eight sets of real scMulti-omics data, including samples from both human and mouse. The experimental results demonstrate that scMGAT significantly enhances the quality of multiomics data and improves the accuracy of cell-type annotation compared to state-of-the-art methods. scMGAT is now freely accessible at https://github.com/Xingyu-Liao/scMGAT.
期刊介绍:
The journal is particularly interested in studies on the design and synthesis of new genetic circuits and gene products; computational methods in the design of systems; and integrative applied approaches to understanding disease and metabolism.
Topics may include, but are not limited to:
Design and optimization of genetic systems
Genetic circuit design and their principles for their organization into programs
Computational methods to aid the design of genetic systems
Experimental methods to quantify genetic parts, circuits, and metabolic fluxes
Genetic parts libraries: their creation, analysis, and ontological representation
Protein engineering including computational design
Metabolic engineering and cellular manufacturing, including biomass conversion
Natural product access, engineering, and production
Creative and innovative applications of cellular programming
Medical applications, tissue engineering, and the programming of therapeutic cells
Minimal cell design and construction
Genomics and genome replacement strategies
Viral engineering
Automated and robotic assembly platforms for synthetic biology
DNA synthesis methodologies
Metagenomics and synthetic metagenomic analysis
Bioinformatics applied to gene discovery, chemoinformatics, and pathway construction
Gene optimization
Methods for genome-scale measurements of transcription and metabolomics
Systems biology and methods to integrate multiple data sources
in vitro and cell-free synthetic biology and molecular programming
Nucleic acid engineering.