{"title":"A Hashing-Based Framework for Enhancing Cluster Delineation of High-Dimensional Single-Cell Profiles.","authors":"Xiao Liu, Ting Zhang, Ziyang Tan, Antony R Warden, Shanhe Li, Edwin Cheung, Xianting Ding","doi":"10.1007/s43657-022-00056-z","DOIUrl":null,"url":null,"abstract":"<p><p>Although many methods have been developed to explore the function of cells by clustering high-dimensional (HD) single-cell omics data, the inconspicuously differential expressions of biomarkers of proteins or genes across all cells disturb the cell cluster delineation and downstream analysis. Here, we introduce a hashing-based framework to improve the delineation of cell clusters, which is based on the hypothesis that one variable with no significant differences can be decomposed into more diversely latent variables to distinguish cells. By projecting the original data into a sparse HD space, fly and densefly hashing preprocessing retain the local structure of data, and improve the cluster delineation of existing clustering methods, such as PhenoGraph. Moreover, the analyses on mass cytometry dataset show that our hashing-based framework manages to unveil new hidden heterogeneities in cell clusters. The proposed framework promotes the utilization of cell biomarkers and enriches the biological findings by introducing more latent variables.</p><p><strong>Supplementary information: </strong>The online version contains supplementary material available at 10.1007/s43657-022-00056-z.</p>","PeriodicalId":74435,"journal":{"name":"Phenomics (Cham, Switzerland)","volume":"2 5","pages":"323-335"},"PeriodicalIF":3.7000,"publicationDate":"2022-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9590516/pdf/43657_2022_Article_56.pdf","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Phenomics (Cham, Switzerland)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s43657-022-00056-z","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2022/10/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 2
Abstract
Although many methods have been developed to explore the function of cells by clustering high-dimensional (HD) single-cell omics data, the inconspicuously differential expressions of biomarkers of proteins or genes across all cells disturb the cell cluster delineation and downstream analysis. Here, we introduce a hashing-based framework to improve the delineation of cell clusters, which is based on the hypothesis that one variable with no significant differences can be decomposed into more diversely latent variables to distinguish cells. By projecting the original data into a sparse HD space, fly and densefly hashing preprocessing retain the local structure of data, and improve the cluster delineation of existing clustering methods, such as PhenoGraph. Moreover, the analyses on mass cytometry dataset show that our hashing-based framework manages to unveil new hidden heterogeneities in cell clusters. The proposed framework promotes the utilization of cell biomarkers and enriches the biological findings by introducing more latent variables.
Supplementary information: The online version contains supplementary material available at 10.1007/s43657-022-00056-z.