{"title":"cellstruct: Metrics scores to quantify the biological preservation between two embeddings","authors":"Jui Wan Loh, John F Ouyang","doi":"10.1101/2023.11.13.566337","DOIUrl":null,"url":null,"abstract":"Single-cell transcriptomics (scRNA-seq) is extensively applied in uncovering biological heterogeneity. There are different dimensionality reduction techniques, but it is unclear which method works best in preserving biological information when creating a two-dimensional embedding. Therefore, we implemented cellstruct, which calculates three metrics scores to quantify the global or local biological similarity between a two-dimensional and its corresponding higher-dimensional PCA embeddings at either single-cell or cluster level. These scores pinpoint cell populations with low biological information preservation, in addition to visualizing the cell-cell or cluster-cluster relationships in the PCA embedding. Two study cases illustrate the usefulness of cellstruct in exploratory data analysis.","PeriodicalId":486943,"journal":{"name":"bioRxiv (Cold Spring Harbor Laboratory)","volume":"45 8","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"bioRxiv (Cold Spring Harbor Laboratory)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2023.11.13.566337","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Single-cell transcriptomics (scRNA-seq) is extensively applied in uncovering biological heterogeneity. There are different dimensionality reduction techniques, but it is unclear which method works best in preserving biological information when creating a two-dimensional embedding. Therefore, we implemented cellstruct, which calculates three metrics scores to quantify the global or local biological similarity between a two-dimensional and its corresponding higher-dimensional PCA embeddings at either single-cell or cluster level. These scores pinpoint cell populations with low biological information preservation, in addition to visualizing the cell-cell or cluster-cluster relationships in the PCA embedding. Two study cases illustrate the usefulness of cellstruct in exploratory data analysis.