{"title":"A subspace clustering method for satisfying stoimetric constraints in scRNA -seq","authors":"Angela Huang, Junhyong Kim","doi":"10.1109/BIBE52308.2021.9635324","DOIUrl":null,"url":null,"abstract":"With the advent of single-cell RNA-sequencing, researchers now have the ability to define cell types from large amounts of transcriptome information. Over the years, various clustering algorithms have been designed. Currently, most clustering algorithms measure cell-to-cell similarities using distance metrics based on the assumption that each cluster is comprised of “nearby” neighbors. In effect, clusters are a collection of similar cells in the embedded metric. Here, we propose that biological clusters should be comprised of sets of cells that satisfy a set of stochiometric constraints, whose intersections define a cell type. We propose to model each cell population with a single affine subspace, where all cells of the same type share a common set of linear constraints. We present an algorithm that leverages this subspace structure and learns a cell-to-cell affinity matrix based on notions of subspace similarity. We simulate scRNA-seq data according to the subspace model and benchmark our algorithm against preexisting methods. We further test our algorithm on an in-house C. elegans dataset and show recovery of information on both cell type and developmental time.","PeriodicalId":343724,"journal":{"name":"2021 IEEE 21st International Conference on Bioinformatics and Bioengineering (BIBE)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 21st International Conference on Bioinformatics and Bioengineering (BIBE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIBE52308.2021.9635324","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
With the advent of single-cell RNA-sequencing, researchers now have the ability to define cell types from large amounts of transcriptome information. Over the years, various clustering algorithms have been designed. Currently, most clustering algorithms measure cell-to-cell similarities using distance metrics based on the assumption that each cluster is comprised of “nearby” neighbors. In effect, clusters are a collection of similar cells in the embedded metric. Here, we propose that biological clusters should be comprised of sets of cells that satisfy a set of stochiometric constraints, whose intersections define a cell type. We propose to model each cell population with a single affine subspace, where all cells of the same type share a common set of linear constraints. We present an algorithm that leverages this subspace structure and learns a cell-to-cell affinity matrix based on notions of subspace similarity. We simulate scRNA-seq data according to the subspace model and benchmark our algorithm against preexisting methods. We further test our algorithm on an in-house C. elegans dataset and show recovery of information on both cell type and developmental time.