{"title":"高维细胞计数数据全自动预门控方法的初步研究","authors":"A. Suwalska, J. Polańska","doi":"10.1109/BIBE52308.2021.9635492","DOIUrl":null,"url":null,"abstract":"Mass cytometry as an advanced single-cell analysis technology can produce high-dimensional data consisting of millions of cells and more than 50 features. Therefore the cell subtypes identification is difficult and impossible to be done manually. Each step of the analysis affect the results and may cause a loss of rare sub-populations of interest. One of the first steps in the analysis is pre-gating which involves filtering out unwanted measurements like debris or doublets. The existing semi-automated solutions for pre-gating require some parameters to be set which may lead to different results. Moreover, the tools often use downsampling from millions to thousands of cells. Despite the existing methods, there is still a need for a fully automated tool that will be independent of sample size. In the study, we developed a solution based on Gaussian Mixture Model (GMM) decomposition and grouping of its components into clusters. Based on the clusters we propose filtration criteria that identify measurements to be removed from the analysis. The algorithm was validated on two independent public datasets. The results are promising and reproducible, leaving intact, live cells that can be further analyzed.","PeriodicalId":343724,"journal":{"name":"2021 IEEE 21st International Conference on Bioinformatics and Bioengineering (BIBE)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Preliminary study for a fully automated pre-gating method for high-dimensional mass cytometry data\",\"authors\":\"A. Suwalska, J. Polańska\",\"doi\":\"10.1109/BIBE52308.2021.9635492\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Mass cytometry as an advanced single-cell analysis technology can produce high-dimensional data consisting of millions of cells and more than 50 features. Therefore the cell subtypes identification is difficult and impossible to be done manually. Each step of the analysis affect the results and may cause a loss of rare sub-populations of interest. One of the first steps in the analysis is pre-gating which involves filtering out unwanted measurements like debris or doublets. The existing semi-automated solutions for pre-gating require some parameters to be set which may lead to different results. Moreover, the tools often use downsampling from millions to thousands of cells. Despite the existing methods, there is still a need for a fully automated tool that will be independent of sample size. In the study, we developed a solution based on Gaussian Mixture Model (GMM) decomposition and grouping of its components into clusters. Based on the clusters we propose filtration criteria that identify measurements to be removed from the analysis. The algorithm was validated on two independent public datasets. The results are promising and reproducible, leaving intact, live cells that can be further analyzed.\",\"PeriodicalId\":343724,\"journal\":{\"name\":\"2021 IEEE 21st International Conference on Bioinformatics and Bioengineering (BIBE)\",\"volume\":\"32 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE 21st International Conference on Bioinformatics and Bioengineering (BIBE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/BIBE52308.2021.9635492\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 21st International Conference on Bioinformatics and Bioengineering (BIBE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIBE52308.2021.9635492","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Preliminary study for a fully automated pre-gating method for high-dimensional mass cytometry data
Mass cytometry as an advanced single-cell analysis technology can produce high-dimensional data consisting of millions of cells and more than 50 features. Therefore the cell subtypes identification is difficult and impossible to be done manually. Each step of the analysis affect the results and may cause a loss of rare sub-populations of interest. One of the first steps in the analysis is pre-gating which involves filtering out unwanted measurements like debris or doublets. The existing semi-automated solutions for pre-gating require some parameters to be set which may lead to different results. Moreover, the tools often use downsampling from millions to thousands of cells. Despite the existing methods, there is still a need for a fully automated tool that will be independent of sample size. In the study, we developed a solution based on Gaussian Mixture Model (GMM) decomposition and grouping of its components into clusters. Based on the clusters we propose filtration criteria that identify measurements to be removed from the analysis. The algorithm was validated on two independent public datasets. The results are promising and reproducible, leaving intact, live cells that can be further analyzed.