Preliminary study for a fully automated pre-gating method for high-dimensional mass cytometry data

A. Suwalska, J. Polańska
{"title":"Preliminary study for a fully automated pre-gating method for high-dimensional mass cytometry data","authors":"A. Suwalska, J. Polańska","doi":"10.1109/BIBE52308.2021.9635492","DOIUrl":null,"url":null,"abstract":"Mass cytometry as an advanced single-cell analysis technology can produce high-dimensional data consisting of millions of cells and more than 50 features. Therefore the cell subtypes identification is difficult and impossible to be done manually. Each step of the analysis affect the results and may cause a loss of rare sub-populations of interest. One of the first steps in the analysis is pre-gating which involves filtering out unwanted measurements like debris or doublets. The existing semi-automated solutions for pre-gating require some parameters to be set which may lead to different results. Moreover, the tools often use downsampling from millions to thousands of cells. Despite the existing methods, there is still a need for a fully automated tool that will be independent of sample size. In the study, we developed a solution based on Gaussian Mixture Model (GMM) decomposition and grouping of its components into clusters. Based on the clusters we propose filtration criteria that identify measurements to be removed from the analysis. The algorithm was validated on two independent public datasets. The results are promising and reproducible, leaving intact, live cells that can be further analyzed.","PeriodicalId":343724,"journal":{"name":"2021 IEEE 21st International Conference on Bioinformatics and Bioengineering (BIBE)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 21st International Conference on Bioinformatics and Bioengineering (BIBE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIBE52308.2021.9635492","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Mass cytometry as an advanced single-cell analysis technology can produce high-dimensional data consisting of millions of cells and more than 50 features. Therefore the cell subtypes identification is difficult and impossible to be done manually. Each step of the analysis affect the results and may cause a loss of rare sub-populations of interest. One of the first steps in the analysis is pre-gating which involves filtering out unwanted measurements like debris or doublets. The existing semi-automated solutions for pre-gating require some parameters to be set which may lead to different results. Moreover, the tools often use downsampling from millions to thousands of cells. Despite the existing methods, there is still a need for a fully automated tool that will be independent of sample size. In the study, we developed a solution based on Gaussian Mixture Model (GMM) decomposition and grouping of its components into clusters. Based on the clusters we propose filtration criteria that identify measurements to be removed from the analysis. The algorithm was validated on two independent public datasets. The results are promising and reproducible, leaving intact, live cells that can be further analyzed.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
高维细胞计数数据全自动预门控方法的初步研究
质量细胞术作为一种先进的单细胞分析技术,可以产生由数百万个细胞和50多个特征组成的高维数据。因此,细胞亚型鉴定是困难的,不可能手工完成。分析的每一步都会影响结果,并可能导致稀有亚种群的损失。分析的第一步是预门控,包括过滤掉不需要的测量,如碎片或重态。现有的半自动化预门控方案需要设置一些参数,这可能会导致不同的结果。此外,这些工具经常使用从数百万到数千个细胞的降采样。尽管现有的方法,仍然需要一个完全自动化的工具,将独立于样本量。在研究中,我们开发了一种基于高斯混合模型(GMM)的解决方案,并将其组件分解成簇。基于聚类,我们提出过滤标准,以确定要从分析中删除的测量值。该算法在两个独立的公共数据集上进行了验证。结果是有希望的和可重复的,留下完整的活细胞,可以进一步分析。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Structural, antimicrobial, and molecular docking study of 3-(1-(4-hydroxyphenyl)amino) ethylidene)chroman-2,4-dione and its corresponding Pd complex Multiple-Activation Parallel Convolution Network in Combination with t-SNE for the Classification of Mild Cognitive Impairment Analyzing the Impact of Resampling Approaches on Chest X-Ray Images for COVID-19 Identification in a Local Hierarchical Classification Scenario Analysis of knee joint forces in different types of jumps of top futsal players at the beginning and at the end of the preparation period Design and evaluation of a noninvasive tongue-computer interface for individuals with severe disabilities
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1