scATAcat: cell-type annotation for scATAC-seq data.

IF 2.8 Q1 GENETICS & HEREDITY NAR Genomics and Bioinformatics Pub Date : 2024-10-08 eCollection Date: 2024-09-01 DOI:10.1093/nargab/lqae135

Aybuge Altay, Martin Vingron

引用次数: 0

Abstract

Cells whose accessibility landscape has been profiled with scATAC-seq cannot readily be annotated to a particular cell type. In fact, annotating cell-types in scATAC-seq data is a challenging task since, unlike in scRNA-seq data, we lack knowledge of 'marker regions' which could be used for cell-type annotation. Current annotation methods typically translate accessibility to expression space and rely on gene expression patterns. We propose a novel approach, scATAcat, that leverages characterized bulk ATAC-seq data as prototypes to annotate scATAC-seq data. To mitigate the inherent sparsity of single-cell data, we aggregate cells that belong to the same cluster and create pseudobulk. To demonstrate the feasibility of our approach we collected a number of datasets with respective annotations to quantify the results and evaluate performance for scATAcat. scATAcat is available as a python package at https://github.com/aybugealtay/scATAcat.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

scATAcat：用于 scATAC-seq 数据的细胞类型注释。

用 scATAC-seq 分析了细胞的可及性图谱的细胞不能轻易地被注释为特定的细胞类型。事实上，在 scATAC-seq 数据中注释细胞类型是一项具有挑战性的任务，因为与 scRNA-seq 数据不同，我们缺乏可用于细胞类型注释的 "标记区 "知识。目前的注释方法通常将可及性转化为表达空间，并依赖于基因表达模式。我们提出了一种新方法--scATAcat，它利用特征化的大容量 ATAC-seq 数据作为原型来注释 scATAC-seq 数据。为了减轻单细胞数据固有的稀疏性，我们将属于同一群组的细胞聚合在一起，创建伪群组。为了证明我们方法的可行性，我们收集了一些数据集，并分别进行了注释，以量化结果并评估 scATAcat 的性能。scATAcat 是一个 python 软件包，可在 https://github.com/aybugealtay/scATAcat 上下载。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊