Directly selecting cell-type marker genes for single-cell clustering analyses.

IF 4.3 Q1 BIOCHEMICAL RESEARCH METHODS Cell Reports Methods Pub Date : 2024-07-15 Epub Date: 2024-07-08 DOI:10.1016/j.crmeth.2024.100810
Zihao Chen, Changhu Wang, Siyuan Huang, Yang Shi, Ruibin Xi
{"title":"Directly selecting cell-type marker genes for single-cell clustering analyses.","authors":"Zihao Chen, Changhu Wang, Siyuan Huang, Yang Shi, Ruibin Xi","doi":"10.1016/j.crmeth.2024.100810","DOIUrl":null,"url":null,"abstract":"<p><p>In single-cell RNA sequencing (scRNA-seq) studies, cell types and their marker genes are often identified by clustering and differentially expressed gene (DEG) analysis. A common practice is to select genes using surrogate criteria such as variance and deviance, then cluster them using selected genes and detect markers by DEG analysis assuming known cell types. The surrogate criteria can miss important genes or select unimportant genes, while DEG analysis has the selection-bias problem. We present Festem, a statistical method for the direct selection of cell-type markers for downstream clustering. Festem distinguishes marker genes with heterogeneous distribution across cells that are cluster informative. Simulation and scRNA-seq applications demonstrate that Festem can sensitively select markers with high precision and enables the identification of cell types often missed by other methods. In a large intrahepatic cholangiocarcinoma dataset, we identify diverse CD8<sup>+</sup> T cell types and potential prognostic marker genes.</p>","PeriodicalId":29773,"journal":{"name":"Cell Reports Methods","volume":null,"pages":null},"PeriodicalIF":4.3000,"publicationDate":"2024-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11294843/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cell Reports Methods","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1016/j.crmeth.2024.100810","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/7/8 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0

Abstract

In single-cell RNA sequencing (scRNA-seq) studies, cell types and their marker genes are often identified by clustering and differentially expressed gene (DEG) analysis. A common practice is to select genes using surrogate criteria such as variance and deviance, then cluster them using selected genes and detect markers by DEG analysis assuming known cell types. The surrogate criteria can miss important genes or select unimportant genes, while DEG analysis has the selection-bias problem. We present Festem, a statistical method for the direct selection of cell-type markers for downstream clustering. Festem distinguishes marker genes with heterogeneous distribution across cells that are cluster informative. Simulation and scRNA-seq applications demonstrate that Festem can sensitively select markers with high precision and enables the identification of cell types often missed by other methods. In a large intrahepatic cholangiocarcinoma dataset, we identify diverse CD8+ T cell types and potential prognostic marker genes.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
直接选择细胞类型标记基因进行单细胞聚类分析。
在单细胞 RNA 测序(scRNA-seq)研究中,通常通过聚类和差异表达基因(DEG)分析来确定细胞类型及其标记基因。常见的做法是利用方差和偏差等替代标准选择基因,然后利用所选基因进行聚类,并假定已知的细胞类型,通过 DEG 分析检测标记基因。代用标准可能会遗漏重要基因或选择不重要基因,而 DEG 分析则存在选择偏差问题。我们提出的 Festem 是一种直接选择细胞类型标记进行下游聚类的统计方法。Festem 能区分在细胞中分布不均、具有聚类信息的标记基因。模拟和 scRNA-seq 应用证明,Festem 可以灵敏地选择高精度的标记,并能识别其他方法经常遗漏的细胞类型。在一个大型肝内胆管癌数据集中,我们发现了多种 CD8+ T 细胞类型和潜在的预后标记基因。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Cell Reports Methods
Cell Reports Methods Chemistry (General), Biochemistry, Genetics and Molecular Biology (General), Immunology and Microbiology (General)
CiteScore
3.80
自引率
0.00%
发文量
0
审稿时长
111 days
期刊最新文献
Optimized full-spectrum flow cytometry panel for deep immunophenotyping of murine lungs. A deep learning framework combining molecular image and protein structural representations identifies candidate drugs for pain. Adult zebrafish can learn Morris water maze-like tasks in a two-dimensional virtual reality system. Recovering single-cell expression profiles from spatial transcriptomics with scResolve. Mimicking and analyzing the tumor microenvironment.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1