Recovery of biological signals lost in single-cell batch integration with CellANOVA

IF 33.1 1区 生物学 Q1 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Nature biotechnology Pub Date : 2024-11-26 DOI:10.1038/s41587-024-02463-1
Zhaojun Zhang, Divij Mathew, Tristan L. Lim, Kaishu Mason, Clara Morral Martinez, Sijia Huang, E. John Wherry, Katalin Susztak, Andy J. Minn, Zongming Ma, Nancy R. Zhang
{"title":"Recovery of biological signals lost in single-cell batch integration with CellANOVA","authors":"Zhaojun Zhang, Divij Mathew, Tristan L. Lim, Kaishu Mason, Clara Morral Martinez, Sijia Huang, E. John Wherry, Katalin Susztak, Andy J. Minn, Zongming Ma, Nancy R. Zhang","doi":"10.1038/s41587-024-02463-1","DOIUrl":null,"url":null,"abstract":"<p>Data integration to align cells across batches has become a cornerstone of single-cell data analysis, critically affecting downstream results. Currently, there are no guidelines for when the biological differences between samples are separable from batch effects. Here we show that current paradigms for single-cell data integration remove biologically meaningful variation and introduce distortion. We present a statistical model and computationally scalable algorithm, CellANOVA (cell state space analysis of variance), that harnesses experimental design to explicitly recover biological signals that are erased during single-cell data integration. CellANOVA uses a ‘pool-of-controls’ design concept, applicable across diverse settings, to separate unwanted variation from biological variation of interest and allow the recovery of subtle biological signals. We apply CellANOVA to diverse contexts and validate the recovered biological signals by orthogonal assays. In particular, we show that CellANOVA is effective in the challenging case of single-cell and single-nucleus data integration, where it recovers subtle biological signals that can be validated and replicated by external data.</p>","PeriodicalId":19084,"journal":{"name":"Nature biotechnology","volume":"191 1","pages":""},"PeriodicalIF":33.1000,"publicationDate":"2024-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nature biotechnology","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1038/s41587-024-02463-1","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOTECHNOLOGY & APPLIED MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Data integration to align cells across batches has become a cornerstone of single-cell data analysis, critically affecting downstream results. Currently, there are no guidelines for when the biological differences between samples are separable from batch effects. Here we show that current paradigms for single-cell data integration remove biologically meaningful variation and introduce distortion. We present a statistical model and computationally scalable algorithm, CellANOVA (cell state space analysis of variance), that harnesses experimental design to explicitly recover biological signals that are erased during single-cell data integration. CellANOVA uses a ‘pool-of-controls’ design concept, applicable across diverse settings, to separate unwanted variation from biological variation of interest and allow the recovery of subtle biological signals. We apply CellANOVA to diverse contexts and validate the recovered biological signals by orthogonal assays. In particular, we show that CellANOVA is effective in the challenging case of single-cell and single-nucleus data integration, where it recovers subtle biological signals that can be validated and replicated by external data.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用 CellANOVA 恢复单细胞批量整合中丢失的生物信号
整合数据以调整不同批次的细胞已成为单细胞数据分析的基石,对下游结果有着至关重要的影响。目前,对于样本间的生物差异何时可与批次效应分离尚无指导原则。在这里,我们展示了当前的单细胞数据整合范式会去除有生物意义的差异并引入失真。我们提出了一种统计模型和可扩展计算算法 CellANOVA(细胞状态空间方差分析),它利用实验设计明确恢复在单细胞数据整合过程中被抹去的生物信号。CellANOVA 采用 "控制池 "设计理念,适用于各种不同的环境,将不需要的变异与感兴趣的生物变异分离开来,从而恢复微妙的生物信号。我们将 CellANOVA 应用于各种环境,并通过正交试验验证了所恢复的生物信号。我们特别展示了 CellANOVA 在具有挑战性的单细胞和单核数据整合中的有效性,它能恢复微妙的生物信号,这些信号可以通过外部数据进行验证和复制。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Nature biotechnology
Nature biotechnology 工程技术-生物工程与应用微生物
CiteScore
63.00
自引率
1.70%
发文量
382
审稿时长
3 months
期刊介绍: Nature Biotechnology is a monthly journal that focuses on the science and business of biotechnology. It covers a wide range of topics including technology/methodology advancements in the biological, biomedical, agricultural, and environmental sciences. The journal also explores the commercial, political, ethical, legal, and societal aspects of this research. The journal serves researchers by providing peer-reviewed research papers in the field of biotechnology. It also serves the business community by delivering news about research developments. This approach ensures that both the scientific and business communities are well-informed and able to stay up-to-date on the latest advancements and opportunities in the field. Some key areas of interest in which the journal actively seeks research papers include molecular engineering of nucleic acids and proteins, molecular therapy, large-scale biology, computational biology, regenerative medicine, imaging technology, analytical biotechnology, applied immunology, food and agricultural biotechnology, and environmental biotechnology. In summary, Nature Biotechnology is a comprehensive journal that covers both the scientific and business aspects of biotechnology. It strives to provide researchers with valuable research papers and news while also delivering important scientific advancements to the business community.
期刊最新文献
Recovery of biological signals lost in single-cell batch integration with CellANOVA Intravenous administration of blood–brain barrier-crossing conjugates facilitate biomacromolecule transport into central nervous system Lipid nanoparticle-mediated mRNA delivery to CD34+ cells in rhesus monkeys Pooled CRISPR screens with joint single-nucleus chromatin accessibility and transcriptome profiling Lab-grown breast milk
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1