Depth-corrected multi-factor dissection of chromatin accessibility for scATAC-seq data with PACS

IF 15.7 1区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES Nature Communications Pub Date : 2025-01-05 DOI:10.1038/s41467-024-55580-5
Zhen Miao, Jianqiao Wang, Kernyu Park, Da Kuang, Junhyong Kim
{"title":"Depth-corrected multi-factor dissection of chromatin accessibility for scATAC-seq data with PACS","authors":"Zhen Miao, Jianqiao Wang, Kernyu Park, Da Kuang, Junhyong Kim","doi":"10.1038/s41467-024-55580-5","DOIUrl":null,"url":null,"abstract":"<p>Single cell ATAC-seq (scATAC-seq) experimental designs have become increasingly complex, with multiple factors that might affect chromatin accessibility, including genotype, cell type, tissue of origin, sample location, batch, etc., whose compound effects are difficult to test by existing methods. In addition, current scATAC-seq data present statistical difficulties due to their sparsity and variations in individual sequence capture. To address these problems, we present a zero-adjusted statistical model, Probability model of Accessible Chromatin of Single cells (PACS), that allows complex hypothesis testing of accessibility-modulating factors while accounting for sparse and incomplete data. For differential accessibility analysis, PACS controls the false positive rate and achieves a 17% to 122% higher power on average than existing tools. We demonstrate the effectiveness of PACS through several analysis tasks, including supervised cell type annotation, compound hypothesis testing, batch effect correction, and spatiotemporal modeling. We apply PACS to datasets from various tissues and show its ability to reveal previously undiscovered insights in scATAC-seq data.</p>","PeriodicalId":19066,"journal":{"name":"Nature Communications","volume":"98 1","pages":""},"PeriodicalIF":15.7000,"publicationDate":"2025-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nature Communications","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1038/s41467-024-55580-5","RegionNum":1,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Single cell ATAC-seq (scATAC-seq) experimental designs have become increasingly complex, with multiple factors that might affect chromatin accessibility, including genotype, cell type, tissue of origin, sample location, batch, etc., whose compound effects are difficult to test by existing methods. In addition, current scATAC-seq data present statistical difficulties due to their sparsity and variations in individual sequence capture. To address these problems, we present a zero-adjusted statistical model, Probability model of Accessible Chromatin of Single cells (PACS), that allows complex hypothesis testing of accessibility-modulating factors while accounting for sparse and incomplete data. For differential accessibility analysis, PACS controls the false positive rate and achieves a 17% to 122% higher power on average than existing tools. We demonstrate the effectiveness of PACS through several analysis tasks, including supervised cell type annotation, compound hypothesis testing, batch effect correction, and spatiotemporal modeling. We apply PACS to datasets from various tissues and show its ability to reveal previously undiscovered insights in scATAC-seq data.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用PACS对scATAC-seq数据的染色质可及性进行深度校正的多因素剖析
单细胞ATAC-seq (scATAC-seq)实验设计越来越复杂,可能影响染色质可及性的因素有多种,包括基因型、细胞类型、组织来源、样品位置、批次等,现有方法难以检测其复合效应。此外,目前的scATAC-seq数据由于其稀疏性和单个序列捕获的变化而存在统计困难。为了解决这些问题,我们提出了一个零调整的统计模型,即单个细胞可接近染色质的概率模型(PACS),该模型允许在考虑稀疏和不完整数据的情况下对可接近性调节因子进行复杂的假设检验。对于差异可及性分析,PACS可以控制误报率,比现有工具平均提高17%至122%的功率。我们通过几个分析任务证明了PACS的有效性,包括监督细胞类型注释、复合假设检验、批量效果校正和时空建模。我们将PACS应用于来自各种组织的数据集,并展示了其揭示scATAC-seq数据中先前未被发现的见解的能力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Nature Communications
Nature Communications Biological Science Disciplines-
CiteScore
24.90
自引率
2.40%
发文量
6928
审稿时长
3.7 months
期刊介绍: Nature Communications, an open-access journal, publishes high-quality research spanning all areas of the natural sciences. Papers featured in the journal showcase significant advances relevant to specialists in each respective field. With a 2-year impact factor of 16.6 (2022) and a median time of 8 days from submission to the first editorial decision, Nature Communications is committed to rapid dissemination of research findings. As a multidisciplinary journal, it welcomes contributions from biological, health, physical, chemical, Earth, social, mathematical, applied, and engineering sciences, aiming to highlight important breakthroughs within each domain.
期刊最新文献
Sustainable recycling of polyester wastes using a coordinatively unsaturated Zn catalyst. PGC-derived migrasomes couple PGC proliferation with migration. Polyfluorophenyl quinoxaline derived acceptors with long exciton diffusion length for efficient thin-film/thick-film organic solar cells. Maternal antibodies regulate the establishment of murine oral and salivary mucosal immunity. Primary cilia in the hypothalamic AgRP neurons mediate metabolic effects of butyrate
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1