基因表达模式的最大熵模型

Camilla Sarra, Leopoldo Sarra, Luca Di Carlo, Trevor GrandPre, Yaojun Zhang, Curtis G. Callan Jr., William Bialek
{"title":"基因表达模式的最大熵模型","authors":"Camilla Sarra, Leopoldo Sarra, Luca Di Carlo, Trevor GrandPre, Yaojun Zhang, Curtis G. Callan Jr., William Bialek","doi":"arxiv-2408.08037","DOIUrl":null,"url":null,"abstract":"New experimental methods make it possible to measure the expression levels of\nmany genes, simultaneously, in snapshots from thousands or even millions of\nindividual cells. Current approaches to analyze these experiments involve\nclustering or low-dimensional projections. Here we use the principle of maximum\nentropy to obtain a probabilistic description that captures the observed\npresence or absence of mRNAs from hundreds of genes in cells from the mammalian\nbrain. We construct the Ising model compatible with experimental means and\npairwise correlations, and validate it by showing that it gives good\npredictions for higher-order statistics. We notice that the probability\ndistribution of cell states has many local maxima. By labeling cell states\naccording to the associated maximum, we obtain a cell classification that\nagrees well with previous results that use traditional clustering techniques.\nOur results provide quantitative descriptions of gene expression statistics and\ninterpretable criteria for defining cell classes, supporting the hypothesis\nthat cell classes emerge from the collective interaction of gene expression\nlevels.","PeriodicalId":501325,"journal":{"name":"arXiv - QuanBio - Molecular Networks","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-08-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Maximum entropy models for patterns of gene expression\",\"authors\":\"Camilla Sarra, Leopoldo Sarra, Luca Di Carlo, Trevor GrandPre, Yaojun Zhang, Curtis G. Callan Jr., William Bialek\",\"doi\":\"arxiv-2408.08037\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"New experimental methods make it possible to measure the expression levels of\\nmany genes, simultaneously, in snapshots from thousands or even millions of\\nindividual cells. Current approaches to analyze these experiments involve\\nclustering or low-dimensional projections. Here we use the principle of maximum\\nentropy to obtain a probabilistic description that captures the observed\\npresence or absence of mRNAs from hundreds of genes in cells from the mammalian\\nbrain. We construct the Ising model compatible with experimental means and\\npairwise correlations, and validate it by showing that it gives good\\npredictions for higher-order statistics. We notice that the probability\\ndistribution of cell states has many local maxima. By labeling cell states\\naccording to the associated maximum, we obtain a cell classification that\\nagrees well with previous results that use traditional clustering techniques.\\nOur results provide quantitative descriptions of gene expression statistics and\\ninterpretable criteria for defining cell classes, supporting the hypothesis\\nthat cell classes emerge from the collective interaction of gene expression\\nlevels.\",\"PeriodicalId\":501325,\"journal\":{\"name\":\"arXiv - QuanBio - Molecular Networks\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-08-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - QuanBio - Molecular Networks\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2408.08037\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuanBio - Molecular Networks","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.08037","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

新的实验方法使同时测量数千甚至数百万个单个细胞快照中许多基因的表达水平成为可能。目前分析这些实验的方法包括聚类或低维投影。在这里,我们利用最大熵原理获得了一种概率描述,它捕捉到了在哺乳动物脑细胞中观察到的数百个基因的 mRNA 的存在或不存在。我们构建了与实验均值和成对相关性兼容的伊辛模型,并通过证明它能很好地预测高阶统计量来验证它。我们注意到细胞状态的概率分布有许多局部最大值。我们的结果提供了基因表达统计的定量描述和可解释的细胞类别定义标准,支持了细胞类别产生于基因表达水平的集体相互作用这一假设。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Maximum entropy models for patterns of gene expression
New experimental methods make it possible to measure the expression levels of many genes, simultaneously, in snapshots from thousands or even millions of individual cells. Current approaches to analyze these experiments involve clustering or low-dimensional projections. Here we use the principle of maximum entropy to obtain a probabilistic description that captures the observed presence or absence of mRNAs from hundreds of genes in cells from the mammalian brain. We construct the Ising model compatible with experimental means and pairwise correlations, and validate it by showing that it gives good predictions for higher-order statistics. We notice that the probability distribution of cell states has many local maxima. By labeling cell states according to the associated maximum, we obtain a cell classification that agrees well with previous results that use traditional clustering techniques. Our results provide quantitative descriptions of gene expression statistics and interpretable criteria for defining cell classes, supporting the hypothesis that cell classes emerge from the collective interaction of gene expression levels.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Multi-variable control to mitigate loads in CRISPRa networks Some bounds on positive equilibria in mass action networks Non-explosivity of endotactic stochastic reaction systems Limits on the computational expressivity of non-equilibrium biophysical processes When lowering temperature, the in vivo circadian clock in cyanobacteria follows and surpasses the in vitro protein clock trough the Hopf bifurcation
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1