首页 > 最新文献

Cell systems最新文献

英文 中文
An efficient, not-only-linear correlation coefficient based on clustering. 基于聚类的高效非线性相关系数
Pub Date : 2024-09-18 Epub Date: 2024-09-06 DOI: 10.1016/j.cels.2024.08.005
Milton Pividori, Marylyn D Ritchie, Diego H Milone, Casey S Greene

Identifying meaningful patterns in data is crucial for understanding complex biological processes, particularly in transcriptomics, where genes with correlated expression often share functions or contribute to disease mechanisms. Traditional correlation coefficients, which primarily capture linear relationships, may overlook important nonlinear patterns. We introduce the clustermatch correlation coefficient (CCC), a not-only-linear coefficient that utilizes clustering to efficiently detect both linear and nonlinear associations. CCC outperforms standard methods by revealing biologically meaningful patterns that linear-only coefficients miss and is faster than state-of-the-art coefficients such as the maximal information coefficient. When applied to human gene expression data from genotype-tissue expression (GTEx), CCC identified robust linear relationships and nonlinear patterns, such as sex-specific differences, that are undetectable by standard methods. Highly ranked gene pairs were enriched for interactions in integrated networks built from protein-protein interactions, transcription factor regulation, and chemical and genetic perturbations, suggesting that CCC can detect functional relationships missed by linear-only approaches. CCC is a highly efficient, next-generation, not-only-linear correlation coefficient for genome-scale data. A record of this paper's transparent peer review process is included in the supplemental information.

在数据中识别有意义的模式对于理解复杂的生物过程至关重要,特别是在转录组学中,具有相关表达的基因往往具有共同的功能或有助于疾病机制。传统的相关系数主要捕捉线性关系,可能会忽略重要的非线性模式。我们引入了聚类匹配相关系数(CCC),这是一种利用聚类有效检测线性和非线性关联的非线性系数。通过揭示纯线性系数所忽略的有生物意义的模式,CCC 优于标准方法,而且比最大信息系数等最先进的系数更快。当应用于基因型-组织表达(GTEx)的人类基因表达数据时,CCC 发现了标准方法无法检测到的稳健线性关系和非线性模式,如性别差异。在由蛋白质-蛋白质相互作用、转录因子调控以及化学和遗传扰动构建的整合网络中,高排序基因对富集了相互作用,这表明 CCC 可以发现纯线性方法所遗漏的功能关系。CCC 是适用于基因组规模数据的高效、新一代非线性相关系数。补充信息中包含了本文透明的同行评审过程记录。
{"title":"An efficient, not-only-linear correlation coefficient based on clustering.","authors":"Milton Pividori, Marylyn D Ritchie, Diego H Milone, Casey S Greene","doi":"10.1016/j.cels.2024.08.005","DOIUrl":"10.1016/j.cels.2024.08.005","url":null,"abstract":"<p><p>Identifying meaningful patterns in data is crucial for understanding complex biological processes, particularly in transcriptomics, where genes with correlated expression often share functions or contribute to disease mechanisms. Traditional correlation coefficients, which primarily capture linear relationships, may overlook important nonlinear patterns. We introduce the clustermatch correlation coefficient (CCC), a not-only-linear coefficient that utilizes clustering to efficiently detect both linear and nonlinear associations. CCC outperforms standard methods by revealing biologically meaningful patterns that linear-only coefficients miss and is faster than state-of-the-art coefficients such as the maximal information coefficient. When applied to human gene expression data from genotype-tissue expression (GTEx), CCC identified robust linear relationships and nonlinear patterns, such as sex-specific differences, that are undetectable by standard methods. Highly ranked gene pairs were enriched for interactions in integrated networks built from protein-protein interactions, transcription factor regulation, and chemical and genetic perturbations, suggesting that CCC can detect functional relationships missed by linear-only approaches. CCC is a highly efficient, next-generation, not-only-linear correlation coefficient for genome-scale data. A record of this paper's transparent peer review process is included in the supplemental information.</p>","PeriodicalId":93929,"journal":{"name":"Cell systems","volume":" ","pages":"854-868.e3"},"PeriodicalIF":0.0,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142147119","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Discovery of therapeutic targets in cancer using chromatin accessibility and transcriptomic data. 利用染色质可及性和转录组数据发现癌症治疗靶点。
Pub Date : 2024-09-18 Epub Date: 2024-09-04 DOI: 10.1016/j.cels.2024.08.004
Andre Neil Forbes, Duo Xu, Sandra Cohen, Priya Pancholi, Ekta Khurana

Most cancer types lack targeted therapeutic options, and when first-line targeted therapies are available, treatment resistance is a huge challenge. Recent technological advances enable the use of assay for transposase-accessible chromatin with sequencing (ATAC-seq) and RNA sequencing (RNA-seq) on patient tissue in a high-throughput manner. Here, we present a computational approach that leverages these datasets to identify drug targets based on tumor lineage. We constructed gene regulatory networks for 371 patients of 22 cancer types using machine learning approaches trained with three-dimensional genomic data for enhancer-to-promoter contacts. Next, we identified the key transcription factors (TFs) in these networks, which are used to find therapeutic vulnerabilities, by direct targeting of either TFs or the proteins that they interact with. We validated four candidates identified for neuroendocrine, liver, and renal cancers, which have a dismal prognosis with current therapeutic options.

大多数癌症类型都缺乏靶向治疗选择,即使有一线靶向治疗药物,耐药性也是一个巨大的挑战。最近的技术进步使我们能以高通量的方式在患者组织上使用转座酶可访问染色质测序(ATAC-seq)和RNA测序(RNA-seq)。在这里,我们提出了一种计算方法,利用这些数据集来识别基于肿瘤谱系的药物靶点。我们利用三维基因组数据训练的机器学习方法,为 22 种癌症类型的 371 名患者构建了基因调控网络,以了解增强子与启动子之间的联系。接下来,我们确定了这些网络中的关键转录因子(TFs),通过直接靶向TFs或与其相互作用的蛋白质,找到治疗漏洞。我们验证了为神经内分泌癌、肝癌和肾癌确定的四种候选药物,目前的治疗方案对这些癌症的预后效果不佳。
{"title":"Discovery of therapeutic targets in cancer using chromatin accessibility and transcriptomic data.","authors":"Andre Neil Forbes, Duo Xu, Sandra Cohen, Priya Pancholi, Ekta Khurana","doi":"10.1016/j.cels.2024.08.004","DOIUrl":"10.1016/j.cels.2024.08.004","url":null,"abstract":"<p><p>Most cancer types lack targeted therapeutic options, and when first-line targeted therapies are available, treatment resistance is a huge challenge. Recent technological advances enable the use of assay for transposase-accessible chromatin with sequencing (ATAC-seq) and RNA sequencing (RNA-seq) on patient tissue in a high-throughput manner. Here, we present a computational approach that leverages these datasets to identify drug targets based on tumor lineage. We constructed gene regulatory networks for 371 patients of 22 cancer types using machine learning approaches trained with three-dimensional genomic data for enhancer-to-promoter contacts. Next, we identified the key transcription factors (TFs) in these networks, which are used to find therapeutic vulnerabilities, by direct targeting of either TFs or the proteins that they interact with. We validated four candidates identified for neuroendocrine, liver, and renal cancers, which have a dismal prognosis with current therapeutic options.</p>","PeriodicalId":93929,"journal":{"name":"Cell systems","volume":" ","pages":"824-837.e6"},"PeriodicalIF":0.0,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11415227/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142142057","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Promoter DNA methylation and transcription factor condensation are linked to transcriptional memory in mammalian cells. 启动子 DNA 甲基化和转录因子凝集与哺乳动物细胞的转录记忆有关。
Pub Date : 2024-09-18 Epub Date: 2024-09-06 DOI: 10.1016/j.cels.2024.08.007
Shenqi Fan, Liang Ma, Chengzhi Song, Xu Han, Bijunyao Zhong, Yihan Lin

The regulation of genes can be mathematically described by input-output functions that are typically assumed to be time invariant. This fundamental assumption underpins the design of synthetic gene circuits and the quantitative understanding of natural gene regulatory networks. Here, we found that this assumption is challenged in mammalian cells. We observed that a synthetic reporter gene can exhibit unexpected transcriptional memory, leading to a shift in the dose-response curve upon a second induction. Mechanistically, we investigated the cis-dependency of transcriptional memory, revealing the necessity of promoter DNA methylation in establishing memory. Furthermore, we showed that the synthetic transcription factor's effective DNA binding affinity underlies trans-dependency, which is associated with its capacity to undergo biomolecular condensation. These principles enabled modulating memory by perturbing either cis- or trans-regulation of genes. Together, our findings suggest the potential pervasiveness of transcriptional memory and implicate the need to model mammalian gene regulation with time-varying input-output functions. A record of this paper's transparent peer review process is included in the supplemental information.

基因的调控可以用输入-输出函数进行数学描述,这些函数通常被假定为时间不变。这一基本假设是设计合成基因回路和定量理解天然基因调控网络的基础。在这里,我们发现这一假设在哺乳动物细胞中受到了挑战。我们观察到,合成报告基因会表现出意想不到的转录记忆,导致剂量反应曲线在第二次诱导时发生移动。从机理上讲,我们研究了转录记忆的顺式依赖性,揭示了启动子 DNA 甲基化对建立记忆的必要性。此外,我们还发现合成转录因子的有效 DNA 结合亲和力是反式依赖性的基础,而反式依赖性与其进行生物分子缩聚的能力有关。这些原理使我们能够通过干扰基因的顺式或反式调控来调节记忆。总之,我们的研究结果表明转录记忆具有潜在的普遍性,并暗示了利用时变输入-输出功能来模拟哺乳动物基因调控的必要性。本文的同行评审过程透明,其记录见补充信息。
{"title":"Promoter DNA methylation and transcription factor condensation are linked to transcriptional memory in mammalian cells.","authors":"Shenqi Fan, Liang Ma, Chengzhi Song, Xu Han, Bijunyao Zhong, Yihan Lin","doi":"10.1016/j.cels.2024.08.007","DOIUrl":"10.1016/j.cels.2024.08.007","url":null,"abstract":"<p><p>The regulation of genes can be mathematically described by input-output functions that are typically assumed to be time invariant. This fundamental assumption underpins the design of synthetic gene circuits and the quantitative understanding of natural gene regulatory networks. Here, we found that this assumption is challenged in mammalian cells. We observed that a synthetic reporter gene can exhibit unexpected transcriptional memory, leading to a shift in the dose-response curve upon a second induction. Mechanistically, we investigated the cis-dependency of transcriptional memory, revealing the necessity of promoter DNA methylation in establishing memory. Furthermore, we showed that the synthetic transcription factor's effective DNA binding affinity underlies trans-dependency, which is associated with its capacity to undergo biomolecular condensation. These principles enabled modulating memory by perturbing either cis- or trans-regulation of genes. Together, our findings suggest the potential pervasiveness of transcriptional memory and implicate the need to model mammalian gene regulation with time-varying input-output functions. A record of this paper's transparent peer review process is included in the supplemental information.</p>","PeriodicalId":93929,"journal":{"name":"Cell systems","volume":" ","pages":"808-823.e6"},"PeriodicalIF":0.0,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142147121","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Versatile system cores as a conceptual basis for generality in cell and developmental biology. 多功能系统核心是细胞和发育生物学通用性的概念基础。
Pub Date : 2024-09-18 Epub Date: 2024-09-04 DOI: 10.1016/j.cels.2024.08.001
Elisa Gallo, Stefano De Renzis, James Sharpe, Roberto Mayor, Jonas Hartmann

The discovery of general principles underlying the complexity and diversity of cellular and developmental systems is a central and long-standing aim of biology. While new technologies collect data at an ever-accelerating rate, there is growing concern that conceptual progress is not keeping pace. We contend that this is due to a paucity of conceptual frameworks that support meaningful generalizations. This led us to develop the core and periphery (C&P) hypothesis, which posits that many biological systems can be decomposed into a highly versatile core with a large behavioral repertoire and a specific periphery that configures said core to perform one particular function. Versatile cores tend to be widely reused across biology, which confers generality to theories describing them. Here, we introduce this concept and describe examples at multiple scales, including Turing patterning, actomyosin dynamics, multi-cellular morphogenesis, and vertebrate gastrulation. We also sketch its evolutionary basis and discuss key implications and open questions. We propose that the C&P hypothesis could unlock new avenues of conceptual progress in mesoscale biology.

发现细胞和发育系统的复杂性和多样性所蕴含的一般原理是生物学长期以来的核心目标。虽然新技术收集数据的速度不断加快,但人们越来越担心概念方面的进展跟不上步伐。我们认为,这是由于缺乏支持有意义的概括的概念框架。因此,我们提出了 "核心与外围"(C&P)假说,认为许多生物系统都可以分解为一个具有大量行为剧目的多功能核心和一个特定的外围,前者将上述核心配置为执行某一特定功能。多功能核心往往在整个生物学中被广泛重复使用,这就赋予了描述它们的理论以通用性。在这里,我们介绍了这一概念,并描述了多个尺度上的实例,包括图灵模式、肌动蛋白动力学、多细胞形态发生和脊椎动物胃形成。我们还概述了它的进化基础,并讨论了其主要影响和悬而未决的问题。我们认为,C&P假说可以为中尺度生物学的概念进步开辟新的途径。
{"title":"Versatile system cores as a conceptual basis for generality in cell and developmental biology.","authors":"Elisa Gallo, Stefano De Renzis, James Sharpe, Roberto Mayor, Jonas Hartmann","doi":"10.1016/j.cels.2024.08.001","DOIUrl":"10.1016/j.cels.2024.08.001","url":null,"abstract":"<p><p>The discovery of general principles underlying the complexity and diversity of cellular and developmental systems is a central and long-standing aim of biology. While new technologies collect data at an ever-accelerating rate, there is growing concern that conceptual progress is not keeping pace. We contend that this is due to a paucity of conceptual frameworks that support meaningful generalizations. This led us to develop the core and periphery (C&P) hypothesis, which posits that many biological systems can be decomposed into a highly versatile core with a large behavioral repertoire and a specific periphery that configures said core to perform one particular function. Versatile cores tend to be widely reused across biology, which confers generality to theories describing them. Here, we introduce this concept and describe examples at multiple scales, including Turing patterning, actomyosin dynamics, multi-cellular morphogenesis, and vertebrate gastrulation. We also sketch its evolutionary basis and discuss key implications and open questions. We propose that the C&P hypothesis could unlock new avenues of conceptual progress in mesoscale biology.</p>","PeriodicalId":93929,"journal":{"name":"Cell systems","volume":" ","pages":"790-807"},"PeriodicalIF":0.0,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142142059","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Imputing abundance of over 2,500 surface proteins from single-cell transcriptomes with context-agnostic zero-shot deep ensembles. 利用上下文无关的零点深度集合,从单细胞转录组中推算出 2,500 多种表面蛋白质的丰度。
Pub Date : 2024-09-18 Epub Date: 2024-09-06 DOI: 10.1016/j.cels.2024.08.006
Ruoqiao Chen, Jiayu Zhou, Bin Chen

Cell surface proteins serve as primary drug targets and cell identity markers. Techniques such as CITE-seq (cellular indexing of transcriptomes and epitopes by sequencing) have enabled the simultaneous quantification of surface protein abundance and transcript expression within individual cells. The published data have been utilized to train machine learning models for predicting surface protein abundance solely from transcript expression. However, the small scale of proteins predicted and the poor generalization ability of these computational approaches across diverse contexts (e.g., different tissues/disease states) impede their widespread adoption. Here, we propose SPIDER (surface protein prediction using deep ensembles from single-cell RNA sequencing), a context-agnostic zero-shot deep ensemble model, which enables large-scale protein abundance prediction and generalizes better to various contexts. Comprehensive benchmarking shows that SPIDER outperforms other state-of-the-art methods. Using the predicted surface abundance of >2,500 proteins from single-cell transcriptomes, we demonstrate the broad applications of SPIDER, including cell type annotation, biomarker/target identification, and cell-cell interaction analysis in hepatocellular carcinoma and colorectal cancer. A record of this paper's transparent peer review process is included in the supplemental information.

细胞表面蛋白是主要的药物靶标和细胞身份标记。CITE-seq(通过测序对转录组和表位进行细胞索引)等技术实现了对单个细胞内表面蛋白丰度和转录物表达的同时量化。已发表的数据被用来训练机器学习模型,以便仅从转录本表达预测表面蛋白丰度。然而,由于预测的蛋白质规模较小,而且这些计算方法在不同环境(如不同组织/疾病状态)下的泛化能力较差,这阻碍了它们的广泛应用。在这里,我们提出了 SPIDER(利用单细胞 RNA 测序的深度集合进行表面蛋白质预测),这是一种与上下文无关的零次深度集合模型,它能进行大规模蛋白质丰度预测,并能更好地泛化到各种上下文中。综合基准测试表明,SPIDER优于其他最先进的方法。通过预测单细胞转录组中超过2500个蛋白质的表面丰度,我们展示了SPIDER的广泛应用,包括肝癌和结直肠癌的细胞类型注释、生物标记物/靶标识别以及细胞-细胞相互作用分析。补充信息中包含了本文透明的同行评审过程记录。
{"title":"Imputing abundance of over 2,500 surface proteins from single-cell transcriptomes with context-agnostic zero-shot deep ensembles.","authors":"Ruoqiao Chen, Jiayu Zhou, Bin Chen","doi":"10.1016/j.cels.2024.08.006","DOIUrl":"10.1016/j.cels.2024.08.006","url":null,"abstract":"<p><p>Cell surface proteins serve as primary drug targets and cell identity markers. Techniques such as CITE-seq (cellular indexing of transcriptomes and epitopes by sequencing) have enabled the simultaneous quantification of surface protein abundance and transcript expression within individual cells. The published data have been utilized to train machine learning models for predicting surface protein abundance solely from transcript expression. However, the small scale of proteins predicted and the poor generalization ability of these computational approaches across diverse contexts (e.g., different tissues/disease states) impede their widespread adoption. Here, we propose SPIDER (surface protein prediction using deep ensembles from single-cell RNA sequencing), a context-agnostic zero-shot deep ensemble model, which enables large-scale protein abundance prediction and generalizes better to various contexts. Comprehensive benchmarking shows that SPIDER outperforms other state-of-the-art methods. Using the predicted surface abundance of >2,500 proteins from single-cell transcriptomes, we demonstrate the broad applications of SPIDER, including cell type annotation, biomarker/target identification, and cell-cell interaction analysis in hepatocellular carcinoma and colorectal cancer. A record of this paper's transparent peer review process is included in the supplemental information.</p>","PeriodicalId":93929,"journal":{"name":"Cell systems","volume":" ","pages":"869-884.e6"},"PeriodicalIF":0.0,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11423933/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142147120","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Environmental modulators of algae-bacteria interactions at scale. 大规模藻类与细菌相互作用的环境调节因素。
Pub Date : 2024-09-18 Epub Date: 2024-09-04 DOI: 10.1016/j.cels.2024.08.002
Chandana Gopalakrishnappa, Zeqian Li, Seppe Kuehn

Interactions between photosynthetic and heterotrophic microbes play a key role in global primary production. Understanding phototroph-heterotroph interactions remains challenging because these microbes reside in chemically complex environments. Here, we leverage a massively parallel droplet microfluidic platform that enables us to interrogate interactions between photosynthetic algae and heterotrophic bacteria in >100,000 communities across ∼525 environmental conditions with varying pH, carbon availability, and phosphorus availability. By developing a statistical framework to dissect interactions in this complex dataset, we reveal that the dependence of algae-bacteria interactions on nutrient availability is strongly modulated by pH and buffering capacity. Furthermore, we show that the chemical identity of the available organic carbon source controls how pH, buffering capacity, and nutrient availability modulate algae-bacteria interactions. Our study reveals the previously underappreciated role of pH in modulating phototroph-heterotroph interactions and provides a framework for thinking about interactions between phototrophs and heterotrophs in more natural contexts.

光合微生物和异养微生物之间的相互作用在全球初级生产中发挥着关键作用。由于光合微生物生活在化学性质复杂的环境中,因此了解光合微生物与异养微生物之间的相互作用仍然具有挑战性。在这里,我们利用大规模并行液滴微流控平台,在 pH 值、碳供应量和磷供应量不同的 525 种环境条件下,在 >100,000 个群落中分析光合藻类和异养细菌之间的相互作用。通过建立一个统计框架来剖析这一复杂数据集中的相互作用,我们发现藻类与细菌之间的相互作用对养分供应的依赖性受到 pH 值和缓冲能力的强烈调节。此外,我们还表明,可用有机碳源的化学特性控制着 pH 值、缓冲能力和营养物质的可用性如何调节藻类与细菌的相互作用。我们的研究揭示了酸碱度在调节光营养体与异养生物相互作用方面以前未被充分认识的作用,并为思考光营养体与异养生物在更多自然环境中的相互作用提供了一个框架。
{"title":"Environmental modulators of algae-bacteria interactions at scale.","authors":"Chandana Gopalakrishnappa, Zeqian Li, Seppe Kuehn","doi":"10.1016/j.cels.2024.08.002","DOIUrl":"10.1016/j.cels.2024.08.002","url":null,"abstract":"<p><p>Interactions between photosynthetic and heterotrophic microbes play a key role in global primary production. Understanding phototroph-heterotroph interactions remains challenging because these microbes reside in chemically complex environments. Here, we leverage a massively parallel droplet microfluidic platform that enables us to interrogate interactions between photosynthetic algae and heterotrophic bacteria in >100,000 communities across ∼525 environmental conditions with varying pH, carbon availability, and phosphorus availability. By developing a statistical framework to dissect interactions in this complex dataset, we reveal that the dependence of algae-bacteria interactions on nutrient availability is strongly modulated by pH and buffering capacity. Furthermore, we show that the chemical identity of the available organic carbon source controls how pH, buffering capacity, and nutrient availability modulate algae-bacteria interactions. Our study reveals the previously underappreciated role of pH in modulating phototroph-heterotroph interactions and provides a framework for thinking about interactions between phototrophs and heterotrophs in more natural contexts.</p>","PeriodicalId":93929,"journal":{"name":"Cell systems","volume":" ","pages":"838-853.e13"},"PeriodicalIF":0.0,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11412779/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142142058","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The structure is the message: Preserving experimental context through tensor decomposition. 结构即信息:通过张量分解保留实验背景。
Pub Date : 2024-08-21 DOI: 10.1016/j.cels.2024.07.004
Zhixin Cyrillus Tan, Aaron S Meyer

Recent biological studies have been revolutionized in scale and granularity by multiplex and high-throughput assays. Profiling cell responses across several experimental parameters, such as perturbations, time, and genetic contexts, leads to richer and more generalizable findings. However, these multidimensional datasets necessitate a reevaluation of the conventional methods for their representation and analysis. Traditionally, experimental parameters are merged to flatten the data into a two-dimensional matrix, sacrificing crucial experiment context reflected by the structure. As Marshall McLuhan famously stated, "the medium is the message." In this work, we propose that the experiment structure is the medium in which subsequent analysis is performed, and the optimal choice of data representation must reflect the experiment structure. We review how tensor-structured analyses and decompositions can preserve this information. We contend that tensor methods are poised to become integral to the biomedical data sciences toolkit.

近期的生物学研究在规模和粒度上都发生了革命性的变化,这得益于多重和高通量检测方法。通过对多个实验参数(如扰动、时间和遗传背景)的细胞反应进行剖析,可以获得更丰富、更有普遍意义的发现。然而,这些多维数据集需要重新评估其表示和分析的传统方法。传统的方法是合并实验参数,将数据平铺成二维矩阵,从而牺牲了结构所反映的关键实验背景。正如马歇尔-麦克卢汉的名言:"媒介即信息"。在这项工作中,我们提出实验结构是进行后续分析的媒介,数据表示的最佳选择必须反映实验结构。我们回顾了张量结构分析和分解如何保留这一信息。我们认为,张量方法有望成为生物医学数据科学工具包的组成部分。
{"title":"The structure is the message: Preserving experimental context through tensor decomposition.","authors":"Zhixin Cyrillus Tan, Aaron S Meyer","doi":"10.1016/j.cels.2024.07.004","DOIUrl":"10.1016/j.cels.2024.07.004","url":null,"abstract":"<p><p>Recent biological studies have been revolutionized in scale and granularity by multiplex and high-throughput assays. Profiling cell responses across several experimental parameters, such as perturbations, time, and genetic contexts, leads to richer and more generalizable findings. However, these multidimensional datasets necessitate a reevaluation of the conventional methods for their representation and analysis. Traditionally, experimental parameters are merged to flatten the data into a two-dimensional matrix, sacrificing crucial experiment context reflected by the structure. As Marshall McLuhan famously stated, \"the medium is the message.\" In this work, we propose that the experiment structure is the medium in which subsequent analysis is performed, and the optimal choice of data representation must reflect the experiment structure. We review how tensor-structured analyses and decompositions can preserve this information. We contend that tensor methods are poised to become integral to the biomedical data sciences toolkit.</p>","PeriodicalId":93929,"journal":{"name":"Cell systems","volume":"15 8","pages":"679-693"},"PeriodicalIF":0.0,"publicationDate":"2024-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11366223/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142038014","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Rationally reprogramming single-cell aging trajectories and lifespan through dynamic modulation of environmental inputs. 通过动态调节环境输入,合理重编程单细胞衰老轨迹和寿命。
Pub Date : 2024-08-21 DOI: 10.1016/j.cels.2024.07.008
Matthew Smart, David F Moreno, Murat Acar

How do variations in nutrient levels influence cellular lifespan? A dynamical systems model of a core circuit involved in yeast aging suggests principles underlying lifespan extension observed at static and alternating glucose levels that are reminiscent of intermittent fasting regimens.

营养水平的变化如何影响细胞寿命?一个参与酵母衰老的核心电路的动力学系统模型提出了在静态和交替葡萄糖水平下观察到的寿命延长的基本原理,这让人联想到间歇性禁食疗法。
{"title":"Rationally reprogramming single-cell aging trajectories and lifespan through dynamic modulation of environmental inputs.","authors":"Matthew Smart, David F Moreno, Murat Acar","doi":"10.1016/j.cels.2024.07.008","DOIUrl":"https://doi.org/10.1016/j.cels.2024.07.008","url":null,"abstract":"<p><p>How do variations in nutrient levels influence cellular lifespan? A dynamical systems model of a core circuit involved in yeast aging suggests principles underlying lifespan extension observed at static and alternating glucose levels that are reminiscent of intermittent fasting regimens.</p>","PeriodicalId":93929,"journal":{"name":"Cell systems","volume":"15 8","pages":"676-678"},"PeriodicalIF":0.0,"publicationDate":"2024-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142038013","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep-learning-based design of synthetic orthologs of SH3 signaling domains. 基于深度学习的 SH3 信号结构域合成同源物设计。
Pub Date : 2024-08-21 Epub Date: 2024-08-05 DOI: 10.1016/j.cels.2024.07.005
Xinran Lian, Nikša Praljak, Subu K Subramanian, Sarah Wasinger, Rama Ranganathan, Andrew L Ferguson

Evolution-based deep generative models represent an exciting direction in understanding and designing proteins. An open question is whether such models can learn specialized functional constraints that control fitness in specific biological contexts. Here, we examine the ability of generative models to produce synthetic versions of Src-homology 3 (SH3) domains that mediate signaling in the Sho1 osmotic stress response pathway of yeast. We show that a variational autoencoder (VAE) model produces artificial sequences that experimentally recapitulate the function of natural SH3 domains. More generally, the model organizes all fungal SH3 domains such that locality in the model latent space (but not simply locality in sequence space) enriches the design of synthetic orthologs and exposes non-obvious amino acid constraints distributed near and far from the SH3 ligand-binding site. The ability of generative models to design ortholog-like functions in vivo opens new avenues for engineering protein function in specific cellular contexts and environments.

基于进化的深度生成模型是理解和设计蛋白质的一个令人兴奋的方向。一个悬而未决的问题是,这类模型能否学习到在特定生物环境中控制适应性的专门功能约束。在这里,我们研究了生成模型生成Src-homology 3(SH3)结构域合成版本的能力,SH3结构域在酵母的Sho1渗透应激反应途径中介导信号传导。我们的研究表明,变异自动编码器(VAE)模型产生的人工序列在实验上再现了天然 SH3 结构域的功能。更广泛地说,该模型对所有真菌 SH3 结构域进行了组织,从而使模型潜在空间中的定位性(而不仅仅是序列空间中的定位性)丰富了合成直向同源物的设计,并暴露了分布在 SH3 配体结合位点附近和远处的非显而易见的氨基酸限制。生成模型设计体内同源物功能的能力为在特定细胞环境中设计蛋白质功能开辟了新途径。
{"title":"Deep-learning-based design of synthetic orthologs of SH3 signaling domains.","authors":"Xinran Lian, Nikša Praljak, Subu K Subramanian, Sarah Wasinger, Rama Ranganathan, Andrew L Ferguson","doi":"10.1016/j.cels.2024.07.005","DOIUrl":"10.1016/j.cels.2024.07.005","url":null,"abstract":"<p><p>Evolution-based deep generative models represent an exciting direction in understanding and designing proteins. An open question is whether such models can learn specialized functional constraints that control fitness in specific biological contexts. Here, we examine the ability of generative models to produce synthetic versions of Src-homology 3 (SH3) domains that mediate signaling in the Sho1 osmotic stress response pathway of yeast. We show that a variational autoencoder (VAE) model produces artificial sequences that experimentally recapitulate the function of natural SH3 domains. More generally, the model organizes all fungal SH3 domains such that locality in the model latent space (but not simply locality in sequence space) enriches the design of synthetic orthologs and exposes non-obvious amino acid constraints distributed near and far from the SH3 ligand-binding site. The ability of generative models to design ortholog-like functions in vivo opens new avenues for engineering protein function in specific cellular contexts and environments.</p>","PeriodicalId":93929,"journal":{"name":"Cell systems","volume":" ","pages":"725-737.e7"},"PeriodicalIF":0.0,"publicationDate":"2024-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141899168","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PanIN and CAF transitions in pancreatic carcinogenesis revealed with spatial data integration. 通过空间数据整合揭示胰腺癌发生过程中的 PanIN 和 CAF 转变。
Pub Date : 2024-08-21 Epub Date: 2024-08-07 DOI: 10.1016/j.cels.2024.07.001
Alexander T F Bell, Jacob T Mitchell, Ashley L Kiemen, Melissa Lyman, Kohei Fujikura, Jae W Lee, Erin Coyne, Sarah M Shin, Sushma Nagaraj, Atul Deshpande, Pei-Hsun Wu, Dimitrios N Sidiropoulos, Rossin Erbe, Jacob Stern, Rena Chan, Stephen Williams, James M Chell, Lauren Ciotti, Jacquelyn W Zimmerman, Denis Wirtz, Won Jin Ho, Neeha Zaidi, Elizabeth Thompson, Elizabeth M Jaffee, Laura D Wood, Elana J Fertig, Luciane T Kagohara

This study introduces a new imaging, spatial transcriptomics (ST), and single-cell RNA-sequencing integration pipeline to characterize neoplastic cell state transitions during tumorigenesis. We applied a semi-supervised analysis pipeline to examine premalignant pancreatic intraepithelial neoplasias (PanINs) that can develop into pancreatic ductal adenocarcinoma (PDAC). Their strict diagnosis on formalin-fixed and paraffin-embedded (FFPE) samples limited the single-cell characterization of human PanINs within their microenvironment. We leverage whole transcriptome FFPE ST to enable the study of a rare cohort of matched low-grade (LG) and high-grade (HG) PanIN lesions to track progression and map cellular phenotypes relative to single-cell PDAC datasets. We demonstrate that cancer-associated fibroblasts (CAFs), including antigen-presenting CAFs, are located close to PanINs. We further observed a transition from CAF-related inflammatory signaling to cellular proliferation during PanIN progression. We validate these findings with single-cell high-dimensional imaging proteomics and transcriptomics technologies. Altogether, our semi-supervised learning framework for spatial multi-omics has broad applicability across cancer types to decipher the spatiotemporal dynamics of carcinogenesis.

本研究介绍了一种新的成像、空间转录组学(ST)和单细胞 RNA 序列整合管道,用于描述肿瘤发生过程中肿瘤细胞状态的转变。我们应用半监督分析管道研究了可发展为胰腺导管腺癌(PDAC)的恶性胰腺上皮内瘤(PanINs)。对福尔马林固定和石蜡包埋(FFPE)样本的严格诊断限制了人类 PanINs 在其微环境中的单细胞表征。我们利用全转录组 FFPE ST 对匹配的低级别(LG)和高级别(HG)PanIN 病变进行了罕见的队列研究,以跟踪进展情况并绘制与单细胞 PDAC 数据集相对应的细胞表型图。我们证明,癌相关成纤维细胞(CAFs),包括抗原递呈CAFs,位于PanINs附近。我们进一步观察到,在 PanIN 进展过程中,与 CAF 相关的炎症信号转导向细胞增殖过渡。我们利用单细胞高维成像蛋白质组学和转录组学技术验证了这些发现。总之,我们的空间多组学半监督学习框架可广泛应用于各种癌症类型,以破译癌变的时空动态。
{"title":"PanIN and CAF transitions in pancreatic carcinogenesis revealed with spatial data integration.","authors":"Alexander T F Bell, Jacob T Mitchell, Ashley L Kiemen, Melissa Lyman, Kohei Fujikura, Jae W Lee, Erin Coyne, Sarah M Shin, Sushma Nagaraj, Atul Deshpande, Pei-Hsun Wu, Dimitrios N Sidiropoulos, Rossin Erbe, Jacob Stern, Rena Chan, Stephen Williams, James M Chell, Lauren Ciotti, Jacquelyn W Zimmerman, Denis Wirtz, Won Jin Ho, Neeha Zaidi, Elizabeth Thompson, Elizabeth M Jaffee, Laura D Wood, Elana J Fertig, Luciane T Kagohara","doi":"10.1016/j.cels.2024.07.001","DOIUrl":"10.1016/j.cels.2024.07.001","url":null,"abstract":"<p><p>This study introduces a new imaging, spatial transcriptomics (ST), and single-cell RNA-sequencing integration pipeline to characterize neoplastic cell state transitions during tumorigenesis. We applied a semi-supervised analysis pipeline to examine premalignant pancreatic intraepithelial neoplasias (PanINs) that can develop into pancreatic ductal adenocarcinoma (PDAC). Their strict diagnosis on formalin-fixed and paraffin-embedded (FFPE) samples limited the single-cell characterization of human PanINs within their microenvironment. We leverage whole transcriptome FFPE ST to enable the study of a rare cohort of matched low-grade (LG) and high-grade (HG) PanIN lesions to track progression and map cellular phenotypes relative to single-cell PDAC datasets. We demonstrate that cancer-associated fibroblasts (CAFs), including antigen-presenting CAFs, are located close to PanINs. We further observed a transition from CAF-related inflammatory signaling to cellular proliferation during PanIN progression. We validate these findings with single-cell high-dimensional imaging proteomics and transcriptomics technologies. Altogether, our semi-supervised learning framework for spatial multi-omics has broad applicability across cancer types to decipher the spatiotemporal dynamics of carcinogenesis.</p>","PeriodicalId":93929,"journal":{"name":"Cell systems","volume":" ","pages":"753-769.e5"},"PeriodicalIF":0.0,"publicationDate":"2024-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11409191/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141908694","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Cell systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1