首页 > 最新文献

bioRxiv - Systems Biology最新文献

英文 中文
Assessing the Relative Contributions of Mosaic and Regulatory Developmental Modes from Single-Cell Trajectories 从单细胞轨迹评估镶嵌式和调控式发育模式的相对贡献
Pub Date : 2024-07-25 DOI: 10.1101/2024.07.25.605053
Solene Song, Paul Villoutreix
Development is a highly complex process consisting of coordinated cell proliferation, cell differentiation and spatial organization. Classically, two ways to specify cell types during development are hypothesized : mosaic and regulative modes. In mosaic development, a particular cell isolated from the rest of the embryo will nevertheless give rise to cells with a fate identical to the ones expected in normal development, thus relying on lineage-inherited factors. In regulative development, the fate of a cell depends on its interactions with its environment, and thus relies on space-dependant factors. Both modes often coexist in the development of a given animal. We propose a method to quantify their respective contributions. C. elegans development provides a unique opportunity to elaborate such a measure. Indeed, its invariant lineage development allows to combine spatial positions, lineage relationships and protein expression data. Using the single cell protein expression profile as a readout of the cell state, we relate the contributions of the mosaic and the regulative modes to the following measurable quantities. The contribution of the mosaic mode, or lineage-inherited contribution is quantified by the strength of the relationship between the cell-cell lineage distance and the cell-cell expression distance. Similarly, the contribution of the regulative mode, or context-dependent contribution is quantified by the strength of the relationship between the cell-cell context distance and the cell-cell expression distance. The cell-cell context distance measures the similarity between the spatial neighborhoods of two cells based on the gene expression profiles of their neighbours. We assess the significance of these contributions by comparing the empirical results obtained on C. elegans data to artificial models generated using simple rules. With these measures, we show the co-existence of mosaic and regulative modes in the development of C. elegans. The relative contribution of these two modes varies across the different tissues and in time. In particular, we see in the skin tissue that during early development, the mosaic mode dominates while at later stages, regulative mode dominates, suggesting a convergence of single cell trajectories. These measures are general and can be applied to other datasets that will be made available with the progress of spatial transcriptomics and lineage-tracing, paving the way for a quantitative, unbiased and perturbation-free study of fundamental concepts in developmental biology.
发育是一个高度复杂的过程,包括协调的细胞增殖、细胞分化和空间组织。根据经典假设,发育过程中有两种指定细胞类型的方式:镶嵌模式和调节模式。在镶嵌式发育过程中,从胚胎其他部分分离出来的特定细胞会产生与正常发育过程中预期的细胞命运相同的细胞,从而依赖于系遗传因子。在调节性发育中,细胞的命运取决于其与环境的相互作用,因此依赖于空间相关因素。在特定动物的发育过程中,这两种模式往往同时存在。我们提出了一种量化它们各自贡献的方法。秀丽隐杆线虫的发育为制定这样的衡量标准提供了一个独特的机会。事实上,其不变的品系发育可以将空间位置、品系关系和蛋白质表达数据结合起来。利用单细胞蛋白质表达谱作为细胞状态的读数,我们将镶嵌模式和调节模式的贡献与以下可测量的量联系起来。镶嵌模式或系遗传模式的贡献是通过细胞-细胞系距离与细胞-细胞表达距离之间关系的强度来量化的。同样,调控模式(或语境依赖性贡献)的贡献是通过细胞-细胞语境距离与细胞-细胞表达距离之间关系的强度来量化的。细胞-细胞上下文距离根据两个细胞相邻区域的基因表达谱来衡量其空间邻域之间的相似性。我们通过比较从 elegans 数据中获得的经验结果和使用简单规则生成的人工模型,评估了这些贡献的重要性。通过这些测量方法,我们发现在秀丽隐杆线虫的发育过程中,镶嵌模式和调节模式并存。这两种模式在不同组织和不同时期的相对贡献各不相同。特别是在皮肤组织中,我们发现在早期发育阶段,镶嵌模式占主导地位,而在后期阶段,调节模式占主导地位,这表明单细胞轨迹趋于一致。这些测量方法是通用的,可应用于其他数据集,随着空间转录组学和系谱追踪技术的进步,这些数据集也将可用,从而为定量、无偏见和无扰动地研究发育生物学的基本概念铺平道路。
{"title":"Assessing the Relative Contributions of Mosaic and Regulatory Developmental Modes from Single-Cell Trajectories","authors":"Solene Song, Paul Villoutreix","doi":"10.1101/2024.07.25.605053","DOIUrl":"https://doi.org/10.1101/2024.07.25.605053","url":null,"abstract":"Development is a highly complex process consisting of coordinated cell proliferation, cell differentiation and spatial organization. Classically, two ways to specify cell types during development are hypothesized : mosaic and regulative modes. In mosaic development, a particular cell isolated from the rest of the embryo will nevertheless give rise to cells with a fate identical to the ones expected in normal development, thus relying on lineage-inherited factors. In regulative development, the fate of a cell depends on its interactions with its environment, and thus relies on space-dependant factors. Both modes often coexist in the development of a given animal. We propose a method to quantify their respective contributions. C. elegans development provides a unique opportunity to elaborate such a measure. Indeed, its invariant lineage development allows to combine spatial positions, lineage relationships and protein expression data. Using the single cell protein expression profile as a readout of the cell state, we relate the contributions of the mosaic and the regulative modes to the following measurable quantities. The contribution of the mosaic mode, or lineage-inherited contribution is quantified by the strength of the relationship between the cell-cell lineage distance and the cell-cell expression distance. Similarly, the contribution of the regulative mode, or context-dependent contribution is quantified by the strength of the relationship between the cell-cell context distance and the cell-cell expression distance. The cell-cell context distance measures the similarity between the spatial neighborhoods of two cells based on the gene expression profiles of their neighbours. We assess the significance of these contributions by comparing the empirical results obtained on C. elegans data to artificial models generated using simple rules. With these measures, we show the co-existence of mosaic and regulative modes in the development of C. elegans. The relative contribution of these two modes varies across the different tissues and in time. In particular, we see in the skin tissue that during early development, the mosaic mode dominates while at later stages, regulative mode dominates, suggesting a convergence of single cell trajectories. These measures are general and can be applied to other datasets that will be made available with the progress of spatial transcriptomics and lineage-tracing, paving the way for a quantitative, unbiased and perturbation-free study of fundamental concepts in developmental biology.","PeriodicalId":501213,"journal":{"name":"bioRxiv - Systems Biology","volume":"430 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141781262","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AP-1 Mediates Cellular Adaptation and Memory Formation During Therapy Resistance AP-1 在治疗抵抗过程中介导细胞适应和记忆形成
Pub Date : 2024-07-25 DOI: 10.1101/2024.07.25.604999
Jingxin Li, Pavithran T. Ravindran, Aoife O'Farrell, Gianna T. Busch, Ryan H. Boe, Zijian Niu, Sean Woo, Maraget C. Dunagin, Naveen Jain, Yogesh Goyal, Kavitha Sarma, Meenhard Herlyn, Arjun Raj
Cellular responses to environmental stimuli are typically thought to be governed by genetically encoded programs. We demonstrate that melanoma cells can form and maintain cellular memories during the acquisition of therapy resistance that exhibit characteristics of cellular learning and are dependent on the transcription factor AP-1. We show that cells exposed to a low dose of therapy adapt to become resistant to a high dose, demonstrating that resistance was not purely selective. The application of therapy itself results in the encoding of transient gene expression into cellular memory and that this encoding occurs for both transiently induced and probabilistically arising expression. Chromatin accessibility showed concomitant persistence. A two-color AP-1 reporter system showed that these memories are encoded in cis, constituting an example of activating cis epigenetics. Our findings establish the formation and maintenance of cellular memories as a critical aspect of gene regulation during the development of therapy resistance.
细胞对环境刺激的反应通常被认为是由基因编码程序控制的。我们的研究表明,黑色素瘤细胞在获得抗药性的过程中可以形成并维持细胞记忆,这种记忆表现出细胞学习的特征,并且依赖于转录因子 AP-1。我们发现,接受低剂量治疗的细胞会适应高剂量治疗,从而产生抗药性,这表明抗药性并非纯粹是选择性的。应用治疗本身会将瞬时基因表达编码为细胞记忆,而且这种编码同时发生在瞬时诱导表达和概率性表达上。染色质的可及性同时显示出持久性。双色 AP-1 报告系统显示,这些记忆是顺式编码的,是激活顺式表观遗传学的一个实例。我们的研究结果表明,细胞记忆的形成和维持是耐药性形成过程中基因调控的一个重要方面。
{"title":"AP-1 Mediates Cellular Adaptation and Memory Formation During Therapy Resistance","authors":"Jingxin Li, Pavithran T. Ravindran, Aoife O'Farrell, Gianna T. Busch, Ryan H. Boe, Zijian Niu, Sean Woo, Maraget C. Dunagin, Naveen Jain, Yogesh Goyal, Kavitha Sarma, Meenhard Herlyn, Arjun Raj","doi":"10.1101/2024.07.25.604999","DOIUrl":"https://doi.org/10.1101/2024.07.25.604999","url":null,"abstract":"Cellular responses to environmental stimuli are typically thought to be governed by genetically encoded programs. We demonstrate that melanoma cells can form and maintain cellular memories during the acquisition of therapy resistance that exhibit characteristics of cellular learning and are dependent on the transcription factor AP-1. We show that cells exposed to a low dose of therapy adapt to become resistant to a high dose, demonstrating that resistance was not purely selective. The application of therapy itself results in the encoding of transient gene expression into cellular memory and that this encoding occurs for both transiently induced and probabilistically arising expression. Chromatin accessibility showed concomitant persistence. A two-color AP-1 reporter system showed that these memories are encoded in cis, constituting an example of activating cis epigenetics. Our findings establish the formation and maintenance of cellular memories as a critical aspect of gene regulation during the development of therapy resistance.","PeriodicalId":501213,"journal":{"name":"bioRxiv - Systems Biology","volume":"70 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141781258","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Investigation of a Novel Mouse Model of Prader-Willi Syndrome with Invalidation of Necdin and Magel2 研究一种新型的普拉德-威利综合征小鼠模型,并对 Necdin 和 Magel2 进行验证
Pub Date : 2024-07-24 DOI: 10.1101/2024.07.24.604909
Pierre-Yves Barelle, Alicia Sicardi, Fabienne Schaller, Julie Buron, Denis Becquet, Felix Omnes, Francoise Watrin, Catarina Santos, Clement Menuet, Anne-Marie Francois-Bellan, Emilie Caron, Jessica Klucznik, Vincent Prevot, Sebastien G Bouret, Francoise Muscatelli
Prader-Willi syndrome (PWS) is a multigenic disorder caused by the loss of seven contiguous paternally expressed genes. Mouse models with inactivation of all PWS genes are lethal. Knockout (KO) mouse models for each candidate gene were generated, but they lack the functional interactions between PWS genes. Here, we revealed an interplay between Necdin and Magel2 PWS genes and generated a novel mouse model (named Madin) with a deletion including both genes. A subset of Madin KO mice showed neonatal lethality. Behaviorally, surviving mutant mice exhibited sensory delays during infancy and alterations in social exploration at adulthood. Madin KO mice had a lower body weight before weaning, persisting after weaning in males only, with reduced fat mass and improved glucose tolerance. Delayed sexual maturation and altered timing of puberty onset were observed in mutant mice. Adult Madin KO mice displayed increased ventilation and a persistent increase in apneas following a hypercapnic challenge. Transcriptomics analyses revealed a dysregulation of key circadian genes and alterations of genes associated with axonal function that were also found in the hypothalamus of patients with PWS. At neuroanatomical levels, we report an impaired maturation of oxytocin neurons and a disrupted development of melanocortin circuits. Together, these data indicate that the Madin KO mouse is a reliable and more genetically relevant model for the study of PWS.
普拉德-威利综合征(Prader-Willi Syndrome,PWS)是一种由父系表达的七个连续基因缺失引起的多基因疾病。所有PWS基因失活的小鼠模型都是致死的。针对每个候选基因的基因敲除(KO)小鼠模型已经产生,但它们缺乏 PWS 基因之间的功能性相互作用。在这里,我们揭示了Necdin和Magel2 PWS基因之间的相互作用,并生成了一个同时缺失这两个基因的新型小鼠模型(命名为Madin)。一部分Madin KO小鼠表现出新生儿致死性。在行为上,存活的突变小鼠在婴儿期表现出感官发育迟缓,成年后社会探索能力发生改变。Madin KO小鼠断奶前体重较轻,断奶后仅雄性小鼠体重持续较轻,脂肪量减少,葡萄糖耐量改善。在突变小鼠中观察到性成熟延迟和青春期开始时间的改变。成年的Madin KO小鼠通气量增加,高碳酸血症挑战后呼吸暂停持续增加。转录组学分析揭示了关键昼夜节律基因的失调以及与轴突功能相关的基因的改变,这些在PWS患者的下丘脑中也有发现。在神经解剖学层面,我们发现催产素神经元的成熟受到影响,黑皮质素回路的发育也受到破坏。这些数据共同表明,Madin KO小鼠是研究PWS的一个可靠且更具遗传相关性的模型。
{"title":"Investigation of a Novel Mouse Model of Prader-Willi Syndrome with Invalidation of Necdin and Magel2","authors":"Pierre-Yves Barelle, Alicia Sicardi, Fabienne Schaller, Julie Buron, Denis Becquet, Felix Omnes, Francoise Watrin, Catarina Santos, Clement Menuet, Anne-Marie Francois-Bellan, Emilie Caron, Jessica Klucznik, Vincent Prevot, Sebastien G Bouret, Francoise Muscatelli","doi":"10.1101/2024.07.24.604909","DOIUrl":"https://doi.org/10.1101/2024.07.24.604909","url":null,"abstract":"Prader-Willi syndrome (PWS) is a multigenic disorder caused by the loss of seven contiguous paternally expressed genes. Mouse models with inactivation of all PWS genes are lethal. Knockout (KO) mouse models for each candidate gene were generated, but they lack the functional interactions between PWS genes. Here, we revealed an interplay between Necdin and Magel2 PWS genes and generated a novel mouse model (named Madin) with a deletion including both genes. A subset of Madin KO mice showed neonatal lethality. Behaviorally, surviving mutant mice exhibited sensory delays during infancy and alterations in social exploration at adulthood. Madin KO mice had a lower body weight before weaning, persisting after weaning in males only, with reduced fat mass and improved glucose tolerance. Delayed sexual maturation and altered timing of puberty onset were observed in mutant mice. Adult Madin KO mice displayed increased ventilation and a persistent increase in apneas following a hypercapnic challenge. Transcriptomics analyses revealed a dysregulation of key circadian genes and alterations of genes associated with axonal function that were also found in the hypothalamus of patients with PWS. At neuroanatomical levels, we report an impaired maturation of oxytocin neurons and a disrupted development of melanocortin circuits. Together, these data indicate that the Madin KO mouse is a reliable and more genetically relevant model for the study of PWS.","PeriodicalId":501213,"journal":{"name":"bioRxiv - Systems Biology","volume":"41 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141785890","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Resolving artefacts in voltage-clamp experiments with computational modelling: an application to fast sodium current recordings 利用计算建模解决电压钳实验中的伪影:快速钠电流记录的应用
Pub Date : 2024-07-24 DOI: 10.1101/2024.07.23.604780
Chon Lok Lei, Alexander P Clark, Michael Clerx, Siyu Wei, Meye Bloothooft, Teun P de Boer, David J Christini, Trine Krogh-Madsen, Gary R Mirams
Cellular electrophysiology is the foundation of many fields, from basic science in neurology, cardiology, oncology to safety critical applications for drug safety testing, clinical phenotyping, etc. Patch-clamp voltage clamp is the gold standard technique for studying cellular electrophysiology. Yet, the quality of these experiments is not always transparent, which may lead to erroneous conclusions for studies and applications. Here, we have developed a new computational approach that allows us to explain and predict the experimental artefacts in voltage-clamp experiments. The computational model captures the experimental procedure and its inadequacies, including: voltage offset, series resistance, membrane capacitance and (imperfect) amplifier compensations, such as series resistance compensation and supercharging. The computational model was validated through a series of electrical model cell experiments. Using this computational approach, the artefacts in voltage-clamp experiments of cardiac fast sodium current, one of the most challenging currents to voltage clamp, were able to be resolved and explained through coupling the observed current and the simulated membrane voltage, including some typically observed shifts and delays in the recorded currents. We further demonstrated that the typical way of averaging data for current-voltage relationships would lead to biases in the peak current and shifts in the peak voltage, and such biases can be in the same order of magnitude as those differences reported for disease-causing mutations. Therefore, the presented new computational pipeline will provide a new standard of assessing the voltage-clamp experiments and interpreting the experimental data, which may be able to rectify and provide a better understanding of ion channel mutations and other related applications.
细胞电生理学是许多领域的基础,从神经学、心脏病学、肿瘤学等基础科学到药物安全性测试、临床表型等安全关键应用。膜片钳电压钳是研究细胞电生理学的黄金标准技术。然而,这些实验的质量并不总是透明的,这可能会导致研究和应用得出错误的结论。在这里,我们开发了一种新的计算方法,可以解释和预测电压钳实验中的实验伪影。计算模型捕捉了实验过程及其不足之处,包括:电压偏移、串联电阻、膜电容和(不完善的)放大器补偿,如串联电阻补偿和增压。计算模型通过一系列电模型电池实验进行了验证。利用这种计算方法,通过将观察到的电流与模拟膜电压耦合,包括记录电流中一些典型观察到的偏移和延迟,能够解决和解释电压钳制实验中心脏快速钠电流(最具挑战性的电流之一)的伪像问题。我们进一步证明,对电流-电压关系进行数据平均的典型方法会导致峰值电流的偏差和峰值电压的偏移,这种偏差的数量级与报告的致病突变差异的数量级相同。因此,所介绍的新计算管道将为评估电压钳实验和解释实验数据提供新的标准,从而可能纠正和更好地理解离子通道突变及其他相关应用。
{"title":"Resolving artefacts in voltage-clamp experiments with computational modelling: an application to fast sodium current recordings","authors":"Chon Lok Lei, Alexander P Clark, Michael Clerx, Siyu Wei, Meye Bloothooft, Teun P de Boer, David J Christini, Trine Krogh-Madsen, Gary R Mirams","doi":"10.1101/2024.07.23.604780","DOIUrl":"https://doi.org/10.1101/2024.07.23.604780","url":null,"abstract":"Cellular electrophysiology is the foundation of many fields, from basic science in neurology, cardiology, oncology to safety critical applications for drug safety testing, clinical phenotyping, etc. Patch-clamp voltage clamp is the gold standard technique for studying cellular electrophysiology. Yet, the quality of these experiments is not always transparent, which may lead to erroneous conclusions for studies and applications. Here, we have developed a new computational approach that allows us to explain and predict the experimental artefacts in voltage-clamp experiments. The computational model captures the experimental procedure and its inadequacies, including: voltage offset, series resistance, membrane capacitance and (imperfect) amplifier compensations, such as series resistance compensation and supercharging. The computational model was validated through a series of electrical model cell experiments. Using this computational approach, the artefacts in voltage-clamp experiments of cardiac fast sodium current, one of the most challenging currents to voltage clamp, were able to be resolved and explained through coupling the observed current and the simulated membrane voltage, including some typically observed shifts and delays in the recorded currents. We further demonstrated that the typical way of averaging data for current-voltage relationships would lead to biases in the peak current and shifts in the peak voltage, and such biases can be in the same order of magnitude as those differences reported for disease-causing mutations. Therefore, the presented new computational pipeline will provide a new standard of assessing the voltage-clamp experiments and interpreting the experimental data, which may be able to rectify and provide a better understanding of ion channel mutations and other related applications.","PeriodicalId":501213,"journal":{"name":"bioRxiv - Systems Biology","volume":"19 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141781260","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Epigenetic control of metabolic identity across cell types 跨细胞类型代谢特征的表观遗传控制
Pub Date : 2024-07-24 DOI: 10.1101/2024.07.24.604914
Maria Pires Pacheco, Deborah Gerard, Riley J. Mangan, Alec R. Chapman, Dennis Hecker, Manolis Kellis, Marcel H. Schulz, Lasse Sinkkonen, Thomas Sauter
Constraint-based network modelling is a powerful tool for analysing cellular metabolism at genomic scale. Here, we conducted an integrative analysis of metabolic networks reconstructed from RNA-seq data with paired epigenomic data from the EpiATLAS resource of the International Human Epigenome Consortium (IHEC). Applying a state-of-the-art contextualisation algorithm, we reconstructed metabolic networks across 1,555 samples corresponding to 58 tissues and cell types. Analysis of these networks revealed the distribution of metabolic functionalities across human cell types and provides a compendium of human metabolic activity. This integrative approach allowed us to define, across tissues and cell types, i) reactions that fulfil the basic metabolic processes (core metabolism), and ii) cell type-specific functions (unique metabolism), that shape the metabolic identity of a cell or a tissue. Integration with EpiATLAS-derived cell type-specific gene-level chromatin states and enhancer-gene interactions identified enhancers, transcription factors, and key nodes controlling core and unique metabolism. Transport and first reactions of pathways were enriched for high expression, active chromatin state, and Polycomb-mediated repression in cell types where pathways are inactive, suggesting that key nodes are targets of repression. This integrative analysis forms the basis for identifying regulation points that control metabolic identity in human cells.
基于约束的网络建模是在基因组尺度上分析细胞代谢的有力工具。在这里,我们对从RNA-seq数据重建的代谢网络与国际人类表观基因组联盟(IHEC)的EpiATLAS资源中的成对表观基因组数据进行了综合分析。我们应用最先进的情境化算法,重建了1,555个样本的代谢网络,这些样本对应58种组织和细胞类型。对这些网络的分析揭示了新陈代谢功能在人类细胞类型中的分布,并提供了人类新陈代谢活动汇编。这种综合方法使我们能够在不同组织和细胞类型中定义 i) 实现基本代谢过程的反应(核心代谢),以及 ii) 细胞类型特有的功能(独特代谢),从而形成细胞或组织的代谢特征。与 EpiATLAS 导出的细胞类型特异性基因水平染色质状态和增强子-基因相互作用相结合,确定了控制核心代谢和独特代谢的增强子、转录因子和关键节点。在通路不活跃的细胞类型中,通路的运输和第一反应富含高表达、活跃的染色质状态和多聚酶介导的抑制,这表明关键节点是抑制的目标。这一综合分析为确定控制人类细胞代谢特性的调控点奠定了基础。
{"title":"Epigenetic control of metabolic identity across cell types","authors":"Maria Pires Pacheco, Deborah Gerard, Riley J. Mangan, Alec R. Chapman, Dennis Hecker, Manolis Kellis, Marcel H. Schulz, Lasse Sinkkonen, Thomas Sauter","doi":"10.1101/2024.07.24.604914","DOIUrl":"https://doi.org/10.1101/2024.07.24.604914","url":null,"abstract":"Constraint-based network modelling is a powerful tool for analysing cellular metabolism at genomic scale. Here, we conducted an integrative analysis of metabolic networks reconstructed from RNA-seq data with paired epigenomic data from the EpiATLAS resource of the International Human Epigenome Consortium (IHEC). Applying a state-of-the-art contextualisation algorithm, we reconstructed metabolic networks across 1,555 samples corresponding to 58 tissues and cell types. Analysis of these networks revealed the distribution of metabolic functionalities across human cell types and provides a compendium of human metabolic activity. This integrative approach allowed us to define, across tissues and cell types, i) reactions that fulfil the basic metabolic processes (core metabolism), and ii) cell type-specific functions (unique metabolism), that shape the metabolic identity of a cell or a tissue. Integration with EpiATLAS-derived cell type-specific gene-level chromatin states and enhancer-gene interactions identified enhancers, transcription factors, and key nodes controlling core and unique metabolism. Transport and first reactions of pathways were enriched for high expression, active chromatin state, and Polycomb-mediated repression in cell types where pathways are inactive, suggesting that key nodes are targets of repression. This integrative analysis forms the basis for identifying regulation points that control metabolic identity in human cells.","PeriodicalId":501213,"journal":{"name":"bioRxiv - Systems Biology","volume":"55 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141786330","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Constructing Ensemble Gene Functional Networks Capturing Tissue/condition-specific Co-expression from Unlabled Transcriptomic Data with TEA-GCN 利用 TEA-GCN 构建集合基因功能网络,从未获授权的转录组数据中捕捉组织/条件特异性共表达
Pub Date : 2024-07-23 DOI: 10.1101/2024.07.22.604713
Peng Ken Lim, Ruoxi Wang, Jenet Princy Antony Velankanni, Marek Mutwil
Gene co-expression networks (GCNs) generated from public transcriptomic datasets can elucidate the co-regulatory and co-functional relationships between genes, making GCNs an important tool to predict gene functions. However, current GCN construction methods are sensitive to the quality of the data, and the interpretability of the identified relationships between genes is still difficult. To address this, we present a novel method: Two-Tier Ensemble Aggregation (TEA-) GCN. TEA-GCN utilizes unsupervised partitioning of big transcriptomic datasets and three correlation coefficients to generate ensemble GCNs in a two-step aggregation process. We show that TEA-GCN outperforms in finding correct functional relationships between genes over the current state-of-the-art across three model species, and is able to not only capture condition/tissue-specific gene co-expression but explain them through the use of natural language processing (NLP). In addition, we found TEA-GCN to be especially performant in identifying relationships between transcription factors and their activation targets, making it effective in inferring gene regulatory networks. TEA-GCN is available at https://github.com/pengkenlim/TEA-GCN.
从公共转录组数据集生成的基因共表达网络(GCN)可以阐明基因之间的共调控和共功能关系,使GCN成为预测基因功能的重要工具。然而,目前的 GCN 构建方法对数据质量很敏感,而且所发现的基因间关系的可解释性仍然很困难。为了解决这个问题,我们提出了一种新方法:两层集合聚合(TEA-)GCN。TEA-GCN 利用对大型转录组数据集的无监督分区和三个相关系数,通过两步聚合过程生成集合 GCN。我们的研究表明,TEA-GCN 在发现基因间正确的功能关系方面优于目前三种模式物种的最先进技术,而且不仅能捕捉条件/组织特异性基因共表达,还能通过使用自然语言处理(NLP)来解释它们。此外,我们还发现 TEA-GCN 在识别转录因子及其激活靶标之间的关系方面表现尤为突出,这使它在推断基因调控网络方面非常有效。TEA-GCN可在https://github.com/pengkenlim/TEA-GCN。
{"title":"Constructing Ensemble Gene Functional Networks Capturing Tissue/condition-specific Co-expression from Unlabled Transcriptomic Data with TEA-GCN","authors":"Peng Ken Lim, Ruoxi Wang, Jenet Princy Antony Velankanni, Marek Mutwil","doi":"10.1101/2024.07.22.604713","DOIUrl":"https://doi.org/10.1101/2024.07.22.604713","url":null,"abstract":"Gene co-expression networks (GCNs) generated from public transcriptomic datasets can elucidate the co-regulatory and co-functional relationships between genes, making GCNs an important tool to predict gene functions. However, current GCN construction methods are sensitive to the quality of the data, and the interpretability of the identified relationships between genes is still difficult. To address this, we present a novel method: Two-Tier Ensemble Aggregation (TEA-) GCN. TEA-GCN utilizes unsupervised partitioning of big transcriptomic datasets and three correlation coefficients to generate ensemble GCNs in a two-step aggregation process. We show that TEA-GCN outperforms in finding correct functional relationships between genes over the current state-of-the-art across three model species, and is able to not only capture condition/tissue-specific gene co-expression but explain them through the use of natural language processing (NLP). In addition, we found TEA-GCN to be especially performant in identifying relationships between transcription factors and their activation targets, making it effective in inferring gene regulatory networks. TEA-GCN is available at https://github.com/pengkenlim/TEA-GCN.","PeriodicalId":501213,"journal":{"name":"bioRxiv - Systems Biology","volume":"21 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141781261","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Systemic inflammation is a minor contributor to remnant cholesterol atherogenicity 全身炎症是残余胆固醇致动脉粥样硬化的一个次要因素
Pub Date : 2024-07-23 DOI: 10.1101/2024.07.18.604203
Jordan M Kraaijenhof, Marije Kerkvliet, Nick S. Nurmohamed, Aldo Grefhorst, Jeffrey Kroon, Nicholas J. Wareham, G. Kees Hovingh, Erik SG Stroes, S. Matthijs Boekholdt, Laurens F. Reeskamp
Background:Both plasma levels of remnant cholesterol and low-density lipoprotein cholesterol (LDL-C) levels are independent risk factors for atherosclerotic cardiovascular disease. However, only remnant cholesterol has consistently been associated with systemic inflammation. The extent to which inflammation mediates the effect of remnant cholesterol on major adverse cardiovascular events (MACE) remains unclear.Methods and Results:This study included 16,445 participants without prior atherosclerotic cardiovascular disease from the EPIC-Norfolk cohort, with a mean age of 58.8 +/- 9.1 years, of which 9,357 (56.9%) were women. Every 1 mmol/L higher remnant cholesterol was associated with 29.5% higher hsCRP levels (95% Confidence Interval (CI): 22.1, 37.4, p<0.001), whereas LDL-C was not significantly associated with hsCRP levels in the fully adjusted model. Additionally, each 1 mmol/L higher remnant cholesterol was associated with a hazard ratio (HR) of 1.31 (95% CI: 1.14, 1.50, p<0.001) for MACE, compared to a HR of 1.21 (95% CI: 1.13, 1.31, p<0.001) for LDL-C. Mediation analysis showed that hsCRP mediated 5.9% (95% CI: 1.2, 10.6%, p<0.001) of the effect of remnant cholesterol on MACE, whereas hsCRP did not mediate the effect of LDL-C.Conclusions:Plasma remnant cholesterol levels are independently associated with systemic inflammation and cardiovascular events. Inflammation, as measured with hsCRP, contributed minorly to the association between remnant cholesterol and MACE. This underscores the need to address both remnant cholesterol and systemic inflammation separately in the clinical management of cardiovascular disease.
背景:血浆中的残余胆固醇水平和低密度脂蛋白胆固醇(LDL-C)水平都是动脉粥样硬化性心血管疾病的独立危险因素。然而,只有残余胆固醇一直与全身炎症相关。方法与结果:该研究纳入了 EPIC-Norfolk 队列中 16,445 名既往未患动脉粥样硬化性心血管疾病的参与者,平均年龄为 58.8 +/- 9.1 岁,其中 9,357 人(56.9%)为女性。残余胆固醇每升高 1 毫摩尔/升,hsCRP 水平就会升高 29.5%(95% 置信区间 (CI):22.1, 37.4,p<0.001),而在完全调整模型中,低密度脂蛋白胆固醇与 hsCRP 水平无显著相关性。此外,残余胆固醇每升高 1 毫摩尔/升,MACE 的危险比 (HR) 为 1.31(95% CI:1.14, 1.50,p<0.001),而 LDL-C 的危险比为 1.21(95% CI:1.13, 1.31,p<0.001)。中介分析显示,hsCRP介导了残余胆固醇对MACE影响的5.9%(95% CI:1.2, 10.6%,p<0.001),而hsCRP并不介导低密度脂蛋白胆固醇的影响。用 hsCRP 测定的炎症对残余胆固醇与心血管事件之间的关系影响较小。这突出表明,在心血管疾病的临床治疗中,需要分别处理残余胆固醇和全身炎症。
{"title":"Systemic inflammation is a minor contributor to remnant cholesterol atherogenicity","authors":"Jordan M Kraaijenhof, Marije Kerkvliet, Nick S. Nurmohamed, Aldo Grefhorst, Jeffrey Kroon, Nicholas J. Wareham, G. Kees Hovingh, Erik SG Stroes, S. Matthijs Boekholdt, Laurens F. Reeskamp","doi":"10.1101/2024.07.18.604203","DOIUrl":"https://doi.org/10.1101/2024.07.18.604203","url":null,"abstract":"Background:\u0000Both plasma levels of remnant cholesterol and low-density lipoprotein cholesterol (LDL-C) levels are independent risk factors for atherosclerotic cardiovascular disease. However, only remnant cholesterol has consistently been associated with systemic inflammation. The extent to which inflammation mediates the effect of remnant cholesterol on major adverse cardiovascular events (MACE) remains unclear.\u0000Methods and Results:\u0000This study included 16,445 participants without prior atherosclerotic cardiovascular disease from the EPIC-Norfolk cohort, with a mean age of 58.8 +/- 9.1 years, of which 9,357 (56.9%) were women. Every 1 mmol/L higher remnant cholesterol was associated with 29.5% higher hsCRP levels (95% Confidence Interval (CI): 22.1, 37.4, p&lt;0.001), whereas LDL-C was not significantly associated with hsCRP levels in the fully adjusted model. Additionally, each 1 mmol/L higher remnant cholesterol was associated with a hazard ratio (HR) of 1.31 (95% CI: 1.14, 1.50, p&lt;0.001) for MACE, compared to a HR of 1.21 (95% CI: 1.13, 1.31, p&lt;0.001) for LDL-C. Mediation analysis showed that hsCRP mediated 5.9% (95% CI: 1.2, 10.6%, p&lt;0.001) of the effect of remnant cholesterol on MACE, whereas hsCRP did not mediate the effect of LDL-C.\u0000Conclusions:\u0000Plasma remnant cholesterol levels are independently associated with systemic inflammation and cardiovascular events. Inflammation, as measured with hsCRP, contributed minorly to the association between remnant cholesterol and MACE. This underscores the need to address both remnant cholesterol and systemic inflammation separately in the clinical management of cardiovascular disease.","PeriodicalId":501213,"journal":{"name":"bioRxiv - Systems Biology","volume":"70 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141781420","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Proteome-wide 4-hydroxy-2-nonenal signature of oxidative stress in the marine invasive tunicate Botryllus schlosseri 全蛋白质组的 4-羟基-2-壬烯醛氧化应激特征在海洋入侵鳞栉水母(Botryllus schlosseri)中的应用
Pub Date : 2024-07-23 DOI: 10.1101/2024.07.19.604351
Dietmar Kültz, Alison M. Gardell, Anthony DeTomaso, Greg Stoney, Baruch Rinkevich, Andy Qarri, Jens Hamar
The colonial ascidian Boytryllus schlosseri is an invasive marine chordate that thrives under conditions of anthropogenic climate change. We show that the B. schlosseri expressed proteome contains unusually high levels of proteins that are adducted with 4-hydroxy-2-nonenal (HNE). HNE represents a prominent posttranslational modification resulting from oxidative stress. Although numerous studies have assessed oxidative stress in marine organisms HNE protein modification has not previously been determined in any marine species. LC/MS proteomics was used to identify 1052 HNE adducted proteins in B. schlosseri field and laboratory populations. Adducted amino acid residues were ascertained for 1849 modified sites, of which 1195 had a maximum amino acid localization score. Most HNE modifications were at less reactive lysines (rather than more reactive cysteines). HNE prevelance on most sites was high. These observations suggest that B. schlosseri experiences and tolerates high intracellular reactive oxygen species levels, resulting in substantial lipid peroxidation. HNE adducted B. schlosseri proteins show enrichment in mitochondrial, proteostasis, and cytoskeletal functions. Based on these results we propose that redox signaling contributes to regulating energy metabolism, the blastogenic cycle, oxidative burst defenses, and cytoskeleton dynamics during B. schlosseri development and physiology. A DIA assay library was constructed to quantify HNE adduction at 72 sites across 60 proteins that represent a holistic network of functionally discernable oxidative stress bioindicators. We conclude that the vast amount of HNE protein adduction in this circumpolar tunicate is indicative of high oxidative stress tolerance contributing to its range expansion into diverse environments.
在人为气候变化的条件下,Boytryllus schlosseri 是一种外来入侵的海洋脊索动物。我们发现,B. schlosseri表达的蛋白质组中含有异常高水平的与4-羟基-2-壬烯醛(HNE)加成的蛋白质。HNE 是氧化应激导致的一种突出的翻译后修饰。尽管已有大量研究对海洋生物的氧化应激进行了评估,但此前还未在任何海洋物种中测定过 HNE 蛋白质修饰。利用 LC/MS 蛋白质组学鉴定了 B. schlosseri 野外种群和实验室种群中的 1052 个 HNE 加合物蛋白质。确定了 1849 个修饰位点的加成氨基酸残基,其中 1195 个位点的氨基酸定位得分最高。大多数 HNE 修饰位点是活性较低的赖氨酸(而不是活性较高的半胱氨酸)。大多数位点的 HNE 发生率都很高。这些观察结果表明,B. schlosseri 经历并耐受高水平的细胞内活性氧,从而导致大量脂质过氧化。HNE 加合物在线粒体、蛋白稳态和细胞骨架功能中显示出丰富的作用。基于这些结果,我们认为氧化还原信号在 B. schlosseri 的发育和生理过程中有助于调节能量代谢、胚泡形成周期、氧化猝灭防御和细胞骨架动态。我们构建了一个 DIA 检测库,以量化 60 个蛋白质中 72 个位点的 HNE 加成,这些位点代表了一个功能可辨的氧化应激生物指标整体网络。我们的结论是,这种环极鳞栉水母体内大量的 HNE 蛋白吸附表明它具有很强的氧化应激耐受性,这也是它能在多种环境中扩展活动范围的原因之一。
{"title":"Proteome-wide 4-hydroxy-2-nonenal signature of oxidative stress in the marine invasive tunicate Botryllus schlosseri","authors":"Dietmar Kültz, Alison M. Gardell, Anthony DeTomaso, Greg Stoney, Baruch Rinkevich, Andy Qarri, Jens Hamar","doi":"10.1101/2024.07.19.604351","DOIUrl":"https://doi.org/10.1101/2024.07.19.604351","url":null,"abstract":"The colonial ascidian <em>Boytryllus schlosseri</em> is an invasive marine chordate that thrives under conditions of anthropogenic climate change. We show that the <em>B. schlosseri</em> expressed proteome contains unusually high levels of proteins that are adducted with 4-hydroxy-2-nonenal (HNE). HNE represents a prominent posttranslational modification resulting from oxidative stress. Although numerous studies have assessed oxidative stress in marine organisms HNE protein modification has not previously been determined in any marine species. LC/MS proteomics was used to identify 1052 HNE adducted proteins in <em>B. schlosseri</em> field and laboratory populations. Adducted amino acid residues were ascertained for 1849 modified sites, of which 1195 had a maximum amino acid localization score. Most HNE modifications were at less reactive lysines (rather than more reactive cysteines). HNE prevelance on most sites was high. These observations suggest that <em>B. schlosseri</em> experiences and tolerates high intracellular reactive oxygen species levels, resulting in substantial lipid peroxidation. HNE adducted B. schlosseri proteins show enrichment in mitochondrial, proteostasis, and cytoskeletal functions. Based on these results we propose that redox signaling contributes to regulating energy metabolism, the blastogenic cycle, oxidative burst defenses, and cytoskeleton dynamics during <em>B. schlosseri</em> development and physiology. A DIA assay library was constructed to quantify HNE adduction at 72 sites across 60 proteins that represent a holistic network of functionally discernable oxidative stress bioindicators. We conclude that the vast amount of HNE protein adduction in this circumpolar tunicate is indicative of high oxidative stress tolerance contributing to its range expansion into diverse environments.","PeriodicalId":501213,"journal":{"name":"bioRxiv - Systems Biology","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141781263","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hysteresis and noise floor in gene expression optimised for persistence against lethal events 优化基因表达的滞后性和噪声底限,以持续抵御致死事件
Pub Date : 2024-07-23 DOI: 10.1101/2024.07.19.604229
Pavol Bokes, Abhyudai Singh
Bacterial cell persistence, crucial for survival under adverse conditions like antibiotic exposure, is intrinsically linked to stochastic fluctuations in gene expression. Certain genes, while inhibiting growth under normal circumstances, confer tolerance to antibiotics at elevated expression levels. The occurrence of antibiotic events lead to instantaneous cellular responses with varied survival probabilities correlated with gene expression levels. Notably, cells with lower protein concentrations face higher mortality rates. This study aims to elucidate an optimal strategy for protein expression conducive to cellular survival. Through comprehensive mathematical analysis, we determine the optimal burst size and frequency that maximise cell proliferation. Furthermore, we explore how the optimal expression distribution changes as the cost of protein expression to growth escalates. Our model reveals a hysteresis phenomenon, characterised by discontinuous transitions between deterministic and stochastic optima. Intriguingly, stochastic optima possess a noise floor, representing the minimal level of fluctuations essential for optimal cellular resilience.
细菌细胞的持久性对于在抗生素暴露等不利条件下的生存至关重要,它与基因表达的随机波动有着内在联系。某些基因虽然在正常情况下会抑制生长,但在表达水平升高时会产生对抗生素的耐受性。抗生素事件的发生会导致细胞瞬时反应,其存活概率与基因表达水平相关。值得注意的是,蛋白质浓度较低的细胞死亡率较高。本研究旨在阐明有利于细胞存活的最佳蛋白质表达策略。通过全面的数学分析,我们确定了能使细胞增殖最大化的最佳爆发大小和频率。此外,我们还探讨了最佳表达分布如何随着蛋白质表达对生长的成本增加而发生变化。我们的模型揭示了一种滞后现象,其特点是确定性最优和随机最优之间的不连续转换。耐人寻味的是,随机最佳值具有噪音底限,代表了细胞最佳复原力所必需的最低波动水平。
{"title":"Hysteresis and noise floor in gene expression optimised for persistence against lethal events","authors":"Pavol Bokes, Abhyudai Singh","doi":"10.1101/2024.07.19.604229","DOIUrl":"https://doi.org/10.1101/2024.07.19.604229","url":null,"abstract":"Bacterial cell persistence, crucial for survival under adverse conditions like antibiotic exposure, is intrinsically linked to stochastic fluctuations in gene expression. Certain genes, while inhibiting growth under normal circumstances, confer tolerance to antibiotics at elevated expression levels. The occurrence of antibiotic events lead to instantaneous cellular responses with varied survival probabilities correlated with gene expression levels. Notably, cells with lower protein concentrations face higher mortality rates. This study aims to elucidate an optimal strategy for protein expression conducive to cellular survival. Through comprehensive mathematical analysis, we determine the optimal burst size and frequency that maximise cell proliferation. Furthermore, we explore how the optimal expression distribution changes as the cost of protein expression to growth escalates. Our model reveals a hysteresis phenomenon, characterised by discontinuous transitions between deterministic and stochastic optima. Intriguingly, stochastic optima possess a noise floor, representing the minimal level of fluctuations essential for optimal cellular resilience.","PeriodicalId":501213,"journal":{"name":"bioRxiv - Systems Biology","volume":"35 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141786278","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Context-driven interaction retrieval and classification for modeling, curation, and reuse 情境驱动的交互检索和分类,用于建模、策划和再利用
Pub Date : 2024-07-23 DOI: 10.1101/2024.07.21.604448
Haomiao Luo, Casey Hansen, Cheryl A Telmer, Difei Tang, Niloofar Arazkhani, Gaoxiang Zhou, Peter Spirtes, Natasa Miskov-Zivanov
Computational modeling seeks to construct and simulate intracellular signaling networks to understand health and disease. The scientific literature contains descriptions of experimental results that can be interpreted by machines using NLP or LLMs to itemize molecular interactions. This machine readable output can then be used to assess, update or improve existing biological models if there is a tool for comparing the existing model with the information extracted from the papers. Here we describe VIOLIN a tool for classifying machine outputs of molecular interactions with respect to a biological model. VIOLIN classifies interactions as corroborations, contradictions, flagged or extensions with subcategories of each class. This paper analyzes 2 different models, 9 reading sets, 2 NLP and 2 LLM tools to test VIOLIN's capabilities. The results show that VIOLIN successfully classifies interaction types and can be combined with automated filtering to provide a versatile tool for use by the systems biology community.
计算建模旨在构建和模拟细胞内信号网络,以了解健康和疾病。科学文献包含对实验结果的描述,这些描述可由使用 NLP 或 LLM 的机器进行解读,以逐项列出分子间的相互作用。如果有一种工具可以将现有模型与从论文中提取的信息进行比较,那么这种机器可读的输出结果就可以用来评估、更新或改进现有的生物模型。在此,我们将介绍 VIOLIN 这一工具,它可根据生物模型对机器输出的分子相互作用进行分类。VIOLIN 将相互作用分为确证、矛盾、标记或扩展四类,每类又分为若干子类。本文分析了 2 种不同的模型、9 个阅读集、2 种 NLP 和 2 种 LLM 工具,以测试 VIOLIN 的能力。结果表明,VIOLIN 成功地对相互作用类型进行了分类,并可与自动过滤相结合,为系统生物学界提供一种多功能工具。
{"title":"Context-driven interaction retrieval and classification for modeling, curation, and reuse","authors":"Haomiao Luo, Casey Hansen, Cheryl A Telmer, Difei Tang, Niloofar Arazkhani, Gaoxiang Zhou, Peter Spirtes, Natasa Miskov-Zivanov","doi":"10.1101/2024.07.21.604448","DOIUrl":"https://doi.org/10.1101/2024.07.21.604448","url":null,"abstract":"Computational modeling seeks to construct and simulate intracellular signaling networks to understand health and disease. The scientific literature contains descriptions of experimental results that can be interpreted by machines using NLP or LLMs to itemize molecular interactions. This machine readable output can then be used to assess, update or improve existing biological models if there is a tool for comparing the existing model with the information extracted from the papers. Here we describe VIOLIN a tool for classifying machine outputs of molecular interactions with respect to a biological model. VIOLIN classifies interactions as corroborations, contradictions, flagged or extensions with subcategories of each class. This paper analyzes 2 different models, 9 reading sets, 2 NLP and 2 LLM tools to test VIOLIN's capabilities. The results show that VIOLIN successfully classifies interaction types and can be combined with automated filtering to provide a versatile tool for use by the systems biology community.","PeriodicalId":501213,"journal":{"name":"bioRxiv - Systems Biology","volume":"45 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141781422","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
bioRxiv - Systems Biology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1