首页 > 最新文献

PLoS Computational Biology最新文献

英文 中文
Structural identifiability of biomolecular controller motifs with and without flow measurements as model output. 具有和不具有作为模型输出的流量测量的生物分子控制器基序的结构可识别性。
IF 4.3 2区 生物学 Pub Date : 2023-08-28 eCollection Date: 2023-08-01 DOI: 10.1371/journal.pcbi.1011398
Eivind S Haus, Tormod Drengstig, Kristian Thorsen

Controller motifs are simple biomolecular reaction networks with negative feedback. They can explain how regulatory function is achieved and are often used as building blocks in mathematical models of biological systems. In this paper we perform an extensive investigation into structural identifiability of controller motifs, specifically the so-called basic and antithetic controller motifs. Structural identifiability analysis is a useful tool in the creation and evaluation of mathematical models: it can be used to ensure that model parameters can be determined uniquely and to examine which measurements are necessary for this purpose. This is especially useful for biological models where parameter estimation can be difficult due to limited availability of measureable outputs. Our aim with this work is to investigate how structural identifiability is affected by controller motif complexity and choice of measurements. To increase the number of potential outputs we propose two methods for including flow measurements and show how this affects structural identifiability in combination with, or in the absence of, concentration measurements. In our investigation, we analyze 128 different controller motif structures using a combination of flow and/or concentration measurements, giving a total of 3648 instances. Among all instances, 34% of the measurement combinations provided structural identifiability. Our main findings for the controller motifs include: i) a single measurement is insufficient for structural identifiability, ii) measurements related to different chemical species are necessary for structural identifiability. Applying these findings result in a reduced subset of 1568 instances, where 80% are structurally identifiable, and more complex/interconnected motifs appear easier to structurally identify. The model structures we have investigated are commonly used in models of biological systems, and our results demonstrate how different model structures and measurement combinations affect structural identifiability of controller motifs.

控制器基序是具有负反馈的简单生物分子反应网络。它们可以解释调节功能是如何实现的,并且经常被用作生物系统数学模型的构建块。在本文中,我们对控制器基元的结构可识别性进行了广泛的研究,特别是所谓的基本和对偶控制器基元。结构可识别性分析是创建和评估数学模型的有用工具:它可以用于确保模型参数可以唯一确定,并检查为此目的需要进行哪些测量。这对于由于可测量输出的可用性有限而难以进行参数估计的生物模型尤其有用。我们这项工作的目的是研究结构可识别性如何受到控制器基序复杂性和测量选择的影响。为了增加潜在输出的数量,我们提出了两种包括流量测量的方法,并展示了这如何影响与浓度测量相结合或在没有浓度测量的情况下的结构可识别性。在我们的研究中,我们使用流量和/或浓度测量的组合分析了128种不同的控制器基序结构,总共给出了3648个实例。在所有实例中,34%的测量组合提供了结构可识别性。我们对控制器基序的主要发现包括:i)单个测量不足以实现结构可识别性,ii)与不同化学物种相关的测量对于结构可识别是必要的。应用这些发现减少了1568个实例的子集,其中80%在结构上是可识别的,并且更复杂/互连的基序似乎更容易在结构上识别。我们研究的模型结构通常用于生物系统的模型,我们的结果证明了不同的模型结构和测量组合如何影响控制器基元的结构可识别性。
{"title":"Structural identifiability of biomolecular controller motifs with and without flow measurements as model output.","authors":"Eivind S Haus,&nbsp;Tormod Drengstig,&nbsp;Kristian Thorsen","doi":"10.1371/journal.pcbi.1011398","DOIUrl":"10.1371/journal.pcbi.1011398","url":null,"abstract":"<p><p>Controller motifs are simple biomolecular reaction networks with negative feedback. They can explain how regulatory function is achieved and are often used as building blocks in mathematical models of biological systems. In this paper we perform an extensive investigation into structural identifiability of controller motifs, specifically the so-called basic and antithetic controller motifs. Structural identifiability analysis is a useful tool in the creation and evaluation of mathematical models: it can be used to ensure that model parameters can be determined uniquely and to examine which measurements are necessary for this purpose. This is especially useful for biological models where parameter estimation can be difficult due to limited availability of measureable outputs. Our aim with this work is to investigate how structural identifiability is affected by controller motif complexity and choice of measurements. To increase the number of potential outputs we propose two methods for including flow measurements and show how this affects structural identifiability in combination with, or in the absence of, concentration measurements. In our investigation, we analyze 128 different controller motif structures using a combination of flow and/or concentration measurements, giving a total of 3648 instances. Among all instances, 34% of the measurement combinations provided structural identifiability. Our main findings for the controller motifs include: i) a single measurement is insufficient for structural identifiability, ii) measurements related to different chemical species are necessary for structural identifiability. Applying these findings result in a reduced subset of 1568 instances, where 80% are structurally identifiable, and more complex/interconnected motifs appear easier to structurally identify. The model structures we have investigated are commonly used in models of biological systems, and our results demonstrate how different model structures and measurement combinations affect structural identifiability of controller motifs.</p>","PeriodicalId":49688,"journal":{"name":"PLoS Computational Biology","volume":"19 8","pages":"e1011398"},"PeriodicalIF":4.3,"publicationDate":"2023-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10491402/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10236547","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Switching state-space modeling of neural signal dynamics. 神经信号动力学的切换状态空间建模。
IF 4.3 2区 生物学 Pub Date : 2023-08-28 eCollection Date: 2023-08-01 DOI: 10.1371/journal.pcbi.1011395
Mingjian He, Proloy Das, Gladia Hotan, Patrick L Purdon

Linear parametric state-space models are a ubiquitous tool for analyzing neural time series data, providing a way to characterize the underlying brain dynamics with much greater statistical efficiency than non-parametric data analysis approaches. However, neural time series data are frequently time-varying, exhibiting rapid changes in dynamics, with transient activity that is often the key feature of interest in the data. Stationary methods can be adapted to time-varying scenarios by employing fixed-duration windows under an assumption of quasi-stationarity. But time-varying dynamics can be explicitly modeled by switching state-space models, i.e., by using a pool of state-space models with different dynamics selected by a probabilistic switching process. Unfortunately, exact solutions for state inference and parameter learning with switching state-space models are intractable. Here we revisit a switching state-space model inference approach first proposed by Ghahramani and Hinton. We provide explicit derivations for solving the inference problem iteratively after applying a variational approximation on the joint posterior of the hidden states and the switching process. We introduce a novel initialization procedure using an efficient leave-one-out strategy to compare among candidate models, which significantly improves performance compared to the existing method that relies on deterministic annealing. We then utilize this state inference solution within a generalized expectation-maximization algorithm to estimate model parameters of the switching process and the linear state-space models with dynamics potentially shared among candidate models. We perform extensive simulations under different settings to benchmark performance against existing switching inference methods and further validate the robustness of our switching inference solution outside the generative switching model class. Finally, we demonstrate the utility of our method for sleep spindle detection in real recordings, showing how switching state-space models can be used to detect and extract transient spindles from human sleep electroencephalograms in an unsupervised manner.

线性参数状态空间模型是一种普遍存在的分析神经时间序列数据的工具,它提供了一种比非参数数据分析方法更高的统计效率来表征潜在的大脑动力学的方法。然而,神经时间序列数据经常是时变的,表现出动力学的快速变化,瞬态活动通常是数据中感兴趣的关键特征。在拟平稳性假设下,通过使用固定持续时间窗口,平稳方法可以适应时变场景。但是,时变动力学可以通过切换状态空间模型来明确地建模,即,通过使用由概率切换过程选择的具有不同动力学的状态空间模型池。不幸的是,使用切换状态空间模型进行状态推理和参数学习的精确解是难以解决的。在这里,我们回顾了Ghahramani和Hinton首次提出的切换状态空间模型推理方法。在对隐藏状态和切换过程的联合后验应用变分近似后,我们提供了迭代求解推理问题的显式导数。我们介绍了一种新的初始化过程,该过程使用有效的留一策略在候选模型之间进行比较,与依赖于确定性退火的现有方法相比,该方法显著提高了性能。然后,我们在广义期望最大化算法中利用这种状态推理解决方案来估计切换过程的模型参数,以及候选模型之间可能共享动态的线性状态空间模型。我们在不同的设置下进行了广泛的模拟,以将性能与现有的切换推理方法进行比较,并进一步验证了我们的切换推理解决方案在生成切换模型类之外的稳健性。最后,我们展示了我们的方法在真实记录中用于睡眠纺锤波检测的实用性,展示了如何使用切换状态空间模型以无监督的方式从人类睡眠脑电图中检测和提取瞬态纺锤波。
{"title":"Switching state-space modeling of neural signal dynamics.","authors":"Mingjian He, Proloy Das, Gladia Hotan, Patrick L Purdon","doi":"10.1371/journal.pcbi.1011395","DOIUrl":"10.1371/journal.pcbi.1011395","url":null,"abstract":"<p><p>Linear parametric state-space models are a ubiquitous tool for analyzing neural time series data, providing a way to characterize the underlying brain dynamics with much greater statistical efficiency than non-parametric data analysis approaches. However, neural time series data are frequently time-varying, exhibiting rapid changes in dynamics, with transient activity that is often the key feature of interest in the data. Stationary methods can be adapted to time-varying scenarios by employing fixed-duration windows under an assumption of quasi-stationarity. But time-varying dynamics can be explicitly modeled by switching state-space models, i.e., by using a pool of state-space models with different dynamics selected by a probabilistic switching process. Unfortunately, exact solutions for state inference and parameter learning with switching state-space models are intractable. Here we revisit a switching state-space model inference approach first proposed by Ghahramani and Hinton. We provide explicit derivations for solving the inference problem iteratively after applying a variational approximation on the joint posterior of the hidden states and the switching process. We introduce a novel initialization procedure using an efficient leave-one-out strategy to compare among candidate models, which significantly improves performance compared to the existing method that relies on deterministic annealing. We then utilize this state inference solution within a generalized expectation-maximization algorithm to estimate model parameters of the switching process and the linear state-space models with dynamics potentially shared among candidate models. We perform extensive simulations under different settings to benchmark performance against existing switching inference methods and further validate the robustness of our switching inference solution outside the generative switching model class. Finally, we demonstrate the utility of our method for sleep spindle detection in real recordings, showing how switching state-space models can be used to detect and extract transient spindles from human sleep electroencephalograms in an unsupervised manner.</p>","PeriodicalId":49688,"journal":{"name":"PLoS Computational Biology","volume":"19 8","pages":"e1011395"},"PeriodicalIF":4.3,"publicationDate":"2023-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10491408/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10212762","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Estimating the epidemic reproduction number from temporally aggregated incidence data: A statistical modelling approach and software tool. 根据时间聚集的发病率数据估计流行病繁殖数量:一种统计建模方法和软件工具。
IF 4.3 2区 生物学 Pub Date : 2023-08-28 eCollection Date: 2023-08-01 DOI: 10.1371/journal.pcbi.1011439
Rebecca K Nash, Samir Bhatt, Anne Cori, Pierre Nouvellet

The time-varying reproduction number (Rt) is an important measure of epidemic transmissibility that directly informs policy decisions and the optimisation of control measures. EpiEstim is a widely used opensource software tool that uses case incidence and the serial interval (SI, time between symptoms in a case and their infector) to estimate Rt in real-time. The incidence and the SI distribution must be provided at the same temporal resolution, which can limit the applicability of EpiEstim and other similar methods, e.g. for contexts where the time window of incidence reporting is longer than the mean SI. In the EpiEstim R package, we implement an expectation-maximisation algorithm to reconstruct daily incidence from temporally aggregated data, from which Rt can then be estimated. We assess the validity of our method using an extensive simulation study and apply it to COVID-19 and influenza data. For all datasets, the influence of intra-weekly variability in reported data was mitigated by using aggregated weekly data. Rt estimated on weekly sliding windows using incidence reconstructed from weekly data was strongly correlated with estimates from the original daily data. The simulation study revealed that Rt was well estimated in all scenarios and regardless of the temporal aggregation of the data. In the presence of weekend effects, Rt estimates from reconstructed data were more successful at recovering the true value of Rt than those obtained from reported daily data. These results show that this novel method allows Rt to be successfully recovered from aggregated data using a simple approach with very few data requirements. Additionally, by removing administrative noise when daily incidence data are reconstructed, the accuracy of Rt estimates can be improved.

时变繁殖数(Rt)是衡量流行病传播性的重要指标,直接为政策决策和控制措施的优化提供信息。EpiEstim是一种广泛使用的开源软件工具,它使用病例发生率和序列间隔(SI,病例中症状与其感染者之间的时间)来实时估计Rt。发病率和SI分布必须以相同的时间分辨率提供,这可能会限制EpiEstim和其他类似方法的适用性,例如,在发病率报告的时间窗口长于平均SI的情况下。在EpiEstim R包中,我们实现了期望最大化算法,以从时间聚合数据中重建每日发病率,由此可以估计Rt。我们使用广泛的模拟研究来评估我们的方法的有效性,并将其应用于新冠肺炎和流感数据。对于所有数据集,通过使用汇总的每周数据减轻了报告数据中周内变异性的影响。使用从每周数据重建的发病率在每周滑动窗口上估计的Rt与原始每日数据的估计值强相关。模拟研究表明,无论数据的时间聚集如何,在所有情况下都能很好地估计Rt。在存在周末效应的情况下,从重建数据中估计的Rt比从报告的每日数据中获得的Rt更能成功地恢复Rt的真实值。这些结果表明,这种新方法允许使用一种简单的方法从聚合数据中成功地恢复Rt,而数据需求很少。此外,通过在重建每日发病率数据时消除管理噪声,可以提高Rt估计的准确性。
{"title":"Estimating the epidemic reproduction number from temporally aggregated incidence data: A statistical modelling approach and software tool.","authors":"Rebecca K Nash, Samir Bhatt, Anne Cori, Pierre Nouvellet","doi":"10.1371/journal.pcbi.1011439","DOIUrl":"10.1371/journal.pcbi.1011439","url":null,"abstract":"<p><p>The time-varying reproduction number (Rt) is an important measure of epidemic transmissibility that directly informs policy decisions and the optimisation of control measures. EpiEstim is a widely used opensource software tool that uses case incidence and the serial interval (SI, time between symptoms in a case and their infector) to estimate Rt in real-time. The incidence and the SI distribution must be provided at the same temporal resolution, which can limit the applicability of EpiEstim and other similar methods, e.g. for contexts where the time window of incidence reporting is longer than the mean SI. In the EpiEstim R package, we implement an expectation-maximisation algorithm to reconstruct daily incidence from temporally aggregated data, from which Rt can then be estimated. We assess the validity of our method using an extensive simulation study and apply it to COVID-19 and influenza data. For all datasets, the influence of intra-weekly variability in reported data was mitigated by using aggregated weekly data. Rt estimated on weekly sliding windows using incidence reconstructed from weekly data was strongly correlated with estimates from the original daily data. The simulation study revealed that Rt was well estimated in all scenarios and regardless of the temporal aggregation of the data. In the presence of weekend effects, Rt estimates from reconstructed data were more successful at recovering the true value of Rt than those obtained from reported daily data. These results show that this novel method allows Rt to be successfully recovered from aggregated data using a simple approach with very few data requirements. Additionally, by removing administrative noise when daily incidence data are reconstructed, the accuracy of Rt estimates can be improved.</p>","PeriodicalId":49688,"journal":{"name":"PLoS Computational Biology","volume":"19 8","pages":"e1011439"},"PeriodicalIF":4.3,"publicationDate":"2023-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10491397/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10207476","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Curated single cell multimodal landmark datasets for R/Bioconductor. 为R/Bioconductor绘制的单细胞多峰标志性数据集。
IF 4.3 2区 生物学 Pub Date : 2023-08-25 eCollection Date: 2023-08-01 DOI: 10.1371/journal.pcbi.1011324
Kelly B Eckenrode, Dario Righelli, Marcel Ramos, Ricard Argelaguet, Christophe Vanderaa, Ludwig Geistlinger, Aedin C Culhane, Laurent Gatto, Vincent Carey, Martin Morgan, Davide Risso, Levi Waldron

Background: The majority of high-throughput single-cell molecular profiling methods quantify RNA expression; however, recent multimodal profiling methods add simultaneous measurement of genomic, proteomic, epigenetic, and/or spatial information on the same cells. The development of new statistical and computational methods in Bioconductor for such data will be facilitated by easy availability of landmark datasets using standard data classes.

Results: We collected, processed, and packaged publicly available landmark datasets from important single-cell multimodal protocols, including CITE-Seq, ECCITE-Seq, SCoPE2, scNMT, 10X Multiome, seqFISH, and G&T. We integrate data modalities via the MultiAssayExperiment Bioconductor class, document and re-distribute datasets as the SingleCellMultiModal package in Bioconductor's Cloud-based ExperimentHub. The result is single-command actualization of landmark datasets from seven single-cell multimodal data generation technologies, without need for further data processing or wrangling in order to analyze and develop methods within Bioconductor's ecosystem of hundreds of packages for single-cell and multimodal data.

Conclusions: We provide two examples of integrative analyses that are greatly simplified by SingleCellMultiModal. The package will facilitate development of bioinformatic and statistical methods in Bioconductor to meet the challenges of integrating molecular layers and analyzing phenotypic outputs including cell differentiation, activity, and disease.

背景:大多数高通量单细胞分子图谱方法都量化RNA的表达;然而,最近的多模式分析方法增加了对相同细胞的基因组、蛋白质组、表观遗传学和/或空间信息的同时测量。使用标准数据类的里程碑式数据集的易用性将有助于在Bioconductor中开发此类数据的新统计和计算方法。结果:我们从重要的单细胞多模式协议中收集、处理和打包了公开可用的里程碑式数据集,包括CITE-Seq、ECCITE-Seq、SCoPE2、scNMT、10X Multiome、seqFISH和G&T。我们通过MultiAssayExperiment-Bioconductor类集成数据模式,在Bioconductor基于云的ExperimentHub中以SingleCellMultiModal包的形式记录和重新分发数据集。其结果是通过七种单细胞多模式数据生成技术实现了具有里程碑意义的数据集,无需进一步的数据处理或争论,即可在Bioconductor的数百个单细胞和多模式数据包的生态系统中分析和开发方法。结论:我们提供了两个整合分析的例子,这些例子被SingleCellMultiModal大大简化了。该软件包将促进Bioconductor中生物信息学和统计方法的开发,以应对整合分子层和分析表型输出(包括细胞分化、活性和疾病)的挑战。
{"title":"Curated single cell multimodal landmark datasets for R/Bioconductor.","authors":"Kelly B Eckenrode, Dario Righelli, Marcel Ramos, Ricard Argelaguet, Christophe Vanderaa, Ludwig Geistlinger, Aedin C Culhane, Laurent Gatto, Vincent Carey, Martin Morgan, Davide Risso, Levi Waldron","doi":"10.1371/journal.pcbi.1011324","DOIUrl":"10.1371/journal.pcbi.1011324","url":null,"abstract":"<p><strong>Background: </strong>The majority of high-throughput single-cell molecular profiling methods quantify RNA expression; however, recent multimodal profiling methods add simultaneous measurement of genomic, proteomic, epigenetic, and/or spatial information on the same cells. The development of new statistical and computational methods in Bioconductor for such data will be facilitated by easy availability of landmark datasets using standard data classes.</p><p><strong>Results: </strong>We collected, processed, and packaged publicly available landmark datasets from important single-cell multimodal protocols, including CITE-Seq, ECCITE-Seq, SCoPE2, scNMT, 10X Multiome, seqFISH, and G&T. We integrate data modalities via the MultiAssayExperiment Bioconductor class, document and re-distribute datasets as the SingleCellMultiModal package in Bioconductor's Cloud-based ExperimentHub. The result is single-command actualization of landmark datasets from seven single-cell multimodal data generation technologies, without need for further data processing or wrangling in order to analyze and develop methods within Bioconductor's ecosystem of hundreds of packages for single-cell and multimodal data.</p><p><strong>Conclusions: </strong>We provide two examples of integrative analyses that are greatly simplified by SingleCellMultiModal. The package will facilitate development of bioinformatic and statistical methods in Bioconductor to meet the challenges of integrating molecular layers and analyzing phenotypic outputs including cell differentiation, activity, and disease.</p>","PeriodicalId":49688,"journal":{"name":"PLoS Computational Biology","volume":"19 8","pages":"e1011324"},"PeriodicalIF":4.3,"publicationDate":"2023-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10497156/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10262459","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Disentangling Abstraction from Statistical Pattern Matching in Human and Machine Learning. 从人机学习中的统计模式匹配中分离抽象。
IF 4.3 2区 生物学 Pub Date : 2023-08-25 eCollection Date: 2023-08-01 DOI: 10.1371/journal.pcbi.1011316
Sreejan Kumar, Ishita Dasgupta, Nathaniel D Daw, Jonathan D Cohen, Thomas L Griffiths

The ability to acquire abstract knowledge is a hallmark of human intelligence and is believed by many to be one of the core differences between humans and neural network models. Agents can be endowed with an inductive bias towards abstraction through meta-learning, where they are trained on a distribution of tasks that share some abstract structure that can be learned and applied. However, because neural networks are hard to interpret, it can be difficult to tell whether agents have learned the underlying abstraction, or alternatively statistical patterns that are characteristic of that abstraction. In this work, we compare the performance of humans and agents in a meta-reinforcement learning paradigm in which tasks are generated from abstract rules. We define a novel methodology for building "task metamers" that closely match the statistics of the abstract tasks but use a different underlying generative process, and evaluate performance on both abstract and metamer tasks. We find that humans perform better at abstract tasks than metamer tasks whereas common neural network architectures typically perform worse on the abstract tasks than the matched metamers. This work provides a foundation for characterizing differences between humans and machine learning that can be used in future work towards developing machines with more human-like behavior.

获得抽象知识的能力是人类智力的标志,许多人认为这是人类与神经网络模型之间的核心区别之一。通过元学习,可以赋予代理对抽象的归纳偏见,在元学习中,他们被训练在共享一些可以学习和应用的抽象结构的任务分布上。然而,由于神经网络很难解释,因此很难判断代理是否已经学习了底层抽象,或者学习了该抽象的统计模式。在这项工作中,我们比较了人类和智能体在元强化学习范式中的表现,在该范式中,任务是从抽象规则生成的。我们定义了一种新的方法来构建“任务元模型”,该方法与抽象任务的统计数据密切匹配,但使用不同的底层生成过程,并评估抽象任务和元模型任务的性能。我们发现,人类在抽象任务上的表现比元模型任务好,而常见的神经网络架构在抽象任务中的表现通常比匹配的元模型差。这项工作为表征人类和机器学习之间的差异提供了基础,可用于未来开发具有更类似人类行为的机器的工作。
{"title":"Disentangling Abstraction from Statistical Pattern Matching in Human and Machine Learning.","authors":"Sreejan Kumar,&nbsp;Ishita Dasgupta,&nbsp;Nathaniel D Daw,&nbsp;Jonathan D Cohen,&nbsp;Thomas L Griffiths","doi":"10.1371/journal.pcbi.1011316","DOIUrl":"10.1371/journal.pcbi.1011316","url":null,"abstract":"<p><p>The ability to acquire abstract knowledge is a hallmark of human intelligence and is believed by many to be one of the core differences between humans and neural network models. Agents can be endowed with an inductive bias towards abstraction through meta-learning, where they are trained on a distribution of tasks that share some abstract structure that can be learned and applied. However, because neural networks are hard to interpret, it can be difficult to tell whether agents have learned the underlying abstraction, or alternatively statistical patterns that are characteristic of that abstraction. In this work, we compare the performance of humans and agents in a meta-reinforcement learning paradigm in which tasks are generated from abstract rules. We define a novel methodology for building \"task metamers\" that closely match the statistics of the abstract tasks but use a different underlying generative process, and evaluate performance on both abstract and metamer tasks. We find that humans perform better at abstract tasks than metamer tasks whereas common neural network architectures typically perform worse on the abstract tasks than the matched metamers. This work provides a foundation for characterizing differences between humans and machine learning that can be used in future work towards developing machines with more human-like behavior.</p>","PeriodicalId":49688,"journal":{"name":"PLoS Computational Biology","volume":"19 8","pages":"e1011316"},"PeriodicalIF":4.3,"publicationDate":"2023-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10497163/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10586605","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Rapid memory encoding in a recurrent network model with behavioral time scale synaptic plasticity. 具有行为时间尺度突触可塑性的递归网络模型中的快速记忆编码。
IF 4.3 2区 生物学 Pub Date : 2023-08-25 eCollection Date: 2023-08-01 DOI: 10.1371/journal.pcbi.1011139
Pan Ye Li, Alex Roxin

Episodic memories are formed after a single exposure to novel stimuli. The plasticity mechanisms underlying such fast learning still remain largely unknown. Recently, it was shown that cells in area CA1 of the hippocampus of mice could form or shift their place fields after a single traversal of a virtual linear track. In-vivo intracellular recordings in CA1 cells revealed that previously silent inputs from CA3 could be switched on when they occurred within a few seconds of a dendritic plateau potential (PP) in the post-synaptic cell, a phenomenon dubbed Behavioral Time-scale Plasticity (BTSP). A recently developed computational framework for BTSP in which the dynamics of synaptic traces related to the pre-synaptic activity and post-synaptic PP are explicitly modelled, can account for experimental findings. Here we show that this model of plasticity can be further simplified to a 1D map which describes changes to the synaptic weights after a single trial. We use a temporally symmetric version of this map to study the storage of a large number of spatial memories in a recurrent network, such as CA3. Specifically, the simplicity of the map allows us to calculate the correlation of the synaptic weight matrix with any given past environment analytically. We show that the calculated memory trace can be used to predict the emergence and stability of bump attractors in a high dimensional neural network model endowed with BTSP.

情节记忆是在一次接触新的刺激后形成的。这种快速学习的可塑性机制在很大程度上仍然未知。最近,研究表明,小鼠海马CA1区的细胞在单次穿过虚拟线性轨迹后可以形成或移动其位置场。CA1细胞的体内细胞内记录显示,当突触后细胞中的树突平台电位(PP)在几秒内发生时,来自CA3的先前沉默的输入可以被开启,这种现象被称为行为时间尺度可塑性(BTSP)。最近开发的BTSP计算框架可以解释实验结果,其中与突触前活动和突触后PP相关的突触轨迹的动力学被明确建模。在这里,我们表明,这种可塑性模型可以进一步简化为1D图,该图描述了单次试验后突触重量的变化。我们使用该映射的时间对称版本来研究循环网络(如CA3)中大量空间存储器的存储。具体来说,该图的简单性使我们能够分析计算突触权重矩阵与任何给定的过去环境的相关性。我们证明,在具有BTSP的高维神经网络模型中,计算的记忆轨迹可以用来预测凸点吸引子的出现和稳定性。
{"title":"Rapid memory encoding in a recurrent network model with behavioral time scale synaptic plasticity.","authors":"Pan Ye Li,&nbsp;Alex Roxin","doi":"10.1371/journal.pcbi.1011139","DOIUrl":"10.1371/journal.pcbi.1011139","url":null,"abstract":"<p><p>Episodic memories are formed after a single exposure to novel stimuli. The plasticity mechanisms underlying such fast learning still remain largely unknown. Recently, it was shown that cells in area CA1 of the hippocampus of mice could form or shift their place fields after a single traversal of a virtual linear track. In-vivo intracellular recordings in CA1 cells revealed that previously silent inputs from CA3 could be switched on when they occurred within a few seconds of a dendritic plateau potential (PP) in the post-synaptic cell, a phenomenon dubbed Behavioral Time-scale Plasticity (BTSP). A recently developed computational framework for BTSP in which the dynamics of synaptic traces related to the pre-synaptic activity and post-synaptic PP are explicitly modelled, can account for experimental findings. Here we show that this model of plasticity can be further simplified to a 1D map which describes changes to the synaptic weights after a single trial. We use a temporally symmetric version of this map to study the storage of a large number of spatial memories in a recurrent network, such as CA3. Specifically, the simplicity of the map allows us to calculate the correlation of the synaptic weight matrix with any given past environment analytically. We show that the calculated memory trace can be used to predict the emergence and stability of bump attractors in a high dimensional neural network model endowed with BTSP.</p>","PeriodicalId":49688,"journal":{"name":"PLoS Computational Biology","volume":"19 8","pages":"e1011139"},"PeriodicalIF":4.3,"publicationDate":"2023-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10484462/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10538163","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
TFOFinder: Python program for identifying purine-only double-stranded stretches in the predicted secondary structure(s) of RNA targets. TFOFinder:Python程序,用于识别预测的RNA靶标二级结构中仅嘌呤的双链延伸。
IF 4.3 2区 生物学 Pub Date : 2023-08-25 eCollection Date: 2023-08-01 DOI: 10.1371/journal.pcbi.1011418
Atara Neugroschl, Irina E Catrina

Nucleic acid probes are valuable tools in biology and chemistry and are indispensable for PCR amplification of DNA, RNA quantification and visualization, and downregulation of gene expression. Recently, triplex-forming oligonucleotides (TFO) have received increased attention due to their improved selectivity and sensitivity in recognizing purine-rich double-stranded RNA regions at physiological pH by incorporating backbone and base modifications. For example, triplex-forming peptide nucleic acid (PNA) oligomers have been used for imaging a structured RNA in cells and inhibiting influenza A replication. Although a handful of programs are available to identify triplex target sites (TTS) in DNA, none are available that find such regions in structured RNAs. Here, we describe TFOFinder, a Python program that facilitates the identification of intramolecular purine-only RNA duplexes that are amenable to forming parallel triple helices (pyrimidine/purine/pyrimidine) and the design of the corresponding TFO(s). We performed genome- and transcriptome-wide analyses of TTS in Drosophila melanogaster and found that only 0.3% (123) of total unique transcripts (35,642) show the potential of forming 12-purine long triplex forming sites that contain at least one guanine. Using minimization algorithms, we predicted the secondary structure(s) of these transcripts, and using TFOFinder, we found that 97 (79%) of the identified 123 transcripts are predicted to fold to form at least one TTS for parallel triple helix formation. The number of transcripts with potential purine TTS increases when the strict search conditions are relaxed by decreasing the length of the probe or by allowing up to two pyrimidine inversions or 1-nucleotide bulge in the target site. These results are encouraging for the use of modified triplex forming probes for live imaging of endogenous structured RNA targets, such as pre-miRNAs, and inhibition of target-specific translation and viral replication.

核酸探针是生物学和化学中有价值的工具,对于DNA的PCR扩增、RNA的定量和可视化以及基因表达的下调都是必不可少的。最近,三链形成寡核苷酸(TFO)由于其在生理pH下通过结合主链和碱基修饰来识别富含嘌呤的双链RNA区域的选择性和敏感性提高而受到越来越多的关注。例如,三链形成肽核酸(PNA)低聚物已被用于对细胞中的结构化RNA进行成像并抑制甲型流感复制。尽管有少数程序可用于识别DNA中的三重靶位点(TTS),但没有一个程序可用于在结构化RNA中发现此类区域。在这里,我们描述了TFOFinder,这是一个Python程序,有助于识别分子内仅嘌呤的RNA双链体,该双链体可形成平行的三螺旋(嘧啶/嘌呤/嘧啶),并设计相应的TFO。我们对黑腹果蝇的TTS进行了全基因组和转录组分析,发现只有0.3%(123)的总独特转录物(35642)显示出形成12个嘌呤长三链形成位点的潜力,这些位点含有至少一种鸟嘌呤。使用最小化算法,我们预测了这些转录物的二级结构,并使用TFOFinder,我们发现识别的123个转录物中有97个(79%)被预测折叠以形成至少一个TTS,用于平行三螺旋形成。当通过减少探针的长度或通过在靶位点中允许最多两个嘧啶反转或1-核苷酸凸起来放松严格的搜索条件时,具有潜在嘌呤TTS的转录物的数量增加。这些结果对于使用修饰的三链形成探针对内源性结构RNA靶标(如前miRNA)进行实时成像以及抑制靶标特异性翻译和病毒复制是令人鼓舞的。
{"title":"TFOFinder: Python program for identifying purine-only double-stranded stretches in the predicted secondary structure(s) of RNA targets.","authors":"Atara Neugroschl,&nbsp;Irina E Catrina","doi":"10.1371/journal.pcbi.1011418","DOIUrl":"10.1371/journal.pcbi.1011418","url":null,"abstract":"<p><p>Nucleic acid probes are valuable tools in biology and chemistry and are indispensable for PCR amplification of DNA, RNA quantification and visualization, and downregulation of gene expression. Recently, triplex-forming oligonucleotides (TFO) have received increased attention due to their improved selectivity and sensitivity in recognizing purine-rich double-stranded RNA regions at physiological pH by incorporating backbone and base modifications. For example, triplex-forming peptide nucleic acid (PNA) oligomers have been used for imaging a structured RNA in cells and inhibiting influenza A replication. Although a handful of programs are available to identify triplex target sites (TTS) in DNA, none are available that find such regions in structured RNAs. Here, we describe TFOFinder, a Python program that facilitates the identification of intramolecular purine-only RNA duplexes that are amenable to forming parallel triple helices (pyrimidine/purine/pyrimidine) and the design of the corresponding TFO(s). We performed genome- and transcriptome-wide analyses of TTS in Drosophila melanogaster and found that only 0.3% (123) of total unique transcripts (35,642) show the potential of forming 12-purine long triplex forming sites that contain at least one guanine. Using minimization algorithms, we predicted the secondary structure(s) of these transcripts, and using TFOFinder, we found that 97 (79%) of the identified 123 transcripts are predicted to fold to form at least one TTS for parallel triple helix formation. The number of transcripts with potential purine TTS increases when the strict search conditions are relaxed by decreasing the length of the probe or by allowing up to two pyrimidine inversions or 1-nucleotide bulge in the target site. These results are encouraging for the use of modified triplex forming probes for live imaging of endogenous structured RNA targets, such as pre-miRNAs, and inhibition of target-specific translation and viral replication.</p>","PeriodicalId":49688,"journal":{"name":"PLoS Computational Biology","volume":"19 8","pages":"e1011418"},"PeriodicalIF":4.3,"publicationDate":"2023-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10484449/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10241462","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Flux balance analysis-based metabolic modeling of microbial secondary metabolism: Current status and outlook. 基于通量平衡分析的微生物二次代谢代谢建模:现状与展望。
IF 4.3 2区 生物学 Pub Date : 2023-08-24 eCollection Date: 2023-08-01 DOI: 10.1371/journal.pcbi.1011391
Sizhe Qiu, Aidong Yang, Hong Zeng

In microorganisms, different from primary metabolism for cellular growth, secondary metabolism is for ecological interactions and stress responses and an important source of natural products widely used in various areas such as pharmaceutics and food additives. With advancements of sequencing technologies and bioinformatics tools, a large number of biosynthetic gene clusters of secondary metabolites have been discovered from microbial genomes. However, due to challenges from the difficulty of genome-scale pathway reconstruction and the limitation of conventional flux balance analysis (FBA) on secondary metabolism, the quantitative modeling of secondary metabolism is poorly established, in contrast to that of primary metabolism. This review first discusses current efforts on the reconstruction of secondary metabolic pathways in genome-scale metabolic models (GSMMs), as well as related FBA-based modeling techniques. Additionally, potential extensions of FBA are suggested to improve the prediction accuracy of secondary metabolite production. As this review posits, biosynthetic pathway reconstruction for various secondary metabolites will become automated and a modeling framework capturing secondary metabolism onset will enhance the predictive power. Expectedly, an improved FBA-based modeling workflow will facilitate quantitative study of secondary metabolism and in silico design of engineering strategies for natural product production.

在微生物中,与细胞生长的初级代谢不同,次级代谢用于生态相互作用和应激反应,是广泛应用于制药和食品添加剂等各个领域的天然产物的重要来源。随着测序技术和生物信息学工具的进步,从微生物基因组中发现了大量次生代谢产物的生物合成基因簇。然而,由于基因组规模通路重建的困难和传统通量平衡分析(FBA)对次级代谢的限制,与初级代谢相比,次级代谢的定量建模建立得很差。这篇综述首先讨论了目前在基因组规模代谢模型(GSMMs)中重建次级代谢途径的努力,以及相关的基于FBA的建模技术。此外,建议对FBA进行潜在的扩展,以提高次级代谢产物产生的预测准确性。正如这篇综述所述,各种次级代谢产物的生物合成途径重建将实现自动化,捕捉次级代谢开始的建模框架将增强预测能力。不出所料,一种改进的基于FBA的建模工作流程将有助于二次代谢的定量研究和天然产品生产工程策略的计算机设计。
{"title":"Flux balance analysis-based metabolic modeling of microbial secondary metabolism: Current status and outlook.","authors":"Sizhe Qiu,&nbsp;Aidong Yang,&nbsp;Hong Zeng","doi":"10.1371/journal.pcbi.1011391","DOIUrl":"10.1371/journal.pcbi.1011391","url":null,"abstract":"<p><p>In microorganisms, different from primary metabolism for cellular growth, secondary metabolism is for ecological interactions and stress responses and an important source of natural products widely used in various areas such as pharmaceutics and food additives. With advancements of sequencing technologies and bioinformatics tools, a large number of biosynthetic gene clusters of secondary metabolites have been discovered from microbial genomes. However, due to challenges from the difficulty of genome-scale pathway reconstruction and the limitation of conventional flux balance analysis (FBA) on secondary metabolism, the quantitative modeling of secondary metabolism is poorly established, in contrast to that of primary metabolism. This review first discusses current efforts on the reconstruction of secondary metabolic pathways in genome-scale metabolic models (GSMMs), as well as related FBA-based modeling techniques. Additionally, potential extensions of FBA are suggested to improve the prediction accuracy of secondary metabolite production. As this review posits, biosynthetic pathway reconstruction for various secondary metabolites will become automated and a modeling framework capturing secondary metabolism onset will enhance the predictive power. Expectedly, an improved FBA-based modeling workflow will facilitate quantitative study of secondary metabolism and in silico design of engineering strategies for natural product production.</p>","PeriodicalId":49688,"journal":{"name":"PLoS Computational Biology","volume":"19 8","pages":"e1011391"},"PeriodicalIF":4.3,"publicationDate":"2023-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10449171/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10135371","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Mathematical modeling indicates that regulatory inhibition of CD8+ T cell cytotoxicity can limit efficacy of IL-15 immunotherapy in cases of high pre-treatment SIV viral load. 数学模型表明,在高预处理SIV病毒载量的情况下,CD8+T细胞毒性的调节性抑制可以限制IL-15免疫疗法的疗效。
IF 4.3 2区 生物学 Pub Date : 2023-08-24 eCollection Date: 2023-08-01 DOI: 10.1371/journal.pcbi.1011425
Jonathan W Cody, Amy L Ellis-Connell, Shelby L O'Connor, Elsje Pienaar

Immunotherapeutic cytokines can activate immune cells against cancers and chronic infections. N-803 is an IL-15 superagonist that expands CD8+ T cells and increases their cytotoxicity. N-803 also temporarily reduced viral load in a limited subset of non-human primates infected with simian immunodeficiency virus (SIV), a model of HIV. However, viral suppression has not been observed in all SIV cohorts and may depend on pre-treatment viral load and the corresponding effects on CD8+ T cells. Starting from an existing mechanistic mathematical model of N-803 immunotherapy of SIV, we develop a model that includes activation of SIV-specific and non-SIV-specific CD8+ T cells by antigen, inflammation, and N-803. Also included is a regulatory counter-response that inhibits CD8+ T cell proliferation and function, representing the effects of immune checkpoint molecules and immunosuppressive cells. We simultaneously calibrate the model to two separate SIV cohorts. The first cohort had low viral loads prior to treatment (≈3-4 log viral RNA copy equivalents (CEQ)/mL), and N-803 treatment transiently suppressed viral load. The second had higher pre-treatment viral loads (≈5-7 log CEQ/mL) and saw no consistent virus suppression with N-803. The mathematical model can replicate the viral and CD8+ T cell dynamics of both cohorts based on different pre-treatment viral loads and different levels of regulatory inhibition of CD8+ T cells due to those viral loads (i.e. initial conditions of model). Our predictions are validated by additional data from these and other SIV cohorts. While both cohorts had high numbers of activated SIV-specific CD8+ T cells in simulations, viral suppression was precluded in the high viral load cohort due to elevated inhibition of cytotoxicity. Thus, we mathematically demonstrate how the pre-treatment viral load can influence immunotherapeutic efficacy, highlighting the in vivo conditions and combination therapies that could maximize efficacy and improve treatment outcomes.

免疫治疗性细胞因子可以激活免疫细胞对抗癌症和慢性感染。N-803是一种IL-15超级拮抗剂,可扩增CD8+T细胞并增加其细胞毒性。N-803还暂时降低了感染猴免疫缺陷病毒(SIV)的非人类灵长类动物的病毒载量,这是一种HIV模型。然而,尚未在所有SIV队列中观察到病毒抑制,这可能取决于治疗前的病毒载量和对CD8+T细胞的相应影响。从现有的SIV免疫治疗N-803的机制数学模型开始,我们开发了一个模型,该模型包括通过抗原、炎症和N-803激活SIV特异性和非SIV特异性CD8+T细胞。还包括抑制CD8+T细胞增殖和功能的调节性反反应,代表免疫检查点分子和免疫抑制细胞的作用。我们同时将模型校准为两个独立的SIV队列。第一个队列在治疗前病毒载量较低(≈3-4 log病毒RNA拷贝当量(CEQ)/mL),N-803治疗暂时抑制了病毒载量。第二种具有更高的预处理病毒载量(≈5-7 log CEQ/mL),并且与N-803没有一致的病毒抑制。该数学模型可以基于不同的预处理病毒载量和由于这些病毒载量对CD8+T细胞的不同调节抑制水平(即模型的初始条件)来复制两个队列的病毒和CD8+T淋巴细胞动力学。我们的预测得到了来自这些和其他SIV队列的额外数据的验证。虽然两个队列在模拟中都有大量活化的SIV特异性CD8+T细胞,但在高病毒载量队列中,由于细胞毒性抑制作用增强,病毒抑制被排除。因此,我们从数学上证明了治疗前病毒载量如何影响免疫治疗效果,强调了体内条件和联合治疗可以最大限度地提高疗效并改善治疗结果。
{"title":"Mathematical modeling indicates that regulatory inhibition of CD8+ T cell cytotoxicity can limit efficacy of IL-15 immunotherapy in cases of high pre-treatment SIV viral load.","authors":"Jonathan W Cody,&nbsp;Amy L Ellis-Connell,&nbsp;Shelby L O'Connor,&nbsp;Elsje Pienaar","doi":"10.1371/journal.pcbi.1011425","DOIUrl":"10.1371/journal.pcbi.1011425","url":null,"abstract":"<p><p>Immunotherapeutic cytokines can activate immune cells against cancers and chronic infections. N-803 is an IL-15 superagonist that expands CD8+ T cells and increases their cytotoxicity. N-803 also temporarily reduced viral load in a limited subset of non-human primates infected with simian immunodeficiency virus (SIV), a model of HIV. However, viral suppression has not been observed in all SIV cohorts and may depend on pre-treatment viral load and the corresponding effects on CD8+ T cells. Starting from an existing mechanistic mathematical model of N-803 immunotherapy of SIV, we develop a model that includes activation of SIV-specific and non-SIV-specific CD8+ T cells by antigen, inflammation, and N-803. Also included is a regulatory counter-response that inhibits CD8+ T cell proliferation and function, representing the effects of immune checkpoint molecules and immunosuppressive cells. We simultaneously calibrate the model to two separate SIV cohorts. The first cohort had low viral loads prior to treatment (≈3-4 log viral RNA copy equivalents (CEQ)/mL), and N-803 treatment transiently suppressed viral load. The second had higher pre-treatment viral loads (≈5-7 log CEQ/mL) and saw no consistent virus suppression with N-803. The mathematical model can replicate the viral and CD8+ T cell dynamics of both cohorts based on different pre-treatment viral loads and different levels of regulatory inhibition of CD8+ T cells due to those viral loads (i.e. initial conditions of model). Our predictions are validated by additional data from these and other SIV cohorts. While both cohorts had high numbers of activated SIV-specific CD8+ T cells in simulations, viral suppression was precluded in the high viral load cohort due to elevated inhibition of cytotoxicity. Thus, we mathematically demonstrate how the pre-treatment viral load can influence immunotherapeutic efficacy, highlighting the in vivo conditions and combination therapies that could maximize efficacy and improve treatment outcomes.</p>","PeriodicalId":49688,"journal":{"name":"PLoS Computational Biology","volume":"19 8","pages":"e1011425"},"PeriodicalIF":4.3,"publicationDate":"2023-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10482305/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10235074","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Inferring gene regulatory networks using transcriptional profiles as dynamical attractors. 利用转录谱作为动态引诱因子推断基因调控网络。
IF 4.3 2区 生物学 Pub Date : 2023-08-22 eCollection Date: 2023-08-01 DOI: 10.1371/journal.pcbi.1010991
Ruihao Li, Jordan C Rozum, Morgan M Quail, Mohammad N Qasim, Suzanne S Sindi, Clarissa J Nobile, Réka Albert, Aaron D Hernday

Genetic regulatory networks (GRNs) regulate the flow of genetic information from the genome to expressed messenger RNAs (mRNAs) and thus are critical to controlling the phenotypic characteristics of cells. Numerous methods exist for profiling mRNA transcript levels and identifying protein-DNA binding interactions at the genome-wide scale. These enable researchers to determine the structure and output of transcriptional regulatory networks, but uncovering the complete structure and regulatory logic of GRNs remains a challenge. The field of GRN inference aims to meet this challenge using computational modeling to derive the structure and logic of GRNs from experimental data and to encode this knowledge in Boolean networks, Bayesian networks, ordinary differential equation (ODE) models, or other modeling frameworks. However, most existing models do not incorporate dynamic transcriptional data since it has historically been less widely available in comparison to "static" transcriptional data. We report the development of an evolutionary algorithm-based ODE modeling approach (named EA) that integrates kinetic transcription data and the theory of attractor matching to infer GRN architecture and regulatory logic. Our method outperformed six leading GRN inference methods, none of which incorporate kinetic transcriptional data, in predicting regulatory connections among TFs when applied to a small-scale engineered synthetic GRN in Saccharomyces cerevisiae. Moreover, we demonstrate the potential of our method to predict unknown transcriptional profiles that would be produced upon genetic perturbation of the GRN governing a two-state cellular phenotypic switch in Candida albicans. We established an iterative refinement strategy to facilitate candidate selection for experimentation; the experimental results in turn provide validation or improvement for the model. In this way, our GRN inference approach can expedite the development of a sophisticated mathematical model that can accurately describe the structure and dynamics of the in vivo GRN.

遗传调控网络(GRNs)调节遗传信息从基因组流向表达信使核糖核酸(mRNAs),因此对控制细胞的表型特征至关重要。在全基因组范围内,存在许多用于分析mRNA转录水平和鉴定蛋白质-DNA结合相互作用的方法。这些使研究人员能够确定转录调控网络的结构和输出,但揭示GRN的完整结构和调控逻辑仍然是一个挑战。GRN推理领域旨在通过计算建模来应对这一挑战,从实验数据中推导GRN的结构和逻辑,并将这些知识编码在布尔网络、贝叶斯网络、常微分方程(ODE)模型或其他建模框架中。然而,大多数现有的模型都没有包含动态转录数据,因为与“静态”转录数据相比,动态转录数据在历史上的可用性较低。我们报告了一种基于进化算法的ODE建模方法(称为EA)的开发,该方法集成了动力学转录数据和吸引子匹配理论,以推断GRN结构和调控逻辑。当应用于酿酒酵母中的小规模工程合成GRN时,我们的方法在预测TF之间的调节联系方面优于六种领先的GRN推断方法,其中没有一种方法包含动力学转录数据。此外,我们证明了我们的方法预测未知转录谱的潜力,这些转录谱将在白色念珠菌中控制两态细胞表型转换的GRN的遗传扰动时产生。我们建立了一种迭代细化策略,以便于实验的候选选择;实验结果反过来为模型提供了验证或改进。这样,我们的GRN推理方法可以加快开发一个复杂的数学模型,该模型可以准确描述体内GRN的结构和动力学。
{"title":"Inferring gene regulatory networks using transcriptional profiles as dynamical attractors.","authors":"Ruihao Li,&nbsp;Jordan C Rozum,&nbsp;Morgan M Quail,&nbsp;Mohammad N Qasim,&nbsp;Suzanne S Sindi,&nbsp;Clarissa J Nobile,&nbsp;Réka Albert,&nbsp;Aaron D Hernday","doi":"10.1371/journal.pcbi.1010991","DOIUrl":"10.1371/journal.pcbi.1010991","url":null,"abstract":"<p><p>Genetic regulatory networks (GRNs) regulate the flow of genetic information from the genome to expressed messenger RNAs (mRNAs) and thus are critical to controlling the phenotypic characteristics of cells. Numerous methods exist for profiling mRNA transcript levels and identifying protein-DNA binding interactions at the genome-wide scale. These enable researchers to determine the structure and output of transcriptional regulatory networks, but uncovering the complete structure and regulatory logic of GRNs remains a challenge. The field of GRN inference aims to meet this challenge using computational modeling to derive the structure and logic of GRNs from experimental data and to encode this knowledge in Boolean networks, Bayesian networks, ordinary differential equation (ODE) models, or other modeling frameworks. However, most existing models do not incorporate dynamic transcriptional data since it has historically been less widely available in comparison to \"static\" transcriptional data. We report the development of an evolutionary algorithm-based ODE modeling approach (named EA) that integrates kinetic transcription data and the theory of attractor matching to infer GRN architecture and regulatory logic. Our method outperformed six leading GRN inference methods, none of which incorporate kinetic transcriptional data, in predicting regulatory connections among TFs when applied to a small-scale engineered synthetic GRN in Saccharomyces cerevisiae. Moreover, we demonstrate the potential of our method to predict unknown transcriptional profiles that would be produced upon genetic perturbation of the GRN governing a two-state cellular phenotypic switch in Candida albicans. We established an iterative refinement strategy to facilitate candidate selection for experimentation; the experimental results in turn provide validation or improvement for the model. In this way, our GRN inference approach can expedite the development of a sophisticated mathematical model that can accurately describe the structure and dynamics of the in vivo GRN.</p>","PeriodicalId":49688,"journal":{"name":"PLoS Computational Biology","volume":"19 8","pages":"e1010991"},"PeriodicalIF":4.3,"publicationDate":"2023-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10473541/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10141142","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
PLoS Computational Biology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1