首页 > 最新文献

PLoS Computational Biology最新文献

英文 中文
When does humoral memory enhance infection? 体液记忆何时会增强感染?
IF 4.3 2区 生物学 Pub Date : 2023-08-21 eCollection Date: 2023-08-01 DOI: 10.1371/journal.pcbi.1011377
Ariel Nikas, Hasan Ahmed, Mia R Moore, Veronika I Zarnitsyna, Rustom Antia

Antibodies and humoral memory are key components of the adaptive immune system. We consider and computationally model mechanisms by which humoral memory present at baseline might increase rather than decrease infection load; we refer to this effect as EI-HM (enhancement of infection by humoral memory). We first consider antibody dependent enhancement (ADE) in which antibody enhances the growth of the pathogen, typically a virus, and typically at intermediate 'Goldilocks' levels of antibody. Our ADE model reproduces ADE in vitro and enhancement of infection in vivo from passive antibody transfer. But notably the simplest implementation of our ADE model never results in EI-HM. Adding complexity, by making the cross-reactive antibody much less neutralizing than the de novo generated antibody or by including a sufficiently strong non-antibody immune response, allows for ADE-mediated EI-HM. We next consider the possibility that cross-reactive memory causes EI-HM by crowding out a possibly superior de novo immune response. We show that, even without ADE, EI-HM can occur when the cross-reactive response is both less potent and 'directly' (i.e. independently of infection load) suppressive with regard to the de novo response. In this case adding a non-antibody immune response to our computational model greatly reduces or completely eliminates EI-HM, which suggests that 'crowding out' is unlikely to cause substantial EI-HM. Hence, our results provide examples in which simple models give qualitatively opposite results compared to models with plausible complexity. Our results may be helpful in interpreting and reconciling disparate experimental findings, especially from dengue, and for vaccination.

抗体和体液记忆是适应性免疫系统的关键组成部分。我们考虑了基线时存在的体液记忆可能增加而不是减少感染负荷的机制,并对其进行了计算建模;我们将这种作用称为EI-HM(通过体液记忆增强感染)。我们首先考虑抗体依赖性增强(ADE),其中抗体增强病原体(通常是病毒)的生长,并且通常处于中等水平的“金发姑娘”抗体。我们的ADE模型在体外复制ADE,并通过被动抗体转移增强体内感染。但值得注意的是,我们的ADE模型的最简单实现从未产生EI-HM。通过使交叉反应性抗体比从头产生的抗体的中和性低得多,或者通过包括足够强的非抗体免疫反应,增加复杂性,允许ADE介导的EI-HM。接下来,我们考虑交叉反应记忆通过挤出可能优越的从头免疫反应而导致EI-HM的可能性。我们表明,即使没有ADE,当交叉反应反应的效力较低且对从头反应具有“直接”(即独立于感染负荷)抑制作用时,也可能发生EI-HM。在这种情况下,在我们的计算模型中添加非抗体免疫反应大大减少或完全消除了EI-HM,这表明“挤出”不太可能导致显著的EI-HM。因此,我们的结果提供了一些例子,在这些例子中,与具有合理复杂性的模型相比,简单模型给出了定性相反的结果。我们的研究结果可能有助于解释和调和不同的实验结果,特别是登革热的实验结果和疫苗接种。
{"title":"When does humoral memory enhance infection?","authors":"Ariel Nikas,&nbsp;Hasan Ahmed,&nbsp;Mia R Moore,&nbsp;Veronika I Zarnitsyna,&nbsp;Rustom Antia","doi":"10.1371/journal.pcbi.1011377","DOIUrl":"10.1371/journal.pcbi.1011377","url":null,"abstract":"<p><p>Antibodies and humoral memory are key components of the adaptive immune system. We consider and computationally model mechanisms by which humoral memory present at baseline might increase rather than decrease infection load; we refer to this effect as EI-HM (enhancement of infection by humoral memory). We first consider antibody dependent enhancement (ADE) in which antibody enhances the growth of the pathogen, typically a virus, and typically at intermediate 'Goldilocks' levels of antibody. Our ADE model reproduces ADE in vitro and enhancement of infection in vivo from passive antibody transfer. But notably the simplest implementation of our ADE model never results in EI-HM. Adding complexity, by making the cross-reactive antibody much less neutralizing than the de novo generated antibody or by including a sufficiently strong non-antibody immune response, allows for ADE-mediated EI-HM. We next consider the possibility that cross-reactive memory causes EI-HM by crowding out a possibly superior de novo immune response. We show that, even without ADE, EI-HM can occur when the cross-reactive response is both less potent and 'directly' (i.e. independently of infection load) suppressive with regard to the de novo response. In this case adding a non-antibody immune response to our computational model greatly reduces or completely eliminates EI-HM, which suggests that 'crowding out' is unlikely to cause substantial EI-HM. Hence, our results provide examples in which simple models give qualitatively opposite results compared to models with plausible complexity. Our results may be helpful in interpreting and reconciling disparate experimental findings, especially from dengue, and for vaccination.</p>","PeriodicalId":49688,"journal":{"name":"PLoS Computational Biology","volume":"19 8","pages":"e1011377"},"PeriodicalIF":4.3,"publicationDate":"2023-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10470880/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10147285","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A subcomponent-guided deep learning method for interpretable cancer drug response prediction. 一种用于可解释癌症药物反应预测的子组件引导的深度学习方法。
IF 4.3 2区 生物学 Pub Date : 2023-08-21 eCollection Date: 2023-08-01 DOI: 10.1371/journal.pcbi.1011382
Xuan Liu, Wen Zhang

Accurate prediction of cancer drug response (CDR) is a longstanding challenge in modern oncology that underpins personalized treatment. Current computational methods implement CDR prediction by modeling responses between entire drugs and cell lines, without the consideration that response outcomes may primarily attribute to a few finer-level 'subcomponents', such as privileged substructures of the drug or gene signatures of the cancer cell, thus producing predictions that are hard to explain. Herein, we present SubCDR, a subcomponent-guided deep learning method for interpretable CDR prediction, to recognize the most relevant subcomponents driving response outcomes. Technically, SubCDR is built upon a line of deep neural networks that enables a set of functional subcomponents to be extracted from each drug and cell line profile, and breaks the CDR prediction down to identifying pairwise interactions between subcomponents. Such a subcomponent interaction form can offer a traceable path to explicitly indicate which subcomponents contribute more to the response outcome. We verify the superiority of SubCDR over state-of-the-art CDR prediction methods through extensive computational experiments on the GDSC dataset. Crucially, we found many predicted cases that demonstrate the strength of SubCDR in finding the key subcomponents driving responses and exploiting these subcomponents to discover new therapeutic drugs. These results suggest that SubCDR will be highly useful for biomedical researchers, particularly in anti-cancer drug design.

准确预测癌症药物反应(CDR)是现代肿瘤学中的一个长期挑战,也是个性化治疗的基础。目前的计算方法通过对整个药物和细胞系之间的反应进行建模来实现CDR预测,而不考虑反应结果可能主要归因于少数精细水平的“子成分”,如药物的特权子结构或癌症细胞的基因特征,从而产生难以解释的预测。在此,我们提出了SubCDR,这是一种用于可解释CDR预测的子组件引导的深度学习方法,以识别驱动反应结果的最相关的子组件。从技术上讲,SubCDR建立在一系列深度神经网络的基础上,该网络能够从每种药物和细胞系图谱中提取一组功能性子成分,并将CDR预测分解为识别子成分之间的成对相互作用。这样的子组件交互表单可以提供一个可跟踪的路径,明确指示哪些子组件对响应结果的贡献更大。我们通过在GDSC数据集上进行大量计算实验,验证了SubCDR相对于最先进的CDR预测方法的优越性。至关重要的是,我们发现了许多预测病例,这些病例证明了亚CDR在寻找驱动反应的关键子成分并利用这些子成分发现新的治疗药物方面的优势。这些结果表明,SubCDR将对生物医学研究人员非常有用,特别是在抗癌药物设计方面。
{"title":"A subcomponent-guided deep learning method for interpretable cancer drug response prediction.","authors":"Xuan Liu,&nbsp;Wen Zhang","doi":"10.1371/journal.pcbi.1011382","DOIUrl":"10.1371/journal.pcbi.1011382","url":null,"abstract":"<p><p>Accurate prediction of cancer drug response (CDR) is a longstanding challenge in modern oncology that underpins personalized treatment. Current computational methods implement CDR prediction by modeling responses between entire drugs and cell lines, without the consideration that response outcomes may primarily attribute to a few finer-level 'subcomponents', such as privileged substructures of the drug or gene signatures of the cancer cell, thus producing predictions that are hard to explain. Herein, we present SubCDR, a subcomponent-guided deep learning method for interpretable CDR prediction, to recognize the most relevant subcomponents driving response outcomes. Technically, SubCDR is built upon a line of deep neural networks that enables a set of functional subcomponents to be extracted from each drug and cell line profile, and breaks the CDR prediction down to identifying pairwise interactions between subcomponents. Such a subcomponent interaction form can offer a traceable path to explicitly indicate which subcomponents contribute more to the response outcome. We verify the superiority of SubCDR over state-of-the-art CDR prediction methods through extensive computational experiments on the GDSC dataset. Crucially, we found many predicted cases that demonstrate the strength of SubCDR in finding the key subcomponents driving responses and exploiting these subcomponents to discover new therapeutic drugs. These results suggest that SubCDR will be highly useful for biomedical researchers, particularly in anti-cancer drug design.</p>","PeriodicalId":49688,"journal":{"name":"PLoS Computational Biology","volume":"19 8","pages":"e1011382"},"PeriodicalIF":4.3,"publicationDate":"2023-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10470940/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10135107","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Efficient sampling-based Bayesian Active Learning for synaptic characterization. 用于突触表征的高效基于采样的贝叶斯主动学习。
IF 4.3 2区 生物学 Pub Date : 2023-08-21 eCollection Date: 2023-08-01 DOI: 10.1371/journal.pcbi.1011342
Camille Gontier, Simone Carlo Surace, Igor Delvendahl, Martin Müller, Jean-Pascal Pfister

Bayesian Active Learning (BAL) is an efficient framework for learning the parameters of a model, in which input stimuli are selected to maximize the mutual information between the observations and the unknown parameters. However, the applicability of BAL to experiments is limited as it requires performing high-dimensional integrations and optimizations in real time. Current methods are either too time consuming, or only applicable to specific models. Here, we propose an Efficient Sampling-Based Bayesian Active Learning (ESB-BAL) framework, which is efficient enough to be used in real-time biological experiments. We apply our method to the problem of estimating the parameters of a chemical synapse from the postsynaptic responses to evoked presynaptic action potentials. Using synthetic data and synaptic whole-cell patch-clamp recordings, we show that our method can improve the precision of model-based inferences, thereby paving the way towards more systematic and efficient experimental designs in physiology.

贝叶斯主动学习(BAL)是一种用于学习模型参数的有效框架,其中选择输入刺激以最大化观测值和未知参数之间的相互信息。然而,BAL对实验的适用性是有限的,因为它需要实时执行高维积分和优化。当前的方法要么过于耗时,要么只适用于特定的模型。在这里,我们提出了一个基于有效采样的贝叶斯主动学习(ESB-BAL)框架,该框架足够有效,可以用于实时生物实验。我们将我们的方法应用于从突触后对诱发的突触前动作电位的反应来估计化学突触的参数的问题。使用合成数据和突触全细胞膜片钳记录,我们表明我们的方法可以提高基于模型的推断的精度,从而为生理学中更系统、更有效的实验设计铺平道路。
{"title":"Efficient sampling-based Bayesian Active Learning for synaptic characterization.","authors":"Camille Gontier, Simone Carlo Surace, Igor Delvendahl, Martin Müller, Jean-Pascal Pfister","doi":"10.1371/journal.pcbi.1011342","DOIUrl":"10.1371/journal.pcbi.1011342","url":null,"abstract":"<p><p>Bayesian Active Learning (BAL) is an efficient framework for learning the parameters of a model, in which input stimuli are selected to maximize the mutual information between the observations and the unknown parameters. However, the applicability of BAL to experiments is limited as it requires performing high-dimensional integrations and optimizations in real time. Current methods are either too time consuming, or only applicable to specific models. Here, we propose an Efficient Sampling-Based Bayesian Active Learning (ESB-BAL) framework, which is efficient enough to be used in real-time biological experiments. We apply our method to the problem of estimating the parameters of a chemical synapse from the postsynaptic responses to evoked presynaptic action potentials. Using synthetic data and synaptic whole-cell patch-clamp recordings, we show that our method can improve the precision of model-based inferences, thereby paving the way towards more systematic and efficient experimental designs in physiology.</p>","PeriodicalId":49688,"journal":{"name":"PLoS Computational Biology","volume":"19 8","pages":"e1011342"},"PeriodicalIF":4.3,"publicationDate":"2023-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10470935/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10135106","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Metabolic modeling of sex-specific liver tissue suggests mechanism of differences in toxicological responses. 性别特异性肝组织的代谢模型提示了毒理学反应差异的机制。
IF 4.3 2区 生物学 Pub Date : 2023-08-21 eCollection Date: 2023-08-01 DOI: 10.1371/journal.pcbi.1010927
Connor J Moore, Christopher P Holstege, Jason A Papin

Male subjects in animal and human studies are disproportionately used for toxicological testing. This discrepancy is evidenced in clinical medicine where females are more likely than males to experience liver-related adverse events in response to xenobiotics. While previous work has shown gene expression differences between the sexes, there is a lack of systems-level approaches to understand the direct clinical impact of these differences. Here, we integrate gene expression data with metabolic network models to characterize the impact of transcriptional changes of metabolic genes in the context of sex differences and drug treatment. We used Tasks Inferred from Differential Expression (TIDEs), a reaction-centric approach to analyzing differences in gene expression, to discover that several metabolic pathways exhibit sex differences including glycolysis, fatty acid metabolism, nucleotide metabolism, and xenobiotics metabolism. When TIDEs is used to compare expression differences in treated and untreated hepatocytes, we find several subsystems with differential expression overlap with the sex-altered pathways such as fatty acid metabolism, purine and pyrimidine metabolism, and xenobiotics metabolism. Finally, using sex-specific transcriptomic data, we create individual and averaged male and female liver models and find differences in the pentose phosphate pathway and other metabolic pathways. These results suggest potential sex differences in the contribution of the pentose phosphate pathway to oxidative stress, and we recommend further research into how these reactions respond to hepatotoxic pharmaceuticals.

动物和人类研究中的男性受试者被不成比例地用于毒理学测试。这种差异在临床医学中得到了证明,女性比男性更有可能在对外源性药物的反应中出现与肝脏相关的不良事件。虽然先前的研究表明性别之间的基因表达存在差异,但缺乏系统层面的方法来理解这些差异的直接临床影响。在这里,我们将基因表达数据与代谢网络模型相结合,以表征性别差异和药物治疗背景下代谢基因转录变化的影响。我们使用从差异表达推断的任务(TIDE),这是一种以反应为中心的分析基因表达差异的方法,发现几种代谢途径表现出性别差异,包括糖酵解、脂肪酸代谢、核苷酸代谢和外源性代谢。当使用TIDE来比较处理和未处理肝细胞的表达差异时,我们发现几个具有差异表达的子系统与性别改变的途径重叠,如脂肪酸代谢、嘌呤和嘧啶代谢以及外源性代谢。最后,使用性别特异性转录组数据,我们创建了个体和平均的男性和女性肝脏模型,并发现了磷酸戊糖途径和其他代谢途径的差异。这些结果表明,磷酸戊糖途径对氧化应激的贡献存在潜在的性别差异,我们建议进一步研究这些反应对肝毒性药物的反应。
{"title":"Metabolic modeling of sex-specific liver tissue suggests mechanism of differences in toxicological responses.","authors":"Connor J Moore, Christopher P Holstege, Jason A Papin","doi":"10.1371/journal.pcbi.1010927","DOIUrl":"10.1371/journal.pcbi.1010927","url":null,"abstract":"<p><p>Male subjects in animal and human studies are disproportionately used for toxicological testing. This discrepancy is evidenced in clinical medicine where females are more likely than males to experience liver-related adverse events in response to xenobiotics. While previous work has shown gene expression differences between the sexes, there is a lack of systems-level approaches to understand the direct clinical impact of these differences. Here, we integrate gene expression data with metabolic network models to characterize the impact of transcriptional changes of metabolic genes in the context of sex differences and drug treatment. We used Tasks Inferred from Differential Expression (TIDEs), a reaction-centric approach to analyzing differences in gene expression, to discover that several metabolic pathways exhibit sex differences including glycolysis, fatty acid metabolism, nucleotide metabolism, and xenobiotics metabolism. When TIDEs is used to compare expression differences in treated and untreated hepatocytes, we find several subsystems with differential expression overlap with the sex-altered pathways such as fatty acid metabolism, purine and pyrimidine metabolism, and xenobiotics metabolism. Finally, using sex-specific transcriptomic data, we create individual and averaged male and female liver models and find differences in the pentose phosphate pathway and other metabolic pathways. These results suggest potential sex differences in the contribution of the pentose phosphate pathway to oxidative stress, and we recommend further research into how these reactions respond to hepatotoxic pharmaceuticals.</p>","PeriodicalId":49688,"journal":{"name":"PLoS Computational Biology","volume":"19 8","pages":"e1010927"},"PeriodicalIF":4.3,"publicationDate":"2023-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10470949/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10141819","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
STREAK: A supervised cell surface receptor abundance estimation strategy for single cell RNA-sequencing data using feature selection and thresholded gene set scoring. STREAK:一种使用特征选择和阈值基因集评分的单细胞RNA测序数据的监督细胞表面受体丰度估计策略。
IF 4.3 2区 生物学 Pub Date : 2023-08-21 eCollection Date: 2023-08-01 DOI: 10.1371/journal.pcbi.1011413
Azka Javaid, Hildreth Robert Frost

The accurate estimation of cell surface receptor abundance for single cell transcriptomics data is important for the tasks of cell type and phenotype categorization and cell-cell interaction quantification. We previously developed an unsupervised receptor abundance estimation technique named SPECK (Surface Protein abundance Estimation using CKmeans-based clustered thresholding) to address the challenges associated with accurate abundance estimation. In that paper, we concluded that SPECK results in improved concordance with Cellular Indexing of Transcriptomes and Epitopes by Sequencing (CITE-seq) data relative to comparative unsupervised abundance estimation techniques using only single-cell RNA-sequencing (scRNA-seq) data. In this paper, we outline a new supervised receptor abundance estimation method called STREAK (gene Set Testing-based Receptor abundance Estimation using Adjusted distances and cKmeans thresholding) that leverages associations learned from joint scRNA-seq/CITE-seq training data and a thresholded gene set scoring mechanism to estimate receptor abundance for scRNA-seq target data. We evaluate STREAK relative to both unsupervised and supervised receptor abundance estimation techniques using two evaluation approaches on six joint scRNA-seq/CITE-seq datasets that represent four human and mouse tissue types. We conclude that STREAK outperforms other abundance estimation strategies and provides a more biologically interpretable and transparent statistical model.

单细胞转录组学数据的细胞表面受体丰度的准确估计对于细胞类型和表型分类以及细胞-细胞相互作用定量的任务是重要的。我们之前开发了一种名为SPECK(使用基于CKmeans的聚类阈值的表面蛋白丰度估计)的无监督受体丰度估计技术,以解决与准确丰度估计相关的挑战。在那篇论文中,我们得出结论,与仅使用单细胞RNA测序(scRNA-seq)数据的比较无监督丰度估计技术相比,SPECK与通过测序的转录组和表位的细胞索引(CITE-seq)数据提高了一致性。在本文中,我们概述了一种新的监督受体丰度估计方法,称为STREAK(使用调整后的距离和cKmeans阈值的基于基因集测试的受体丰度估计),该方法利用从scRNA-seq/CITE-seq联合训练数据中学习到的关联和阈值基因集评分机制来估计scRNA-seq靶数据的受体丰度。我们在代表四种人类和小鼠组织类型的六个联合scRNA-seq/CITE-seq数据集上使用两种评估方法,相对于无监督和有监督的受体丰度估计技术来评估STREAK。我们得出的结论是,STREAK优于其他丰度估计策略,并提供了一个更具生物学可解释性和透明性的统计模型。
{"title":"STREAK: A supervised cell surface receptor abundance estimation strategy for single cell RNA-sequencing data using feature selection and thresholded gene set scoring.","authors":"Azka Javaid, Hildreth Robert Frost","doi":"10.1371/journal.pcbi.1011413","DOIUrl":"10.1371/journal.pcbi.1011413","url":null,"abstract":"<p><p>The accurate estimation of cell surface receptor abundance for single cell transcriptomics data is important for the tasks of cell type and phenotype categorization and cell-cell interaction quantification. We previously developed an unsupervised receptor abundance estimation technique named SPECK (Surface Protein abundance Estimation using CKmeans-based clustered thresholding) to address the challenges associated with accurate abundance estimation. In that paper, we concluded that SPECK results in improved concordance with Cellular Indexing of Transcriptomes and Epitopes by Sequencing (CITE-seq) data relative to comparative unsupervised abundance estimation techniques using only single-cell RNA-sequencing (scRNA-seq) data. In this paper, we outline a new supervised receptor abundance estimation method called STREAK (gene Set Testing-based Receptor abundance Estimation using Adjusted distances and cKmeans thresholding) that leverages associations learned from joint scRNA-seq/CITE-seq training data and a thresholded gene set scoring mechanism to estimate receptor abundance for scRNA-seq target data. We evaluate STREAK relative to both unsupervised and supervised receptor abundance estimation techniques using two evaluation approaches on six joint scRNA-seq/CITE-seq datasets that represent four human and mouse tissue types. We conclude that STREAK outperforms other abundance estimation strategies and provides a more biologically interpretable and transparent statistical model.</p>","PeriodicalId":49688,"journal":{"name":"PLoS Computational Biology","volume":"19 8","pages":"e1011413"},"PeriodicalIF":4.3,"publicationDate":"2023-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10470905/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10143839","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Regulated bacterial interaction networks: A mathematical framework to describe competitive growth under inclusion of metabolite cross-feeding. 受调控的细菌相互作用网络:一个数学框架,用于描述在包含代谢物交叉喂养的情况下的竞争性生长。
IF 4.3 2区 生物学 Pub Date : 2023-08-21 eCollection Date: 2023-08-01 DOI: 10.1371/journal.pcbi.1011402
Isaline Guex, Christian Mazza, Manupriyam Dubey, Maxime Batsch, Renyi Li, Jan Roelof van der Meer

When bacterial species with the same resource preferences share the same growth environment, it is commonly believed that direct competition will arise. A large variety of competition and more general 'interaction' models have been formulated, but what is currently lacking are models that link monoculture growth kinetics and community growth under inclusion of emerging biological interactions, such as metabolite cross-feeding. In order to understand and mathematically describe the nature of potential cross-feeding interactions, we design experiments where two bacterial species Pseudomonas putida and Pseudomonas veronii grow in liquid medium either in mono- or as co-culture in a resource-limited environment. We measure population growth under single substrate competition or with double species-specific substrates (substrate 'indifference'), and starting from varying cell ratios of either species. Using experimental data as input, we first consider a mean-field model of resource-based competition, which captures well the empirically observed growth rates for monocultures, but fails to correctly predict growth rates in co-culture mixtures, in particular for skewed starting species ratios. Based on this, we extend the model by cross-feeding interactions where the consumption of substrate by one consumer produces metabolites that in turn are resources for the other consumer, thus leading to positive feedback in the species system. Two different cross-feeding options were considered, which either lead to constant metabolite cross-feeding, or to a regulated form, where metabolite utilization is activated with rates according to either a threshold or a Hill function, dependent on metabolite concentration. Both mathematical proof and experimental data indicate regulated cross-feeding to be the preferred model to constant metabolite utilization, with best co-culture growth predictions in case of high Hill coefficients, close to binary (on/off) activation states. This suggests that species use the appearing metabolite concentrations only when they are becoming high enough; possibly as a consequence of their lower energetic content than the primary substrate. Metabolite sharing was particularly relevant at unbalanced starting cell ratios, causing the minority partner to proliferate more than expected from the competitive substrate because of metabolite release from the majority partner. This effect thus likely quells immediate substrate competition and may be important in natural communities with typical very skewed relative taxa abundances and slower-growing taxa. In conclusion, the regulated bacterial interaction network correctly describes species substrate growth reactions in mixtures with few kinetic parameters that can be obtained from monoculture growth experiments.

当具有相同资源偏好的细菌物种共享相同的生长环境时,通常认为会出现直接竞争。已经制定了各种各样的竞争和更通用的“相互作用”模型,但目前缺乏的是将单一栽培生长动力学和群落生长联系起来的模型,包括新出现的生物相互作用,如代谢物交叉喂养。为了理解和数学描述潜在交叉喂养相互作用的性质,我们设计了两种细菌在液体培养基中生长的实验,在资源有限的环境中,两种细菌分别以单培养或共培养的方式生长。我们测量了单底物竞争或双物种特异性底物(底物“无差异”)下的种群增长,并从两个物种不同的细胞比例开始。使用实验数据作为输入,我们首先考虑了基于资源的竞争的平均场模型,该模型很好地捕捉到了经验上观察到的单一栽培的增长率,但未能正确预测共同栽培混合物中的生长率,特别是对于偏斜的起始物种比率。基于此,我们通过交叉喂养相互作用扩展了模型,其中一个消费者对底物的消耗产生代谢物,而代谢物又是另一个消费者的资源,从而在物种系统中产生正反馈。考虑了两种不同的交叉喂养选择,这两种选择要么导致持续的代谢物交叉喂养,要么导致一种受调节的形式,其中代谢物的利用率根据阈值或Hill函数激活,取决于代谢物浓度。数学证明和实验数据都表明,调节交叉喂养是恒定代谢产物利用的首选模型,在Hill系数高、接近二元(开/关)激活状态的情况下,具有最佳的共培养生长预测。这表明,物种只有在达到足够高的浓度时才会使用出现的代谢物浓度;可能是由于它们的能量含量低于主要基质。代谢产物共享在起始细胞比例不平衡的情况下尤其重要,由于多数伴侣释放代谢产物,导致少数伴侣从竞争底物中增殖得比预期的更多。因此,这种效应可能会平息直接的基质竞争,并且在具有典型的非常偏斜的相对类群丰度和生长较慢的类群的自然群落中可能很重要。总之,受调控的细菌相互作用网络正确地描述了混合物中物种-底物的生长反应,而从单一栽培生长实验中可以获得的动力学参数很少。
{"title":"Regulated bacterial interaction networks: A mathematical framework to describe competitive growth under inclusion of metabolite cross-feeding.","authors":"Isaline Guex,&nbsp;Christian Mazza,&nbsp;Manupriyam Dubey,&nbsp;Maxime Batsch,&nbsp;Renyi Li,&nbsp;Jan Roelof van der Meer","doi":"10.1371/journal.pcbi.1011402","DOIUrl":"10.1371/journal.pcbi.1011402","url":null,"abstract":"<p><p>When bacterial species with the same resource preferences share the same growth environment, it is commonly believed that direct competition will arise. A large variety of competition and more general 'interaction' models have been formulated, but what is currently lacking are models that link monoculture growth kinetics and community growth under inclusion of emerging biological interactions, such as metabolite cross-feeding. In order to understand and mathematically describe the nature of potential cross-feeding interactions, we design experiments where two bacterial species Pseudomonas putida and Pseudomonas veronii grow in liquid medium either in mono- or as co-culture in a resource-limited environment. We measure population growth under single substrate competition or with double species-specific substrates (substrate 'indifference'), and starting from varying cell ratios of either species. Using experimental data as input, we first consider a mean-field model of resource-based competition, which captures well the empirically observed growth rates for monocultures, but fails to correctly predict growth rates in co-culture mixtures, in particular for skewed starting species ratios. Based on this, we extend the model by cross-feeding interactions where the consumption of substrate by one consumer produces metabolites that in turn are resources for the other consumer, thus leading to positive feedback in the species system. Two different cross-feeding options were considered, which either lead to constant metabolite cross-feeding, or to a regulated form, where metabolite utilization is activated with rates according to either a threshold or a Hill function, dependent on metabolite concentration. Both mathematical proof and experimental data indicate regulated cross-feeding to be the preferred model to constant metabolite utilization, with best co-culture growth predictions in case of high Hill coefficients, close to binary (on/off) activation states. This suggests that species use the appearing metabolite concentrations only when they are becoming high enough; possibly as a consequence of their lower energetic content than the primary substrate. Metabolite sharing was particularly relevant at unbalanced starting cell ratios, causing the minority partner to proliferate more than expected from the competitive substrate because of metabolite release from the majority partner. This effect thus likely quells immediate substrate competition and may be important in natural communities with typical very skewed relative taxa abundances and slower-growing taxa. In conclusion, the regulated bacterial interaction network correctly describes species substrate growth reactions in mixtures with few kinetic parameters that can be obtained from monoculture growth experiments.</p>","PeriodicalId":49688,"journal":{"name":"PLoS Computational Biology","volume":"19 8","pages":"e1011402"},"PeriodicalIF":4.3,"publicationDate":"2023-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10470959/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10141824","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing reinforcement learning models by including direct and indirect pathways improves performance on striatal dependent tasks. 通过包括直接和间接途径来增强强化学习模型可以提高纹状体依赖任务的表现。
IF 4.3 2区 生物学 Pub Date : 2023-08-18 eCollection Date: 2023-08-01 DOI: 10.1371/journal.pcbi.1011385
Kim T Blackwell, Kenji Doya

A major advance in understanding learning behavior stems from experiments showing that reward learning requires dopamine inputs to striatal neurons and arises from synaptic plasticity of cortico-striatal synapses. Numerous reinforcement learning models mimic this dopamine-dependent synaptic plasticity by using the reward prediction error, which resembles dopamine neuron firing, to learn the best action in response to a set of cues. Though these models can explain many facets of behavior, reproducing some types of goal-directed behavior, such as renewal and reversal, require additional model components. Here we present a reinforcement learning model, TD2Q, which better corresponds to the basal ganglia with two Q matrices, one representing direct pathway neurons (G) and another representing indirect pathway neurons (N). Unlike previous two-Q architectures, a novel and critical aspect of TD2Q is to update the G and N matrices utilizing the temporal difference reward prediction error. A best action is selected for N and G using a softmax with a reward-dependent adaptive exploration parameter, and then differences are resolved using a second selection step applied to the two action probabilities. The model is tested on a range of multi-step tasks including extinction, renewal, discrimination; switching reward probability learning; and sequence learning. Simulations show that TD2Q produces behaviors similar to rodents in choice and sequence learning tasks, and that use of the temporal difference reward prediction error is required to learn multi-step tasks. Blocking the update rule on the N matrix blocks discrimination learning, as observed experimentally. Performance in the sequence learning task is dramatically improved with two matrices. These results suggest that including additional aspects of basal ganglia physiology can improve the performance of reinforcement learning models, better reproduce animal behaviors, and provide insight as to the role of direct- and indirect-pathway striatal neurons.

在理解学习行为方面的一个重大进展源于实验,该实验表明,奖励学习需要多巴胺输入到纹状体神经元,并源于皮质-纹状体突触的突触可塑性。许多强化学习模型通过使用类似于多巴胺神经元放电的奖励预测误差来学习对一组线索的最佳反应,从而模拟这种多巴胺依赖性突触可塑性。尽管这些模型可以解释行为的许多方面,但复制某些类型的目标导向行为,如更新和反转,需要额外的模型组件。在这里,我们提出了一个强化学习模型TD2Q,它更好地对应于具有两个Q矩阵的基底神经节,一个表示直接通路神经元(G),另一个表示间接通路神经元(N)。与之前的两个Q架构不同,TD2Q的一个新颖而关键的方面是利用时间差奖励预测误差来更新G和N矩阵。使用具有依赖于奖励的自适应探索参数的softmax为N和G选择最佳动作,然后使用应用于两个动作概率的第二选择步骤来解决差异。该模型在一系列多步骤任务中进行了测试,包括灭绝、更新、歧视;切换奖励概率学习;以及序列学习。仿真表明,TD2Q在选择和序列学习任务中产生类似啮齿动物的行为,并且需要使用时间差奖励预测误差来学习多步骤任务。如实验所观察到的,在N个矩阵上阻止更新规则会阻止区分学习。使用两个矩阵可以显著提高序列学习任务的性能。这些结果表明,包括基底节生理学的其他方面可以提高强化学习模型的性能,更好地再现动物行为,并深入了解直接和间接途径纹状体神经元的作用。
{"title":"Enhancing reinforcement learning models by including direct and indirect pathways improves performance on striatal dependent tasks.","authors":"Kim T Blackwell,&nbsp;Kenji Doya","doi":"10.1371/journal.pcbi.1011385","DOIUrl":"10.1371/journal.pcbi.1011385","url":null,"abstract":"<p><p>A major advance in understanding learning behavior stems from experiments showing that reward learning requires dopamine inputs to striatal neurons and arises from synaptic plasticity of cortico-striatal synapses. Numerous reinforcement learning models mimic this dopamine-dependent synaptic plasticity by using the reward prediction error, which resembles dopamine neuron firing, to learn the best action in response to a set of cues. Though these models can explain many facets of behavior, reproducing some types of goal-directed behavior, such as renewal and reversal, require additional model components. Here we present a reinforcement learning model, TD2Q, which better corresponds to the basal ganglia with two Q matrices, one representing direct pathway neurons (G) and another representing indirect pathway neurons (N). Unlike previous two-Q architectures, a novel and critical aspect of TD2Q is to update the G and N matrices utilizing the temporal difference reward prediction error. A best action is selected for N and G using a softmax with a reward-dependent adaptive exploration parameter, and then differences are resolved using a second selection step applied to the two action probabilities. The model is tested on a range of multi-step tasks including extinction, renewal, discrimination; switching reward probability learning; and sequence learning. Simulations show that TD2Q produces behaviors similar to rodents in choice and sequence learning tasks, and that use of the temporal difference reward prediction error is required to learn multi-step tasks. Blocking the update rule on the N matrix blocks discrimination learning, as observed experimentally. Performance in the sequence learning task is dramatically improved with two matrices. These results suggest that including additional aspects of basal ganglia physiology can improve the performance of reinforcement learning models, better reproduce animal behaviors, and provide insight as to the role of direct- and indirect-pathway striatal neurons.</p>","PeriodicalId":49688,"journal":{"name":"PLoS Computational Biology","volume":"19 8","pages":"e1011385"},"PeriodicalIF":4.3,"publicationDate":"2023-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10479916/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10165386","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
BALDR: A Web-based platform for informed comparison and prioritization of biomarker candidates for type 2 diabetes mellitus. BALDR:一个基于Web的平台,用于2型糖尿病候选生物标志物的知情比较和优先级排序。
IF 4.3 2区 生物学 Pub Date : 2023-08-17 eCollection Date: 2023-08-01 DOI: 10.1371/journal.pcbi.1011403
Agnete T Lundgaard, Frédéric Burdet, Troels Siggaard, David Westergaard, Danai Vagiaki, Lisa Cantwell, Timo Röder, Dorte Vistisen, Thomas Sparsø, Giuseppe N Giordano, Mark Ibberson, Karina Banasik, Søren Brunak

Novel biomarkers are key to addressing the ongoing pandemic of type 2 diabetes mellitus. While new technologies have improved the potential of identifying such biomarkers, at the same time there is an increasing need for informed prioritization to ensure efficient downstream verification. We have built BALDR, an automated pipeline for biomarker comparison and prioritization in the context of diabetes. BALDR includes protein, gene, and disease data from major public repositories, text-mining data, and human and mouse experimental data from the IMI2 RHAPSODY consortium. These data are provided as easy-to-read figures and tables enabling direct comparison of up to 20 biomarker candidates for diabetes through the public website https://baldr.cpr.ku.dk.

新型生物标志物是应对2型糖尿病持续流行的关键。虽然新技术提高了识别此类生物标志物的潜力,但同时也越来越需要知情的优先顺序,以确保有效的下游验证。我们已经建立了BALDR,这是一个在糖尿病背景下进行生物标志物比较和优先排序的自动化管道。BALDR包括来自主要公共存储库的蛋白质、基因和疾病数据、文本挖掘数据以及来自IMI2-RHAPSODY联盟的人类和小鼠实验数据。这些数据以易于阅读的图表形式提供,可以通过公共网站直接比较多达20种糖尿病候选生物标志物https://baldr.cpr.ku.dk.
{"title":"BALDR: A Web-based platform for informed comparison and prioritization of biomarker candidates for type 2 diabetes mellitus.","authors":"Agnete T Lundgaard,&nbsp;Frédéric Burdet,&nbsp;Troels Siggaard,&nbsp;David Westergaard,&nbsp;Danai Vagiaki,&nbsp;Lisa Cantwell,&nbsp;Timo Röder,&nbsp;Dorte Vistisen,&nbsp;Thomas Sparsø,&nbsp;Giuseppe N Giordano,&nbsp;Mark Ibberson,&nbsp;Karina Banasik,&nbsp;Søren Brunak","doi":"10.1371/journal.pcbi.1011403","DOIUrl":"10.1371/journal.pcbi.1011403","url":null,"abstract":"<p><p>Novel biomarkers are key to addressing the ongoing pandemic of type 2 diabetes mellitus. While new technologies have improved the potential of identifying such biomarkers, at the same time there is an increasing need for informed prioritization to ensure efficient downstream verification. We have built BALDR, an automated pipeline for biomarker comparison and prioritization in the context of diabetes. BALDR includes protein, gene, and disease data from major public repositories, text-mining data, and human and mouse experimental data from the IMI2 RHAPSODY consortium. These data are provided as easy-to-read figures and tables enabling direct comparison of up to 20 biomarker candidates for diabetes through the public website https://baldr.cpr.ku.dk.</p>","PeriodicalId":49688,"journal":{"name":"PLoS Computational Biology","volume":"19 8","pages":"e1011403"},"PeriodicalIF":4.3,"publicationDate":"2023-08-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10464978/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10117668","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The specious art of single-cell genomics. 似是而非的单细胞基因组学艺术。
IF 4.3 2区 生物学 Pub Date : 2023-08-17 eCollection Date: 2023-08-01 DOI: 10.1371/journal.pcbi.1011288
Tara Chari, Lior Pachter

Dimensionality reduction is standard practice for filtering noise and identifying relevant features in large-scale data analyses. In biology, single-cell genomics studies typically begin with reduction to 2 or 3 dimensions to produce "all-in-one" visuals of the data that are amenable to the human eye, and these are subsequently used for qualitative and quantitative exploratory analysis. However, there is little theoretical support for this practice, and we show that extreme dimension reduction, from hundreds or thousands of dimensions to 2, inevitably induces significant distortion of high-dimensional datasets. We therefore examine the practical implications of low-dimensional embedding of single-cell data and find that extensive distortions and inconsistent practices make such embeddings counter-productive for exploratory, biological analyses. In lieu of this, we discuss alternative approaches for conducting targeted embedding and feature exploration to enable hypothesis-driven biological discovery.

降维是大规模数据分析中过滤噪音和识别相关特征的标准做法。在生物学中,单细胞基因组学研究通常首先将维度缩减到 2 或 3 维,以产生适合人眼的 "一体化 "数据视觉效果,然后再将其用于定性和定量探索性分析。然而,这种做法几乎没有理论支持,我们的研究表明,将维度从成百上千降到 2 维的极端降维做法不可避免地会导致高维数据集严重失真。因此,我们研究了单细胞数据低维嵌入的实际影响,发现广泛的失真和不一致的做法使这种嵌入对探索性生物分析适得其反。为此,我们讨论了进行有针对性的嵌入和特征探索的替代方法,以实现假设驱动的生物发现。
{"title":"The specious art of single-cell genomics.","authors":"Tara Chari, Lior Pachter","doi":"10.1371/journal.pcbi.1011288","DOIUrl":"10.1371/journal.pcbi.1011288","url":null,"abstract":"<p><p>Dimensionality reduction is standard practice for filtering noise and identifying relevant features in large-scale data analyses. In biology, single-cell genomics studies typically begin with reduction to 2 or 3 dimensions to produce \"all-in-one\" visuals of the data that are amenable to the human eye, and these are subsequently used for qualitative and quantitative exploratory analysis. However, there is little theoretical support for this practice, and we show that extreme dimension reduction, from hundreds or thousands of dimensions to 2, inevitably induces significant distortion of high-dimensional datasets. We therefore examine the practical implications of low-dimensional embedding of single-cell data and find that extensive distortions and inconsistent practices make such embeddings counter-productive for exploratory, biological analyses. In lieu of this, we discuss alternative approaches for conducting targeted embedding and feature exploration to enable hypothesis-driven biological discovery.</p>","PeriodicalId":49688,"journal":{"name":"PLoS Computational Biology","volume":"19 8","pages":"e1011288"},"PeriodicalIF":4.3,"publicationDate":"2023-08-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10434946/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10037203","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Elucidation of genome-wide understudied proteins targeted by PROTAC-induced degradation using interpretable machine learning. 使用可解释的机器学习阐明PROTAC诱导降解靶向的全基因组研究不足的蛋白质。
IF 4.3 2区 生物学 Pub Date : 2023-08-17 eCollection Date: 2023-08-01 DOI: 10.1371/journal.pcbi.1010974
Li Xie, Lei Xie

Proteolysis-targeting chimeras (PROTACs) are hetero-bifunctional molecules that induce the degradation of target proteins by recruiting an E3 ligase. PROTACs have the potential to inactivate disease-related genes that are considered undruggable by small molecules, making them a promising therapy for the treatment of incurable diseases. However, only a few hundred proteins have been experimentally tested for their amenability to PROTACs, and it remains unclear which other proteins in the entire human genome can be targeted by PROTACs. In this study, we have developed PrePROTAC, an interpretable machine learning model based on a transformer-based protein sequence descriptor and random forest classification. PrePROTAC predicts genome-wide targets that can be degraded by CRBN, one of the E3 ligases. In the benchmark studies, PrePROTAC achieved a ROC-AUC of 0.81, an average precision of 0.84, and over 40% sensitivity at a false positive rate of 0.05. When evaluated by an external test set which comprised proteins from different structural folds than those in the training set, the performance of PrePROTAC did not drop significantly, indicating its generalizability. Furthermore, we developed an embedding SHapley Additive exPlanations (eSHAP) method, which extends conventional SHAP analysis for original features to an embedding space through in silico mutagenesis. This method allowed us to identify key residues in the protein structure that play critical roles in PROTAC activity. The identified key residues were consistent with existing knowledge. Using PrePROTAC, we identified over 600 novel understudied proteins that are potentially degradable by CRBN and proposed PROTAC compounds for three novel drug targets associated with Alzheimer's disease.

蛋白质水解靶向嵌合体(PROTACs)是一种异质双功能分子,通过募集E3连接酶来诱导靶蛋白的降解。PROTAC有可能使被认为是小分子不可治疗的疾病相关基因失活,使其成为治疗不治之症的一种有前景的疗法。然而,只有几百种蛋白质对PROTAC的适应性进行了实验测试,目前尚不清楚整个人类基因组中的哪些其他蛋白质可以被PROTAC靶向。在这项研究中,我们开发了PrePROTAC,这是一个可解释的机器学习模型,基于基于转换器的蛋白质序列描述符和随机森林分类。PrePROTAC预测可以被E3连接酶之一的CRBN降解的全基因组靶标。在基准研究中,PrePROTAC的ROC-AUC为0.81,平均精密度为0.84,在假阳性率为0.05的情况下,灵敏度超过40%。当通过包含与训练集中不同结构折叠的蛋白质的外部测试集进行评估时,PrePROTAC的性能没有显著下降,表明其可推广性。此外,我们开发了一种嵌入SHapley Additive exPlanations(eSHAP)方法,该方法通过计算机诱变将原始特征的传统SHAP分析扩展到嵌入空间。这种方法使我们能够识别蛋白质结构中在PROTAC活性中起关键作用的关键残基。已鉴定的关键残留物与现有知识一致。使用PrePROTAC,我们鉴定了600多种新的研究不足的蛋白质,这些蛋白质可能被CRBN降解,并提出了用于与阿尔茨海默病相关的三个新药物靶点的PROTAC化合物。
{"title":"Elucidation of genome-wide understudied proteins targeted by PROTAC-induced degradation using interpretable machine learning.","authors":"Li Xie, Lei Xie","doi":"10.1371/journal.pcbi.1010974","DOIUrl":"10.1371/journal.pcbi.1010974","url":null,"abstract":"<p><p>Proteolysis-targeting chimeras (PROTACs) are hetero-bifunctional molecules that induce the degradation of target proteins by recruiting an E3 ligase. PROTACs have the potential to inactivate disease-related genes that are considered undruggable by small molecules, making them a promising therapy for the treatment of incurable diseases. However, only a few hundred proteins have been experimentally tested for their amenability to PROTACs, and it remains unclear which other proteins in the entire human genome can be targeted by PROTACs. In this study, we have developed PrePROTAC, an interpretable machine learning model based on a transformer-based protein sequence descriptor and random forest classification. PrePROTAC predicts genome-wide targets that can be degraded by CRBN, one of the E3 ligases. In the benchmark studies, PrePROTAC achieved a ROC-AUC of 0.81, an average precision of 0.84, and over 40% sensitivity at a false positive rate of 0.05. When evaluated by an external test set which comprised proteins from different structural folds than those in the training set, the performance of PrePROTAC did not drop significantly, indicating its generalizability. Furthermore, we developed an embedding SHapley Additive exPlanations (eSHAP) method, which extends conventional SHAP analysis for original features to an embedding space through in silico mutagenesis. This method allowed us to identify key residues in the protein structure that play critical roles in PROTAC activity. The identified key residues were consistent with existing knowledge. Using PrePROTAC, we identified over 600 novel understudied proteins that are potentially degradable by CRBN and proposed PROTAC compounds for three novel drug targets associated with Alzheimer's disease.</p>","PeriodicalId":49688,"journal":{"name":"PLoS Computational Biology","volume":"19 8","pages":"e1010974"},"PeriodicalIF":4.3,"publicationDate":"2023-08-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10464998/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10134611","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
PLoS Computational Biology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1