首页 > 最新文献

PLoS Computational Biology最新文献

英文 中文
How robust are estimates of key parameters in standard viral dynamic models? 标准病毒动态模型中关键参数的估计值有多可靠?
IF 4.3 2区 生物学 Pub Date : 2024-04-16 DOI: 10.1371/journal.pcbi.1011437
Carolin Zitzmann, Ruian Ke, R. Ribeiro, A. Perelson
Mathematical models of viral infection have been developed, fitted to data, and provide insight into disease pathogenesis for multiple agents that cause chronic infection, including HIV, hepatitis C, and B virus. However, for agents that cause acute infections or during the acute stage of agents that cause chronic infections, viral load data are often collected after symptoms develop, usually around or after the peak viral load. Consequently, we frequently lack data in the initial phase of viral growth, i.e., when pre-symptomatic transmission events occur. Missing data may make estimating the time of infection, the infectious period, and parameters in viral dynamic models, such as the cell infection rate, difficult. However, having extra information, such as the average time to peak viral load, may improve the robustness of the estimation. Here, we evaluated the robustness of estimates of key model parameters when viral load data prior to the viral load peak is missing, when we know the values of some parameters and/or the time from infection to peak viral load. Although estimates of the time of infection are sensitive to the quality and amount of available data, particularly pre-peak, other parameters important in understanding disease pathogenesis, such as the loss rate of infected cells, are less sensitive. Viral infectivity and the viral production rate are key parameters affecting the robustness of data fits. Fixing their values to literature values can help estimate the remaining model parameters when pre-peak data is missing or limited. We find a lack of data in the pre-peak growth phase underestimates the time to peak viral load by several days, leading to a shorter predicted growth phase. On the other hand, knowing the time of infection (e.g., from epidemiological data) and fixing it results in good estimates of dynamical parameters even in the absence of early data. While we provide ways to approximate model parameters in the absence of early viral load data, our results also suggest that these data, when available, are needed to estimate model parameters more precisely.
对于包括艾滋病病毒、丙型肝炎病毒和乙型肝炎病毒在内的多种导致慢性感染的病原体,人们已经建立了病毒感染的数学模型,并将其与数据进行了拟合,从而对疾病的发病机理有了深入的了解。然而,对于引起急性感染的病原体或在引起慢性感染的病原体的急性期,病毒载量数据通常是在出现症状后收集的,通常是在病毒载量峰值前后。因此,我们经常缺乏病毒生长初期的数据,即无症状前传播事件发生时的数据。缺失的数据可能会给估算感染时间、感染期和病毒动态模型参数(如细胞感染率)带来困难。然而,如果有额外的信息,如病毒载量达到峰值的平均时间,则可以提高估算的稳健性。在此,我们评估了当病毒载量峰值之前的病毒载量数据缺失时,当我们知道某些参数的值和/或从感染到病毒载量峰值的时间时,关键模型参数估计的稳健性。虽然对感染时间的估计对可用数据的质量和数量很敏感,尤其是高峰前的数据,但对了解疾病发病机制很重要的其他参数(如感染细胞的丢失率)则不太敏感。病毒传染性和病毒产生率是影响数据拟合稳健性的关键参数。将它们的值固定为文献值有助于在峰前数据缺失或有限的情况下估计其余模型参数。我们发现,如果缺乏高峰前生长阶段的数据,病毒载量达到峰值的时间就会被低估数天,从而导致预测的生长阶段缩短。另一方面,即使在缺乏早期数据的情况下,知道感染时间(例如从流行病学数据中)并将其固定下来,也能获得良好的动态参数估计。虽然我们提供了在没有早期病毒载量数据的情况下近似估计模型参数的方法,但我们的结果也表明,如果有这些数据,就需要更精确地估计模型参数。
{"title":"How robust are estimates of key parameters in standard viral dynamic models?","authors":"Carolin Zitzmann, Ruian Ke, R. Ribeiro, A. Perelson","doi":"10.1371/journal.pcbi.1011437","DOIUrl":"https://doi.org/10.1371/journal.pcbi.1011437","url":null,"abstract":"Mathematical models of viral infection have been developed, fitted to data, and provide insight into disease pathogenesis for multiple agents that cause chronic infection, including HIV, hepatitis C, and B virus. However, for agents that cause acute infections or during the acute stage of agents that cause chronic infections, viral load data are often collected after symptoms develop, usually around or after the peak viral load. Consequently, we frequently lack data in the initial phase of viral growth, i.e., when pre-symptomatic transmission events occur. Missing data may make estimating the time of infection, the infectious period, and parameters in viral dynamic models, such as the cell infection rate, difficult. However, having extra information, such as the average time to peak viral load, may improve the robustness of the estimation. Here, we evaluated the robustness of estimates of key model parameters when viral load data prior to the viral load peak is missing, when we know the values of some parameters and/or the time from infection to peak viral load. Although estimates of the time of infection are sensitive to the quality and amount of available data, particularly pre-peak, other parameters important in understanding disease pathogenesis, such as the loss rate of infected cells, are less sensitive. Viral infectivity and the viral production rate are key parameters affecting the robustness of data fits. Fixing their values to literature values can help estimate the remaining model parameters when pre-peak data is missing or limited. We find a lack of data in the pre-peak growth phase underestimates the time to peak viral load by several days, leading to a shorter predicted growth phase. On the other hand, knowing the time of infection (e.g., from epidemiological data) and fixing it results in good estimates of dynamical parameters even in the absence of early data. While we provide ways to approximate model parameters in the absence of early viral load data, our results also suggest that these data, when available, are needed to estimate model parameters more precisely.","PeriodicalId":49688,"journal":{"name":"PLoS Computational Biology","volume":"55 1","pages":"e1011437"},"PeriodicalIF":4.3,"publicationDate":"2024-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140695584","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Spatio-temporal spread of artemisinin resistance in Southeast Asia. 青蒿素抗药性在东南亚的时空传播。
IF 4.3 2区 生物学 Pub Date : 2024-04-16 DOI: 10.1371/journal.pcbi.1012017
Jennifer A Flegg, Sevvandi Kandanaarachchi, P. Guerin, A. Dondorp, François H. Nosten, S. D. Otienoburu, Nick Golding
Current malaria elimination targets must withstand a colossal challenge-resistance to the current gold standard antimalarial drug, namely artemisinin derivatives. If artemisinin resistance significantly expands to Africa or India, cases and malaria-related deaths are set to increase substantially. Spatial information on the changing levels of artemisinin resistance in Southeast Asia is therefore critical for health organisations to prioritise malaria control measures, but available data on artemisinin resistance are sparse. We use a comprehensive database from the WorldWide Antimalarial Resistance Network on the prevalence of non-synonymous mutations in the Kelch 13 (K13) gene, which are known to be associated with artemisinin resistance, and a Bayesian geostatistical model to produce spatio-temporal predictions of artemisinin resistance. Our maps of estimated prevalence show an expansion of the K13 mutation across the Greater Mekong Subregion from 2000 to 2022. Moreover, the period between 2010 and 2015 demonstrated the most spatial change across the region. Our model and maps provide important insights into the spatial and temporal trends of artemisinin resistance in a way that is not possible using data alone, thereby enabling improved spatial decision support systems on an unprecedented fine-scale spatial resolution. By predicting for the first time spatio-temporal patterns and extents of artemisinin resistance at the subcontinent level, this study provides critical information for supporting malaria elimination goals in Southeast Asia.
目前的消除疟疾目标必须经受住巨大的挑战--目前的金标准抗疟药物(即青蒿素衍生物)的抗药性。如果青蒿素抗药性大幅扩展到非洲或印度,病例和与疟疾相关的死亡人数必将大幅增加。因此,有关东南亚地区青蒿素抗药性水平变化的空间信息对于卫生机构优先采取疟疾控制措施至关重要,但现有的青蒿素抗药性数据却很稀少。我们利用世界抗疟网络(WorldWide Antimalarial Resistance Network)关于Kelch 13(K13)基因非同义突变流行率的综合数据库,以及贝叶斯地理统计模型,对青蒿素抗药性进行了时空预测。我们的估计流行率地图显示,从2000年到2022年,K13突变在整个大湄公河次区域都在扩大。此外,2010 年至 2015 年期间整个地区的空间变化最大。我们的模型和地图为了解青蒿素抗药性的时空趋势提供了重要依据,而这一点仅靠数据是无法实现的,因此可以在前所未有的精细空间分辨率上改进空间决策支持系统。这项研究首次在次大陆一级预测了青蒿素抗药性的时空模式和范围,为支持东南亚实现消除疟疾的目标提供了重要信息。
{"title":"Spatio-temporal spread of artemisinin resistance in Southeast Asia.","authors":"Jennifer A Flegg, Sevvandi Kandanaarachchi, P. Guerin, A. Dondorp, François H. Nosten, S. D. Otienoburu, Nick Golding","doi":"10.1371/journal.pcbi.1012017","DOIUrl":"https://doi.org/10.1371/journal.pcbi.1012017","url":null,"abstract":"Current malaria elimination targets must withstand a colossal challenge-resistance to the current gold standard antimalarial drug, namely artemisinin derivatives. If artemisinin resistance significantly expands to Africa or India, cases and malaria-related deaths are set to increase substantially. Spatial information on the changing levels of artemisinin resistance in Southeast Asia is therefore critical for health organisations to prioritise malaria control measures, but available data on artemisinin resistance are sparse. We use a comprehensive database from the WorldWide Antimalarial Resistance Network on the prevalence of non-synonymous mutations in the Kelch 13 (K13) gene, which are known to be associated with artemisinin resistance, and a Bayesian geostatistical model to produce spatio-temporal predictions of artemisinin resistance. Our maps of estimated prevalence show an expansion of the K13 mutation across the Greater Mekong Subregion from 2000 to 2022. Moreover, the period between 2010 and 2015 demonstrated the most spatial change across the region. Our model and maps provide important insights into the spatial and temporal trends of artemisinin resistance in a way that is not possible using data alone, thereby enabling improved spatial decision support systems on an unprecedented fine-scale spatial resolution. By predicting for the first time spatio-temporal patterns and extents of artemisinin resistance at the subcontinent level, this study provides critical information for supporting malaria elimination goals in Southeast Asia.","PeriodicalId":49688,"journal":{"name":"PLoS Computational Biology","volume":"4 19","pages":"e1012017"},"PeriodicalIF":4.3,"publicationDate":"2024-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140697982","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Network-neuron interactions underlying sensory responses of layer 5 pyramidal tract neurons in barrel cortex. 桶状皮层第 5 层锥体束神经元感觉反应所依赖的网络-神经元相互作用
IF 4.3 2区 生物学 Pub Date : 2024-04-16 DOI: 10.1371/journal.pcbi.1011468
A. Bast, Rieke Fruengel, C. D. de Kock, M. Oberlaender
Neurons in the cerebral cortex receive thousands of synaptic inputs per second from thousands of presynaptic neurons. How the dendritic location of inputs, their timing, strength, and presynaptic origin, in conjunction with complex dendritic physiology, impact the transformation of synaptic input into action potential (AP) output remains generally unknown for in vivo conditions. Here, we introduce a computational approach to reveal which properties of the input causally underlie AP output, and how this neuronal input-output computation is influenced by the morphology and biophysical properties of the dendrites. We demonstrate that this approach allows dissecting of how different input populations drive in vivo observed APs. For this purpose, we focus on fast and broadly tuned responses that pyramidal tract neurons in layer 5 (L5PTs) of the rat barrel cortex elicit upon passive single whisker deflections. By reducing a multi-scale model that we reported previously, we show that three features are sufficient to predict with high accuracy the sensory responses and receptive fields of L5PTs under these specific in vivo conditions: the count of active excitatory versus inhibitory synapses preceding the response, their spatial distribution on the dendrites, and the AP history. Based on these three features, we derive an analytically tractable description of the input-output computation of L5PTs, which enabled us to dissect how synaptic input from thalamus and different cell types in barrel cortex contribute to these responses. We show that the input-output computation is preserved across L5PTs despite morphological and biophysical diversity of their dendrites. We found that trial-to-trial variability in L5PT responses, and cell-to-cell variability in their receptive fields, are sufficiently explained by variability in synaptic input from the network, whereas variability in biophysical and morphological properties have minor contributions. Our approach to derive analytically tractable models of input-output computations in L5PTs provides a roadmap to dissect network-neuron interactions underlying L5PT responses across different in vivo conditions and for other cell types.
大脑皮层的神经元每秒从数千个突触前神经元接收数千次突触输入。在活体条件下,输入的树突位置、时间、强度和突触前来源如何与复杂的树突生理学相结合,影响突触输入到动作电位(AP)输出的转化,目前仍是一个未知数。在这里,我们引入了一种计算方法来揭示哪些输入属性是 AP 输出的因果基础,以及神经元的输入输出计算如何受到树突形态和生物物理属性的影响。我们证明,这种方法可以剖析不同的输入群如何驱动体内观察到的 AP。为此,我们重点研究了大鼠桶状皮层第 5 层锥体束神经元(L5PTs)在被动单须偏转时引起的快速和广泛调谐响应。通过还原我们之前报告过的多尺度模型,我们发现在这些特定的体内条件下,有三个特征足以高精度地预测 L5PT 的感觉反应和感受野:反应前活跃的兴奋性与抑制性突触的数量、它们在树突上的空间分布以及 AP 历史。基于这三个特征,我们得出了 L5PT 输入-输出计算的可分析描述,这使我们能够剖析丘脑和桶状皮层不同类型细胞的突触输入是如何促成这些反应的。我们的研究表明,尽管 L5PTs 树突的形态和生物物理具有多样性,但它们的输入-输出计算在不同的 L5PTs 之间保持不变。我们发现,L5PT 反应的试验间变异及其感受野的细胞间变异可通过来自网络的突触输入的变异得到充分解释,而生物物理和形态特性的变异所起的作用较小。我们推导 L5PT 输入-输出计算的可分析模型的方法,为剖析不同体内条件和其他细胞类型的 L5PT 反应所依赖的网络-神经元相互作用提供了路线图。
{"title":"Network-neuron interactions underlying sensory responses of layer 5 pyramidal tract neurons in barrel cortex.","authors":"A. Bast, Rieke Fruengel, C. D. de Kock, M. Oberlaender","doi":"10.1371/journal.pcbi.1011468","DOIUrl":"https://doi.org/10.1371/journal.pcbi.1011468","url":null,"abstract":"Neurons in the cerebral cortex receive thousands of synaptic inputs per second from thousands of presynaptic neurons. How the dendritic location of inputs, their timing, strength, and presynaptic origin, in conjunction with complex dendritic physiology, impact the transformation of synaptic input into action potential (AP) output remains generally unknown for in vivo conditions. Here, we introduce a computational approach to reveal which properties of the input causally underlie AP output, and how this neuronal input-output computation is influenced by the morphology and biophysical properties of the dendrites. We demonstrate that this approach allows dissecting of how different input populations drive in vivo observed APs. For this purpose, we focus on fast and broadly tuned responses that pyramidal tract neurons in layer 5 (L5PTs) of the rat barrel cortex elicit upon passive single whisker deflections. By reducing a multi-scale model that we reported previously, we show that three features are sufficient to predict with high accuracy the sensory responses and receptive fields of L5PTs under these specific in vivo conditions: the count of active excitatory versus inhibitory synapses preceding the response, their spatial distribution on the dendrites, and the AP history. Based on these three features, we derive an analytically tractable description of the input-output computation of L5PTs, which enabled us to dissect how synaptic input from thalamus and different cell types in barrel cortex contribute to these responses. We show that the input-output computation is preserved across L5PTs despite morphological and biophysical diversity of their dendrites. We found that trial-to-trial variability in L5PT responses, and cell-to-cell variability in their receptive fields, are sufficiently explained by variability in synaptic input from the network, whereas variability in biophysical and morphological properties have minor contributions. Our approach to derive analytically tractable models of input-output computations in L5PTs provides a roadmap to dissect network-neuron interactions underlying L5PT responses across different in vivo conditions and for other cell types.","PeriodicalId":49688,"journal":{"name":"PLoS Computational Biology","volume":"7 13","pages":"e1011468"},"PeriodicalIF":4.3,"publicationDate":"2024-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140696221","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SubGE-DDI: A new prediction model for drug-drug interaction established through biomedical texts and drug-pairs knowledge subgraph enhancement. SubGE-DDI:通过生物医学文本和药物对知识子图增强建立的药物相互作用新预测模型。
IF 4.3 2区 生物学 Pub Date : 2024-04-16 DOI: 10.1371/journal.pcbi.1011989
Yiyang Shi, Mingxiu He, Junheng Chen, Fangfang Han, Yongming Cai
Biomedical texts provide important data for investigating drug-drug interactions (DDIs) in the field of pharmacovigilance. Although researchers have attempted to investigate DDIs from biomedical texts and predict unknown DDIs, the lack of accurate manual annotations significantly hinders the performance of machine learning algorithms. In this study, a new DDI prediction framework, Subgraph Enhance model, was developed for DDI (SubGE-DDI) to improve the performance of machine learning algorithms. This model uses drug pairs knowledge subgraph information to achieve large-scale plain text prediction without many annotations. This model treats DDI prediction as a multi-class classification problem and predicts the specific DDI type for each drug pair (e.g. Mechanism, Effect, Advise, Interact and Negative). The drug pairs knowledge subgraph was derived from a huge drug knowledge graph containing various public datasets, such as DrugBank, TwoSIDES, OffSIDES, DrugCentral, EntrezeGene, SMPDB (The Small Molecule Pathway Database), CTD (The Comparative Toxicogenomics Database) and SIDER. The SubGE-DDI was evaluated from the public dataset (SemEval-2013 Task 9 dataset) and then compared with other state-of-the-art baselines. SubGE-DDI achieves 83.91% micro F1 score and 84.75% macro F1 score in the test dataset, outperforming the other state-of-the-art baselines. These findings show that the proposed drug pairs knowledge subgraph-assisted model can effectively improve the prediction performance of DDIs from biomedical texts.
生物医学文本为研究药物警戒领域的药物相互作用(DDI)提供了重要数据。尽管研究人员已经尝试从生物医学文本中研究 DDIs 并预测未知的 DDIs,但缺乏准确的人工标注极大地阻碍了机器学习算法的性能。本研究开发了一种新的 DDI 预测框架--DDI 子图增强模型(Subgraph Enhance model,简称 SubGE-DDI),以提高机器学习算法的性能。该模型利用药物配对知识子图信息,无需大量注释即可实现大规模纯文本预测。该模型将 DDI 预测视为多类分类问题,并预测每对药物的具体 DDI 类型(如机制、影响、建议、相互作用和阴性)。药物配对知识子图来自一个包含各种公共数据集的庞大药物知识图谱,如 DrugBank、TwoSIDES、OffSIDES、DrugCentral、EntrezeGene、SMPDB(小分子途径数据库)、CTD(比较毒物基因组学数据库)和 SIDER。通过公共数据集(SemEval-2013 Task 9 数据集)对 SubGE-DDI 进行了评估,然后与其他最先进的基线进行了比较。在测试数据集中,SubGE-DDI 获得了 83.91% 的微观 F1 分数和 84.75% 的宏观 F1 分数,表现优于其他先进基线。这些结果表明,所提出的药物配对知识子图辅助模型可以有效提高生物医学文本中 DDI 的预测性能。
{"title":"SubGE-DDI: A new prediction model for drug-drug interaction established through biomedical texts and drug-pairs knowledge subgraph enhancement.","authors":"Yiyang Shi, Mingxiu He, Junheng Chen, Fangfang Han, Yongming Cai","doi":"10.1371/journal.pcbi.1011989","DOIUrl":"https://doi.org/10.1371/journal.pcbi.1011989","url":null,"abstract":"Biomedical texts provide important data for investigating drug-drug interactions (DDIs) in the field of pharmacovigilance. Although researchers have attempted to investigate DDIs from biomedical texts and predict unknown DDIs, the lack of accurate manual annotations significantly hinders the performance of machine learning algorithms. In this study, a new DDI prediction framework, Subgraph Enhance model, was developed for DDI (SubGE-DDI) to improve the performance of machine learning algorithms. This model uses drug pairs knowledge subgraph information to achieve large-scale plain text prediction without many annotations. This model treats DDI prediction as a multi-class classification problem and predicts the specific DDI type for each drug pair (e.g. Mechanism, Effect, Advise, Interact and Negative). The drug pairs knowledge subgraph was derived from a huge drug knowledge graph containing various public datasets, such as DrugBank, TwoSIDES, OffSIDES, DrugCentral, EntrezeGene, SMPDB (The Small Molecule Pathway Database), CTD (The Comparative Toxicogenomics Database) and SIDER. The SubGE-DDI was evaluated from the public dataset (SemEval-2013 Task 9 dataset) and then compared with other state-of-the-art baselines. SubGE-DDI achieves 83.91% micro F1 score and 84.75% macro F1 score in the test dataset, outperforming the other state-of-the-art baselines. These findings show that the proposed drug pairs knowledge subgraph-assisted model can effectively improve the prediction performance of DDIs from biomedical texts.","PeriodicalId":49688,"journal":{"name":"PLoS Computational Biology","volume":"349 13","pages":"e1011989"},"PeriodicalIF":4.3,"publicationDate":"2024-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140698313","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mesoscale simulations predict the role of synergistic cerebellar plasticity during classical eyeblink conditioning. 中尺度模拟预测了小脑协同可塑性在经典眼动条件反射过程中的作用。
IF 4.3 2区 生物学 Pub Date : 2024-04-04 DOI: 10.1371/journal.pcbi.1011277
A. Geminiani, C. Casellato, H. Boele, A. Pedrocchi, C. D. De Zeeuw, E. D'Angelo
According to the motor learning theory by Albus and Ito, synaptic depression at the parallel fibre to Purkinje cells synapse (pf-PC) is the main substrate responsible for learning sensorimotor contingencies under climbing fibre control. However, recent experimental evidence challenges this relatively monopolistic view of cerebellar learning. Bidirectional plasticity appears crucial for learning, in which different microzones can undergo opposite changes of synaptic strength (e.g. downbound microzones-more likely depression, upbound microzones-more likely potentiation), and multiple forms of plasticity have been identified, distributed over different cerebellar circuit synapses. Here, we have simulated classical eyeblink conditioning (CEBC) using an advanced spiking cerebellar model embedding downbound and upbound modules that are subject to multiple plasticity rules. Simulations indicate that synaptic plasticity regulates the cascade of precise spiking patterns spreading throughout the cerebellar cortex and cerebellar nuclei. CEBC was supported by plasticity at the pf-PC synapses as well as at the synapses of the molecular layer interneurons (MLIs), but only the combined switch-off of both sites of plasticity compromised learning significantly. By differentially engaging climbing fibre information and related forms of synaptic plasticity, both microzones contributed to generate a well-timed conditioned response, but it was the downbound module that played the major role in this process. The outcomes of our simulations closely align with the behavioural and electrophysiological phenotypes of mutant mice suffering from cell-specific mutations that affect processing of their PC and/or MLI synapses. Our data highlight that a synergy of bidirectional plasticity rules distributed across the cerebellum can facilitate finetuning of adaptive associative behaviours at a high spatiotemporal resolution.
根据阿尔伯斯和伊藤的运动学习理论,平行纤维与浦肯野细胞突触(pf-PC)上的突触抑制是在爬行纤维控制下学习感觉运动或然性的主要基质。然而,最近的实验证据对小脑学习的这种相对垄断性观点提出了挑战。双向可塑性似乎对学习至关重要,其中不同微区的突触强度会发生相反的变化(例如,下行微区更可能是抑制,上行微区更可能是增效),而且已发现多种形式的可塑性分布在不同的小脑回路突触上。在这里,我们使用先进的尖峰小脑模型模拟了经典眼动调节(CEBC),该模型嵌入了受多种可塑性规则影响的下行和上行模块。模拟结果表明,突触可塑性调节了遍布整个小脑皮层和小脑核的一连串精确尖峰模式。CEBC得到了pf-PC突触和分子层中间神经元(MLIs)突触可塑性的支持,但只有这两个部位的可塑性联合关闭才会显著影响学习。通过不同程度地利用爬行纤维信息和相关形式的突触可塑性,两个微区都有助于产生适时的条件反应,但在这一过程中起主要作用的是下行模块。我们的模拟结果与细胞特异性突变影响 PC 和/或 MLI 突触处理的突变小鼠的行为和电生理学表型密切吻合。我们的数据突出表明,分布在整个小脑的双向可塑性规则的协同作用可以促进在高时空分辨率下对适应性联想行为进行微调。
{"title":"Mesoscale simulations predict the role of synergistic cerebellar plasticity during classical eyeblink conditioning.","authors":"A. Geminiani, C. Casellato, H. Boele, A. Pedrocchi, C. D. De Zeeuw, E. D'Angelo","doi":"10.1371/journal.pcbi.1011277","DOIUrl":"https://doi.org/10.1371/journal.pcbi.1011277","url":null,"abstract":"According to the motor learning theory by Albus and Ito, synaptic depression at the parallel fibre to Purkinje cells synapse (pf-PC) is the main substrate responsible for learning sensorimotor contingencies under climbing fibre control. However, recent experimental evidence challenges this relatively monopolistic view of cerebellar learning. Bidirectional plasticity appears crucial for learning, in which different microzones can undergo opposite changes of synaptic strength (e.g. downbound microzones-more likely depression, upbound microzones-more likely potentiation), and multiple forms of plasticity have been identified, distributed over different cerebellar circuit synapses. Here, we have simulated classical eyeblink conditioning (CEBC) using an advanced spiking cerebellar model embedding downbound and upbound modules that are subject to multiple plasticity rules. Simulations indicate that synaptic plasticity regulates the cascade of precise spiking patterns spreading throughout the cerebellar cortex and cerebellar nuclei. CEBC was supported by plasticity at the pf-PC synapses as well as at the synapses of the molecular layer interneurons (MLIs), but only the combined switch-off of both sites of plasticity compromised learning significantly. By differentially engaging climbing fibre information and related forms of synaptic plasticity, both microzones contributed to generate a well-timed conditioned response, but it was the downbound module that played the major role in this process. The outcomes of our simulations closely align with the behavioural and electrophysiological phenotypes of mutant mice suffering from cell-specific mutations that affect processing of their PC and/or MLI synapses. Our data highlight that a synergy of bidirectional plasticity rules distributed across the cerebellum can facilitate finetuning of adaptive associative behaviours at a high spatiotemporal resolution.","PeriodicalId":49688,"journal":{"name":"PLoS Computational Biology","volume":"77 5","pages":"e1011277"},"PeriodicalIF":4.3,"publicationDate":"2024-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140741514","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Method for cycle detection in sparse, irregularly sampled, long-term neuro-behavioral timeseries: Basis pursuit denoising with polynomial detrending of long-term, inter-ictal epileptiform activity 在稀疏、不规则采样的长期神经行为时间序列中检测周期的方法:对长期发作间癫痫样活动进行多项式去趋势的基线追踪去噪方法
IF 4.3 2区 生物学 Pub Date : 2024-04-01 DOI: 10.1371/journal.pcbi.1011152
I. Balzekas, Joshua Trzasko, Grace Yu, Thomas J. Richner, F. Mivalt, V. Sladky, N. Gregg, Jamie Van Gompel, Kai Miller, Paul E Croarkin, V. Kremen, Greg Worrell
Numerous physiological processes are cyclical, but sampling these processes densely enough to perform frequency decomposition and subsequent analyses can be challenging. Mathematical approaches for decomposition and reconstruction of sparsely and irregularly sampled signals are well established but have been under-utilized in physiological applications. We developed a basis pursuit denoising with polynomial detrending (BPWP) model that recovers oscillations and trends from sparse and irregularly sampled timeseries. We validated this model on a unique dataset of long-term inter-ictal epileptiform discharge (IED) rates from human hippocampus recorded with a novel investigational device with continuous local field potential sensing. IED rates have well established circadian and multiday cycles related to sleep, wakefulness, and seizure clusters. Given sparse and irregular samples of IED rates from multi-month intracranial EEG recordings from ambulatory humans, we used BPWP to compute narrowband spectral power and polynomial trend coefficients and identify IED rate cycles in three subjects. In select cases, we propose that random and irregular sampling may be leveraged for frequency decomposition of physiological signals. Trial Registration: NCT03946618.
许多生理过程都是周期性的,但要对这些过程进行足够密集的采样,以进行频率分解和后续分析,却很有难度。对稀疏和不规则采样信号进行分解和重建的数学方法已经成熟,但在生理应用中却未得到充分利用。我们开发了基于多项式去趋势的基序去噪(BPWP)模型,可以从稀疏和不规则采样的时间序列中恢复振荡和趋势。我们在一个独特的数据集上验证了这一模型,该数据集是用一种新型研究设备记录的人类海马长期发作间癫痫样放电(IED)率,该设备具有连续的局部场电位传感功能。痫样放电率具有与睡眠、觉醒和癫痫发作群相关的成熟的昼夜和多日周期。鉴于从伏卧者的多月颅内脑电图记录中获得的 IED 率样本稀少且不规则,我们使用 BPWP 计算了窄带频谱功率和多项式趋势系数,并识别了三个受试者的 IED 率周期。在特定情况下,我们建议利用随机和不规则采样对生理信号进行频率分解。试验注册:NCT03946618.
{"title":"Method for cycle detection in sparse, irregularly sampled, long-term neuro-behavioral timeseries: Basis pursuit denoising with polynomial detrending of long-term, inter-ictal epileptiform activity","authors":"I. Balzekas, Joshua Trzasko, Grace Yu, Thomas J. Richner, F. Mivalt, V. Sladky, N. Gregg, Jamie Van Gompel, Kai Miller, Paul E Croarkin, V. Kremen, Greg Worrell","doi":"10.1371/journal.pcbi.1011152","DOIUrl":"https://doi.org/10.1371/journal.pcbi.1011152","url":null,"abstract":"Numerous physiological processes are cyclical, but sampling these processes densely enough to perform frequency decomposition and subsequent analyses can be challenging. Mathematical approaches for decomposition and reconstruction of sparsely and irregularly sampled signals are well established but have been under-utilized in physiological applications. We developed a basis pursuit denoising with polynomial detrending (BPWP) model that recovers oscillations and trends from sparse and irregularly sampled timeseries. We validated this model on a unique dataset of long-term inter-ictal epileptiform discharge (IED) rates from human hippocampus recorded with a novel investigational device with continuous local field potential sensing. IED rates have well established circadian and multiday cycles related to sleep, wakefulness, and seizure clusters. Given sparse and irregular samples of IED rates from multi-month intracranial EEG recordings from ambulatory humans, we used BPWP to compute narrowband spectral power and polynomial trend coefficients and identify IED rate cycles in three subjects. In select cases, we propose that random and irregular sampling may be leveraged for frequency decomposition of physiological signals. Trial Registration: NCT03946618.","PeriodicalId":49688,"journal":{"name":"PLoS Computational Biology","volume":"98 ","pages":""},"PeriodicalIF":4.3,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140760949","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Locations and structures of influenza A virus packaging-associated signals and other functional elements via an in silico pipeline for predicting constrained features in RNA viruses 通过用于预测 RNA 病毒受限特征的硅学管道确定甲型流感病毒包装相关信号和其他功能元素的位置和结构
IF 4.3 2区 生物学 Pub Date : 2024-04-01 DOI: 10.1371/journal.pcbi.1012009
Emma Beniston, J. Skittrall
Influenza A virus contains regions of its segmented genome associated with ability to package the segments into virions, but many such regions are poorly characterised. We provide detailed predictions of the key locations within these packaging-associated regions, and their structures, by applying a recently-improved pipeline for delineating constrained regions in RNA viruses and applying structural prediction algorithms. We find and characterise other known constrained regions within influenza A genomes, including the region associated with the PA-X frameshift, regions associated with alternative splicing, and constraint around the initiation motif for a truncated PB1 protein, PB1-N92, associated with avian viruses. We further predict the presence of constrained regions that have not previously been described. The extra characterisation our work provides allows investigation of these key regions for drug target potential, and points towards determinants of packaging compatibility between segments.
甲型流感病毒的分段基因组中有一些区域与将分段包装成病毒的能力有关,但许多此类区域的特征还不清楚。我们采用最近改进的 RNA 病毒受限区域划分方法,并应用结构预测算法,对这些包装相关区域的关键位置及其结构进行了详细预测。我们发现并描述了甲型流感基因组中的其他已知受限区域,包括与 PA-X 框变相关的区域、与替代剪接相关的区域,以及与禽类病毒相关的截短 PB1 蛋白 PB1-N92 启动基序周围的受限区域。我们还进一步预测了以前未曾描述过的受限区域的存在。我们的工作提供了额外的特征,使我们能够研究这些关键区域的药物靶点潜力,并指出片段之间包装兼容性的决定因素。
{"title":"Locations and structures of influenza A virus packaging-associated signals and other functional elements via an in silico pipeline for predicting constrained features in RNA viruses","authors":"Emma Beniston, J. Skittrall","doi":"10.1371/journal.pcbi.1012009","DOIUrl":"https://doi.org/10.1371/journal.pcbi.1012009","url":null,"abstract":"Influenza A virus contains regions of its segmented genome associated with ability to package the segments into virions, but many such regions are poorly characterised. We provide detailed predictions of the key locations within these packaging-associated regions, and their structures, by applying a recently-improved pipeline for delineating constrained regions in RNA viruses and applying structural prediction algorithms. We find and characterise other known constrained regions within influenza A genomes, including the region associated with the PA-X frameshift, regions associated with alternative splicing, and constraint around the initiation motif for a truncated PB1 protein, PB1-N92, associated with avian viruses. We further predict the presence of constrained regions that have not previously been described. The extra characterisation our work provides allows investigation of these key regions for drug target potential, and points towards determinants of packaging compatibility between segments.","PeriodicalId":49688,"journal":{"name":"PLoS Computational Biology","volume":"284 ","pages":""},"PeriodicalIF":4.3,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140782977","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Benchmarking multi-ancestry prostate cancer polygenic risk scores in a real-world cohort 真实世界队列中多宗族前列腺癌多基因风险评分的基准分析
IF 4.3 2区 生物学 Pub Date : 2024-04-01 DOI: 10.1371/journal.pcbi.1011990
Yajas Shah, S. Kulm, J. Nauseef, Zhengming Chen, Olivier Elemento, K. Kensler, Ravi N Sharaf
Prostate cancer is a heritable disease with ancestry-biased incidence and mortality. Polygenic risk scores (PRSs) offer promising advancements in predicting disease risk, including prostate cancer. While their accuracy continues to improve, research aimed at enhancing their effectiveness within African and Asian populations remains key for equitable use. Recent algorithmic developments for PRS derivation have resulted in improved pan-ancestral risk prediction for several diseases. In this study, we benchmark the predictive power of six widely used PRS derivation algorithms, including four of which adjust for ancestry, against prostate cancer cases and controls from the UK Biobank and All of Us cohorts. We find modest improvement in discriminatory ability when compared with a simple method that prioritizes variants, clumping, and published polygenic risk scores. Our findings underscore the importance of improving upon risk prediction algorithms and the sampling of diverse cohorts.
前列腺癌是一种遗传性疾病,其发病率和死亡率与祖先有关。多基因风险评分(PRSs)在预测包括前列腺癌在内的疾病风险方面取得了令人鼓舞的进展。虽然多基因风险评分的准确性在不断提高,但如何提高其在非洲和亚洲人群中的有效性仍是公平使用的关键。最近,PRS推导算法的发展改善了几种疾病的泛宗族风险预测。在本研究中,我们以英国生物库和 "我们所有人 "队列中的前列腺癌病例和对照组为对象,对六种广泛使用的 PRS 推算算法的预测能力进行了基准测试,其中四种算法对血统进行了调整。我们发现,与优先考虑变异、结块和已公布的多基因风险评分的简单方法相比,我们的判别能力略有提高。我们的发现强调了改进风险预测算法和对不同队列采样的重要性。
{"title":"Benchmarking multi-ancestry prostate cancer polygenic risk scores in a real-world cohort","authors":"Yajas Shah, S. Kulm, J. Nauseef, Zhengming Chen, Olivier Elemento, K. Kensler, Ravi N Sharaf","doi":"10.1371/journal.pcbi.1011990","DOIUrl":"https://doi.org/10.1371/journal.pcbi.1011990","url":null,"abstract":"Prostate cancer is a heritable disease with ancestry-biased incidence and mortality. Polygenic risk scores (PRSs) offer promising advancements in predicting disease risk, including prostate cancer. While their accuracy continues to improve, research aimed at enhancing their effectiveness within African and Asian populations remains key for equitable use. Recent algorithmic developments for PRS derivation have resulted in improved pan-ancestral risk prediction for several diseases. In this study, we benchmark the predictive power of six widely used PRS derivation algorithms, including four of which adjust for ancestry, against prostate cancer cases and controls from the UK Biobank and All of Us cohorts. We find modest improvement in discriminatory ability when compared with a simple method that prioritizes variants, clumping, and published polygenic risk scores. Our findings underscore the importance of improving upon risk prediction algorithms and the sampling of diverse cohorts.","PeriodicalId":49688,"journal":{"name":"PLoS Computational Biology","volume":"40 11","pages":""},"PeriodicalIF":4.3,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140797449","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
HGCLAMIR: Hypergraph contrastive learning with attention mechanism and integrated multi-view representation for predicting miRNA-disease associations HGCLAMIR:具有注意力机制和综合多视图表示的超图对比学习,用于预测 miRNA 与疾病的联系
IF 4.3 2区 生物学 Pub Date : 2024-04-01 DOI: 10.1371/journal.pcbi.1011927
Ouyang Dong, Yong Liang, Jinfeng Wang, Le Li, Ning Ai, Junning Feng, Shan Lu, Shuilin liao, Xiaoying Liu, Shengli Xie
Existing studies have shown that the abnormal expression of microRNAs (miRNAs) usually leads to the occurrence and development of human diseases. Identifying disease-related miRNAs contributes to studying the pathogenesis of diseases at the molecular level. As traditional biological experiments are time-consuming and expensive, computational methods have been used as an effective complement to infer the potential associations between miRNAs and diseases. However, most of the existing computational methods still face three main challenges: (i) learning of high-order relations; (ii) insufficient representation learning ability; (iii) importance learning and integration of multi-view embedding representation. To this end, we developed a HyperGraph Contrastive Learning with view-aware Attention Mechanism and Integrated multi-view Representation (HGCLAMIR) model to discover potential miRNA-disease associations. First, hypergraph convolutional network (HGCN) was utilized to capture high-order complex relations from hypergraphs related to miRNAs and diseases. Then, we combined HGCN with contrastive learning to improve and enhance the embedded representation learning ability of HGCN. Moreover, we introduced view-aware attention mechanism to adaptively weight the embedded representations of different views, thereby obtaining the importance of multi-view latent representations. Next, we innovatively proposed integrated representation learning to integrate the embedded representation information of multiple views for obtaining more reasonable embedding information. Finally, the integrated representation information was fed into a neural network-based matrix completion method to perform miRNA-disease association prediction. Experimental results on the cross-validation set and independent test set indicated that HGCLAMIR can achieve better prediction performance than other baseline models. Furthermore, the results of case studies and enrichment analysis further demonstrated the accuracy of HGCLAMIR and unconfirmed potential associations had biological significance.
现有研究表明,微RNA(miRNA)的异常表达通常会导致人类疾病的发生和发展。鉴定与疾病相关的 miRNAs 有助于从分子水平研究疾病的发病机理。由于传统的生物学实验耗时且昂贵,计算方法已被用作推断 miRNA 与疾病之间潜在关联的有效补充。然而,现有的大多数计算方法仍面临三大挑战:(1)高阶关系学习;(2)表征学习能力不足;(3)多视图嵌入表征的重要性学习与整合。为此,我们开发了一种具有视图感知注意机制和集成多视图表示的超图对比学习(HGCLAMIR)模型来发现潜在的 miRNA 与疾病的关联。首先,我们利用超图卷积网络(HGCN)从与 miRNA 和疾病相关的超图中捕捉高阶复杂关系。然后,我们将超图卷积网络与对比学习相结合,改进并提高了超图卷积网络的嵌入式表征学习能力。此外,我们还引入了视图感知注意机制,对不同视图的嵌入表征进行自适应加权,从而获得多视图潜在表征的重要性。接着,我们创新性地提出了整合表征学习,将多个视图的嵌入表征信息进行整合,从而获得更合理的嵌入信息。最后,将整合后的表征信息输入基于神经网络的矩阵补全方法,进行 miRNA 与疾病的关联预测。交叉验证集和独立测试集的实验结果表明,HGCLAMIR 比其他基线模型具有更好的预测性能。此外,案例研究和富集分析的结果也进一步证明了 HGCLAMIR 的准确性,未证实的潜在关联具有生物学意义。
{"title":"HGCLAMIR: Hypergraph contrastive learning with attention mechanism and integrated multi-view representation for predicting miRNA-disease associations","authors":"Ouyang Dong, Yong Liang, Jinfeng Wang, Le Li, Ning Ai, Junning Feng, Shan Lu, Shuilin liao, Xiaoying Liu, Shengli Xie","doi":"10.1371/journal.pcbi.1011927","DOIUrl":"https://doi.org/10.1371/journal.pcbi.1011927","url":null,"abstract":"Existing studies have shown that the abnormal expression of microRNAs (miRNAs) usually leads to the occurrence and development of human diseases. Identifying disease-related miRNAs contributes to studying the pathogenesis of diseases at the molecular level. As traditional biological experiments are time-consuming and expensive, computational methods have been used as an effective complement to infer the potential associations between miRNAs and diseases. However, most of the existing computational methods still face three main challenges: (i) learning of high-order relations; (ii) insufficient representation learning ability; (iii) importance learning and integration of multi-view embedding representation. To this end, we developed a HyperGraph Contrastive Learning with view-aware Attention Mechanism and Integrated multi-view Representation (HGCLAMIR) model to discover potential miRNA-disease associations. First, hypergraph convolutional network (HGCN) was utilized to capture high-order complex relations from hypergraphs related to miRNAs and diseases. Then, we combined HGCN with contrastive learning to improve and enhance the embedded representation learning ability of HGCN. Moreover, we introduced view-aware attention mechanism to adaptively weight the embedded representations of different views, thereby obtaining the importance of multi-view latent representations. Next, we innovatively proposed integrated representation learning to integrate the embedded representation information of multiple views for obtaining more reasonable embedding information. Finally, the integrated representation information was fed into a neural network-based matrix completion method to perform miRNA-disease association prediction. Experimental results on the cross-validation set and independent test set indicated that HGCLAMIR can achieve better prediction performance than other baseline models. Furthermore, the results of case studies and enrichment analysis further demonstrated the accuracy of HGCLAMIR and unconfirmed potential associations had biological significance.","PeriodicalId":49688,"journal":{"name":"PLoS Computational Biology","volume":"5 9","pages":""},"PeriodicalIF":4.3,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140785277","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Partial label learning for automated classification of single-cell transcriptomic profiles 部分标签学习用于单细胞转录组图谱的自动分类
IF 4.3 2区 生物学 Pub Date : 2024-04-01 DOI: 10.1371/journal.pcbi.1012006
Malek Senoussi, Thierry Artières, Paul Villoutreix
Single-cell RNA sequencing (scRNASeq) data plays a major role in advancing our understanding of developmental biology. An important current question is how to classify transcriptomic profiles obtained from scRNASeq experiments into the various cell types and identify the lineage relationship for individual cells. Because of the fast accumulation of datasets and the high dimensionality of the data, it has become challenging to explore and annotate single-cell transcriptomic profiles by hand. To overcome this challenge, automated classification methods are needed. Classical approaches rely on supervised training datasets. However, due to the difficulty of obtaining data annotated at single-cell resolution, we propose instead to take advantage of partial annotations. The partial label learning framework assumes that we can obtain a set of candidate labels containing the correct one for each data point, a simpler setting than requiring a fully supervised training dataset. We study and extend when needed state-of-the-art multi-class classification methods, such as SVM, kNN, prototype-based, logistic regression and ensemble methods, to the partial label learning framework. Moreover, we study the effect of incorporating the structure of the label set into the methods. We focus particularly on the hierarchical structure of the labels, as commonly observed in developmental processes. We show, on simulated and real datasets, that these extensions enable to learn from partially labeled data, and perform predictions with high accuracy, particularly with a nonlinear prototype-based method. We demonstrate that the performances of our methods trained with partially annotated data reach the same performance as fully supervised data. Finally, we study the level of uncertainty present in the partially annotated data, and derive some prescriptive results on the effect of this uncertainty on the accuracy of the partial label learning methods. Overall our findings show how hierarchical and non-hierarchical partial label learning strategies can help solve the problem of automated classification of single-cell transcriptomic profiles, interestingly these methods rely on a much less stringent type of annotated datasets compared to fully supervised learning methods.
单细胞 RNA 测序(scRNASeq)数据在促进我们对发育生物学的理解方面发挥着重要作用。当前的一个重要问题是如何将从 scRNASeq 实验中获得的转录组图谱分类到各种细胞类型中,并确定单个细胞的系谱关系。由于数据集的快速积累和数据的高维度,手工探索和注释单细胞转录组图谱已成为一项挑战。为了克服这一挑战,需要采用自动分类方法。经典方法依赖于有监督的训练数据集。然而,由于难以获得单细胞分辨率的注释数据,我们建议利用部分注释。部分标注学习框架假定我们可以获得一组候选标签,其中包含每个数据点的正确标签,这比需要完全监督的训练数据集更简单。我们研究了最先进的多类分类方法,如 SVM、kNN、基于原型、逻辑回归和集合方法,并在必要时将其扩展到部分标签学习框架。此外,我们还研究了将标签集结构纳入方法的效果。我们特别关注标签的分层结构,这在发展过程中很常见。我们在模拟和真实数据集上表明,这些扩展能够从部分标签数据中学习,并进行高精度预测,尤其是基于非线性原型的方法。我们证明,使用部分标注数据训练的方法的性能与完全监督数据的性能相同。最后,我们研究了部分标注数据中存在的不确定性水平,并得出了这种不确定性对部分标注学习方法准确性影响的一些规范性结果。总之,我们的研究结果表明了分层和非分层部分标签学习策略如何帮助解决单细胞转录组图谱的自动分类问题,有趣的是,与完全监督学习方法相比,这些方法所依赖的注释数据集的类型要宽松得多。
{"title":"Partial label learning for automated classification of single-cell transcriptomic profiles","authors":"Malek Senoussi, Thierry Artières, Paul Villoutreix","doi":"10.1371/journal.pcbi.1012006","DOIUrl":"https://doi.org/10.1371/journal.pcbi.1012006","url":null,"abstract":"Single-cell RNA sequencing (scRNASeq) data plays a major role in advancing our understanding of developmental biology. An important current question is how to classify transcriptomic profiles obtained from scRNASeq experiments into the various cell types and identify the lineage relationship for individual cells. Because of the fast accumulation of datasets and the high dimensionality of the data, it has become challenging to explore and annotate single-cell transcriptomic profiles by hand. To overcome this challenge, automated classification methods are needed. Classical approaches rely on supervised training datasets. However, due to the difficulty of obtaining data annotated at single-cell resolution, we propose instead to take advantage of partial annotations. The partial label learning framework assumes that we can obtain a set of candidate labels containing the correct one for each data point, a simpler setting than requiring a fully supervised training dataset. We study and extend when needed state-of-the-art multi-class classification methods, such as SVM, kNN, prototype-based, logistic regression and ensemble methods, to the partial label learning framework. Moreover, we study the effect of incorporating the structure of the label set into the methods. We focus particularly on the hierarchical structure of the labels, as commonly observed in developmental processes. We show, on simulated and real datasets, that these extensions enable to learn from partially labeled data, and perform predictions with high accuracy, particularly with a nonlinear prototype-based method. We demonstrate that the performances of our methods trained with partially annotated data reach the same performance as fully supervised data. Finally, we study the level of uncertainty present in the partially annotated data, and derive some prescriptive results on the effect of this uncertainty on the accuracy of the partial label learning methods. Overall our findings show how hierarchical and non-hierarchical partial label learning strategies can help solve the problem of automated classification of single-cell transcriptomic profiles, interestingly these methods rely on a much less stringent type of annotated datasets compared to fully supervised learning methods.","PeriodicalId":49688,"journal":{"name":"PLoS Computational Biology","volume":"796 ","pages":""},"PeriodicalIF":4.3,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140784128","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
PLoS Computational Biology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1