首页 > 最新文献

PLoS Computational Biology最新文献

英文 中文
Maximizing multi-reaction dependencies provides more accurate and precise predictions of intracellular fluxes than the principle of parsimony. 与简约原理相比,最大化多反应依赖性提供了对细胞内流量的更准确和精确的预测。
IF 4.3 2区 生物学 Pub Date : 2023-09-18 eCollection Date: 2023-09-01 DOI: 10.1371/journal.pcbi.1011489
Seirana Hashemi, Zahra Razaghi-Moghadam, Zoran Nikoloski

Intracellular fluxes represent a joint outcome of cellular transcription and translation and reflect the availability and usage of nutrients from the environment. While approaches from the constraint-based metabolic framework can accurately predict cellular phenotypes, such as growth and exchange rates with the environment, accurate prediction of intracellular fluxes remains a pressing problem. Parsimonious flux balance analysis (pFBA) has become an approach of choice to predict intracellular fluxes by employing the principle of efficient usage of protein resources. Nevertheless, comparative analyses of intracellular flux predictions from pFBA against fluxes estimated from labeling experiments remain scarce. Here, we posited that steady-state flux distributions derived from the principle of maximizing multi-reaction dependencies are of improved accuracy and precision than those resulting from pFBA. To this end, we designed a constraint-based approach, termed complex-balanced FBA (cbFBA), to predict steady-state flux distributions that support the given specific growth rate and exchange fluxes. We showed that the steady-state flux distributions resulting from cbFBA in comparison to pFBA show better agreement with experimentally measured fluxes from 17 Escherichia coli strains and are more precise, due to the smaller space of alternative solutions. We also showed that the same principle holds in eukaryotes by comparing the predictions of pFBA and cbFBA against experimentally derived steady-state flux distributions from 26 knock-out mutants of Saccharomyces cerevisiae. Furthermore, our results showed that intracellular fluxes predicted by cbFBA provide better support for the principle of minimizing metabolic adjustment between mutants and wild types. Together, our findings point that other principles that consider the dynamics and coordination of steady states may govern the distribution of intracellular fluxes.

细胞内通量代表了细胞转录和翻译的共同结果,反映了环境中营养物质的可用性和使用情况。虽然基于约束的代谢框架的方法可以准确预测细胞表型,如生长和与环境的交换率,但准确预测细胞内流量仍然是一个紧迫的问题。简洁通量平衡分析(pFBA)已成为利用蛋白质资源有效利用原理预测细胞内通量的一种选择方法。然而,pFBA的细胞内流量预测与标记实验估计的流量的比较分析仍然很少。在这里,我们假设,根据多反应依赖性最大化原理推导的稳态通量分布比pFBA推导的稳态流量分布具有更高的准确性和精度。为此,我们设计了一种基于约束的方法,称为复杂平衡FBA(cbFBA),以预测支持给定特定增长率和交换通量的稳态通量分布。我们发现,与pFBA相比,cbFBA产生的稳态通量分布与17个大肠杆菌菌株的实验测量通量显示出更好的一致性,并且由于替代溶液的空间较小,因此更精确。我们还通过将pFBA和cbFBA的预测与来自酿酒酵母26个敲除突变体的实验推导的稳态通量分布进行比较,表明在真核生物中也适用相同的原理。此外,我们的结果表明,cbFBA预测的细胞内通量为突变体和野生型之间的代谢调节最小化的原理提供了更好的支持。总之,我们的发现指出,考虑稳态的动力学和协调的其他原理可能控制细胞内通量的分布。
{"title":"Maximizing multi-reaction dependencies provides more accurate and precise predictions of intracellular fluxes than the principle of parsimony.","authors":"Seirana Hashemi, Zahra Razaghi-Moghadam, Zoran Nikoloski","doi":"10.1371/journal.pcbi.1011489","DOIUrl":"10.1371/journal.pcbi.1011489","url":null,"abstract":"<p><p>Intracellular fluxes represent a joint outcome of cellular transcription and translation and reflect the availability and usage of nutrients from the environment. While approaches from the constraint-based metabolic framework can accurately predict cellular phenotypes, such as growth and exchange rates with the environment, accurate prediction of intracellular fluxes remains a pressing problem. Parsimonious flux balance analysis (pFBA) has become an approach of choice to predict intracellular fluxes by employing the principle of efficient usage of protein resources. Nevertheless, comparative analyses of intracellular flux predictions from pFBA against fluxes estimated from labeling experiments remain scarce. Here, we posited that steady-state flux distributions derived from the principle of maximizing multi-reaction dependencies are of improved accuracy and precision than those resulting from pFBA. To this end, we designed a constraint-based approach, termed complex-balanced FBA (cbFBA), to predict steady-state flux distributions that support the given specific growth rate and exchange fluxes. We showed that the steady-state flux distributions resulting from cbFBA in comparison to pFBA show better agreement with experimentally measured fluxes from 17 Escherichia coli strains and are more precise, due to the smaller space of alternative solutions. We also showed that the same principle holds in eukaryotes by comparing the predictions of pFBA and cbFBA against experimentally derived steady-state flux distributions from 26 knock-out mutants of Saccharomyces cerevisiae. Furthermore, our results showed that intracellular fluxes predicted by cbFBA provide better support for the principle of minimizing metabolic adjustment between mutants and wild types. Together, our findings point that other principles that consider the dynamics and coordination of steady states may govern the distribution of intracellular fluxes.</p>","PeriodicalId":49688,"journal":{"name":"PLoS Computational Biology","volume":"19 9","pages":"e1011489"},"PeriodicalIF":4.3,"publicationDate":"2023-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10538754/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10312490","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Nowcasting the 2022 mpox outbreak in England. 现在预测2022年猴痘在英格兰的爆发。
IF 4.3 2区 生物学 Pub Date : 2023-09-18 eCollection Date: 2023-09-01 DOI: 10.1371/journal.pcbi.1011463
Christopher E Overton, Sam Abbott, Rachel Christie, Fergus Cumming, Julie Day, Owen Jones, Rob Paton, Charlie Turner, Thomas Ward

In May 2022, a cluster of mpox cases were detected in the UK that could not be traced to recent travel history from an endemic region. Over the coming months, the outbreak grew, with over 3000 total cases reported in the UK, and similar outbreaks occurring worldwide. These outbreaks appeared linked to sexual contact networks between gay, bisexual and other men who have sex with men. Following the COVID-19 pandemic, local health systems were strained, and therefore effective surveillance for mpox was essential for managing public health policy. However, the mpox outbreak in the UK was characterised by substantial delays in the reporting of the symptom onset date and specimen collection date for confirmed positive cases. These delays led to substantial backfilling in the epidemic curve, making it challenging to interpret the epidemic trajectory in real-time. Many nowcasting models exist to tackle this challenge in epidemiological data, but these lacked sufficient flexibility. We have developed a nowcasting model using generalised additive models that makes novel use of individual-level patient data to correct the mpox epidemic curve in England. The aim of this model is to correct for backfilling in the epidemic curve and provide real-time characteristics of the state of the epidemic, including the real-time growth rate. This model benefited from close collaboration with individuals involved in collecting and processing the data, enabling temporal changes in the reporting structure to be built into the model, which improved the robustness of the nowcasts generated. The resulting model accurately captured the true shape of the epidemic curve in real time.

2022年5月,在英国发现了一组猴痘病例,这些病例无法追溯到流行地区的近期旅行史。在接下来的几个月里,疫情加剧,英国报告的总病例超过3000例,类似的疫情也在全球范围内发生。这些疫情似乎与同性恋、双性恋和其他与男性发生性关系的男性之间的性接触网络有关。新冠肺炎大流行后,当地卫生系统紧张,因此对猴痘的有效监测对于管理公共卫生政策至关重要。然而,英国猴痘疫情的特点是,确诊阳性病例的症状出现日期和样本采集日期的报告大幅延迟。这些延迟导致了疫情曲线的大幅回填,使得实时解释疫情轨迹具有挑战性。目前存在许多模型来应对流行病学数据中的这一挑战,但这些模型缺乏足够的灵活性。我们使用广义加性模型开发了一个即时预报模型,该模型新颖地利用了个体水平的患者数据来校正英格兰的猴痘流行病曲线。该模型的目的是校正流行病曲线中的回填,并提供流行病状态的实时特征,包括实时增长率。该模型得益于与参与收集和处理数据的个人的密切合作,使报告结构的时间变化能够纳入模型,从而提高了生成的即时预报的稳健性。由此产生的模型实时准确地捕捉到了流行病曲线的真实形状。
{"title":"Nowcasting the 2022 mpox outbreak in England.","authors":"Christopher E Overton, Sam Abbott, Rachel Christie, Fergus Cumming, Julie Day, Owen Jones, Rob Paton, Charlie Turner, Thomas Ward","doi":"10.1371/journal.pcbi.1011463","DOIUrl":"10.1371/journal.pcbi.1011463","url":null,"abstract":"<p><p>In May 2022, a cluster of mpox cases were detected in the UK that could not be traced to recent travel history from an endemic region. Over the coming months, the outbreak grew, with over 3000 total cases reported in the UK, and similar outbreaks occurring worldwide. These outbreaks appeared linked to sexual contact networks between gay, bisexual and other men who have sex with men. Following the COVID-19 pandemic, local health systems were strained, and therefore effective surveillance for mpox was essential for managing public health policy. However, the mpox outbreak in the UK was characterised by substantial delays in the reporting of the symptom onset date and specimen collection date for confirmed positive cases. These delays led to substantial backfilling in the epidemic curve, making it challenging to interpret the epidemic trajectory in real-time. Many nowcasting models exist to tackle this challenge in epidemiological data, but these lacked sufficient flexibility. We have developed a nowcasting model using generalised additive models that makes novel use of individual-level patient data to correct the mpox epidemic curve in England. The aim of this model is to correct for backfilling in the epidemic curve and provide real-time characteristics of the state of the epidemic, including the real-time growth rate. This model benefited from close collaboration with individuals involved in collecting and processing the data, enabling temporal changes in the reporting structure to be built into the model, which improved the robustness of the nowcasts generated. The resulting model accurately captured the true shape of the epidemic curve in real time.</p>","PeriodicalId":49688,"journal":{"name":"PLoS Computational Biology","volume":"19 9","pages":"e1011463"},"PeriodicalIF":4.3,"publicationDate":"2023-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10538717/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10307570","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exploring tumor-normal cross-talk with TranNet: Role of the environment in tumor progression. 使用TranNet探索肿瘤正常串扰:环境在肿瘤进展中的作用。
IF 4.3 2区 生物学 Pub Date : 2023-09-18 eCollection Date: 2023-09-01 DOI: 10.1371/journal.pcbi.1011472
Bayarbaatar Amgalan, Chi-Ping Day, Teresa M Przytycka

There is a growing awareness that tumor-adjacent normal tissues used as control samples in cancer studies do not represent fully healthy tissues. Instead, they are intermediates between healthy tissues and tumors. The factors that contribute to the deviation of such control samples from healthy state include exposure to the tumor-promoting factors, tumor-related immune response, and other aspects of tumor microenvironment. Characterizing the relation between gene expression of tumor-adjacent control samples and tumors is fundamental for understanding roles of microenvironment in tumor initiation and progression, as well as for identification of diagnostic and prognostic biomarkers for cancers. To address the demand, we developed and validated TranNet, a computational approach that utilizes gene expression in matched control and tumor samples to study the relation between their gene expression profiles. TranNet infers a sparse weighted bipartite graph from gene expression profiles of matched control samples to tumors. The results allow us to identify predictors (potential regulators) of this transition. To our knowledge, TranNet is the first computational method to infer such dependencies. We applied TranNet to the data of several cancer types and their matched control samples from The Cancer Genome Atlas (TCGA). Many predictors identified by TranNet are genes associated with regulation by the tumor microenvironment as they are enriched in G-protein coupled receptor signaling, cell-to-cell communication, immune processes, and cell adhesion. Correspondingly, targets of inferred predictors are enriched in pathways related to tissue remodelling (including the epithelial-mesenchymal Transition (EMT)), immune response, and cell proliferation. This implies that the predictors are markers and potential stromal facilitators of tumor progression. Our results provide new insights into the relationships between tumor adjacent control sample, tumor and the tumor environment. Moreover, the set of predictors identified by TranNet will provide a valuable resource for future investigations.

人们越来越认识到,在癌症研究中用作对照样本的肿瘤邻近正常组织并不代表完全健康的组织。相反,它们是健康组织和肿瘤之间的中间产物。导致这种对照样品偏离健康状态的因素包括暴露于肿瘤促进因子、肿瘤相关免疫反应和肿瘤微环境的其他方面。表征肿瘤邻近对照样品和肿瘤的基因表达之间的关系,对于理解微环境在肿瘤发生和发展中的作用,以及鉴定癌症的诊断和预后生物标志物是至关重要的。为了满足这一需求,我们开发并验证了TranNet,这是一种利用匹配对照和肿瘤样本中的基因表达来研究其基因表达谱之间关系的计算方法。TranNet从匹配的对照样本到肿瘤的基因表达谱推断出稀疏加权二分图。这些结果使我们能够确定这种转变的预测因素(潜在的调节因素)。据我们所知,TranNet是第一个推断这种依赖关系的计算方法。我们将TranNet应用于癌症基因组图谱(TCGA)中几种癌症类型及其匹配对照样本的数据。TranNet确定的许多预测因子是与肿瘤微环境调节相关的基因,因为它们富含G蛋白偶联受体信号传导、细胞间通讯、免疫过程和细胞粘附。相应地,推断预测因子的靶点在与组织重塑(包括上皮-间充质转化(EMT))、免疫反应和细胞增殖相关的途径中富集。这意味着预测因子是肿瘤进展的标志物和潜在的基质促进因子。我们的研究结果为肿瘤邻近对照样本、肿瘤和肿瘤环境之间的关系提供了新的见解。此外,TranNet确定的一组预测因子将为未来的研究提供宝贵的资源。
{"title":"Exploring tumor-normal cross-talk with TranNet: Role of the environment in tumor progression.","authors":"Bayarbaatar Amgalan,&nbsp;Chi-Ping Day,&nbsp;Teresa M Przytycka","doi":"10.1371/journal.pcbi.1011472","DOIUrl":"10.1371/journal.pcbi.1011472","url":null,"abstract":"<p><p>There is a growing awareness that tumor-adjacent normal tissues used as control samples in cancer studies do not represent fully healthy tissues. Instead, they are intermediates between healthy tissues and tumors. The factors that contribute to the deviation of such control samples from healthy state include exposure to the tumor-promoting factors, tumor-related immune response, and other aspects of tumor microenvironment. Characterizing the relation between gene expression of tumor-adjacent control samples and tumors is fundamental for understanding roles of microenvironment in tumor initiation and progression, as well as for identification of diagnostic and prognostic biomarkers for cancers. To address the demand, we developed and validated TranNet, a computational approach that utilizes gene expression in matched control and tumor samples to study the relation between their gene expression profiles. TranNet infers a sparse weighted bipartite graph from gene expression profiles of matched control samples to tumors. The results allow us to identify predictors (potential regulators) of this transition. To our knowledge, TranNet is the first computational method to infer such dependencies. We applied TranNet to the data of several cancer types and their matched control samples from The Cancer Genome Atlas (TCGA). Many predictors identified by TranNet are genes associated with regulation by the tumor microenvironment as they are enriched in G-protein coupled receptor signaling, cell-to-cell communication, immune processes, and cell adhesion. Correspondingly, targets of inferred predictors are enriched in pathways related to tissue remodelling (including the epithelial-mesenchymal Transition (EMT)), immune response, and cell proliferation. This implies that the predictors are markers and potential stromal facilitators of tumor progression. Our results provide new insights into the relationships between tumor adjacent control sample, tumor and the tumor environment. Moreover, the set of predictors identified by TranNet will provide a valuable resource for future investigations.</p>","PeriodicalId":49688,"journal":{"name":"PLoS Computational Biology","volume":"19 9","pages":"e1011472"},"PeriodicalIF":4.3,"publicationDate":"2023-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10538798/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10360970","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mathematical reconstruction of the metabolic network in an in-vitro multiple myeloma model. 体外多发性骨髓瘤模型中代谢网络的数学重建。
IF 4.3 2区 生物学 Pub Date : 2023-09-15 eCollection Date: 2023-09-01 DOI: 10.1371/journal.pcbi.1011374
Elias Vera-Siguenza, Cristina Escribano-Gonzalez, Irene Serrano-Gonzalo, Kattri-Liis Eskla, Fabian Spill, Daniel Tennant

It is increasingly apparent that cancer cells, in addition to remodelling their metabolism to survive and proliferate, adapt and manipulate the metabolism of other cells. This property may be a telling sign that pre-clinical tumour metabolism studies exclusively utilising in-vitro mono-culture models could prove to be limited for uncovering novel metabolic targets able to translate into clinical therapies. Although this is increasingly recognised, and work towards addressing the issue is becoming routinary much remains poorly understood. For instance, knowledge regarding the biochemical mechanisms through which cancer cells manipulate non-cancerous cell metabolism, and the subsequent impact on their survival and proliferation remains limited. Additionally, the variations in these processes across different cancer types and progression stages, and their implications for therapy, also remain largely unexplored. This study employs an interdisciplinary approach that leverages the predictive power of mathematical modelling to enrich experimental findings. We develop a functional multicellular in-silico model that facilitates the qualitative and quantitative analysis of the metabolic network spawned by an in-vitro co-culture model of bone marrow mesenchymal stem- and myeloma cell lines. To procure this model, we devised a bespoke human genome constraint-based reconstruction workflow that combines aspects from the legacy mCADRE & Metabotools algorithms, the novel redHuman algorithm, along with 13C-metabolic flux analysis. Our workflow transforms the latest human metabolic network matrix (Recon3D) into two cell-specific models coupled with a metabolic network spanning a shared growth medium. When cross-validating our in-silico model against the in-vitro model, we found that the in-silico model successfully reproduces vital metabolic behaviours of its in-vitro counterpart; results include cell growth predictions, respiration rates, as well as support for observations which suggest cross-shuttling of redox-active metabolites between cells.

越来越明显的是,癌症细胞除了重塑其新陈代谢以生存和增殖外,还适应和操纵其他细胞的新陈代谢。这一特性可能是一个明显的迹象,表明仅利用体外单培养模型的临床前肿瘤代谢研究可能被证明对于揭示能够转化为临床治疗的新代谢靶点是有限的。尽管人们越来越认识到这一点,而且解决这一问题的工作也越来越常规,但人们对这一点仍知之甚少。例如,关于癌症细胞操纵非癌细胞代谢的生化机制,以及随后对其生存和增殖的影响的知识仍然有限。此外,这些过程在不同癌症类型和进展阶段的变化,以及它们对治疗的影响,也在很大程度上未被探索。这项研究采用了一种跨学科的方法,利用数学建模的预测能力来丰富实验结果。我们开发了一种功能性多细胞计算机模型,该模型有助于对骨髓间充质干细胞和骨髓瘤细胞系体外共培养模型产生的代谢网络进行定性和定量分析。为了获得这个模型,我们设计了一个定制的基于人类基因组约束的重建工作流程,该工作流程结合了传统的mCADRE和Metabotools算法、新的redHuman算法以及13C代谢通量分析。我们的工作流程将最新的人类代谢网络矩阵(Recon3D)转换为两个细胞特异性模型,并与跨越共享生长培养基的代谢网络相结合。当将我们的计算机模型与体外模型进行交叉验证时,我们发现计算机模型成功地再现了体外模型的重要代谢行为;结果包括细胞生长预测、呼吸速率,以及对观察结果的支持,这些观察结果表明氧化还原活性代谢产物在细胞之间交叉穿梭。
{"title":"Mathematical reconstruction of the metabolic network in an in-vitro multiple myeloma model.","authors":"Elias Vera-Siguenza, Cristina Escribano-Gonzalez, Irene Serrano-Gonzalo, Kattri-Liis Eskla, Fabian Spill, Daniel Tennant","doi":"10.1371/journal.pcbi.1011374","DOIUrl":"10.1371/journal.pcbi.1011374","url":null,"abstract":"<p><p>It is increasingly apparent that cancer cells, in addition to remodelling their metabolism to survive and proliferate, adapt and manipulate the metabolism of other cells. This property may be a telling sign that pre-clinical tumour metabolism studies exclusively utilising in-vitro mono-culture models could prove to be limited for uncovering novel metabolic targets able to translate into clinical therapies. Although this is increasingly recognised, and work towards addressing the issue is becoming routinary much remains poorly understood. For instance, knowledge regarding the biochemical mechanisms through which cancer cells manipulate non-cancerous cell metabolism, and the subsequent impact on their survival and proliferation remains limited. Additionally, the variations in these processes across different cancer types and progression stages, and their implications for therapy, also remain largely unexplored. This study employs an interdisciplinary approach that leverages the predictive power of mathematical modelling to enrich experimental findings. We develop a functional multicellular in-silico model that facilitates the qualitative and quantitative analysis of the metabolic network spawned by an in-vitro co-culture model of bone marrow mesenchymal stem- and myeloma cell lines. To procure this model, we devised a bespoke human genome constraint-based reconstruction workflow that combines aspects from the legacy mCADRE & Metabotools algorithms, the novel redHuman algorithm, along with 13C-metabolic flux analysis. Our workflow transforms the latest human metabolic network matrix (Recon3D) into two cell-specific models coupled with a metabolic network spanning a shared growth medium. When cross-validating our in-silico model against the in-vitro model, we found that the in-silico model successfully reproduces vital metabolic behaviours of its in-vitro counterpart; results include cell growth predictions, respiration rates, as well as support for observations which suggest cross-shuttling of redox-active metabolites between cells.</p>","PeriodicalId":49688,"journal":{"name":"PLoS Computational Biology","volume":"19 9","pages":"e1011374"},"PeriodicalIF":4.3,"publicationDate":"2023-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10503963/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10309574","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Incorporating physics to overcome data scarcity in predictive modeling of protein function: A case study of BK channels. 结合物理学克服蛋白质功能预测建模中的数据短缺:BK通道的案例研究。
IF 4.3 2区 生物学 Pub Date : 2023-09-15 eCollection Date: 2023-09-01 DOI: 10.1371/journal.pcbi.1011460
Erik Nordquist, Guohui Zhang, Shrishti Barethiya, Nathan Ji, Kelli M White, Lu Han, Zhiguang Jia, Jingyi Shi, Jianmin Cui, Jianhan Chen

Machine learning has played transformative roles in numerous chemical and biophysical problems such as protein folding where large amount of data exists. Nonetheless, many important problems remain challenging for data-driven machine learning approaches due to the limitation of data scarcity. One approach to overcome data scarcity is to incorporate physical principles such as through molecular modeling and simulation. Here, we focus on the big potassium (BK) channels that play important roles in cardiovascular and neural systems. Many mutants of BK channel are associated with various neurological and cardiovascular diseases, but the molecular effects are unknown. The voltage gating properties of BK channels have been characterized for 473 site-specific mutations experimentally over the last three decades; yet, these functional data by themselves remain far too sparse to derive a predictive model of BK channel voltage gating. Using physics-based modeling, we quantify the energetic effects of all single mutations on both open and closed states of the channel. Together with dynamic properties derived from atomistic simulations, these physical descriptors allow the training of random forest models that could reproduce unseen experimentally measured shifts in gating voltage, ∆V1/2, with a RMSE ~ 32 mV and correlation coefficient of R ~ 0.7. Importantly, the model appears capable of uncovering nontrivial physical principles underlying the gating of the channel, including a central role of hydrophobic gating. The model was further evaluated using four novel mutations of L235 and V236 on the S5 helix, mutations of which are predicted to have opposing effects on V1/2 and suggest a key role of S5 in mediating voltage sensor-pore coupling. The measured ∆V1/2 agree quantitatively with prediction for all four mutations, with a high correlation of R = 0.92 and RMSE = 18 mV. Therefore, the model can capture nontrivial voltage gating properties in regions where few mutations are known. The success of predictive modeling of BK voltage gating demonstrates the potential of combining physics and statistical learning for overcoming data scarcity in nontrivial protein function prediction.

机器学习在许多化学和生物物理问题中发挥了变革性作用,例如存在大量数据的蛋白质折叠。尽管如此,由于数据稀缺的限制,数据驱动的机器学习方法仍然面临许多重要问题。克服数据匮乏的一种方法是结合物理原理,例如通过分子建模和模拟。在这里,我们关注在心血管和神经系统中发挥重要作用的大钾(BK)通道。BK通道的许多突变体与各种神经和心血管疾病有关,但其分子效应尚不清楚。在过去三十年中,BK通道的电压门控特性已经通过实验表征了473个位点特异性突变;然而,这些函数数据本身仍然过于稀疏,无法导出BK沟道电压门控的预测模型。使用基于物理的建模,我们量化了所有单个突变对通道打开和关闭状态的能量影响。结合原子模拟得出的动态特性,这些物理描述符允许训练随机森林模型,该模型可以再现未经实验测量的门控电压∆V1/2的偏移,RMSE~32mV,相关系数R~0.7。重要的是,该模型似乎能够揭示通道门控的重要物理原理,包括疏水门控的核心作用。使用S5螺旋上L235和V236的四个新突变对该模型进行了进一步评估,据预测,这两个突变对V1/2具有相反的影响,并表明S5在介导电压传感器-孔耦合中发挥着关键作用。测量的∆V1/2在数量上与所有四种突变的预测一致,具有R=0.92和RMSE=18mV的高度相关性。因此,该模型可以捕捉到已知突变很少的区域中的非平凡电压门控特性。BK电压门控预测建模的成功证明了将物理学和统计学习相结合以克服非平凡蛋白质功能预测中数据匮乏的潜力。
{"title":"Incorporating physics to overcome data scarcity in predictive modeling of protein function: A case study of BK channels.","authors":"Erik Nordquist,&nbsp;Guohui Zhang,&nbsp;Shrishti Barethiya,&nbsp;Nathan Ji,&nbsp;Kelli M White,&nbsp;Lu Han,&nbsp;Zhiguang Jia,&nbsp;Jingyi Shi,&nbsp;Jianmin Cui,&nbsp;Jianhan Chen","doi":"10.1371/journal.pcbi.1011460","DOIUrl":"10.1371/journal.pcbi.1011460","url":null,"abstract":"<p><p>Machine learning has played transformative roles in numerous chemical and biophysical problems such as protein folding where large amount of data exists. Nonetheless, many important problems remain challenging for data-driven machine learning approaches due to the limitation of data scarcity. One approach to overcome data scarcity is to incorporate physical principles such as through molecular modeling and simulation. Here, we focus on the big potassium (BK) channels that play important roles in cardiovascular and neural systems. Many mutants of BK channel are associated with various neurological and cardiovascular diseases, but the molecular effects are unknown. The voltage gating properties of BK channels have been characterized for 473 site-specific mutations experimentally over the last three decades; yet, these functional data by themselves remain far too sparse to derive a predictive model of BK channel voltage gating. Using physics-based modeling, we quantify the energetic effects of all single mutations on both open and closed states of the channel. Together with dynamic properties derived from atomistic simulations, these physical descriptors allow the training of random forest models that could reproduce unseen experimentally measured shifts in gating voltage, ∆V1/2, with a RMSE ~ 32 mV and correlation coefficient of R ~ 0.7. Importantly, the model appears capable of uncovering nontrivial physical principles underlying the gating of the channel, including a central role of hydrophobic gating. The model was further evaluated using four novel mutations of L235 and V236 on the S5 helix, mutations of which are predicted to have opposing effects on V1/2 and suggest a key role of S5 in mediating voltage sensor-pore coupling. The measured ∆V1/2 agree quantitatively with prediction for all four mutations, with a high correlation of R = 0.92 and RMSE = 18 mV. Therefore, the model can capture nontrivial voltage gating properties in regions where few mutations are known. The success of predictive modeling of BK voltage gating demonstrates the potential of combining physics and statistical learning for overcoming data scarcity in nontrivial protein function prediction.</p>","PeriodicalId":49688,"journal":{"name":"PLoS Computational Biology","volume":"19 9","pages":"e1011460"},"PeriodicalIF":4.3,"publicationDate":"2023-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10529646/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10265087","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Excalibur: A new ensemble method based on an optimal combination of aggregation tests for rare-variant association testing for sequencing data. Excalibur:一种基于聚集测试最佳组合的新集成方法,用于测序数据的罕见变异关联测试。
IF 4.3 2区 生物学 Pub Date : 2023-09-14 eCollection Date: 2023-09-01 DOI: 10.1371/journal.pcbi.1011488
Simon Boutry, Raphaël Helaers, Tom Lenaerts, Miikka Vikkula

The development of high-throughput next-generation sequencing technologies and large-scale genetic association studies produced numerous advances in the biostatistics field. Various aggregation tests, i.e. statistical methods that analyze associations of a trait with multiple markers within a genomic region, have produced a variety of novel discoveries. Notwithstanding their usefulness, there is no single test that fits all needs, each suffering from specific drawbacks. Selecting the right aggregation test, while considering an unknown underlying genetic model of the disease, remains an important challenge. Here we propose a new ensemble method, called Excalibur, based on an optimal combination of 36 aggregation tests created after an in-depth study of the limitations of each test and their impact on the quality of result. Our findings demonstrate the ability of our method to control type I error and illustrate that it offers the best average power across all scenarios. The proposed method allows for novel advances in Whole Exome/Genome sequencing association studies, able to handle a wide range of association models, providing researchers with an optimal aggregation analysis for the genetic regions of interest.

高通量下一代测序技术的发展和大规模遗传关联研究在生物统计学领域取得了许多进展。各种聚集测试,即分析一个性状与基因组区域内多个标记的关联的统计方法,已经产生了各种新的发现。尽管它们很有用,但没有一种测试能满足所有需求,每种测试都有特定的缺点。选择正确的聚集性测试,同时考虑疾病的未知潜在遗传模型,仍然是一个重要的挑战。在这里,我们提出了一种新的集成方法,称为Excalibur,该方法基于对每个测试的局限性及其对结果质量的影响进行深入研究后创建的36个聚合测试的最佳组合。我们的发现证明了我们的方法控制I型误差的能力,并说明它在所有情况下都能提供最佳的平均功率。所提出的方法允许全外显子组/基因组测序关联研究取得新进展,能够处理广泛的关联模型,为研究人员提供感兴趣的遗传区域的最佳聚集分析。
{"title":"Excalibur: A new ensemble method based on an optimal combination of aggregation tests for rare-variant association testing for sequencing data.","authors":"Simon Boutry,&nbsp;Raphaël Helaers,&nbsp;Tom Lenaerts,&nbsp;Miikka Vikkula","doi":"10.1371/journal.pcbi.1011488","DOIUrl":"10.1371/journal.pcbi.1011488","url":null,"abstract":"<p><p>The development of high-throughput next-generation sequencing technologies and large-scale genetic association studies produced numerous advances in the biostatistics field. Various aggregation tests, i.e. statistical methods that analyze associations of a trait with multiple markers within a genomic region, have produced a variety of novel discoveries. Notwithstanding their usefulness, there is no single test that fits all needs, each suffering from specific drawbacks. Selecting the right aggregation test, while considering an unknown underlying genetic model of the disease, remains an important challenge. Here we propose a new ensemble method, called Excalibur, based on an optimal combination of 36 aggregation tests created after an in-depth study of the limitations of each test and their impact on the quality of result. Our findings demonstrate the ability of our method to control type I error and illustrate that it offers the best average power across all scenarios. The proposed method allows for novel advances in Whole Exome/Genome sequencing association studies, able to handle a wide range of association models, providing researchers with an optimal aggregation analysis for the genetic regions of interest.</p>","PeriodicalId":49688,"journal":{"name":"PLoS Computational Biology","volume":"19 9","pages":"e1011488"},"PeriodicalIF":4.3,"publicationDate":"2023-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10522036/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10233141","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Mixtures of strategies underlie rodent behavior during reversal learning. 在反向学习过程中,策略的混合是啮齿动物行为的基础。
IF 4.3 2区 生物学 Pub Date : 2023-09-14 eCollection Date: 2023-09-01 DOI: 10.1371/journal.pcbi.1011430
Nhat Minh Le, Murat Yildirim, Yizhi Wang, Hiroki Sugihara, Mehrdad Jazayeri, Mriganka Sur

In reversal learning tasks, the behavior of humans and animals is often assumed to be uniform within single experimental sessions to facilitate data analysis and model fitting. However, behavior of agents can display substantial variability in single experimental sessions, as they execute different blocks of trials with different transition dynamics. Here, we observed that in a deterministic reversal learning task, mice display noisy and sub-optimal choice transitions even at the expert stages of learning. We investigated two sources of the sub-optimality in the behavior. First, we found that mice exhibit a high lapse rate during task execution, as they reverted to unrewarded directions after choice transitions. Second, we unexpectedly found that a majority of mice did not execute a uniform strategy, but rather mixed between several behavioral modes with different transition dynamics. We quantified the use of such mixtures with a state-space model, block Hidden Markov Model (block HMM), to dissociate the mixtures of dynamic choice transitions in individual blocks of trials. Additionally, we found that blockHMM transition modes in rodent behavior can be accounted for by two different types of behavioral algorithms, model-free or inference-based learning, that might be used to solve the task. Combining these approaches, we found that mice used a mixture of both exploratory, model-free strategies and deterministic, inference-based behavior in the task, explaining their overall noisy choice sequences. Together, our combined computational approach highlights intrinsic sources of noise in rodent reversal learning behavior and provides a richer description of behavior than conventional techniques, while uncovering the hidden states that underlie the block-by-block transitions.

在反向学习任务中,通常假设人类和动物的行为在单个实验会话中是一致的,以便于数据分析和模型拟合。然而,代理人的行为在单个实验过程中可能表现出显著的可变性,因为他们执行具有不同过渡动力学的不同试验块。在这里,我们观察到,在确定性反向学习任务中,即使在专家学习阶段,小鼠也会表现出嘈杂和次优的选择转换。我们研究了行为中次最优性的两个来源。首先,我们发现老鼠在执行任务时表现出很高的失误率,因为它们在选择转换后会回到未回报的方向。其次,我们意外地发现,大多数小鼠并没有执行统一的策略,而是在具有不同过渡动力学的几种行为模式之间混合。我们用状态空间模型块隐马尔可夫模型(块HMM)量化了这种混合物的使用,以分离单个试验块中的动态选择转换的混合物。此外,我们发现啮齿动物行为中的blockHMM转换模式可以由两种不同类型的行为算法来解释,无模型或基于推理的学习,这两种算法可能用于解决任务。结合这些方法,我们发现小鼠在任务中使用了探索性的、无模型的策略和确定性的、基于推理的行为,解释了它们的总体噪声选择序列。总之,我们的组合计算方法突出了啮齿动物反向学习行为中的内在噪声源,并提供了比传统技术更丰富的行为描述,同时揭示了逐块转换背后的隐藏状态。
{"title":"Mixtures of strategies underlie rodent behavior during reversal learning.","authors":"Nhat Minh Le, Murat Yildirim, Yizhi Wang, Hiroki Sugihara, Mehrdad Jazayeri, Mriganka Sur","doi":"10.1371/journal.pcbi.1011430","DOIUrl":"10.1371/journal.pcbi.1011430","url":null,"abstract":"<p><p>In reversal learning tasks, the behavior of humans and animals is often assumed to be uniform within single experimental sessions to facilitate data analysis and model fitting. However, behavior of agents can display substantial variability in single experimental sessions, as they execute different blocks of trials with different transition dynamics. Here, we observed that in a deterministic reversal learning task, mice display noisy and sub-optimal choice transitions even at the expert stages of learning. We investigated two sources of the sub-optimality in the behavior. First, we found that mice exhibit a high lapse rate during task execution, as they reverted to unrewarded directions after choice transitions. Second, we unexpectedly found that a majority of mice did not execute a uniform strategy, but rather mixed between several behavioral modes with different transition dynamics. We quantified the use of such mixtures with a state-space model, block Hidden Markov Model (block HMM), to dissociate the mixtures of dynamic choice transitions in individual blocks of trials. Additionally, we found that blockHMM transition modes in rodent behavior can be accounted for by two different types of behavioral algorithms, model-free or inference-based learning, that might be used to solve the task. Combining these approaches, we found that mice used a mixture of both exploratory, model-free strategies and deterministic, inference-based behavior in the task, explaining their overall noisy choice sequences. Together, our combined computational approach highlights intrinsic sources of noise in rodent reversal learning behavior and provides a richer description of behavior than conventional techniques, while uncovering the hidden states that underlie the block-by-block transitions.</p>","PeriodicalId":49688,"journal":{"name":"PLoS Computational Biology","volume":"19 9","pages":"e1011430"},"PeriodicalIF":4.3,"publicationDate":"2023-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10501641/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10276831","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Uncovering specific mechanisms across cell types in dynamical models. 揭示动力学模型中跨细胞类型的特定机制。
IF 4.3 2区 生物学 Pub Date : 2023-09-13 eCollection Date: 2023-09-01 DOI: 10.1371/journal.pcbi.1010867
Adrian L Hauber, Marcus Rosenblatt, Jens Timmer

Ordinary differential equations are frequently employed for mathematical modeling of biological systems. The identification of mechanisms that are specific to certain cell types is crucial for building useful models and to gain insights into the underlying biological processes. Regularization techniques have been proposed and applied to identify mechanisms specific to two cell types, e.g., healthy and cancer cells, including the LASSO (least absolute shrinkage and selection operator). However, when analyzing more than two cell types, these approaches are not consistent, and require the selection of a reference cell type, which can affect the results. To make the regularization approach applicable to identifying cell-type specific mechanisms in any number of cell types, we propose to incorporate the clustered LASSO into the framework of ordinary differential equation modeling by penalizing the pairwise differences of the logarithmized fold-change parameters encoding a specific mechanism in different cell types. The symmetry introduced by this approach renders the results independent of the reference cell type. We discuss the necessary adaptations of state-of-the-art numerical optimization techniques and the process of model selection for this method. We assess the performance with realistic biological models and synthetic data, and demonstrate that it outperforms existing approaches. Finally, we also exemplify its application to published biological models including experimental data, and link the results to independent biological measurements.

常微分方程经常用于生物系统的数学建模。识别特定细胞类型的机制对于建立有用的模型和深入了解潜在的生物过程至关重要。已经提出并应用正则化技术来识别两种细胞类型(例如,健康细胞和癌症细胞)的特异性机制,包括LASSO(最小绝对收缩和选择算子)。然而,当分析两种以上的细胞类型时,这些方法并不一致,并且需要选择参考细胞类型,这可能会影响结果。为了使正则化方法适用于识别任何数量的细胞类型中的细胞类型特异性机制,我们建议通过惩罚编码不同细胞类型中特定机制的对数倍数变化参数的成对差异,将聚类LASSO纳入常微分方程建模的框架中。这种方法引入的对称性使得结果与参考细胞类型无关。我们讨论了最先进的数值优化技术的必要适应性以及该方法的模型选择过程。我们用真实的生物模型和合成数据评估了性能,并证明它优于现有方法。最后,我们还举例说明了它在包括实验数据在内的已发表生物模型中的应用,并将结果与独立的生物测量结果联系起来。
{"title":"Uncovering specific mechanisms across cell types in dynamical models.","authors":"Adrian L Hauber, Marcus Rosenblatt, Jens Timmer","doi":"10.1371/journal.pcbi.1010867","DOIUrl":"10.1371/journal.pcbi.1010867","url":null,"abstract":"<p><p>Ordinary differential equations are frequently employed for mathematical modeling of biological systems. The identification of mechanisms that are specific to certain cell types is crucial for building useful models and to gain insights into the underlying biological processes. Regularization techniques have been proposed and applied to identify mechanisms specific to two cell types, e.g., healthy and cancer cells, including the LASSO (least absolute shrinkage and selection operator). However, when analyzing more than two cell types, these approaches are not consistent, and require the selection of a reference cell type, which can affect the results. To make the regularization approach applicable to identifying cell-type specific mechanisms in any number of cell types, we propose to incorporate the clustered LASSO into the framework of ordinary differential equation modeling by penalizing the pairwise differences of the logarithmized fold-change parameters encoding a specific mechanism in different cell types. The symmetry introduced by this approach renders the results independent of the reference cell type. We discuss the necessary adaptations of state-of-the-art numerical optimization techniques and the process of model selection for this method. We assess the performance with realistic biological models and synthetic data, and demonstrate that it outperforms existing approaches. Finally, we also exemplify its application to published biological models including experimental data, and link the results to independent biological measurements.</p>","PeriodicalId":49688,"journal":{"name":"PLoS Computational Biology","volume":"19 9","pages":"e1010867"},"PeriodicalIF":4.3,"publicationDate":"2023-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10519600/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10285169","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Beyond ℓ1 sparse coding in V1. 超过ℓ1中的稀疏编码。
IF 4.3 2区 生物学 Pub Date : 2023-09-12 eCollection Date: 2023-09-01 DOI: 10.1371/journal.pcbi.1011459
Ilias Rentzeperis, Luca Calatroni, Laurent U Perrinet, Dario Prandi

Growing evidence indicates that only a sparse subset from a pool of sensory neurons is active for the encoding of visual stimuli at any instant in time. Traditionally, to replicate such biological sparsity, generative models have been using the ℓ1 norm as a penalty due to its convexity, which makes it amenable to fast and simple algorithmic solvers. In this work, we use biological vision as a test-bed and show that the soft thresholding operation associated to the use of the ℓ1 norm is highly suboptimal compared to other functions suited to approximating ℓp with 0 ≤ p < 1 (including recently proposed continuous exact relaxations), in terms of performance. We show that ℓ1 sparsity employs a pool with more neurons, i.e. has a higher degree of overcompleteness, in order to maintain the same reconstruction error as the other methods considered. More specifically, at the same sparsity level, the thresholding algorithm using the ℓ1 norm as a penalty requires a dictionary of ten times more units compared to the proposed approach, where a non-convex continuous relaxation of the ℓ0 pseudo-norm is used, to reconstruct the external stimulus equally well. At a fixed sparsity level, both ℓ0- and ℓ1-based regularization develop units with receptive field (RF) shapes similar to biological neurons in V1 (and a subset of neurons in V2), but ℓ0-based regularization shows approximately five times better reconstruction of the stimulus. Our results in conjunction with recent metabolic findings indicate that for V1 to operate efficiently it should follow a coding regime which uses a regularization that is closer to the ℓ0 pseudo-norm rather than the ℓ1 one, and suggests a similar mode of operation for the sensory cortex in general.

越来越多的证据表明,在任何时刻,只有来自感觉神经元池的稀疏子集对视觉刺激的编码是活跃的。传统上,为了复制这种生物稀疏性,生成模型一直在使用ℓ1范数作为惩罚,因为它具有凸性,这使得它适用于快速简单的算法求解器。在这项工作中,我们使用生物视觉作为试验台,并表明与使用ℓ与适用于近似的其他函数相比,1范数是高度次优的ℓp≤p<1(包括最近提出的连续精确弛豫)。我们展示了ℓ1稀疏性采用了具有更多神经元的池,即具有更高程度的过完全性,以便保持与所考虑的其他方法相同的重建误差。更具体地,在相同的稀疏性水平下,使用ℓ1范数作为惩罚需要一个比所提出的方法多十倍的单位字典,其中ℓ0伪范数,可以同样好地重构外部刺激。在固定的稀疏性级别上ℓ0和ℓ基于1的正则化开发具有与V1中的生物神经元(以及V2中的神经元子集)相似的感受野(RF)形状的单元,但是ℓ基于0的正则化显示了对刺激的大约五倍的更好的重建。我们的研究结果以及最近的代谢发现表明,为了使V1有效运行,它应该遵循一种编码机制,该机制使用更接近ℓ0伪范数,而不是ℓ1之一,并提出了一般感觉皮层的类似操作模式。
{"title":"Beyond ℓ1 sparse coding in V1.","authors":"Ilias Rentzeperis,&nbsp;Luca Calatroni,&nbsp;Laurent U Perrinet,&nbsp;Dario Prandi","doi":"10.1371/journal.pcbi.1011459","DOIUrl":"10.1371/journal.pcbi.1011459","url":null,"abstract":"<p><p>Growing evidence indicates that only a sparse subset from a pool of sensory neurons is active for the encoding of visual stimuli at any instant in time. Traditionally, to replicate such biological sparsity, generative models have been using the ℓ1 norm as a penalty due to its convexity, which makes it amenable to fast and simple algorithmic solvers. In this work, we use biological vision as a test-bed and show that the soft thresholding operation associated to the use of the ℓ1 norm is highly suboptimal compared to other functions suited to approximating ℓp with 0 ≤ p < 1 (including recently proposed continuous exact relaxations), in terms of performance. We show that ℓ1 sparsity employs a pool with more neurons, i.e. has a higher degree of overcompleteness, in order to maintain the same reconstruction error as the other methods considered. More specifically, at the same sparsity level, the thresholding algorithm using the ℓ1 norm as a penalty requires a dictionary of ten times more units compared to the proposed approach, where a non-convex continuous relaxation of the ℓ0 pseudo-norm is used, to reconstruct the external stimulus equally well. At a fixed sparsity level, both ℓ0- and ℓ1-based regularization develop units with receptive field (RF) shapes similar to biological neurons in V1 (and a subset of neurons in V2), but ℓ0-based regularization shows approximately five times better reconstruction of the stimulus. Our results in conjunction with recent metabolic findings indicate that for V1 to operate efficiently it should follow a coding regime which uses a regularization that is closer to the ℓ0 pseudo-norm rather than the ℓ1 one, and suggests a similar mode of operation for the sensory cortex in general.</p>","PeriodicalId":49688,"journal":{"name":"PLoS Computational Biology","volume":"19 9","pages":"e1011459"},"PeriodicalIF":4.3,"publicationDate":"2023-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10516432/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10214238","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluating the use of social contact data to produce age-specific short-term forecasts of SARS-CoV-2 incidence in England. 评估使用社会接触数据对英格兰严重急性呼吸系统综合征冠状病毒2型发病率进行特定年龄的短期预测。
IF 4.3 2区 生物学 Pub Date : 2023-09-12 eCollection Date: 2023-09-01 DOI: 10.1371/journal.pcbi.1011453
James D Munday, Sam Abbott, Sophie Meakin, Sebastian Funk

Mathematical and statistical models can be used to make predictions of how epidemics may progress in the near future and form a central part of outbreak mitigation and control. Renewal equation based models allow inference of epidemiological parameters from historical data and forecast future epidemic dynamics without requiring complex mechanistic assumptions. However, these models typically ignore interaction between age groups, partly due to challenges in parameterising a time varying interaction matrix. Social contact data collected regularly during the COVID-19 epidemic provide a means to inform interaction between age groups in real-time. We developed an age-specific forecasting framework and applied it to two age-stratified time-series: incidence of SARS-CoV-2 infection, estimated from a national infection and antibody prevalence survey; and, reported cases according to the UK national COVID-19 dashboard. Jointly fitting our model to social contact data from the CoMix study, we inferred a time-varying next generation matrix which we used to project infections and cases in the four weeks following each of 29 forecast dates between October 2020 and November 2021. We evaluated the forecasts using proper scoring rules and compared performance with three other models with alternative data and specifications alongside two naive baseline models. Overall, incorporating age interaction improved forecasts of infections and the CoMix-data-informed model was the best performing model at time horizons between two and four weeks. However, this was not true when forecasting cases. We found that age group interaction was most important for predicting cases in children and older adults. The contact-data-informed models performed best during the winter months of 2020-2021, but performed comparatively poorly in other periods. We highlight challenges regarding the incorporation of contact data in forecasting and offer proposals as to how to extend and adapt our approach, which may lead to more successful forecasts in future.

数学和统计模型可用于预测流行病在不久的将来可能如何发展,并构成疫情缓解和控制的核心部分。基于更新方程的模型允许从历史数据推断流行病学参数,并预测未来的流行病动态,而不需要复杂的机制假设。然而,这些模型通常忽略了年龄组之间的相互作用,部分原因是在参数化时变相互作用矩阵方面存在挑战。新冠肺炎疫情期间定期收集的社交接触数据提供了一种实时告知年龄组之间互动的手段。我们开发了一个特定年龄的预测框架,并将其应用于两个年龄分层的时间序列:根据全国感染和抗体流行率调查估计的严重急性呼吸系统综合征冠状病毒2型感染的发病率;以及,根据英国国家新冠肺炎仪表盘报告的病例。我们将我们的模型与CoMix研究的社会接触数据联合拟合,推断出一个时变的下一代矩阵,我们使用该矩阵来预测2020年10月至2021年11月期间29个预测日期后的四周内的感染和病例。我们使用适当的评分规则对预测进行了评估,并将性能与其他三个具有替代数据和规范的模型以及两个天真的基线模型进行了比较。总的来说,结合年龄相互作用改善了对感染的预测,CoMix数据知情模型是在两到四周的时间范围内表现最好的模型。然而,在预测案例时,情况并非如此。我们发现,年龄组的互动对于预测儿童和老年人的病例最为重要。接触数据表明,模型在2020-2021年冬季表现最好,但在其他时期表现相对较差。我们强调了在预测中纳入联系数据方面的挑战,并就如何扩展和调整我们的方法提出了建议,这可能会在未来带来更成功的预测。
{"title":"Evaluating the use of social contact data to produce age-specific short-term forecasts of SARS-CoV-2 incidence in England.","authors":"James D Munday,&nbsp;Sam Abbott,&nbsp;Sophie Meakin,&nbsp;Sebastian Funk","doi":"10.1371/journal.pcbi.1011453","DOIUrl":"10.1371/journal.pcbi.1011453","url":null,"abstract":"<p><p>Mathematical and statistical models can be used to make predictions of how epidemics may progress in the near future and form a central part of outbreak mitigation and control. Renewal equation based models allow inference of epidemiological parameters from historical data and forecast future epidemic dynamics without requiring complex mechanistic assumptions. However, these models typically ignore interaction between age groups, partly due to challenges in parameterising a time varying interaction matrix. Social contact data collected regularly during the COVID-19 epidemic provide a means to inform interaction between age groups in real-time. We developed an age-specific forecasting framework and applied it to two age-stratified time-series: incidence of SARS-CoV-2 infection, estimated from a national infection and antibody prevalence survey; and, reported cases according to the UK national COVID-19 dashboard. Jointly fitting our model to social contact data from the CoMix study, we inferred a time-varying next generation matrix which we used to project infections and cases in the four weeks following each of 29 forecast dates between October 2020 and November 2021. We evaluated the forecasts using proper scoring rules and compared performance with three other models with alternative data and specifications alongside two naive baseline models. Overall, incorporating age interaction improved forecasts of infections and the CoMix-data-informed model was the best performing model at time horizons between two and four weeks. However, this was not true when forecasting cases. We found that age group interaction was most important for predicting cases in children and older adults. The contact-data-informed models performed best during the winter months of 2020-2021, but performed comparatively poorly in other periods. We highlight challenges regarding the incorporation of contact data in forecasting and offer proposals as to how to extend and adapt our approach, which may lead to more successful forecasts in future.</p>","PeriodicalId":49688,"journal":{"name":"PLoS Computational Biology","volume":"19 9","pages":"e1011453"},"PeriodicalIF":4.3,"publicationDate":"2023-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10516435/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10263500","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
PLoS Computational Biology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1