PLoS Computational Biology最新文献_第2页

Deep autoencoder-based behavioral pattern recognition outperforms standard statistical methods in high-dimensional zebrafish studies 在高维斑马鱼研究中，基于深度自动编码器的行为模式识别优于标准统计方法

IF 4.3 2区生物学 Q1 BIOCHEMICAL RESEARCH METHODS

PLoS Computational Biology

Pub Date : 2024-09-10 DOI: 10.1371/journal.pcbi.1012423

Adrian J. Green, Lisa Truong, Preethi Thunga, Connor Leong, Melody Hancock, Robyn L. Tanguay, David M. Reif

Zebrafish have become an essential model organism in screening for developmental neurotoxic chemicals and their molecular targets. The success of zebrafish as a screening model is partially due to their physical characteristics including their relatively simple nervous system, rapid development, experimental tractability, and genetic diversity combined with technical advantages that allow for the generation of large amounts of high-dimensional behavioral data. These data are complex and require advanced machine learning and statistical techniques to comprehensively analyze and capture spatiotemporal responses. To accomplish this goal, we have trained semi-supervised deep autoencoders using behavior data from unexposed larval zebrafish to extract quintessential “normal” behavior. Following training, our network was evaluated using data from larvae shown to have significant changes in behavior (using a traditional statistical framework) following exposure to toxicants that include nanomaterials, aromatics, per- and polyfluoroalkyl substances (PFAS), and other environmental contaminants. Further, our model identified new chemicals (Perfluoro-n-octadecanoic acid, 8-Chloroperfluorooctylphosphonic acid, and Nonafluoropentanamide) as capable of inducing abnormal behavior at multiple chemical-concentrations pairs not captured using distance moved alone. Leveraging this deep learning model will allow for better characterization of the different exposure-induced behavioral phenotypes, facilitate improved genetic and neurobehavioral analysis in mechanistic determination studies and provide a robust framework for analyzing complex behaviors found in higher-order model systems.

斑马鱼已成为筛选发育神经毒性化学物质及其分子靶标的重要模式生物。斑马鱼作为筛选模型的成功部分归功于其物理特性，包括相对简单的神经系统、快速发育、实验可操作性和遗传多样性，以及可生成大量高维行为数据的技术优势。这些数据非常复杂，需要先进的机器学习和统计技术来全面分析和捕捉时空反应。为了实现这一目标，我们利用未暴露幼体斑马鱼的行为数据训练了半监督深度自动编码器，以提取典型的 "正常 "行为。训练结束后，我们使用暴露于有毒物质（包括纳米材料、芳烃、全氟和多氟烷基物质 (PFAS)）和其他环境污染物后行为发生显著变化（使用传统统计框架）的幼体数据对我们的网络进行了评估。此外，我们的模型还发现了新的化学物质（全氟十八酸、8-氯全氟辛基膦酸和壬氟戊酰胺），这些化学物质能够在多个化学浓度对中诱发异常行为，而单独使用距离移动模型则无法捕捉到这些异常行为。利用这一深度学习模型可以更好地描述不同暴露诱导的行为表型，促进机理测定研究中遗传和神经行为分析的改进，并为分析高阶模型系统中的复杂行为提供一个稳健的框架。

{"title":"Deep autoencoder-based behavioral pattern recognition outperforms standard statistical methods in high-dimensional zebrafish studies","authors":"Adrian J. Green, Lisa Truong, Preethi Thunga, Connor Leong, Melody Hancock, Robyn L. Tanguay, David M. Reif","doi":"10.1371/journal.pcbi.1012423","DOIUrl":"https://doi.org/10.1371/journal.pcbi.1012423","url":null,"abstract":"Zebrafish have become an essential model organism in screening for developmental neurotoxic chemicals and their molecular targets. The success of zebrafish as a screening model is partially due to their physical characteristics including their relatively simple nervous system, rapid development, experimental tractability, and genetic diversity combined with technical advantages that allow for the generation of large amounts of high-dimensional behavioral data. These data are complex and require advanced machine learning and statistical techniques to comprehensively analyze and capture spatiotemporal responses. To accomplish this goal, we have trained semi-supervised deep autoencoders using behavior data from unexposed larval zebrafish to extract quintessential “normal” behavior. Following training, our network was evaluated using data from larvae shown to have significant changes in behavior (using a traditional statistical framework) following exposure to toxicants that include nanomaterials, aromatics, per- and polyfluoroalkyl substances (PFAS), and other environmental contaminants. Further, our model identified new chemicals (Perfluoro-n-octadecanoic acid, 8-Chloroperfluorooctylphosphonic acid, and Nonafluoropentanamide) as capable of inducing abnormal behavior at multiple chemical-concentrations pairs not captured using distance moved alone. Leveraging this deep learning model will allow for better characterization of the different exposure-induced behavioral phenotypes, facilitate improved genetic and neurobehavioral analysis in mechanistic determination studies and provide a robust framework for analyzing complex behaviors found in higher-order model systems.","PeriodicalId":20241,"journal":{"name":"PLoS Computational Biology","volume":null,"pages":null},"PeriodicalIF":4.3,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142194331","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Understanding the transmission of bacterial agents of sapronotic diseases using an ecosystem-based approach: A first spatially realistic metacommunity model 利用基于生态系统的方法了解细菌病原体的传播：首个空间现实元群落模型

IF 4.3 2区生物学 Q1 BIOCHEMICAL RESEARCH METHODS

PLoS Computational Biology

Pub Date : 2024-09-10 DOI: 10.1371/journal.pcbi.1012435

Ahmadou Sylla, Christine Chevillon, Ramsès Djidjiou-Demasse, Ousmane Seydi, Carlos A. Vargas Campos, Magdalene Dogbe, Kayla M. Fast, Jennifer L. Pechal, Alex Rakestraw, Matthew E. Scott, Michael W. Sandel, Heather Jordan, Mark Eric Benbow, Jean-François Guégan

Pathogens such as bacteria, fungi and viruses are important components of soil and aquatic communities, where they can benefit from decaying and living organic matter, and may opportunistically infect human and animal hosts. One-third of human infectious diseases is constituted by sapronotic disease agents that are natural inhabitants of soil or aquatic ecosystems. They are capable of existing and reproducing in the environment outside of the host for extended periods of time. However, as ecological research on sapronosis is infrequent and epidemiological models are even rarer, very little information is currently available. Their importance is overlooked in medical and veterinary research, as well as the relationships between free environmental forms and those that are pathogenic. Here, using dynamical models in realistic aquatic metacommunity systems, we analyze sapronosis transmission, using the human pathogen Mycobacterium ulcerans that is responsible for Buruli ulcer. We show that the persistence of bacilli in aquatic ecosystems is driven by a seasonal upstream supply, and that the attachment and development of cells to aquatic living forms is essential for such pathogen persistence and population dynamics. Our work constitutes the first set of metacommunity models of sapronotic disease transmission, and is highly flexible for adaptation to other types of sapronosis. The importance of sapronotic agents on animal and human disease burden needs better understanding and new models of sapronosis disease ecology to guide the management and prevention of this important group of pathogens.

细菌、真菌和病毒等病原体是土壤和水生生物群落的重要组成部分，它们可以从腐烂和存活的有机物中获益，并可能伺机感染人类和动物宿主。三分之一的人类传染病是由土壤或水生生态系统中的自然居民--无细胞病原体引起的。它们能够在宿主以外的环境中长期存在和繁殖。然而，由于有关无脊椎疾病的生态学研究并不常见，流行病学模型更是罕见，因此目前可获得的信息非常少。在医学和兽医学研究中，它们的重要性以及自由环境形式与致病形式之间的关系都被忽视了。在这里，我们利用现实水生元群落系统中的动力学模型，以导致布路里溃疡的人类病原体溃疡分枝杆菌为对象，分析了沙门氏菌病的传播。我们的研究表明，杆菌在水生生态系统中的持续存在是由季节性上游供应驱动的，而细胞在水生生物体上的附着和发育对这种病原体的持续存在和种群动态至关重要。我们的研究工作构成了第一套无丝体疾病传播的元群落模型，而且具有高度灵活性，可适应其他类型的无丝体疾病。无脊椎病原体对动物和人类疾病负担的重要性需要更好的理解，也需要新的无脊椎疾病生态学模型来指导对这组重要病原体的管理和预防。

{"title":"Understanding the transmission of bacterial agents of sapronotic diseases using an ecosystem-based approach: A first spatially realistic metacommunity model","authors":"Ahmadou Sylla, Christine Chevillon, Ramsès Djidjiou-Demasse, Ousmane Seydi, Carlos A. Vargas Campos, Magdalene Dogbe, Kayla M. Fast, Jennifer L. Pechal, Alex Rakestraw, Matthew E. Scott, Michael W. Sandel, Heather Jordan, Mark Eric Benbow, Jean-François Guégan","doi":"10.1371/journal.pcbi.1012435","DOIUrl":"https://doi.org/10.1371/journal.pcbi.1012435","url":null,"abstract":"Pathogens such as bacteria, fungi and viruses are important components of soil and aquatic communities, where they can benefit from decaying and living organic matter, and may opportunistically infect human and animal hosts. One-third of human infectious diseases is constituted by sapronotic disease agents that are natural inhabitants of soil or aquatic ecosystems. They are capable of existing and reproducing in the environment outside of the host for extended periods of time. However, as ecological research on sapronosis is infrequent and epidemiological models are even rarer, very little information is currently available. Their importance is overlooked in medical and veterinary research, as well as the relationships between free environmental forms and those that are pathogenic. Here, using dynamical models in realistic aquatic metacommunity systems, we analyze sapronosis transmission, using the human pathogen Mycobacterium ulcerans that is responsible for Buruli ulcer. We show that the persistence of bacilli in aquatic ecosystems is driven by a seasonal upstream supply, and that the attachment and development of cells to aquatic living forms is essential for such pathogen persistence and population dynamics. Our work constitutes the first set of metacommunity models of sapronotic disease transmission, and is highly flexible for adaptation to other types of sapronosis. The importance of sapronotic agents on animal and human disease burden needs better understanding and new models of sapronosis disease ecology to guide the management and prevention of this important group of pathogens.","PeriodicalId":20241,"journal":{"name":"PLoS Computational Biology","volume":null,"pages":null},"PeriodicalIF":4.3,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142225044","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Antiviral capacity of the early CD8 T-cell response is predictive of natural control of SIV infection: Learning in vivo dynamics using ex vivo data 早期 CD8 T 细胞反应的抗病毒能力可预测 SIV 感染的自然控制：利用体内外数据学习体内动力学

IF 4.3 2区生物学 Q1 BIOCHEMICAL RESEARCH METHODS

PLoS Computational Biology

Pub Date : 2024-09-10 DOI: 10.1371/journal.pcbi.1012434

Bharadwaj Vemparala, Vincent Madelain, Caroline Passaes, Antoine Millet, Véronique Avettand-Fenoel, Ramsès Djidjou-Demasse, Nathalie Dereuddre-Bosquet, Roger Le Grand, Christine Rouzioux, Bruno Vaslin, Asier Sáez-Cirión, Jérémie Guedj, Narendra M. Dixit

While most individuals suffer progressive disease following HIV infection, a small fraction spontaneously controls the infection. Although CD8 T-cells have been implicated in this natural control, their mechanistic roles are yet to be established. Here, we combined mathematical modeling and analysis of previously published data from 16 SIV-infected macaques, of which 12 were natural controllers, to elucidate the role of CD8 T-cells in natural control. For each macaque, we considered, in addition to the canonical in vivo plasma viral load and SIV DNA data, longitudinal ex vivo measurements of the virus suppressive capacity of CD8 T-cells. Available mathematical models do not allow analysis of such combined in vivo-ex vivo datasets. We explicitly modeled the ex vivo assay, derived analytical approximations that link the ex vivo measurements with the in vivo effector function of CD8-T cells, and integrated them with an in vivo model of virus dynamics, thus developing a new learning framework that enabled the analysis. Our model fit the data well and estimated the recruitment rate and/or maximal killing rate of CD8 T-cells to be up to 2-fold higher in controllers than non-controllers (p = 0.013). Importantly, the cumulative suppressive capacity of CD8 T-cells over the first 4–6 weeks of infection was associated with virus control (Spearman’s ρ = -0.51; p = 0.05). Thus, our analysis identified the early cumulative suppressive capacity of CD8 T-cells as a predictor of natural control. Furthermore, simulating a large virtual population, our model quantified the minimum capacity of this early CD8 T-cell response necessary for long-term control. Our study presents new, quantitative insights into the role of CD8 T-cells in the natural control of HIV infection and has implications for remission strategies.

虽然大多数人在感染艾滋病病毒后会出现进行性疾病，但也有一小部分人会自发控制感染。虽然 CD8 T 细胞与这种自然控制有关，但它们的作用机制尚未确定。在这里，我们结合数学建模和对之前发表的 16 只 SIV 感染猕猴（其中 12 只为自然控制者）的数据分析，来阐明 CD8 T 细胞在自然控制中的作用。对于每只猕猴，除了常规的体内血浆病毒载量和 SIV DNA 数据外，我们还考虑了 CD8 T 细胞抑制病毒能力的纵向体外测量数据。现有的数学模型无法对这种体内-体外联合数据集进行分析。我们对体内外检测进行了明确建模，得出了将体内外测量结果与体内 CD8-T 细胞效应功能联系起来的分析近似值，并将其与体内病毒动态模型进行了整合，从而建立了一个新的学习框架，使分析成为可能。我们的模型与数据拟合良好，估计控制者的 CD8 T 细胞招募率和/或最大杀伤率是非控制者的 2 倍（p = 0.013）。重要的是，在感染的前 4-6 周，CD8 T 细胞的累积抑制能力与病毒控制有关（Spearman's ρ = -0.51; p = 0.05）。因此，我们的分析确定了 CD8 T 细胞的早期累积抑制能力是自然控制的预测因素。此外，我们的模型模拟了一个庞大的虚拟群体，量化了长期控制所需的这种早期 CD8 T 细胞反应的最低能力。我们的研究对 CD8 T 细胞在艾滋病病毒感染的自然控制中的作用提出了新的定量见解，并对缓解策略产生了影响。

{"title":"Antiviral capacity of the early CD8 T-cell response is predictive of natural control of SIV infection: Learning in vivo dynamics using ex vivo data","authors":"Bharadwaj Vemparala, Vincent Madelain, Caroline Passaes, Antoine Millet, Véronique Avettand-Fenoel, Ramsès Djidjou-Demasse, Nathalie Dereuddre-Bosquet, Roger Le Grand, Christine Rouzioux, Bruno Vaslin, Asier Sáez-Cirión, Jérémie Guedj, Narendra M. Dixit","doi":"10.1371/journal.pcbi.1012434","DOIUrl":"https://doi.org/10.1371/journal.pcbi.1012434","url":null,"abstract":"While most individuals suffer progressive disease following HIV infection, a small fraction spontaneously controls the infection. Although CD8 T-cells have been implicated in this natural control, their mechanistic roles are yet to be established. Here, we combined mathematical modeling and analysis of previously published data from 16 SIV-infected macaques, of which 12 were natural controllers, to elucidate the role of CD8 T-cells in natural control. For each macaque, we considered, in addition to the canonical in vivo plasma viral load and SIV DNA data, longitudinal ex vivo measurements of the virus suppressive capacity of CD8 T-cells. Available mathematical models do not allow analysis of such combined in vivo-ex vivo datasets. We explicitly modeled the ex vivo assay, derived analytical approximations that link the ex vivo measurements with the in vivo effector function of CD8-T cells, and integrated them with an in vivo model of virus dynamics, thus developing a new learning framework that enabled the analysis. Our model fit the data well and estimated the recruitment rate and/or maximal killing rate of CD8 T-cells to be up to 2-fold higher in controllers than non-controllers (p = 0.013). Importantly, the cumulative suppressive capacity of CD8 T-cells over the first 4–6 weeks of infection was associated with virus control (Spearman’s ρ = -0.51; p = 0.05). Thus, our analysis identified the early cumulative suppressive capacity of CD8 T-cells as a predictor of natural control. Furthermore, simulating a large virtual population, our model quantified the minimum capacity of this early CD8 T-cell response necessary for long-term control. Our study presents new, quantitative insights into the role of CD8 T-cells in the natural control of HIV infection and has implications for remission strategies.","PeriodicalId":20241,"journal":{"name":"PLoS Computational Biology","volume":null,"pages":null},"PeriodicalIF":4.3,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142194330","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A comparison of EEG encoding models using audiovisual stimuli and their unimodal counterparts 使用视听刺激及其单模态对应物的脑电图编码模型比较

IF 4.3 2区生物学 Q1 BIOCHEMICAL RESEARCH METHODS

PLoS Computational Biology

Pub Date : 2024-09-09 DOI: 10.1371/journal.pcbi.1012433

Maansi Desai, Alyssa M. Field, Liberty S. Hamilton

Communication in the real world is inherently multimodal. When having a conversation, typically sighted and hearing people use both auditory and visual cues to understand one another. For example, objects may make sounds as they move in space, or we may use the movement of a person’s mouth to better understand what they are saying in a noisy environment. Still, many neuroscience experiments rely on unimodal stimuli to understand encoding of sensory features in the brain. The extent to which visual information may influence encoding of auditory information and vice versa in natural environments is thus unclear. Here, we addressed this question by recording scalp electroencephalography (EEG) in 11 subjects as they listened to and watched movie trailers in audiovisual (AV), visual (V) only, and audio (A) only conditions. We then fit linear encoding models that described the relationship between the brain responses and the acoustic, phonetic, and visual information in the stimuli. We also compared whether auditory and visual feature tuning was the same when stimuli were presented in the original AV format versus when visual or auditory information was removed. In these stimuli, visual and auditory information was relatively uncorrelated, and included spoken narration over a scene as well as animated or live-action characters talking with and without their face visible. For this stimulus, we found that auditory feature tuning was similar in the AV and A-only conditions, and similarly, tuning for visual information was similar when stimuli were presented with the audio present (AV) and when the audio was removed (V only). In a cross prediction analysis, we investigated whether models trained on AV data predicted responses to A or V only test data similarly to models trained on unimodal data. Overall, prediction performance using AV training and V test sets was similar to using V training and V test sets, suggesting that the auditory information has a relatively smaller effect on EEG. In contrast, prediction performance using AV training and A only test set was slightly worse than using matching A only training and A only test sets. This suggests the visual information has a stronger influence on EEG, though this makes no qualitative difference in the derived feature tuning. In effect, our results show that researchers may benefit from the richness of multimodal datasets, which can then be used to answer more than one research question.

现实世界中的交流本来就是多模态的。在交谈时，视力正常的人和听力正常的人通常会同时使用听觉和视觉线索来理解对方。例如，物体在空间移动时可能会发出声音，或者我们可以通过一个人嘴巴的动作来更好地理解他在嘈杂环境中所说的话。尽管如此，许多神经科学实验仍然依赖于单模态刺激来了解大脑对感官特征的编码。因此，在自然环境中，视觉信息在多大程度上影响听觉信息的编码，反之亦然，这一点尚不清楚。在此，我们通过记录 11 名受试者在视听（AV）、仅视觉（V）和仅音频（A）条件下收听和观看电影预告片时的头皮脑电图（EEG）来解决这一问题。然后，我们拟合了线性编码模型，描述了大脑反应与刺激物中的声音、语音和视觉信息之间的关系。我们还比较了当刺激以原始 AV 格式呈现时，听觉和视觉特征调谐是否相同，以及当视觉或听觉信息被移除时，听觉和视觉特征调谐是否相同。在这些刺激中，视觉和听觉信息相对不相关，包括场景中的口语叙述以及动画或真人角色的面部可见或不可见的谈话。对于这种刺激，我们发现听觉特征调谐在有音频和无音频条件下相似，同样，当刺激出现音频（有音频）和去掉音频（无音频）时，视觉信息的调谐也相似。在交叉预测分析中，我们研究了在视听数据上训练的模型是否与在单模态数据上训练的模型相似，都能预测出对A或V测试数据的反应。总体而言，使用 AV 训练和 V 测试集的预测性能与使用 V 训练和 V 测试集的预测性能相似，这表明听觉信息对脑电图的影响相对较小。相比之下，使用 AV 训练集和仅 A 测试集的预测性能略差于使用匹配的仅 A 训练集和仅 A 测试集的预测性能。这表明视觉信息对脑电图的影响更大，尽管这对推导出的特征调谐没有本质区别。实际上，我们的研究结果表明，研究人员可以从丰富的多模态数据集中获益，这些数据集可以用来回答不止一个研究问题。

{"title":"A comparison of EEG encoding models using audiovisual stimuli and their unimodal counterparts","authors":"Maansi Desai, Alyssa M. Field, Liberty S. Hamilton","doi":"10.1371/journal.pcbi.1012433","DOIUrl":"https://doi.org/10.1371/journal.pcbi.1012433","url":null,"abstract":"Communication in the real world is inherently multimodal. When having a conversation, typically sighted and hearing people use both auditory and visual cues to understand one another. For example, objects may make sounds as they move in space, or we may use the movement of a person’s mouth to better understand what they are saying in a noisy environment. Still, many neuroscience experiments rely on unimodal stimuli to understand encoding of sensory features in the brain. The extent to which visual information may influence encoding of auditory information and vice versa in natural environments is thus unclear. Here, we addressed this question by recording scalp electroencephalography (EEG) in 11 subjects as they listened to and watched movie trailers in audiovisual (AV), visual (V) only, and audio (A) only conditions. We then fit linear encoding models that described the relationship between the brain responses and the acoustic, phonetic, and visual information in the stimuli. We also compared whether auditory and visual feature tuning was the same when stimuli were presented in the original AV format versus when visual or auditory information was removed. In these stimuli, visual and auditory information was relatively uncorrelated, and included spoken narration over a scene as well as animated or live-action characters talking with and without their face visible. For this stimulus, we found that auditory feature tuning was similar in the AV and A-only conditions, and similarly, tuning for visual information was similar when stimuli were presented with the audio present (AV) and when the audio was removed (V only). In a cross prediction analysis, we investigated whether models trained on AV data predicted responses to A or V only test data similarly to models trained on unimodal data. Overall, prediction performance using AV training and V test sets was similar to using V training and V test sets, suggesting that the auditory information has a relatively smaller effect on EEG. In contrast, prediction performance using AV training and A only test set was slightly worse than using matching A only training and A only test sets. This suggests the visual information has a stronger influence on EEG, though this makes no qualitative difference in the derived feature tuning. In effect, our results show that researchers may benefit from the richness of multimodal datasets, which can then be used to answer more than one research question.","PeriodicalId":20241,"journal":{"name":"PLoS Computational Biology","volume":null,"pages":null},"PeriodicalIF":4.3,"publicationDate":"2024-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142194409","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Fitness models provide accurate short-term forecasts of SARS-CoV-2 variant frequency. 体能模型可在短期内准确预测 SARS-CoV-2 变异频率。

IF 3.8 2区生物学 Q1 BIOCHEMICAL RESEARCH METHODS

PLoS Computational Biology

Pub Date : 2024-09-06 DOI: 10.1371/journal.pcbi.1012443

Eslam Abousamra, Marlin Figgins, Trevor Bedford

Genomic surveillance of pathogen evolution is essential for public health response, treatment strategies, and vaccine development. In the context of SARS-COV-2, multiple models have been developed including Multinomial Logistic Regression (MLR) describing variant frequency growth as well as Fixed Growth Advantage (FGA), Growth Advantage Random Walk (GARW) and Piantham parameterizations describing variant Rt. These models provide estimates of variant fitness and can be used to forecast changes in variant frequency. We introduce a framework for evaluating real-time forecasts of variant frequencies, and apply this framework to the evolution of SARS-CoV-2 during 2022 in which multiple new viral variants emerged and rapidly spread through the population. We compare models across representative countries with different intensities of genomic surveillance. Retrospective assessment of model accuracy highlights that most models of variant frequency perform well and are able to produce reasonable forecasts. We find that the simple MLR model provides ∼0.6% median absolute error and ∼6% mean absolute error when forecasting 30 days out for countries with robust genomic surveillance. We investigate impacts of sequence quantity and quality across countries on forecast accuracy and conduct systematic downsampling to identify that 1000 sequences per week is fully sufficient for accurate short-term forecasts. We conclude that fitness models represent a useful prognostic tool for short-term evolutionary forecasting.

病原体进化的基因组监测对于公共卫生响应、治疗策略和疫苗开发至关重要。针对 SARS-COV-2 已经开发出多种模型，包括描述变异频率增长的多项式逻辑回归 (MLR)，以及描述变异 Rt 的固定增长优势 (FGA)、增长优势随机漫步 (GARW) 和 Piantham 参数化。我们引入了一个评估变体频率实时预测的框架，并将这一框架应用于 2022 年期间 SARS-CoV-2 的演化过程，在这一过程中出现了多种新的病毒变体并在人群中迅速传播。我们对不同基因组监测强度的代表性国家的模型进行了比较。对模型准确性的回顾性评估表明，大多数变异频率模型表现良好，能够做出合理的预测。我们发现，简单的 MLR 模型在预测 30 天后具有强大基因组监测能力的国家时，中位绝对误差为 0.6%，平均绝对误差为 6%。我们研究了各国序列数量和质量对预测准确性的影响，并进行了系统性的降采样，确定每周 1000 个序列完全足以进行准确的短期预测。我们的结论是，适配性模型是短期进化预测的有用预报工具。

{"title":"Fitness models provide accurate short-term forecasts of SARS-CoV-2 variant frequency.","authors":"Eslam Abousamra, Marlin Figgins, Trevor Bedford","doi":"10.1371/journal.pcbi.1012443","DOIUrl":"https://doi.org/10.1371/journal.pcbi.1012443","url":null,"abstract":"Genomic surveillance of pathogen evolution is essential for public health response, treatment strategies, and vaccine development. In the context of SARS-COV-2, multiple models have been developed including Multinomial Logistic Regression (MLR) describing variant frequency growth as well as Fixed Growth Advantage (FGA), Growth Advantage Random Walk (GARW) and Piantham parameterizations describing variant Rt. These models provide estimates of variant fitness and can be used to forecast changes in variant frequency. We introduce a framework for evaluating real-time forecasts of variant frequencies, and apply this framework to the evolution of SARS-CoV-2 during 2022 in which multiple new viral variants emerged and rapidly spread through the population. We compare models across representative countries with different intensities of genomic surveillance. Retrospective assessment of model accuracy highlights that most models of variant frequency perform well and are able to produce reasonable forecasts. We find that the simple MLR model provides ∼0.6% median absolute error and ∼6% mean absolute error when forecasting 30 days out for countries with robust genomic surveillance. We investigate impacts of sequence quantity and quality across countries on forecast accuracy and conduct systematic downsampling to identify that 1000 sequences per week is fully sufficient for accurate short-term forecasts. We conclude that fitness models represent a useful prognostic tool for short-term evolutionary forecasting.","PeriodicalId":20241,"journal":{"name":"PLoS Computational Biology","volume":null,"pages":null},"PeriodicalIF":3.8,"publicationDate":"2024-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142143330","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Binomial models uncover biological variation during feature selection of droplet-based single-cell RNA sequencing. 二项式模型揭示基于液滴的单细胞 RNA 测序特征选择过程中的生物变异。

IF 3.8 2区生物学 Q1 BIOCHEMICAL RESEARCH METHODS

PLoS Computational Biology

Pub Date : 2024-09-06 DOI: 10.1371/journal.pcbi.1012386

Breanne Sparta, Timothy Hamilton, Gunalan Natesan, Samuel D Aragones, Eric J Deeds

Effective analysis of single-cell RNA sequencing (scRNA-seq) data requires a rigorous distinction between technical noise and biological variation. In this work, we propose a simple feature selection model, termed "Differentially Distributed Genes" or DDGs, where a binomial sampling process for each mRNA species produces a null model of technical variation. Using scRNA-seq data where cell identities have been established a priori, we find that the DDG model of biological variation outperforms existing methods. We demonstrate that DDGs distinguish a validated set of real biologically varying genes, minimize neighborhood distortion, and enable accurate partitioning of cells into their established cell-type groups.

有效分析单细胞 RNA 测序（scRNA-seq）数据需要严格区分技术噪声和生物变异。在这项工作中，我们提出了一个简单的特征选择模型，称为 "差异分布基因"（Differentially Distributed Genes）或 DDGs，其中每个 mRNA 物种的二项式采样过程会产生一个技术变异的空模型。利用事先确定了细胞身份的 scRNA-seq 数据，我们发现 DDG 生物变异模型优于现有方法。我们证明了 DDGs 能区分一组经过验证的真实生物变异基因，最大程度地减少邻域失真，并能准确地将细胞划分到既定的细胞类型组中。

引用次数: 0

Backtracking: Improved methods for identifying the source of a deliberate release of Bacillus anthracis from the temporal and spatial distribution of cases. 回溯：从病例的时间和空间分布中确定故意释放炭疽杆菌来源的改进方法。

IF 3.8 2区生物学 Q1 BIOCHEMICAL RESEARCH METHODS

PLoS Computational Biology

Pub Date : 2024-09-06 DOI: 10.1371/journal.pcbi.1010817

Joseph Shingleton, David Mustard, Steven Dyke, Hannah Williams, Emma Bennett, Thomas Finnie

Reverse epidemiology is a mathematical modelling tool used to ascertain information about the source of a pathogen, given the spatial and temporal distribution of cases, hospitalisations and deaths. In the context of a deliberately released pathogen, such as Bacillus anthracis (the disease-causing organism of anthrax), this can allow responders to quickly identify the location and timing of the release, as well as other factors such as the strength of the release, and the realized wind speed and direction at release. These estimates can then be used to parameterise a predictive mechanistic model, allowing for estimation of the potential scale of the release, and to optimise the distribution of prophylaxis. In this paper we present two novel approaches to reverse epidemiology, and demonstrate their utility in responding to a simulated deliberate release of B. anthracis in ten locations in the UK and compare these to the standard grid-search approach. The two methods-a modified MCMC and a Recurrent Convolutional Neural Network-are able to identify the source location and timing of the release with significantly better accuracy compared to the grid-search approach. Further, the neural network method is able to do inference on new data significantly quicker than either the grid-search or novel MCMC methods, allowing for rapid deployment in time-sensitive outbreaks.

反向流行病学是一种数学建模工具，用于根据病例、住院和死亡的空间和时间分布，确定病原体的来源信息。对于故意释放的病原体，如炭疽杆菌（炭疽的致病微生物），这可以让应对人员快速确定释放的地点和时间，以及其他因素，如释放的强度、释放时的风速和风向。然后，这些估计值可用于对预测性机理模型进行参数化，从而对释放的潜在规模进行估计，并优化预防措施的分布。在本文中，我们介绍了反向流行病学的两种新方法，并展示了这两种方法在应对英国十个地点的炭疽杆菌模拟蓄意释放中的实用性，并将其与标准网格搜索方法进行了比较。与网格搜索法相比，这两种方法--改进的 MCMC 和递归卷积神经网络--能够更准确地确定释放源的位置和时间。此外，神经网络方法对新数据进行推理的速度明显快于网格搜索方法或新型 MCMC 方法，从而可以在时间敏感的疫情爆发中快速部署。

{"title":"Backtracking: Improved methods for identifying the source of a deliberate release of Bacillus anthracis from the temporal and spatial distribution of cases.","authors":"Joseph Shingleton, David Mustard, Steven Dyke, Hannah Williams, Emma Bennett, Thomas Finnie","doi":"10.1371/journal.pcbi.1010817","DOIUrl":"https://doi.org/10.1371/journal.pcbi.1010817","url":null,"abstract":"Reverse epidemiology is a mathematical modelling tool used to ascertain information about the source of a pathogen, given the spatial and temporal distribution of cases, hospitalisations and deaths. In the context of a deliberately released pathogen, such as Bacillus anthracis (the disease-causing organism of anthrax), this can allow responders to quickly identify the location and timing of the release, as well as other factors such as the strength of the release, and the realized wind speed and direction at release. These estimates can then be used to parameterise a predictive mechanistic model, allowing for estimation of the potential scale of the release, and to optimise the distribution of prophylaxis. In this paper we present two novel approaches to reverse epidemiology, and demonstrate their utility in responding to a simulated deliberate release of B. anthracis in ten locations in the UK and compare these to the standard grid-search approach. The two methods-a modified MCMC and a Recurrent Convolutional Neural Network-are able to identify the source location and timing of the release with significantly better accuracy compared to the grid-search approach. Further, the neural network method is able to do inference on new data significantly quicker than either the grid-search or novel MCMC methods, allowing for rapid deployment in time-sensitive outbreaks.","PeriodicalId":20241,"journal":{"name":"PLoS Computational Biology","volume":null,"pages":null},"PeriodicalIF":3.8,"publicationDate":"2024-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142143327","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Cracking the neural code for word recognition in convolutional neural networks. 破解卷积神经网络中单词识别的神经密码

IF 3.8 2区生物学 Q1 BIOCHEMICAL RESEARCH METHODS

PLoS Computational Biology

Pub Date : 2024-09-06 DOI: 10.1371/journal.pcbi.1012430

Aakash Agrawal, Stanislas Dehaene

Learning to read places a strong challenge on the visual system. Years of expertise lead to a remarkable capacity to separate similar letters and encode their relative positions, thus distinguishing words such as FORM and FROM, invariantly over a large range of positions, sizes and fonts. How neural circuits achieve invariant word recognition remains unknown. Here, we address this issue by recycling deep neural network models initially trained for image recognition. We retrain them to recognize written words and then analyze how reading-specialized units emerge and operate across the successive layers. With literacy, a small subset of units becomes specialized for word recognition in the learned script, similar to the visual word form area (VWFA) in the human brain. We show that these units are sensitive to specific letter identities and their ordinal position from the left or the right of a word. The transition from retinotopic to ordinal position coding is achieved by a hierarchy of "space bigram" unit that detect the position of a letter relative to a blank space and that pool across low- and high-frequency-sensitive units from early layers of the network. The proposed scheme provides a plausible neural code for written words in the VWFA, and leads to predictions for reading behavior, error patterns, and the neurophysiology of reading.

学习阅读是对视觉系统的巨大挑战。经过多年的专业学习，视觉系统已经具备了将相似字母分开并对其相对位置进行编码的卓越能力，从而可以在很大的位置、大小和字体范围内不变地识别出 FORM 和 FROM 等单词。神经回路如何实现不变的单词识别仍是未知数。在此，我们通过回收最初为图像识别而训练的深度神经网络模型来解决这一问题。我们重新训练它们来识别书面文字，然后分析阅读专用单元是如何出现并在连续层中运行的。随着识字能力的提高，一小部分单元变得专门用于识别所学文字中的单词，类似于人脑中的视觉单词形式区（VWFA）。我们的研究表明，这些单元对特定字母的特征及其在单词左侧或右侧的顺序位置非常敏感。从视网膜位置编码到顺序位置编码的过渡是通过 "空间大图 "单元的层次结构实现的，该单元检测字母相对于空白空间的位置，并汇集网络早期层的低频和高频敏感单元。所提出的方案为VWFA中的书面文字提供了合理的神经编码，并对阅读行为、错误模式和阅读神经生理学做出了预测。

{"title":"Cracking the neural code for word recognition in convolutional neural networks.","authors":"Aakash Agrawal, Stanislas Dehaene","doi":"10.1371/journal.pcbi.1012430","DOIUrl":"https://doi.org/10.1371/journal.pcbi.1012430","url":null,"abstract":"Learning to read places a strong challenge on the visual system. Years of expertise lead to a remarkable capacity to separate similar letters and encode their relative positions, thus distinguishing words such as FORM and FROM, invariantly over a large range of positions, sizes and fonts. How neural circuits achieve invariant word recognition remains unknown. Here, we address this issue by recycling deep neural network models initially trained for image recognition. We retrain them to recognize written words and then analyze how reading-specialized units emerge and operate across the successive layers. With literacy, a small subset of units becomes specialized for word recognition in the learned script, similar to the visual word form area (VWFA) in the human brain. We show that these units are sensitive to specific letter identities and their ordinal position from the left or the right of a word. The transition from retinotopic to ordinal position coding is achieved by a hierarchy of \"space bigram\" unit that detect the position of a letter relative to a blank space and that pool across low- and high-frequency-sensitive units from early layers of the network. The proposed scheme provides a plausible neural code for written words in the VWFA, and leads to predictions for reading behavior, error patterns, and the neurophysiology of reading.","PeriodicalId":20241,"journal":{"name":"PLoS Computational Biology","volume":null,"pages":null},"PeriodicalIF":3.8,"publicationDate":"2024-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142143329","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Mutant fate in spatially structured populations on graphs: Connecting models to experiments. 图上空间结构种群的突变命运：连接模型与实验

IF 3.8 2区生物学 Q1 BIOCHEMICAL RESEARCH METHODS

PLoS Computational Biology

Pub Date : 2024-09-06 DOI: 10.1371/journal.pcbi.1012424

Alia Abbara, Lisa Pagani, Celia García-Pareja, Anne-Florence Bitbol

In nature, most microbial populations have complex spatial structures that can affect their evolution. Evolutionary graph theory predicts that some spatial structures modelled by placing individuals on the nodes of a graph affect the probability that a mutant will fix. Evolution experiments are beginning to explicitly address the impact of graph structures on mutant fixation. However, the assumptions of evolutionary graph theory differ from the conditions of modern evolution experiments, making the comparison between theory and experiment challenging. Here, we aim to bridge this gap by using our new model of spatially structured populations. This model considers connected subpopulations that lie on the nodes of a graph, and allows asymmetric migrations. It can handle large populations, and explicitly models serial passage events with migrations, thus closely mimicking experimental conditions. We analyze recent experiments in light of this model. We suggest useful parameter regimes for future experiments, and we make quantitative predictions for these experiments. In particular, we propose experiments to directly test our recent prediction that the star graph with asymmetric migrations suppresses natural selection and can accelerate mutant fixation or extinction, compared to a well-mixed population.

在自然界中，大多数微生物种群都有复杂的空间结构，这会影响它们的进化。进化图理论预测，将个体置于图节点上的某些空间结构会影响突变体固定的概率。进化实验已开始明确探讨图结构对突变体固定的影响。然而，进化图论的假设条件与现代进化实验的条件不同，这使得理论与实验之间的比较具有挑战性。在此，我们希望利用我们的空间结构种群新模型来弥补这一差距。该模型考虑了位于图节点上的相连亚种群，并允许非对称迁移。该模型可以处理大量种群，并明确地将序列通过事件与迁移进行建模，从而密切模拟实验条件。我们根据这一模型对最近的实验进行了分析。我们为未来的实验提出了有用的参数机制，并对这些实验进行了定量预测。特别是，我们提出了一些实验来直接验证我们最近的预测，即与混合良好的种群相比，具有非对称迁移的星形图会抑制自然选择并加速突变体的固定或灭绝。

{"title":"Mutant fate in spatially structured populations on graphs: Connecting models to experiments.","authors":"Alia Abbara, Lisa Pagani, Celia García-Pareja, Anne-Florence Bitbol","doi":"10.1371/journal.pcbi.1012424","DOIUrl":"https://doi.org/10.1371/journal.pcbi.1012424","url":null,"abstract":"In nature, most microbial populations have complex spatial structures that can affect their evolution. Evolutionary graph theory predicts that some spatial structures modelled by placing individuals on the nodes of a graph affect the probability that a mutant will fix. Evolution experiments are beginning to explicitly address the impact of graph structures on mutant fixation. However, the assumptions of evolutionary graph theory differ from the conditions of modern evolution experiments, making the comparison between theory and experiment challenging. Here, we aim to bridge this gap by using our new model of spatially structured populations. This model considers connected subpopulations that lie on the nodes of a graph, and allows asymmetric migrations. It can handle large populations, and explicitly models serial passage events with migrations, thus closely mimicking experimental conditions. We analyze recent experiments in light of this model. We suggest useful parameter regimes for future experiments, and we make quantitative predictions for these experiments. In particular, we propose experiments to directly test our recent prediction that the star graph with asymmetric migrations suppresses natural selection and can accelerate mutant fixation or extinction, compared to a well-mixed population.","PeriodicalId":20241,"journal":{"name":"PLoS Computational Biology","volume":null,"pages":null},"PeriodicalIF":3.8,"publicationDate":"2024-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142143331","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Surveillance strategies for the detection of new pathogen variants across epidemiological contexts. 跨流行病学环境检测新病原体变异的监控策略。

IF 3.8 2区生物学 Q1 BIOCHEMICAL RESEARCH METHODS

PLoS Computational Biology

Pub Date : 2024-09-05 DOI: 10.1371/journal.pcbi.1012416

Kirstin I Oliveira Roster, Stephen M Kissler, Enoma Omoregie, Jade C Wang, Helly Amin, Steve Di Lonardo, Scott Hughes, Yonatan H Grad

Surveillance systems that monitor pathogen genome sequences are critical for rapidly detecting the introduction and emergence of pathogen variants. To evaluate how interactions between surveillance capacity, variant properties, and the epidemiological context influence the timeliness of pathogen variant detection, we developed a geographically explicit stochastic compartmental model to simulate the transmission of a novel SARS-CoV-2 variant in New York City. We measured the impact of (1) testing and sequencing volume, (2) geographic targeting of testing, (3) the timing and location of variant emergence, and (4) the relative variant transmissibility on detection speed and on the undetected disease burden. Improvements in detection times and reduction of undetected infections were driven primarily by increases in the number of sequenced samples. The relative transmissibility of the new variant and the epidemic context of variant emergence also influenced detection times, showing that individual surveillance strategies can result in a wide range of detection outcomes, depending on the underlying dynamics of the circulating variants. These findings help contextualize the design, interpretation, and trade-offs of genomic surveillance strategies of pandemic respiratory pathogens.

监测病原体基因组序列的监控系统对于快速检测病原体变异体的引入和出现至关重要。为了评估监控能力、变异体特性和流行病学背景之间的相互作用如何影响病原体变异体检测的及时性，我们开发了一个地理明确的随机分区模型，模拟新型 SARS-CoV-2 变异体在纽约市的传播。我们测量了以下因素对检测速度和未检测疾病负担的影响：(1) 检测和测序量；(2) 检测的地理针对性；(3) 变异出现的时间和地点；(4) 变异的相对传播性。检测时间的缩短和未检测到感染病例的减少主要是由于测序样本数量的增加。新变异体的相对传播性和变异体出现的流行环境也会影响检测时间，这表明，根据循环变异体的基本动态，不同的监测策略会产生不同的检测结果。这些发现有助于说明大流行性呼吸道病原体基因组监测策略的设计、解释和权衡。

{"title":"Surveillance strategies for the detection of new pathogen variants across epidemiological contexts.","authors":"Kirstin I Oliveira Roster, Stephen M Kissler, Enoma Omoregie, Jade C Wang, Helly Amin, Steve Di Lonardo, Scott Hughes, Yonatan H Grad","doi":"10.1371/journal.pcbi.1012416","DOIUrl":"https://doi.org/10.1371/journal.pcbi.1012416","url":null,"abstract":"Surveillance systems that monitor pathogen genome sequences are critical for rapidly detecting the introduction and emergence of pathogen variants. To evaluate how interactions between surveillance capacity, variant properties, and the epidemiological context influence the timeliness of pathogen variant detection, we developed a geographically explicit stochastic compartmental model to simulate the transmission of a novel SARS-CoV-2 variant in New York City. We measured the impact of (1) testing and sequencing volume, (2) geographic targeting of testing, (3) the timing and location of variant emergence, and (4) the relative variant transmissibility on detection speed and on the undetected disease burden. Improvements in detection times and reduction of undetected infections were driven primarily by increases in the number of sequenced samples. The relative transmissibility of the new variant and the epidemic context of variant emergence also influenced detection times, showing that individual surveillance strategies can result in a wide range of detection outcomes, depending on the underlying dynamics of the circulating variants. These findings help contextualize the design, interpretation, and trade-offs of genomic surveillance strategies of pandemic respiratory pathogens.","PeriodicalId":20241,"journal":{"name":"PLoS Computational Biology","volume":null,"pages":null},"PeriodicalIF":3.8,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142140856","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0