首页 > 最新文献

PLoS Computational Biology最新文献

英文 中文
Information uncertainty influences learning strategy from sequentially delayed rewards. 信息不确定性影响顺序延迟奖励的学习策略。
IF 3.6 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Pub Date : 2026-02-02 eCollection Date: 2026-02-01 DOI: 10.1371/journal.pcbi.1013879
Sean R Maulhardt, Alec Solway, Caroline J Charpentier

When receiving a reward after a sequence of multiple events, how do we determine which event caused the reward? This problem, known as temporal credit assignment, can be difficult for humans to solve given the temporal uncertainty in the environment. Research to date has attempted to isolate dimensions of delay and reward during decision-making, but algorithmic solutions to temporal learning problems and the effect of uncertainty on learning remain underexplored. To further our understanding, we adapted a reward learning task that creates a temporal credit assignment problem by combining sequentially delayed rewards, intervening events, and varying uncertainty via the amount of information presented during feedback. Using computational modeling, two learning strategies were developed: an eligibility trace, whereby previously selected actions are updated as a function of the temporal sequence, and a tabular update, whereby only systematically related past actions (rather than unrelated intervening events) are updated. We hypothesized that reduced information uncertainty would correlate with increased use of the tabular strategy, given the model's capacity to incorporate additional feedback information. Both models effectively learned the task, and predicted choices made by participants (N = 142) as well as specific behavioral signatures of credit assignment. Consistent with our hypothesis, the tabular model outperformed the eligibility model under low information uncertainty, as evidenced by more accurate predictions of participants' behavior and an increase in tabular weight. These findings provide new insights into the mechanisms implemented by humans to solve temporal credit assignment and adapt their strategy in varying environments.

当在一系列多个事件之后获得奖励时,我们如何确定哪个事件导致了奖励?这个问题被称为时间信用分配,由于环境的时间不确定性,人类很难解决这个问题。迄今为止的研究试图在决策过程中隔离延迟和奖励的维度,但对时间学习问题的算法解决方案和不确定性对学习的影响仍未得到充分探索。为了进一步理解,我们调整了一个奖励学习任务,该任务通过结合顺序延迟奖励、干预事件和通过反馈期间提供的信息量而变化的不确定性,创造了一个时间信用分配问题。使用计算建模,开发了两种学习策略:一种是资格跟踪,即以前选择的行动作为时间序列的函数更新;另一种是表格更新,即仅更新系统相关的过去行动(而不是不相关的干预事件)。我们假设,减少信息不确定性将与增加表格策略的使用相关,因为模型有能力纳入额外的反馈信息。两种模型都有效地学习了任务,并预测了参与者(N = 142)的选择以及信用分配的特定行为特征。与我们的假设一致,表格模型在低信息不确定性下优于资格模型,这可以通过更准确地预测参与者的行为和表格权重的增加来证明。这些发现为人类解决时间信用分配和适应不同环境策略的机制提供了新的见解。
{"title":"Information uncertainty influences learning strategy from sequentially delayed rewards.","authors":"Sean R Maulhardt, Alec Solway, Caroline J Charpentier","doi":"10.1371/journal.pcbi.1013879","DOIUrl":"10.1371/journal.pcbi.1013879","url":null,"abstract":"<p><p>When receiving a reward after a sequence of multiple events, how do we determine which event caused the reward? This problem, known as temporal credit assignment, can be difficult for humans to solve given the temporal uncertainty in the environment. Research to date has attempted to isolate dimensions of delay and reward during decision-making, but algorithmic solutions to temporal learning problems and the effect of uncertainty on learning remain underexplored. To further our understanding, we adapted a reward learning task that creates a temporal credit assignment problem by combining sequentially delayed rewards, intervening events, and varying uncertainty via the amount of information presented during feedback. Using computational modeling, two learning strategies were developed: an eligibility trace, whereby previously selected actions are updated as a function of the temporal sequence, and a tabular update, whereby only systematically related past actions (rather than unrelated intervening events) are updated. We hypothesized that reduced information uncertainty would correlate with increased use of the tabular strategy, given the model's capacity to incorporate additional feedback information. Both models effectively learned the task, and predicted choices made by participants (N = 142) as well as specific behavioral signatures of credit assignment. Consistent with our hypothesis, the tabular model outperformed the eligibility model under low information uncertainty, as evidenced by more accurate predictions of participants' behavior and an increase in tabular weight. These findings provide new insights into the mechanisms implemented by humans to solve temporal credit assignment and adapt their strategy in varying environments.</p>","PeriodicalId":20241,"journal":{"name":"PLoS Computational Biology","volume":"22 2","pages":"e1013879"},"PeriodicalIF":3.6,"publicationDate":"2026-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12885371/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146106951","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multiscale segmentation using hierarchical phase-contrast tomography and deep learning. 使用分层相衬断层扫描和深度学习的多尺度分割。
IF 3.6 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Pub Date : 2026-02-02 eCollection Date: 2026-02-01 DOI: 10.1371/journal.pcbi.1013923
Yang Zhou, Shahab Aslani, Yousef Javanmardi, Joseph Brunet, David Stansby, Saskia Carroll, Alexandre Bellier, Maximilian Ackermann, Paul Tafforeau, Peter D Lee, Claire L Walsh

Biomedical systems span multiple spatial scales, encompassing tiny functional units to entire organs. Interpreting these systems through image segmentation requires the effective propagation and integration of information across different scales. However, most existing segmentation methods are optimised for single-scale imaging modalities, limiting their ability to capture and analyse small functional units throughout complete human organs. To facilitate multiscale biomedical image segmentation, we utilised Hierarchical Phase-Contrast Tomography (HiP-CT), an advanced imaging modality that can generate 3D multiscale datasets from high-resolution volumes of interest (VOIs) at ca. 1 [Formula: see text]/voxel to whole-organ scans at ca. 20 [Formula: see text]/voxel. Building on these hierarchical multiscale datasets, we developed a deep learning-based segmentation pipeline that is initially trained on manually annotated high-resolution HiP-CT data and then extended to lower-resolution whole-organ scans using pseudo-labels generated from high-resolution predictions and multiscale image registration. As a case study, we focused on glomeruli in human kidneys, benchmarking four 3D deep learning models for biomedical image segmentation on a manually annotated high-resolution dataset extracted from VOIs, at 2.58 to ca. 5 [Formula: see text]/voxel, of four human kidneys. Among them, nnUNet demonstrated the best performance, achieving an average test Dice score of 0.906, and was subsequently used as the baseline model for multiscale segmentation in the pipeline. Applying this pipeline to two low-resolution full-organ data at ca. 25 [Formula: see text]/voxel, the model identified 1,019,890 and 231,179 glomeruli in a 62-year-old donor without kidney diseases and a 94-year-old hypertensive donor, enabling comprehensive morphological analyses, including cortical spatial statistics and glomerular distributions, which aligned well with previous anatomical studies. Our results highlight the effectiveness of the proposed pipeline for segmenting small functional units in multiscale bioimaging datasets and suggest its broader applicability to other organ systems.

生物医学系统跨越多个空间尺度,包括微小的功能单位到整个器官。通过图像分割来解释这些系统需要有效地传播和整合不同尺度的信息。然而,大多数现有的分割方法针对单尺度成像模式进行了优化,限制了它们在整个完整人体器官中捕获和分析小功能单元的能力。为了促进多尺度生物医学图像分割,我们使用了分层相对比断层扫描(hict),这是一种先进的成像方式,可以从高分辨率感兴趣体积(voi)生成3D多尺度数据集,其速度约为1[公式:见文本]/体素到全器官扫描,速度约为20[公式:见文本]/体素。在这些分层多尺度数据集的基础上,我们开发了一种基于深度学习的分割管道,该管道最初在手动注释的高分辨率HiP-CT数据上进行训练,然后使用高分辨率预测和多尺度图像配准生成的伪标签扩展到低分辨率的全器官扫描。作为一个案例研究,我们专注于人类肾脏的肾小球,在人工注释的高分辨率数据集上对四种用于生物医学图像分割的3D深度学习模型进行基准测试,这些数据集从voi中提取,为2.58至ca. 5 /体素,四个人类肾脏。其中,nnUNet表现出最好的性能,平均测试Dice得分为0.906,随后被用作流水线中多尺度分割的基线模型。将该管道应用于两个低分辨率的全器官数据(约25 /体素),该模型在一名62岁无肾脏疾病的供体和一名94岁高血压供体中分别识别出1,019,890和231,179个肾小球,从而实现了全面的形态学分析,包括皮质空间统计和肾小球分布,这与之前的解剖学研究非常吻合。我们的研究结果强调了该方法在多尺度生物成像数据集中分割小功能单元的有效性,并表明其更广泛地适用于其他器官系统。
{"title":"Multiscale segmentation using hierarchical phase-contrast tomography and deep learning.","authors":"Yang Zhou, Shahab Aslani, Yousef Javanmardi, Joseph Brunet, David Stansby, Saskia Carroll, Alexandre Bellier, Maximilian Ackermann, Paul Tafforeau, Peter D Lee, Claire L Walsh","doi":"10.1371/journal.pcbi.1013923","DOIUrl":"10.1371/journal.pcbi.1013923","url":null,"abstract":"<p><p>Biomedical systems span multiple spatial scales, encompassing tiny functional units to entire organs. Interpreting these systems through image segmentation requires the effective propagation and integration of information across different scales. However, most existing segmentation methods are optimised for single-scale imaging modalities, limiting their ability to capture and analyse small functional units throughout complete human organs. To facilitate multiscale biomedical image segmentation, we utilised Hierarchical Phase-Contrast Tomography (HiP-CT), an advanced imaging modality that can generate 3D multiscale datasets from high-resolution volumes of interest (VOIs) at ca. 1 [Formula: see text]/voxel to whole-organ scans at ca. 20 [Formula: see text]/voxel. Building on these hierarchical multiscale datasets, we developed a deep learning-based segmentation pipeline that is initially trained on manually annotated high-resolution HiP-CT data and then extended to lower-resolution whole-organ scans using pseudo-labels generated from high-resolution predictions and multiscale image registration. As a case study, we focused on glomeruli in human kidneys, benchmarking four 3D deep learning models for biomedical image segmentation on a manually annotated high-resolution dataset extracted from VOIs, at 2.58 to ca. 5 [Formula: see text]/voxel, of four human kidneys. Among them, nnUNet demonstrated the best performance, achieving an average test Dice score of 0.906, and was subsequently used as the baseline model for multiscale segmentation in the pipeline. Applying this pipeline to two low-resolution full-organ data at ca. 25 [Formula: see text]/voxel, the model identified 1,019,890 and 231,179 glomeruli in a 62-year-old donor without kidney diseases and a 94-year-old hypertensive donor, enabling comprehensive morphological analyses, including cortical spatial statistics and glomerular distributions, which aligned well with previous anatomical studies. Our results highlight the effectiveness of the proposed pipeline for segmenting small functional units in multiscale bioimaging datasets and suggest its broader applicability to other organ systems.</p>","PeriodicalId":20241,"journal":{"name":"PLoS Computational Biology","volume":"22 2","pages":"e1013923"},"PeriodicalIF":3.6,"publicationDate":"2026-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12880754/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146107023","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Learning genetic perturbation effects with variational causal inference. 用变分因果推理学习遗传扰动效应。
IF 3.6 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Pub Date : 2026-02-02 eCollection Date: 2026-02-01 DOI: 10.1371/journal.pcbi.1013194
Emily Liu, Jiaqi Zhang, Caroline Uhler

Advances in sequencing technologies have enhanced the understanding of gene regulation in cells. In particular, Perturb-seq has enabled high-resolution profiling of the transcriptomic response to genetic perturbations at the single-cell level. This understanding has implications in functional genomics and potentially for identifying therapeutic targets. Various computational models have been developed to predict perturbational effects. While deep learning models excel at interpolating observed perturbational data, they tend to overfit in the lack of enough data and may not generalize well to unseen perturbations. In contrast, mechanistic models, such as linear causal models based on gene regulatory networks, hold greater potential for extrapolation, as they encapsulate regulatory information that can predict responses to unseen perturbations. However, their application has been limited to small studies due to overly simplistic assumptions, making them less effective in handling noisy, large-scale single-cell data. We propose a hybrid approach that combines a mechanistic causal model with variational deep learning, termed Single Cell Causal Variational Autoencoder (SCCVAE). The mechanistic model employs a learned regulatory network to represent perturbational changes as shift interventions that propagate through the learned network. SCCVAE integrates this mechanistic causal model into a variational autoencoder, generating rich, comprehensive transcriptomic responses. Our results indicate that SCCVAE exhibits superior performance over current state-of-the-art baselines for extrapolating to predict unseen perturbational responses. Additionally, for the observed perturbations, the latent space learned by SCCVAE allows for the identification of functional perturbation modules and simulation of single-gene knockdown experiments of varying penetrance, presenting a robust tool for interpreting and interpolating perturbational responses at the single-cell level.

测序技术的进步提高了对细胞中基因调控的认识。特别是,Perturb-seq能够在单细胞水平上对遗传扰动的转录组反应进行高分辨率分析。这种理解对功能基因组学和潜在的治疗靶点的识别具有重要意义。已经开发了各种计算模型来预测微扰效应。虽然深度学习模型擅长插值观察到的扰动数据,但它们往往在缺乏足够数据的情况下过度拟合,并且可能无法很好地推广到看不见的扰动。相比之下,机械模型,如基于基因调控网络的线性因果模型,具有更大的外推潜力,因为它们封装了可以预测对看不见的扰动的反应的调控信息。然而,由于过于简单的假设,它们的应用仅限于小型研究,这使得它们在处理嘈杂的大规模单细胞数据时效率较低。我们提出了一种混合方法,将机械因果模型与变分深度学习相结合,称为单细胞因果变分自编码器(SCCVAE)。机制模型采用一个习得的调节网络,将扰动变化表示为通过习得网络传播的移位干预。SCCVAE将这种机制因果模型集成到变分自编码器中,生成丰富、全面的转录组反应。我们的研究结果表明,SCCVAE在外推预测看不见的扰动响应方面表现出优于当前最先进的基线的性能。此外,对于观察到的扰动,SCCVAE学习的潜在空间允许识别功能扰动模块和模拟不同外显率的单基因敲除实验,为解释和插值单细胞水平的扰动响应提供了一个强大的工具。
{"title":"Learning genetic perturbation effects with variational causal inference.","authors":"Emily Liu, Jiaqi Zhang, Caroline Uhler","doi":"10.1371/journal.pcbi.1013194","DOIUrl":"10.1371/journal.pcbi.1013194","url":null,"abstract":"<p><p>Advances in sequencing technologies have enhanced the understanding of gene regulation in cells. In particular, Perturb-seq has enabled high-resolution profiling of the transcriptomic response to genetic perturbations at the single-cell level. This understanding has implications in functional genomics and potentially for identifying therapeutic targets. Various computational models have been developed to predict perturbational effects. While deep learning models excel at interpolating observed perturbational data, they tend to overfit in the lack of enough data and may not generalize well to unseen perturbations. In contrast, mechanistic models, such as linear causal models based on gene regulatory networks, hold greater potential for extrapolation, as they encapsulate regulatory information that can predict responses to unseen perturbations. However, their application has been limited to small studies due to overly simplistic assumptions, making them less effective in handling noisy, large-scale single-cell data. We propose a hybrid approach that combines a mechanistic causal model with variational deep learning, termed Single Cell Causal Variational Autoencoder (SCCVAE). The mechanistic model employs a learned regulatory network to represent perturbational changes as shift interventions that propagate through the learned network. SCCVAE integrates this mechanistic causal model into a variational autoencoder, generating rich, comprehensive transcriptomic responses. Our results indicate that SCCVAE exhibits superior performance over current state-of-the-art baselines for extrapolating to predict unseen perturbational responses. Additionally, for the observed perturbations, the latent space learned by SCCVAE allows for the identification of functional perturbation modules and simulation of single-gene knockdown experiments of varying penetrance, presenting a robust tool for interpreting and interpolating perturbational responses at the single-cell level.</p>","PeriodicalId":20241,"journal":{"name":"PLoS Computational Biology","volume":"22 2","pages":"e1013194"},"PeriodicalIF":3.6,"publicationDate":"2026-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12885384/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146106998","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cluster dispersal shapes microbial diversity during community assembly. 群落聚集过程中,群落分散形成微生物多样性。
IF 3.6 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Pub Date : 2026-02-02 eCollection Date: 2026-02-01 DOI: 10.1371/journal.pcbi.1013918
Loïc Marrec, Sonja Lehtinen

Identifying the drivers of diversity remains a central challenge in microbial ecology. In microbiota, within-community diversity is often linked to host health, which makes it all the more important to understand. Since many communities assemble de novo, microbial dispersal plays a critical role in shaping community structure during the early stages of assembly. While theoretical models typically assume microbes disperse individually, this overlooks cases where microbes disperse in clusters, such as, for example, during host feeding. Here, we investigate how cluster dispersal impacts species richness, between-community dissimilarity, and species abundance in the initial steps of microbial community assembly. We developed a model in which microbes disperse from a pool into communities as clusters and then replicate locally. Using both analytical and numerical approaches, we show that cluster dispersal promotes community homogenization by increasing within-community richness and reducing dissimilarity across communities, even at low dispersal rates. Moreover, it modulates the influence of local selection on microbial community assembly and, consequently, on species abundance. Our results demonstrate that cluster dispersal has distinct effects from simply increasing the dispersal rate. This work reveals new evidence for the role of cluster dispersal in the early dynamics of microbial community assembly.

确定多样性的驱动因素仍然是微生物生态学的核心挑战。在微生物群中,群落内的多样性往往与宿主的健康有关,这使得了解它变得更加重要。由于许多群落都是从头开始组装的,因此在组装的早期阶段,微生物的扩散在形成群落结构方面起着关键作用。虽然理论模型通常假设微生物是单独分散的,但这忽略了微生物成群分散的情况,例如,在宿主摄食期间。在这里,我们研究了群落扩散如何影响微生物群落组装初始阶段的物种丰富度、群落间差异和物种丰度。我们开发了一个模型,在这个模型中,微生物从一个池中分散成群体,然后在当地复制。通过分析和数值方法,我们发现即使在低扩散率下,集群扩散也会通过增加群落内部丰富度和减少群落间的差异性来促进群落同质化。此外,它调节了局部选择对微生物群落聚集的影响,从而调节了物种丰度。我们的研究结果表明,集群扩散与简单地增加扩散速率具有明显的效果。这项工作揭示了新的证据,在微生物群落组装的早期动态集群分散的作用。
{"title":"Cluster dispersal shapes microbial diversity during community assembly.","authors":"Loïc Marrec, Sonja Lehtinen","doi":"10.1371/journal.pcbi.1013918","DOIUrl":"10.1371/journal.pcbi.1013918","url":null,"abstract":"<p><p>Identifying the drivers of diversity remains a central challenge in microbial ecology. In microbiota, within-community diversity is often linked to host health, which makes it all the more important to understand. Since many communities assemble de novo, microbial dispersal plays a critical role in shaping community structure during the early stages of assembly. While theoretical models typically assume microbes disperse individually, this overlooks cases where microbes disperse in clusters, such as, for example, during host feeding. Here, we investigate how cluster dispersal impacts species richness, between-community dissimilarity, and species abundance in the initial steps of microbial community assembly. We developed a model in which microbes disperse from a pool into communities as clusters and then replicate locally. Using both analytical and numerical approaches, we show that cluster dispersal promotes community homogenization by increasing within-community richness and reducing dissimilarity across communities, even at low dispersal rates. Moreover, it modulates the influence of local selection on microbial community assembly and, consequently, on species abundance. Our results demonstrate that cluster dispersal has distinct effects from simply increasing the dispersal rate. This work reveals new evidence for the role of cluster dispersal in the early dynamics of microbial community assembly.</p>","PeriodicalId":20241,"journal":{"name":"PLoS Computational Biology","volume":"22 2","pages":"e1013918"},"PeriodicalIF":3.6,"publicationDate":"2026-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12885383/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146106981","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A 2D Gabor-wavelet baseline model out-performs a 3D surface model in scene-responsive cortex. 在场景反应皮层中,二维gabor -小波基线模型优于三维表面模型。
IF 3.6 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Pub Date : 2026-02-02 eCollection Date: 2026-02-01 DOI: 10.1371/journal.pcbi.1013888
Anna Shafer-Skelton, Timothy F Brady, John T Serences

Understanding 3D representations of spatial information, particularly in naturalistic scenes, remains a significant challenge in vision science. This is largely because of conceptual difficulties in disentangling higher-level 3D information from co-occurring features and cues (e.g., the 3D shape of a scene image is necessarily defined by "low-level" spatial frequency and orientation information). Recent work has employed newer models and analysis techniques that attempt to mitigate these difficulties within a model-comparison framework. For example, one such study reported 3D-surface features were uniquely present in areas OPA, PPA, and MPA/RSC (areas typically referred to as 'scene-selective'), above and beyond a Gabor-wavelet baseline model. Here, we tested whether these findings generalized to a new stimulus set that, on average, dissociated static Gabor-wavelet baseline features from 3D scene-surface features. Surprisingly, we found evidence that a Gabor-wavelet baseline model-commonly thought of as a "low-level" or "2D" model-better fit voxel responses in areas OPA, PPA and MPA/RSC compared to a model with 3D-surface information. We highlight that this difference in results could be due to differences in the baseline conditions used across studies. These findings emphasize that much of the information in "scene-selective" regions-potentially even information about 3D surfaces-may be in the form of spatial frequency and orientation information often considered 2D or low-level. Disentangling lower-level and higher-level visual information is a continuing fundamental challenge for model-comparison approaches in visual cognition, and it motivates future work investigating which visual features could cue higher-level properties in our real-world visual experience-both within and beyond current model comparison frameworks.

理解空间信息的三维表示,特别是在自然场景中,仍然是视觉科学的重大挑战。这主要是因为从共同发生的特征和线索(例如,场景图像的3D形状必须由“低级”空间频率和方向信息定义)中分离高级3D信息的概念困难。最近的工作采用了较新的模型和分析技术,试图在模型比较框架内减轻这些困难。例如,一项这样的研究报告称,3d表面特征在OPA、PPA和MPA/RSC区域(通常被称为“场景选择性”区域)中是唯一存在的,超出了gabor -小波基线模型。在这里,我们测试了这些发现是否推广到一个新的刺激集,平均而言,将静态gabor -小波基线特征与3D场景表面特征分离开来。令人惊讶的是,我们发现的证据表明,与具有3d表面信息的模型相比,通常被认为是“低水平”或“2D”模型的Gabor-wavelet基线模型更适合OPA, PPA和MPA/RSC区域的体素响应。我们强调,结果的差异可能是由于不同研究中使用的基线条件的差异。这些发现强调了“场景选择”区域中的大部分信息——甚至可能是关于3D表面的信息——可能以空间频率和方向信息的形式存在,通常被认为是2D或低级别的。解开低层次和高层次视觉信息是视觉认知中模型比较方法的一个持续的基本挑战,它激发了未来的工作,研究哪些视觉特征可以在我们的现实世界视觉经验中提示更高层次的属性——无论是在当前的模型比较框架内还是之外。
{"title":"A 2D Gabor-wavelet baseline model out-performs a 3D surface model in scene-responsive cortex.","authors":"Anna Shafer-Skelton, Timothy F Brady, John T Serences","doi":"10.1371/journal.pcbi.1013888","DOIUrl":"10.1371/journal.pcbi.1013888","url":null,"abstract":"<p><p>Understanding 3D representations of spatial information, particularly in naturalistic scenes, remains a significant challenge in vision science. This is largely because of conceptual difficulties in disentangling higher-level 3D information from co-occurring features and cues (e.g., the 3D shape of a scene image is necessarily defined by \"low-level\" spatial frequency and orientation information). Recent work has employed newer models and analysis techniques that attempt to mitigate these difficulties within a model-comparison framework. For example, one such study reported 3D-surface features were uniquely present in areas OPA, PPA, and MPA/RSC (areas typically referred to as 'scene-selective'), above and beyond a Gabor-wavelet baseline model. Here, we tested whether these findings generalized to a new stimulus set that, on average, dissociated static Gabor-wavelet baseline features from 3D scene-surface features. Surprisingly, we found evidence that a Gabor-wavelet baseline model-commonly thought of as a \"low-level\" or \"2D\" model-better fit voxel responses in areas OPA, PPA and MPA/RSC compared to a model with 3D-surface information. We highlight that this difference in results could be due to differences in the baseline conditions used across studies. These findings emphasize that much of the information in \"scene-selective\" regions-potentially even information about 3D surfaces-may be in the form of spatial frequency and orientation information often considered 2D or low-level. Disentangling lower-level and higher-level visual information is a continuing fundamental challenge for model-comparison approaches in visual cognition, and it motivates future work investigating which visual features could cue higher-level properties in our real-world visual experience-both within and beyond current model comparison frameworks.</p>","PeriodicalId":20241,"journal":{"name":"PLoS Computational Biology","volume":"22 2","pages":"e1013888"},"PeriodicalIF":3.6,"publicationDate":"2026-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12880747/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146106966","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Capturing individual variation in children's electroencephalograms during nREM sleep. 捕捉儿童在非快速眼动睡眠期间脑电图的个体差异。
IF 3.6 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Pub Date : 2026-01-30 eCollection Date: 2026-01-01 DOI: 10.1371/journal.pcbi.1013931
Verna Heikkinen, Susanne Merz, Riitta Salmelin, Sampsa Vanhatalo, Leena Lauronen, Mia Liljeström, Hanna Renvall

Human brain dynamics are highly unique between individuals: functional neuroimaging studies have recently described functional features that can be used as neural fingerprints. However, the stability of these fingerprints is affected by aging and disease. As such, the stability of brain fingerprints may be a useful metric when studying normal and pathological neurodevelopment. Before examining clinically relevant deviations, the individual stability and variation of neuroimaging features across brain maturation in normally developing children need to be addressed with real clinical data. Here we applied Bayesian reduced-rank regression (BRRR) to extract low-dimensional representations of electroencephalography (EEG) power spectra measured during different non-REM sleep stages (N1 and N2) from 782 normally developing children aged between 6 weeks to 19 years. The representations learned within specific sleep stages successfully separated between subjects and generalized across sleep stages. Fingerprint stability increased with the age of the subjects. Compared to correlation-based fingerprinting methods, the BRRR model performed better, especially in fingerprinting across sleep stages, highlighting the usefulness of dimensionality reduction when the noise and signal of interest are correlated. While further studies are needed to address the possible non-linear maturation effects over developmental periods, our results demonstrate the existence of stable within-session neurofunctional fingerprints in pediatric populations.

人类大脑动力学在个体之间是高度独特的:功能神经成像研究最近描述了可以用作神经指纹的功能特征。然而,这些指纹的稳定性受到年龄和疾病的影响。因此,在研究正常和病理神经发育时,脑指纹的稳定性可能是一个有用的指标。在检查临床相关偏差之前,正常发育儿童脑成熟过程中神经影像学特征的个体稳定性和变化需要用真实的临床数据来解决。本文采用贝叶斯降秩回归(BRRR)方法提取了782名年龄在6周至19岁之间的正常发育儿童在不同非快速眼动睡眠阶段(N1和N2)的脑电图功率谱的低维表示。在特定睡眠阶段学习的表征成功地在受试者之间分离,并在整个睡眠阶段进行概括。指纹稳定性随受试者年龄的增长而增加。与基于相关性的指纹识别方法相比,BRRR模型表现更好,特别是在跨睡眠阶段的指纹识别中,这突出了当噪声和感兴趣的信号相关时降维的有效性。虽然需要进一步的研究来解决在发育期间可能的非线性成熟效应,但我们的研究结果表明,在儿科人群中存在稳定的会话内神经功能指纹。
{"title":"Capturing individual variation in children's electroencephalograms during nREM sleep.","authors":"Verna Heikkinen, Susanne Merz, Riitta Salmelin, Sampsa Vanhatalo, Leena Lauronen, Mia Liljeström, Hanna Renvall","doi":"10.1371/journal.pcbi.1013931","DOIUrl":"10.1371/journal.pcbi.1013931","url":null,"abstract":"<p><p>Human brain dynamics are highly unique between individuals: functional neuroimaging studies have recently described functional features that can be used as neural fingerprints. However, the stability of these fingerprints is affected by aging and disease. As such, the stability of brain fingerprints may be a useful metric when studying normal and pathological neurodevelopment. Before examining clinically relevant deviations, the individual stability and variation of neuroimaging features across brain maturation in normally developing children need to be addressed with real clinical data. Here we applied Bayesian reduced-rank regression (BRRR) to extract low-dimensional representations of electroencephalography (EEG) power spectra measured during different non-REM sleep stages (N1 and N2) from 782 normally developing children aged between 6 weeks to 19 years. The representations learned within specific sleep stages successfully separated between subjects and generalized across sleep stages. Fingerprint stability increased with the age of the subjects. Compared to correlation-based fingerprinting methods, the BRRR model performed better, especially in fingerprinting across sleep stages, highlighting the usefulness of dimensionality reduction when the noise and signal of interest are correlated. While further studies are needed to address the possible non-linear maturation effects over developmental periods, our results demonstrate the existence of stable within-session neurofunctional fingerprints in pediatric populations.</p>","PeriodicalId":20241,"journal":{"name":"PLoS Computational Biology","volume":"22 1","pages":"e1013931"},"PeriodicalIF":3.6,"publicationDate":"2026-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12885382/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146093860","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
TEvarSim: A genome simulator for transposable element (TE) variants. TEvarSim:一个转座因子(TE)变异的基因组模拟器。
IF 3.6 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Pub Date : 2026-01-30 eCollection Date: 2026-01-01 DOI: 10.1371/journal.pcbi.1013933
Jian Miao, Dawei Li

Transposable element (TE) variants, the presence or absence of TE sequences such as LINE-1, Alu, SVA, and endogenous retroviruses, are a major source of genomic diversity and play critical roles in human health, evolution, and disease. As interest in TE variants grows, developing related methods and tools for detection has become increasingly important. However, rigorous benchmarking of TE variant detection methods remains limited due to the lack of accurate and scalable TE variant simulation platforms and the absence of reliable ground truth data. Here, we developed TEvarSim, a novel TE variant simulator that generates TE-containing genomic data in multiple formats, including genomes, short- and long-read sequencing data, and VCF files. TEvarSim supports both random and real-world TE insertions and deletions, including variants derived from pangenome graphs. It can rapidly simulate hundreds to thousands of synthetic chromosomes or genomes and model natural variation at the haplotype, individual, and population levels, making it well suited for large-scale studies. In addition, TEvarSim can directly compare simulated VCF files with TEs reported by TE detection tools, streamlining the benchmarking of TE genotyping methods. TEvarSim provides an all-in-one toolkit for simulating, evaluating, and improving TE variant detection, advancing our ability to accurately study TEs in health and disease in various species.

转座因子(TE)变异,如LINE-1、Alu、SVA和内源性逆转录病毒等TE序列的存在或缺失,是基因组多样性的主要来源,在人类健康、进化和疾病中发挥着关键作用。随着人们对TE变异的兴趣日益增长,开发相关的检测方法和工具变得越来越重要。然而,由于缺乏准确和可扩展的TE变体仿真平台以及缺乏可靠的地面真值数据,严格的TE变体检测方法的基准测试仍然有限。在这里,我们开发了TEvarSim,这是一种新型的TE变体模拟器,可以生成多种格式的包含TE的基因组数据,包括基因组、短读和长读测序数据以及VCF文件。TEvarSim支持随机和真实的TE插入和删除,包括来自泛基因组图的变体。它可以快速模拟数百到数千个合成染色体或基因组,并在单倍型,个体和群体水平上模拟自然变异,使其非常适合大规模研究。此外,TEvarSim可以直接将模拟的VCF文件与TE检测工具报告的TE进行比较,简化了TE基因分型方法的基准测试。TEvarSim提供了一个模拟、评估和改进TE变异检测的一体化工具包,提高了我们准确研究TE在不同物种健康和疾病中的能力。
{"title":"TEvarSim: A genome simulator for transposable element (TE) variants.","authors":"Jian Miao, Dawei Li","doi":"10.1371/journal.pcbi.1013933","DOIUrl":"10.1371/journal.pcbi.1013933","url":null,"abstract":"<p><p>Transposable element (TE) variants, the presence or absence of TE sequences such as LINE-1, Alu, SVA, and endogenous retroviruses, are a major source of genomic diversity and play critical roles in human health, evolution, and disease. As interest in TE variants grows, developing related methods and tools for detection has become increasingly important. However, rigorous benchmarking of TE variant detection methods remains limited due to the lack of accurate and scalable TE variant simulation platforms and the absence of reliable ground truth data. Here, we developed TEvarSim, a novel TE variant simulator that generates TE-containing genomic data in multiple formats, including genomes, short- and long-read sequencing data, and VCF files. TEvarSim supports both random and real-world TE insertions and deletions, including variants derived from pangenome graphs. It can rapidly simulate hundreds to thousands of synthetic chromosomes or genomes and model natural variation at the haplotype, individual, and population levels, making it well suited for large-scale studies. In addition, TEvarSim can directly compare simulated VCF files with TEs reported by TE detection tools, streamlining the benchmarking of TE genotyping methods. TEvarSim provides an all-in-one toolkit for simulating, evaluating, and improving TE variant detection, advancing our ability to accurately study TEs in health and disease in various species.</p>","PeriodicalId":20241,"journal":{"name":"PLoS Computational Biology","volume":"22 1","pages":"e1013933"},"PeriodicalIF":3.6,"publicationDate":"2026-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12875575/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146093892","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SpaConTDS: A multimodal contrastive learning framework for identifying spatial domains by applying tuple disturbing strategy. SpaConTDS:一种多模态对比学习框架,用于应用元组干扰策略识别空间域。
IF 3.6 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Pub Date : 2026-01-29 eCollection Date: 2026-01-01 DOI: 10.1371/journal.pcbi.1013893
Ruiwen Xu, Xiaoqing Cheng, Waiki Ching, Siyao Wu, Yuanben Zhang, Yidan Zhang

The rational utilization of multimodal spatial transcriptomics (ST) data enables accurate identification of spatial domains, which is essential for investigating cellular structure and functions. In this study, we proposed SpaConTDS, a novel framework that integrates reinforcement learning with self-supervised multimodal contrastive learning. SpaConTDS generates positive and negative samples through data augmentation and a pseudo-label tuple perturbation strategy, enabling the learning of fused representations that capture global semantics and cross-modal interactions. The model's hyper-parameters are dynamically optimized using reinforcement learning. Extensive experiments across various resolutions and platforms demonstrate that SpaConTDS achieves state-of-the-art accuracy in spatial domain identification and outperforms existing methods in downstream tasks such as denoising, trajectory inference, and UMAP visualization. Moreover, SpaConTDS effectively integrates multiple tissue sections and corrects batch effects without requiring prior alignment. Compared to existing approaches, SpaConTDS offers more robust fused representations of multimodal data, providing researchers with a flexible and powerful tool for a wide range of spatial transcriptomics analyses.

合理利用多模态空间转录组学(ST)数据可以准确识别空间域,这对研究细胞结构和功能至关重要。在这项研究中,我们提出了SpaConTDS,这是一个将强化学习与自监督多模态对比学习相结合的新框架。SpaConTDS通过数据增强和伪标签元组扰动策略生成正样本和负样本,从而能够学习捕获全局语义和跨模态交互的融合表示。模型的超参数使用强化学习进行动态优化。在各种分辨率和平台上进行的大量实验表明,SpaConTDS在空间域识别方面达到了最先进的精度,并且在下游任务(如去噪、轨迹推断和UMAP可视化)中优于现有方法。此外,SpaConTDS有效地整合了多个组织切片,并在不需要事先对齐的情况下纠正批量效果。与现有的方法相比,SpaConTDS提供了更强大的多模态数据融合表示,为研究人员提供了一个灵活而强大的工具,用于广泛的空间转录组学分析。
{"title":"SpaConTDS: A multimodal contrastive learning framework for identifying spatial domains by applying tuple disturbing strategy.","authors":"Ruiwen Xu, Xiaoqing Cheng, Waiki Ching, Siyao Wu, Yuanben Zhang, Yidan Zhang","doi":"10.1371/journal.pcbi.1013893","DOIUrl":"10.1371/journal.pcbi.1013893","url":null,"abstract":"<p><p>The rational utilization of multimodal spatial transcriptomics (ST) data enables accurate identification of spatial domains, which is essential for investigating cellular structure and functions. In this study, we proposed SpaConTDS, a novel framework that integrates reinforcement learning with self-supervised multimodal contrastive learning. SpaConTDS generates positive and negative samples through data augmentation and a pseudo-label tuple perturbation strategy, enabling the learning of fused representations that capture global semantics and cross-modal interactions. The model's hyper-parameters are dynamically optimized using reinforcement learning. Extensive experiments across various resolutions and platforms demonstrate that SpaConTDS achieves state-of-the-art accuracy in spatial domain identification and outperforms existing methods in downstream tasks such as denoising, trajectory inference, and UMAP visualization. Moreover, SpaConTDS effectively integrates multiple tissue sections and corrects batch effects without requiring prior alignment. Compared to existing approaches, SpaConTDS offers more robust fused representations of multimodal data, providing researchers with a flexible and powerful tool for a wide range of spatial transcriptomics analyses.</p>","PeriodicalId":20241,"journal":{"name":"PLoS Computational Biology","volume":"22 1","pages":"e1013893"},"PeriodicalIF":3.6,"publicationDate":"2026-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12854462/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146087059","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Persistence diagrams as morphological signatures of cells: A method to measure and compare cells within a population. 作为细胞形态特征的持久性图:一种在种群中测量和比较细胞的方法。
IF 3.6 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Pub Date : 2026-01-28 eCollection Date: 2026-01-01 DOI: 10.1371/journal.pcbi.1013890
Yossi Bokor Bleile, Pooja Yadav, Patrice Koehl, Florian Rehfeldt

Quantifying cell morphology is central to understanding cellular regulation, fate, and heterogeneity, yet conventional image-based analyses often struggle with diverse or irregular shapes. We present a computational framework that uses topological data analysis to characterise and compare single-cell morphologies from fluorescence microscopy. Each cell is represented by its contour together with the position of its nucleus, from which we construct a filtration based on a radial distance function and derive a persistence diagram encoding the shape's topological evolution. The similarity between two cells is quantified using the 2-Wasserstein distance between their diagrams, yielding a shape distance we call the PH distance. We apply this method to two representative experimental systems-primary human mesenchymal stem cells (hMSCs) and HeLa cells-and show that PH distances enable the detection of outliers in those systems, the identification of sub-populations, and the quantification of shape heterogeneity. We benchmark PH against three established contour-based distances (aspect ratio, Fourier descriptors, and elastic shape analysis) and show that PH offers better separation between cell types and greater robustness when clustering heterogeneous populations. Together, these results demonstrate that persistent-homology-based signatures provide a principled and sensitive approach for analysing cell morphology in settings where traditional geometric or image-based descriptors are insufficient.

定量细胞形态是理解细胞调控、命运和异质性的核心,然而传统的基于图像的分析经常与不同或不规则的形状作斗争。我们提出了一个计算框架,使用拓扑数据分析来表征和比较荧光显微镜的单细胞形态。每个细胞由其轮廓及其核的位置表示,并以此构建基于径向距离函数的过滤,并推导出编码形状拓扑演化的持久性图。两个细胞之间的相似性使用图之间的2-Wasserstein距离来量化,产生我们称之为PH距离的形状距离。我们将这种方法应用于两个具有代表性的实验系统-原代人间充质干细胞(hMSCs)和HeLa细胞-并表明PH距离能够检测这些系统中的异常值,识别亚群,并量化形状异质性。我们将PH值与三种已建立的基于轮廓的距离(长宽比、傅立叶描述符和弹性形状分析)进行基准测试,并表明PH值在异质种群聚类时提供了更好的细胞类型分离和更强的鲁棒性。总之,这些结果表明,基于持续同源的签名提供了一种原则和敏感的方法,用于在传统的几何或基于图像的描述符不足的情况下分析细胞形态。
{"title":"Persistence diagrams as morphological signatures of cells: A method to measure and compare cells within a population.","authors":"Yossi Bokor Bleile, Pooja Yadav, Patrice Koehl, Florian Rehfeldt","doi":"10.1371/journal.pcbi.1013890","DOIUrl":"10.1371/journal.pcbi.1013890","url":null,"abstract":"<p><p>Quantifying cell morphology is central to understanding cellular regulation, fate, and heterogeneity, yet conventional image-based analyses often struggle with diverse or irregular shapes. We present a computational framework that uses topological data analysis to characterise and compare single-cell morphologies from fluorescence microscopy. Each cell is represented by its contour together with the position of its nucleus, from which we construct a filtration based on a radial distance function and derive a persistence diagram encoding the shape's topological evolution. The similarity between two cells is quantified using the 2-Wasserstein distance between their diagrams, yielding a shape distance we call the PH distance. We apply this method to two representative experimental systems-primary human mesenchymal stem cells (hMSCs) and HeLa cells-and show that PH distances enable the detection of outliers in those systems, the identification of sub-populations, and the quantification of shape heterogeneity. We benchmark PH against three established contour-based distances (aspect ratio, Fourier descriptors, and elastic shape analysis) and show that PH offers better separation between cell types and greater robustness when clustering heterogeneous populations. Together, these results demonstrate that persistent-homology-based signatures provide a principled and sensitive approach for analysing cell morphology in settings where traditional geometric or image-based descriptors are insufficient.</p>","PeriodicalId":20241,"journal":{"name":"PLoS Computational Biology","volume":"22 1","pages":"e1013890"},"PeriodicalIF":3.6,"publicationDate":"2026-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12871990/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146113939","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hierarchical analysis of RNA secondary structures with pseudoknots based on sections. 基于切片的假结RNA二级结构的层次分析。
IF 3.6 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Pub Date : 2026-01-27 eCollection Date: 2026-01-01 DOI: 10.1371/journal.pcbi.1013904
Ryota Masuki, Donn Liew, Ee Hou Yong

Predicting RNA structures containing pseudoknots remains computationally challenging due to high processing costs and complexity. While standard methods for pseudoknot prediction require O(N6) time complexity, we present a hierarchical approach that significantly reduces computational cost while maintaining prediction accuracy. Our method analyzes RNA structures by dividing them into contiguous regions of unpaired bases ("sections") derived from known secondary structures. We examine pseudoknot interactions between sections using a nearest-neighbor energy model with dynamic programming. Our algorithm scales as [Formula: see text], offering substantial computational advantages over existing global prediction methods. Analysis of 726 transfer messenger RNA and 454 Ribonuclease P RNA sequences reveals that biologically relevant pseudoknots are highly concentrated among section pairs with large minimum free energy (MFE) gain. Over 90% of connected section pairs appear within just the top 3% of section pairs ranked by MFE gain. For 2-clusters, our method achieves high prediction accuracy with sensitivity exceeding 0.9 and positive predictive value above 0.8. For 3-clusters, we discovered asymmetric behavior where "former" section pairs (formed early in the sequence) are predicted accurately, while "latter" section pairs do not follow local energy predictions. This asymmetry suggests that complex pseudoknot formation follows sequential co-transcriptional folding rather than global energy minimization, providing insights into RNA folding dynamics.

由于高处理成本和复杂性,预测含有假结的RNA结构在计算上仍然具有挑战性。虽然假结预测的标准方法需要0 (N6)时间复杂度,但我们提出了一种分层方法,在保持预测精度的同时显着降低了计算成本。我们的方法通过将RNA结构划分为来自已知二级结构的未配对碱基(“片段”)的连续区域来分析RNA结构。我们使用具有动态规划的最近邻能量模型来检查截面之间的伪结相互作用。我们的算法规模为[公式:见文本],与现有的全局预测方法相比,提供了实质性的计算优势。对726条传递信使RNA和454条核糖核酸酶P RNA序列的分析表明,具有生物学相关性的假结高度集中在最小自由能(MFE)增益较大的区段对上。超过90%的连接区段对出现在MFE增益排名前3%的区段对中。对于2聚类,我们的方法具有较高的预测精度,灵敏度超过0.9,阳性预测值在0.8以上。对于3-簇,我们发现了不对称行为,其中“前”部分对(在序列早期形成)被准确预测,而“后”部分对不遵循局部能量预测。这种不对称性表明复杂的假结形成遵循顺序的共转录折叠,而不是全局能量最小化,为RNA折叠动力学提供了见解。
{"title":"Hierarchical analysis of RNA secondary structures with pseudoknots based on sections.","authors":"Ryota Masuki, Donn Liew, Ee Hou Yong","doi":"10.1371/journal.pcbi.1013904","DOIUrl":"10.1371/journal.pcbi.1013904","url":null,"abstract":"<p><p>Predicting RNA structures containing pseudoknots remains computationally challenging due to high processing costs and complexity. While standard methods for pseudoknot prediction require O(N6) time complexity, we present a hierarchical approach that significantly reduces computational cost while maintaining prediction accuracy. Our method analyzes RNA structures by dividing them into contiguous regions of unpaired bases (\"sections\") derived from known secondary structures. We examine pseudoknot interactions between sections using a nearest-neighbor energy model with dynamic programming. Our algorithm scales as [Formula: see text], offering substantial computational advantages over existing global prediction methods. Analysis of 726 transfer messenger RNA and 454 Ribonuclease P RNA sequences reveals that biologically relevant pseudoknots are highly concentrated among section pairs with large minimum free energy (MFE) gain. Over 90% of connected section pairs appear within just the top 3% of section pairs ranked by MFE gain. For 2-clusters, our method achieves high prediction accuracy with sensitivity exceeding 0.9 and positive predictive value above 0.8. For 3-clusters, we discovered asymmetric behavior where \"former\" section pairs (formed early in the sequence) are predicted accurately, while \"latter\" section pairs do not follow local energy predictions. This asymmetry suggests that complex pseudoknot formation follows sequential co-transcriptional folding rather than global energy minimization, providing insights into RNA folding dynamics.</p>","PeriodicalId":20241,"journal":{"name":"PLoS Computational Biology","volume":"22 1","pages":"e1013904"},"PeriodicalIF":3.6,"publicationDate":"2026-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12858078/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146066246","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
PLoS Computational Biology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1