首页 > 最新文献

ArXiv最新文献

英文 中文
Cedalion Tutorial: A Python-based framework for comprehensive analysis of multimodal fNIRS & DOT from the lab to the everyday world. Cedalion教程:一个基于python的框架,用于从实验室到日常世界的多模态fNIRS和DOT综合分析。
Pub Date : 2026-01-09
E Middell, L Carlton, S Moradi, T Codina, T Fischer, J Cutler, S Kelley, J Behrendt, T Dissanayake, N Harmening, M A Yücel, D A Boas, A von Lühmann

Functional near-infrared spectroscopy (fNIRS) and diffuse optical tomography (DOT) are rapidly evolving toward wearable, multimodal, and data-driven, AI-supported neuroimaging in the everyday world. However, current analytical tools are fragmented across platforms, limiting reproducibility, interoperability, and integration with modern machine learning (ML) workflows. Cedalion is a Python-based open-source framework designed to unify advanced model-based and data-driven analysis of multimodal fNIRS and DOT data within a reproducible, extensible, and community-driven environment. Cedalion integrates forward modelling, photogrammetric optode co-registration, signal processing, GLM Analysis, DOT image reconstruction, and ML-based data-driven methods within a single standardized architecture based on the Python ecosystem. It adheres to SNIRF and BIDS standards, supports cloud-executable Jupyter notebooks, and provides containerized workflows for scalable, fully reproducible analysis pipelines that can be provided alongside original research publications. Cedalion connects established optical-neuroimaging pipelines with ML frameworks such as scikit-learn and PyTorch, enabling seamless multimodal fusion with EEG, MEG, and physiological data. It implements validated algorithms for signal-quality assessment, motion correction, GLM modelling, and DOT reconstruction, complemented by modules for simulation, data augmentation, and multimodal physiology analysis. Automated documentation links each method to its source publication, and continuous-integration testing ensures robustness. This tutorial paper provides seven fully executable notebooks that demonstrate core features. Cedalion offers an open, transparent, and community extensible foundation that supports reproducible, scalable, cloud- and ML-ready fNIRS/DOT workflows for laboratory-based and real-world neuroimaging.

功能性近红外光谱(fNIRS)和漫射光学断层扫描(DOT)正在迅速向日常生活中可穿戴、多模态、数据驱动、人工智能支持的神经成像发展。然而,当前的分析工具跨平台分散,限制了再现性、互操作性以及与现代机器学习(ML)工作流的集成。Cedalion是一个基于python的开源框架,用于在可复制、可扩展和社区驱动的环境中统一多模态fNIRS和DOT数据的高级基于模型和数据驱动分析。Cedalion将前向建模、摄影测量光电共配准、信号处理、GLM分析、DOT图像重建和基于ml的数据驱动方法集成在基于Python生态系统的单一标准化架构中。它遵循SNIRF和BIDS标准,支持云执行的Jupyter笔记本,并为可扩展的、完全可重复的分析管道提供容器化工作流,可以与原始研究出版物一起提供。Cedalion将已建立的光学神经成像管道与ML框架(如scikit-learn和PyTorch)连接起来,实现与EEG, MEG和生理数据的无缝多模式融合。它实现了经过验证的算法,用于信号质量评估、运动校正、GLM建模和DOT重建,并辅以仿真、数据增强和多模态生理分析模块。自动化文档将每个方法链接到它的源发布,持续集成测试确保了健壮性。本教程提供了七个完全可执行的笔记本,演示了核心功能。Cedalion提供了一个开放、透明和社区可扩展的基础,支持可重复、可扩展、云和ml就绪的fNIRS/DOT工作流程,用于基于实验室和现实世界的神经成像。
{"title":"Cedalion Tutorial: A Python-based framework for comprehensive analysis of multimodal fNIRS & DOT from the lab to the everyday world.","authors":"E Middell, L Carlton, S Moradi, T Codina, T Fischer, J Cutler, S Kelley, J Behrendt, T Dissanayake, N Harmening, M A Yücel, D A Boas, A von Lühmann","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Functional near-infrared spectroscopy (fNIRS) and diffuse optical tomography (DOT) are rapidly evolving toward wearable, multimodal, and data-driven, AI-supported neuroimaging in the everyday world. However, current analytical tools are fragmented across platforms, limiting reproducibility, interoperability, and integration with modern machine learning (ML) workflows. Cedalion is a Python-based open-source framework designed to unify advanced model-based and data-driven analysis of multimodal fNIRS and DOT data within a reproducible, extensible, and community-driven environment. Cedalion integrates forward modelling, photogrammetric optode co-registration, signal processing, GLM Analysis, DOT image reconstruction, and ML-based data-driven methods within a single standardized architecture based on the Python ecosystem. It adheres to SNIRF and BIDS standards, supports cloud-executable Jupyter notebooks, and provides containerized workflows for scalable, fully reproducible analysis pipelines that can be provided alongside original research publications. Cedalion connects established optical-neuroimaging pipelines with ML frameworks such as scikit-learn and PyTorch, enabling seamless multimodal fusion with EEG, MEG, and physiological data. It implements validated algorithms for signal-quality assessment, motion correction, GLM modelling, and DOT reconstruction, complemented by modules for simulation, data augmentation, and multimodal physiology analysis. Automated documentation links each method to its source publication, and continuous-integration testing ensures robustness. This tutorial paper provides seven fully executable notebooks that demonstrate core features. Cedalion offers an open, transparent, and community extensible foundation that supports reproducible, scalable, cloud- and ML-ready fNIRS/DOT workflows for laboratory-based and real-world neuroimaging.</p>","PeriodicalId":93888,"journal":{"name":"ArXiv","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2026-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12803311/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145992215","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fold-switching proteins push the boundaries of conformational ensemble prediction. 折叠开关蛋白推动了构象集合预测的边界。
Pub Date : 2026-01-08
Myeongsang Lee, Lauren L Porter

A protein's function depends critically on its conformational ensemble, a collection of energy weighted structures whose balance depends on temperature and environment. Though recent deep learning (DL) methods have substantially advanced predictions of single protein structures, computationally modeling conformational ensembles remains a challenge. Here, we focus on modeling fold-switching proteins, which remodel their secondary and/or tertiary structures and change their functions in response to cellular stimuli. These underrepresented members of the protein universe serve as test cases for a method's generalizability. They reveal that DL models often predict conformational ensembles by association with training-set structures, limiting generalizability. These observations suggest use cases for when DL methods will likely succeed or fail. Developing computational methods that successfully identify new fold-switching proteins from large pools of candidates may advance modeling conformational ensembles more broadly.

蛋白质的功能主要取决于它的构象集合,这是一种能量加权结构的集合,其平衡取决于温度和环境。尽管最近的深度学习(DL)方法已经大大提高了对单个蛋白质结构的预测,但计算建模构象集成仍然是一个挑战。在这里,我们的重点是建模折叠开关蛋白,其重塑其二级和/或三级结构,并改变其功能,以响应细胞刺激。这些蛋白质宇宙中未被充分代表的成员可以作为测试方法可泛化性的用例。他们发现深度学习模型经常通过与训练集结构的关联来预测构象集成,限制了泛化能力。这些观察结果为深度学习方法可能成功或失败的用例提供了建议。开发计算方法,成功地从大量候选蛋白质中识别新的折叠开关蛋白质,可能会更广泛地推进构象集成的建模。
{"title":"Fold-switching proteins push the boundaries of conformational ensemble prediction.","authors":"Myeongsang Lee, Lauren L Porter","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>A protein's function depends critically on its conformational ensemble, a collection of energy weighted structures whose balance depends on temperature and environment. Though recent deep learning (DL) methods have substantially advanced predictions of single protein structures, computationally modeling conformational ensembles remains a challenge. Here, we focus on modeling fold-switching proteins, which remodel their secondary and/or tertiary structures and change their functions in response to cellular stimuli. These underrepresented members of the protein universe serve as test cases for a method's generalizability. They reveal that DL models often predict conformational ensembles by association with training-set structures, limiting generalizability. These observations suggest use cases for when DL methods will likely succeed or fail. Developing computational methods that successfully identify new fold-switching proteins from large pools of candidates may advance modeling conformational ensembles more broadly.</p>","PeriodicalId":93888,"journal":{"name":"ArXiv","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2026-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12803323/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145992142","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Effect of Right Ventricular Outflow Tract Material Properties on Simulated Transcatheter Pulmonary Placement. 右心室流出道材料特性对模拟经导管肺动脉置入术的影响。
Pub Date : 2026-01-08
Jalaj Maheshwari, Wensi Wu, Christopher N Zelonis, Steve A Maas, Kyle Sunderland, Yuval Barak-Corren, Stephen Ching, Patricia Sabin, Andras Lasso, Matthew J Gillespie, Jeffrey A Weiss, Matthew A Jolley

Finite element (FE) simulations emulating transcatheter pulmonary valve (TPV) system deployment in patient-specific right ventricular outflow tracts (RVOT) assume material properties for the RVOT and adjacent tissues. Sensitivity of the deployment to variation in RVOT material properties is unknown. Moreover, the effect of a transannular patch stiffness and location on simulated TPV deployment has not been explored. A sensitivity analysis on the material properties of a patient-specific RVOT during TPV deployment, modeled as an uncoupled HGO material, was conducted using FEBioUncertainSCI. Further, the effects of a transannular patch during TPV deployment were analyzed by considering two patch locations and four patch stiffnesses. Visualization of results and quantification were performed using custom metrics implemented in SlicerHeart and FEBio. Sensitivity analysis revealed that the shear modulus of the ground matrix (c), fiber modulus (k1), and fiber mean orientation angle (gamma) had the greatest effect on 95th %ile stress, whereas only c had the greatest effect on 95th %ile Lagrangian strain. First-order sensitivity indices contributed the greatest to the total-order sensitivity indices. Simulations using a transannular patch revealed that peak stress and strain were dependent on patch location. As stiffness of the patch increased, greater stress was observed at the interface connecting the patch to the RVOT, and stress in the patch itself increased while strain decreased. The total enclosed volume by the TPV device remained unchanged across all simulated patch cases. This study highlights that while uncertainties in tissue material properties and patch locations may influence functional outcomes, FE simulations provide a reliable framework for evaluating these outcomes in TPVR.

有限元(FE)模拟经导管肺动脉瓣(TPV)系统在患者特定右心室流出道(RVOT)中的部署,假设RVOT和邻近组织的材料特性。部署对RVOT材料特性变化的敏感性尚不清楚。此外,跨环形贴片刚度和位置对模拟TPV部署的影响尚未得到探讨。在TPV部署过程中,对患者特异性RVOT的材料特性进行了敏感性分析,建模为不耦合的HGO材料,使用febiunsuresci进行了分析。此外,通过考虑两种贴片位置和四种贴片刚度,分析了TPV部署过程中跨环形贴片的影响。使用SlicerHeart和FEBio实现的自定义指标对结果进行可视化和量化。敏感性分析表明,基底剪切模量(c)、纤维模量(k1)和纤维平均取向角(gamma)对95% ile应力的影响最大,而只有c对95% ile拉格朗日应变的影响最大。一阶灵敏度指标对全阶灵敏度指标的贡献最大。通过环形贴片的模拟表明,峰值应力和应变依赖于贴片的位置。随着贴片刚度的增加,在贴片与RVOT的界面处观察到更大的应力,并且贴片本身的应力增加而应变减小。TPV装置的总封闭体积在所有模拟贴片病例中保持不变。该研究强调,虽然组织材料特性和贴片位置的不确定性可能会影响功能结果,但FE模拟为评估TPVR的这些结果提供了可靠的框架。
{"title":"Effect of Right Ventricular Outflow Tract Material Properties on Simulated Transcatheter Pulmonary Placement.","authors":"Jalaj Maheshwari, Wensi Wu, Christopher N Zelonis, Steve A Maas, Kyle Sunderland, Yuval Barak-Corren, Stephen Ching, Patricia Sabin, Andras Lasso, Matthew J Gillespie, Jeffrey A Weiss, Matthew A Jolley","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Finite element (FE) simulations emulating transcatheter pulmonary valve (TPV) system deployment in patient-specific right ventricular outflow tracts (RVOT) assume material properties for the RVOT and adjacent tissues. Sensitivity of the deployment to variation in RVOT material properties is unknown. Moreover, the effect of a transannular patch stiffness and location on simulated TPV deployment has not been explored. A sensitivity analysis on the material properties of a patient-specific RVOT during TPV deployment, modeled as an uncoupled HGO material, was conducted using FEBioUncertainSCI. Further, the effects of a transannular patch during TPV deployment were analyzed by considering two patch locations and four patch stiffnesses. Visualization of results and quantification were performed using custom metrics implemented in SlicerHeart and FEBio. Sensitivity analysis revealed that the shear modulus of the ground matrix (c), fiber modulus (k1), and fiber mean orientation angle (gamma) had the greatest effect on 95th %ile stress, whereas only c had the greatest effect on 95th %ile Lagrangian strain. First-order sensitivity indices contributed the greatest to the total-order sensitivity indices. Simulations using a transannular patch revealed that peak stress and strain were dependent on patch location. As stiffness of the patch increased, greater stress was observed at the interface connecting the patch to the RVOT, and stress in the patch itself increased while strain decreased. The total enclosed volume by the TPV device remained unchanged across all simulated patch cases. This study highlights that while uncertainties in tissue material properties and patch locations may influence functional outcomes, FE simulations provide a reliable framework for evaluating these outcomes in TPVR.</p>","PeriodicalId":93888,"journal":{"name":"ArXiv","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2026-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12803331/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145992190","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The rights and wrongs of rescaling in population genetics simulations. 群体遗传学模拟中缩放的对与错。
Pub Date : 2026-01-08
Parul Johri, Fanny Pouyet, Brian Charlesworth

Computer simulations of complex population genetic models are an essential tool for making sense of the large-scale datasets of multiple genome sequences from a single species that are becoming increasingly available. A widely used approach for reducing computing time is to simulate populations that are much smaller than the natural populations that they are intended to represent, by using parameters such as selection coefficients and mutation rates, whose products with the population size correspond to those of the natural populations. This approach has come to be known as rescaling, and is justified by the theory of the genetics of finite populations. Recently, however, there have been criticisms of this practice, which have brought to light situations in which it can lead to erroneous conclusions. This paper reviews the theoretical basis for rescaling, and relates it to current practice in population genetics simulations. It shows that some population genetic statistics are scaleable while others are not. Additionally, it shows that there are likely to be problems with rescaling when simulating large chromosomal regions, due to the non-linear relation between the physical distance between a pair of separate nucleotide sites and the frequency of recombination between them. Other difficulties with rescaling can arise in connection with simulations of selection on complex traits, and with populations that reproduce partly by self-fertilization or asexual reproduction. A number of recommendations are made for good practice in relation to rescaling.

复杂种群遗传模型的计算机模拟是理解来自单一物种的多个基因组序列的大规模数据集的重要工具,这些数据集正变得越来越可用。减少计算时间的一种广泛使用的方法是,通过使用选择系数和突变率等参数来模拟比它们所要表示的自然种群小得多的种群,其种群大小的乘积与自然种群的乘积相对应。这种方法被称为重新缩放,并被有限种群的遗传学理论所证实。然而,最近出现了对这种做法的批评,这些批评暴露了这种做法可能导致错误结论的情况。本文综述了重标度的理论基础,并将其与当前群体遗传学模拟的实践联系起来。这表明一些群体遗传统计是可扩展的,而另一些则不是。此外,它表明,由于一对独立核苷酸位点之间的物理距离与它们之间的重组频率之间的非线性关系,在模拟大染色体区域时可能存在重新缩放问题。重定尺度的其他困难可能与复杂性状的选择模拟有关,也可能与部分通过自交受精或无性繁殖繁殖的种群有关。本文提出了一些关于重新缩放的良好做法的建议。
{"title":"The rights and wrongs of rescaling in population genetics simulations.","authors":"Parul Johri, Fanny Pouyet, Brian Charlesworth","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Computer simulations of complex population genetic models are an essential tool for making sense of the large-scale datasets of multiple genome sequences from a single species that are becoming increasingly available. A widely used approach for reducing computing time is to simulate populations that are much smaller than the natural populations that they are intended to represent, by using parameters such as selection coefficients and mutation rates, whose products with the population size correspond to those of the natural populations. This approach has come to be known as rescaling, and is justified by the theory of the genetics of finite populations. Recently, however, there have been criticisms of this practice, which have brought to light situations in which it can lead to erroneous conclusions. This paper reviews the theoretical basis for rescaling, and relates it to current practice in population genetics simulations. It shows that some population genetic statistics are scaleable while others are not. Additionally, it shows that there are likely to be problems with rescaling when simulating large chromosomal regions, due to the non-linear relation between the physical distance between a pair of separate nucleotide sites and the frequency of recombination between them. Other difficulties with rescaling can arise in connection with simulations of selection on complex traits, and with populations that reproduce partly by self-fertilization or asexual reproduction. A number of recommendations are made for good practice in relation to rescaling.</p>","PeriodicalId":93888,"journal":{"name":"ArXiv","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2026-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12803318/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145992137","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Knowledge Distillation of a Protein Language Model Yields a Foundational Implicit Solvent Model. 蛋白质语言模型的知识蒸馏产生了一个基本的隐式溶剂模型。
Pub Date : 2026-01-08
Justin Airas, Bin Zhang

Implicit solvent models (ISMs) promise to deliver the accuracy of explicit solvent simulations at a fraction of the computational cost. However, despite decades of development, their accuracy has remained insufficient for many critical applications, particularly for simulating protein folding and the behavior of intrinsically disordered proteins. Developing a transferable, data-driven ISM that overcomes the limitations of traditional analytical formulas remains a central challenge in computational chemistry. Here we address this challenge by introducing a novel strategy that distills the evolutionary information learned by a protein language model, ESM3, into a computationally efficient graph neural network (GNN). We show that this GNN potential, trained on effective energies from ESM3, is robust enough to drive stable, long-timescale molecular dynamics simulations. When combined with a standard electrostatics term, our hybrid model accurately reproduces protein folding free-energy landscapes and predicts the structural ensembles of intrinsically disordered proteins. This approach yields a single, unified model that is transferable across both folded and disordered protein states, resolving a long-standing limitation of conventional ISMs. By successfully distilling evolutionary knowledge into a physical potential, our work delivers a foundational implicit solvent model poised to accelerate the development of predictive, large-scale simulation tools.

隐式溶剂模型(ISMs)承诺以计算成本的一小部分提供显式溶剂模拟的准确性。然而,尽管经过几十年的发展,它们的准确性仍然不足以用于许多关键应用,特别是模拟蛋白质折叠和内在无序蛋白质的行为。开发一种可转移的、数据驱动的ISM,克服传统分析公式的局限性,仍然是计算化学的核心挑战。在这里,我们通过引入一种新的策略来解决这一挑战,该策略将蛋白质语言模型ESM3学习到的进化信息提取到计算效率高的图神经网络(GNN)中。我们证明,这种GNN势,在ESM3有效能量的训练下,足够强大,可以驱动稳定的、长时间尺度的分子动力学模拟。当与标准静电术语相结合时,我们的混合模型准确地再现了蛋白质折叠的自由能景观,并预测了内在无序蛋白质的结构集合。这种方法产生了一个单一的,统一的模型,可以在折叠和无序的蛋白质状态之间转移,解决了传统ISMs长期存在的局限性。通过成功地将进化知识提炼成物理势,我们的工作提供了一个基本的隐式溶剂模型,有望加速预测性大规模模拟工具的开发。
{"title":"Knowledge Distillation of a Protein Language Model Yields a Foundational Implicit Solvent Model.","authors":"Justin Airas, Bin Zhang","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Implicit solvent models (ISMs) promise to deliver the accuracy of explicit solvent simulations at a fraction of the computational cost. However, despite decades of development, their accuracy has remained insufficient for many critical applications, particularly for simulating protein folding and the behavior of intrinsically disordered proteins. Developing a transferable, data-driven ISM that overcomes the limitations of traditional analytical formulas remains a central challenge in computational chemistry. Here we address this challenge by introducing a novel strategy that distills the evolutionary information learned by a protein language model, ESM3, into a computationally efficient graph neural network (GNN). We show that this GNN potential, trained on effective energies from ESM3, is robust enough to drive stable, long-timescale molecular dynamics simulations. When combined with a standard electrostatics term, our hybrid model accurately reproduces protein folding free-energy landscapes and predicts the structural ensembles of intrinsically disordered proteins. This approach yields a single, unified model that is transferable across both folded and disordered protein states, resolving a long-standing limitation of conventional ISMs. By successfully distilling evolutionary knowledge into a physical potential, our work delivers a foundational implicit solvent model poised to accelerate the development of predictive, large-scale simulation tools.</p>","PeriodicalId":93888,"journal":{"name":"ArXiv","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2026-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12803332/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145992156","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Assessment of scoring functions for computational models of protein-protein interfaces. 蛋白质-蛋白质界面计算模型的评分函数评估。
Pub Date : 2026-01-07
Jacob Sumner, Grace Meng, Naomi Brandt, Alex T Grigas, Andrés Córdoba, Mark D Shattuck, Corey S O'Hern

A goal of computational studies of protein-protein interfaces (PPIs) is to predict the binding site between two monomers that form a heterodimer. The simplest version of this problem is to rigidly re-dock the bound forms of the monomers, which involves generating computational models of the heterodimer and then scoring them to determine the most native-like models. Scoring functions have been assessed previously using rank- and classification-based metrics, however, these methods are sensitive to the number and quality of models in the scoring function training set. We assess the accuracy of seven PPI scoring functions by comparing their scores to a measure of structural similarity to the x-ray crystal structure (i.e. the DockQ score) for a non-redundant set of heterodimers from the Protein Data Bank. For each heterodimer, we generate re-docked models uniformly sampled over DockQ and calculate the Spearman correlation between the PPI scores and DockQ. For some targets, the scores and DockQ are highly correlated; however, for many targets, there are weak correlations. Several physical features can explain the difference between difficult- and easy-to-score targets. For example, strong correlations exist between the score and DockQ for targets with highly intertwined monomers and many interface contacts. We also develop a new score based on only three physical features that matches or exceeds the performance of current PPI scoring functions. These results emphasize that PPI prediction can be improved by focusing on correlations between the PPI score and DockQ and incorporating more discriminating physical features into PPI scoring functions.

蛋白质-蛋白质界面(PPIs)计算研究的一个目标是预测形成异源二聚体的两个单体之间的结合位点。这个问题的最简单的版本是严格地重新对接单体的结合形式,这涉及到生成异源二聚体的计算模型,然后对它们进行评分,以确定最像原生的模型。以前已经使用基于排名和分类的度量来评估评分函数,然而,这些方法对评分函数训练集中模型的数量和质量很敏感。我们通过将七个PPI评分函数的分数与来自蛋白质数据库的一组非冗余异二聚体的x射线晶体结构的结构相似性(即DockQ评分)进行比较,来评估其准确性。对于每个异源二聚体,我们在DockQ上均匀采样生成重新对接模型,并计算PPI评分与DockQ之间的Spearman相关性。对于某些目标,得分与DockQ高度相关;然而,对于许多目标,存在弱相关性。一些身体特征可以解释难得分和易得分目标之间的差异。例如,对于具有高度缠绕单体和许多界面接触的目标,得分和DockQ之间存在很强的相关性。我们还开发了一个新的评分,仅基于三个物理特征,匹配或超过当前PPI评分功能的性能。这些结果强调,通过关注PPI评分与DockQ之间的相关性,并在PPI评分函数中加入更多具有区别性的身体特征,可以改善PPI预测。
{"title":"Assessment of scoring functions for computational models of protein-protein interfaces.","authors":"Jacob Sumner, Grace Meng, Naomi Brandt, Alex T Grigas, Andrés Córdoba, Mark D Shattuck, Corey S O'Hern","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>A goal of computational studies of protein-protein interfaces (PPIs) is to predict the binding site between two monomers that form a heterodimer. The simplest version of this problem is to rigidly re-dock the bound forms of the monomers, which involves generating computational models of the heterodimer and then scoring them to determine the most native-like models. Scoring functions have been assessed previously using rank- and classification-based metrics, however, these methods are sensitive to the number and quality of models in the scoring function training set. We assess the accuracy of seven PPI scoring functions by comparing their scores to a measure of structural similarity to the x-ray crystal structure (i.e. the DockQ score) for a non-redundant set of heterodimers from the Protein Data Bank. For each heterodimer, we generate re-docked models uniformly sampled over DockQ and calculate the Spearman correlation between the PPI scores and DockQ. For some targets, the scores and DockQ are highly correlated; however, for many targets, there are weak correlations. Several physical features can explain the difference between difficult- and easy-to-score targets. For example, strong correlations exist between the score and DockQ for targets with highly intertwined monomers and many interface contacts. We also develop a new score based on only three physical features that matches or exceeds the performance of current PPI scoring functions. These results emphasize that PPI prediction can be improved by focusing on correlations between the PPI score and DockQ and incorporating more discriminating physical features into PPI scoring functions.</p>","PeriodicalId":93888,"journal":{"name":"ArXiv","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2026-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12803336/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145991921","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Allostery Beyond Amplification: Temporal Regulation of Signaling Information. 超越放大的变构:信号信息的时间调节。
Pub Date : 2026-01-07
Pedro Pessoa, Steve Pressé, S Banu Ozkan

Allostery is a fundamental mechanism of protein regulation and is commonly interpreted as modulating enzymatic activity or product abundance. Here we show that this view is incomplete. Using a stochastic model of allosteric regulation combined with an information-theoretic analysis, we quantify the mutual information between an enzyme's regulatory state and the states of downstream signaling components. Beyond controlling steady-state production levels, allostery also regulates the timing and duration over which information is transmitted. By tuning the temporal operating regime of signaling pathways, allosteric regulation enables distinct dynamical outcomes from identical molecular components, providing a physical mechanism for temporal information flow, signaling specificity, and coordination without changes in metabolic pathways.

变构是蛋白质调控的基本机制,通常被解释为调节酶活性或产物丰度。这里我们证明这种观点是不完整的。利用变构调节的随机模型结合信息论分析,我们量化了酶的调节状态和下游信号成分状态之间的相互信息。除了控制稳态生产水平外,变构还调节信息传输的时间和持续时间。通过调节信号通路的时间运作机制,变构调节使相同的分子成分产生不同的动态结果,为时间信息流、信号特异性和协调提供了一种物理机制,而不改变代谢途径。
{"title":"Allostery Beyond Amplification: Temporal Regulation of Signaling Information.","authors":"Pedro Pessoa, Steve Pressé, S Banu Ozkan","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Allostery is a fundamental mechanism of protein regulation and is commonly interpreted as modulating enzymatic activity or product abundance. Here we show that this view is incomplete. Using a stochastic model of allosteric regulation combined with an information-theoretic analysis, we quantify the mutual information between an enzyme's regulatory state and the states of downstream signaling components. Beyond controlling steady-state production levels, allostery also regulates the timing and duration over which information is transmitted. By tuning the temporal operating regime of signaling pathways, allosteric regulation enables distinct dynamical outcomes from identical molecular components, providing a physical mechanism for temporal information flow, signaling specificity, and coordination without changes in metabolic pathways.</p>","PeriodicalId":93888,"journal":{"name":"ArXiv","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2026-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12803326/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145992011","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Restoring information in aged gene regulatory networks by single knock-ins. 通过单敲入恢复老化基因调控网络中的信息。
Pub Date : 2026-01-07
Ryan LeFebre, Fabrisia Ambrosio, Andrew Mugler

A hallmark of aging is loss of information in gene regulatory networks. These networks are tightly connected, raising the question of whether information could be restored by perturbing single genes. We develop a simple theoretical framework for information transmission in gene regulatory networks that describes the information gained or lost when a gene is "knocked in" (exogenously expressed). Applying the framework to gene expression data from muscle cells in young and old mice, we find that single knock-ins can restore network information by up to 10%. Our work advances the study of information flow in networks and identifies potential gene targets for rejuvenation.

衰老的一个标志是基因调控网络中的信息丢失。这些网络是紧密相连的,这就提出了一个问题:是否可以通过扰乱单个基因来恢复信息?我们为基因调控网络中的信息传递开发了一个简单的理论框架,该框架描述了当基因“敲入”(外源表达)时获得或丢失的信息。将该框架应用于年轻和年老小鼠肌肉细胞的基因表达数据,我们发现单敲入可以恢复高达10%的网络信息。我们的工作推进了网络信息流的研究,并确定了复壮的潜在基因靶点。
{"title":"Restoring information in aged gene regulatory networks by single knock-ins.","authors":"Ryan LeFebre, Fabrisia Ambrosio, Andrew Mugler","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>A hallmark of aging is loss of information in gene regulatory networks. These networks are tightly connected, raising the question of whether information could be restored by perturbing single genes. We develop a simple theoretical framework for information transmission in gene regulatory networks that describes the information gained or lost when a gene is \"knocked in\" (exogenously expressed). Applying the framework to gene expression data from muscle cells in young and old mice, we find that single knock-ins can restore network information by up to 10%. Our work advances the study of information flow in networks and identifies potential gene targets for rejuvenation.</p>","PeriodicalId":93888,"journal":{"name":"ArXiv","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2026-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12803314/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145992205","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Biomechanically Informed Image Registration for Patient-Specific Aortic Valve Strain Analysis. 生物力学信息图像配准用于患者特异性主动脉瓣应变分析。
Pub Date : 2026-01-07
Mohsen Nakhaei, Alison Pouch, Silvani Amin, Matthew Daemer, Christian Herz, Natalie Yushkevich, Lourdes Al Ghofaily, Nimesh Desai, Joseph Bavaria, Matthew Jolley, Wensi Wu

Aortic valve (AV) biomechanics play a critical role in maintaining normal cardiac function. Pathological variations, particularly in bicuspid aortic valves (BAVs), alter leaflet loading, increase strain, and accelerate disease progression. Accurate, patient-specific characterization of valve geometry and deformation is essential for predicting disease progression and guiding durable repair. Current imaging and computational methods often fail to capture rapid valve motion and complex patient-specific features. To address these challenges, we combined image registration with finite element method (FEM) to enhance AV tracking and biomechanical assessment. Patient-specific valve geometries from 4D transesophageal echocardiography (TEE) and CT were used in FEM to model AV closure and generate intermediate deformation states. The FEM-generated states facilitated leaflet tracking, while the registration algorithm corrected mismatches between simulation and image. Across 20 patients, FEM-augmented registration improved accuracy by 40% compared with direct registration (33% for TEE, 46% for CT). This improvement enabled more reliable strain estimation directly from imaging and reducing uncertainties from boundary conditions and material assumptions. Areal and Green-Lagrange strains, as well as effective strain, were quantified in adult trileaflet/bicuspid, and pediatric patients. Trileaflet adults showed uniform deformation, BAVs exhibited asymmetric strain, and pediatric valves had low mean areal strain with high variability. Convergence between trileaflet adult and pediatric valves in mean effective strain suggests volumetric deformation drives age- and size-related differences. The FEM-augmented registration framework enhances geometric tracking and provides clinically relevant insights into patient-specific AV deformation, supporting individualized intervention planning.

主动脉瓣生物力学在维持正常心功能中起着至关重要的作用。病理变异,特别是在二尖瓣主动脉瓣(bav),改变小叶负荷,增加应变,并加速疾病进展。准确的、患者特异性的瓣膜几何形状和变形特征对于预测疾病进展和指导持久修复至关重要。目前的成像和计算方法往往不能捕捉快速的瓣膜运动和复杂的患者特异性特征。为了解决这些挑战,我们将图像配准与有限元法(FEM)相结合,以增强AV跟踪和生物力学评估。通过4D经食管超声心动图(TEE)和CT的患者特异性瓣膜几何形状在FEM中模拟房室关闭并产生中间变形状态。fem生成的状态促进了传单的跟踪,而配准算法则纠正了模拟与图像之间的不匹配。在20名患者中,与直接登记相比,fem增强登记的准确性提高了40% (TEE为33%,CT为46%)。这种改进可以直接从成像中获得更可靠的应变估计,并减少边界条件和材料假设的不确定性。在成人三叶/二尖叶和儿童患者中定量测定Areal和Green-Lagrange菌株以及有效菌株。成年三叶瓣膜表现为均匀变形,bav表现为不对称应变,儿童瓣膜具有低平均面应变和高变异性。成人和儿童三叶瓣膜的平均有效应变趋同表明体积变形驱动年龄和尺寸相关的差异。fem增强的注册框架增强了几何跟踪,并为患者特异性房室变形提供了临床相关的见解,支持个性化干预计划。
{"title":"Biomechanically Informed Image Registration for Patient-Specific Aortic Valve Strain Analysis.","authors":"Mohsen Nakhaei, Alison Pouch, Silvani Amin, Matthew Daemer, Christian Herz, Natalie Yushkevich, Lourdes Al Ghofaily, Nimesh Desai, Joseph Bavaria, Matthew Jolley, Wensi Wu","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Aortic valve (AV) biomechanics play a critical role in maintaining normal cardiac function. Pathological variations, particularly in bicuspid aortic valves (BAVs), alter leaflet loading, increase strain, and accelerate disease progression. Accurate, patient-specific characterization of valve geometry and deformation is essential for predicting disease progression and guiding durable repair. Current imaging and computational methods often fail to capture rapid valve motion and complex patient-specific features. To address these challenges, we combined image registration with finite element method (FEM) to enhance AV tracking and biomechanical assessment. Patient-specific valve geometries from 4D transesophageal echocardiography (TEE) and CT were used in FEM to model AV closure and generate intermediate deformation states. The FEM-generated states facilitated leaflet tracking, while the registration algorithm corrected mismatches between simulation and image. Across 20 patients, FEM-augmented registration improved accuracy by 40% compared with direct registration (33% for TEE, 46% for CT). This improvement enabled more reliable strain estimation directly from imaging and reducing uncertainties from boundary conditions and material assumptions. Areal and Green-Lagrange strains, as well as effective strain, were quantified in adult trileaflet/bicuspid, and pediatric patients. Trileaflet adults showed uniform deformation, BAVs exhibited asymmetric strain, and pediatric valves had low mean areal strain with high variability. Convergence between trileaflet adult and pediatric valves in mean effective strain suggests volumetric deformation drives age- and size-related differences. The FEM-augmented registration framework enhances geometric tracking and provides clinically relevant insights into patient-specific AV deformation, supporting individualized intervention planning.</p>","PeriodicalId":93888,"journal":{"name":"ArXiv","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2026-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12803325/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145992117","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hierarchical temporal receptive windows and zero-shot timescale generalization in biologically constrained scale-invariant deep networks. 生物约束尺度不变深度网络的分层时间接受窗和零时间尺度泛化。
Pub Date : 2026-01-06
Aakash Sarkar, Marc W Howard

Human cognition integrates information across nested timescales. While the cortex exhibits hierarchical Temporal Receptive Windows (TRWs), local circuits often display heterogeneous time constants. To reconcile this, we trained biologically constrained deep networks, based on scale-invariant hippocampal time cells, on a language classification task mimicking the hierarchical structure of language (e.g., 'letters' forming 'words'). First, using a feedforward model (SITHCon), we found that a hierarchy of TRWs emerged naturally across layers, despite the network having an identical spectrum of time constants within layers. We then distilled these inductive priors into a biologically plausible recurrent architecture, SITH-RNN. Training a sequence of architectures ranging from generic RNNs to this restricted subset showed that the scale-invariant SITH-RNN learned faster with orders-of-magnitude fewer parameters, and generalized zero-shot to out-of-distribution timescales. These results suggest the brain employs scale-invariant, sequential priors - coding "what" happened "when" - making recurrent networks with such priors particularly well-suited to describe human cognition.

人类的认知整合了嵌套时间尺度上的信息。当皮层表现出分层的时间接受窗(TRWs)时,局部电路往往表现出不均匀的时间常数。为了调和这一点,我们训练了基于尺度不变海马时间细胞的生物约束深度网络,在模拟语言层次结构的语言分类任务上(例如,“字母”形成“单词”)。首先,使用前馈模型(SITHCon),我们发现,尽管网络在各层内具有相同的时间常数谱,但trw的层次结构在各层之间自然出现。然后,我们将这些归纳先验提炼成生物学上合理的循环结构,即th - rnn。从通用rnn到该受限子集的一系列体系结构的训练表明,尺度不变的SITH-RNN在参数少的数量级上学习得更快,并且将零射击广义到分布外时间尺度。这些结果表明,大脑使用尺度不变的顺序先验——编码“什么”“什么时候”发生了——使具有这种先验的循环网络特别适合描述人类认知。
{"title":"Hierarchical temporal receptive windows and zero-shot timescale generalization in biologically constrained scale-invariant deep networks.","authors":"Aakash Sarkar, Marc W Howard","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Human cognition integrates information across nested timescales. While the cortex exhibits hierarchical Temporal Receptive Windows (TRWs), local circuits often display heterogeneous time constants. To reconcile this, we trained biologically constrained deep networks, based on scale-invariant hippocampal time cells, on a language classification task mimicking the hierarchical structure of language (e.g., 'letters' forming 'words'). First, using a feedforward model (SITHCon), we found that a hierarchy of TRWs emerged naturally across layers, despite the network having an identical spectrum of time constants within layers. We then distilled these inductive priors into a biologically plausible recurrent architecture, SITH-RNN. Training a sequence of architectures ranging from generic RNNs to this restricted subset showed that the scale-invariant SITH-RNN learned faster with orders-of-magnitude fewer parameters, and generalized zero-shot to out-of-distribution timescales. These results suggest the brain employs scale-invariant, sequential priors - coding \"what\" happened \"when\" - making recurrent networks with such priors particularly well-suited to describe human cognition.</p>","PeriodicalId":93888,"journal":{"name":"ArXiv","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2026-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12803329/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145992130","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
ArXiv
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1