首页 > 最新文献

Physical review. E最新文献

英文 中文
Evolution of invasion patterns due to surfactant adsorption in non-Gaussian pore distribution: Role of mass transfer and Laplace pressure. 表面活性剂在非高斯孔隙分布中吸附的入侵模式演化:传质和拉普拉斯压力的作用。
IF 2.4 3区 物理与天体物理 Q1 Mathematics Pub Date : 2025-12-01 DOI: 10.1103/b2wy-b918
Debanik Bhattacharjee, Guy Z Ramon, Yaniv Edery

Immiscible two-phase flow in porous media occurs in many processes, such as enhanced oil recovery (EOR), as well as oil spill and soil remediation. These processes involve a fluid displacing another immiscible fluid within the confines of a heterogeneous porous structure. The invasion pattern generally remains the same under constant conditions but can also evolve over time in the presence of surfactants, which alter the interfacial tension (IFT) and surface wettability. The dynamics under such conditions extend beyond the usual way in which such immiscible displacement is modeled. Here, we develop a time-dependent pore network model (PNM) to simulate the effects of surfactant-induced IFT reduction on immiscible displacement driven by constant inlet pressure, with pressure drops across the network calculated using a random resistor network and mass conservation equations. Node-specific flux and velocity are derived using the Hagen-Poiseuille equation, and surfactant adsorption is modeled using the Langmuir isotherm, capturing its impact on fluid-fluid and solid-fluid interfaces within the invaded path. Since the evolution of the invasion pattern comprises the cooperative mechanisms of surfactant mass transfer to the interfaces and the resulting changes in capillary and Laplace pressures, we employ two strategies to quantify this complex feedback behavior: mass transfer based, introducing a mass transfer timescale, and Laplace pressure based, scaling with the inlet pressure. Results reveal that heavy-tailed pore throat distribution accelerates the onset of secondary invasions, which enhances the dominance of Laplace pressure. As the distribution becomes more symmetric or Gaussian, mass transfer becomes the dominant mechanism. This interplay highlights the intricate balance between mass transfer and capillary effects in governing the spatiotemporal evolution of immiscible fluid invasion.

多孔介质中的非混相两相流存在于提高采收率(EOR)、溢油和土壤修复等诸多过程中。这些过程涉及一种流体在非均质多孔结构的范围内取代另一种不混相流体。在一定条件下,入侵模式通常保持不变,但在表面活性剂的存在下,入侵模式也会随着时间的推移而变化,从而改变界面张力(IFT)和表面润湿性。在这种条件下的动力学超出了通常模拟这种非混相位移的方法。在这里,我们开发了一个时间相关的孔隙网络模型(PNM)来模拟表面活性剂诱导的IFT减少对恒定进口压力驱动的非混相位移的影响,并使用随机电阻网络和质量守恒方程计算网络中的压降。节点特定通量和速度使用Hagen-Poiseuille方程推导,表面活性剂吸附使用Langmuir等温线建模,捕获其对侵入路径内流体-流体和固体-流体界面的影响。由于入侵模式的演变包括表面活性剂传质到界面的合作机制以及由此产生的毛细压力和拉普拉斯压力的变化,因此我们采用两种策略来量化这种复杂的反馈行为:基于传质,引入传质时间标度;基于拉普拉斯压力,随入口压力标度。结果表明,重尾孔喉分布加速了次生侵入的发生,增强了拉普拉斯压力的主导作用。当分布变得更加对称或高斯分布时,传质成为主要的机制。这种相互作用突出了在控制非混相流体侵入的时空演变中,质传递和毛细效应之间的复杂平衡。
{"title":"Evolution of invasion patterns due to surfactant adsorption in non-Gaussian pore distribution: Role of mass transfer and Laplace pressure.","authors":"Debanik Bhattacharjee, Guy Z Ramon, Yaniv Edery","doi":"10.1103/b2wy-b918","DOIUrl":"https://doi.org/10.1103/b2wy-b918","url":null,"abstract":"<p><p>Immiscible two-phase flow in porous media occurs in many processes, such as enhanced oil recovery (EOR), as well as oil spill and soil remediation. These processes involve a fluid displacing another immiscible fluid within the confines of a heterogeneous porous structure. The invasion pattern generally remains the same under constant conditions but can also evolve over time in the presence of surfactants, which alter the interfacial tension (IFT) and surface wettability. The dynamics under such conditions extend beyond the usual way in which such immiscible displacement is modeled. Here, we develop a time-dependent pore network model (PNM) to simulate the effects of surfactant-induced IFT reduction on immiscible displacement driven by constant inlet pressure, with pressure drops across the network calculated using a random resistor network and mass conservation equations. Node-specific flux and velocity are derived using the Hagen-Poiseuille equation, and surfactant adsorption is modeled using the Langmuir isotherm, capturing its impact on fluid-fluid and solid-fluid interfaces within the invaded path. Since the evolution of the invasion pattern comprises the cooperative mechanisms of surfactant mass transfer to the interfaces and the resulting changes in capillary and Laplace pressures, we employ two strategies to quantify this complex feedback behavior: mass transfer based, introducing a mass transfer timescale, and Laplace pressure based, scaling with the inlet pressure. Results reveal that heavy-tailed pore throat distribution accelerates the onset of secondary invasions, which enhances the dominance of Laplace pressure. As the distribution becomes more symmetric or Gaussian, mass transfer becomes the dominant mechanism. This interplay highlights the intricate balance between mass transfer and capillary effects in governing the spatiotemporal evolution of immiscible fluid invasion.</p>","PeriodicalId":20085,"journal":{"name":"Physical review. E","volume":"112 6-2","pages":"065108"},"PeriodicalIF":2.4,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146011909","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Emergent Berezinskii-Kosterlitz-Thouless deconfinement in super-Coulombic plasmas. 超库仑等离子体中的突发Berezinskii-Kosterlitz-Thouless约束。
IF 2.4 3区 物理与天体物理 Q1 Mathematics Pub Date : 2025-12-01 DOI: 10.1103/4q5z-4wkn
Ayush De, Leo Radzihovsky, Snir Gazit

We study the statistical mechanics of two-dimensional "super-Coulombic" plasmas, namely, neutral plasmas with power-law interactions longer ranged than Coulomb. To that end, we employ numerically exact large-scale Monte Carlo simulations. Contrary to naive energy-entropy arguments, we observe a charge confinement-deconfinement transition as a function of temperature. Remarkably, the transition lies in the Berezinskii-Kosterlitz-Thouless (BKT) universality class. Our results corroborate recent dielectric medium and renormalization group calculations predicting effective long-scale Coulomb interactions in microscopically super-Coulombic gases. We explicitly showcase this novel dielectric screening phenomenon, capturing the emergent Coulomb potential and the associated crossover length scale. This is achieved by utilizing a new test charge based methodology for determining effective inter-particle interactions. Lastly, we show that this Coulomb emergence and the associated BKT transition occur universally across generic interactions and densities.

我们研究了二维“超库仑”等离子体的统计力学,即幂律相互作用范围大于库仑的中性等离子体。为此,我们采用数值精确的大规模蒙特卡罗模拟。与朴素的能量熵论证相反,我们观察到电荷约束-定义跃迁是温度的函数。值得注意的是,这种转变在于别列津斯基-科斯特利茨-索利斯(BKT)普适类。我们的结果证实了最近的介电介质和重整化组计算预测微观超库仑气体中有效的长尺度库仑相互作用。我们明确地展示了这种新的介电屏蔽现象,捕获了涌现的库仑电位和相关的交叉长度尺度。这是通过利用一种新的基于测试电荷的方法来确定有效的粒子间相互作用来实现的。最后,我们证明这种库仑突现和相关的BKT跃迁在一般相互作用和密度中普遍发生。
{"title":"Emergent Berezinskii-Kosterlitz-Thouless deconfinement in super-Coulombic plasmas.","authors":"Ayush De, Leo Radzihovsky, Snir Gazit","doi":"10.1103/4q5z-4wkn","DOIUrl":"https://doi.org/10.1103/4q5z-4wkn","url":null,"abstract":"<p><p>We study the statistical mechanics of two-dimensional \"super-Coulombic\" plasmas, namely, neutral plasmas with power-law interactions longer ranged than Coulomb. To that end, we employ numerically exact large-scale Monte Carlo simulations. Contrary to naive energy-entropy arguments, we observe a charge confinement-deconfinement transition as a function of temperature. Remarkably, the transition lies in the Berezinskii-Kosterlitz-Thouless (BKT) universality class. Our results corroborate recent dielectric medium and renormalization group calculations predicting effective long-scale Coulomb interactions in microscopically super-Coulombic gases. We explicitly showcase this novel dielectric screening phenomenon, capturing the emergent Coulomb potential and the associated crossover length scale. This is achieved by utilizing a new test charge based methodology for determining effective inter-particle interactions. Lastly, we show that this Coulomb emergence and the associated BKT transition occur universally across generic interactions and densities.</p>","PeriodicalId":20085,"journal":{"name":"Physical review. E","volume":"112 6","pages":"L062102"},"PeriodicalIF":2.4,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146011937","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Speed-performance tradeoff and its effect in the game of cricket: Predictive modeling from a statistical mechanics perspective. 板球运动中速度-性能的权衡及其影响:从统计力学角度的预测建模。
IF 2.4 3区 物理与天体物理 Q1 Mathematics Pub Date : 2025-12-01 DOI: 10.1103/kglf-lj6k
Mohd Suhail Rizvi

Speed-accuracy tradeoffs are a common feature in several physical and biological systems. Here, we demonstrate analogous speed-performance tradeoffs in the game of cricket, where batters score runs off the balls delivered by bowlers. Leveraging extensive data from cricket, we show that a batter's run-scoring rate and probability of dismissal are related, and this relation can be approximated by a power law. Similar relation is observed for bowlers as well that links their run-conceding rate and wicket taking abilities. The exponents in these power-law relations quantify player adaptability under varying conditions and serve as a robust performance indicator. Using a drift-diffusion-decay model for run scoring and conceding, we find that players with extreme exponent values are better suited to specific game formats and match conditions than those with moderate values. These findings provide a quantitative framework for evaluating player potential, optimizing team strategy, and extending the analysis to other systems exhibiting similar speed-accuracy or performance tradeoffs.

在一些物理和生物系统中,速度和精度的权衡是一个共同的特征。在这里,我们在板球比赛中展示了类似的速度-性能权衡,击球手在投球手投出的球上得分。利用板球的大量数据,我们表明击球手的得分率和被解雇的概率是相关的,这种关系可以用幂律来近似。在投球手中也观察到类似的关系,将他们的失分率和三柱射门能力联系起来。这些幂律关系中的指数量化了玩家在不同条件下的适应性,并作为一个稳健的表现指标。利用漂移-扩散-衰减模型对跑位得分和失分进行了分析,发现极端指数的球员比中等指数的球员更适合特定的比赛形式和比赛条件。这些发现为评估球员潜力、优化球队策略提供了定量框架,并将分析扩展到其他表现出类似速度-准确性或性能权衡的系统。
{"title":"Speed-performance tradeoff and its effect in the game of cricket: Predictive modeling from a statistical mechanics perspective.","authors":"Mohd Suhail Rizvi","doi":"10.1103/kglf-lj6k","DOIUrl":"https://doi.org/10.1103/kglf-lj6k","url":null,"abstract":"<p><p>Speed-accuracy tradeoffs are a common feature in several physical and biological systems. Here, we demonstrate analogous speed-performance tradeoffs in the game of cricket, where batters score runs off the balls delivered by bowlers. Leveraging extensive data from cricket, we show that a batter's run-scoring rate and probability of dismissal are related, and this relation can be approximated by a power law. Similar relation is observed for bowlers as well that links their run-conceding rate and wicket taking abilities. The exponents in these power-law relations quantify player adaptability under varying conditions and serve as a robust performance indicator. Using a drift-diffusion-decay model for run scoring and conceding, we find that players with extreme exponent values are better suited to specific game formats and match conditions than those with moderate values. These findings provide a quantitative framework for evaluating player potential, optimizing team strategy, and extending the analysis to other systems exhibiting similar speed-accuracy or performance tradeoffs.</p>","PeriodicalId":20085,"journal":{"name":"Physical review. E","volume":"112 6-1","pages":"064121"},"PeriodicalIF":2.4,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146011946","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Pólya urn model for analysis of football passes. Pólya用于足球传球分析的urn模型。
IF 2.4 3区 物理与天体物理 Q1 Mathematics Pub Date : 2025-12-01 DOI: 10.1103/w5wk-mmcc
Ken Yamamoto

This study analyzes pass networks in football (soccer) using a stochastic model known as the Pólya urn. By focusing on preferential selection, it theoretically demonstrates that the time evolution of networks can be characterized by a single parameter. Building on this result, a data analysis method is proposed and applied to a large-scale public dataset of professional football matches. The statistical properties of the preferential-selection parameter are examined, demonstrating its correlation with pass accuracy and with mean pass difficulty. This method is applicable to various evolving networks.

本研究分析了传球网络在足球(足球)使用随机模型称为Pólya回合。通过关注优先选择,从理论上证明了网络的时间演化可以用单个参数来表征。在此基础上,提出了一种数据分析方法,并将其应用于大规模的职业足球比赛公共数据集。研究了优先选择参数的统计特性,证明了其与通过率和平均通过率的相关性。该方法适用于各种演化网络。
{"title":"Pólya urn model for analysis of football passes.","authors":"Ken Yamamoto","doi":"10.1103/w5wk-mmcc","DOIUrl":"https://doi.org/10.1103/w5wk-mmcc","url":null,"abstract":"<p><p>This study analyzes pass networks in football (soccer) using a stochastic model known as the Pólya urn. By focusing on preferential selection, it theoretically demonstrates that the time evolution of networks can be characterized by a single parameter. Building on this result, a data analysis method is proposed and applied to a large-scale public dataset of professional football matches. The statistical properties of the preferential-selection parameter are examined, demonstrating its correlation with pass accuracy and with mean pass difficulty. This method is applicable to various evolving networks.</p>","PeriodicalId":20085,"journal":{"name":"Physical review. E","volume":"112 6","pages":"L062303"},"PeriodicalIF":2.4,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146011988","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Landscape complexity for the empirical risk of generalized linear models: Discrimination between structured data. 广义线性模型经验风险的景观复杂性:结构化数据之间的区别。
IF 2.4 3区 物理与天体物理 Q1 Mathematics Pub Date : 2025-12-01 DOI: 10.1103/3mbj-xkgk
Theodoros G Tsironis, Aris L Moustakas

We use the Kac-Rice formula and results from random matrix theory to obtain the average number of critical points of a family of high-dimensional empirical loss functions, where the data are correlated d-dimensional Gaussian vectors, whose number has a fixed ratio with their dimension. The correlations are introduced to model the existence of structure in the data, as is common in current machine learning systems. Under a technical hypothesis, our results are exact in the large-d limit, and characterize the annealed landscape complexity, namely the logarithm of the expected number of critical points at a given value of the loss. We first address in detail the landscape of the loss function of a single perceptron and then generalize it to the case where two competing datasets with different covariance matrices are present, with the perceptron seeking to discriminate between them. The latter model can be applied to understand the interplay between adversity and nontrivial data structures. For completeness, we also treat the case of a loss function used in training generalized linear models in the presence of correlated input data.

我们利用Kac-Rice公式和随机矩阵理论的结果得到了一类高维经验损失函数的平均临界点数,其中数据是相关的d维高斯向量,其数量与它们的维数有固定的比值。引入相关性来模拟数据中结构的存在,这在当前的机器学习系统中很常见。在一个技术假设下,我们的结果在大d极限下是精确的,并且表征了退火景观的复杂性,即在给定损失值下预期临界点数量的对数。我们首先详细介绍了单个感知器损失函数的情况,然后将其推广到存在两个具有不同协方差矩阵的竞争数据集的情况,感知器试图区分它们。后一种模型可用于理解逆境和重要数据结构之间的相互作用。为了完整起见,我们还处理了在存在相关输入数据的情况下用于训练广义线性模型的损失函数的情况。
{"title":"Landscape complexity for the empirical risk of generalized linear models: Discrimination between structured data.","authors":"Theodoros G Tsironis, Aris L Moustakas","doi":"10.1103/3mbj-xkgk","DOIUrl":"https://doi.org/10.1103/3mbj-xkgk","url":null,"abstract":"<p><p>We use the Kac-Rice formula and results from random matrix theory to obtain the average number of critical points of a family of high-dimensional empirical loss functions, where the data are correlated d-dimensional Gaussian vectors, whose number has a fixed ratio with their dimension. The correlations are introduced to model the existence of structure in the data, as is common in current machine learning systems. Under a technical hypothesis, our results are exact in the large-d limit, and characterize the annealed landscape complexity, namely the logarithm of the expected number of critical points at a given value of the loss. We first address in detail the landscape of the loss function of a single perceptron and then generalize it to the case where two competing datasets with different covariance matrices are present, with the perceptron seeking to discriminate between them. The latter model can be applied to understand the interplay between adversity and nontrivial data structures. For completeness, we also treat the case of a loss function used in training generalized linear models in the presence of correlated input data.</p>","PeriodicalId":20085,"journal":{"name":"Physical review. E","volume":"112 6-2","pages":"065307"},"PeriodicalIF":2.4,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146012003","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Investigating the relation between elastic and relaxation properties of dry, frictional granular media during shear deformation. 研究干摩擦颗粒介质在剪切变形过程中的弹性和松弛特性的关系。
IF 2.4 3区 物理与天体物理 Q1 Mathematics Pub Date : 2025-12-01 DOI: 10.1103/rp3b-lzvt
Aurélien Rigotti, Véronique Dansereau, Jérôme Weiss

Using discrete element simulations based on molecular dynamics, we investigate the mechanical behavior of two-dimensional sheared, dry, frictional granular media in the "dense" and "critical" regimes. We find that this behavior is partitioned between transient stages and a final stationary stage. While the latter is macroscopically consistent with the predictions of the viscous μ(I) rheology, both the macroscopic behavior during the transient stages and the overall microscopic behavior suggest a more complex picture. Indeed, the simulated granular medium exhibits a finite elastic stiffness throughout its entire shear deformation history, although topological rearrangements of the grains at the microscale translate into a partial degradation of this stiffness, which can be interpreted as a form of elastic damage. When letting the system relax under constant volume at different stages of shear deformation, the relaxation of stresses follows a compressed exponential, also highlighting the role of elastic interactions in the medium, with residual stresses that depend on the level of elastic damage. The relations we establish between elastic and relaxation properties point to a complex rheology, characterized by a damage-dependent transition between a viscoelastoplastic and a viscous behavior.

利用基于分子动力学的离散元模拟,我们研究了二维剪切、干燥、摩擦颗粒介质在“密集”和“临界”状态下的力学行为。我们发现这种行为在暂态阶段和最终平稳阶段之间被划分。虽然后者在宏观上与粘性μ(I)流变学的预测一致,但瞬态阶段的宏观行为和整体微观行为都表明了一个更复杂的图景。事实上,模拟的颗粒介质在其整个剪切变形历史中表现出有限的弹性刚度,尽管颗粒在微观尺度上的拓扑重排转化为这种刚度的部分退化,这可以解释为弹性损伤的一种形式。当系统在剪切变形的不同阶段以恒定体积松弛时,应力松弛遵循压缩指数,也突出了介质中弹性相互作用的作用,残余应力取决于弹性损伤的水平。我们建立的弹性和松弛特性之间的关系指向一个复杂的流变学,其特征是粘弹塑性和粘性行为之间的依赖于损伤的过渡。
{"title":"Investigating the relation between elastic and relaxation properties of dry, frictional granular media during shear deformation.","authors":"Aurélien Rigotti, Véronique Dansereau, Jérôme Weiss","doi":"10.1103/rp3b-lzvt","DOIUrl":"https://doi.org/10.1103/rp3b-lzvt","url":null,"abstract":"<p><p>Using discrete element simulations based on molecular dynamics, we investigate the mechanical behavior of two-dimensional sheared, dry, frictional granular media in the \"dense\" and \"critical\" regimes. We find that this behavior is partitioned between transient stages and a final stationary stage. While the latter is macroscopically consistent with the predictions of the viscous μ(I) rheology, both the macroscopic behavior during the transient stages and the overall microscopic behavior suggest a more complex picture. Indeed, the simulated granular medium exhibits a finite elastic stiffness throughout its entire shear deformation history, although topological rearrangements of the grains at the microscale translate into a partial degradation of this stiffness, which can be interpreted as a form of elastic damage. When letting the system relax under constant volume at different stages of shear deformation, the relaxation of stresses follows a compressed exponential, also highlighting the role of elastic interactions in the medium, with residual stresses that depend on the level of elastic damage. The relations we establish between elastic and relaxation properties point to a complex rheology, characterized by a damage-dependent transition between a viscoelastoplastic and a viscous behavior.</p>","PeriodicalId":20085,"journal":{"name":"Physical review. E","volume":"112 6-2","pages":"065415"},"PeriodicalIF":2.4,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146012006","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Scaling laws and representation learning in simple hierarchical languages: Transformers versus convolutional architectures. 简单层次语言中的缩放定律和表示学习:变形金刚与卷积架构。
IF 2.4 3区 物理与天体物理 Q1 Mathematics Pub Date : 2025-12-01 DOI: 10.1103/qtd6-nl8p
Francesco Cagnetta, Alessandro Favero, Antonio Sclocchi, Matthieu Wyart

How do neural language models acquire a language's structure when trained for next-token prediction? We address this question by deriving theoretical scaling laws for neural network performance on synthetic datasets generated by the random hierarchy model (RHM)-an ensemble of probabilistic context-free grammars designed to capture the hierarchical structure of natural language while remaining analytically tractable. Previously, we developed a theory of representation learning based on data correlations that explains how deep learning models capture the hierarchical structure of the data sequentially, one layer at a time. Here, we extend our theoretical framework to account for architectural differences. In particular, we predict and empirically validate that convolutional networks, whose structure aligns with that of the generative process through locality and weight sharing, enjoy a faster scaling of performance compared to transformer models, which rely on global self-attention mechanisms. This finding clarifies the architectural biases underlying neural scaling laws and highlights how representation learning is shaped by the interaction between model architecture and the statistical properties of data.

神经语言模型如何在训练下一个标记预测时获得语言的结构?我们通过推导由随机层次模型(RHM)生成的合成数据集上神经网络性能的理论标度定律来解决这个问题。随机层次模型是一种概率上下文无关语法的集合,旨在捕捉自然语言的层次结构,同时保持分析可处理性。在此之前,我们开发了一种基于数据相关性的表示学习理论,该理论解释了深度学习模型如何按顺序捕获数据的层次结构,一次捕获一层。在这里,我们扩展了我们的理论框架来解释架构上的差异。特别是,我们预测并经验验证了卷积网络,其结构通过局部性和权重共享与生成过程保持一致,与依赖全局自关注机制的变压器模型相比,具有更快的性能扩展。这一发现澄清了神经尺度定律背后的架构偏差,并强调了模型架构和数据统计属性之间的相互作用如何塑造表征学习。
{"title":"Scaling laws and representation learning in simple hierarchical languages: Transformers versus convolutional architectures.","authors":"Francesco Cagnetta, Alessandro Favero, Antonio Sclocchi, Matthieu Wyart","doi":"10.1103/qtd6-nl8p","DOIUrl":"https://doi.org/10.1103/qtd6-nl8p","url":null,"abstract":"<p><p>How do neural language models acquire a language's structure when trained for next-token prediction? We address this question by deriving theoretical scaling laws for neural network performance on synthetic datasets generated by the random hierarchy model (RHM)-an ensemble of probabilistic context-free grammars designed to capture the hierarchical structure of natural language while remaining analytically tractable. Previously, we developed a theory of representation learning based on data correlations that explains how deep learning models capture the hierarchical structure of the data sequentially, one layer at a time. Here, we extend our theoretical framework to account for architectural differences. In particular, we predict and empirically validate that convolutional networks, whose structure aligns with that of the generative process through locality and weight sharing, enjoy a faster scaling of performance compared to transformer models, which rely on global self-attention mechanisms. This finding clarifies the architectural biases underlying neural scaling laws and highlights how representation learning is shaped by the interaction between model architecture and the statistical properties of data.</p>","PeriodicalId":20085,"journal":{"name":"Physical review. E","volume":"112 6-2","pages":"065312"},"PeriodicalIF":2.4,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146012027","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Choosing observables that capture critical slowing down before tipping points: A Fokker-Planck operator approach. 选择在引爆点之前捕获临界减速的观测值:福克-普朗克算子方法。
IF 2.4 3区 物理与天体物理 Q1 Mathematics Pub Date : 2025-12-01 DOI: 10.1103/l2v2-xndy
Johannes Lohmann, Georg A Gottwald

Tipping points (TP) are abrupt transitions between metastable states in complex systems, most often described by a bifurcation or crisis of a multistable system induced by a slowly changing control parameter. An avenue for predicting TPs in real-world systems is critical slowing down (CSD), which is a decrease in the relaxation rate after perturbations prior to a TP that can be measured by statistical early warning signals (EWS) in the autocovariance of observational time series. In high-dimensional systems, we cannot expect a priori chosen scalar observables to show significant EWS, and some may even show an opposite signal. Thus, to avoid false negative or positive early warnings, it is desirable to monitor fluctuations only in observables that are designed to capture CSD. Here we propose that a natural observable for this purpose can be obtained by a data-driven approximation of the first nontrivial eigenfunction of the backward Fokker-Planck (or Kolmogorov) operator, using the diffusion map algorithm.

引爆点(TP)是复杂系统中亚稳态之间的突变,通常由缓慢变化的控制参数引起的多稳定系统的分岔或危机来描述。在现实世界系统中预测TP的一个途径是临界减速(CSD),它是在TP之前的扰动后弛豫率的降低,可以通过观测时间序列的自协方差中的统计预警信号(EWS)来测量。在高维系统中,我们不能期望先验选择的标量观测值显示显著的EWS,有些甚至可能显示相反的信号。因此,为了避免假阴性或假阳性的早期预警,最好只监测旨在捕捉可持续发展的可观测值的波动。在这里,我们提出了一个自然的可观测值,可以通过使用扩散映射算法对后向Fokker-Planck(或Kolmogorov)算子的第一个非平凡特征函数的数据驱动近似来获得。
{"title":"Choosing observables that capture critical slowing down before tipping points: A Fokker-Planck operator approach.","authors":"Johannes Lohmann, Georg A Gottwald","doi":"10.1103/l2v2-xndy","DOIUrl":"https://doi.org/10.1103/l2v2-xndy","url":null,"abstract":"<p><p>Tipping points (TP) are abrupt transitions between metastable states in complex systems, most often described by a bifurcation or crisis of a multistable system induced by a slowly changing control parameter. An avenue for predicting TPs in real-world systems is critical slowing down (CSD), which is a decrease in the relaxation rate after perturbations prior to a TP that can be measured by statistical early warning signals (EWS) in the autocovariance of observational time series. In high-dimensional systems, we cannot expect a priori chosen scalar observables to show significant EWS, and some may even show an opposite signal. Thus, to avoid false negative or positive early warnings, it is desirable to monitor fluctuations only in observables that are designed to capture CSD. Here we propose that a natural observable for this purpose can be obtained by a data-driven approximation of the first nontrivial eigenfunction of the backward Fokker-Planck (or Kolmogorov) operator, using the diffusion map algorithm.</p>","PeriodicalId":20085,"journal":{"name":"Physical review. E","volume":"112 6-1","pages":"064204"},"PeriodicalIF":2.4,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146012052","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unified gas-kinetic wave-particle method for multiscale phonon transport. 多尺度声子输运的统一气动波粒法。
IF 2.4 3区 物理与天体物理 Q1 Mathematics Pub Date : 2025-12-01 DOI: 10.1103/hz9s-5qbm
Hongyu Liu, Xiaojian Yang, Chuang Zhang, Xing Ji, Kun Xu

Over the past seven decades, the classical Monte Carlo method has played a huge role in the fields of rarefied gas flow and micro/nanoscale heat transfer, but it also has shortcomings: the time step and cell size are limited by the relaxation time and mean free path, making it difficult to efficiently simulate multiscale heat and mass transfer problems from the ballistic to diffusion limit. To overcome this drawback, a unified gas-kinetic wave-particle (UGKWP) method is developed for solving the phonon Boltzmann transport equation (BTE) in all regimes covering both ballistic and diffusive limits. This method is built upon the space-time coupled evolution model of the phonon BTE, which provides the framework for constructing a multiscale flux at the cell interfaces. At the same time, in order to capture nonequilibrium transport efficiently, the multiscale flux comprises two distinct components: a deterministic part for capturing the near-equilibrium or diffusive transport, and a statistical particle part for recovering nonequilibrium or ballistic transport phenomena. The UGKWP method exhibits remarkable multiscale adaptability and versatility, seamlessly bridging the gap between the diffusive and ballistic transport phenomena. In the diffusive limit, the present method naturally converges to Fourier's law, with the diminishing particle contribution, whereas in the ballistic limit, the nonequilibrium flux is fully described by the free-streaming particles. This inherent adaptability not only allows for precise capturing of both equilibrium and nonequilibrium heat transfer processes, but it also guarantees that the model adheres strictly to the underlying physical laws in each phonon transport regime. A series of numerical tests fully demonstrate the excellent performance of the UGKWP method in all Knudsen regimes, where the time step and cell size are not constrained by the relaxation time and mean free path in the diffusive regime. The present method is an efficient and accurate computational tool for simulating multiscale nonequilibrium heat transfer, and offering significant advantages over traditional methods in terms of numerical performance and physical applicability.

在过去的70年里,经典蒙特卡罗方法在稀薄气体流动和微纳米尺度传热领域发挥了巨大的作用,但它也存在缺点:时间步长和单元大小受弛豫时间和平均自由程的限制,难以有效地模拟从弹道极限到扩散极限的多尺度传热传质问题。为了克服这一缺点,开发了一种统一的气体动力学波粒(UGKWP)方法,用于解决所有覆盖弹道和扩散极限的声子玻尔兹曼输运方程(BTE)。该方法建立在声子BTE的时空耦合演化模型的基础上,为构建细胞界面处的多尺度通量提供了框架。同时,为了有效地捕获非平衡输运,多尺度通量包括两个不同的部分:确定性部分用于捕获近平衡或扩散输运,统计粒子部分用于恢复非平衡或弹道输运现象。UGKWP方法表现出显著的多尺度适应性和多功能性,无缝地弥合了扩散和弹道输运现象之间的差距。在扩散极限下,该方法自然收敛于傅立叶定律,粒子的贡献逐渐减少,而在弹道极限下,非平衡通量完全由自由流动的粒子描述。这种固有的适应性不仅允许精确捕获平衡和非平衡传热过程,而且还保证该模型严格遵守每个声子输运机制的基本物理定律。一系列的数值试验充分证明了UGKWP方法在所有Knudsen区域的优异性能,其中时间步长和单元大小不受扩散区域弛豫时间和平均自由程的约束。该方法是模拟多尺度非平衡传热的一种高效、准确的计算工具,在数值性能和物理适用性方面均优于传统方法。
{"title":"Unified gas-kinetic wave-particle method for multiscale phonon transport.","authors":"Hongyu Liu, Xiaojian Yang, Chuang Zhang, Xing Ji, Kun Xu","doi":"10.1103/hz9s-5qbm","DOIUrl":"https://doi.org/10.1103/hz9s-5qbm","url":null,"abstract":"<p><p>Over the past seven decades, the classical Monte Carlo method has played a huge role in the fields of rarefied gas flow and micro/nanoscale heat transfer, but it also has shortcomings: the time step and cell size are limited by the relaxation time and mean free path, making it difficult to efficiently simulate multiscale heat and mass transfer problems from the ballistic to diffusion limit. To overcome this drawback, a unified gas-kinetic wave-particle (UGKWP) method is developed for solving the phonon Boltzmann transport equation (BTE) in all regimes covering both ballistic and diffusive limits. This method is built upon the space-time coupled evolution model of the phonon BTE, which provides the framework for constructing a multiscale flux at the cell interfaces. At the same time, in order to capture nonequilibrium transport efficiently, the multiscale flux comprises two distinct components: a deterministic part for capturing the near-equilibrium or diffusive transport, and a statistical particle part for recovering nonequilibrium or ballistic transport phenomena. The UGKWP method exhibits remarkable multiscale adaptability and versatility, seamlessly bridging the gap between the diffusive and ballistic transport phenomena. In the diffusive limit, the present method naturally converges to Fourier's law, with the diminishing particle contribution, whereas in the ballistic limit, the nonequilibrium flux is fully described by the free-streaming particles. This inherent adaptability not only allows for precise capturing of both equilibrium and nonequilibrium heat transfer processes, but it also guarantees that the model adheres strictly to the underlying physical laws in each phonon transport regime. A series of numerical tests fully demonstrate the excellent performance of the UGKWP method in all Knudsen regimes, where the time step and cell size are not constrained by the relaxation time and mean free path in the diffusive regime. The present method is an efficient and accurate computational tool for simulating multiscale nonequilibrium heat transfer, and offering significant advantages over traditional methods in terms of numerical performance and physical applicability.</p>","PeriodicalId":20085,"journal":{"name":"Physical review. E","volume":"112 6-2","pages":"065304"},"PeriodicalIF":2.4,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146012098","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Central limit behavior at the edge of chaos in the z-logistic map. z-logistic图混沌边缘的中心极限行为。
IF 2.4 3区 物理与天体物理 Q1 Mathematics Pub Date : 2025-12-01 DOI: 10.1103/gtlz-67cf
Abbas Ali Saberi, Ugur Tirnakli, Constantino Tsallis

We focus on the Feigenbaum-Coullet-Tresser point of the dissipative one-dimensional z-logistic map x_{t+1}=1-a|x_{t}|^{z}(z≥1). We show that sums of iterates converge to q-Gaussian distributions P_{q}(y)=P_{q}(0)exp_{q}(-β_{q}y^{2})=P_{q}(0)[1+(q-1)β_{q}y^{2}]^{1/(1-q)}(q≥1;β_{q}>0), which optimize the nonadditive entropic functional S_{q} under simple constraints. We propose and justify heuristically a closed-form prediction for the entropic index, q(z)=1+2/(z+1), and validate it numerically via data collapse for typical z values. The formula captures how the limiting law depends on the nonlinearity order and implies finite variance for z>2 and divergent variance for 1≤z≤2. These results extend edge-of-chaos central limit behavior beyond the standard (z=2) case and provide a simple predictive law for unimodal maps with varying maximum order.

研究了耗散一维z-logistic映射x_{t+1}=1-a|x_{t}|^{z}(z≥1)的Feigenbaum-Coullet-Tresser点。我们证明迭代和收敛于q-高斯分布P_{q}(y)=P_{q}(0)exp_{q}(-β_{q}y^{2})=P_{q}(0)[1+(q-1)β_{q}y^{2}]^{1/(1-q)}(q≥1;β_{q}>0),在简单约束下优化了非加性熵泛函S_{q}。我们提出并证明启发式地对熵指数q(z)=1+2/(z+1)的封闭形式预测,并通过典型z值的数据崩溃进行数值验证。该公式捕捉了极限律如何依赖于非线性阶,并暗示了z>2的有限方差和1≤z≤2的发散方差。这些结果将混沌边缘中心极限行为扩展到标准(z=2)情况之外,并为变化最大阶的单峰映射提供了一个简单的预测律。
{"title":"Central limit behavior at the edge of chaos in the z-logistic map.","authors":"Abbas Ali Saberi, Ugur Tirnakli, Constantino Tsallis","doi":"10.1103/gtlz-67cf","DOIUrl":"https://doi.org/10.1103/gtlz-67cf","url":null,"abstract":"<p><p>We focus on the Feigenbaum-Coullet-Tresser point of the dissipative one-dimensional z-logistic map x_{t+1}=1-a|x_{t}|^{z}(z≥1). We show that sums of iterates converge to q-Gaussian distributions P_{q}(y)=P_{q}(0)exp_{q}(-β_{q}y^{2})=P_{q}(0)[1+(q-1)β_{q}y^{2}]^{1/(1-q)}(q≥1;β_{q}>0), which optimize the nonadditive entropic functional S_{q} under simple constraints. We propose and justify heuristically a closed-form prediction for the entropic index, q(z)=1+2/(z+1), and validate it numerically via data collapse for typical z values. The formula captures how the limiting law depends on the nonlinearity order and implies finite variance for z>2 and divergent variance for 1≤z≤2. These results extend edge-of-chaos central limit behavior beyond the standard (z=2) case and provide a simple predictive law for unimodal maps with varying maximum order.</p>","PeriodicalId":20085,"journal":{"name":"Physical review. E","volume":"112 6-1","pages":"064209"},"PeriodicalIF":2.4,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146012100","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Physical review. E
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1