首页 > 最新文献

Computer Physics Communications最新文献

英文 中文
CoOMBE: A suite of open-source programs for the integration of the optical Bloch equations and Maxwell-Bloch equations CoOMBE:一套用于整合布洛赫光学方程和麦克斯韦-布洛赫方程的开源程序
IF 7.2 2区 物理与天体物理 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-09-11 DOI: 10.1016/j.cpc.2024.109374
R.M. Potvliege, S.A. Wrathmall
<div><p>The programs described in this article and distributed with it aim (1) at integrating the optical Bloch equations governing the time evolution of the density matrix representing the quantum state of an atomic system driven by laser or microwave fields, and (2) at integrating the 1D Maxwell-Bloch equations for one or two laser fields co-propagating in an atomic vapour. The rotating wave approximation is assumed. These programs can also be used for more general quantum dynamical systems governed by the Lindblad master equation. They are written in Fortran 90; however, their use does not require any knowledge of Fortran programming. Methods for solving the optical Bloch equations in the rate equations limit, for calculating the steady-state density matrix and for formulating the optical Bloch equations in the weak probe approximation are also described.</p></div><div><h3>Program summary</h3><p><em>Program Title:</em> CoOMBE</p><p><em>CPC Library link to program files:</em> <span><span>https://doi.org/10.17632/5wsg9d52dk.1</span><svg><path></path></svg></span></p><p><em>Developers' repository link:</em> <span><span>https://github.com/durham-qlm/CoOMBE</span><svg><path></path></svg></span></p><p><em>Licensing provisions:</em> GPLv3</p><p><em>Programming language:</em> Fortran 90</p><p><em>Nature of problem:</em> The present programs can be used for the following operations: (1) Integrating the optical-Bloch equations within the rotating wave approximation for a multi-state atomic system. At the choice of the user, the calculation will return either the time-dependent density matrix at given times or the density matrix in the long time limit if the system evolves into a steady state in that limit. The calculation can be done with or without averaging over the thermal velocity distribution of the atoms. The number of atomic states which can be included in the calculation is limited only by the CPU time available and possibly by memory requirements. An arbitrarily large number of laser or microwave fields can be included in the calculation if these fields are all CW. This number is currently limited to one or two for fields that are not all CW. The calculation can be done in the weak probe approximation, or in the rate equations approximation, or without assuming either of these two approximations. Calculating refractive indexes, absorption coefficients and complex susceptibilities is also possible. (2) Integrating the 1D Maxwell-Bloch equations in the slowly varying envelope approximation for one or two fields co-propagating in a single-species atomic vapour. Although geared towards the case of atoms interacting with laser fields, this code can also be used for more general quantum systems with similar equations of motion (e.g., molecular systems, spin systems, etc.).</p><p><em>Solution method:</em> The Lindblad master equation is expressed as a system of homogeneous first order linear differential equations, which are transformed as required an
本文中描述的程序以及与之一起发布的程序旨在:(1)对代表激光或微波场驱动下原子系统量子态的密度矩阵的时间演化进行光学布洛赫方程积分;(2)对在原子蒸汽中共同传播的一个或两个激光场进行一维麦克斯韦-布洛赫方程积分。假设采用旋转波近似。这些程序也可用于林德布拉德主方程支配的更一般的量子动力学系统。这些程序是用 Fortran 90 编写的;不过,使用它们不需要任何 Fortran 编程知识。此外,还介绍了在速率方程极限中求解光学布洛赫方程、计算稳态密度矩阵以及在弱探针近似中拟定光学布洛赫方程的方法:CoOMBECPC 库与程序文件的链接:https://doi.org/10.17632/5wsg9d52dk.1Developers' 存储库链接:https://github.com/durham-qlm/CoOMBELicensing 规定:GPLv3 编程语言:问题性质:本程序可用于以下操作:(1)在旋转波近似中积分多态原子系统的光布洛赫方程。根据用户的选择,计算将返回给定时间内随时间变化的密度矩阵,或者返回长时限内的密度矩阵(如果系统在长时限内进入稳态)。计算时可以对原子的热速度分布进行平均,也可以不进行平均。计算中可包含的原子态数量仅受 CPU 可用时间的限制,也可能受内存要求的限制。如果激光或微波场都是连续波场,那么计算中可以包含任意数量的激光或微波场。目前,对于不全是 CW 的场,这一数量仅限于一个或两个。计算可以采用弱探针近似法或速率方程近似法,也可以不采用这两种近似法。也可以计算折射率、吸收系数和复感性。(2) 在缓慢变化的包络近似中积分一维麦克斯韦-布洛赫方程,用于单种原子蒸汽中一个或两个场的共传播。虽然该代码针对的是原子与激光场相互作用的情况,但也可用于具有类似运动方程的更一般量子系统(如分子系统、自旋系统等):林德布拉德主方程表示为一个均质一阶线性微分方程系统,根据需要对其进行变换和求解,以获得代表原子系统状态的密度矩阵。为此提供了多种方法。在计算介质的极化时,也采用了同样的方法对麦克斯韦-布洛赫方程进行积分。后者使用预测器-校正器方法进行空间积分。该程序库包含一个通用驱动程序,无需额外开发程序即可使用这些代码。发行版还包括使用容器运行这些程序的示例,而无需预装 Fortran 编译器。
{"title":"CoOMBE: A suite of open-source programs for the integration of the optical Bloch equations and Maxwell-Bloch equations","authors":"R.M. Potvliege,&nbsp;S.A. Wrathmall","doi":"10.1016/j.cpc.2024.109374","DOIUrl":"10.1016/j.cpc.2024.109374","url":null,"abstract":"&lt;div&gt;&lt;p&gt;The programs described in this article and distributed with it aim (1) at integrating the optical Bloch equations governing the time evolution of the density matrix representing the quantum state of an atomic system driven by laser or microwave fields, and (2) at integrating the 1D Maxwell-Bloch equations for one or two laser fields co-propagating in an atomic vapour. The rotating wave approximation is assumed. These programs can also be used for more general quantum dynamical systems governed by the Lindblad master equation. They are written in Fortran 90; however, their use does not require any knowledge of Fortran programming. Methods for solving the optical Bloch equations in the rate equations limit, for calculating the steady-state density matrix and for formulating the optical Bloch equations in the weak probe approximation are also described.&lt;/p&gt;&lt;/div&gt;&lt;div&gt;&lt;h3&gt;Program summary&lt;/h3&gt;&lt;p&gt;&lt;em&gt;Program Title:&lt;/em&gt; CoOMBE&lt;/p&gt;&lt;p&gt;&lt;em&gt;CPC Library link to program files:&lt;/em&gt; &lt;span&gt;&lt;span&gt;https://doi.org/10.17632/5wsg9d52dk.1&lt;/span&gt;&lt;svg&gt;&lt;path&gt;&lt;/path&gt;&lt;/svg&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;em&gt;Developers' repository link:&lt;/em&gt; &lt;span&gt;&lt;span&gt;https://github.com/durham-qlm/CoOMBE&lt;/span&gt;&lt;svg&gt;&lt;path&gt;&lt;/path&gt;&lt;/svg&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;em&gt;Licensing provisions:&lt;/em&gt; GPLv3&lt;/p&gt;&lt;p&gt;&lt;em&gt;Programming language:&lt;/em&gt; Fortran 90&lt;/p&gt;&lt;p&gt;&lt;em&gt;Nature of problem:&lt;/em&gt; The present programs can be used for the following operations: (1) Integrating the optical-Bloch equations within the rotating wave approximation for a multi-state atomic system. At the choice of the user, the calculation will return either the time-dependent density matrix at given times or the density matrix in the long time limit if the system evolves into a steady state in that limit. The calculation can be done with or without averaging over the thermal velocity distribution of the atoms. The number of atomic states which can be included in the calculation is limited only by the CPU time available and possibly by memory requirements. An arbitrarily large number of laser or microwave fields can be included in the calculation if these fields are all CW. This number is currently limited to one or two for fields that are not all CW. The calculation can be done in the weak probe approximation, or in the rate equations approximation, or without assuming either of these two approximations. Calculating refractive indexes, absorption coefficients and complex susceptibilities is also possible. (2) Integrating the 1D Maxwell-Bloch equations in the slowly varying envelope approximation for one or two fields co-propagating in a single-species atomic vapour. Although geared towards the case of atoms interacting with laser fields, this code can also be used for more general quantum systems with similar equations of motion (e.g., molecular systems, spin systems, etc.).&lt;/p&gt;&lt;p&gt;&lt;em&gt;Solution method:&lt;/em&gt; The Lindblad master equation is expressed as a system of homogeneous first order linear differential equations, which are transformed as required an","PeriodicalId":285,"journal":{"name":"Computer Physics Communications","volume":"306 ","pages":"Article 109374"},"PeriodicalIF":7.2,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0010465524002972/pdfft?md5=08cacafc480e8377627083ad6c353bac&pid=1-s2.0-S0010465524002972-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142243054","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A quantum algorithm for the lattice-Boltzmann method advection-diffusion equation 晶格-玻尔兹曼法平流扩散方程的量子算法
IF 7.2 2区 物理与天体物理 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-09-10 DOI: 10.1016/j.cpc.2024.109373
David Wawrzyniak , Josef Winter , Steffen Schmidt , Thomas Indinger , Christian F. Janßen , Uwe Schramm , Nikolaus A. Adams

We present a versatile and efficient quantum algorithm based on the Lattice Boltzmann method (LBM) approximate solution of the linear advection-diffusion equation (ADE). We emphasize that the LBM approximation modifies the diffusion term of the underlying exact ADE and leads to a modified equation (mADE). Due to its versatility in terms of operator splitting, the proposed quantum LBM algorithm for the mADE provides a building block for future quantum algorithms to solve the linearized Navier-Stokes equation on quantum computers. We split the algorithm into four operations: initialization, collision, streaming, and calculation of the macroscopic quantities. We propose general quantum building blocks for each operator, which adapt intrinsically from the general three-dimensional case to smaller dimensions and apply to arbitrary lattice-velocity sets. Based on (sub-linear) amplitude data encoding, we propose improved initialization and collision operations with reduced complexity and efficient sampling-based simulation. Quantum streaming algorithms are based on previous developments. The proposed quantum algorithm allows for the computation of successive time steps, requiring full state measurement and reinitialization after every time step. It is validated by comparison with a digital implementation and based on analytical solutions in one and two dimensions. Furthermore, we demonstrate the versatility of the quantum algorithm for two cases with non-uniform advection velocities in two and three dimensions. Various velocity sets are considered to further highlight the flexibility of the algorithm. We benchmark our optimized quantum algorithm against previous methods employed in sampling-based quantum simulators. We demonstrate sampling efficiency, with sampling accelerated convergence requiring fewer shots.

我们提出了一种基于晶格玻尔兹曼法(LBM)近似解线性平流扩散方程(ADE)的多功能高效量子算法。我们强调,LBM 近似方法修改了底层精确 ADE 的扩散项,并导致一个修正方程 (mADE)。由于其在算子拆分方面的多功能性,针对 mADE 提出的量子 LBM 算法为未来在量子计算机上求解线性化纳维-斯托克斯方程的量子算法提供了一个基石。我们将算法分为四个操作:初始化、碰撞、流和宏观量的计算。我们为每个算子提出了通用量子构件,这些构件从一般三维情况本质上适应于更小的维度,并适用于任意晶格速度集。基于(亚线性)振幅数据编码,我们提出了改进的初始化和碰撞操作,降低了复杂性,并实现了基于采样的高效模拟。量子流算法基于之前的发展。所提出的量子算法允许计算连续的时间步长,要求在每个时间步长后进行完整的状态测量和重新初始化。通过与数字实现的比较,并基于一维和二维的分析解,我们对该算法进行了验证。此外,我们还展示了量子算法在二维和三维非均匀平流速度两种情况下的多功能性。为了进一步突出算法的灵活性,我们考虑了各种速度集。我们将优化后的量子算法与之前基于采样的量子模拟器所采用的方法进行对比。我们证明了采样效率,采样加速收敛所需的次数更少。
{"title":"A quantum algorithm for the lattice-Boltzmann method advection-diffusion equation","authors":"David Wawrzyniak ,&nbsp;Josef Winter ,&nbsp;Steffen Schmidt ,&nbsp;Thomas Indinger ,&nbsp;Christian F. Janßen ,&nbsp;Uwe Schramm ,&nbsp;Nikolaus A. Adams","doi":"10.1016/j.cpc.2024.109373","DOIUrl":"10.1016/j.cpc.2024.109373","url":null,"abstract":"<div><p>We present a versatile and efficient quantum algorithm based on the Lattice Boltzmann method (LBM) approximate solution of the linear advection-diffusion equation (ADE). We emphasize that the LBM approximation modifies the diffusion term of the underlying exact ADE and leads to a modified equation (mADE). Due to its versatility in terms of operator splitting, the proposed quantum LBM algorithm for the mADE provides a building block for future quantum algorithms to solve the linearized Navier-Stokes equation on quantum computers. We split the algorithm into four operations: initialization, collision, streaming, and calculation of the macroscopic quantities. We propose general quantum building blocks for each operator, which adapt intrinsically from the general three-dimensional case to smaller dimensions and apply to arbitrary lattice-velocity sets. Based on (sub-linear) amplitude data encoding, we propose improved initialization and collision operations with reduced complexity and efficient sampling-based simulation. Quantum streaming algorithms are based on previous developments. The proposed quantum algorithm allows for the computation of successive time steps, requiring full state measurement and reinitialization after every time step. It is validated by comparison with a digital implementation and based on analytical solutions in one and two dimensions. Furthermore, we demonstrate the versatility of the quantum algorithm for two cases with non-uniform advection velocities in two and three dimensions. Various velocity sets are considered to further highlight the flexibility of the algorithm. We benchmark our optimized quantum algorithm against previous methods employed in sampling-based quantum simulators. We demonstrate sampling efficiency, with sampling accelerated convergence requiring fewer shots.</p></div>","PeriodicalId":285,"journal":{"name":"Computer Physics Communications","volume":"306 ","pages":"Article 109373"},"PeriodicalIF":7.2,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0010465524002960/pdfft?md5=e4e628fb077f6863c2f65ab258218bf8&pid=1-s2.0-S0010465524002960-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142229068","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MaRTIn – Massive Recursive Tensor Integration MaRTIn - 大规模递归张量集成
IF 7.2 2区 物理与天体物理 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-09-10 DOI: 10.1016/j.cpc.2024.109372
Joachim Brod , Lorenz Hüdepohl , Emmanuel Stamou , Tom Steudtner

We present MaRTIn, an extendable all-in-one package for calculating amplitudes up to two loops in an expansion in external momenta or using the method of infrared rearrangement. Renormalisable and non-renormalisable models can be supplied by the user; an implementation of the Standard Model is included in the package. In this manual, we discuss the scope and functionality of the software, and give instructions of its use.

我们介绍的 MaRTIn 是一个可扩展的多合一软件包,用于计算外部力矩展开或使用红外重排方法中最多两个环的振幅。用户可以提供可重正化和不可重正化的模型;软件包中包括标准模型的实现。在本手册中,我们将讨论软件的范围和功能,并给出使用说明。
{"title":"MaRTIn – Massive Recursive Tensor Integration","authors":"Joachim Brod ,&nbsp;Lorenz Hüdepohl ,&nbsp;Emmanuel Stamou ,&nbsp;Tom Steudtner","doi":"10.1016/j.cpc.2024.109372","DOIUrl":"10.1016/j.cpc.2024.109372","url":null,"abstract":"<div><p>We present <span>MaRTIn</span>, an extendable all-in-one package for calculating amplitudes up to two loops in an expansion in external momenta or using the method of infrared rearrangement. Renormalisable and non-renormalisable models can be supplied by the user; an implementation of the Standard Model is included in the package. In this manual, we discuss the scope and functionality of the software, and give instructions of its use.</p></div>","PeriodicalId":285,"journal":{"name":"Computer Physics Communications","volume":"306 ","pages":"Article 109372"},"PeriodicalIF":7.2,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0010465524002959/pdfft?md5=0ae4f72d734fc596bec3b0c120968a3d&pid=1-s2.0-S0010465524002959-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142243050","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
GPU-enabled extreme-scale turbulence simulations: Fourier pseudo-spectral algorithms at the exascale using OpenMP offloading GPU 支持的极端尺度湍流模拟:使用 OpenMP 卸载在 exascale 上运行傅立叶伪光谱算法
IF 7.2 2区 物理与天体物理 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-09-05 DOI: 10.1016/j.cpc.2024.109364
P.K. Yeung , Kiran Ravikumar , Stephen Nichols , Rohini Uma-Vaideswaran
<div><p>Fourier pseudo-spectral methods for nonlinear partial differential equations are of wide interest in many areas of advanced computational science, including direct numerical simulation of three-dimensional (3-D) turbulence governed by the Navier-Stokes equations in fluid dynamics. This paper presents a new capability for simulating turbulence at a new record resolution up to 35 trillion grid points, on the world's first exascale computer, <em>Frontier</em>, comprising AMD MI250x GPUs with HPE's Slingshot interconnect and operated by the US Department of Energy's Oak Ridge Leadership Computing Facility (OLCF). Key programming strategies designed to take maximum advantage of the machine architecture involve performing almost all computations on the GPU which has the same memory capacity as the CPU, performing all-to-all communication among sets of parallel processes directly on the GPU, and targeting GPUs efficiently using OpenMP offloading for intensive number-crunching including 1-D Fast Fourier Transforms (FFT) performed using AMD ROCm library calls. With 99% of computing power on Frontier being on the GPU, leaving the CPU idle leads to a net performance gain via avoiding the overhead of data movement between host and device except when needed for some I/O purposes. Memory footprint including the size of communication buffers for MPI_ALLTOALL is managed carefully to maximize the largest problem size possible for a given node count.</p><p>Detailed performance data including separate contributions from different categories of operations to the elapsed wall time per step are reported for five grid resolutions, from 2048<sup>3</sup> on a single node to 32768<sup>3</sup> on 4096 or 8192 nodes out of 9408 on the system. Both 1D and 2D domain decompositions which divide a 3D periodic domain into slabs and pencils respectively are implemented. The present code suite (labeled by the acronym GESTS, GPUs for Extreme Scale Turbulence Simulations) achieves a figure of merit (in grid points per second) exceeding goals set in the Center for Accelerated Application Readiness (CAAR) program for Frontier. The performance attained is highly favorable in both weak scaling and strong scaling, with notable departures only for 2048<sup>3</sup> where communication is entirely intra-node, and for 32768<sup>3</sup>, where a challenge due to small message sizes does arise. Communication performance is addressed further using a lightweight test code that performs all-to-all communication in a manner matching the full turbulence simulation code. Performance at large problem sizes is affected by both small message size due to high node counts as well as dragonfly network topology features on the machine, but is consistent with official expectations of sustained performance on Frontier. Overall, although not perfect, the scalability achieved at the extreme problem size of 32768<sup>3</sup> (and up to 8192 nodes — which corresponds to hardware rated at just under 1 exa
用于非线性偏微分方程的傅立叶伪谱方法在先进计算科学的许多领域都受到广泛关注,其中包括流体动力学中纳维-斯托克斯方程控制的三维(3-D)湍流的直接数值模拟。本文介绍了在世界首台超大规模计算机 Frontier 上以创纪录的 35 万亿网格点分辨率模拟湍流的新功能,Frontier 由 AMD MI250x GPU 和 HPE 的 Slingshot 互连组成,由美国能源部橡树岭领先计算设施 (OLCF) 负责运行。为最大限度地利用计算机架构而设计的主要编程策略包括:在 GPU 上执行几乎所有计算(GPU 具有与 CPU 相同的内存容量);直接在 GPU 上执行并行进程集之间的全对全通信;使用 OpenMP 卸载高效地针对 GPU 进行密集型数字运算,包括使用 AMD ROCm 库调用执行的一维快速傅立叶变换 (FFT)。由于 Frontier 上 99% 的计算能力都在 GPU 上,CPU 闲置可避免主机和设备之间的数据移动开销(某些 I/O 用途需要时除外),从而实现净性能提升。详细的性能数据(包括不同类别的操作对每一步所耗费的壁时间的贡献)针对五种网格分辨率进行了报告,从单个节点上的 20483 到 4096 节点上的 327683 或系统上 9408 个节点中的 8192 个。实现了一维和二维域分解,分别将三维周期域划分为板块和铅笔。本代码套件(缩写为 GESTS,GPUs for Extreme Scale Turbulence Simulations)的性能指标(每秒网格点数)超过了加速应用准备中心(CAAR)为前沿计划设定的目标。无论是弱扩展还是强扩展,所取得的性能都非常出色,只有 20483 和 327683 的性能存在明显差异,前者完全是节点内通信,而后者则因信息规模较小而面临挑战。使用轻量级测试代码进一步解决了通信性能问题,该代码以与完整湍流模拟代码相匹配的方式执行全对全通信。大问题规模下的性能会受到高节点数导致的小信息量以及机器上蜻蜓网络拓扑特征的影响,但与官方对 Frontier 持续性能的预期一致。总体而言,尽管并不完美,但在 327683(最多 8192 个节点--相当于理论峰值计算性能略低于 1 exaflop/秒的额定硬件)的极端问题规模下实现的可扩展性,可以说优于在 OLCF 的 Frontier 前代机器(Summit)上使用先前最先进算法观察到的可扩展性。在不久的将来,将单独报告利用该代码及其扩展功能研究湍流间歇性的新科学成果。
{"title":"GPU-enabled extreme-scale turbulence simulations: Fourier pseudo-spectral algorithms at the exascale using OpenMP offloading","authors":"P.K. Yeung ,&nbsp;Kiran Ravikumar ,&nbsp;Stephen Nichols ,&nbsp;Rohini Uma-Vaideswaran","doi":"10.1016/j.cpc.2024.109364","DOIUrl":"10.1016/j.cpc.2024.109364","url":null,"abstract":"&lt;div&gt;&lt;p&gt;Fourier pseudo-spectral methods for nonlinear partial differential equations are of wide interest in many areas of advanced computational science, including direct numerical simulation of three-dimensional (3-D) turbulence governed by the Navier-Stokes equations in fluid dynamics. This paper presents a new capability for simulating turbulence at a new record resolution up to 35 trillion grid points, on the world's first exascale computer, &lt;em&gt;Frontier&lt;/em&gt;, comprising AMD MI250x GPUs with HPE's Slingshot interconnect and operated by the US Department of Energy's Oak Ridge Leadership Computing Facility (OLCF). Key programming strategies designed to take maximum advantage of the machine architecture involve performing almost all computations on the GPU which has the same memory capacity as the CPU, performing all-to-all communication among sets of parallel processes directly on the GPU, and targeting GPUs efficiently using OpenMP offloading for intensive number-crunching including 1-D Fast Fourier Transforms (FFT) performed using AMD ROCm library calls. With 99% of computing power on Frontier being on the GPU, leaving the CPU idle leads to a net performance gain via avoiding the overhead of data movement between host and device except when needed for some I/O purposes. Memory footprint including the size of communication buffers for MPI_ALLTOALL is managed carefully to maximize the largest problem size possible for a given node count.&lt;/p&gt;&lt;p&gt;Detailed performance data including separate contributions from different categories of operations to the elapsed wall time per step are reported for five grid resolutions, from 2048&lt;sup&gt;3&lt;/sup&gt; on a single node to 32768&lt;sup&gt;3&lt;/sup&gt; on 4096 or 8192 nodes out of 9408 on the system. Both 1D and 2D domain decompositions which divide a 3D periodic domain into slabs and pencils respectively are implemented. The present code suite (labeled by the acronym GESTS, GPUs for Extreme Scale Turbulence Simulations) achieves a figure of merit (in grid points per second) exceeding goals set in the Center for Accelerated Application Readiness (CAAR) program for Frontier. The performance attained is highly favorable in both weak scaling and strong scaling, with notable departures only for 2048&lt;sup&gt;3&lt;/sup&gt; where communication is entirely intra-node, and for 32768&lt;sup&gt;3&lt;/sup&gt;, where a challenge due to small message sizes does arise. Communication performance is addressed further using a lightweight test code that performs all-to-all communication in a manner matching the full turbulence simulation code. Performance at large problem sizes is affected by both small message size due to high node counts as well as dragonfly network topology features on the machine, but is consistent with official expectations of sustained performance on Frontier. Overall, although not perfect, the scalability achieved at the extreme problem size of 32768&lt;sup&gt;3&lt;/sup&gt; (and up to 8192 nodes — which corresponds to hardware rated at just under 1 exa","PeriodicalId":285,"journal":{"name":"Computer Physics Communications","volume":"306 ","pages":"Article 109364"},"PeriodicalIF":7.2,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142163977","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Universally Adaptable Multiscale Molecular Dynamics (UAMMD). A native-GPU software ecosystem for complex fluids, soft matter, and beyond 通用适应性多尺度分子动力学 (UAMMD)。用于复杂流体、软物质及其他领域的本地 GPU 软件生态系统
IF 7.2 2区 物理与天体物理 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-09-04 DOI: 10.1016/j.cpc.2024.109363
Raúl P. Peláez , Pablo Ibáñez-Freire , Pablo Palacios-Alonso , Aleksandar Donev , Rafael Delgado-Buscalioni
<div><p>We introduce UAMMD (Universally Adaptable Multiscale Molecular Dynamics), a novel software infrastructure tailored for mesoscale complex fluid simulations on GPUs. The UAMMD library encompasses a comprehensive range of computational schemes optimized for the GPU, spanning from molecular dynamics to immersed boundary fluctuating-hydrodynamics. Developed in CUDA/C++14, this header-only open-source software serves both as a simulation engine and as a library with a modular architecture, offering a vast array of independent modules, categorized as <em>interactors</em> (neighbor search, bonded, non-bonded and electrostatic interactions, etc.) and <em>integrators</em> (molecular dynamics, dissipative particle dynamics, smooth particle hydrodynamics, Brownian hydrodynamics and a rather complete array of Immersed Boundary -IB- schemes). UAMMD excels in schemes that couple particle-based elastic structures with continuum fields in different regions of the mesoscale. To that end, thermal fluctuations can be added in physically consistent ways, and fast modes can be eliminated to adapt UAMMD to different regimes (compressible or incompressible flow, inertial or Stokesian dynamics, etc.). Thus, UAMMD is extremely useful for coarse-grained simulations of nanoparticles, and soft and biological matter (from proteins to viruses and micro-swimmers). Importantly, all UAMMD developments are hand-to-hand validated against experimental techniques, and it has proven to <em>quantitatively</em> reproduce experimental signals from quartz-crystal microbalance, atomic force microscopy, magnetic sensors, optic-matter interaction and ultrasound.</p></div><div><h3>Program summary</h3><p><em>Program Title:</em> UAMMD</p><p><em>CPC Library link to program files:</em> <span><span>https://doi.org/10.17632/srrt2y5s4m.1</span><svg><path></path></svg></span></p><p><em>Developer's repository link:</em> <span><span>https://github.com/RaulPPelaez/UAMMD/</span><svg><path></path></svg></span></p><p><em>Licensing provisions:</em> GPLv3</p><p><em>Programming language:</em> C++/CUDA</p><p><em>Nature of problem:</em> The key problem addressed in computational physics is simulating the behavior of matter at various scales, encompassing both discrete (particle-based) and continuum (field-based) approaches. The challenge lies in accurately and efficiently modeling interactions at different spatio-temporal scales, ranging from atomic (microscopic) to fluid dynamics (macroscopic). This complexity is further amplified in mesoscale regions, where different physics domains intersect, necessitating advanced computational techniques to capture the nuanced dynamics of systems such as colloids, polymers, and biological structures.</p><p><em>Solution method:</em> The present solution consists in the creation of UAMMD (Universally Adaptable Multiscale Molecular Dynamics), a CUDA/C++14 library designed for GPU-accelerated complex fluid simulations. UAMMD offers a flexible platform that integrates d
我们介绍了 UAMMD(通用可适应多尺度分子动力学),这是一种专为 GPU 中尺度复杂流体模拟定制的新型软件基础设施。UAMMD 库包含一系列针对 GPU 优化的计算方案,从分子动力学到浸没边界波动流体力学。该开源软件采用 CUDA/C++14 开发,既是一个仿真引擎,也是一个模块化架构的库,提供大量独立模块,分为相互作用器(邻域搜索、键合、非键合和静电相互作用等)和积分器(分子动力学、耗散粒子动力学、平滑粒子流体力学、布朗流体力学和相当完整的沉浸边界-IB-方案)。UAMMD 擅长将基于粒子的弹性结构与中尺度不同区域的连续场耦合在一起的方案。为此,可以以物理上一致的方式添加热波动,并消除快速模式,使 UAMMD 适应不同状态(可压缩或不可压缩流、惯性或斯托克斯动力学等)。因此,UAMMD 对于纳米粒子、软物质和生物物质(从蛋白质到病毒和微游泳者)的粗粒度模拟非常有用。重要的是,UAMMD 的所有开发成果都经过了实验技术的手把手验证,事实证明它可以定量再现石英晶体微天平、原子力显微镜、磁传感器、光物质相互作用和超声波的实验信号:UAMMDCPC 库程序文件链接:https://doi.org/10.17632/srrt2y5s4m.1Developer's 资源库链接:https://github.com/RaulPPelaez/UAMMD/Licensing 规定:GPLv3 编程语言问题性质:计算物理学中的关键问题是模拟各种尺度的物质行为,包括离散(基于粒子)和连续(基于场)方法。挑战在于如何准确、高效地模拟从原子(微观)到流体动力学(宏观)等不同时空尺度的相互作用。这种复杂性在不同物理领域交叉的中尺度区域进一步放大,需要先进的计算技术来捕捉胶体、聚合物和生物结构等系统的细微动态:本解决方案包括创建 UAMMD(Universally Adaptable Multiscale Molecular Dynamics),这是一个专为 GPU 加速复杂流体模拟而设计的 CUDA/C++14 库。UAMMD 提供了一个灵活的平台,将离散粒子动力学与连续流体动力学整合在一起。它支持多种计算方案,每种方案都针对特定的时空机制。该库的模块化架构可无缝引入新算法,并轻松集成到现有代码库中:UAMMD 的设计强调模块化和 GPU 原生架构,优化了计算效率和灵活性。然而,UAMMD 专注于 GPU 加速和低级特性,这意味着它需要兼容的硬件和熟悉 CUDA 编程。虽然 UAMMD 在处理各种物理机制方面具有多样性,但它目前缺乏某些标准力场势能和多 GPU 支持。尽管如此,UAMMD 的持续开发和开源特性保证了它的不断改进。
{"title":"Universally Adaptable Multiscale Molecular Dynamics (UAMMD). A native-GPU software ecosystem for complex fluids, soft matter, and beyond","authors":"Raúl P. Peláez ,&nbsp;Pablo Ibáñez-Freire ,&nbsp;Pablo Palacios-Alonso ,&nbsp;Aleksandar Donev ,&nbsp;Rafael Delgado-Buscalioni","doi":"10.1016/j.cpc.2024.109363","DOIUrl":"10.1016/j.cpc.2024.109363","url":null,"abstract":"&lt;div&gt;&lt;p&gt;We introduce UAMMD (Universally Adaptable Multiscale Molecular Dynamics), a novel software infrastructure tailored for mesoscale complex fluid simulations on GPUs. The UAMMD library encompasses a comprehensive range of computational schemes optimized for the GPU, spanning from molecular dynamics to immersed boundary fluctuating-hydrodynamics. Developed in CUDA/C++14, this header-only open-source software serves both as a simulation engine and as a library with a modular architecture, offering a vast array of independent modules, categorized as &lt;em&gt;interactors&lt;/em&gt; (neighbor search, bonded, non-bonded and electrostatic interactions, etc.) and &lt;em&gt;integrators&lt;/em&gt; (molecular dynamics, dissipative particle dynamics, smooth particle hydrodynamics, Brownian hydrodynamics and a rather complete array of Immersed Boundary -IB- schemes). UAMMD excels in schemes that couple particle-based elastic structures with continuum fields in different regions of the mesoscale. To that end, thermal fluctuations can be added in physically consistent ways, and fast modes can be eliminated to adapt UAMMD to different regimes (compressible or incompressible flow, inertial or Stokesian dynamics, etc.). Thus, UAMMD is extremely useful for coarse-grained simulations of nanoparticles, and soft and biological matter (from proteins to viruses and micro-swimmers). Importantly, all UAMMD developments are hand-to-hand validated against experimental techniques, and it has proven to &lt;em&gt;quantitatively&lt;/em&gt; reproduce experimental signals from quartz-crystal microbalance, atomic force microscopy, magnetic sensors, optic-matter interaction and ultrasound.&lt;/p&gt;&lt;/div&gt;&lt;div&gt;&lt;h3&gt;Program summary&lt;/h3&gt;&lt;p&gt;&lt;em&gt;Program Title:&lt;/em&gt; UAMMD&lt;/p&gt;&lt;p&gt;&lt;em&gt;CPC Library link to program files:&lt;/em&gt; &lt;span&gt;&lt;span&gt;https://doi.org/10.17632/srrt2y5s4m.1&lt;/span&gt;&lt;svg&gt;&lt;path&gt;&lt;/path&gt;&lt;/svg&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;em&gt;Developer's repository link:&lt;/em&gt; &lt;span&gt;&lt;span&gt;https://github.com/RaulPPelaez/UAMMD/&lt;/span&gt;&lt;svg&gt;&lt;path&gt;&lt;/path&gt;&lt;/svg&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;em&gt;Licensing provisions:&lt;/em&gt; GPLv3&lt;/p&gt;&lt;p&gt;&lt;em&gt;Programming language:&lt;/em&gt; C++/CUDA&lt;/p&gt;&lt;p&gt;&lt;em&gt;Nature of problem:&lt;/em&gt; The key problem addressed in computational physics is simulating the behavior of matter at various scales, encompassing both discrete (particle-based) and continuum (field-based) approaches. The challenge lies in accurately and efficiently modeling interactions at different spatio-temporal scales, ranging from atomic (microscopic) to fluid dynamics (macroscopic). This complexity is further amplified in mesoscale regions, where different physics domains intersect, necessitating advanced computational techniques to capture the nuanced dynamics of systems such as colloids, polymers, and biological structures.&lt;/p&gt;&lt;p&gt;&lt;em&gt;Solution method:&lt;/em&gt; The present solution consists in the creation of UAMMD (Universally Adaptable Multiscale Molecular Dynamics), a CUDA/C++14 library designed for GPU-accelerated complex fluid simulations. UAMMD offers a flexible platform that integrates d","PeriodicalId":285,"journal":{"name":"Computer Physics Communications","volume":"306 ","pages":"Article 109363"},"PeriodicalIF":7.2,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142163979","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
TorchDA: A Python package for performing data assimilation with deep learning forward and transformation functions TorchDA:利用深度学习前向和转换函数执行数据同化的 Python 软件包
IF 7.2 2区 物理与天体物理 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-09-04 DOI: 10.1016/j.cpc.2024.109359
Sibo Cheng , Jinyang Min , Che Liu , Rossella Arcucci

Data assimilation techniques are often confronted with challenges handling complex high dimensional physical systems, because high precision simulation in complex high dimensional physical systems is computationally expensive and the exact observation functions that can be applied in these systems are difficult to obtain. It prompts growing interest in integrating deep learning models within data assimilation workflows, but current software packages for data assimilation cannot handle deep learning models inside. This study presents a novel Python package seamlessly combining data assimilation with deep neural networks to serve as models for state transition and observation functions. The package, named TorchDA, implements Kalman Filter, Ensemble Kalman Filter (EnKF), 3D Variational (3DVar), and 4D Variational (4DVar) algorithms, allowing flexible algorithm selection based on application requirements. Comprehensive experiments conducted on the Lorenz 63 and a two-dimensional shallow water system demonstrate significantly enhanced performance over standalone model predictions without assimilation. The shallow water analysis validates data assimilation capabilities mapping between different physical quantity spaces in either full space or reduced order space. Overall, this innovative software package enables flexible integration of deep learning representations within data assimilation, conferring a versatile tool to tackle complex high dimensional dynamical systems across scientific domains.

Program summary

Program Title: TorchDA

CPC Library link to program files: https://doi.org/10.17632/bm5d7xk6gw.1

Developer's repository link: https://github.com/acse-jm122/torchda

Licensing provisions: GNU General Public License version 3

Programming language: Python3

External routines/libraries: Pytorch.

Nature of problem: Deep learning has recently emerged as a potent tool for establishing data-driven predictive and observation functions within data assimilation workflows. Existing data assimilation tools like OpenDA and ADAO are not well-suited for handling predictive and observation models represented by deep neural networks. This gap necessitates the development of a comprehensive package that harmonizes deep learning and data assimilation.

Solution method: This project introduces TorchDA, a novel computational tool based on the PyTorch framework, addressing the challenges posed by predictive and observation functions represented by deep neural networks. It enables users to train their custom neural networks and effortlessly incorporate them into data assimilation processes. This integration facilitates the incorporation of real-time observational data in both full and reduced physical spaces.

数据同化技术在处理复杂的高维物理系统时往往面临挑战,因为复杂的高维物理系统的高精度模拟计算成本高昂,而且难以获得可用于这些系统的精确观测函数。这促使人们越来越关注在数据同化工作流程中集成深度学习模型,但目前的数据同化软件包无法在内部处理深度学习模型。本研究提出了一个新颖的 Python 软件包,将数据同化与深度神经网络无缝结合,作为状态转换和观测函数的模型。该软件包名为 TorchDA,实现了卡尔曼滤波、集合卡尔曼滤波(EnKF)、三维变分(3DVar)和四维变分(4DVar)算法,可根据应用需求灵活选择算法。在洛伦兹 63 和二维浅水系统上进行的综合实验表明,与没有同化的独立模型预测相比,该算法的性能显著提高。浅水分析验证了数据同化功能在全空间或缩减阶空间不同物理量空间之间的映射。总之,这一创新软件包能够在数据同化中灵活集成深度学习表示法,为解决跨科学领域的复杂高维动态系统问题提供了一个多功能工具:TorchDACPC Library 程序文件链接:https://doi.org/10.17632/bm5d7xk6gw.1Developer's repository 链接:https://github.com/acse-jm122/torchdaLicensing provisions:GNU 通用公共许可证版本 3编程语言:Python3外部例程/库:Pytorch.问题性质:深度学习最近已成为在数据同化工作流程中建立数据驱动的预测和观测功能的有力工具。现有的数据同化工具(如 OpenDA 和 ADAO)并不适合处理以深度神经网络为代表的预测和观测模型。因此,有必要开发一个能协调深度学习和数据同化的综合软件包:本项目介绍了基于PyTorch框架的新型计算工具TorchDA,以应对深度神经网络所代表的预测和观测功能所带来的挑战。它使用户能够训练自己定制的神经网络,并毫不费力地将其纳入数据同化过程。这种整合有助于将实时观测数据纳入完整和缩小的物理空间。
{"title":"TorchDA: A Python package for performing data assimilation with deep learning forward and transformation functions","authors":"Sibo Cheng ,&nbsp;Jinyang Min ,&nbsp;Che Liu ,&nbsp;Rossella Arcucci","doi":"10.1016/j.cpc.2024.109359","DOIUrl":"10.1016/j.cpc.2024.109359","url":null,"abstract":"<div><p>Data assimilation techniques are often confronted with challenges handling complex high dimensional physical systems, because high precision simulation in complex high dimensional physical systems is computationally expensive and the exact observation functions that can be applied in these systems are difficult to obtain. It prompts growing interest in integrating deep learning models within data assimilation workflows, but current software packages for data assimilation cannot handle deep learning models inside. This study presents a novel Python package seamlessly combining data assimilation with deep neural networks to serve as models for state transition and observation functions. The package, named TorchDA, implements Kalman Filter, Ensemble Kalman Filter (EnKF), 3D Variational (3DVar), and 4D Variational (4DVar) algorithms, allowing flexible algorithm selection based on application requirements. Comprehensive experiments conducted on the Lorenz 63 and a two-dimensional shallow water system demonstrate significantly enhanced performance over standalone model predictions without assimilation. The shallow water analysis validates data assimilation capabilities mapping between different physical quantity spaces in either full space or reduced order space. Overall, this innovative software package enables flexible integration of deep learning representations within data assimilation, conferring a versatile tool to tackle complex high dimensional dynamical systems across scientific domains.</p></div><div><h3>Program summary</h3><p><em>Program Title:</em> TorchDA</p><p><em>CPC Library link to program files:</em> <span><span>https://doi.org/10.17632/bm5d7xk6gw.1</span><svg><path></path></svg></span></p><p><em>Developer's repository link:</em> <span><span>https://github.com/acse-jm122/torchda</span><svg><path></path></svg></span></p><p><em>Licensing provisions:</em> GNU General Public License version 3</p><p><em>Programming language:</em> Python3</p><p><em>External routines/libraries:</em> Pytorch.</p><p><em>Nature of problem:</em> Deep learning has recently emerged as a potent tool for establishing data-driven predictive and observation functions within data assimilation workflows. Existing data assimilation tools like OpenDA and ADAO are not well-suited for handling predictive and observation models represented by deep neural networks. This gap necessitates the development of a comprehensive package that harmonizes deep learning and data assimilation.</p><p><em>Solution method:</em> This project introduces TorchDA, a novel computational tool based on the PyTorch framework, addressing the challenges posed by predictive and observation functions represented by deep neural networks. It enables users to train their custom neural networks and effortlessly incorporate them into data assimilation processes. This integration facilitates the incorporation of real-time observational data in both full and reduced physical spaces.</p></div>","PeriodicalId":285,"journal":{"name":"Computer Physics Communications","volume":"306 ","pages":"Article 109359"},"PeriodicalIF":7.2,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0010465524002820/pdfft?md5=cf361b3ec606c0abf6dee1adecdebe6d&pid=1-s2.0-S0010465524002820-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142148822","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A hybrid Eulerian-Lagrangian Vlasov method for nonlinear wave-particle interaction in weakly inhomogeneous magnetic field 弱不均匀磁场中非线性波粒相互作用的欧拉-拉格朗日混合 Vlasov 方法
IF 7.2 2区 物理与天体物理 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-09-03 DOI: 10.1016/j.cpc.2024.109362
Jiangshan Zheng , Ge Wang , Bo Li

We present a hybrid Eulerian-Lagrangian (HEL) Vlasov method for nonlinear resonant wave-particle interactions in weakly inhomogeneous magnetic field. The governing Vlasov equation is derived from a recently proposed resonance tracking Hamiltonian theory. It gives the evolution of the distribution function with a scale-separated Hamiltonian that contains the fast-varying coherent wave-particle interaction and slowly-varying motion about the resonance frame of reference. The hybrid scheme solves the fast-varying phase space evolution on Eulerian grid with an adaptive time step and then advances the slowly-varying dynamics by Lagrangian method along the resonance trajectory. We apply the HEL method to study the frequency chirping of whistler-mode chorus wave in the magnetosphere and the self-consistent simulations reproduce the chirping chorus wave and give high-resolution phase space dynamics of energetic particles at low computational cost. The scale-separated HEL approach could provide additional insights of the wave instabilities and wave-particle nonlinear coherence compared to the conventional Vlasov and particle-in-cell methods.

我们提出了一种欧拉-拉格朗日混合(HEL)弗拉索夫方法,用于弱不均匀磁场中的非线性共振波粒相互作用。支配弗拉索夫方程是从最近提出的共振跟踪哈密顿理论中推导出来的。它给出了分布函数与尺度分离哈密顿的演化,后者包含快速变化的相干波粒相互作用和围绕共振参照系的慢速变化运动。混合方案在欧拉网格上以自适应时间步长求解快变相空间演化,然后用拉格朗日方法沿共振轨迹推进慢变动力学。我们将 HEL 方法用于研究磁层中啸模合声波的频率啁啾,自洽模拟再现了啁啾合声波,并以较低的计算成本给出了高能粒子的高分辨率相空间动力学。与传统的 Vlasov 方法和粒子入胞方法相比,尺度分离的 HEL 方法可以对波的不稳定性和波粒非线性相干性提供更多的见解。
{"title":"A hybrid Eulerian-Lagrangian Vlasov method for nonlinear wave-particle interaction in weakly inhomogeneous magnetic field","authors":"Jiangshan Zheng ,&nbsp;Ge Wang ,&nbsp;Bo Li","doi":"10.1016/j.cpc.2024.109362","DOIUrl":"10.1016/j.cpc.2024.109362","url":null,"abstract":"<div><p>We present a hybrid Eulerian-Lagrangian (HEL) Vlasov method for nonlinear resonant wave-particle interactions in weakly inhomogeneous magnetic field. The governing Vlasov equation is derived from a recently proposed resonance tracking Hamiltonian theory. It gives the evolution of the distribution function with a scale-separated Hamiltonian that contains the fast-varying coherent wave-particle interaction and slowly-varying motion about the resonance frame of reference. The hybrid scheme solves the fast-varying phase space evolution on Eulerian grid with an adaptive time step and then advances the slowly-varying dynamics by Lagrangian method along the resonance trajectory. We apply the HEL method to study the frequency chirping of whistler-mode chorus wave in the magnetosphere and the self-consistent simulations reproduce the chirping chorus wave and give high-resolution phase space dynamics of energetic particles at low computational cost. The scale-separated HEL approach could provide additional insights of the wave instabilities and wave-particle nonlinear coherence compared to the conventional Vlasov and particle-in-cell methods.</p></div>","PeriodicalId":285,"journal":{"name":"Computer Physics Communications","volume":"306 ","pages":"Article 109362"},"PeriodicalIF":7.2,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142148820","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Modeling and geometrization in PGNAA 在 PGNAA 中建模和绘制几何图形
IF 7.2 2区 物理与天体物理 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-09-02 DOI: 10.1016/j.cpc.2024.109360
Halisson Alberdan Cavalcanti Cardoso , Silvio de Barros Melo , Ilker Meric

Prompt Gamma Neutron Analysis Activation is a widely used technique for analyzing materials. This technique defines graphs (reference spectrum collection, or libraries) of spectral intensity as a function of energy (channels) for the elements inserted in a sample. The Monte Carlo Library Least Squares (MCLLS) is the dominant approach in the PGNAA technique. The main difficulties faced in the MCLLS domain are (1) numerical instabilities in the least-squares stage (Library Least Squares (LLS)); (2) overdetermination of the system of equations; (3) linear dependence in the libraries; (4) gamma radiation scattering; (5) high computational costs. The present work proposes optimizing the LLS module to face the abovementioned problems using the Greedy Randomized Adaptive Search Procedure (GRASP) and Continuous Greedy Randomized Adaptive Search Procedure (CGRASP) algorithms. The search for the spectral count peaks of the libraries leads to a partitioning of the data before applying the GRASP and CGRASP algorithms. The methodological procedures also address estimating the spectral counts of an unknown library possibly integrates the sample. The results show (1) efficient partitioning of the input data (2) evidence of suitable precision of the weight fractions of the libraries that make up the sample (average precision of the order of 3.16% against 8.8% of other methods); (3) success in the approximation and estimation of the unknown library (average precision of 4.25%) present in the sample. Our method proved to be promising in improving the determination of percentage count fractions by the least-squares module and showing the advantages of data partitioning.

即时伽马中子分析活化是一种广泛使用的材料分析技术。该技术定义了插入样品中的元素的光谱强度与能量(通道)的函数关系图(参考谱收集或库)。蒙特卡洛最小二乘法(MCLLS)是 PGNAA 技术的主要方法。MCLLS 领域面临的主要困难是:(1)最小二乘法阶段(库最小二乘法(LLS))的数值不稳定性;(2)方程组的过度确定;(3)库的线性依赖性;(4)伽马辐射散射;(5)计算成本高。针对上述问题,本研究提出利用贪婪随机自适应搜索程序(GRASP)和连续贪婪随机自适应搜索程序(CGRASP)算法对 LLS 模块进行优化。在应用 GRASP 和 CGRASP 算法之前,对库的光谱计数峰进行搜索,从而对数据进行分区。这些方法程序还可以估算可能整合样本的未知文库的光谱计数。结果表明:(1) 对输入数据进行了有效的分区;(2) 证明了组成样本的库的权重分数具有适当的精度(平均精度为 3.16%,而其他方法为 8.8%);(3) 成功逼近和估计了样本中的未知库(平均精度为 4.25%)。事实证明,我们的方法在改进用最小二乘模块确定百分比计数分数方面大有可为,并显示了数据分区的优势。
{"title":"Modeling and geometrization in PGNAA","authors":"Halisson Alberdan Cavalcanti Cardoso ,&nbsp;Silvio de Barros Melo ,&nbsp;Ilker Meric","doi":"10.1016/j.cpc.2024.109360","DOIUrl":"10.1016/j.cpc.2024.109360","url":null,"abstract":"<div><p>Prompt Gamma Neutron Analysis Activation is a widely used technique for analyzing materials. This technique defines graphs (reference spectrum collection, or libraries) of spectral intensity as a function of energy (channels) for the elements inserted in a sample. The Monte Carlo Library Least Squares (MCLLS) is the dominant approach in the PGNAA technique. The main difficulties faced in the MCLLS domain are (1) numerical instabilities in the least-squares stage (Library Least Squares (LLS)); (2) overdetermination of the system of equations; (3) linear dependence in the libraries; (4) gamma radiation scattering; (5) high computational costs. The present work proposes optimizing the LLS module to face the abovementioned problems using the Greedy Randomized Adaptive Search Procedure (GRASP) and Continuous Greedy Randomized Adaptive Search Procedure (CGRASP) algorithms. The search for the spectral count peaks of the libraries leads to a partitioning of the data before applying the GRASP and CGRASP algorithms. The methodological procedures also address estimating the spectral counts of an unknown library possibly integrates the sample. The results show (1) efficient partitioning of the input data (2) evidence of suitable precision of the weight fractions of the libraries that make up the sample (average precision of the order of 3.16% against 8.8% of other methods); (3) success in the approximation and estimation of the unknown library (average precision of 4.25%) present in the sample. Our method proved to be promising in improving the determination of percentage count fractions by the least-squares module and showing the advantages of data partitioning.</p></div>","PeriodicalId":285,"journal":{"name":"Computer Physics Communications","volume":"306 ","pages":"Article 109360"},"PeriodicalIF":7.2,"publicationDate":"2024-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142135953","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Python tool for parameter estimation of “black box” macro- and micro-kinetic models with Bayesian optimization – petBOA 利用贝叶斯优化对 "黑箱 "宏观和微观动力学模型进行参数估计的 Python 工具 - petBOA
IF 7.2 2区 物理与天体物理 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-08-31 DOI: 10.1016/j.cpc.2024.109358
Sashank Kasiraju , Yifan Wang , Saurabh Bhandari , Aayush R. Singh , Dionisios G. Vlachos

We develop an open-source Python-based Parameter Estimation Tool utilizing Bayesian Optimization (petBOA) with a unique wrapper interface for gradient-free parameter estimation of expensive black-box kinetic models. We provide examples for Python macrokinetic and microkinetic modeling (MKM) tools, such as Cantera and OpenMKM. petBOA leverages surrogate Gaussian processes to approximate and minimize the objective function designed for parameter estimation. Bayesian Optimization (BO) is implemented using the open-source BoTorch toolkit. petBOA employs local and global sensitivity analyses to identify important parameters optimized against experimental data, and leverages pMuTT for consistent kinetic and thermodynamic parameters while perturbing species binding energies within the typical error of conventional DFT exchange-correlation functionals (20-30 kJ/mol). The source code and documentation are hosted on GitHub.

Program summary

Program title: petBOA

Developer's repository link: https://github.com/VlachosGroup/petBOA

Licensing provisions: MIT license

Programming language: Python

External routines: NEXTorch, PyTorch, GPyTorch, BoTorch, Matplotlib, PyDOE2, NumPy, SciPy, pandas, pMuTT, SALib, docker.

Nature of the problem: An open-source, gradient-free parameter estimation of black-box microkinetic modeling tools, such as OpenMKM is lacking.

Solution method: petBOA is a Python-based tool that utilizes Bayesian Optimization and offers a unique wrapper interface for expensive black-box kinetic models. It leverages the pMuTT library for consistent kinetic and thermodynamic parameter estimation and employs both local and global sensitivity analyses to identify crucial parameters.

我们开发了一种基于 Python 的开源参数估计工具--贝叶斯优化(petBOA),它具有独特的包装界面,可对昂贵的黑盒动力学模型进行无梯度参数估计。我们提供了用于 Python 宏动力学和微动力学建模(MKM)工具(如 Cantera 和 OpenMKM)的示例。petBOA 利用代理高斯过程来近似并最小化为参数估计设计的目标函数。petBOA 采用局部和全局敏感性分析来确定根据实验数据优化的重要参数,并利用 pMuTT 获得一致的动力学和热力学参数,同时将物种结合能的扰动控制在传统 DFT 交换相关函数的典型误差(20-30 kJ/mol)范围内。源代码和文档托管在 GitHub 上。程序摘要程序标题:petBOAD开发者资源库链接:https://github.com/VlachosGroup/petBOALicensing provisions:MIT 许可证编程语言:Python外部例程:NEXTorch、PyTorch、GPyTorch、BoTorch、Matplotlib、PyDOE2、NumPy、SciPy、pandas、pMuTT、SALib、docker.问题性质:缺乏开源、无梯度参数估计的黑盒微动力学建模工具,如 OpenMKM。解决方法:petBOA 是一款基于 Python 的工具,它利用贝叶斯优化(Bayesian Optimization)技术,为昂贵的黑盒微观动力学模型提供了一个独特的封装接口。它利用 pMuTT 库进行一致的动力学和热力学参数估计,并采用局部和全局敏感性分析来确定关键参数。
{"title":"A Python tool for parameter estimation of “black box” macro- and micro-kinetic models with Bayesian optimization – petBOA","authors":"Sashank Kasiraju ,&nbsp;Yifan Wang ,&nbsp;Saurabh Bhandari ,&nbsp;Aayush R. Singh ,&nbsp;Dionisios G. Vlachos","doi":"10.1016/j.cpc.2024.109358","DOIUrl":"10.1016/j.cpc.2024.109358","url":null,"abstract":"<div><p>We develop an open-source Python-based Parameter Estimation Tool utilizing Bayesian Optimization (petBOA) with a unique wrapper interface for gradient-free parameter estimation of expensive black-box kinetic models. We provide examples for Python macrokinetic and microkinetic modeling (MKM) tools, such as Cantera and OpenMKM. petBOA leverages surrogate Gaussian processes to approximate and minimize the objective function designed for parameter estimation. Bayesian Optimization (BO) is implemented using the open-source BoTorch toolkit. petBOA employs local and global sensitivity analyses to identify important parameters optimized against experimental data, and leverages pMuTT for consistent kinetic and thermodynamic parameters while perturbing species binding energies within the typical error of conventional DFT exchange-correlation functionals (20-30 kJ/mol). The source code and documentation are hosted on GitHub.</p></div><div><h3>Program summary</h3><p><em>Program title</em>: petBOA</p><p><em>Developer's repository link</em>: <span><span>https://github.com/VlachosGroup/petBOA</span><svg><path></path></svg></span></p><p><em>Licensing provisions</em>: MIT license</p><p><em>Programming language</em>: Python</p><p><em>External routines</em>: NEXTorch, PyTorch, GPyTorch, BoTorch, Matplotlib, PyDOE2, NumPy, SciPy, pandas, pMuTT, SALib, docker.</p><p><em>Nature of the problem</em>: An open-source, gradient-free parameter estimation of black-box microkinetic modeling tools, such as OpenMKM is lacking.</p><p><em>Solution method</em>: petBOA is a Python-based tool that utilizes Bayesian Optimization and offers a unique wrapper interface for expensive black-box kinetic models. It leverages the pMuTT library for consistent kinetic and thermodynamic parameter estimation and employs both local and global sensitivity analyses to identify crucial parameters.</p></div>","PeriodicalId":285,"journal":{"name":"Computer Physics Communications","volume":"306 ","pages":"Article 109358"},"PeriodicalIF":7.2,"publicationDate":"2024-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142163978","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CoupledElectricMagneticDipoles.jl - Julia modules for coupled electric and magnetic dipoles method for light scattering, and optical forces in three dimensions CoupledElectricMagneticDipoles.jl - 用于光散射和三维光学力的耦合电偶极子和磁偶极子方法的 Julia 模块
IF 7.2 2区 物理与天体物理 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-08-31 DOI: 10.1016/j.cpc.2024.109361
Augustin Muster, Diego R. Abujetas, Frank Scheffold, Luis S. Froufe-Pérez

CoupledElectricMagneticDipoles.jl is a set of modules implemented in the Julia language. Several modules are provided to solve typical problems encountered in nano-optics and nano-photonics including light emission by point sources in complex environments, electromagnetic wave scattering by single objects with complex geometry or collections of them. Optical forces can also be computed with this software package.

Two closely related computational methods are implemented in this library, the discrete dipole approach (DDA) and the coupled electric and magnetic dipoles (CEMD) method.

CoupledElectricMagneticDipoles.jl 是一套用 Julia 语言实现的模块。它提供了多个模块来解决纳米光学和纳米光子学中遇到的典型问题,包括复杂环境中点光源的光发射、具有复杂几何形状的单个物体或物体集合的电磁波散射。本软件包还可以计算光学力。本库中实现了两种密切相关的计算方法,即离散偶极子方法(DDA)和电偶极子与磁偶极子耦合方法(CEMD)。
{"title":"CoupledElectricMagneticDipoles.jl - Julia modules for coupled electric and magnetic dipoles method for light scattering, and optical forces in three dimensions","authors":"Augustin Muster,&nbsp;Diego R. Abujetas,&nbsp;Frank Scheffold,&nbsp;Luis S. Froufe-Pérez","doi":"10.1016/j.cpc.2024.109361","DOIUrl":"10.1016/j.cpc.2024.109361","url":null,"abstract":"<div><p>CoupledElectricMagneticDipoles.jl is a set of modules implemented in the Julia language. Several modules are provided to solve typical problems encountered in nano-optics and nano-photonics including light emission by point sources in complex environments, electromagnetic wave scattering by single objects with complex geometry or collections of them. Optical forces can also be computed with this software package.</p><p>Two closely related computational methods are implemented in this library, the discrete dipole approach (DDA) and the coupled electric and magnetic dipoles (CEMD) method.</p></div>","PeriodicalId":285,"journal":{"name":"Computer Physics Communications","volume":"306 ","pages":"Article 109361"},"PeriodicalIF":7.2,"publicationDate":"2024-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0010465524002844/pdfft?md5=f3f950fa5a78f74712f99c9054fb7264&pid=1-s2.0-S0010465524002844-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142243051","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Computer Physics Communications
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1