A simplified fast multipole method based on strong recursive skeletonization

IF 3.8 2区 物理与天体物理 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Journal of Computational Physics Pub Date : 2025-03-01 Epub Date: 2024-12-30 DOI:10.1016/j.jcp.2024.113707
Anna Yesypenko , Chao Chen , Per-Gunnar Martinsson
{"title":"A simplified fast multipole method based on strong recursive skeletonization","authors":"Anna Yesypenko ,&nbsp;Chao Chen ,&nbsp;Per-Gunnar Martinsson","doi":"10.1016/j.jcp.2024.113707","DOIUrl":null,"url":null,"abstract":"<div><div>This work introduces a kernel-independent, multilevel, adaptive algorithm for efficiently evaluating a discrete convolution kernel with a given source distribution. The method is based on linear algebraic tools such as low rank approximation and “skeleton representations” to approximate far-field interactions. While this work is related to previous linear algebraic formulations of the fast multipole method, the proposed algorithm is distinguished by relying on simpler data structures.</div><div>The proposed algorithm eliminates the need for explicit interaction lists by restructuring computations to operate exclusively on the near-neighbor list at each level of the tree, thereby simplifying both implementation and data structures. This work also introduces novel translation operators that significantly simplify the handling of adaptive point distributions. As a kernel-independent approach, it only requires evaluation of the kernel function, making it easily adaptable to a variety of kernels. By using operations on the neighbor list (of size at most 27 in 3D) rather than the interaction list (of size up to 189 in 3D), the algorithm is particularly well-suited for parallel implementation on modern hardware.</div><div>Numerical experiments on uniform and non-uniform point distributions in 2D and 3D demonstrate the effectiveness of the proposed parallel algorithm for Laplace and (low-frequency) Helmholtz kernels. The algorithm constructs a tailored skeleton representation for the given geometry during a precomputation stage. After precomputation, the fast summation achieves high efficiency on the GPU using batched linear algebra operations.</div></div>","PeriodicalId":352,"journal":{"name":"Journal of Computational Physics","volume":"524 ","pages":"Article 113707"},"PeriodicalIF":3.8000,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Computational Physics","FirstCategoryId":"101","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0021999124009550","RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/30 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

Abstract

This work introduces a kernel-independent, multilevel, adaptive algorithm for efficiently evaluating a discrete convolution kernel with a given source distribution. The method is based on linear algebraic tools such as low rank approximation and “skeleton representations” to approximate far-field interactions. While this work is related to previous linear algebraic formulations of the fast multipole method, the proposed algorithm is distinguished by relying on simpler data structures.
The proposed algorithm eliminates the need for explicit interaction lists by restructuring computations to operate exclusively on the near-neighbor list at each level of the tree, thereby simplifying both implementation and data structures. This work also introduces novel translation operators that significantly simplify the handling of adaptive point distributions. As a kernel-independent approach, it only requires evaluation of the kernel function, making it easily adaptable to a variety of kernels. By using operations on the neighbor list (of size at most 27 in 3D) rather than the interaction list (of size up to 189 in 3D), the algorithm is particularly well-suited for parallel implementation on modern hardware.
Numerical experiments on uniform and non-uniform point distributions in 2D and 3D demonstrate the effectiveness of the proposed parallel algorithm for Laplace and (low-frequency) Helmholtz kernels. The algorithm constructs a tailored skeleton representation for the given geometry during a precomputation stage. After precomputation, the fast summation achieves high efficiency on the GPU using batched linear algebra operations.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于强递归骨架化的简化快速多极法
这项工作介绍了一种核无关的,多级别的,自适应算法,用于有效地评估具有给定源分布的离散卷积核。该方法基于线性代数工具,如低秩近似和“骨架表示”来近似远场相互作用。虽然这项工作与先前的快速多极方法的线性代数公式有关,但所提出的算法的特点是依赖于更简单的数据结构。该算法通过重构计算,在树的每一层只对近邻表进行操作,从而消除了对显式交互列表的需要,从而简化了实现和数据结构。这项工作还引入了新的平移算子,大大简化了自适应点分布的处理。作为一种与内核无关的方法,它只需要对内核函数进行评估,这使得它很容易适应各种内核。通过使用邻居列表(在3D中大小最多为27)而不是交互列表(在3D中大小最多为189)上的操作,该算法特别适合在现代硬件上并行实现。二维和三维均匀和非均匀点分布的数值实验证明了所提出的拉普拉斯核和(低频)亥姆霍兹核并行算法的有效性。该算法在预计算阶段为给定几何构造一个定制的骨架表示。预计算后,快速求和通过批处理线性代数运算在GPU上实现了高效率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Journal of Computational Physics
Journal of Computational Physics 物理-计算机:跨学科应用
CiteScore
7.60
自引率
14.60%
发文量
763
审稿时长
5.8 months
期刊介绍: Journal of Computational Physics thoroughly treats the computational aspects of physical problems, presenting techniques for the numerical solution of mathematical equations arising in all areas of physics. The journal seeks to emphasize methods that cross disciplinary boundaries. The Journal of Computational Physics also publishes short notes of 4 pages or less (including figures, tables, and references but excluding title pages). Letters to the Editor commenting on articles already published in this Journal will also be considered. Neither notes nor letters should have an abstract.
期刊最新文献
Energy-Stable, L2-Stable, and weakly H1-Stable BDF2 viscosity splitting method for the penalized incompressible navier–Stokes equations Zeta expansion for long-range interactions under periodic boundary conditions with applications to micromagnetics Numerical boundary flux functions that give provable bounds for nonlinear initial boundary value problems with open boundaries A second-order cell-centered Lagrangian scheme for 2D ideal magnetohydrodynamics on unstructured meshes A structure-preserving semi-implicit four-split scheme for continuum mechanics
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1