Anna Yesypenko , Chao Chen , Per-Gunnar Martinsson
{"title":"A simplified fast multipole method based on strong recursive skeletonization","authors":"Anna Yesypenko , Chao Chen , Per-Gunnar Martinsson","doi":"10.1016/j.jcp.2024.113707","DOIUrl":null,"url":null,"abstract":"<div><div>This work introduces a kernel-independent, multilevel, adaptive algorithm for efficiently evaluating a discrete convolution kernel with a given source distribution. The method is based on linear algebraic tools such as low rank approximation and “skeleton representations” to approximate far-field interactions. While this work is related to previous linear algebraic formulations of the fast multipole method, the proposed algorithm is distinguished by relying on simpler data structures.</div><div>The proposed algorithm eliminates the need for explicit interaction lists by restructuring computations to operate exclusively on the near-neighbor list at each level of the tree, thereby simplifying both implementation and data structures. This work also introduces novel translation operators that significantly simplify the handling of adaptive point distributions. As a kernel-independent approach, it only requires evaluation of the kernel function, making it easily adaptable to a variety of kernels. By using operations on the neighbor list (of size at most 27 in 3D) rather than the interaction list (of size up to 189 in 3D), the algorithm is particularly well-suited for parallel implementation on modern hardware.</div><div>Numerical experiments on uniform and non-uniform point distributions in 2D and 3D demonstrate the effectiveness of the proposed parallel algorithm for Laplace and (low-frequency) Helmholtz kernels. The algorithm constructs a tailored skeleton representation for the given geometry during a precomputation stage. After precomputation, the fast summation achieves high efficiency on the GPU using batched linear algebra operations.</div></div>","PeriodicalId":352,"journal":{"name":"Journal of Computational Physics","volume":"524 ","pages":"Article 113707"},"PeriodicalIF":3.8000,"publicationDate":"2024-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Computational Physics","FirstCategoryId":"101","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0021999124009550","RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
This work introduces a kernel-independent, multilevel, adaptive algorithm for efficiently evaluating a discrete convolution kernel with a given source distribution. The method is based on linear algebraic tools such as low rank approximation and “skeleton representations” to approximate far-field interactions. While this work is related to previous linear algebraic formulations of the fast multipole method, the proposed algorithm is distinguished by relying on simpler data structures.
The proposed algorithm eliminates the need for explicit interaction lists by restructuring computations to operate exclusively on the near-neighbor list at each level of the tree, thereby simplifying both implementation and data structures. This work also introduces novel translation operators that significantly simplify the handling of adaptive point distributions. As a kernel-independent approach, it only requires evaluation of the kernel function, making it easily adaptable to a variety of kernels. By using operations on the neighbor list (of size at most 27 in 3D) rather than the interaction list (of size up to 189 in 3D), the algorithm is particularly well-suited for parallel implementation on modern hardware.
Numerical experiments on uniform and non-uniform point distributions in 2D and 3D demonstrate the effectiveness of the proposed parallel algorithm for Laplace and (low-frequency) Helmholtz kernels. The algorithm constructs a tailored skeleton representation for the given geometry during a precomputation stage. After precomputation, the fast summation achieves high efficiency on the GPU using batched linear algebra operations.
期刊介绍:
Journal of Computational Physics thoroughly treats the computational aspects of physical problems, presenting techniques for the numerical solution of mathematical equations arising in all areas of physics. The journal seeks to emphasize methods that cross disciplinary boundaries.
The Journal of Computational Physics also publishes short notes of 4 pages or less (including figures, tables, and references but excluding title pages). Letters to the Editor commenting on articles already published in this Journal will also be considered. Neither notes nor letters should have an abstract.