基于Hankel矩阵的大规模流动数据精确并行动态模态分解

IF 2.2 3区工程技术 Q2 MECHANICS Theoretical and Computational Fluid Dynamics Pub Date : 2024-12-07 DOI:10.1007/s00162-024-00730-0

Hiroyuki Asada, Soshi Kawai

{"title":"基于Hankel矩阵的大规模流动数据精确并行动态模态分解","authors":"Hiroyuki Asada, Soshi Kawai","doi":"10.1007/s00162-024-00730-0","DOIUrl":null,"url":null,"abstract":"An exact parallel algorithm of dynamic mode decomposition (DMD) with Hankel matrices for large-scale flow data is proposed. The proposed algorithm enables the DMD and the Hankel DMD for large-scale data obtained by high-fidelity flow simulations, such as large-eddy simulations or direct numerical simulations using more than a billion grid points, on parallel computations without any approximations. The proposed algorithm completes the computations of the DMD by utilizing block matrices of \\(X^TX\\in \\mathbb {R}^{k\\times k}\\) (where \\(X\\in \\mathbb {R}^{n\\times k}\\) is a large data matrix obtained by high-fidelity simulations, the number of snapshot data is \\(n > rsim 10^9\\), and the number of snapshots is \\(k\\lesssim O(10^3)\\)) without any approximations: for example, the singular value decomposition of X is replaced by the eigenvalue decomposition of \\(X^TX\\). Then, the computation of \\(X^TX\\) is parallelized by utilizing the domain decomposition often used in flow simulations, which reduces the memory consumption for each parallel process and wall-clock time in the DMD by a factor approximately equal to the number of parallel processes. The parallel computation with communication is performed only for \\(X^TX\\), allowing for high parallel efficiency under massively parallel computations. Furthermore, the proposed exact parallel algorithm is extended to the Hankel DMD without any additional parallel computations, realizing the Hankel DMD of large-scale data collected by over a billion grid points with comparable cost and memory to the DMD without Hankel matrices. Moreover, this study shows that the Hankel DMD, which has been employed to enrich information and augment rank, is advantageous for large-scale high-dimensional data collected by high-fidelity simulations in data reconstruction and predictions of future states (while prior studies have reported such advantages for low-dimensional data). Several numerical experiments using large-scale data, including laminar and turbulent flows around a cylinder and transonic buffeting flow around a full aircraft configuration, demonstrate that (i) the proposed exact parallel algorithm reproduces the existing non-parallelized Hankel DMD, (ii) the Hankel DMD for large-scale data consisting of over a billion grid points is feasible by using the proposed exact parallel algorithm with high parallel efficiency on more than 6 thousand CPU cores, and (iii) the Hankel DMD has advantages for high-dimensional data such as \\(n > rsim 10^9\\).","PeriodicalId":795,"journal":{"name":"Theoretical and Computational Fluid Dynamics","volume":"39 1","pages":""},"PeriodicalIF":2.2000,"publicationDate":"2024-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s00162-024-00730-0.pdf","citationCount":"0","resultStr":"{\"title\":\"Exact parallelized dynamic mode decomposition with Hankel matrix for large-scale flow data\",\"authors\":\"Hiroyuki Asada, Soshi Kawai\",\"doi\":\"10.1007/s00162-024-00730-0\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"An exact parallel algorithm of dynamic mode decomposition (DMD) with Hankel matrices for large-scale flow data is proposed. The proposed algorithm enables the DMD and the Hankel DMD for large-scale data obtained by high-fidelity flow simulations, such as large-eddy simulations or direct numerical simulations using more than a billion grid points, on parallel computations without any approximations. The proposed algorithm completes the computations of the DMD by utilizing block matrices of \\\\(X^TX\\\\in \\\\mathbb {R}^{k\\\\times k}\\\\) (where \\\\(X\\\\in \\\\mathbb {R}^{n\\\\times k}\\\\) is a large data matrix obtained by high-fidelity simulations, the number of snapshot data is \\\\(n > rsim 10^9\\\\), and the number of snapshots is \\\\(k\\\\lesssim O(10^3)\\\\)) without any approximations: for example, the singular value decomposition of X is replaced by the eigenvalue decomposition of \\\\(X^TX\\\\). Then, the computation of \\\\(X^TX\\\\) is parallelized by utilizing the domain decomposition often used in flow simulations, which reduces the memory consumption for each parallel process and wall-clock time in the DMD by a factor approximately equal to the number of parallel processes. The parallel computation with communication is performed only for \\\\(X^TX\\\\), allowing for high parallel efficiency under massively parallel computations. Furthermore, the proposed exact parallel algorithm is extended to the Hankel DMD without any additional parallel computations, realizing the Hankel DMD of large-scale data collected by over a billion grid points with comparable cost and memory to the DMD without Hankel matrices. Moreover, this study shows that the Hankel DMD, which has been employed to enrich information and augment rank, is advantageous for large-scale high-dimensional data collected by high-fidelity simulations in data reconstruction and predictions of future states (while prior studies have reported such advantages for low-dimensional data). Several numerical experiments using large-scale data, including laminar and turbulent flows around a cylinder and transonic buffeting flow around a full aircraft configuration, demonstrate that (i) the proposed exact parallel algorithm reproduces the existing non-parallelized Hankel DMD, (ii) the Hankel DMD for large-scale data consisting of over a billion grid points is feasible by using the proposed exact parallel algorithm with high parallel efficiency on more than 6 thousand CPU cores, and (iii) the Hankel DMD has advantages for high-dimensional data such as \\\\(n > rsim 10^9\\\\).\",\"PeriodicalId\":795,\"journal\":{\"name\":\"Theoretical and Computational Fluid Dynamics\",\"volume\":\"39 1\",\"pages\":\"\"},\"PeriodicalIF\":2.2000,\"publicationDate\":\"2024-12-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://link.springer.com/content/pdf/10.1007/s00162-024-00730-0.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Theoretical and Computational Fluid Dynamics\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s00162-024-00730-0\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MECHANICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Theoretical and Computational Fluid Dynamics","FirstCategoryId":"5","ListUrlMain":"https://link.springer.com/article/10.1007/s00162-024-00730-0","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MECHANICS","Score":null,"Total":0}

引用次数: 0

摘要

提出了一种基于Hankel矩阵的大规模流动数据动态模态分解（DMD）精确并行算法。提出的算法使DMD和Hankel DMD能够在并行计算中无需任何近似的情况下，通过高保真流动模拟获得大规模数据，例如大涡模拟或使用超过10亿个网格点的直接数值模拟。该算法利用\(X^TX\in \mathbb {R}^{k\times k}\)的块矩阵（其中\(X\in \mathbb {R}^{n\times k}\)为高保真仿真得到的大数据矩阵，快照数据个数为\(n > rsim 10^9\)，快照个数为\(k\lesssim O(10^3)\)），不做任何近似，完成DMD的计算，例如将X的奇异值分解替换为\(X^TX\)的特征值分解。然后，利用流模拟中常用的域分解对\(X^TX\)的计算进行并行化，将每个并行进程的内存消耗和DMD中的挂钟时间减少了大约等于并行进程数量的因子。具有通信的并行计算仅在\(X^TX\)上执行，允许在大规模并行计算下的高并行效率。此外，将该精确并行算法扩展到Hankel DMD中，无需额外的并行计算，实现了超过10亿个网格点的大规模数据的Hankel DMD，其成本和内存与不使用Hankel矩阵的DMD相当。此外，本研究表明，用于丰富信息和增强秩的Hankel DMD对于高保真仿真收集的大规模高维数据在数据重建和未来状态预测方面具有优势（而先前的研究已经报道了对低维数据的优势）。几个大规模数据的数值实验，包括圆柱周围的层流和湍流以及全飞机结构周围的跨音速抖振流，表明：(i)所提出的精确并行算法再现了现有的非并行化Hankel DMD；（ii）使用所提出的精确并行算法在超过6000个CPU内核上具有很高的并行效率，可以实现由超过10亿个网格点组成的大规模数据的Hankel DMD。（iii）汉高DMD对于高维数据（如\(n > rsim 10^9\)）具有优势。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Exact parallelized dynamic mode decomposition with Hankel matrix for large-scale flow data

An exact parallel algorithm of dynamic mode decomposition (DMD) with Hankel matrices for large-scale flow data is proposed. The proposed algorithm enables the DMD and the Hankel DMD for large-scale data obtained by high-fidelity flow simulations, such as large-eddy simulations or direct numerical simulations using more than a billion grid points, on parallel computations without any approximations. The proposed algorithm completes the computations of the DMD by utilizing block matrices of \(X^TX\in \mathbb {R}^{k\times k}\) (where \(X\in \mathbb {R}^{n\times k}\) is a large data matrix obtained by high-fidelity simulations, the number of snapshot data is \(n > rsim 10^9\), and the number of snapshots is \(k\lesssim O(10^3)\)) without any approximations: for example, the singular value decomposition of X is replaced by the eigenvalue decomposition of \(X^TX\). Then, the computation of \(X^TX\) is parallelized by utilizing the domain decomposition often used in flow simulations, which reduces the memory consumption for each parallel process and wall-clock time in the DMD by a factor approximately equal to the number of parallel processes. The parallel computation with communication is performed only for \(X^TX\), allowing for high parallel efficiency under massively parallel computations. Furthermore, the proposed exact parallel algorithm is extended to the Hankel DMD without any additional parallel computations, realizing the Hankel DMD of large-scale data collected by over a billion grid points with comparable cost and memory to the DMD without Hankel matrices. Moreover, this study shows that the Hankel DMD, which has been employed to enrich information and augment rank, is advantageous for large-scale high-dimensional data collected by high-fidelity simulations in data reconstruction and predictions of future states (while prior studies have reported such advantages for low-dimensional data). Several numerical experiments using large-scale data, including laminar and turbulent flows around a cylinder and transonic buffeting flow around a full aircraft configuration, demonstrate that (i) the proposed exact parallel algorithm reproduces the existing non-parallelized Hankel DMD, (ii) the Hankel DMD for large-scale data consisting of over a billion grid points is feasible by using the proposed exact parallel algorithm with high parallel efficiency on more than 6 thousand CPU cores, and (iii) the Hankel DMD has advantages for high-dimensional data such as \(n > rsim 10^9\).

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Theoretical and Computational Fluid Dynamics 物理-力学

CiteScore

5.80

自引率

2.90%

发文量

审稿时长

>12 weeks

期刊介绍： Theoretical and Computational Fluid Dynamics provides a forum for the cross fertilization of ideas, tools and techniques across all disciplines in which fluid flow plays a role. The focus is on aspects of fluid dynamics where theory and computation are used to provide insights and data upon which solid physical understanding is revealed. We seek research papers, invited review articles, brief communications, letters and comments addressing flow phenomena of relevance to aeronautical, geophysical, environmental, material, mechanical and life sciences. Papers of a purely algorithmic, experimental or engineering application nature, and papers without significant new physical insights, are outside the scope of this journal. For computational work, authors are responsible for ensuring that any artifacts of discretization and/or implementation are sufficiently controlled such that the numerical results unambiguously support the conclusions drawn. Where appropriate, and to the extent possible, such papers should either include or reference supporting documentation in the form of verification and validation studies.

期刊最新文献

Experimentally informed, linear mean-field modelling of circular cylinder aeroacoustics Porous plates at incidence Performance investigations of the two-phase mixer for liquid metal magnetohydrodynamic generator Active learning of data-assimilation closures using graph neural networks Simple shape model for normal shock trains in straight channels