{"title":"基于Hankel矩阵的大规模流动数据精确并行动态模态分解","authors":"Hiroyuki Asada, Soshi Kawai","doi":"10.1007/s00162-024-00730-0","DOIUrl":null,"url":null,"abstract":"<p>An exact parallel algorithm of dynamic mode decomposition (DMD) with Hankel matrices for large-scale flow data is proposed. The proposed algorithm enables the DMD and the Hankel DMD for large-scale data obtained by high-fidelity flow simulations, such as large-eddy simulations or direct numerical simulations using more than a billion grid points, on parallel computations without any approximations. The proposed algorithm completes the computations of the DMD by utilizing block matrices of <span>\\(X^TX\\in \\mathbb {R}^{k\\times k}\\)</span> (where <span>\\(X\\in \\mathbb {R}^{n\\times k}\\)</span> is a large data matrix obtained by high-fidelity simulations, the number of snapshot data is <span>\\(n > rsim 10^9\\)</span>, and the number of snapshots is <span>\\(k\\lesssim O(10^3)\\)</span>) without any approximations: for example, the singular value decomposition of <i>X</i> is replaced by the eigenvalue decomposition of <span>\\(X^TX\\)</span>. Then, the computation of <span>\\(X^TX\\)</span> is parallelized by utilizing the domain decomposition often used in flow simulations, which reduces the memory consumption for each parallel process and wall-clock time in the DMD by a factor approximately equal to the number of parallel processes. The parallel computation with communication is performed only for <span>\\(X^TX\\)</span>, allowing for high parallel efficiency under massively parallel computations. Furthermore, the proposed exact parallel algorithm is extended to the Hankel DMD without any additional parallel computations, realizing the Hankel DMD of large-scale data collected by over a billion grid points with comparable cost and memory to the DMD without Hankel matrices. Moreover, this study shows that the Hankel DMD, which has been employed to enrich information and augment rank, is advantageous for large-scale high-dimensional data collected by high-fidelity simulations in data reconstruction and predictions of future states (while prior studies have reported such advantages for low-dimensional data). Several numerical experiments using large-scale data, including laminar and turbulent flows around a cylinder and transonic buffeting flow around a full aircraft configuration, demonstrate that (i) the proposed exact parallel algorithm reproduces the existing non-parallelized Hankel DMD, (ii) the Hankel DMD for large-scale data consisting of over a billion grid points is feasible by using the proposed exact parallel algorithm with high parallel efficiency on more than 6 thousand CPU cores, and (iii) the Hankel DMD has advantages for high-dimensional data such as <span>\\(n > rsim 10^9\\)</span>.</p>","PeriodicalId":795,"journal":{"name":"Theoretical and Computational Fluid Dynamics","volume":"39 1","pages":""},"PeriodicalIF":2.2000,"publicationDate":"2024-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s00162-024-00730-0.pdf","citationCount":"0","resultStr":"{\"title\":\"Exact parallelized dynamic mode decomposition with Hankel matrix for large-scale flow data\",\"authors\":\"Hiroyuki Asada, Soshi Kawai\",\"doi\":\"10.1007/s00162-024-00730-0\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>An exact parallel algorithm of dynamic mode decomposition (DMD) with Hankel matrices for large-scale flow data is proposed. The proposed algorithm enables the DMD and the Hankel DMD for large-scale data obtained by high-fidelity flow simulations, such as large-eddy simulations or direct numerical simulations using more than a billion grid points, on parallel computations without any approximations. The proposed algorithm completes the computations of the DMD by utilizing block matrices of <span>\\\\(X^TX\\\\in \\\\mathbb {R}^{k\\\\times k}\\\\)</span> (where <span>\\\\(X\\\\in \\\\mathbb {R}^{n\\\\times k}\\\\)</span> is a large data matrix obtained by high-fidelity simulations, the number of snapshot data is <span>\\\\(n > rsim 10^9\\\\)</span>, and the number of snapshots is <span>\\\\(k\\\\lesssim O(10^3)\\\\)</span>) without any approximations: for example, the singular value decomposition of <i>X</i> is replaced by the eigenvalue decomposition of <span>\\\\(X^TX\\\\)</span>. Then, the computation of <span>\\\\(X^TX\\\\)</span> is parallelized by utilizing the domain decomposition often used in flow simulations, which reduces the memory consumption for each parallel process and wall-clock time in the DMD by a factor approximately equal to the number of parallel processes. The parallel computation with communication is performed only for <span>\\\\(X^TX\\\\)</span>, allowing for high parallel efficiency under massively parallel computations. Furthermore, the proposed exact parallel algorithm is extended to the Hankel DMD without any additional parallel computations, realizing the Hankel DMD of large-scale data collected by over a billion grid points with comparable cost and memory to the DMD without Hankel matrices. Moreover, this study shows that the Hankel DMD, which has been employed to enrich information and augment rank, is advantageous for large-scale high-dimensional data collected by high-fidelity simulations in data reconstruction and predictions of future states (while prior studies have reported such advantages for low-dimensional data). Several numerical experiments using large-scale data, including laminar and turbulent flows around a cylinder and transonic buffeting flow around a full aircraft configuration, demonstrate that (i) the proposed exact parallel algorithm reproduces the existing non-parallelized Hankel DMD, (ii) the Hankel DMD for large-scale data consisting of over a billion grid points is feasible by using the proposed exact parallel algorithm with high parallel efficiency on more than 6 thousand CPU cores, and (iii) the Hankel DMD has advantages for high-dimensional data such as <span>\\\\(n > rsim 10^9\\\\)</span>.</p>\",\"PeriodicalId\":795,\"journal\":{\"name\":\"Theoretical and Computational Fluid Dynamics\",\"volume\":\"39 1\",\"pages\":\"\"},\"PeriodicalIF\":2.2000,\"publicationDate\":\"2024-12-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://link.springer.com/content/pdf/10.1007/s00162-024-00730-0.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Theoretical and Computational Fluid Dynamics\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s00162-024-00730-0\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MECHANICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Theoretical and Computational Fluid Dynamics","FirstCategoryId":"5","ListUrlMain":"https://link.springer.com/article/10.1007/s00162-024-00730-0","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MECHANICS","Score":null,"Total":0}
Exact parallelized dynamic mode decomposition with Hankel matrix for large-scale flow data
An exact parallel algorithm of dynamic mode decomposition (DMD) with Hankel matrices for large-scale flow data is proposed. The proposed algorithm enables the DMD and the Hankel DMD for large-scale data obtained by high-fidelity flow simulations, such as large-eddy simulations or direct numerical simulations using more than a billion grid points, on parallel computations without any approximations. The proposed algorithm completes the computations of the DMD by utilizing block matrices of \(X^TX\in \mathbb {R}^{k\times k}\) (where \(X\in \mathbb {R}^{n\times k}\) is a large data matrix obtained by high-fidelity simulations, the number of snapshot data is \(n > rsim 10^9\), and the number of snapshots is \(k\lesssim O(10^3)\)) without any approximations: for example, the singular value decomposition of X is replaced by the eigenvalue decomposition of \(X^TX\). Then, the computation of \(X^TX\) is parallelized by utilizing the domain decomposition often used in flow simulations, which reduces the memory consumption for each parallel process and wall-clock time in the DMD by a factor approximately equal to the number of parallel processes. The parallel computation with communication is performed only for \(X^TX\), allowing for high parallel efficiency under massively parallel computations. Furthermore, the proposed exact parallel algorithm is extended to the Hankel DMD without any additional parallel computations, realizing the Hankel DMD of large-scale data collected by over a billion grid points with comparable cost and memory to the DMD without Hankel matrices. Moreover, this study shows that the Hankel DMD, which has been employed to enrich information and augment rank, is advantageous for large-scale high-dimensional data collected by high-fidelity simulations in data reconstruction and predictions of future states (while prior studies have reported such advantages for low-dimensional data). Several numerical experiments using large-scale data, including laminar and turbulent flows around a cylinder and transonic buffeting flow around a full aircraft configuration, demonstrate that (i) the proposed exact parallel algorithm reproduces the existing non-parallelized Hankel DMD, (ii) the Hankel DMD for large-scale data consisting of over a billion grid points is feasible by using the proposed exact parallel algorithm with high parallel efficiency on more than 6 thousand CPU cores, and (iii) the Hankel DMD has advantages for high-dimensional data such as \(n > rsim 10^9\).
期刊介绍:
Theoretical and Computational Fluid Dynamics provides a forum for the cross fertilization of ideas, tools and techniques across all disciplines in which fluid flow plays a role. The focus is on aspects of fluid dynamics where theory and computation are used to provide insights and data upon which solid physical understanding is revealed. We seek research papers, invited review articles, brief communications, letters and comments addressing flow phenomena of relevance to aeronautical, geophysical, environmental, material, mechanical and life sciences. Papers of a purely algorithmic, experimental or engineering application nature, and papers without significant new physical insights, are outside the scope of this journal. For computational work, authors are responsible for ensuring that any artifacts of discretization and/or implementation are sufficiently controlled such that the numerical results unambiguously support the conclusions drawn. Where appropriate, and to the extent possible, such papers should either include or reference supporting documentation in the form of verification and validation studies.