Yi Ju, Adalberto Perez, Stefano Markidis, Philipp Schlatter, Erwin Laure
{"title":"了解同步、异步和混合原位技术在计算流体动力学应用中的影响","authors":"Yi Ju, Adalberto Perez, Stefano Markidis, Philipp Schlatter, Erwin Laure","doi":"arxiv-2407.20717","DOIUrl":null,"url":null,"abstract":"High-Performance Computing (HPC) systems provide input/output (IO)\nperformance growing relatively slowly compared to peak computational\nperformance and have limited storage capacity. Computational Fluid Dynamics\n(CFD) applications aiming to leverage the full power of Exascale HPC systems,\nsuch as the solver Nek5000, will generate massive data for further processing.\nThese data need to be efficiently stored via the IO subsystem. However, limited\nIO performance and storage capacity may result in performance, and thus\nscientific discovery, bottlenecks. In comparison to traditional post-processing\nmethods, in-situ techniques can reduce or avoid writing and reading the data\nthrough the IO subsystem, promising to be a solution to these problems. In this\npaper, we study the performance and resource usage of three in-situ use cases:\ndata compression, image generation, and uncertainty quantification. We\nfurthermore analyze three approaches when these in-situ tasks and the\nsimulation are executed synchronously, asynchronously, or in a hybrid manner.\nIn-situ compression can be used to reduce the IO time and storage requirements\nwhile maintaining data accuracy. Furthermore, in-situ visualization and\nanalysis can save Terabytes of data from being routed through the IO subsystem\nto storage. However, the overall efficiency is crucially dependent on the\ncharacteristics of both, the in-situ task and the simulation. In some cases,\nthe overhead introduced by the in-situ tasks can be substantial. Therefore, it\nis essential to choose the proper in-situ approach, synchronous, asynchronous,\nor hybrid, to minimize overhead and maximize the benefits of concurrent\nexecution.","PeriodicalId":501291,"journal":{"name":"arXiv - CS - Performance","volume":"77 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Understanding the Impact of Synchronous, Asynchronous, and Hybrid In-Situ Techniques in Computational Fluid Dynamics Applications\",\"authors\":\"Yi Ju, Adalberto Perez, Stefano Markidis, Philipp Schlatter, Erwin Laure\",\"doi\":\"arxiv-2407.20717\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"High-Performance Computing (HPC) systems provide input/output (IO)\\nperformance growing relatively slowly compared to peak computational\\nperformance and have limited storage capacity. Computational Fluid Dynamics\\n(CFD) applications aiming to leverage the full power of Exascale HPC systems,\\nsuch as the solver Nek5000, will generate massive data for further processing.\\nThese data need to be efficiently stored via the IO subsystem. However, limited\\nIO performance and storage capacity may result in performance, and thus\\nscientific discovery, bottlenecks. In comparison to traditional post-processing\\nmethods, in-situ techniques can reduce or avoid writing and reading the data\\nthrough the IO subsystem, promising to be a solution to these problems. In this\\npaper, we study the performance and resource usage of three in-situ use cases:\\ndata compression, image generation, and uncertainty quantification. We\\nfurthermore analyze three approaches when these in-situ tasks and the\\nsimulation are executed synchronously, asynchronously, or in a hybrid manner.\\nIn-situ compression can be used to reduce the IO time and storage requirements\\nwhile maintaining data accuracy. Furthermore, in-situ visualization and\\nanalysis can save Terabytes of data from being routed through the IO subsystem\\nto storage. However, the overall efficiency is crucially dependent on the\\ncharacteristics of both, the in-situ task and the simulation. In some cases,\\nthe overhead introduced by the in-situ tasks can be substantial. Therefore, it\\nis essential to choose the proper in-situ approach, synchronous, asynchronous,\\nor hybrid, to minimize overhead and maximize the benefits of concurrent\\nexecution.\",\"PeriodicalId\":501291,\"journal\":{\"name\":\"arXiv - CS - Performance\",\"volume\":\"77 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-07-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Performance\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2407.20717\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Performance","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2407.20717","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Understanding the Impact of Synchronous, Asynchronous, and Hybrid In-Situ Techniques in Computational Fluid Dynamics Applications
High-Performance Computing (HPC) systems provide input/output (IO)
performance growing relatively slowly compared to peak computational
performance and have limited storage capacity. Computational Fluid Dynamics
(CFD) applications aiming to leverage the full power of Exascale HPC systems,
such as the solver Nek5000, will generate massive data for further processing.
These data need to be efficiently stored via the IO subsystem. However, limited
IO performance and storage capacity may result in performance, and thus
scientific discovery, bottlenecks. In comparison to traditional post-processing
methods, in-situ techniques can reduce or avoid writing and reading the data
through the IO subsystem, promising to be a solution to these problems. In this
paper, we study the performance and resource usage of three in-situ use cases:
data compression, image generation, and uncertainty quantification. We
furthermore analyze three approaches when these in-situ tasks and the
simulation are executed synchronously, asynchronously, or in a hybrid manner.
In-situ compression can be used to reduce the IO time and storage requirements
while maintaining data accuracy. Furthermore, in-situ visualization and
analysis can save Terabytes of data from being routed through the IO subsystem
to storage. However, the overall efficiency is crucially dependent on the
characteristics of both, the in-situ task and the simulation. In some cases,
the overhead introduced by the in-situ tasks can be substantial. Therefore, it
is essential to choose the proper in-situ approach, synchronous, asynchronous,
or hybrid, to minimize overhead and maximize the benefits of concurrent
execution.