Numerical Simulation of Particulate Matter Transport in the Atmospheric Urban Boundary Layer Using the Lagrangian Approach: Physical Problems and Parallel Implementation
A. I. Varentsov, O. A. Imeev, A. V. Glazunov, E. V. Mortikov, V. M. Stepanenko
{"title":"Numerical Simulation of Particulate Matter Transport in the Atmospheric Urban Boundary Layer Using the Lagrangian Approach: Physical Problems and Parallel Implementation","authors":"A. I. Varentsov, O. A. Imeev, A. V. Glazunov, E. V. Mortikov, V. M. Stepanenko","doi":"10.1134/s0361768823080248","DOIUrl":null,"url":null,"abstract":"<h3 data-test=\"abstract-sub-heading\">Abstract</h3><p>This paper presents results of development of a numerical model of Lagrangian particle transport, as well as results of application of parallel computation methods to improve the efficiency of the software implementation of this model. The model is a software package that allows the transport and deposition of aerosol particles to be calculated taking into account properties of particles and the input data that describe atmospheric conditions and underlying surface geometry. The dynamic core, physical parameterizations, numerical implementation, and algorithm of the model are described. Results of successful verification of the model on analytical solutions are presented. Initially, the model was used for less computationally intensive problems. In this paper, given the need to use the model in more computationally intensive problems, we optimize the sequential software implementation of the model, as well as develop its software implementations that use parallel computing technologies (OpenMP, MPI, and CUDA). The results of testing different implementations of the model show that the optimization of the most computationally complex blocks in its sequential version can reduce the execution time by 27%. At the same time, the use of parallel computing technologies allows us to achieve acceleration by several orders of magnitude. The use of OpenMP in the dynamic block of the model provides almost 4-fold acceleration of this block; the use of MPI, almost 8-fold acceleration; and the use of CUDA, almost 16-fold acceleration (all other conditions being equal). We also give some recommendations on the choice of a parallel computing technology depending on the properties of a computing system.</p>","PeriodicalId":54555,"journal":{"name":"Programming and Computer Software","volume":null,"pages":null},"PeriodicalIF":0.7000,"publicationDate":"2024-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Programming and Computer Software","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1134/s0361768823080248","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
引用次数: 0
Abstract
This paper presents results of development of a numerical model of Lagrangian particle transport, as well as results of application of parallel computation methods to improve the efficiency of the software implementation of this model. The model is a software package that allows the transport and deposition of aerosol particles to be calculated taking into account properties of particles and the input data that describe atmospheric conditions and underlying surface geometry. The dynamic core, physical parameterizations, numerical implementation, and algorithm of the model are described. Results of successful verification of the model on analytical solutions are presented. Initially, the model was used for less computationally intensive problems. In this paper, given the need to use the model in more computationally intensive problems, we optimize the sequential software implementation of the model, as well as develop its software implementations that use parallel computing technologies (OpenMP, MPI, and CUDA). The results of testing different implementations of the model show that the optimization of the most computationally complex blocks in its sequential version can reduce the execution time by 27%. At the same time, the use of parallel computing technologies allows us to achieve acceleration by several orders of magnitude. The use of OpenMP in the dynamic block of the model provides almost 4-fold acceleration of this block; the use of MPI, almost 8-fold acceleration; and the use of CUDA, almost 16-fold acceleration (all other conditions being equal). We also give some recommendations on the choice of a parallel computing technology depending on the properties of a computing system.
期刊介绍:
Programming and Computer Software is a peer reviewed journal devoted to problems in all areas of computer science: operating systems, compiler technology, software engineering, artificial intelligence, etc.