{"title":"Lattice Boltzmann method for two-dimensional shallow water equations with CUDA","authors":"Jittavat Suksumlarn, W. Suwannik, M. Maleewong","doi":"10.1109/ICAICTA.2015.7335362","DOIUrl":null,"url":null,"abstract":"This paper presents the Lattice Boltzmann method (LBM) for two dimensional shallow water equations to simulate dam break problem. We implemented sequential code and compared it with the exact solution. Then, we implemented three parallel programs with the Compute Unified Device Architecture (CUDA). The first parallel version is a straightforward implementation. The second parallel version reduces the calculations of distribution function in the LBM. The third parallel program has fixed the branch divergence problem with branch distribution. Speed up for the third parallel program is increased approximately 2.1x from the first parallel program, and it is increased approximately 22.7x from the sequential program version.","PeriodicalId":319020,"journal":{"name":"2015 2nd International Conference on Advanced Informatics: Concepts, Theory and Applications (ICAICTA)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 2nd International Conference on Advanced Informatics: Concepts, Theory and Applications (ICAICTA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAICTA.2015.7335362","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This paper presents the Lattice Boltzmann method (LBM) for two dimensional shallow water equations to simulate dam break problem. We implemented sequential code and compared it with the exact solution. Then, we implemented three parallel programs with the Compute Unified Device Architecture (CUDA). The first parallel version is a straightforward implementation. The second parallel version reduces the calculations of distribution function in the LBM. The third parallel program has fixed the branch divergence problem with branch distribution. Speed up for the third parallel program is increased approximately 2.1x from the first parallel program, and it is increased approximately 22.7x from the sequential program version.