{"title":"二维浅水方程的格子玻尔兹曼方法","authors":"Jittavat Suksumlarn, W. Suwannik, M. Maleewong","doi":"10.1109/ICAICTA.2015.7335362","DOIUrl":null,"url":null,"abstract":"This paper presents the Lattice Boltzmann method (LBM) for two dimensional shallow water equations to simulate dam break problem. We implemented sequential code and compared it with the exact solution. Then, we implemented three parallel programs with the Compute Unified Device Architecture (CUDA). The first parallel version is a straightforward implementation. The second parallel version reduces the calculations of distribution function in the LBM. The third parallel program has fixed the branch divergence problem with branch distribution. Speed up for the third parallel program is increased approximately 2.1x from the first parallel program, and it is increased approximately 22.7x from the sequential program version.","PeriodicalId":319020,"journal":{"name":"2015 2nd International Conference on Advanced Informatics: Concepts, Theory and Applications (ICAICTA)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Lattice Boltzmann method for two-dimensional shallow water equations with CUDA\",\"authors\":\"Jittavat Suksumlarn, W. Suwannik, M. Maleewong\",\"doi\":\"10.1109/ICAICTA.2015.7335362\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents the Lattice Boltzmann method (LBM) for two dimensional shallow water equations to simulate dam break problem. We implemented sequential code and compared it with the exact solution. Then, we implemented three parallel programs with the Compute Unified Device Architecture (CUDA). The first parallel version is a straightforward implementation. The second parallel version reduces the calculations of distribution function in the LBM. The third parallel program has fixed the branch divergence problem with branch distribution. Speed up for the third parallel program is increased approximately 2.1x from the first parallel program, and it is increased approximately 22.7x from the sequential program version.\",\"PeriodicalId\":319020,\"journal\":{\"name\":\"2015 2nd International Conference on Advanced Informatics: Concepts, Theory and Applications (ICAICTA)\",\"volume\":\"14 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 2nd International Conference on Advanced Informatics: Concepts, Theory and Applications (ICAICTA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICAICTA.2015.7335362\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 2nd International Conference on Advanced Informatics: Concepts, Theory and Applications (ICAICTA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAICTA.2015.7335362","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Lattice Boltzmann method for two-dimensional shallow water equations with CUDA
This paper presents the Lattice Boltzmann method (LBM) for two dimensional shallow water equations to simulate dam break problem. We implemented sequential code and compared it with the exact solution. Then, we implemented three parallel programs with the Compute Unified Device Architecture (CUDA). The first parallel version is a straightforward implementation. The second parallel version reduces the calculations of distribution function in the LBM. The third parallel program has fixed the branch divergence problem with branch distribution. Speed up for the third parallel program is increased approximately 2.1x from the first parallel program, and it is increased approximately 22.7x from the sequential program version.