Mishal Almazrooie, R. Abdullah, L. Yi, Ibrahim Venkat, Zahraa Adnan
{"title":"基于CUDA的GP-GPU并行拉普拉斯滤波","authors":"Mishal Almazrooie, R. Abdullah, L. Yi, Ibrahim Venkat, Zahraa Adnan","doi":"10.1109/ICIMU.2014.7066604","DOIUrl":null,"url":null,"abstract":"Parallel programming has been extensively applied to different fields, such as medicine, security, and image processing. This paper focuses on parallelizing the Laplacian filter, an edge detection algorithm, using CUDA. We have conducted a performance analysis to benchmark the sequential Laplacian version against the CUDA parallel approach. Results show that the parallelization of Laplacian filter achieves better performance than sequential code. CUDA approach achieved 200 fold speedup for an image size of 50MB when 216 processing elements were deployed.","PeriodicalId":408534,"journal":{"name":"Proceedings of the 6th International Conference on Information Technology and Multimedia","volume":"37 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Parallel Laplacian filter using CUDA on GP-GPU\",\"authors\":\"Mishal Almazrooie, R. Abdullah, L. Yi, Ibrahim Venkat, Zahraa Adnan\",\"doi\":\"10.1109/ICIMU.2014.7066604\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Parallel programming has been extensively applied to different fields, such as medicine, security, and image processing. This paper focuses on parallelizing the Laplacian filter, an edge detection algorithm, using CUDA. We have conducted a performance analysis to benchmark the sequential Laplacian version against the CUDA parallel approach. Results show that the parallelization of Laplacian filter achieves better performance than sequential code. CUDA approach achieved 200 fold speedup for an image size of 50MB when 216 processing elements were deployed.\",\"PeriodicalId\":408534,\"journal\":{\"name\":\"Proceedings of the 6th International Conference on Information Technology and Multimedia\",\"volume\":\"37 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 6th International Conference on Information Technology and Multimedia\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICIMU.2014.7066604\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 6th International Conference on Information Technology and Multimedia","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIMU.2014.7066604","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Parallel programming has been extensively applied to different fields, such as medicine, security, and image processing. This paper focuses on parallelizing the Laplacian filter, an edge detection algorithm, using CUDA. We have conducted a performance analysis to benchmark the sequential Laplacian version against the CUDA parallel approach. Results show that the parallelization of Laplacian filter achieves better performance than sequential code. CUDA approach achieved 200 fold speedup for an image size of 50MB when 216 processing elements were deployed.