In this paper, OpenCL is used to target a general purpose graphics processing unit (GPGPU) for acceleration of 2 modules used in a synthetic aperture radar (SAR) simulator. Two of the most computationally complex modules, the Back Projection and Generate Return modules, are targeted to an AMD FirePro M5100 GPGPU. The resulting speedup is 3X over multi-threaded C++ implementations of those algorithms running on an 4-core Intel I7 2.8GHz processor, 4X and 7X over single-threaded C++ implementations, and 19X and 29X over native MATLAB implementations, respectively.
{"title":"GPGPU Acceleration using OpenCL for a Spotlight SAR Simulator","authors":"E. Balster, M. Hoffman, J. P. Skeans, David Fan","doi":"10.1145/3078155.3078157","DOIUrl":"https://doi.org/10.1145/3078155.3078157","url":null,"abstract":"In this paper, OpenCL is used to target a general purpose graphics processing unit (GPGPU) for acceleration of 2 modules used in a synthetic aperture radar (SAR) simulator. Two of the most computationally complex modules, the Back Projection and Generate Return modules, are targeted to an AMD FirePro M5100 GPGPU. The resulting speedup is 3X over multi-threaded C++ implementations of those algorithms running on an 4-core Intel I7 2.8GHz processor, 4X and 7X over single-threaded C++ implementations, and 19X and 29X over native MATLAB implementations, respectively.","PeriodicalId":267581,"journal":{"name":"Proceedings of the 5th International Workshop on OpenCL","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132065158","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Multi-FPGA acceleration has already shown several orders of magnitude speedups in today's analytics, scientific, financial, image processing and machine learning applications. However, the scalability of FPGA-based HPC demanded by the future avalanche of data and network traffic is still in its infancy. OpenCL memory model holds the promise of straightforward and pragmatic scalability of FPGA computing beyond single FPGA within the framework of today's OpenCL FPGA compilers. Scientific Concepts International develops novel Smart Cell Interconnect (SCI) optimized for OpenCL global memory, streaming data accesses and network packets encapsulated into switched cells. Recent advances in Open Source tools based on polyhedral model and IR enable development of source-to-source coarse grain code and data partitioning of the HPC workloads written initially in OpenCL and followed by C/C++, SYCL. Evolution of the multi-FPGA partitioning tools and SCI interconnect IP will enable true scalability of the computing fabric of arrays and clusters of FPGAs. Cloud FPGA computing, fog computing at the source of the generated data as well as fusion of networking, security, and computing are addressed by our architecture, partitioner tools, and future product roadmap.
{"title":"Scalable OpenCL FPGA Computing Evolution","authors":"A. Vassiliev","doi":"10.1145/3078155.3078165","DOIUrl":"https://doi.org/10.1145/3078155.3078165","url":null,"abstract":"Multi-FPGA acceleration has already shown several orders of magnitude speedups in today's analytics, scientific, financial, image processing and machine learning applications. However, the scalability of FPGA-based HPC demanded by the future avalanche of data and network traffic is still in its infancy. OpenCL memory model holds the promise of straightforward and pragmatic scalability of FPGA computing beyond single FPGA within the framework of today's OpenCL FPGA compilers. Scientific Concepts International develops novel Smart Cell Interconnect (SCI) optimized for OpenCL global memory, streaming data accesses and network packets encapsulated into switched cells. Recent advances in Open Source tools based on polyhedral model and IR enable development of source-to-source coarse grain code and data partitioning of the HPC workloads written initially in OpenCL and followed by C/C++, SYCL. Evolution of the multi-FPGA partitioning tools and SCI interconnect IP will enable true scalability of the computing fabric of arrays and clusters of FPGAs. Cloud FPGA computing, fog computing at the source of the generated data as well as fusion of networking, security, and computing are addressed by our architecture, partitioner tools, and future product roadmap.","PeriodicalId":267581,"journal":{"name":"Proceedings of the 5th International Workshop on OpenCL","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129260627","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Proceedings of the 5th International Workshop on OpenCL","authors":"","doi":"10.1145/3078155","DOIUrl":"https://doi.org/10.1145/3078155","url":null,"abstract":"","PeriodicalId":267581,"journal":{"name":"Proceedings of the 5th International Workshop on OpenCL","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133071401","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}