Massively parallel programming models used as hardware description languages: The OpenCL case

2011 IEEE/ACM International Conference on Computer-Aided Design (ICCAD) Pub Date : 2011-11-07 DOI:10.1109/ICCAD.2011.6105349

Muhsen Owaida, Nikolaos Bellas, C. Antonopoulos, Konstantis Daloukas, C. Antoniadis

{"title":"Massively parallel programming models used as hardware description languages: The OpenCL case","authors":"Muhsen Owaida, Nikolaos Bellas, C. Antonopoulos, Konstantis Daloukas, C. Antoniadis","doi":"10.1109/ICCAD.2011.6105349","DOIUrl":null,"url":null,"abstract":"The problem of automatically generating hardware modules from high level application representations has been at the forefront of EDA research during the last few years. In this paper, we introduce a methodology to automatically synthesize hardware accelerators from OpenCL applications. OpenCL is a recent industry supported standard for writing programs that execute on multicore platforms and accelerators such as GPUs. Our methodology maps OpenCL kernels into hardware accelerators, based on architectural templates that explicitly decouple computation from memory communication whenever this is possible. The templates can be tuned to provide a wide repertoire of accelerators that meet user performance requirements and FPGA device characteristics. Furthermore, a set of high- and low-level compiler optimizations is applied to generate optimized accelerators. Our experimental evaluation shows that the generated accelerators are tuned efficiently to match the applications memory access pattern and computational complexity, and to achieve user performance requirements. An important objective of our tool is to expand the FPGA development user base to software engineers, thereby expanding the scope of FPGAs beyond the realm of hardware design.","PeriodicalId":6357,"journal":{"name":"2011 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)","volume":"15 1","pages":"326-333"},"PeriodicalIF":0.0000,"publicationDate":"2011-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"14","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCAD.2011.6105349","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 14

Abstract

The problem of automatically generating hardware modules from high level application representations has been at the forefront of EDA research during the last few years. In this paper, we introduce a methodology to automatically synthesize hardware accelerators from OpenCL applications. OpenCL is a recent industry supported standard for writing programs that execute on multicore platforms and accelerators such as GPUs. Our methodology maps OpenCL kernels into hardware accelerators, based on architectural templates that explicitly decouple computation from memory communication whenever this is possible. The templates can be tuned to provide a wide repertoire of accelerators that meet user performance requirements and FPGA device characteristics. Furthermore, a set of high- and low-level compiler optimizations is applied to generate optimized accelerators. Our experimental evaluation shows that the generated accelerators are tuned efficiently to match the applications memory access pattern and computational complexity, and to achieve user performance requirements. An important objective of our tool is to expand the FPGA development user base to software engineers, thereby expanding the scope of FPGAs beyond the realm of hardware design.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

作为硬件描述语言的大规模并行编程模型:OpenCL案例

从高级应用程序表示中自动生成硬件模块的问题在过去几年中一直是EDA研究的前沿。本文介绍了一种从OpenCL应用程序中自动合成硬件加速器的方法。OpenCL是最近业界支持的一种标准，用于编写在多核平台和gpu等加速器上执行的程序。我们的方法将OpenCL内核映射到硬件加速器中，基于架构模板，在可能的情况下显式地将计算与内存通信解耦。可以对模板进行调优，以提供满足用户性能要求和FPGA器件特性的各种加速器。此外，还应用了一组高级和低级编译器优化来生成优化的加速器。我们的实验评估表明，所生成的加速器可以有效地调整以匹配应用程序的内存访问模式和计算复杂度，并达到用户的性能要求。我们的工具的一个重要目标是将FPGA开发用户群扩展到软件工程师，从而将FPGA的范围扩展到硬件设计领域之外。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2011 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)

自引率

0.00%

发文量