Low-latency double-precision floating-point division for FPGAs

2014 International Conference on Field-Programmable Technology (FPT) Pub Date : 2014-12-01 DOI:10.1109/FPT.2014.7082762

B. Liebig, A. Koch

引用次数: 5

Abstract

With growing FPGA capacities, applications requiring more intensive use of floating-point arithmetic become feasible candidates for acceleration using reconfigurable logic. Still among the more uncommon operations, however, are fast double-precision divider units. Since our application domain (acceleration of custom-compiled convex solvers) heavily relies on these blocks, we have implemented low-latency dividers based on the Goldschmidt algorithm that are accurate up to 1 bit of least precision (1-ULP). On Virtex-6 devices, our units operate at 200 MHz and significantly outperform other state-of-the-art 1-ULP dividers. We evaluate our blocks both stand-alone, as well as on the application-level when used for the high-level synthesis of the convex solver cores.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

fpga的低延迟双精度浮点除法

随着FPGA容量的增长，需要更多使用浮点运算的应用成为使用可重构逻辑加速的可行候选。然而，在比较不常见的操作中，还有快速双精度除法单元。由于我们的应用领域(自定义编译凸求解器的加速)严重依赖于这些块，因此我们基于Goldschmidt算法实现了低延迟分频器，其精度可达最低精度的1位(1- ulp)。在Virtex-6设备上，我们的单元工作频率为200 MHz，明显优于其他最先进的1-ULP分压器。我们既独立地评估我们的块，也在应用程序级别评估用于凸求解器核心的高级合成时的块。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2014 International Conference on Field-Programmable Technology (FPT)

自引率

0.00%

发文量

期刊最新文献

Message from the General Chair and Program Co-Chairs Accelerator-in-Switch: A Novel Cooperation Framework for FPGAs and GPUs FPGA Accelerated HPC and Data Analytics Novel Neural Network Applications on New Python Enabled Platforms High-level synthesis - the right side of history