Improving clock-rate of hard-macro designs

2013 International Conference on Field-Programmable Technology (FPT) Pub Date : 2013-12-01 DOI:10.1109/FPT.2013.6718361

C. Lavin, B. Nelson, B. Hutchings

引用次数: 7

Abstract

HMFlow reuses precompiled circuit modules (hard macros) and other techniques to rapidly compile large designs in a few seconds - many times faster than standard Xilinx flows. However, the clock rates of designs rapidly compiled by HMFlow are often significantly lower than those compiled by the Xilinx flow. To improve clock rates, HMFlow algorithms were modified as follows: (1) the router was modified to take advantage of longer routing wires in the FPGA devices, (2) the original greedy placer was replaced with an annealing-based placer, and (3) certain registers were removed from the hard-macro and moved into the fabric to reduce critical-path delays. Benchmark circuits compiled with these modifications can achieve clock rates that are about 75% as fast as those achieved by Xilinx, on average. Fast run-times are also preserved; the improved algorithms only increase HMFlow run-times by about 50% across the benchmark suite so that HMFlow remains more than 30× faster than the standard Xilinx flow for the benchmarks tested in this paper.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

提高硬宏设计的时钟率

HMFlow重用预编译电路模块(硬宏)和其他技术，可以在几秒钟内快速编译大型设计——比标准Xilinx流程快很多倍。然而，由HMFlow快速编译的设计的时钟速率通常明显低于由Xilinx流编译的设计。为了提高时钟速率，对HMFlow算法进行了如下修改:(1)修改了路由器，以利用FPGA设备中更长的路由线;(2)将原始的贪婪砂矿替换为基于退火的砂矿;(3)从硬宏中删除某些寄存器，并将其移到结构中，以减少关键路径延迟。通过这些修改编译的基准电路可以实现比Xilinx平均快75%的时钟速率。还保留了快速运行时间;改进的算法在整个基准测试套件中只增加了大约50%的HMFlow运行时间，因此在本文测试的基准测试中，HMFlow仍然比标准Xilinx流快30倍以上。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2013 International Conference on Field-Programmable Technology (FPT)

自引率

0.00%

发文量