Loop acceleration and instruction repeat support for application specific instruction-set processors

2015 28th IEEE International System-on-Chip Conference (SOCC) Pub Date : 2015-09-01 DOI:10.1109/SOCC.2015.7406957

Zhenzhi Wu, Dake Liu, Xiaoyang Li

引用次数: 1

Abstract

Computation intensive tasks which consist of nested short loops usually suffer from massive control overhead, or memory size increasing when employing loop unrolling. In this approach, by introducing a modified instruction fetch unit with instruction FIFO and multiple loop controllers, loops can be performed in hardware, and single execution-cycle instructions can be executed in self-loop. Therefore no loop overhead exists for the optimized processor. The flexibility and the instruction granularity are maintained. Special domains for loop and repeat indications are added in the application-specific instructions. The proposed approach achieves dramatically performance and area benefits for many nested short loop dominated programs where the loops are determinable.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

针对特定于应用程序的指令集处理器的循环加速和指令重复支持

由嵌套短循环组成的计算密集型任务通常会遭受巨大的控制开销，或者在使用循环展开时增加内存大小。在这种方法中，通过引入带有指令FIFO和多循环控制器的修改指令提取单元，可以在硬件中执行循环，而在自循环中执行单执行周期的指令。因此，优化后的处理器不存在循环开销。保持了灵活性和指令粒度。在特定于应用程序的指令中添加了循环和重复指示的特殊域。对于许多嵌套短循环主导的程序，该方法在循环是可确定的情况下获得了显著的性能和面积优势。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2015 28th IEEE International System-on-Chip Conference (SOCC)

自引率

0.00%

发文量

期刊最新文献

Per-flow state management technique for high-speed networks A 5-b 1-GS/s 2.7-mW binary-search ADC in 90nm digital CMOS Low-latency power-efficient adaptive router design for network-on-chip A multi-level collaboration low-power design based on embedded system A high speed and low power content-addressable memory(CAM) using pipelined scheme