Exploitation of operation-level parallelism in a processor of the CRAY X-MP

Proceedings., 1990 IEEE International Conference on Computer Design: VLSI in Computers and Processors Pub Date : 1990-09-17 DOI:10.1109/ICCD.1990.130149

S. Vajapeyam, G. Sohi, W. Hsu

引用次数: 2

Abstract

Available operation-level parallelism and its exploitation in the CRAY X-MP processor are studied. Considered are the sizes and contributions to execution time of basic blocks, instruction and operation issue rates and issue stalls, and operation execution overlap for entire executions of three large programs, FLO52, TRFD, and QCD1, taken from the Perfect Club benchmark set. The large basic blocks account for a significant portion of the overall execution time. It is also found that with the use of vector instructions, the X-MP is able to issue more than one operation per clock cycle, even though it can issue a maximum of one instruction per cycle.<>

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

CRAY X-MP处理器中操作级并行性的开发

研究了CRAY X-MP处理器中可用的操作级并行性及其开发。考虑了基本块的大小和对执行时间的贡献，指令和操作的发布率和发布延迟，以及三个大型程序(FLO52、TRFD和QCD1)的整个执行的操作执行重叠，取自Perfect Club基准集。大的基本块占整个执行时间的很大一部分。它还发现，使用向量指令，X-MP能够发出一个以上的操作，每个时钟周期，即使它可以发出最多一个指令。>

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings., 1990 IEEE International Conference on Computer Design: VLSI in Computers and Processors

自引率

0.00%

发文量

期刊最新文献

Design aids and test results for laser-programmable logic arrays An analog parallel distributed solution to the shortest path problem Exploitation of operation-level parallelism in a processor of the CRAY X-MP Pin assignment for improved performance in standard cell design The observability don't-care set and its approximations