Superlinear speedup for matrix multiplication

Proceedings of the ITI 2012 34th International Conference on Information Technology Interfaces Pub Date : 2012-06-25 DOI:10.2498/iti.2012.0376

S. Ristov, M. Gusev

引用次数: 18

Abstract

Amdahl has shown that multiprocessor execution performance is not proportional to the number of processors. Gustafson has found a way to show that there are algorithms which can have almost linear speedup. In this article we have found algorithms which can achieve a superlinear speedup. The idea is not based on changing the algorithm or executing smaller number of operations like in the parallel search. It is based on characteristics of using an structure persistent algorithm which efficiently exploits the cache in a shared multiprocessor and avoids cache misses as much as possible. Our experimental research shows results of superlinear speedup for algorithms which run on modern multicore and multi-chip architectures and perform beyond expectations of maximum linear speedup.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

矩阵乘法的超线性加速

Amdahl已经表明，多处理器执行性能与处理器数量不成比例。Gustafson找到了一种方法来证明有些算法几乎可以实现线性加速。在本文中，我们找到了可以实现超线性加速的算法。这个想法不是基于改变算法或执行少量的操作，比如并行搜索。它基于使用结构持久算法的特点，有效地利用了共享多处理器中的缓存，并尽可能地避免了缓存丢失。我们的实验研究显示了在现代多核和多芯片架构上运行的算法的超线性加速结果，并且超出了最大线性加速的预期。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊