Exploration of Fine-Grained Parallelism for Load Balancing Eager K-truss on GPU and CPU

2019 IEEE High Performance Extreme Computing Conference (HPEC) Pub Date : 2019-09-01 DOI:10.1109/HPEC.2019.8916473

Mark P. Blanco, Tze Meng Low, Kyungjoo Kim

引用次数: 16

Abstract

In this work we present a performance exploration on Eager K-truss, a linear-algebraic formulation of the K-truss graph algorithm. We address performance issues related to load imbalance of parallel tasks in symmetric, triangular graphs by presenting a fine-grained parallel approach to executing the support computation. This approach also increases available parallelism, making it amenable to GPU execution. We demonstrate our fine-grained parallel approach using implementations in Kokkos and evaluate them on an Intel Skylake CPU and an Nvidia Tesla V100 GPU. Overall, we observe between a 1.261. 48x improvement on the CPU and a 9.97-16.92x improvement on the GPU due to our fine-grained parallel formulation.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

GPU和CPU负载均衡的细粒度并行性研究

在这项工作中，我们提出了对热切k -桁架的性能探索，这是k -桁架图算法的线性代数公式。我们通过提供细粒度并行方法来执行支持计算，解决了与对称三角形图中并行任务的负载不平衡相关的性能问题。这种方法还增加了可用的并行性，使其适合GPU执行。我们使用Kokkos中的实现演示了我们的细粒度并行方法，并在英特尔Skylake CPU和Nvidia Tesla V100 GPU上对它们进行了评估。总的来说，我们观察到1.261之间。由于我们的细粒度并行公式，CPU提高了48倍，GPU提高了9.97-16.92倍。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2019 IEEE High Performance Extreme Computing Conference (HPEC)

自引率

0.00%

发文量