HPSM: A Programming Framework for Multi-CPU and Multi-GPU Systems

2017 International Symposium on Computer Architecture and High Performance Computing Workshops (SBAC-PADW) Pub Date : 2017-10-01 DOI:10.1109/SBAC-PADW.2017.14

J. F. Lima, D. D. Domenico

引用次数: 3

Abstract

This paper presents a high-level C++ framework to explore multi-CPU and multi-GPU systems called HPSM. HPSM enables parallel loops and reductions implemented over three parallel backends: Serial, OpenMP (with GCC and libKOMP runtime), and StarPU. We evaluated HPSM development effort with AXPY program, and performance with three parallel benchmarks: N-Body, Hotspot, and CFD solver. The CPU-GPU combination attained better performance than only GPUs for all cases on a CPU-GPU system. Still, our findings provide evidence that NUMA affinity at framework level may produce different results.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

HPSM:一个多cpu和多gpu系统的编程框架

本文提出了一个高级c++框架，用于探索多cpu和多gpu系统，称为HPSM。HPSM支持在三个并行后端上实现并行循环和缩减:Serial, OpenMP(使用GCC和libKOMP运行时)和StarPU。我们使用AXPY程序评估了HPSM的开发工作，并使用三个并行基准:N-Body、Hotspot和CFD求解器来评估性能。在CPU-GPU系统的所有情况下，CPU-GPU组合都比仅使用gpu获得更好的性能。尽管如此，我们的研究结果提供了证据，表明在框架水平上NUMA的亲和力可能产生不同的结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊