Update on Triangle Counting on GPU

2019 IEEE High Performance Extreme Computing Conference (HPEC) Pub Date : 2019-09-01 DOI:10.1109/HPEC.2019.8916547

Carl Pearson, M. Almasri, Omer Anjum, Vikram Sharma Mailthody, Zaid Qureshi, R. Nagi, Jinjun Xiong, Wen-mei W. Hwu

引用次数: 13

Abstract

This work presents an update to the triangle-counting portion of the subgraph isomorphism static graph challenge. This work is motivated by a desire to understand the impact of CUDA unified memory on the triangle-counting problem. First, CUDA unified memory is used to overlap reading large graph data from disk with graph data structures in GPU memory. Second, we use CUDA unified memory hints to solve multi-GPU performance scaling challenges present in our last submission. Finally, we improve the single-GPU kernel performance from our past submission by introducing a work-stealing dynamic algorithm GPU kernel with persistent threads, which makes performance adaptive for large graphs without requiring a graph analysis phase.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

更新了GPU上的三角形计数

这项工作提出了对子图同构静态图挑战的三角形计数部分的更新。这项工作的动机是想了解CUDA统一内存对三角形计数问题的影响。首先，使用CUDA统一内存将从磁盘读取的大型图形数据与GPU内存中的图形数据结构重叠。其次，我们使用CUDA统一内存提示来解决我们上次提交的多gpu性能扩展挑战。最后，我们通过引入具有持久线程的工作窃取动态算法GPU内核来改进过去提交的单GPU内核性能，这使得性能自适应于大型图形而无需图形分析阶段。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2019 IEEE High Performance Extreme Computing Conference (HPEC)

自引率

0.00%

发文量