Autonomous, independent management of dynamic graphs on GPUs

2017 IEEE High Performance Extreme Computing Conference (HPEC) Pub Date : 2017-09-01 DOI:10.1109/HPEC.2017.8091058

Martin Winter, Rhaleb Zayer, M. Steinberger

引用次数: 29

Abstract

In this paper, we present a new, dynamic graph data structure, built to deliver high update rates while keeping a low memory footprint using autonomous memory management directly on the GPU. By transferring the memory management to the GPU, efficient updating of the graph structure and fast initialization times are enabled as no additional memory allocation calls or reallocation procedures are necessary since they are handled directly on the device. In comparison to previous work, this optimized approach allows for significantly lower initialization times (up to 300× faster) and much higher update rates for significant changes to the graph structure and equal rates for small changes. The framework provides different update implementations tailored specifically to different graph properties, enabling over 100 million of updates per second and keeping tens of millions of vertices and hundreds of millions of edges in memory without transferring data back and forth between device and host.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

自主、独立地管理gpu上的动态图形

在本文中，我们提出了一种新的动态图形数据结构，该结构旨在提供高更新率，同时直接在GPU上使用自主内存管理来保持低内存占用。通过将内存管理转移到GPU，可以有效地更新图形结构和快速初始化时间，因为不需要额外的内存分配调用或重新分配过程，因为它们直接在设备上处理。与以前的工作相比，这种优化的方法允许更低的初始化时间(最多快300倍)和更高的更新率，用于对图结构的重大更改和对小更改的相同速率。该框架提供了针对不同图形属性量身定制的不同更新实现，支持每秒超过1亿次更新，并在内存中保留数千万个顶点和数亿个边，而无需在设备和主机之间来回传输数据。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2017 IEEE High Performance Extreme Computing Conference (HPEC)

自引率

0.00%

发文量

期刊最新文献

Optimized task graph mapping on a many-core neuromorphic supercomputer Software-defined extreme scale networks for bigdata applications Power-aware computing: Measurement, control, and performance analysis for Intel Xeon Phi xDCI, a data science cyberinfrastructure for interdisciplinary research Leakage energy reduction for hard real-time caches