文献互助智能选刊最新文献

高级搜索发布求助登录注册

A GPU Implementation of the Sparse Deep Neural Network Graph Challenge

2019 IEEE High Performance Extreme Computing Conference (HPEC) Pub Date : 2019-09-01 DOI:10.1109/HPEC.2019.8916223

M. Bisson, M. Fatica

引用次数: 16

Abstract

This paper presents a CUDA implementation of the latest addition to the Graph Challenge, the inference computation on a collection of large sparse deep neural networks. A single Tesla V100 can compute the inference at 3.7 TeraEdges/s. Using the managed memory API available in CUDA allows for simple and efficient distribution of these computations across a multiGPU NVIDIA DGX-2 server.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

稀疏深度神经网络图挑战的GPU实现

本文介绍了图形挑战最新添加的CUDA实现，该挑战是对大型稀疏深度神经网络集合的推理计算。一台Tesla V100可以以3.7 TeraEdges/s的速度计算推理。使用CUDA中可用的托管内存API可以在多gpu NVIDIA DGX-2服务器上简单有效地分配这些计算。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2019 IEEE High Performance Extreme Computing Conference (HPEC)

2019 IEEE High Performance Extreme Computing Conference (HPEC)

自引率

0.00%

发文量

0

期刊最新文献

[HPEC 2019 Copyright notice] Concurrent Katz Centrality for Streaming Graphs Cyber Baselining: Statistical properties of cyber time series and the search for stability Emerging Applications of 3D Integration and Approximate Computing in High-Performance Computing Systems: Unique Security Vulnerabilities Target-based Resource Allocation for Deep Learning Applications in a Multi-tenancy System

0

微信

客服QQ

Book学术公众号

扫码关注我们

反馈

Book学术官方微信

Book学术文献互助

Book学术文献互助群
群号：604180095

文献互助智能选刊最新文献互助须知联系我们：info@booksci.cn

Book学术提供免费学术资源搜索服务，方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。

Copyright © 2023 Book学术 All rights reserved.

京公网安备 11010802042870号京ICP备2023020795号-1