VIP: A Versatile Inference Processor

2019 IEEE International Symposium on High Performance Computer Architecture (HPCA) Pub Date : 2019-02-01 DOI:10.1109/HPCA.2019.00049

Skand Hurkat, José F. Martínez

引用次数: 8

Abstract

We present Versatile Inference Processor (VIP), a highly programmable architecture for machine learning inference. VIP consists of 128 lightweight processing engines employing a vector processing paradigm, with a simple ISA and carefully chosen microarchitecture features. It is coupled with a modern, lightly customized, 3D-stacked memory system. Through detailed execution-driven simulations backed by RTL synthesis, we show that we can achieve online, real-time vision throughput (24 fps), at low power consumption, for both fullHD depth-from-stereo using belief propagation, and VGG-16 and VGG-19 deep neural networks (batch size of 1). Our RTL synthesis of a VIP processing engine in TSMC 28 nm technology, using a commercial standard-cell library supplied by ARM, results in 18 mm2 of silicon area and 3.5 W to 4.8 W of power consumption for all 128 VIP processing engines combined.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

VIP:多功能推理处理器

我们提出了多功能推理处理器(VIP)，一种高度可编程的机器学习推理架构。VIP由128个轻量级处理引擎组成，采用矢量处理范例，具有简单的ISA和精心选择的微架构功能。它与一个现代的，轻定制的，3d堆叠存储系统相结合。通过RTL合成支持的详细执行驱动仿真，我们表明我们可以在低功耗下实现在线实时视觉吞吐量(24 fps)，使用信念传播和VGG-16和VGG-19深度神经网络(批量大小为1)实现全高清立体声深度。我们的RTL合成了台积电28纳米技术的VIP处理引擎，使用ARM提供的商业标准单元库。导致所有128个VIP处理引擎的硅面积为18mm2，功耗为3.5 W至4.8 W。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2019 IEEE International Symposium on High Performance Computer Architecture (HPCA)

自引率

0.00%

发文量

期刊最新文献

Machine Learning at Facebook: Understanding Inference at the Edge Understanding the Future of Energy Efficiency in Multi-Module GPUs POWERT Channels: A Novel Class of Covert CommunicationExploiting Power Management Vulnerabilities The Accelerator Wall: Limits of Chip Specialization Featherlight Reuse-Distance Measurement