Automated Neural Network Accelerator Generation Framework for Multiple Neural Network Applications

TENCON 2018 - 2018 IEEE Region 10 Conference Pub Date : 2018-10-01 DOI:10.1109/TENCON.2018.8650190

Inho Lee, Seongmin Hong, G. Ryu, Yongjun Park

引用次数: 1

Abstract

Neural networks are widely used in various applications, but general neural network accelerators support only one application at a time. Therefore, information for each application, such as synaptic weights and bias data, must be loaded quickly to use multiple neural network applications. Field-programmable gate array (FPGA)-based implementation has huge performance overhead owing to low data transmission bandwidth. In order to solve this problem, this paper presents an automated FPGA-based multi-neural network accelerator generation framework that can quickly support several applications by storing neural network application data in an on-chip memory inside the FPGA. To do this, we first design a shared custom hardware accelerator that can support rapid changes in multiple target neural network applications. Then, we introduce an automated multi-neural network accelerator generation framework that performs training, weight quantization, and neural accelerator synthesis.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

多神经网络应用的自动神经网络加速器生成框架

神经网络广泛应用于各种应用中，但一般的神经网络加速器一次只能支持一个应用。因此，每个应用程序的信息，如突触权重和偏置数据，必须快速加载以使用多个神经网络应用程序。基于现场可编程门阵列(FPGA)的实现由于数据传输带宽低而造成巨大的性能开销。为了解决这一问题，本文提出了一种基于FPGA的自动化多神经网络加速器生成框架，该框架通过将神经网络应用数据存储在FPGA内部的片上存储器中，可以快速支持多种应用。为此，我们首先设计了一个共享的自定义硬件加速器，它可以支持多个目标神经网络应用程序的快速更改。然后，我们介绍了一个自动化的多神经网络加速器生成框架，该框架执行训练、权重量化和神经加速器合成。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

TENCON 2018 - 2018 IEEE Region 10 Conference

自引率

0.00%

发文量