Accelerating Deep Neural Networks Using FPGA

2018 30th International Conference on Microelectronics (ICM) Pub Date : 2018-12-01 DOI:10.1109/ICM.2018.8704085

Esraa M Adel, Rana Magdy, Sara Mohamed, Mona Mamdouh, Eman El Mandouh, H. Mostafa

引用次数: 5

Abstract

Deep Convolutional Neural Networks (CNNs) are the state-of-the-art systems for image classification and scene understating. They are widely used for their superior accuracy but at the cost of high computational complexity. The target in this field nowadays is its acceleration to be used in real time applications. The solution is to use Graphics Processing Units (GPU) but many problems arise due to the GPU high-power consumption which prevents its utilization in daily-used equipment. The Field Programmable Gate Array (FPGA) is a new solution for CNN implementations due to its low power consumption and flexible architecture. This work discusses this problem and provides a solution that compromises between the speed of the CNN and the power consumption of the FPGA. This solution depends on two main techniques for speeding up: parallelism of layers resources and pipelining inside some layers

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

利用FPGA加速深度神经网络

深度卷积神经网络(cnn)是最先进的图像分类和场景理解系统。它们因精度高而被广泛使用，但代价是计算复杂度高。目前该领域的研究目标是将其加速应用于实时应用。解决方案是使用图形处理单元(GPU)，但由于GPU的高功耗导致其无法在日常使用的设备中使用，因此出现了许多问题。现场可编程门阵列(FPGA)以其低功耗和灵活的结构成为实现CNN的一种新方案。这项工作讨论了这个问题，并提供了一个解决方案，在CNN的速度和FPGA的功耗之间妥协。这种解决方案依赖于两种主要的加速技术:层资源的并行性和某些层内部的流水线

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2018 30th International Conference on Microelectronics (ICM)

自引率

0.00%

发文量

期刊最新文献

Accelerating Deep Neural Networks Using FPGA On-body Investigation of Textile Antenna for Wearable RFID Applications Multi-Bit RRAM Transient Modelling and Analysis DEMO: Multi-Grain Adaptivity in Cyber-Physical Systems Compartive study of MPPT methods for PV systems : Case of Moroccan house