8-bit Convolutional Neural Network Accelerator for Face Recognition

2020 11th IEEE Annual Ubiquitous Computing, Electronics & Mobile Communication Conference (UEMCON) Pub Date : 2020-10-28 DOI:10.1109/UEMCON51285.2020.9298114

Wei Pang, Yufeng Li, Shengli Lu

引用次数: 0

Abstract

With the development of convolutional neural network (CNN), the accuracy of face recognition has been greatly improved. But the huge amount of weights and calculations hinders its implementation in portable devices. Designing hardware accelerator is an effective solution to the problem. In this paper, a face recognition algorithm is designed based on deep separable convolution. The weights and activations are quantified to 8 bits, reducing the requirement of data access and bandwidth. In addition, a generic CNN accelerator based on systolic array is designed and validated on Xilinx Zynq-XC7Z035 FPGA. The face recognition algorithm achieved an accuracy of 94.4% in the LFW dataset. The performance and power efficiency of the accelerator are 52.9 GOPS and 9.71GOPS/W at 100MHz, respectively. And the accelerator can process 160×160 face image at 25FPS.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

用于人脸识别的8位卷积神经网络加速器

随着卷积神经网络(CNN)的发展，人脸识别的准确率有了很大的提高。但是巨大的重量和计算量阻碍了它在便携式设备中的实现。设计硬件加速器是解决这一问题的有效途径。本文设计了一种基于深度可分离卷积的人脸识别算法。权重和激活量化为8位，减少了对数据访问和带宽的要求。此外，在Xilinx Zynq-XC7Z035 FPGA上设计并验证了一种基于收缩阵列的通用CNN加速器。人脸识别算法在LFW数据集上的准确率达到了94.4%。在100MHz时，加速器的性能和功率效率分别为52.9 GOPS/W和9.71GOPS/W。加速器可以以25FPS的速度处理160×160人脸图像。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2020 11th IEEE Annual Ubiquitous Computing, Electronics & Mobile Communication Conference (UEMCON)

自引率

0.00%

发文量