A Ternary Based Bit Scalable, 8.80 TOPS/W CNN accelerator with Many-core Processing-in-memory Architecture with 896K synapses/mm2

2019 Symposium on VLSI Technology Pub Date : 2019-06-09 DOI:10.23919/VLSIT.2019.8776544

S. Okumura, M. Yabuuchi, K. Hijioka, Koichi Nose

引用次数: 34

Abstract

A Processing-In-Memory (PIM) accelerator with ternary SRAM is proposed for low-power, large-scale deep neural network (DNN) processing. The accelerator consists of Ternary Neural Arithmetic Memory (TNAM) which is capable of bit-scalable MAC (multiply and accumulation) operation in accordance with target accuracy and power limit. An ADC less readout circuits to reduce analog-digital conversion power and a system-level variation avoidance technique utilizing features of TNAM are also proposed. A test chip with large-scale PIM is fabricated and successfully operate convolutional neural networks (CNNs) with 8.8TOPS/W and highest accuracy and area density among recent SRAM-type PIMs are obtained.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

一个基于二进制可扩展，8.80 TOPS/W的CNN加速器，具有多核内存处理架构，具有896K突触/mm2

提出了一种基于三元SRAM的内存处理(PIM)加速器，用于低功耗、大规模深度神经网络(DNN)处理。该加速器由三元神经算术存储器(TNAM)组成，能够根据目标精度和功率限制进行位扩展的MAC(乘法和累加)操作。本文还提出了一种减少模数转换功率的无ADC读出电路和一种利用TNAM特性的系统级变差避免技术。制作了一个大规模PIM测试芯片，成功运行了卷积神经网络(cnn)，其精度为8.8TOPS/W，是近年来sram型PIM中精度和面积密度最高的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2019 Symposium on VLSI Technology

自引率

0.00%

发文量