A 1W8R 20T SRAM Codebook for 20% Energy Reduction in Mixed-Precision Deep-Learning Inference Processor System

2023 IEEE 5th International Conference on Artificial Intelligence Circuits and Systems (AICAS) Pub Date : 2023-06-11 DOI:10.1109/AICAS57966.2023.10168555

Ryotaro Ohara, Masaya Kabuto, Masakazu Taichi, Atsushi Fukunaga, Yuto Yasuda, Riku Hamabe, S. Izumi, H. Kawaguchi

引用次数: 0

Abstract

This study introduces a 1W8R 20T multiport memory for codebook quantization in deep-learning processors. We manufactured the memory in a 40 nm process and achieved memory read-access time at 2.75 ns and 2.7-pj/byte power consumption. In addition, we used NVDLA, which was NVIDIA’s deep-learning processor, as a motif and simulated it based on the power obtained from the actual proposed memory. The obtained power and area reduction results are 20.24% and 26.24%, respectively.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

用于混合精度深度学习推理处理器系统能耗降低20%的1w8r20t SRAM码本

本研究介绍一种用于深度学习处理器码本量化的1w8r20t多端口存储器。我们在40 nm工艺中制造了存储器，并实现了存储器读取访问时间为2.75 ns和2.7 pj/byte的功耗。此外，我们使用NVIDIA的深度学习处理器NVDLA作为母题，并根据从实际提议的存储器中获得的功率进行模拟。得到的功率和面积分别降低20.24%和26.24%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2023 IEEE 5th International Conference on Artificial Intelligence Circuits and Systems (AICAS)

自引率

0.00%

发文量