- Book学术

2020 International SoC Design Conference (ISOCC) Pub Date : 2020-10-21 DOI:10.1109/ISOCC50952.2020.9333012

Eunchong Lee, Yongseok Lee, Sang-Seol Lee, Byoung-Ho Choi

引用次数: 1

摘要

为了提高PE的性能，我们关注最小化外部内存访问和并行化的方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Implementation of a Round Robin Processing Element for Deep Learning Accelerator

The deep learning acceleration hardwareperformance is greatly affected by Processing Elements (PEs). In order to apply deep learning accelerators to mobile devices, optimized PE must be designed as ASIC. To improve the performance of PE, we focused on methods of minimizing external memory access and parallelization. As a result, a deep learning accelerator architecture consisting of 512 PEs in parallel is proposed and the results of FPGA implementation is presented.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2020 International SoC Design Conference (ISOCC)

自引率

0.00%

发文量