Learning to Quantize Deep Neural Networks: A Competitive-Collaborative Approach

2020 57th ACM/IEEE Design Automation Conference (DAC) Pub Date : 2020-07-01 DOI:10.1109/DAC18072.2020.9218576

Md Fahim Faysal Khan, Mohammad Mahdi Kamani, M. Mahdavi, V. Narayanan

引用次数: 11

Abstract

Reducing the model size and computation costs for dedicated AI accelerator designs, neural network quantization methods have attracted momentous attention recently. Unfortunately, merely minimizing quantization loss using constant discretization causes accuracy deterioration. In this paper, we propose an iterative accuracy-driven learning framework of competitive-collaborative quantization (CCQ) to gradually adapt the bit-precision of each individual layer. Orthogonal to prior quantization policies working with full precision for the first and last layers of the network, CCQ offers layer-wise competition for any target quantization policy with holistic layer fine-tuning to recover accuracy, where the state-of-the-art networks can be entirely quantized without any significant accuracy degradation.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

学习量化深度神经网络:一种竞争-协作的方法

神经网络量化方法在减少人工智能专用加速器设计的模型尺寸和计算成本方面受到了广泛关注。不幸的是，仅仅使用常数离散化最小化量化损失会导致精度下降。在本文中，我们提出了一个迭代精度驱动的竞争-协作量化(CCQ)学习框架，以逐步适应每一层的比特精度。CCQ与先前的量化策略正交，在网络的第一层和最后一层以完全精确的方式工作，CCQ为任何目标量化策略提供分层竞争，并进行整体层微调以恢复精度，其中最先进的网络可以完全量化而不会出现任何显著的精度下降。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2020 57th ACM/IEEE Design Automation Conference (DAC)

自引率

0.00%

发文量