Learning the number of filters in convolutional neural networks

Int. J. Bio Inspired Comput. Pub Date : 2021-03-24 DOI:10.1504/IJBIC.2021.114101

Jue Li, F. Cao, Honghong Cheng, Yuhua Qian

引用次数: 6

Abstract

Convolutional networks bring the performance of many computer vision tasks to unprecedented heights, but at the cost of enormous computation load. To reduce this cost, many model compression tasks have been proposed by eliminating insignificant model structures. For example, convolution filters with small absolute weights are pruned and then fine-tuned to restore reasonable accuracy. However, most of these works rely on pre-trained models without specific analysis of the changes in filters during the training process, resulting in sizable model retraining costs. Different from previous works, we interpret the change of filter behaviour during training from the associated angle, and propose a novel filter pruning method utilising the change rule, which can remove filters with similar functions later in training. According to this strategy, not only can we achieve model compression without fine-tuning, but we can also find a novel perspective to interpret the changing behaviour of the filter during training. Moreover, our approach has been proved to be effective for many advanced CNN architectures.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

学习卷积神经网络中过滤器的数量

卷积网络使许多计算机视觉任务的性能达到了前所未有的高度，但代价是巨大的计算负荷。为了降低这一成本，许多模型压缩任务都是通过消除无关紧要的模型结构来实现的。例如，对绝对权值较小的卷积滤波器进行修剪，然后进行微调以恢复合理的精度。然而，这些工作大多依赖于预训练的模型，而没有具体分析训练过程中过滤器的变化，导致相当大的模型再训练成本。与以往的研究不同，我们从关联的角度解释了训练过程中滤波器行为的变化，并提出了一种利用变化规则的滤波器剪枝方法，该方法可以在训练后期去除具有相似功能的滤波器。根据这一策略，我们不仅可以在没有微调的情况下实现模型压缩，而且我们还可以找到一个新的角度来解释过滤器在训练过程中的变化行为。此外，我们的方法已被证明对许多先进的CNN架构是有效的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Int. J. Bio Inspired Comput.

自引率

0.00%

发文量