PRF: deep neural network compression by systematic pruning of redundant filters

Neural Computing and Applications Pub Date : 2024-08-14 DOI:10.1007/s00521-024-10256-5

C. H. Sarvani, Mrinmoy Ghorai, S. H. Shabbeer Basha

{"title":"PRF: deep neural network compression by systematic pruning of redundant filters","authors":"C. H. Sarvani, Mrinmoy Ghorai, S. H. Shabbeer Basha","doi":"10.1007/s00521-024-10256-5","DOIUrl":null,"url":null,"abstract":"<p>In deep neural networks, the filters of convolutional layers play an important role in extracting the features from the input. Redundant filters often extract similar features, leading to increased computational overhead and larger model size. To address this issue, a two-step approach is proposed in this paper. First, the clusters of redundant filters are identified based on the cosine distance between them using hierarchical agglomerative clustering (HAC). Next, instead of pruning all the redundant filters from every cluster in single-shot, we propose to prune the filters in a systematic manner. To prune the filters, the cluster importance among all clusters and filter importance within each cluster are identified using the <span>\\(\\ell _1\\)</span>-norm based criterion. Then, based on the pruning ratio filters from the least important cluster to the most important ones are pruned systematically. The proposed method showed better results compared to other clustering-based works. The benchmark datasets CIFAR-10 and ImageNet are used in the experiments. After pruning 83.92% parameters from VGG-16 architecture, an improvement over the baseline is observed. After pruning 54.59% and 49.33% of the FLOPs from ResNet-56 and ResNet-110, respectively, both showed an improvement in accuracy. After pruning 52.97% of the FLOPs, the top-5 accuracy of ResNet-50 drops by only 0.56 over ImageNet.</p>","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neural Computing and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s00521-024-10256-5","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

In deep neural networks, the filters of convolutional layers play an important role in extracting the features from the input. Redundant filters often extract similar features, leading to increased computational overhead and larger model size. To address this issue, a two-step approach is proposed in this paper. First, the clusters of redundant filters are identified based on the cosine distance between them using hierarchical agglomerative clustering (HAC). Next, instead of pruning all the redundant filters from every cluster in single-shot, we propose to prune the filters in a systematic manner. To prune the filters, the cluster importance among all clusters and filter importance within each cluster are identified using the \(\ell _1\)-norm based criterion. Then, based on the pruning ratio filters from the least important cluster to the most important ones are pruned systematically. The proposed method showed better results compared to other clustering-based works. The benchmark datasets CIFAR-10 and ImageNet are used in the experiments. After pruning 83.92% parameters from VGG-16 architecture, an improvement over the baseline is observed. After pruning 54.59% and 49.33% of the FLOPs from ResNet-56 and ResNet-110, respectively, both showed an improvement in accuracy. After pruning 52.97% of the FLOPs, the top-5 accuracy of ResNet-50 drops by only 0.56 over ImageNet.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

PRF：通过系统修剪冗余滤波器压缩深度神经网络

在深度神经网络中，卷积层的滤波器在从输入中提取特征方面发挥着重要作用。冗余滤波器通常会提取相似的特征，从而导致计算开销增加和模型体积增大。为解决这一问题，本文提出了一种分两步走的方法。首先，利用分层聚类（HAC）技术，根据冗余过滤器之间的余弦距离确定它们的聚类。接下来，我们不再一次性剪除每个簇中的所有冗余滤波器，而是提议以系统化的方式剪除滤波器。为了剪切过滤器，我们使用基于 \(\ell _1\)-norm的准则来确定所有聚类中的聚类重要性和每个聚类中过滤器的重要性。然后，根据剪枝率，从最不重要的簇到最重要的簇，对过滤器进行系统剪枝。与其他基于聚类的方法相比，所提出的方法取得了更好的效果。实验中使用了基准数据集 CIFAR-10 和 ImageNet。从 VGG-16 架构中剪枝 83.92% 的参数后，观察到比基线有所改进。在对 ResNet-56 和 ResNet-110 分别剪枝 54.59% 和 49.33% 的 FLOP 后，两者的准确率都有所提高。在剪枝 52.97% 的 FLOP 后，ResNet-50 的前五名准确率仅比 ImageNet 降低了 0.56。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Neural Computing and Applications

自引率

0.00%

发文量