A One-Shot Reparameterization Method for Reducing the Loss of Tile Pruning on DNNs

2022 International Joint Conference on Neural Networks (IJCNN) Pub Date : 2022-07-18 DOI:10.1109/IJCNN55064.2022.9889789

Yancheng Li, Qingzhong Ai, Fumihiko Ino

{"title":"A One-Shot Reparameterization Method for Reducing the Loss of Tile Pruning on DNNs","authors":"Yancheng Li, Qingzhong Ai, Fumihiko Ino","doi":"10.1109/IJCNN55064.2022.9889789","DOIUrl":null,"url":null,"abstract":"Recently, tile pruning has been widely studied to accelerate the inference of deep neural networks (DNNs). However, we found that the loss due to tile pruning, which can eliminate important elements together with unimportant elements, is large on trained DNNs. In this study, we propose a one-shot reparameterization method, called TileTrans, to reduce the loss of tile pruning. Specifically, we repermute the rows or columns of the weight matrix such that the model architecture can be kept unchanged after reparameterization. This repermutation realizes the reparameterization of the DNN model without any retraining. The proposed reparameterization method combines important elements into the same tile; thus, preserving the important elements after the tile pruning. Furthermore, TileTrans can be seamlessly integrated into existing tile pruning methods because it is a pre-processing method executed before pruning, which is orthogonal to most existing methods. The experimental results demonstrate that our method is essential in reducing the loss of tile pruning on DNNs. Specifically, the accuracy is improved by up to 17% for AlexNet while 5% for ResNet-34, where both models are pre-trained on ImageNet.","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Joint Conference on Neural Networks (IJCNN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IJCNN55064.2022.9889789","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Recently, tile pruning has been widely studied to accelerate the inference of deep neural networks (DNNs). However, we found that the loss due to tile pruning, which can eliminate important elements together with unimportant elements, is large on trained DNNs. In this study, we propose a one-shot reparameterization method, called TileTrans, to reduce the loss of tile pruning. Specifically, we repermute the rows or columns of the weight matrix such that the model architecture can be kept unchanged after reparameterization. This repermutation realizes the reparameterization of the DNN model without any retraining. The proposed reparameterization method combines important elements into the same tile; thus, preserving the important elements after the tile pruning. Furthermore, TileTrans can be seamlessly integrated into existing tile pruning methods because it is a pre-processing method executed before pruning, which is orthogonal to most existing methods. The experimental results demonstrate that our method is essential in reducing the loss of tile pruning on DNNs. Specifically, the accuracy is improved by up to 17% for AlexNet while 5% for ResNet-34, where both models are pre-trained on ImageNet.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

一种减少dnn剪枝损失的单次重参数化方法

近年来，为了加速深度神经网络(dnn)的推理，瓦片修剪得到了广泛的研究。然而，我们发现，在训练好的dnn上，由于瓷砖修剪可以去除重要元素和不重要元素而造成的损失很大。在这项研究中，我们提出了一种一次性重新参数化方法，称为TileTrans，以减少瓷砖修剪的损失。具体来说，我们重新调整权重矩阵的行或列，使模型架构在重新参数化后保持不变。这种再突变在不进行再训练的情况下实现了DNN模型的再参数化。提出的重新参数化方法将重要元素合并到同一块图中;因此，在瓷砖修剪后保留了重要的元素。此外，TileTrans可以无缝集成到现有的瓷砖修剪方法中，因为它是在修剪之前执行的预处理方法，与大多数现有方法正交。实验结果表明，我们的方法对于减少dnn上的剪枝损失是必不可少的。具体来说，AlexNet的准确率提高了17%，而ResNet-34的准确率提高了5%，这两个模型都是在ImageNet上进行预训练的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2022 International Joint Conference on Neural Networks (IJCNN)

自引率

0.00%

发文量