Dual-Grained Lightweight Strategy

IEEE transactions on pattern analysis and machine intelligence Pub Date : 2024-10-10 DOI:10.1109/TPAMI.2024.3437421

Debin Liu;Xiang Bai;Ruonan Zhao;Xianjun Deng;Laurence T. Yang

{"title":"Dual-Grained Lightweight Strategy","authors":"Debin Liu;Xiang Bai;Ruonan Zhao;Xianjun Deng;Laurence T. Yang","doi":"10.1109/TPAMI.2024.3437421","DOIUrl":null,"url":null,"abstract":"Removing redundant parameters and computations before the model training has attracted a great interest as it can effectively reduce the storage space of the model, speed up the training and inference of the model, and save energy consumption during the running of the model. In addition, the simplification of deep neural network models can enable high-performance network models to be deployed to resource-constrained edge devices, thus promoting the development of the intelligent world. However, current pruning at initialization methods exhibit poor performance at extreme sparsity. In order to improve the performance of the model under extreme sparsity, this paper proposes a dual-grained lightweight strategy-TEDEPR. This is the first time that TEDEPR has used tensor theory in the pruning at initialization method to optimize the structure of a sparse sub-network model and improve its performance. Specifically, first, at the coarse-grained level, we represent the weight matrix or weight tensor of the model as a low-rank tensor decomposition form and use multi-step chain operations to enhance the feature extraction capability of the base module to construct a low-rank compact network model. Second, unimportant weights are pruned at a fine-grained level based on the trainability of the weights in the low-rank model before the training of the model, resulting in the final compressed model. To evaluate the superiority of TEDEPR, we conducted extensive experiments on MNIST, UCF11, CIFAR-10, CIFAR-100, Tiny-ImageNet and ImageNet datasets with LeNet, LSTM, VGGNet, ResNet and Transformer architectures, and compared with state-of-the-art methods. The experimental results show that TEDEPR has higher accuracy, faster training and inference, and less storage space than other pruning at initialization methods under extreme sparsity.","PeriodicalId":94034,"journal":{"name":"IEEE transactions on pattern analysis and machine intelligence","volume":"46 12","pages":"10228-10245"},"PeriodicalIF":0.0000,"publicationDate":"2024-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on pattern analysis and machine intelligence","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10713325/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Removing redundant parameters and computations before the model training has attracted a great interest as it can effectively reduce the storage space of the model, speed up the training and inference of the model, and save energy consumption during the running of the model. In addition, the simplification of deep neural network models can enable high-performance network models to be deployed to resource-constrained edge devices, thus promoting the development of the intelligent world. However, current pruning at initialization methods exhibit poor performance at extreme sparsity. In order to improve the performance of the model under extreme sparsity, this paper proposes a dual-grained lightweight strategy-TEDEPR. This is the first time that TEDEPR has used tensor theory in the pruning at initialization method to optimize the structure of a sparse sub-network model and improve its performance. Specifically, first, at the coarse-grained level, we represent the weight matrix or weight tensor of the model as a low-rank tensor decomposition form and use multi-step chain operations to enhance the feature extraction capability of the base module to construct a low-rank compact network model. Second, unimportant weights are pruned at a fine-grained level based on the trainability of the weights in the low-rank model before the training of the model, resulting in the final compressed model. To evaluate the superiority of TEDEPR, we conducted extensive experiments on MNIST, UCF11, CIFAR-10, CIFAR-100, Tiny-ImageNet and ImageNet datasets with LeNet, LSTM, VGGNet, ResNet and Transformer architectures, and compared with state-of-the-art methods. The experimental results show that TEDEPR has higher accuracy, faster training and inference, and less storage space than other pruning at initialization methods under extreme sparsity.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

双粒度轻量级策略

在模型训练之前去除冗余参数和计算，可以有效减少模型的存储空间，加快模型的训练和推理速度，并节省模型运行过程中的能耗，因此备受关注。此外，简化深度神经网络模型还能使高性能网络模型部署到资源受限的边缘设备上，从而促进智能世界的发展。然而，目前的初始化剪枝方法在极端稀疏时表现出很差的性能。为了提高模型在极端稀疏性条件下的性能，本文提出了一种双粒度轻量级策略--TEDEPR。这是 TEDEPR 首次在初始化剪枝方法中使用张量理论来优化稀疏子网络模型的结构并提高其性能。具体来说，首先，在粗粒度层面，我们将模型的权重矩阵或权重张量表示为低秩张量分解形式，并利用多步链式运算增强基础模块的特征提取能力，构建低秩紧凑网络模型。其次，在模型训练之前，根据低阶模型中权重的可训练性，对不重要的权重进行细粒度剪枝，从而得到最终的压缩模型。为了评估 TEDEPR 的优越性，我们在 MNIST、UCF11、CIFAR-10、CIFAR-100、Tiny-ImageNet 和 ImageNet 数据集上使用 LeNet、LSTM、VGGNet、ResNet 和 Transformer 架构进行了大量实验，并与最先进的方法进行了比较。实验结果表明，在极端稀疏性条件下，TEDEPR 比其他初始化剪枝方法具有更高的准确率、更快的训练和推理速度以及更少的存储空间。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE transactions on pattern analysis and machine intelligence

自引率

0.00%

发文量