Sparse Bitmap Compression for Memory-Efficient Training on the Edge

2021 IEEE/ACM Symposium on Edge Computing (SEC) Pub Date : 2021-10-29 DOI:10.1145/3453142.3491290

Abdelrahman Hosny, Marina Neseem, S. Reda

{"title":"Sparse Bitmap Compression for Memory-Efficient Training on the Edge","authors":"Abdelrahman Hosny, Marina Neseem, S. Reda","doi":"10.1145/3453142.3491290","DOIUrl":null,"url":null,"abstract":"Training on the Edge enables neural networks to learn continu-ously from new data after deployment on memory-constrained edge devices. Previous work is mostly concerned with reducing the number of model parameters which is only beneficial for in-ference. However, memory footprint from activations is the main bottleneck for training on the edge. Existing incremental training methods fine-tune the last few layers sacrificing accuracy gains from re-training the whole model. In this work, we investigate the memory footprint of training deep learning models, and use our observations to propose BitTrain. In BitTrain, we exploit activation sparsity and propose a novel bitmap compression technique that reduces the memory footprint during training. We save the activations in our proposed bitmap compression format during the forward pass of the training, and restore them during the backward pass for the optimizer computations. The proposed method can be integrated seamlessly in the computation graph of modern deep learning frameworks. Our implementation is safe by construction, and has no negative impact on the accuracy of model training. Experimental results show up to 34% reduction in the memory footprint at a sparsity level of 50%. Further pruning during training results in more than 70% sparsity, which can lead to up to 56% re-duction in memory footprint. BitTrain advances the efforts towards bringing more machine learning capabilities to edge devices. Our source code is available at https://github.com/scale-lab/BitTrain.","PeriodicalId":6779,"journal":{"name":"2021 IEEE/ACM Symposium on Edge Computing (SEC)","volume":"14 1","pages":"14-25"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE/ACM Symposium on Edge Computing (SEC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3453142.3491290","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

Abstract

Training on the Edge enables neural networks to learn continu-ously from new data after deployment on memory-constrained edge devices. Previous work is mostly concerned with reducing the number of model parameters which is only beneficial for in-ference. However, memory footprint from activations is the main bottleneck for training on the edge. Existing incremental training methods fine-tune the last few layers sacrificing accuracy gains from re-training the whole model. In this work, we investigate the memory footprint of training deep learning models, and use our observations to propose BitTrain. In BitTrain, we exploit activation sparsity and propose a novel bitmap compression technique that reduces the memory footprint during training. We save the activations in our proposed bitmap compression format during the forward pass of the training, and restore them during the backward pass for the optimizer computations. The proposed method can be integrated seamlessly in the computation graph of modern deep learning frameworks. Our implementation is safe by construction, and has no negative impact on the accuracy of model training. Experimental results show up to 34% reduction in the memory footprint at a sparsity level of 50%. Further pruning during training results in more than 70% sparsity, which can lead to up to 56% re-duction in memory footprint. BitTrain advances the efforts towards bringing more machine learning capabilities to edge devices. Our source code is available at https://github.com/scale-lab/BitTrain.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

稀疏位图压缩在边缘上的记忆效率训练

边缘上的训练使神经网络能够在部署到内存受限的边缘设备上后，从新数据中持续学习。以往的工作主要集中在减少模型参数的数量，这只有利于推理。然而，激活的内存占用是边缘训练的主要瓶颈。现有的增量训练方法对最后几层进行微调，牺牲了重新训练整个模型所带来的精度增益。在这项工作中，我们研究了训练深度学习模型的内存占用，并利用我们的观察结果提出了BitTrain。在BitTrain中，我们利用激活稀疏性，提出了一种新的位图压缩技术，减少了训练过程中的内存占用。在训练的前向传递期间，我们将激活保存在我们建议的位图压缩格式中，并在优化器计算的后向传递期间恢复它们。该方法可以无缝集成到现代深度学习框架的计算图中。我们的实现是安全的，并且对模型训练的准确性没有负面影响。实验结果表明，在50%的稀疏度水平下，内存占用最多减少34%。在训练期间进一步修剪会导致超过70%的稀疏性，这可能导致内存占用减少多达56%。BitTrain致力于为边缘设备带来更多的机器学习功能。我们的源代码可从https://github.com/scale-lab/BitTrain获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2021 IEEE/ACM Symposium on Edge Computing (SEC)

自引率

0.00%

发文量