Analysis of Layer Efficiency and Layer Reduction on Pre-Trained Deep Learning Models

B. T. Nugraha, S. Su
{"title":"Analysis of Layer Efficiency and Layer Reduction on Pre-Trained Deep Learning Models","authors":"B. T. Nugraha, S. Su","doi":"10.1109/ICSSE.2018.8520080","DOIUrl":null,"url":null,"abstract":"Recent technologies in the deep learning area enable many industries and practitioners fastening the development processes of their products. However, deep learning still encounters some potential issues like overfitting and huge size. The huge size greatly constrains performance and portability of the deep learning model in embedded devices with limited environments. Due to the paradigm of it mixed with the meaning of “deep” layers, many researchers tend to derive the pre-trained model into building deeper layers to solve their problems without knowing whether they are actually needed or not. To address these issues, we exploit the activation and gradient output and weight in each layer of the pre-trained models to measure its efficiencies. By exploiting them, we estimate the efficiencies using our measurements and compare it with the manual layer reduction to validate the most relevant method. We also use the method for continuous layer reductions for validation. With this approach, we save up to 12x and 26x of the time of one manual layer reduction and re-training on VGG-16 and custom AlexNet respectively.","PeriodicalId":431387,"journal":{"name":"2018 International Conference on System Science and Engineering (ICSSE)","volume":"115 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 International Conference on System Science and Engineering (ICSSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSSE.2018.8520080","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Recent technologies in the deep learning area enable many industries and practitioners fastening the development processes of their products. However, deep learning still encounters some potential issues like overfitting and huge size. The huge size greatly constrains performance and portability of the deep learning model in embedded devices with limited environments. Due to the paradigm of it mixed with the meaning of “deep” layers, many researchers tend to derive the pre-trained model into building deeper layers to solve their problems without knowing whether they are actually needed or not. To address these issues, we exploit the activation and gradient output and weight in each layer of the pre-trained models to measure its efficiencies. By exploiting them, we estimate the efficiencies using our measurements and compare it with the manual layer reduction to validate the most relevant method. We also use the method for continuous layer reductions for validation. With this approach, we save up to 12x and 26x of the time of one manual layer reduction and re-training on VGG-16 and custom AlexNet respectively.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
预训练深度学习模型的层效率与层约简分析
深度学习领域的最新技术使许多行业和从业者能够加快其产品的开发过程。然而,深度学习仍然会遇到一些潜在的问题,如过拟合和庞大的规模。巨大的尺寸极大地限制了深度学习模型在有限环境下的嵌入式设备中的性能和可移植性。由于它的范式与“深层”层的含义混合在一起,许多研究人员倾向于在不知道是否实际需要的情况下,将预训练的模型导出到构建更深层来解决他们的问题。为了解决这些问题,我们利用预训练模型的每一层的激活和梯度输出和权重来衡量其效率。通过利用它们,我们使用我们的测量来估计效率,并将其与手动层减少进行比较,以验证最相关的方法。我们还使用连续层减少的方法进行验证。通过这种方法,我们分别在VGG-16和自定义AlexNet上节省了12倍和26倍的手动层减少和重新训练时间。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
A Fuzzy Risk Assessment Strategy Based on Big Data for Multinational Financial Markets Evaluation of Indoor Positioning Based on iBeacon and Pi-Beacon A Mechanism for Adjustable-Delay-Buffer Selection to Dynamically Control Clock Skew A Mixed Reality System to Improve Walking Experience Intelligent Mobile Robot Controller Design for Hotel Room Service with Deep Learning Arm-Based Elevator Manipulator
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1