加速卷积神经网络的研究

PROCEEDINGS OF THE INTERNATIONAL CONFERENCE OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING 2019 (ICCMSE-2019) Pub Date : 2019-12-10 DOI:10.1063/1.5138068

Hsien-I Lin, Chung-Sheng Cheng

{"title":"加速卷积神经网络的研究","authors":"Hsien-I Lin, Chung-Sheng Cheng","doi":"10.1063/1.5138068","DOIUrl":null,"url":null,"abstract":"Recent deep-learning methods have been paid more attention than shallow-learning ones because they have deep and complex structures to approximate functions. The salient feature of deep neural networks is to use many layers where many of them are used to extract data features and few are for classification or regression. The most severe problem of a deep neural network is using too many parameters that cause too much memory usage and computing resources for both training and inference. Thus, deep learning approaches are not suitable for real-time industrial applications that have limited computing resources such as memory and CPU. For example, a famous convolutional neural network (CNN), AlexNet, uses up to 60 million parameters to train ImageNet dataset and many imaging projects apply AlexNet to their own applications as transfer learning. Thus, this work proposes a feasible solution to trim the CNN, speed it up, and keep the accuracy rate similar. Two main types of CNNs and AlexNet, were validated, respectively, in THUR15K, Caltech-101, Caltech-256, and GHIM10k datasets. The results show that the parameter amount greatly decreased (76%) but the recognition rate dropped slightly (1.34%).Recent deep-learning methods have been paid more attention than shallow-learning ones because they have deep and complex structures to approximate functions. The salient feature of deep neural networks is to use many layers where many of them are used to extract data features and few are for classification or regression. The most severe problem of a deep neural network is using too many parameters that cause too much memory usage and computing resources for both training and inference. Thus, deep learning approaches are not suitable for real-time industrial applications that have limited computing resources such as memory and CPU. For example, a famous convolutional neural network (CNN), AlexNet, uses up to 60 million parameters to train ImageNet dataset and many imaging projects apply AlexNet to their own applications as transfer learning. Thus, this work proposes a feasible solution to trim the CNN, speed it up, and keep the accuracy rate similar. Two main types of CNNs and AlexNet, were validated, resp...","PeriodicalId":20565,"journal":{"name":"PROCEEDINGS OF THE INTERNATIONAL CONFERENCE OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING 2019 (ICCMSE-2019)","volume":"7 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2019-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A study on accelerating convolutional neural networks\",\"authors\":\"Hsien-I Lin, Chung-Sheng Cheng\",\"doi\":\"10.1063/1.5138068\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recent deep-learning methods have been paid more attention than shallow-learning ones because they have deep and complex structures to approximate functions. The salient feature of deep neural networks is to use many layers where many of them are used to extract data features and few are for classification or regression. The most severe problem of a deep neural network is using too many parameters that cause too much memory usage and computing resources for both training and inference. Thus, deep learning approaches are not suitable for real-time industrial applications that have limited computing resources such as memory and CPU. For example, a famous convolutional neural network (CNN), AlexNet, uses up to 60 million parameters to train ImageNet dataset and many imaging projects apply AlexNet to their own applications as transfer learning. Thus, this work proposes a feasible solution to trim the CNN, speed it up, and keep the accuracy rate similar. Two main types of CNNs and AlexNet, were validated, respectively, in THUR15K, Caltech-101, Caltech-256, and GHIM10k datasets. The results show that the parameter amount greatly decreased (76%) but the recognition rate dropped slightly (1.34%).Recent deep-learning methods have been paid more attention than shallow-learning ones because they have deep and complex structures to approximate functions. The salient feature of deep neural networks is to use many layers where many of them are used to extract data features and few are for classification or regression. The most severe problem of a deep neural network is using too many parameters that cause too much memory usage and computing resources for both training and inference. Thus, deep learning approaches are not suitable for real-time industrial applications that have limited computing resources such as memory and CPU. For example, a famous convolutional neural network (CNN), AlexNet, uses up to 60 million parameters to train ImageNet dataset and many imaging projects apply AlexNet to their own applications as transfer learning. Thus, this work proposes a feasible solution to trim the CNN, speed it up, and keep the accuracy rate similar. Two main types of CNNs and AlexNet, were validated, resp...\",\"PeriodicalId\":20565,\"journal\":{\"name\":\"PROCEEDINGS OF THE INTERNATIONAL CONFERENCE OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING 2019 (ICCMSE-2019)\",\"volume\":\"7 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-12-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"PROCEEDINGS OF THE INTERNATIONAL CONFERENCE OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING 2019 (ICCMSE-2019)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1063/1.5138068\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"PROCEEDINGS OF THE INTERNATIONAL CONFERENCE OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING 2019 (ICCMSE-2019)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1063/1.5138068","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

与浅学习方法相比，深度学习方法由于具有较深和复杂的近似函数结构而受到越来越多的关注。深度神经网络的显著特点是使用多层，其中许多层用于提取数据特征，很少用于分类或回归。深度神经网络最严重的问题是使用太多的参数，这会导致过多的内存使用和计算资源用于训练和推理。因此，深度学习方法不适合内存和CPU等计算资源有限的实时工业应用。例如，著名的卷积神经网络(CNN) AlexNet使用多达6000万个参数来训练ImageNet数据集，许多成像项目将AlexNet作为迁移学习应用到自己的应用程序中。因此，本工作提出了一种可行的解决方案，以修剪CNN，加快其速度，并保持准确率相似。两种主要类型的cnn和AlexNet分别在THUR15K、Caltech-101、Caltech-256和GHIM10k数据集上进行了验证。结果表明，参数数量大大减少(76%)，但识别率略有下降(1.34%)。与浅学习方法相比，深度学习方法由于具有较深和复杂的近似函数结构而受到越来越多的关注。深度神经网络的显著特点是使用多层，其中许多层用于提取数据特征，很少用于分类或回归。深度神经网络最严重的问题是使用太多的参数，这会导致过多的内存使用和计算资源用于训练和推理。因此，深度学习方法不适合内存和CPU等计算资源有限的实时工业应用。例如，著名的卷积神经网络(CNN) AlexNet使用多达6000万个参数来训练ImageNet数据集，许多成像项目将AlexNet作为迁移学习应用到自己的应用程序中。因此，本工作提出了一种可行的解决方案，以修剪CNN，加快其速度，并保持准确率相似。cnn和AlexNet的两种主要类型得到了验证，分别是…

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

A study on accelerating convolutional neural networks

Recent deep-learning methods have been paid more attention than shallow-learning ones because they have deep and complex structures to approximate functions. The salient feature of deep neural networks is to use many layers where many of them are used to extract data features and few are for classification or regression. The most severe problem of a deep neural network is using too many parameters that cause too much memory usage and computing resources for both training and inference. Thus, deep learning approaches are not suitable for real-time industrial applications that have limited computing resources such as memory and CPU. For example, a famous convolutional neural network (CNN), AlexNet, uses up to 60 million parameters to train ImageNet dataset and many imaging projects apply AlexNet to their own applications as transfer learning. Thus, this work proposes a feasible solution to trim the CNN, speed it up, and keep the accuracy rate similar. Two main types of CNNs and AlexNet, were validated, respectively, in THUR15K, Caltech-101, Caltech-256, and GHIM10k datasets. The results show that the parameter amount greatly decreased (76%) but the recognition rate dropped slightly (1.34%).Recent deep-learning methods have been paid more attention than shallow-learning ones because they have deep and complex structures to approximate functions. The salient feature of deep neural networks is to use many layers where many of them are used to extract data features and few are for classification or regression. The most severe problem of a deep neural network is using too many parameters that cause too much memory usage and computing resources for both training and inference. Thus, deep learning approaches are not suitable for real-time industrial applications that have limited computing resources such as memory and CPU. For example, a famous convolutional neural network (CNN), AlexNet, uses up to 60 million parameters to train ImageNet dataset and many imaging projects apply AlexNet to their own applications as transfer learning. Thus, this work proposes a feasible solution to trim the CNN, speed it up, and keep the accuracy rate similar. Two main types of CNNs and AlexNet, were validated, resp...

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

PROCEEDINGS OF THE INTERNATIONAL CONFERENCE OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING 2019 (ICCMSE-2019)

自引率

0.00%

发文量

期刊最新文献

Selected time space characteristics in female pole vault Wave polarisation in a dynamic elastic lattice Symbolic-numeric research of leaky modes in planar dielectric electromagnetic waveguide as inhomogeneous waves Derivation of the concepts in data modelling Preface of the “GIS, Remote Sensing and Dendrochronology in Geohazards”