{"title":"Optimization of CNN model for image classification","authors":"Meriam Dhouibi, A. K. Ben Salem, S. Ben Saoud","doi":"10.1109/DTS52014.2021.9497988","DOIUrl":null,"url":null,"abstract":"Convolutional Neural Networks (CNNs) are particularly precise in several fields, especially computer vision where image classification is one of the most researched and commercialized application. Deploying these models on embedded devices requires high throughput and low latency even with limited resources and energy budgets. The complexity of the architecture of CNN models implies a very high computation cost. We are looking in this paper for determining the optimal topology (the number of layers and the number of neurons per layer) that allows us to reduce the model and deploy it in embedded platforms. We have proposed a small CNN architecture that achieves high level accuracy 81.50% on CIFAR-10 with fewer parameters based on the growing approach. For more optimization we used the pruning technique and the results showed that with more optimal architecture we obtained 82.43% accuracy and 15% reduction of the number of parameters.","PeriodicalId":158426,"journal":{"name":"2021 IEEE International Conference on Design & Test of Integrated Micro & Nano-Systems (DTS)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Design & Test of Integrated Micro & Nano-Systems (DTS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DTS52014.2021.9497988","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Convolutional Neural Networks (CNNs) are particularly precise in several fields, especially computer vision where image classification is one of the most researched and commercialized application. Deploying these models on embedded devices requires high throughput and low latency even with limited resources and energy budgets. The complexity of the architecture of CNN models implies a very high computation cost. We are looking in this paper for determining the optimal topology (the number of layers and the number of neurons per layer) that allows us to reduce the model and deploy it in embedded platforms. We have proposed a small CNN architecture that achieves high level accuracy 81.50% on CIFAR-10 with fewer parameters based on the growing approach. For more optimization we used the pruning technique and the results showed that with more optimal architecture we obtained 82.43% accuracy and 15% reduction of the number of parameters.