{"title":"基于卷积神经网络的快速图像识别加速器设计","authors":"Yu Liu, Min Xiang, Xiaoxiang Zhao, Run Zhou","doi":"10.1109/ICDSBA51020.2020.00085","DOIUrl":null,"url":null,"abstract":"Due to the limited resources of the Internet of things terminal, the speed of image recognition is difficult to meet the application requirements. A design method of a fast image recognition accelerator based on a convolutional neural network (CNN) is proposed. A pipeline processing scheme combining software and hardware is designed. The operation strategy of the parallel image block, parallel input channel, and parallel output channel is adopted. Based on this strategy, a model of terminal resources and recognition time is established. By solving the model, the optimal number of image partition blocks and convolution parallel parameters are obtained. The experimental results show that the computational performance of the proposed accelerator is improved from 8.86 GOPs to 12.26 GOPs, which effectively improves the speed of image recognition.","PeriodicalId":354742,"journal":{"name":"2020 4th Annual International Conference on Data Science and Business Analytics (ICDSBA)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Design of Fast Image Recognition Accelerator Based on Convolutional Neural Network\",\"authors\":\"Yu Liu, Min Xiang, Xiaoxiang Zhao, Run Zhou\",\"doi\":\"10.1109/ICDSBA51020.2020.00085\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Due to the limited resources of the Internet of things terminal, the speed of image recognition is difficult to meet the application requirements. A design method of a fast image recognition accelerator based on a convolutional neural network (CNN) is proposed. A pipeline processing scheme combining software and hardware is designed. The operation strategy of the parallel image block, parallel input channel, and parallel output channel is adopted. Based on this strategy, a model of terminal resources and recognition time is established. By solving the model, the optimal number of image partition blocks and convolution parallel parameters are obtained. The experimental results show that the computational performance of the proposed accelerator is improved from 8.86 GOPs to 12.26 GOPs, which effectively improves the speed of image recognition.\",\"PeriodicalId\":354742,\"journal\":{\"name\":\"2020 4th Annual International Conference on Data Science and Business Analytics (ICDSBA)\",\"volume\":\"5 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 4th Annual International Conference on Data Science and Business Analytics (ICDSBA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICDSBA51020.2020.00085\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 4th Annual International Conference on Data Science and Business Analytics (ICDSBA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDSBA51020.2020.00085","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Design of Fast Image Recognition Accelerator Based on Convolutional Neural Network
Due to the limited resources of the Internet of things terminal, the speed of image recognition is difficult to meet the application requirements. A design method of a fast image recognition accelerator based on a convolutional neural network (CNN) is proposed. A pipeline processing scheme combining software and hardware is designed. The operation strategy of the parallel image block, parallel input channel, and parallel output channel is adopted. Based on this strategy, a model of terminal resources and recognition time is established. By solving the model, the optimal number of image partition blocks and convolution parallel parameters are obtained. The experimental results show that the computational performance of the proposed accelerator is improved from 8.86 GOPs to 12.26 GOPs, which effectively improves the speed of image recognition.