轻量级卷积神经网络YOLO v3- Tiny算法在FPGA上的目标检测

Jitong Xin, Meiyi Cha, Luojia Shi, Chunyu Long, Hairong Li, Fangcong Wang, Peng Wang
{"title":"轻量级卷积神经网络YOLO v3- Tiny算法在FPGA上的目标检测","authors":"Jitong Xin, Meiyi Cha, Luojia Shi, Chunyu Long, Hairong Li, Fangcong Wang, Peng Wang","doi":"10.1109/CSAIEE54046.2021.9543128","DOIUrl":null,"url":null,"abstract":"With the rapid development of artificial intelligence, the convolution of the neural network has been deploying across a range of industries, and has certain applications in national defense security, transportation monitoring and medical research. However, due to the constraints of speed and the power consumption, convolutional neural networks are still limited to a great extent in the implementation of the edge computing and mobile devices, etc. For this reason, we design a lightweight convolutional neural network based on FPGA. In this paper, we use the YOLOv3- Tiny algorithm, which is fast in execution, small in computation and small in size, is suitable for deployments on embedded devices such as FPGA. This paper uses 16-bit fixed-point quantization, special data storage and hardware circuit for convolutional computation written in verilog - hardware description language, consumes 512 DSPs, consumes 37.037 ms to recognize a single image frame, and consumes 9.611W. The system basically achieves the design goal of target detection.","PeriodicalId":376014,"journal":{"name":"2021 IEEE International Conference on Computer Science, Artificial Intelligence and Electronic Engineering (CSAIEE)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2021-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Lightweight convolutional neural network of YOLO v3- Tiny algorithm on FPGA for target detection\",\"authors\":\"Jitong Xin, Meiyi Cha, Luojia Shi, Chunyu Long, Hairong Li, Fangcong Wang, Peng Wang\",\"doi\":\"10.1109/CSAIEE54046.2021.9543128\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the rapid development of artificial intelligence, the convolution of the neural network has been deploying across a range of industries, and has certain applications in national defense security, transportation monitoring and medical research. However, due to the constraints of speed and the power consumption, convolutional neural networks are still limited to a great extent in the implementation of the edge computing and mobile devices, etc. For this reason, we design a lightweight convolutional neural network based on FPGA. In this paper, we use the YOLOv3- Tiny algorithm, which is fast in execution, small in computation and small in size, is suitable for deployments on embedded devices such as FPGA. This paper uses 16-bit fixed-point quantization, special data storage and hardware circuit for convolutional computation written in verilog - hardware description language, consumes 512 DSPs, consumes 37.037 ms to recognize a single image frame, and consumes 9.611W. The system basically achieves the design goal of target detection.\",\"PeriodicalId\":376014,\"journal\":{\"name\":\"2021 IEEE International Conference on Computer Science, Artificial Intelligence and Electronic Engineering (CSAIEE)\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-08-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE International Conference on Computer Science, Artificial Intelligence and Electronic Engineering (CSAIEE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CSAIEE54046.2021.9543128\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Computer Science, Artificial Intelligence and Electronic Engineering (CSAIEE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSAIEE54046.2021.9543128","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

随着人工智能的快速发展,神经网络的卷积已经在多个行业得到部署,在国防安全、交通监控、医学研究等方面都有一定的应用。然而,由于速度和功耗的限制,卷积神经网络在边缘计算和移动设备等方面的实现仍然受到很大程度的限制。为此,我们设计了一个基于FPGA的轻量级卷积神经网络。本文采用YOLOv3- Tiny算法,该算法具有执行速度快、计算量小、体积小等特点,适合部署在FPGA等嵌入式设备上。本文采用16位定点量化、专用数据存储和用verilog -硬件描述语言编写的卷积计算硬件电路,消耗512个dsp,单帧图像识别耗时37.037 ms,功耗9.611W。该系统基本实现了目标检测的设计目标。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Lightweight convolutional neural network of YOLO v3- Tiny algorithm on FPGA for target detection
With the rapid development of artificial intelligence, the convolution of the neural network has been deploying across a range of industries, and has certain applications in national defense security, transportation monitoring and medical research. However, due to the constraints of speed and the power consumption, convolutional neural networks are still limited to a great extent in the implementation of the edge computing and mobile devices, etc. For this reason, we design a lightweight convolutional neural network based on FPGA. In this paper, we use the YOLOv3- Tiny algorithm, which is fast in execution, small in computation and small in size, is suitable for deployments on embedded devices such as FPGA. This paper uses 16-bit fixed-point quantization, special data storage and hardware circuit for convolutional computation written in verilog - hardware description language, consumes 512 DSPs, consumes 37.037 ms to recognize a single image frame, and consumes 9.611W. The system basically achieves the design goal of target detection.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Res-Attention Net: An Image Dehazing Network Teacher-Student Network for Low-quality Remote Sensing Ship Detection Optimization of GNSS Signals Acquisition Algorithm Complexity Using Comb Decimation Filter Basic Ensemble Learning of Encoder Representations from Transformer for Disaster-mentioning Tweets Classification Measuring Hilbert-Schmidt Independence Criterion with Different Kernels
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1