FPGA Hardware Implementation of the Yolo Subclass Convolutional Neural Network Model in Computer Vision Systems

N. G. Markov, I. V. Zoev, E. Mytsko
{"title":"FPGA Hardware Implementation of the Yolo Subclass Convolutional Neural Network Model in Computer Vision Systems","authors":"N. G. Markov, I. V. Zoev, E. Mytsko","doi":"10.1109/SIBCON56144.2022.10003015","DOIUrl":null,"url":null,"abstract":"The computing unit (CU) of the computer vision system (CVS) has been developed based on a system on a chip (SOC) with the Xilinx Field-Programmable Gate Array (FPGA). A new model of a convolutional neural network (CNN) tiny-YOLO-Inception-ResNet2 related to neural networks of the YOLO subclass was proposed. A feature of this model is the presence of two Inception-ResNet modules. The weight coefficients of the trained software-implemented CNN of the same model were used. Images from the Pascal VOC 2007 dataset were used to train and test these CNN models. The research of the CU effectiveness was carried out. Firstly, we study the detection time of one image which spent in each layer of the CNN model and of the whole model depending on the number of universal computing units in the CU. Also, the study of the detecting objects accuracy depending on the bit depth (16 or 32 bits) of floating-point numbers was carried out. It is concluded that it is necessary to perform calculations using 32-bit floating-point numbers. It is shown that the power consumption of the CU did not exceed 12 Watts during all experiments.","PeriodicalId":265523,"journal":{"name":"2022 International Siberian Conference on Control and Communications (SIBCON)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Siberian Conference on Control and Communications (SIBCON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SIBCON56144.2022.10003015","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The computing unit (CU) of the computer vision system (CVS) has been developed based on a system on a chip (SOC) with the Xilinx Field-Programmable Gate Array (FPGA). A new model of a convolutional neural network (CNN) tiny-YOLO-Inception-ResNet2 related to neural networks of the YOLO subclass was proposed. A feature of this model is the presence of two Inception-ResNet modules. The weight coefficients of the trained software-implemented CNN of the same model were used. Images from the Pascal VOC 2007 dataset were used to train and test these CNN models. The research of the CU effectiveness was carried out. Firstly, we study the detection time of one image which spent in each layer of the CNN model and of the whole model depending on the number of universal computing units in the CU. Also, the study of the detecting objects accuracy depending on the bit depth (16 or 32 bits) of floating-point numbers was carried out. It is concluded that it is necessary to perform calculations using 32-bit floating-point numbers. It is shown that the power consumption of the CU did not exceed 12 Watts during all experiments.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
计算机视觉系统中Yolo子类卷积神经网络模型的FPGA硬件实现
计算机视觉系统(CVS)的计算单元(CU)是基于带有赛灵思现场可编程门阵列(FPGA)的片上系统(SOC)开发的。提出了一种与YOLO子类神经网络相关的卷积神经网络(CNN) tiny-YOLO-Inception-ResNet2模型。该模型的一个特点是存在两个Inception-ResNet模块。采用同种模型训练后的软件实现CNN的权系数。来自Pascal VOC 2007数据集的图像被用来训练和测试这些CNN模型。对CU的有效性进行了研究。首先,我们根据CU中通用计算单元的数量,研究了CNN模型每层和整个模型中单个图像的检测时间。此外,还研究了基于浮点数位深(16位或32位)的检测对象精度。得出的结论是,有必要使用32位浮点数进行计算。结果表明,在所有实验过程中,CU的功耗均不超过12瓦。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
SCMA-LDPC coded communication system Spectral X-ray Detectors Based on Multi-Element Chromium Compensated Gallium Arsenide Sensors and Application Specific Integrated Circuits: : Keynote Paper Devoted 75th Anniversary of the Transistor Optimization of Strip Modal Filters by Random Search Method Room Frequency Response Equalization Using Modified X-Filtered Adaptive Algorithms Laboratory Stand for Study Process Dynamics and Control
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1