Hardware Deployment of HBONext using NXP Bluebox 2.0

S. Joshi, M. El-Sharkawy
{"title":"Hardware Deployment of HBONext using NXP Bluebox 2.0","authors":"S. Joshi, M. El-Sharkawy","doi":"10.1109/AIIoT52608.2021.9454210","DOIUrl":null,"url":null,"abstract":"Deep learning models require a lot of computation and memory, so they can only be run on high-performance computing platforms such as CPUs or GPUs. However, due to resource, energy, and real-time constraints, they often fail to meet portable requirements. As a result, there is an increasing interest in real-time object recognition solutions based on CNNs, which are typically implemented on embedded systems with limited resources and energy consumption. Recently, hardware accelerators have been developed to provide the computing power needed by AI and machine learning tools. These edge accelerators deliver high-performance hardware while maintaining the needed accuracy for the task at hand. This paper takes a step forward by suggesting a design approach for porting CNNs to low-resource embedded systems, bridging the gap between deep learning models and embedded edge systems. To complete our task, we employ closer computing approaches to minimize the computational load and memory consumption of the computer while maintaining impressive deployment performance. HBONext is one of those models that was designed to be easily deployable on embedded and mobile devices. We demonstrate how to use NXP BlueBox 2.0 to introduce a real-time HBONext image classifier in this work. Incorporating this concept into this hardware has been a huge success due to its limited architectural scale of 3 MB. This model was trained and validated using the CIFAR10 data set, which performed exceptionally well due to its smaller size and higher accuracy.","PeriodicalId":443405,"journal":{"name":"2021 IEEE World AI IoT Congress (AIIoT)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE World AI IoT Congress (AIIoT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AIIoT52608.2021.9454210","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Deep learning models require a lot of computation and memory, so they can only be run on high-performance computing platforms such as CPUs or GPUs. However, due to resource, energy, and real-time constraints, they often fail to meet portable requirements. As a result, there is an increasing interest in real-time object recognition solutions based on CNNs, which are typically implemented on embedded systems with limited resources and energy consumption. Recently, hardware accelerators have been developed to provide the computing power needed by AI and machine learning tools. These edge accelerators deliver high-performance hardware while maintaining the needed accuracy for the task at hand. This paper takes a step forward by suggesting a design approach for porting CNNs to low-resource embedded systems, bridging the gap between deep learning models and embedded edge systems. To complete our task, we employ closer computing approaches to minimize the computational load and memory consumption of the computer while maintaining impressive deployment performance. HBONext is one of those models that was designed to be easily deployable on embedded and mobile devices. We demonstrate how to use NXP BlueBox 2.0 to introduce a real-time HBONext image classifier in this work. Incorporating this concept into this hardware has been a huge success due to its limited architectural scale of 3 MB. This model was trained and validated using the CIFAR10 data set, which performed exceptionally well due to its smaller size and higher accuracy.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
HBONext基于NXP Bluebox 2.0的硬件部署
深度学习模型需要大量的计算和内存,因此只能在cpu或gpu等高性能计算平台上运行。然而,由于资源、能源和实时性的限制,它们往往不能满足可移植的要求。因此,人们对基于cnn的实时目标识别解决方案越来越感兴趣,这些解决方案通常在资源和能耗有限的嵌入式系统上实现。最近,硬件加速器已经被开发出来,以提供人工智能和机器学习工具所需的计算能力。这些边缘加速器提供高性能硬件,同时保持手头任务所需的准确性。本文进一步提出了一种将cnn移植到低资源嵌入式系统的设计方法,弥合了深度学习模型和嵌入式边缘系统之间的差距。为了完成我们的任务,我们采用更接近的计算方法来最小化计算机的计算负载和内存消耗,同时保持令人印象深刻的部署性能。HBONext是那些设计为易于在嵌入式和移动设备上部署的模型之一。在这项工作中,我们演示了如何使用NXP BlueBox 2.0引入实时HBONext图像分类器。由于其有限的3 MB架构规模,将此概念整合到该硬件中已经取得了巨大的成功。该模型使用CIFAR10数据集进行训练和验证,由于其更小的尺寸和更高的准确性,该模型表现得非常好。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
CR-LPWAN: issues, solutions and research directions Automatic Detection of Vehicle Congestion by Using Roadside Unit Improved Noise Filtering Technique For Wake Detection In SAR Image Under Rough Sea Condition First Enriched Legal Database in Bangladesh with Efficient Search Optimization and Data Visualization for Law Students and Lawyers Differentially-Private Federated Learning with Long-Term Budget Constraints Using Online Lagrangian Descent
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1