从高级深度学习框架自动生成单个镜头检测器C库

Luca Ranalli, L. D. Stefano, Emanuele Plebani, M. Falchetto, D. Pau, Viviana D'Alto
{"title":"从高级深度学习框架自动生成单个镜头检测器C库","authors":"Luca Ranalli, L. D. Stefano, Emanuele Plebani, M. Falchetto, D. Pau, Viviana D'Alto","doi":"10.1109/RTSI.2018.8548427","DOIUrl":null,"url":null,"abstract":"In designing accurate and high-performance artificial neural networks (ANN) topologies for embedded systems, several weeks of engineering work are required to properly condition the input data, implement architectures optimized both in term of memory and operations using any of the off-the-shelf deep learning frameworks and then test the models in proper-scale data driven experiments. Moreover, code implementing the layers needs to be mapped and validated on the target systems, requiring additional months of hard and hand-made engineering work. To shorten this inefficient and un-productive development procedure, an efficient and automated C library generation workflow has been created. This work presents the phases involved in the automated mapping of a Single Shot Object Detector (SSD) model on an embedded library, where low power consumption is the focus and memory usage needs to be properly handled and minimized when possible. The implementation aspects, dealing with the mapping of high-level functions and dynamic data structures into low-level logical equivalents in ANSI C are presented; in addition, a brief explanation of the validation process as well as a short summary on the link between memory usage and the implementations details of the detector are also provided.","PeriodicalId":363896,"journal":{"name":"2018 IEEE 4th International Forum on Research and Technology for Society and Industry (RTSI)","volume":"50 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Automated Generation of a Single Shot Detector C Library from High Level Deep Learning Frameworks\",\"authors\":\"Luca Ranalli, L. D. Stefano, Emanuele Plebani, M. Falchetto, D. Pau, Viviana D'Alto\",\"doi\":\"10.1109/RTSI.2018.8548427\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In designing accurate and high-performance artificial neural networks (ANN) topologies for embedded systems, several weeks of engineering work are required to properly condition the input data, implement architectures optimized both in term of memory and operations using any of the off-the-shelf deep learning frameworks and then test the models in proper-scale data driven experiments. Moreover, code implementing the layers needs to be mapped and validated on the target systems, requiring additional months of hard and hand-made engineering work. To shorten this inefficient and un-productive development procedure, an efficient and automated C library generation workflow has been created. This work presents the phases involved in the automated mapping of a Single Shot Object Detector (SSD) model on an embedded library, where low power consumption is the focus and memory usage needs to be properly handled and minimized when possible. The implementation aspects, dealing with the mapping of high-level functions and dynamic data structures into low-level logical equivalents in ANSI C are presented; in addition, a brief explanation of the validation process as well as a short summary on the link between memory usage and the implementations details of the detector are also provided.\",\"PeriodicalId\":363896,\"journal\":{\"name\":\"2018 IEEE 4th International Forum on Research and Technology for Society and Industry (RTSI)\",\"volume\":\"50 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 IEEE 4th International Forum on Research and Technology for Society and Industry (RTSI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/RTSI.2018.8548427\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE 4th International Forum on Research and Technology for Society and Industry (RTSI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RTSI.2018.8548427","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

在为嵌入式系统设计精确和高性能的人工神经网络(ANN)拓扑时,需要数周的工程工作来适当地调整输入数据,使用任何现成的深度学习框架在内存和操作方面实现优化的架构,然后在适当规模的数据驱动实验中测试模型。此外,实现这些层的代码需要在目标系统上进行映射和验证,这需要额外数月的艰苦手工工程工作。为了缩短这个效率低下且没有生产力的开发过程,我们创建了一个高效且自动化的C库生成工作流。本工作介绍了在嵌入式库上自动映射单镜头对象检测器(SSD)模型所涉及的各个阶段,其中低功耗是重点,并且需要在可能的情况下适当处理和最小化内存使用。在实现方面,将高级函数和动态数据结构映射到ANSI C中的低级逻辑等价;此外,还提供了验证过程的简要说明,以及内存使用与检测器的实现细节之间的联系的简短总结。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Automated Generation of a Single Shot Detector C Library from High Level Deep Learning Frameworks
In designing accurate and high-performance artificial neural networks (ANN) topologies for embedded systems, several weeks of engineering work are required to properly condition the input data, implement architectures optimized both in term of memory and operations using any of the off-the-shelf deep learning frameworks and then test the models in proper-scale data driven experiments. Moreover, code implementing the layers needs to be mapped and validated on the target systems, requiring additional months of hard and hand-made engineering work. To shorten this inefficient and un-productive development procedure, an efficient and automated C library generation workflow has been created. This work presents the phases involved in the automated mapping of a Single Shot Object Detector (SSD) model on an embedded library, where low power consumption is the focus and memory usage needs to be properly handled and minimized when possible. The implementation aspects, dealing with the mapping of high-level functions and dynamic data structures into low-level logical equivalents in ANSI C are presented; in addition, a brief explanation of the validation process as well as a short summary on the link between memory usage and the implementations details of the detector are also provided.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Data-Driven Approaches to Predict States in a Food Technology Case Study Automating Lung Cancer Identification in PET/CT Imaging Spectral Repeatability of a Hyperspectral System for Human Iris Imaging Hybrid Observer for Indoor Localization with Random Time-of-Arrival Measurments A LiDAR Prototype with Silicon Photomultiplier and MEMS Mirrors
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1