海市蜃楼:基于机器学习的Jetson AGX嵌入式平台相同复制品建模

Hassan Halawa, Hazem A. Abdelhafez, M. O. Ahmed, K. Pattabiraman, M. Ripeanu
{"title":"海市蜃楼:基于机器学习的Jetson AGX嵌入式平台相同复制品建模","authors":"Hassan Halawa, Hazem A. Abdelhafez, M. O. Ahmed, K. Pattabiraman, M. Ripeanu","doi":"10.1145/3453142.3491284","DOIUrl":null,"url":null,"abstract":"A common feature of devices deployed at the edge today is their configurability. The NVIDIA Jetson AGX, for example, has a user-configurable frequency range larger than one order of magnitude for the CPU, the GPU, and the memory controller. Key to make effective use of this configurability is the ability to anticipate the application-level impact of a frequency configuration choice. To this end, this paper presents a novel modeling approach for predicting the runtime and power consumption for convolutional neural net-works (CNNs). This modeling approach is: (i) effective - i.e., makes predictions with low error (models achieve an average relative error of 15.4% for runtime and 14.9% for energy); (ii) efficient - i.e., has a low cost to make predictions; (iii) generic - i.e., supports deploying updated and possibly different deep learning inference models without the need for retraining, and (iv) practical - i.e., requires a low training cost. Three features, all geared towards meeting the challenges of deploying in a real-world environment, set this work apart: (i) the focus on predicting the impact of the frequency configuration choice, (ii) the methodological choice to aggregate predictions at fine (i.e., kernel level) granularity which provides generality; and (iii) taking into account the inter-node variability among nominally identical devices.","PeriodicalId":6779,"journal":{"name":"2021 IEEE/ACM Symposium on Edge Computing (SEC)","volume":"1 1","pages":"26-40"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"MIRAGE: Machine Learning-based Modeling of Identical Replicas of the Jetson AGX Embedded Platform\",\"authors\":\"Hassan Halawa, Hazem A. Abdelhafez, M. O. Ahmed, K. Pattabiraman, M. Ripeanu\",\"doi\":\"10.1145/3453142.3491284\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A common feature of devices deployed at the edge today is their configurability. The NVIDIA Jetson AGX, for example, has a user-configurable frequency range larger than one order of magnitude for the CPU, the GPU, and the memory controller. Key to make effective use of this configurability is the ability to anticipate the application-level impact of a frequency configuration choice. To this end, this paper presents a novel modeling approach for predicting the runtime and power consumption for convolutional neural net-works (CNNs). This modeling approach is: (i) effective - i.e., makes predictions with low error (models achieve an average relative error of 15.4% for runtime and 14.9% for energy); (ii) efficient - i.e., has a low cost to make predictions; (iii) generic - i.e., supports deploying updated and possibly different deep learning inference models without the need for retraining, and (iv) practical - i.e., requires a low training cost. Three features, all geared towards meeting the challenges of deploying in a real-world environment, set this work apart: (i) the focus on predicting the impact of the frequency configuration choice, (ii) the methodological choice to aggregate predictions at fine (i.e., kernel level) granularity which provides generality; and (iii) taking into account the inter-node variability among nominally identical devices.\",\"PeriodicalId\":6779,\"journal\":{\"name\":\"2021 IEEE/ACM Symposium on Edge Computing (SEC)\",\"volume\":\"1 1\",\"pages\":\"26-40\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE/ACM Symposium on Edge Computing (SEC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3453142.3491284\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE/ACM Symposium on Edge Computing (SEC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3453142.3491284","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

如今部署在边缘的设备的一个共同特征是它们的可配置性。例如,NVIDIA Jetson AGX的用户可配置频率范围大于CPU、GPU和内存控制器的一个数量级。有效利用这种可配置性的关键是能够预测频率配置选择对应用程序级的影响。为此,本文提出了一种预测卷积神经网络(cnn)运行时间和功耗的新颖建模方法。这种建模方法是:(i)有效的——即以低误差进行预测(模型在运行时间和能源方面的平均相对误差分别为15.4%和14.9%);(ii)高效——即进行预测的成本低;(iii)通用性——即支持部署更新的和可能不同的深度学习推理模型,而不需要再训练;(iv)实用性——即需要较低的训练成本。三个特点,都是为了应对在现实环境中部署的挑战,使这项工作与众不同:(i)专注于预测频率配置选择的影响,(ii)在精细(即内核级别)粒度上聚合预测的方法选择,提供了通用性;(iii)考虑到名义上相同的设备之间的节点间可变性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
MIRAGE: Machine Learning-based Modeling of Identical Replicas of the Jetson AGX Embedded Platform
A common feature of devices deployed at the edge today is their configurability. The NVIDIA Jetson AGX, for example, has a user-configurable frequency range larger than one order of magnitude for the CPU, the GPU, and the memory controller. Key to make effective use of this configurability is the ability to anticipate the application-level impact of a frequency configuration choice. To this end, this paper presents a novel modeling approach for predicting the runtime and power consumption for convolutional neural net-works (CNNs). This modeling approach is: (i) effective - i.e., makes predictions with low error (models achieve an average relative error of 15.4% for runtime and 14.9% for energy); (ii) efficient - i.e., has a low cost to make predictions; (iii) generic - i.e., supports deploying updated and possibly different deep learning inference models without the need for retraining, and (iv) practical - i.e., requires a low training cost. Three features, all geared towards meeting the challenges of deploying in a real-world environment, set this work apart: (i) the focus on predicting the impact of the frequency configuration choice, (ii) the methodological choice to aggregate predictions at fine (i.e., kernel level) granularity which provides generality; and (iii) taking into account the inter-node variability among nominally identical devices.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
A Data-Driven Optimal Control Decision-Making System for Multiple Autonomous Vehicles The Performance Argument for Blockchain-based Edge DNS Caching LotteryFL: Empower Edge Intelligence with Personalized and Communication-Efficient Federated Learning Collaborative Cloud-Edge-Local Computation Offloading for Multi-Component Applications Poster: Enabling Flexible Edge-assisted XR
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1