Neural architecture search for resource constrained hardware devices: A survey

IF 1.7 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS IET Cyber-Physical Systems: Theory and Applications Pub Date : 2023-07-03 DOI:10.1049/cps2.12058

Yongjia Yang, Jinyu Zhan, Wei Jiang, Yucheng Jiang, Antai Yu

{"title":"Neural architecture search for resource constrained hardware devices: A survey","authors":"Yongjia Yang, Jinyu Zhan, Wei Jiang, Yucheng Jiang, Antai Yu","doi":"10.1049/cps2.12058","DOIUrl":null,"url":null,"abstract":"<p>With the emergence of powerful and low-energy Internet of Things devices, deep learning computing is increasingly applied to resource-constrained edge devices. However, the mismatch between hardware devices with low computing capacity and the increasing complexity of Deep Neural Network models, as well as the growing real-time requirements, bring challenges to the design and deployment of deep learning models. For example, autonomous driving technologies rely on real-time object detection of the environment, which cannot tolerate the extra latency of sending data to the cloud, processing and then sending the results back to edge devices. Many studies aim to find innovative ways to reduce the size of deep learning models, the number of Floating-point Operations per Second, and the time overhead of inference. Neural Architecture Search (NAS) makes it possible to automatically generate efficient neural network models. The authors summarise the existing NAS methods on resource-constrained devices and categorise them according to single-objective or multi-objective optimisation. We review the search space, the search algorithm and the constraints of NAS on hardware devices. We also explore the challenges and open problems of hardware NAS.</p>","PeriodicalId":36881,"journal":{"name":"IET Cyber-Physical Systems: Theory and Applications","volume":"8 3","pages":"149-159"},"PeriodicalIF":1.7000,"publicationDate":"2023-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cps2.12058","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IET Cyber-Physical Systems: Theory and Applications","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1049/cps2.12058","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

With the emergence of powerful and low-energy Internet of Things devices, deep learning computing is increasingly applied to resource-constrained edge devices. However, the mismatch between hardware devices with low computing capacity and the increasing complexity of Deep Neural Network models, as well as the growing real-time requirements, bring challenges to the design and deployment of deep learning models. For example, autonomous driving technologies rely on real-time object detection of the environment, which cannot tolerate the extra latency of sending data to the cloud, processing and then sending the results back to edge devices. Many studies aim to find innovative ways to reduce the size of deep learning models, the number of Floating-point Operations per Second, and the time overhead of inference. Neural Architecture Search (NAS) makes it possible to automatically generate efficient neural network models. The authors summarise the existing NAS methods on resource-constrained devices and categorise them according to single-objective or multi-objective optimisation. We review the search space, the search algorithm and the constraints of NAS on hardware devices. We also explore the challenges and open problems of hardware NAS.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

资源受限硬件设备的神经结构搜索：综述

随着功能强大、能耗低的物联网设备的出现，深度学习计算越来越多地应用于资源受限的边缘设备。然而，计算能力低的硬件设备与深度神经网络模型日益复杂的不匹配，以及日益增长的实时性要求，给深度学习模型的设计和部署带来了挑战。例如，自动驾驶技术依赖于环境的实时物体检测，无法容忍将数据发送到云、处理然后将结果发送回边缘设备的额外延迟。许多研究旨在寻找创新的方法来减少深度学习模型的大小、每秒浮点运算的数量和推理的时间开销。神经结构搜索（NAS）使自动生成高效的神经网络模型成为可能。作者总结了资源受限设备上现有的NAS方法，并根据单目标或多目标优化对其进行了分类。我们回顾了搜索空间、搜索算法以及NAS对硬件设备的限制。我们还探讨了硬件NAS的挑战和悬而未决的问题。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊