Approximate inference systems (AxIS): end-to-end approximations for energy-efficient inference at the edge

Proceedings of the ACM/IEEE International Symposium on Low Power Electronics and Design Pub Date : 2020-08-10 DOI:10.1145/3370748.3406575

Soumendu Kumar Ghosh, Arnab Raha, V. Raghunathan

{"title":"Approximate inference systems (AxIS): end-to-end approximations for energy-efficient inference at the edge","authors":"Soumendu Kumar Ghosh, Arnab Raha, V. Raghunathan","doi":"10.1145/3370748.3406575","DOIUrl":null,"url":null,"abstract":"The rapid proliferation of the Internet-of-Things (IoT) and the dramatic resurgence of artificial intelligence (AI) based application workloads has led to immense interest in performing inference on energy-constrained edge devices. Approximate computing (a design paradigm that yields large energy savings at the cost of a small degradation in application quality) is a promising technique to enable energy-efficient inference at the edge. This paper introduces the concept of an approximate inference system (AxIS) and proposes a systematic methodology to perform joint approximations across different subsystems in a deep neural network-based inference system, leading to significant energy benefits compared to approximating individual subsystems in isolation. We use a smart camera system that executes various convolutional neural network (CNN) based image recognition applications to illustrate how the sensor, memory, compute, and communication subsystems can all be approximated synergistically. We demonstrate our proposed methodology using two variants of a smart camera system: (a) Camedge, where the CNN executes locally on the edge device, and (b) Camcloud, where the edge device sends the captured image to a remote cloud server that executes the CNN. We have prototyped such an approximate inference system using an Altera Stratix IV GX-based Terasic TR4-230 FPGA development board. Experimental results obtained using six CNNs demonstrate significant energy savings (around 1.7× for Camedge and 3.5× for Camcloud) for minimal (< 1%) loss in application quality. Compared to approximating a single subsystem in isolation, AxIS achieves additional energy benefits of 1.6×--1.7× (Camedge) and 1.4×--3.4× (Camcloud) on average for minimal application-level quality loss.","PeriodicalId":116486,"journal":{"name":"Proceedings of the ACM/IEEE International Symposium on Low Power Electronics and Design","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the ACM/IEEE International Symposium on Low Power Electronics and Design","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3370748.3406575","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 7

Abstract

The rapid proliferation of the Internet-of-Things (IoT) and the dramatic resurgence of artificial intelligence (AI) based application workloads has led to immense interest in performing inference on energy-constrained edge devices. Approximate computing (a design paradigm that yields large energy savings at the cost of a small degradation in application quality) is a promising technique to enable energy-efficient inference at the edge. This paper introduces the concept of an approximate inference system (AxIS) and proposes a systematic methodology to perform joint approximations across different subsystems in a deep neural network-based inference system, leading to significant energy benefits compared to approximating individual subsystems in isolation. We use a smart camera system that executes various convolutional neural network (CNN) based image recognition applications to illustrate how the sensor, memory, compute, and communication subsystems can all be approximated synergistically. We demonstrate our proposed methodology using two variants of a smart camera system: (a) Camedge, where the CNN executes locally on the edge device, and (b) Camcloud, where the edge device sends the captured image to a remote cloud server that executes the CNN. We have prototyped such an approximate inference system using an Altera Stratix IV GX-based Terasic TR4-230 FPGA development board. Experimental results obtained using six CNNs demonstrate significant energy savings (around 1.7× for Camedge and 3.5× for Camcloud) for minimal (< 1%) loss in application quality. Compared to approximating a single subsystem in isolation, AxIS achieves additional energy benefits of 1.6×--1.7× (Camedge) and 1.4×--3.4× (Camcloud) on average for minimal application-level quality loss.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

近似推理系统(AxIS):在边缘进行节能推理的端到端近似

物联网(IoT)的快速扩散和基于人工智能(AI)的应用工作负载的急剧复苏，导致人们对在能量受限的边缘设备上执行推理产生了极大的兴趣。近似计算(一种以应用程序质量的小下降为代价产生大量节能的设计范例)是一种很有前途的技术，可以在边缘实现节能推断。本文介绍了近似推理系统(AxIS)的概念，并提出了一种系统的方法，在基于深度神经网络的推理系统中跨不同子系统执行联合逼近，与孤立地逼近单个子系统相比，可以带来显着的能量效益。我们使用一个智能相机系统，该系统执行各种基于卷积神经网络(CNN)的图像识别应用程序，以说明传感器，内存，计算和通信子系统如何都可以近似协同。我们使用智能相机系统的两种变体来演示我们提出的方法:(a) Camedge，其中CNN在边缘设备上本地执行，以及(b) Camcloud，其中边缘设备将捕获的图像发送到执行CNN的远程云服务器。我们使用基于Altera Stratix IV gx的Terasic TR4-230 FPGA开发板制作了这样一个近似推理系统的原型。使用六个cnn获得的实验结果表明，在应用质量损失最小(< 1%)的情况下，显著节能(Camedge约1.7倍，Camcloud约3.5倍)。与孤立的近似单个子系统相比，AxIS实现了1.6×—1.7× (Camedge)和1.4×—3.4× (Camcloud)的额外能量效益，平均可实现最小的应用级质量损失。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of the ACM/IEEE International Symposium on Low Power Electronics and Design

自引率

0.00%

发文量