Context Aware Adaptive ML Inference in Mobile-Cloud Applications

2022 IEEE 23rd International Symposium on a World of Wireless, Mobile and Multimedia Networks (WoWMoM) Pub Date : 2022-06-01 DOI:10.1109/WoWMoM54355.2022.00031

Koustabh Dolui, Sam Michiels, D. Hughes, H. Hallez

{"title":"Context Aware Adaptive ML Inference in Mobile-Cloud Applications","authors":"Koustabh Dolui, Sam Michiels, D. Hughes, H. Hallez","doi":"10.1109/WoWMoM54355.2022.00031","DOIUrl":null,"url":null,"abstract":"With the emergence of mobile devices having enough resources to execute real-time ML inference, deployment opportunities arise on mobile devices while keeping privacy-sensitive data close to the source and reducing server load. Moreover, offloading inference to a cloud server facilitates deployment of neural network-based applications on resource-constrained devices. Depending on the application goals and execution context of the application, the optimal deployment on either cloud server or mobile device varies during the lifetime of an application. In this paper, we propose a context-aware middleware that enables optimization of deployed application software to satisfy the application’s functional goals in accordance with changing execution context and environmental conditions. We facilitate system design through the abstraction of deployed software components as states and make use of finite state machines and contextual triggers to model the reconfiguration of the system. We evaluate our framework using a real-world nutritional monitoring application via food image recognition deployed in a two-tier mobile and cloud architecture. We compare the proposed solution with various static deployments of the application and show that our approach can react to changing application goals at run-time in order to reduce server load and thereby increase scalability.","PeriodicalId":275324,"journal":{"name":"2022 IEEE 23rd International Symposium on a World of Wireless, Mobile and Multimedia Networks (WoWMoM)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 23rd International Symposium on a World of Wireless, Mobile and Multimedia Networks (WoWMoM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WoWMoM54355.2022.00031","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

With the emergence of mobile devices having enough resources to execute real-time ML inference, deployment opportunities arise on mobile devices while keeping privacy-sensitive data close to the source and reducing server load. Moreover, offloading inference to a cloud server facilitates deployment of neural network-based applications on resource-constrained devices. Depending on the application goals and execution context of the application, the optimal deployment on either cloud server or mobile device varies during the lifetime of an application. In this paper, we propose a context-aware middleware that enables optimization of deployed application software to satisfy the application’s functional goals in accordance with changing execution context and environmental conditions. We facilitate system design through the abstraction of deployed software components as states and make use of finite state machines and contextual triggers to model the reconfiguration of the system. We evaluate our framework using a real-world nutritional monitoring application via food image recognition deployed in a two-tier mobile and cloud architecture. We compare the proposed solution with various static deployments of the application and show that our approach can react to changing application goals at run-time in order to reduce server load and thereby increase scalability.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

移动云应用中的上下文感知自适应ML推理

随着拥有足够资源来执行实时ML推理的移动设备的出现，移动设备上的部署机会出现了，同时保持隐私敏感数据靠近源并减少服务器负载。此外，将推理卸载到云服务器有助于在资源受限的设备上部署基于神经网络的应用程序。根据应用程序的目标和执行上下文，在云服务器或移动设备上的最佳部署在应用程序的生命周期中会有所不同。在本文中，我们提出了一种上下文感知中间件，它可以根据不断变化的执行上下文和环境条件优化已部署的应用软件，以满足应用程序的功能目标。我们通过将部署的软件组件抽象为状态来促进系统设计，并利用有限状态机和上下文触发器对系统的重新配置进行建模。我们通过在两层移动和云架构中部署的食品图像识别，使用现实世界的营养监测应用程序来评估我们的框架。我们将建议的解决方案与应用程序的各种静态部署进行比较，并表明我们的方法可以在运行时对不断变化的应用程序目标做出反应，从而减少服务器负载，从而提高可伸缩性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2022 IEEE 23rd International Symposium on a World of Wireless, Mobile and Multimedia Networks (WoWMoM)

自引率

0.00%

发文量

期刊最新文献

An Efficient Analog Eigen-Beamforming Procedure for Wideband mmWave MIMO-OFDM Systems Relay selection in Bluetooth Mesh networks by embedding genetic algorithms in a Digital Communication Twin Modeling Service Mixes in Access Links: Product Form and Oscillations Reviewers: Main Conference N2Women Event