Davide Callegaro, Yoshitomo Matsubara, M. Levorato
{"title":"Optimal Task Allocation for Time-Varying Edge Computing Systems with Split DNNs","authors":"Davide Callegaro, Yoshitomo Matsubara, M. Levorato","doi":"10.1109/GLOBECOM42002.2020.9322344","DOIUrl":null,"url":null,"abstract":"Many modern applications rely on complex machine learning algorithms, such as Deep Neural Networks (DNNs), to analyze images. However, both mobile and edge computing strategies may fail to provide satisfactory performance in some parameter regions. To mitigate this issue, the research community recently proposed methods to split the execution of DNNs to optimize the balance between computing load allocation and channel usage. Building on this set of results, this paper presents an optimization framework that enables the dynamic control of how images are processed in mobile device-edge server systems. The system is modeled as a Markov process, and a Linear Fractional Program is defined to identify the optimal stationary state-action distribution minimizing the overall average inference time under a constraint on the number of discarded images. Results indicate the advantage of using a dynamic control strategy with respect to available fixed strategies.","PeriodicalId":12759,"journal":{"name":"GLOBECOM 2020 - 2020 IEEE Global Communications Conference","volume":"83 1","pages":"1-6"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"GLOBECOM 2020 - 2020 IEEE Global Communications Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GLOBECOM42002.2020.9322344","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
Many modern applications rely on complex machine learning algorithms, such as Deep Neural Networks (DNNs), to analyze images. However, both mobile and edge computing strategies may fail to provide satisfactory performance in some parameter regions. To mitigate this issue, the research community recently proposed methods to split the execution of DNNs to optimize the balance between computing load allocation and channel usage. Building on this set of results, this paper presents an optimization framework that enables the dynamic control of how images are processed in mobile device-edge server systems. The system is modeled as a Markov process, and a Linear Fractional Program is defined to identify the optimal stationary state-action distribution minimizing the overall average inference time under a constraint on the number of discarded images. Results indicate the advantage of using a dynamic control strategy with respect to available fixed strategies.