{"title":"Computation-Aware Offloading for DNN Inference Tasks in Semantic Communication Assisted MEC Systems","authors":"Guangyuan Zheng;Miaowen Wen;Zhaolong Ning;Zhiguo Ding","doi":"10.1109/TWC.2024.3523517","DOIUrl":null,"url":null,"abstract":"In this paper, we focus on computation-aware offloading for executing deep neural network (DNN) inference tasks in a mobile edge computing (MEC) system. To cope with the challenges of insufficient wireless resources during task offloading, we resort to semantic communications (SCs), through which the users can offload the compressed task data to the edge server for remote execution. Specifically, we establish the relationship between the compression ratio and computation ratio for different DNN tasks. To achieve energy-efficient offloading, we formulate an optimization problem to minimize the energy consumption of all users by jointly optimizing the compression ratio, computation allocation, uploading time, and DNN layer selection. We first consider a special case with the preconfigured time scheduling and derive closed-form solutions to computation allocation and offloading time, which yield a threshold-based structure determined by users’ channel conditions and local computation consumption. Inspired by the characteristics of these optimal solutions, a general low-complexity iterative algorithm is then designed to solve the original non-convex problem. Simulation results demonstrate that our proposed SC-based computation-aware offloading scheme can substantially reduce users’ energy consumption compared to the conventional offloading and full offloading, especially with scarce wireless resources.","PeriodicalId":13431,"journal":{"name":"IEEE Transactions on Wireless Communications","volume":"24 4","pages":"2693-2706"},"PeriodicalIF":10.7000,"publicationDate":"2025-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Wireless Communications","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10832517/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
In this paper, we focus on computation-aware offloading for executing deep neural network (DNN) inference tasks in a mobile edge computing (MEC) system. To cope with the challenges of insufficient wireless resources during task offloading, we resort to semantic communications (SCs), through which the users can offload the compressed task data to the edge server for remote execution. Specifically, we establish the relationship between the compression ratio and computation ratio for different DNN tasks. To achieve energy-efficient offloading, we formulate an optimization problem to minimize the energy consumption of all users by jointly optimizing the compression ratio, computation allocation, uploading time, and DNN layer selection. We first consider a special case with the preconfigured time scheduling and derive closed-form solutions to computation allocation and offloading time, which yield a threshold-based structure determined by users’ channel conditions and local computation consumption. Inspired by the characteristics of these optimal solutions, a general low-complexity iterative algorithm is then designed to solve the original non-convex problem. Simulation results demonstrate that our proposed SC-based computation-aware offloading scheme can substantially reduce users’ energy consumption compared to the conventional offloading and full offloading, especially with scarce wireless resources.
期刊介绍:
The IEEE Transactions on Wireless Communications is a prestigious publication that showcases cutting-edge advancements in wireless communications. It welcomes both theoretical and practical contributions in various areas. The scope of the Transactions encompasses a wide range of topics, including modulation and coding, detection and estimation, propagation and channel characterization, and diversity techniques. The journal also emphasizes the physical and link layer communication aspects of network architectures and protocols.
The journal is open to papers on specific topics or non-traditional topics related to specific application areas. This includes simulation tools and methodologies, orthogonal frequency division multiplexing, MIMO systems, and wireless over optical technologies.
Overall, the IEEE Transactions on Wireless Communications serves as a platform for high-quality manuscripts that push the boundaries of wireless communications and contribute to advancements in the field.