Joint Service Placement and Model Partitioning for Accelerating DNN Inference in Edge Intelligence Empowered Vehicle Networks

IF 7.1 2区计算机科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC IEEE Transactions on Vehicular Technology Pub Date : 2025-04-25 DOI:10.1109/TVT.2025.3564471

Wenzhao Zhang;Shujun Han;Xiaodong Xu;Ping Zhang

{"title":"Joint Service Placement and Model Partitioning for Accelerating DNN Inference in Edge Intelligence Empowered Vehicle Networks","authors":"Wenzhao Zhang;Shujun Han;Xiaodong Xu;Ping Zhang","doi":"10.1109/TVT.2025.3564471","DOIUrl":null,"url":null,"abstract":"This paper investigates the joint service placement and model partitioning problem for accelerating DNN inference in edge intelligence empowered vehicle networks, considering the vehicle mobility and the caching capability of Road Side Unit (RSU). To avoid the degradation of DNN inference performance due to the cold start during the vehicle's service node switching, we propose three different service placement solutions, including processing locally, directly selecting one RSU as server node for collaborative offloading, and selecting one RSU as relay to maintain the connection with original server node. We model the RSU network as an undirected weighted graph and formulate an optimization problem to minimize the DNN inference latency by jointly optimizing the DNN service placement and DNN model partitioning. To solve this problem, we propose a Graph Neural Network Reinforcement learning-based service Placement and model Partitioning algorithm (GRPP). Initially, we extract the feature embedding of RSU nodes and obtain the DNN service placement solution based on Graph Neural Network (GNN). Furthermore, we design a Multilayer Perceptron (MLP) to generate the DNN model partitioning point. Specifically, the parameters of GNN and MLP are learned through an actor-critic based reinforcement learning structure. Finally, we generate the simulation model via traffic simulator Simulation for Urban MObility (SUMO) and give a comprehensive performance evaluation. Simulation results indicate that our proposed scheme can decrease the DNN inference latency by up to 64.51% compared with the benchmark.","PeriodicalId":13421,"journal":{"name":"IEEE Transactions on Vehicular Technology","volume":"74 9","pages":"14645-14660"},"PeriodicalIF":7.1000,"publicationDate":"2025-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Vehicular Technology","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10976629/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

Abstract

This paper investigates the joint service placement and model partitioning problem for accelerating DNN inference in edge intelligence empowered vehicle networks, considering the vehicle mobility and the caching capability of Road Side Unit (RSU). To avoid the degradation of DNN inference performance due to the cold start during the vehicle's service node switching, we propose three different service placement solutions, including processing locally, directly selecting one RSU as server node for collaborative offloading, and selecting one RSU as relay to maintain the connection with original server node. We model the RSU network as an undirected weighted graph and formulate an optimization problem to minimize the DNN inference latency by jointly optimizing the DNN service placement and DNN model partitioning. To solve this problem, we propose a Graph Neural Network Reinforcement learning-based service Placement and model Partitioning algorithm (GRPP). Initially, we extract the feature embedding of RSU nodes and obtain the DNN service placement solution based on Graph Neural Network (GNN). Furthermore, we design a Multilayer Perceptron (MLP) to generate the DNN model partitioning point. Specifically, the parameters of GNN and MLP are learned through an actor-critic based reinforcement learning structure. Finally, we generate the simulation model via traffic simulator Simulation for Urban MObility (SUMO) and give a comprehensive performance evaluation. Simulation results indicate that our proposed scheme can decrease the DNN inference latency by up to 64.51% compared with the benchmark.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于边缘智能的车辆网络中加速DNN推理的联合服务布局和模型划分

在考虑车辆移动性和路旁单元（Road Side Unit， RSU）缓存能力的情况下，研究了基于边缘智能的车辆网络中加速DNN推理的联合服务布局和模型划分问题。为了避免车辆服务节点切换过程中冷启动对DNN推理性能的影响，我们提出了三种不同的服务放置方案，包括本地处理、直接选择一个RSU作为服务器节点进行协同卸载、选择一个RSU作为中继保持与原服务器节点的连接。我们将RSU网络建模为无向加权图，并通过联合优化DNN服务布局和DNN模型划分，提出了最小化DNN推理延迟的优化问题。为了解决这个问题，我们提出了一种基于图神经网络强化学习的服务布局和模型划分算法（GRPP）。首先，我们提取RSU节点的特征嵌入，得到基于图神经网络（GNN）的DNN服务布局方案。此外，我们设计了一个多层感知器（MLP）来生成DNN模型划分点。具体来说，GNN和MLP的参数是通过基于行为者批评的强化学习结构来学习的。最后，通过交通模拟器仿真城市交通（SUMO）生成仿真模型，并对其进行综合性能评价。仿真结果表明，与基准算法相比，我们提出的方案可将DNN推理延迟降低64.51%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Transactions on Vehicular Technology 工程技术-电信学

CiteScore

6.00

自引率

8.80%

发文量

1245

审稿时长

6.3 months

期刊介绍： The scope of the Transactions is threefold (which was approved by the IEEE Periodicals Committee in 1967) and is published on the journal website as follows: Communications: The use of mobile radio on land, sea, and air, including cellular radio, two-way radio, and one-way radio, with applications to dispatch and control vehicles, mobile radiotelephone, radio paging, and status monitoring and reporting. Related areas include spectrum usage, component radio equipment such as cavities and antennas, compute control for radio systems, digital modulation and transmission techniques, mobile radio circuit design, radio propagation for vehicular communications, effects of ignition noise and radio frequency interference, and consideration of the vehicle as part of the radio operating environment. Transportation Systems: The use of electronic technology for the control of ground transportation systems including, but not limited to, traffic aid systems; traffic control systems; automatic vehicle identification, location, and monitoring systems; automated transport systems, with single and multiple vehicle control; and moving walkways or people-movers. Vehicular Electronics: The use of electronic or electrical components and systems for control, propulsion, or auxiliary functions, including but not limited to, electronic controls for engineer, drive train, convenience, safety, and other vehicle systems; sensors, actuators, and microprocessors for onboard use; electronic fuel control systems; vehicle electrical components and systems collision avoidance systems; electromagnetic compatibility in the vehicle environment; and electric vehicles and controls.