Joint Service Placement and Model Partitioning for Accelerating DNN Inference in Edge Intelligence Empowered Vehicle Networks

IF 7.1 2区 计算机科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC IEEE Transactions on Vehicular Technology Pub Date : 2025-04-25 DOI:10.1109/TVT.2025.3564471
Wenzhao Zhang;Shujun Han;Xiaodong Xu;Ping Zhang
{"title":"Joint Service Placement and Model Partitioning for Accelerating DNN Inference in Edge Intelligence Empowered Vehicle Networks","authors":"Wenzhao Zhang;Shujun Han;Xiaodong Xu;Ping Zhang","doi":"10.1109/TVT.2025.3564471","DOIUrl":null,"url":null,"abstract":"This paper investigates the joint service placement and model partitioning problem for accelerating DNN inference in edge intelligence empowered vehicle networks, considering the vehicle mobility and the caching capability of Road Side Unit (RSU). To avoid the degradation of DNN inference performance due to the cold start during the vehicle's service node switching, we propose three different service placement solutions, including processing locally, directly selecting one RSU as server node for collaborative offloading, and selecting one RSU as relay to maintain the connection with original server node. We model the RSU network as an undirected weighted graph and formulate an optimization problem to minimize the DNN inference latency by jointly optimizing the DNN service placement and DNN model partitioning. To solve this problem, we propose a Graph Neural Network Reinforcement learning-based service Placement and model Partitioning algorithm (GRPP). Initially, we extract the feature embedding of RSU nodes and obtain the DNN service placement solution based on Graph Neural Network (GNN). Furthermore, we design a Multilayer Perceptron (MLP) to generate the DNN model partitioning point. Specifically, the parameters of GNN and MLP are learned through an actor-critic based reinforcement learning structure. Finally, we generate the simulation model via traffic simulator Simulation for Urban MObility (SUMO) and give a comprehensive performance evaluation. Simulation results indicate that our proposed scheme can decrease the DNN inference latency by up to 64.51% compared with the benchmark.","PeriodicalId":13421,"journal":{"name":"IEEE Transactions on Vehicular Technology","volume":"74 9","pages":"14645-14660"},"PeriodicalIF":7.1000,"publicationDate":"2025-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Vehicular Technology","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10976629/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

This paper investigates the joint service placement and model partitioning problem for accelerating DNN inference in edge intelligence empowered vehicle networks, considering the vehicle mobility and the caching capability of Road Side Unit (RSU). To avoid the degradation of DNN inference performance due to the cold start during the vehicle's service node switching, we propose three different service placement solutions, including processing locally, directly selecting one RSU as server node for collaborative offloading, and selecting one RSU as relay to maintain the connection with original server node. We model the RSU network as an undirected weighted graph and formulate an optimization problem to minimize the DNN inference latency by jointly optimizing the DNN service placement and DNN model partitioning. To solve this problem, we propose a Graph Neural Network Reinforcement learning-based service Placement and model Partitioning algorithm (GRPP). Initially, we extract the feature embedding of RSU nodes and obtain the DNN service placement solution based on Graph Neural Network (GNN). Furthermore, we design a Multilayer Perceptron (MLP) to generate the DNN model partitioning point. Specifically, the parameters of GNN and MLP are learned through an actor-critic based reinforcement learning structure. Finally, we generate the simulation model via traffic simulator Simulation for Urban MObility (SUMO) and give a comprehensive performance evaluation. Simulation results indicate that our proposed scheme can decrease the DNN inference latency by up to 64.51% compared with the benchmark.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于边缘智能的车辆网络中加速DNN推理的联合服务布局和模型划分
在考虑车辆移动性和路旁单元(Road Side Unit, RSU)缓存能力的情况下,研究了基于边缘智能的车辆网络中加速DNN推理的联合服务布局和模型划分问题。为了避免车辆服务节点切换过程中冷启动对DNN推理性能的影响,我们提出了三种不同的服务放置方案,包括本地处理、直接选择一个RSU作为服务器节点进行协同卸载、选择一个RSU作为中继保持与原服务器节点的连接。我们将RSU网络建模为无向加权图,并通过联合优化DNN服务布局和DNN模型划分,提出了最小化DNN推理延迟的优化问题。为了解决这个问题,我们提出了一种基于图神经网络强化学习的服务布局和模型划分算法(GRPP)。首先,我们提取RSU节点的特征嵌入,得到基于图神经网络(GNN)的DNN服务布局方案。此外,我们设计了一个多层感知器(MLP)来生成DNN模型划分点。具体来说,GNN和MLP的参数是通过基于行为者批评的强化学习结构来学习的。最后,通过交通模拟器仿真城市交通(SUMO)生成仿真模型,并对其进行综合性能评价。仿真结果表明,与基准算法相比,我们提出的方案可将DNN推理延迟降低64.51%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
6.00
自引率
8.80%
发文量
1245
审稿时长
6.3 months
期刊介绍: The scope of the Transactions is threefold (which was approved by the IEEE Periodicals Committee in 1967) and is published on the journal website as follows: Communications: The use of mobile radio on land, sea, and air, including cellular radio, two-way radio, and one-way radio, with applications to dispatch and control vehicles, mobile radiotelephone, radio paging, and status monitoring and reporting. Related areas include spectrum usage, component radio equipment such as cavities and antennas, compute control for radio systems, digital modulation and transmission techniques, mobile radio circuit design, radio propagation for vehicular communications, effects of ignition noise and radio frequency interference, and consideration of the vehicle as part of the radio operating environment. Transportation Systems: The use of electronic technology for the control of ground transportation systems including, but not limited to, traffic aid systems; traffic control systems; automatic vehicle identification, location, and monitoring systems; automated transport systems, with single and multiple vehicle control; and moving walkways or people-movers. Vehicular Electronics: The use of electronic or electrical components and systems for control, propulsion, or auxiliary functions, including but not limited to, electronic controls for engineer, drive train, convenience, safety, and other vehicle systems; sensors, actuators, and microprocessors for onboard use; electronic fuel control systems; vehicle electrical components and systems collision avoidance systems; electromagnetic compatibility in the vehicle environment; and electric vehicles and controls.
期刊最新文献
Scalable Distributed User Scheduling for Low-Altitude Wideband Cell-Free RAN A Tchebycheff-Guided MARL-Based Channel Access in IEEE 802.11ah Networks Division-based Receiver-agnostic RFF Identification in WiFi Systems AdaDQ-KD: An Adaptive Dithering Quantization with Knowledge Distillation in Privacy-Preserving Federated Learning Simultaneous Self-Localization and Base Station Localization with Resonant Beam
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1