DeCo-MeSC: Deep Compression-Based Memory-Constrained Split Computing Framework for Cooperative Inference of Neural Network

IF 7.1 2区 计算机科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC IEEE Transactions on Vehicular Technology Pub Date : 2025-03-31 DOI:10.1109/TVT.2025.3556459
Mingyu Sung;Vikas Palakonda;Il-Min Kim;Sangseok Yun;Jae-Mo Kang
{"title":"DeCo-MeSC: Deep Compression-Based Memory-Constrained Split Computing Framework for Cooperative Inference of Neural Network","authors":"Mingyu Sung;Vikas Palakonda;Il-Min Kim;Sangseok Yun;Jae-Mo Kang","doi":"10.1109/TVT.2025.3556459","DOIUrl":null,"url":null,"abstract":"Split computing (SC) of a deep neural network (DNN) across end nodes is a key enabling technology to realize energy-efficient and low-latency cooperative inference in wireless networks and Internet-of-Things (IoT). In this paper, we propose a novel SC framework based on the concept of deep compression (DC) of DNN considering the strictly limited memory footprint of a mobile device, namely, <italic>DeCo-MeSC</i>. In our proposed DeCo-MeSC framework, an initial part of a target DNN (up to so-called the split layer) for the mobile device is compressed with the DC technique to satisfy the memory constraint, while the remaining part of the target DNN (after the split layer) for a cloud server is uncompressed, yet fine-tuned to compensate for the performance loss due to the compression of the initial part. Furthermore, the jointly optimal pair of the split layer and data rate is determined efficiently to maximize the end-to-end inference accuracy under both the end-to-end inference latency and memory constraints. Extensive experimental results demonstrate that the proposed scheme performs markedly better than the existing schemes.","PeriodicalId":13421,"journal":{"name":"IEEE Transactions on Vehicular Technology","volume":"74 8","pages":"13319-13324"},"PeriodicalIF":7.1000,"publicationDate":"2025-03-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Vehicular Technology","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10946233/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

Split computing (SC) of a deep neural network (DNN) across end nodes is a key enabling technology to realize energy-efficient and low-latency cooperative inference in wireless networks and Internet-of-Things (IoT). In this paper, we propose a novel SC framework based on the concept of deep compression (DC) of DNN considering the strictly limited memory footprint of a mobile device, namely, DeCo-MeSC. In our proposed DeCo-MeSC framework, an initial part of a target DNN (up to so-called the split layer) for the mobile device is compressed with the DC technique to satisfy the memory constraint, while the remaining part of the target DNN (after the split layer) for a cloud server is uncompressed, yet fine-tuned to compensate for the performance loss due to the compression of the initial part. Furthermore, the jointly optimal pair of the split layer and data rate is determined efficiently to maximize the end-to-end inference accuracy under both the end-to-end inference latency and memory constraints. Extensive experimental results demonstrate that the proposed scheme performs markedly better than the existing schemes.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
DeCo-MeSC:基于深度压缩的神经网络协同推理内存约束分割计算框架
跨端节点的深度神经网络(DNN)分离计算(SC)是实现无线网络和物联网(IoT)中节能、低延迟协同推理的关键使能技术。在本文中,我们提出了一种基于深度神经网络深度压缩(DC)概念的新型SC框架,考虑到移动设备严格限制的内存占用,即DeCo-MeSC。在我们提出的DeCo-MeSC框架中,用于移动设备的目标DNN的初始部分(直到所谓的分割层)使用DC技术进行压缩以满足内存约束,而用于云服务器的目标DNN的剩余部分(在分割层之后)未被压缩,但经过微调以补偿由于初始部分压缩而导致的性能损失。此外,在端到端推理延迟和内存约束下,有效地确定了分割层和数据速率的联合最优对,以最大限度地提高端到端推理精度。大量的实验结果表明,该方案的性能明显优于现有方案。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
6.00
自引率
8.80%
发文量
1245
审稿时长
6.3 months
期刊介绍: The scope of the Transactions is threefold (which was approved by the IEEE Periodicals Committee in 1967) and is published on the journal website as follows: Communications: The use of mobile radio on land, sea, and air, including cellular radio, two-way radio, and one-way radio, with applications to dispatch and control vehicles, mobile radiotelephone, radio paging, and status monitoring and reporting. Related areas include spectrum usage, component radio equipment such as cavities and antennas, compute control for radio systems, digital modulation and transmission techniques, mobile radio circuit design, radio propagation for vehicular communications, effects of ignition noise and radio frequency interference, and consideration of the vehicle as part of the radio operating environment. Transportation Systems: The use of electronic technology for the control of ground transportation systems including, but not limited to, traffic aid systems; traffic control systems; automatic vehicle identification, location, and monitoring systems; automated transport systems, with single and multiple vehicle control; and moving walkways or people-movers. Vehicular Electronics: The use of electronic or electrical components and systems for control, propulsion, or auxiliary functions, including but not limited to, electronic controls for engineer, drive train, convenience, safety, and other vehicle systems; sensors, actuators, and microprocessors for onboard use; electronic fuel control systems; vehicle electrical components and systems collision avoidance systems; electromagnetic compatibility in the vehicle environment; and electric vehicles and controls.
期刊最新文献
Channel Covariance CSI-based Indoor Localization: Machine Leaning versus Neighbourhood Geometric Approaches Integrated Cost-optimal Convex Optimization for Eco-Driving of Connected Hybrid Electric Vehicles on Sloped and Curved Roads Age of Information in Time-Slotted Wireless Status Updating System Under Generic Updating Interval: Statistical Characteristics and Optimal Updating Interval Energy Efficiency Optimization for RIS-Assisted Task Offloading in Vehicular MEC Networks Safety-Enhanced Trajectory Planning for Autonomous Vehicles: Optimization Based on Dynamic Safety Corridors
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1