Dynamic Semantic Compression for CNN Inference in Multi-Access Edge Computing: A Graph Reinforcement Learning-Based Autoencoder

IF 10.7 1区 计算机科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC IEEE Transactions on Wireless Communications Pub Date : 2024-12-23 DOI:10.1109/TWC.2024.3518399
Nan Li;Alexandros Iosifidis;Qi Zhang
{"title":"Dynamic Semantic Compression for CNN Inference in Multi-Access Edge Computing: A Graph Reinforcement Learning-Based Autoencoder","authors":"Nan Li;Alexandros Iosifidis;Qi Zhang","doi":"10.1109/TWC.2024.3518399","DOIUrl":null,"url":null,"abstract":"This paper studies the computational offloading of CNN inference in dynamic multi-access edge computing (MEC) networks. To address the uncertainties in communication time and edge servers’ available capacity, we propose a novel semantic compression method, autoencoder-based CNN architecture (AECNN), for effective semantic extraction and compression in partial offloading. In the semantic encoder, we introduce a feature compression module based on the channel attention mechanism in CNNs, to compress intermediate data by selecting the most informative features. Additionally, to further reduce communication overhead, we leverage entropy encoding to remove the statistical redundancy in the compressed data. In the semantic decoder, we design a lightweight decoder to reconstruct the intermediate data through learning from the received compressed data to improve accuracy. To effectively trade-off communication, computation, and inference accuracy, we design a reward function and formulate the offloading problem of CNN inference as a maximization problem with the goal of maximizing the average inference accuracy and throughput over the long term. To address this maximization problem, we propose a graph reinforcement learning-based AECNN (GRL-AECNN) method, which outperforms existing works DROO-AECNN, GRL-BottleNet++ and GRL-DeepJSCC under different dynamic scenarios. This highlights the advantages of GRL-AECNN in offloading decision-making for CNN inference tasks in dynamic MEC.","PeriodicalId":13431,"journal":{"name":"IEEE Transactions on Wireless Communications","volume":"24 3","pages":"2157-2172"},"PeriodicalIF":10.7000,"publicationDate":"2024-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Wireless Communications","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10812766/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

This paper studies the computational offloading of CNN inference in dynamic multi-access edge computing (MEC) networks. To address the uncertainties in communication time and edge servers’ available capacity, we propose a novel semantic compression method, autoencoder-based CNN architecture (AECNN), for effective semantic extraction and compression in partial offloading. In the semantic encoder, we introduce a feature compression module based on the channel attention mechanism in CNNs, to compress intermediate data by selecting the most informative features. Additionally, to further reduce communication overhead, we leverage entropy encoding to remove the statistical redundancy in the compressed data. In the semantic decoder, we design a lightweight decoder to reconstruct the intermediate data through learning from the received compressed data to improve accuracy. To effectively trade-off communication, computation, and inference accuracy, we design a reward function and formulate the offloading problem of CNN inference as a maximization problem with the goal of maximizing the average inference accuracy and throughput over the long term. To address this maximization problem, we propose a graph reinforcement learning-based AECNN (GRL-AECNN) method, which outperforms existing works DROO-AECNN, GRL-BottleNet++ and GRL-DeepJSCC under different dynamic scenarios. This highlights the advantages of GRL-AECNN in offloading decision-making for CNN inference tasks in dynamic MEC.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
多访问边缘计算中CNN推理的动态语义压缩:基于图强化学习的自编码器
研究了动态多址边缘计算(MEC)网络中CNN推理的计算卸载问题。为了解决通信时间和边缘服务器可用容量的不确定性,我们提出了一种新的语义压缩方法——基于自编码器的CNN架构(AECNN),用于部分卸载的有效语义提取和压缩。在语义编码器中,我们引入了一个基于cnn频道注意机制的特征压缩模块,通过选择信息量最大的特征来压缩中间数据。此外,为了进一步减少通信开销,我们利用熵编码来消除压缩数据中的统计冗余。在语义解码器中,我们设计了一个轻量级的解码器,通过学习接收到的压缩数据来重构中间数据,以提高解码器的精度。为了有效地权衡通信、计算和推理精度,我们设计了一个奖励函数,并将CNN推理的卸载问题制定为最大化问题,其目标是在长期内最大化平均推理精度和吞吐量。为了解决这一最大化问题,我们提出了一种基于图强化学习的AECNN (GRL-AECNN)方法,该方法在不同的动态场景下优于现有的DROO-AECNN、GRL-BottleNet++和GRL-DeepJSCC。这突出了GRL-AECNN在动态MEC中为CNN推理任务卸载决策的优势。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
18.60
自引率
10.60%
发文量
708
审稿时长
5.6 months
期刊介绍: The IEEE Transactions on Wireless Communications is a prestigious publication that showcases cutting-edge advancements in wireless communications. It welcomes both theoretical and practical contributions in various areas. The scope of the Transactions encompasses a wide range of topics, including modulation and coding, detection and estimation, propagation and channel characterization, and diversity techniques. The journal also emphasizes the physical and link layer communication aspects of network architectures and protocols. The journal is open to papers on specific topics or non-traditional topics related to specific application areas. This includes simulation tools and methodologies, orthogonal frequency division multiplexing, MIMO systems, and wireless over optical technologies. Overall, the IEEE Transactions on Wireless Communications serves as a platform for high-quality manuscripts that push the boundaries of wireless communications and contribute to advancements in the field.
期刊最新文献
Downlink Sum-Rate Maximization of 5G Networks With Metasurface-Based Reconfigurable Antennas Tensor-Based Channel Estimation for Terahertz Ultra-Massive MIMO Systems Learning-Aided Iterative Receiver for Superimposed Pilots: Design and Experimental Evaluation MA-enhanced Mixed Near-field and Far-field Covert Communications Hybrid Near/Far-Field Frequency-Dependent Beamforming via Phase-Time Arrays With Single RF Chain
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1