Optimizing task offloading in MIMO-enabled vehicular networks through deep reinforcement learning

IF 5.8 2区 计算机科学 Q1 TELECOMMUNICATIONS Vehicular Communications Pub Date : 2025-02-25 DOI:10.1016/j.vehcom.2025.100901
Jian Xu, Shengchao Su
{"title":"Optimizing task offloading in MIMO-enabled vehicular networks through deep reinforcement learning","authors":"Jian Xu,&nbsp;Shengchao Su","doi":"10.1016/j.vehcom.2025.100901","DOIUrl":null,"url":null,"abstract":"<div><div>Mobile Edge Computing (MEC) effectively alleviates the computational burden faced by vehicles in processing compute-intensive tasks due to resource limitations. However, traditional approaches typically employ coarse-grained task offloading strategies that utilize sequential protocols and discrete action spaces, resulting in high latency and increased energy consumption. These limitations render such strategies unsuitable for real-time applications. To address these challenges, an innovative computation offloading strategy is proposed, specifically designed to minimize the long-term average computation cost in a multi-vehicle, multi-server Internet of Vehicles (IoV) system. The MEC system model is constructed using Multiple-Input Multiple-Output (MIMO) technology, which facilitates simultaneous uplink transmissions from all vehicles, significantly reducing the time required for data uploads. Subsequently, a continuous action space is adopted to enhance both the flexibility and precision of decision-making. Additionally, Batch-Constrained Q-learning (BCQ) is introduced to further constrain the actions taken by the policy, mitigating overly optimistic estimates through a batch constraint mechanism. Finally, the Twin Delayed Deep Deterministic Policy Gradient with Batch-Constrained Q-learning (TD3BCQ) framework is developed to enable fine-grained decision-making for local execution and power allocation during task offloading within a continuous action space. Experimental results demonstrate that the proposed scheme achieves a more balanced offloading strategy and better exploits the available computing resources, leading to an approximate 20% improvement compared to the baselines.</div></div>","PeriodicalId":54346,"journal":{"name":"Vehicular Communications","volume":"53 ","pages":"Article 100901"},"PeriodicalIF":5.8000,"publicationDate":"2025-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Vehicular Communications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2214209625000282","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"TELECOMMUNICATIONS","Score":null,"Total":0}
引用次数: 0

Abstract

Mobile Edge Computing (MEC) effectively alleviates the computational burden faced by vehicles in processing compute-intensive tasks due to resource limitations. However, traditional approaches typically employ coarse-grained task offloading strategies that utilize sequential protocols and discrete action spaces, resulting in high latency and increased energy consumption. These limitations render such strategies unsuitable for real-time applications. To address these challenges, an innovative computation offloading strategy is proposed, specifically designed to minimize the long-term average computation cost in a multi-vehicle, multi-server Internet of Vehicles (IoV) system. The MEC system model is constructed using Multiple-Input Multiple-Output (MIMO) technology, which facilitates simultaneous uplink transmissions from all vehicles, significantly reducing the time required for data uploads. Subsequently, a continuous action space is adopted to enhance both the flexibility and precision of decision-making. Additionally, Batch-Constrained Q-learning (BCQ) is introduced to further constrain the actions taken by the policy, mitigating overly optimistic estimates through a batch constraint mechanism. Finally, the Twin Delayed Deep Deterministic Policy Gradient with Batch-Constrained Q-learning (TD3BCQ) framework is developed to enable fine-grained decision-making for local execution and power allocation during task offloading within a continuous action space. Experimental results demonstrate that the proposed scheme achieves a more balanced offloading strategy and better exploits the available computing resources, leading to an approximate 20% improvement compared to the baselines.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
求助全文
约1分钟内获得全文 去求助
来源期刊
Vehicular Communications
Vehicular Communications Engineering-Electrical and Electronic Engineering
CiteScore
12.70
自引率
10.40%
发文量
88
审稿时长
62 days
期刊介绍: Vehicular communications is a growing area of communications between vehicles and including roadside communication infrastructure. Advances in wireless communications are making possible sharing of information through real time communications between vehicles and infrastructure. This has led to applications to increase safety of vehicles and communication between passengers and the Internet. Standardization efforts on vehicular communication are also underway to make vehicular transportation safer, greener and easier. The aim of the journal is to publish high quality peer–reviewed papers in the area of vehicular communications. The scope encompasses all types of communications involving vehicles, including vehicle–to–vehicle and vehicle–to–infrastructure. The scope includes (but not limited to) the following topics related to vehicular communications: Vehicle to vehicle and vehicle to infrastructure communications Channel modelling, modulating and coding Congestion Control and scalability issues Protocol design, testing and verification Routing in vehicular networks Security issues and countermeasures Deployment and field testing Reducing energy consumption and enhancing safety of vehicles Wireless in–car networks Data collection and dissemination methods Mobility and handover issues Safety and driver assistance applications UAV Underwater communications Autonomous cooperative driving Social networks Internet of vehicles Standardization of protocols.
期刊最新文献
Intelligent and efficient Metaverse rendering and caching in UAV-aided vehicular edge computing 5G NR sidelink time domain based resource allocation in C-V2X Task offloading and multi-cache placement based on DRL in UAV-assisted MEC networks A question-centric review on DRL-based optimization for UAV-assisted MEC sensor and IoT applications, challenges, and future directions Optimizing task offloading in MIMO-enabled vehicular networks through deep reinforcement learning
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1