{"title":"Optimizing task offloading in MIMO-enabled vehicular networks through deep reinforcement learning","authors":"Jian Xu, Shengchao Su","doi":"10.1016/j.vehcom.2025.100901","DOIUrl":null,"url":null,"abstract":"<div><div>Mobile Edge Computing (MEC) effectively alleviates the computational burden faced by vehicles in processing compute-intensive tasks due to resource limitations. However, traditional approaches typically employ coarse-grained task offloading strategies that utilize sequential protocols and discrete action spaces, resulting in high latency and increased energy consumption. These limitations render such strategies unsuitable for real-time applications. To address these challenges, an innovative computation offloading strategy is proposed, specifically designed to minimize the long-term average computation cost in a multi-vehicle, multi-server Internet of Vehicles (IoV) system. The MEC system model is constructed using Multiple-Input Multiple-Output (MIMO) technology, which facilitates simultaneous uplink transmissions from all vehicles, significantly reducing the time required for data uploads. Subsequently, a continuous action space is adopted to enhance both the flexibility and precision of decision-making. Additionally, Batch-Constrained Q-learning (BCQ) is introduced to further constrain the actions taken by the policy, mitigating overly optimistic estimates through a batch constraint mechanism. Finally, the Twin Delayed Deep Deterministic Policy Gradient with Batch-Constrained Q-learning (TD3BCQ) framework is developed to enable fine-grained decision-making for local execution and power allocation during task offloading within a continuous action space. Experimental results demonstrate that the proposed scheme achieves a more balanced offloading strategy and better exploits the available computing resources, leading to an approximate 20% improvement compared to the baselines.</div></div>","PeriodicalId":54346,"journal":{"name":"Vehicular Communications","volume":"53 ","pages":"Article 100901"},"PeriodicalIF":5.8000,"publicationDate":"2025-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Vehicular Communications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2214209625000282","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"TELECOMMUNICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Mobile Edge Computing (MEC) effectively alleviates the computational burden faced by vehicles in processing compute-intensive tasks due to resource limitations. However, traditional approaches typically employ coarse-grained task offloading strategies that utilize sequential protocols and discrete action spaces, resulting in high latency and increased energy consumption. These limitations render such strategies unsuitable for real-time applications. To address these challenges, an innovative computation offloading strategy is proposed, specifically designed to minimize the long-term average computation cost in a multi-vehicle, multi-server Internet of Vehicles (IoV) system. The MEC system model is constructed using Multiple-Input Multiple-Output (MIMO) technology, which facilitates simultaneous uplink transmissions from all vehicles, significantly reducing the time required for data uploads. Subsequently, a continuous action space is adopted to enhance both the flexibility and precision of decision-making. Additionally, Batch-Constrained Q-learning (BCQ) is introduced to further constrain the actions taken by the policy, mitigating overly optimistic estimates through a batch constraint mechanism. Finally, the Twin Delayed Deep Deterministic Policy Gradient with Batch-Constrained Q-learning (TD3BCQ) framework is developed to enable fine-grained decision-making for local execution and power allocation during task offloading within a continuous action space. Experimental results demonstrate that the proposed scheme achieves a more balanced offloading strategy and better exploits the available computing resources, leading to an approximate 20% improvement compared to the baselines.
期刊介绍:
Vehicular communications is a growing area of communications between vehicles and including roadside communication infrastructure. Advances in wireless communications are making possible sharing of information through real time communications between vehicles and infrastructure. This has led to applications to increase safety of vehicles and communication between passengers and the Internet. Standardization efforts on vehicular communication are also underway to make vehicular transportation safer, greener and easier.
The aim of the journal is to publish high quality peer–reviewed papers in the area of vehicular communications. The scope encompasses all types of communications involving vehicles, including vehicle–to–vehicle and vehicle–to–infrastructure. The scope includes (but not limited to) the following topics related to vehicular communications:
Vehicle to vehicle and vehicle to infrastructure communications
Channel modelling, modulating and coding
Congestion Control and scalability issues
Protocol design, testing and verification
Routing in vehicular networks
Security issues and countermeasures
Deployment and field testing
Reducing energy consumption and enhancing safety of vehicles
Wireless in–car networks
Data collection and dissemination methods
Mobility and handover issues
Safety and driver assistance applications
UAV
Underwater communications
Autonomous cooperative driving
Social networks
Internet of vehicles
Standardization of protocols.