{"title":"基于多智能体深度强化学习的动态异构V2V网络传输设计","authors":"Nong Qu, Chao Wang, Zuxing Li, Fuqiang Liu","doi":"10.23919/JCC.fa.2021-0825.202307","DOIUrl":null,"url":null,"abstract":"In highly dynamic and heterogeneous vehicular communication networks, it is challenging to efficiently utilize network resources and ensure demanding performance requirements of safety-related applications. This paper investigates machine-learning-assisted transmission design in a typical multi-user vehicle-to-vehicle (V2V) communication scenario. The transmission process proceeds sequentially along the discrete time steps, where several source nodes intend to deliver multiple different types of messages to their respective destinations within the same spectrum. Due to rapid movement of vehicles, real-time acquirement of channel knowledge and central coordination of all transmission actions are in general hard to realize. We consider applying multi-agent deep reinforcement learning (MADRL) to handle this issue. By transforming the transmission design problem into a stochastic game, a multi-agent proximal policy optimization (MAPPO) algorithm under a centralized training and decentralized execution framework is proposed such that each source decides its own transmission message type, power level, and data rate, based on local observations of the environment and feedback, to maximize its energy efficiency. Via simulations we show that our method achieves better performance over conventional methods.","PeriodicalId":9814,"journal":{"name":"China Communications","volume":"20 1","pages":"273-289"},"PeriodicalIF":3.1000,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A transmission design in dynamic heterogeneous V2V networks through multi-agent deep reinforcement learning\",\"authors\":\"Nong Qu, Chao Wang, Zuxing Li, Fuqiang Liu\",\"doi\":\"10.23919/JCC.fa.2021-0825.202307\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In highly dynamic and heterogeneous vehicular communication networks, it is challenging to efficiently utilize network resources and ensure demanding performance requirements of safety-related applications. This paper investigates machine-learning-assisted transmission design in a typical multi-user vehicle-to-vehicle (V2V) communication scenario. The transmission process proceeds sequentially along the discrete time steps, where several source nodes intend to deliver multiple different types of messages to their respective destinations within the same spectrum. Due to rapid movement of vehicles, real-time acquirement of channel knowledge and central coordination of all transmission actions are in general hard to realize. We consider applying multi-agent deep reinforcement learning (MADRL) to handle this issue. By transforming the transmission design problem into a stochastic game, a multi-agent proximal policy optimization (MAPPO) algorithm under a centralized training and decentralized execution framework is proposed such that each source decides its own transmission message type, power level, and data rate, based on local observations of the environment and feedback, to maximize its energy efficiency. Via simulations we show that our method achieves better performance over conventional methods.\",\"PeriodicalId\":9814,\"journal\":{\"name\":\"China Communications\",\"volume\":\"20 1\",\"pages\":\"273-289\"},\"PeriodicalIF\":3.1000,\"publicationDate\":\"2023-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"China Communications\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.23919/JCC.fa.2021-0825.202307\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"TELECOMMUNICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"China Communications","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.23919/JCC.fa.2021-0825.202307","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"TELECOMMUNICATIONS","Score":null,"Total":0}
A transmission design in dynamic heterogeneous V2V networks through multi-agent deep reinforcement learning
In highly dynamic and heterogeneous vehicular communication networks, it is challenging to efficiently utilize network resources and ensure demanding performance requirements of safety-related applications. This paper investigates machine-learning-assisted transmission design in a typical multi-user vehicle-to-vehicle (V2V) communication scenario. The transmission process proceeds sequentially along the discrete time steps, where several source nodes intend to deliver multiple different types of messages to their respective destinations within the same spectrum. Due to rapid movement of vehicles, real-time acquirement of channel knowledge and central coordination of all transmission actions are in general hard to realize. We consider applying multi-agent deep reinforcement learning (MADRL) to handle this issue. By transforming the transmission design problem into a stochastic game, a multi-agent proximal policy optimization (MAPPO) algorithm under a centralized training and decentralized execution framework is proposed such that each source decides its own transmission message type, power level, and data rate, based on local observations of the environment and feedback, to maximize its energy efficiency. Via simulations we show that our method achieves better performance over conventional methods.
期刊介绍:
China Communications (ISSN 1673-5447) is an English-language monthly journal cosponsored by the China Institute of Communications (CIC) and IEEE Communications Society (IEEE ComSoc). It is aimed at readers in industry, universities, research and development organizations, and government agencies in the field of Information and Communications Technologies (ICTs) worldwide.
The journal's main objective is to promote academic exchange in the ICTs sector and publish high-quality papers to contribute to the global ICTs industry. It provides instant access to the latest articles and papers, presenting leading-edge research achievements, tutorial overviews, and descriptions of significant practical applications of technology.
China Communications has been indexed in SCIE (Science Citation Index-Expanded) since January 2007. Additionally, all articles have been available in the IEEE Xplore digital library since January 2013.