{"title":"Collaborative Content Caching Algorithm for Large-Scale ISTNs Based on MAPPO","authors":"Ranshu Peng;Shi Chen;Changbin Xue","doi":"10.1109/LWC.2024.3443154","DOIUrl":null,"url":null,"abstract":"In this letter, we investigate the content caching problem within large-scale integrated satellite-terrestrial networks, focusing on a fusion scenario of future large-scale remote sensing constellations and communication satellite networks. Our investigation relies on deep reinforcement learning techniques aimed at minimizing the long-term average content delivery delay. To address the inherent challenge of convergence in single-agent algorithms, we propose clustering intelligent remote sensing satellites, with each cluster headed by an intelligent agent. Based on the characteristics of the model, we modify the multi-agent proximal policy optimization (MAPPO) algorithm by integrating long short-term memory (LSTM) to capture the correlation of the state information of different agents in the time domain. Simulation results show that the proposed LSTM-MAPPO outperforms the benchmarks, exhibiting faster convergence speed and lower standard deviation.","PeriodicalId":13343,"journal":{"name":"IEEE Wireless Communications Letters","volume":"13 11","pages":"3069-3073"},"PeriodicalIF":5.5000,"publicationDate":"2024-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Wireless Communications Letters","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10634974/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
In this letter, we investigate the content caching problem within large-scale integrated satellite-terrestrial networks, focusing on a fusion scenario of future large-scale remote sensing constellations and communication satellite networks. Our investigation relies on deep reinforcement learning techniques aimed at minimizing the long-term average content delivery delay. To address the inherent challenge of convergence in single-agent algorithms, we propose clustering intelligent remote sensing satellites, with each cluster headed by an intelligent agent. Based on the characteristics of the model, we modify the multi-agent proximal policy optimization (MAPPO) algorithm by integrating long short-term memory (LSTM) to capture the correlation of the state information of different agents in the time domain. Simulation results show that the proposed LSTM-MAPPO outperforms the benchmarks, exhibiting faster convergence speed and lower standard deviation.
期刊介绍:
IEEE Wireless Communications Letters publishes short papers in a rapid publication cycle on advances in the state-of-the-art of wireless communications. Both theoretical contributions (including new techniques, concepts, and analyses) and practical contributions (including system experiments and prototypes, and new applications) are encouraged. This journal focuses on the physical layer and the link layer of wireless communication systems.