Yibo Chen;Zhijin Zhao;Xueyi Ye;Shilian Zheng;Xiaoniu Yang
{"title":"Intelligent Decision-Making for Asynchronous Dynamic Orthogonal Networking Based on DO-QMIX Algorithm","authors":"Yibo Chen;Zhijin Zhao;Xueyi Ye;Shilian Zheng;Xiaoniu Yang","doi":"10.1109/LCOMM.2025.3529531","DOIUrl":null,"url":null,"abstract":"In order to intelligently select the frequency points of each subnet in the asynchronous dynamic orthogonal networking (ADON), we propose the QMIX algorithm based on dataset aggregation and options architecture (DO-QMIX). Joint reward is maximized to mitigate the problem of partially observable environment. Specifically, we first pre-train the network by dataset aggregation (DAgger) to improve the sample utilization. Then, we fine-tune the policy via experiences generated by options architecture (OA) to avoid getting trapped in local optima. Numerical results show that the proposed DO-QMIX outperforms the comparison algorithms in the three complex electromagnetic environments.","PeriodicalId":13197,"journal":{"name":"IEEE Communications Letters","volume":"29 3","pages":"537-541"},"PeriodicalIF":3.7000,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Communications Letters","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10841382/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"TELECOMMUNICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
In order to intelligently select the frequency points of each subnet in the asynchronous dynamic orthogonal networking (ADON), we propose the QMIX algorithm based on dataset aggregation and options architecture (DO-QMIX). Joint reward is maximized to mitigate the problem of partially observable environment. Specifically, we first pre-train the network by dataset aggregation (DAgger) to improve the sample utilization. Then, we fine-tune the policy via experiences generated by options architecture (OA) to avoid getting trapped in local optima. Numerical results show that the proposed DO-QMIX outperforms the comparison algorithms in the three complex electromagnetic environments.
期刊介绍:
The IEEE Communications Letters publishes short papers in a rapid publication cycle on advances in the state-of-the-art of communication over different media and channels including wire, underground, waveguide, optical fiber, and storage channels. Both theoretical contributions (including new techniques, concepts, and analyses) and practical contributions (including system experiments and prototypes, and new applications) are encouraged. This journal focuses on the physical layer and the link layer of communication systems.