{"title":"Priority-based Task Scheduling in Dynamic Environments for Cognitive MFR via Transfer DRL","authors":"Sunila Akbar, R. Adve, Z. Ding, P. Moo","doi":"10.1109/RadarConf2351548.2023.10149670","DOIUrl":null,"url":null,"abstract":"A radar resource management module in a cognitive multifunction radar manages the resources by first prioritizing and then scheduling the tasks. Apart from scheduling the tasks, the task scheduler of a cognitive radar requires the scheduling to be adaptable to the changing environment. We formulate a gen-eral model for the distributions of task parameters, specifically, task priorities and delay tolerance, to ensure priority-based task scheduling. We develop the use of transfer learning (TL) within a deep reinforcement learning (DRL) framework to address the challenge of adaptability to a varying environment. Our approach builds on using a Monte Carlo Tree Search (MCTS) aided by a deep neural network (DNN). We show that TL allows accelerated training by transferring the policy learned by training the D NN-based MCTS on initial parameter distribution (environment) to the policy required for a new environment. Our results show that the high priority tasks are least delayed and dropped with the new formulation, whereas TL ensures the respective adaptation to the dynamic environment.","PeriodicalId":168311,"journal":{"name":"2023 IEEE Radar Conference (RadarConf23)","volume":"3 3","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE Radar Conference (RadarConf23)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RadarConf2351548.2023.10149670","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
A radar resource management module in a cognitive multifunction radar manages the resources by first prioritizing and then scheduling the tasks. Apart from scheduling the tasks, the task scheduler of a cognitive radar requires the scheduling to be adaptable to the changing environment. We formulate a gen-eral model for the distributions of task parameters, specifically, task priorities and delay tolerance, to ensure priority-based task scheduling. We develop the use of transfer learning (TL) within a deep reinforcement learning (DRL) framework to address the challenge of adaptability to a varying environment. Our approach builds on using a Monte Carlo Tree Search (MCTS) aided by a deep neural network (DNN). We show that TL allows accelerated training by transferring the policy learned by training the D NN-based MCTS on initial parameter distribution (environment) to the policy required for a new environment. Our results show that the high priority tasks are least delayed and dropped with the new formulation, whereas TL ensures the respective adaptation to the dynamic environment.