{"title":"Cooperative Navigation via Relational Graphs and State Abstraction","authors":"Salwa Mostafa;Mohamed K. Abdel-Aziz;Mehdi Bennis","doi":"10.1109/LNET.2023.3285295","DOIUrl":null,"url":null,"abstract":"We consider a cooperative-navigation problem in a partially observable MADRL framework. We investigate how agents cooperate to learn a communication protocol given a very large state space while generalizing to a new environment. The proposed solution leverages the notion of structured observation and abstraction, in which the raw-pixel observations are converted into a relational graph that is then used for learning abstraction. Abstraction is performed based on compression using a relational graph autoencoder (RGAE) and a multilayer perceptron (MLP) to remove irrelevant information. The results show the effectiveness of the proposed MLP and RGAE in learning better policies with better generalization capabilities. It is also shown that communication among agents is instrumental in improving the navigation task performance.","PeriodicalId":100628,"journal":{"name":"IEEE Networking Letters","volume":"5 4","pages":"184-188"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Networking Letters","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10149094/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
We consider a cooperative-navigation problem in a partially observable MADRL framework. We investigate how agents cooperate to learn a communication protocol given a very large state space while generalizing to a new environment. The proposed solution leverages the notion of structured observation and abstraction, in which the raw-pixel observations are converted into a relational graph that is then used for learning abstraction. Abstraction is performed based on compression using a relational graph autoencoder (RGAE) and a multilayer perceptron (MLP) to remove irrelevant information. The results show the effectiveness of the proposed MLP and RGAE in learning better policies with better generalization capabilities. It is also shown that communication among agents is instrumental in improving the navigation task performance.