{"title":"Collective learning of action sequences","authors":"Gerhard Weiss","doi":"10.1109/ICDCS.1993.287707","DOIUrl":null,"url":null,"abstract":"Learning in multiagent systems is a new research field in distributed artificial intelligence. The author investigates an action-oriented approach to delayed reinforcement learning in reactive multiagent systems and focuses on the question of how the agents can learn to coordinate their actions. Two basic algorithms, the ACE algorithm and the AGE algorithm (ACE and AGE stand for Action Estimation and Action Group Estimation, respectively), for the collective learning of appropriate action sequences are introduced. Both algorithms explicitly take into consideration that (i) each agent typically knows only a fraction of its environment, (ii) the agents typically have to cooperate in solving tasks, and (iii) actions carried out by the agents can be incompatible. The experiments described illustrate these algorithms and their learning capacities.<<ETX>>","PeriodicalId":249060,"journal":{"name":"[1993] Proceedings. The 13th International Conference on Distributed Computing Systems","volume":"100 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1993-05-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"[1993] Proceedings. The 13th International Conference on Distributed Computing Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDCS.1993.287707","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
Learning in multiagent systems is a new research field in distributed artificial intelligence. The author investigates an action-oriented approach to delayed reinforcement learning in reactive multiagent systems and focuses on the question of how the agents can learn to coordinate their actions. Two basic algorithms, the ACE algorithm and the AGE algorithm (ACE and AGE stand for Action Estimation and Action Group Estimation, respectively), for the collective learning of appropriate action sequences are introduced. Both algorithms explicitly take into consideration that (i) each agent typically knows only a fraction of its environment, (ii) the agents typically have to cooperate in solving tasks, and (iii) actions carried out by the agents can be incompatible. The experiments described illustrate these algorithms and their learning capacities.<>