{"title":"Multiple rewards fuzzy reinforcement learning algorithm in RoboCup environment","authors":"Li Shi, Yao Jinyi, Ye Zhen, S. Zeng-qi","doi":"10.1109/CCA.2001.973884","DOIUrl":null,"url":null,"abstract":"In order to achieve the competition tasks for multicooperating robots through learning, the paper discusses a kind of method that is designed for multi-agent systems (MAS), called the multi-reward fuzzy Q-learning algorithm (MRFQLA), which can be applied to the environment of the Robot World Cup Tournament (RoboCup). In MRFQLA., multiple reinforcement functions are established, based on the different characters of multi-agent systems. When the learning robot executes an action, these functions create multiple reinforcement signals that give the criteria of this action from different points of view. A Takagi-Sugeno (TS) model of a fuzzy inference system is built, which integrates these multiple rewards into one signal as the feedback of the learning robot. This method enhances the efficiency of learning because multiple rewards increase TD error and eliminates the conflict between the short-term target and the long-term one. Computer simulations in the RoboCup environment are shown and a discussion is given.","PeriodicalId":365390,"journal":{"name":"Proceedings of the 2001 IEEE International Conference on Control Applications (CCA'01) (Cat. No.01CH37204)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2001-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2001 IEEE International Conference on Control Applications (CCA'01) (Cat. No.01CH37204)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCA.2001.973884","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
In order to achieve the competition tasks for multicooperating robots through learning, the paper discusses a kind of method that is designed for multi-agent systems (MAS), called the multi-reward fuzzy Q-learning algorithm (MRFQLA), which can be applied to the environment of the Robot World Cup Tournament (RoboCup). In MRFQLA., multiple reinforcement functions are established, based on the different characters of multi-agent systems. When the learning robot executes an action, these functions create multiple reinforcement signals that give the criteria of this action from different points of view. A Takagi-Sugeno (TS) model of a fuzzy inference system is built, which integrates these multiple rewards into one signal as the feedback of the learning robot. This method enhances the efficiency of learning because multiple rewards increase TD error and eliminates the conflict between the short-term target and the long-term one. Computer simulations in the RoboCup environment are shown and a discussion is given.