{"title":"State generalization method based on uncertainty minimization of behavior outcomes for reactive agents","authors":"T. Yairi, K. Hori, S. Nakasuka","doi":"10.1109/ICCIMA.1999.798522","DOIUrl":null,"url":null,"abstract":"Autonomous state generalization is one of the key issues in the behavior acquisition problem of reactive agents. The paper proposes a novel state generalization method which discretizes the continuous sensor space based on entropy minimization of agents' behavior outcomes. This general framework unifies the heuristic criteria for state generalization used in conventional works. An experimental study in the latter part suggests that our method increases the adaptability of agents to the environment and improves the overall behavior performance.","PeriodicalId":110736,"journal":{"name":"Proceedings Third International Conference on Computational Intelligence and Multimedia Applications. ICCIMA'99 (Cat. No.PR00300)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1999-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings Third International Conference on Computational Intelligence and Multimedia Applications. ICCIMA'99 (Cat. No.PR00300)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCIMA.1999.798522","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Autonomous state generalization is one of the key issues in the behavior acquisition problem of reactive agents. The paper proposes a novel state generalization method which discretizes the continuous sensor space based on entropy minimization of agents' behavior outcomes. This general framework unifies the heuristic criteria for state generalization used in conventional works. An experimental study in the latter part suggests that our method increases the adaptability of agents to the environment and improves the overall behavior performance.