{"title":"Associative Criteria in Mutually Dependent Markov Decision Processes","authors":"Toshiharu Fujita","doi":"10.1109/IIAI-AAI.2014.39","DOIUrl":null,"url":null,"abstract":"In this paper, we consider associative criteria in mutually dependent Markov decision processes (MDMDP). The MDMDP model is structured upon two types of finite-stage Markov decision process: main-process and sub-process. At each stage, the reward in one process is given by the optimal value of the alternative process problem, whose initial state is determined by the current state and decision in the original process. We introduce an associative criterion to each MDMDP and derive mutually dependent recursive equations by dynamic programming with an invariant imbedding technique.","PeriodicalId":432222,"journal":{"name":"2014 IIAI 3rd International Conference on Advanced Applied Informatics","volume":"7 Suppl 1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 IIAI 3rd International Conference on Advanced Applied Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IIAI-AAI.2014.39","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
In this paper, we consider associative criteria in mutually dependent Markov decision processes (MDMDP). The MDMDP model is structured upon two types of finite-stage Markov decision process: main-process and sub-process. At each stage, the reward in one process is given by the optimal value of the alternative process problem, whose initial state is determined by the current state and decision in the original process. We introduce an associative criterion to each MDMDP and derive mutually dependent recursive equations by dynamic programming with an invariant imbedding technique.