{"title":"Adaptive control of a class of nonidentifiable Markov chains","authors":"A. Jalali, M. Ferguson","doi":"10.1109/CDC.1990.203606","DOIUrl":null,"url":null,"abstract":"Adaptive control of finite unknown Markov chains is considered. A new performance criterion from the theory of bandit processes has recently been introduced for adaptive control of Markov chains. The new performance criterion is stronger than the expected average cost criterion and is more appropriate when the identifiability condition does not hold. An adaptive controller is derived to achieve optimality for a modified version of the new performance criterion.<<ETX>>","PeriodicalId":287089,"journal":{"name":"29th IEEE Conference on Decision and Control","volume":"50 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1990-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"29th IEEE Conference on Decision and Control","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CDC.1990.203606","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Adaptive control of finite unknown Markov chains is considered. A new performance criterion from the theory of bandit processes has recently been introduced for adaptive control of Markov chains. The new performance criterion is stronger than the expected average cost criterion and is more appropriate when the identifiability condition does not hold. An adaptive controller is derived to achieve optimality for a modified version of the new performance criterion.<>