Games with imperfect information: theory and algorithms

L. Doyen, Jean-François Raskin
{"title":"Games with imperfect information: theory and algorithms","authors":"L. Doyen, Jean-François Raskin","doi":"10.1017/CBO9780511973468.007","DOIUrl":null,"url":null,"abstract":"We study observation-based strategies for two-player turn-based games played on graphs with parity objectives. An observation-based strategy relies on imperfect information about the history of a play, namely, on the past sequence of observations. Such games occur in the synthesis of a controller that does not see the private state of the plant. Our main results are twofold. First, we give a fixed-point algorithm for computing the set of states from which a player can win with a deterministic observation-based strategy for a parity objective. Second, we give an algorithm for computing the set of states from which a player can win with probability 1 with a randomised observation-based strategy for a reachability objective. This set is of interest because in the absence of perfect information, randomised strategies are more powerful than deterministic ones. Games are natural models for reactive systems. We consider zero-sum two player turn-based games of infinite duration played on finite graphs. One player represents a control program, and the second player represents its environment. The graph describes the possible interactions of the system, and the game is of infinite duration because reactive systems are usually not expected to terminate. In the simplest setting, the game is turn-based and with perfect information, meaning that the players have full knowledge of both the game structure and the sequence of moves played by the adversary. The winning condition in a zero-sum graph game is defined by a set of plays that the first player aims to enforce, and that the second player aims to avoid.","PeriodicalId":431931,"journal":{"name":"Lectures in Game Theory for Computer Scientists","volume":"485 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"37","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Lectures in Game Theory for Computer Scientists","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1017/CBO9780511973468.007","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 37

Abstract

We study observation-based strategies for two-player turn-based games played on graphs with parity objectives. An observation-based strategy relies on imperfect information about the history of a play, namely, on the past sequence of observations. Such games occur in the synthesis of a controller that does not see the private state of the plant. Our main results are twofold. First, we give a fixed-point algorithm for computing the set of states from which a player can win with a deterministic observation-based strategy for a parity objective. Second, we give an algorithm for computing the set of states from which a player can win with probability 1 with a randomised observation-based strategy for a reachability objective. This set is of interest because in the absence of perfect information, randomised strategies are more powerful than deterministic ones. Games are natural models for reactive systems. We consider zero-sum two player turn-based games of infinite duration played on finite graphs. One player represents a control program, and the second player represents its environment. The graph describes the possible interactions of the system, and the game is of infinite duration because reactive systems are usually not expected to terminate. In the simplest setting, the game is turn-based and with perfect information, meaning that the players have full knowledge of both the game structure and the sequence of moves played by the adversary. The winning condition in a zero-sum graph game is defined by a set of plays that the first player aims to enforce, and that the second player aims to avoid.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
不完全信息博弈:理论与算法
我们研究了基于观察的基于回合制的双玩家游戏策略。基于观察的策略依赖于关于戏剧历史的不完全信息,即过去的观察序列。这样的博弈发生在控制器的合成中,控制器看不到植物的私有状态。我们的主要结果是双重的。首先,我们给出了一个定点算法,用于计算玩家可以通过基于确定性观察的策略赢得奇偶性目标的状态集。其次,我们给出了一种算法,用于计算玩家在可达性目标下使用随机观察策略以1概率获胜的状态集。这个集合很有趣,因为在缺乏完美信息的情况下,随机策略比确定性策略更强大。游戏是反应性系统的自然模型。我们考虑的是在有限图上进行无限持续时间的零和二人回合制游戏。一个玩家代表一个控制程序,第二个玩家代表它的环境。图表描述了系统可能的交互作用,游戏的持续时间是无限的,因为反应性系统通常不会终止。在最简单的设置中,游戏是回合制的,拥有完善的信息,这意味着玩家能够完全了解游戏结构和对手的行动顺序。零和图形游戏中的获胜条件是由第一个玩家想要执行而第二个玩家想要避免的一系列玩法所定义的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Turn-based stochastic games Games with imperfect information: theory and algorithms
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1