在游戏中学习,逐步隐藏

Heymann Benjamin, Lanctot Marc
{"title":"在游戏中学习,逐步隐藏","authors":"Heymann Benjamin, Lanctot Marc","doi":"arxiv-2409.03875","DOIUrl":null,"url":null,"abstract":"When learning to play an imperfect information game, it is often easier to\nfirst start with the basic mechanics of the game rules. For example, one can\nplay several example rounds with private cards revealed to all players to\nbetter understand the basic actions and their effects. Building on this\nintuition, this paper introduces {\\it progressive hiding}, an algorithm that\nlearns to play imperfect information games by first learning the basic\nmechanics and then progressively adding information constraints over time.\nProgressive hiding is inspired by methods from stochastic multistage\noptimization such as scenario decomposition and progressive hedging. We prove\nthat it enables the adaptation of counterfactual regret minimization to games\nwhere perfect recall is not satisfied. Numerical experiments illustrate that\nprogressive hiding can achieve optimal payoff in a benchmark of emergent\ncommunication trading game.","PeriodicalId":501316,"journal":{"name":"arXiv - CS - Computer Science and Game Theory","volume":"30 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Learning in Games with progressive hiding\",\"authors\":\"Heymann Benjamin, Lanctot Marc\",\"doi\":\"arxiv-2409.03875\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"When learning to play an imperfect information game, it is often easier to\\nfirst start with the basic mechanics of the game rules. For example, one can\\nplay several example rounds with private cards revealed to all players to\\nbetter understand the basic actions and their effects. Building on this\\nintuition, this paper introduces {\\\\it progressive hiding}, an algorithm that\\nlearns to play imperfect information games by first learning the basic\\nmechanics and then progressively adding information constraints over time.\\nProgressive hiding is inspired by methods from stochastic multistage\\noptimization such as scenario decomposition and progressive hedging. We prove\\nthat it enables the adaptation of counterfactual regret minimization to games\\nwhere perfect recall is not satisfied. Numerical experiments illustrate that\\nprogressive hiding can achieve optimal payoff in a benchmark of emergent\\ncommunication trading game.\",\"PeriodicalId\":501316,\"journal\":{\"name\":\"arXiv - CS - Computer Science and Game Theory\",\"volume\":\"30 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Computer Science and Game Theory\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.03875\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Computer Science and Game Theory","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.03875","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

在学习玩不完全信息游戏时,首先从游戏规则的基本机制入手通常会比较容易。例如,我们可以玩几轮向所有玩家公开私人牌的示例游戏,以便更好地理解基本行动及其效果。基于这一理念,本文介绍了{it progressive hiding},这是一种通过首先学习基本机制,然后随着时间的推移逐步增加信息约束来学习玩不完全信息博弈的算法。我们证明,它能使反事实遗憾最小化适应不满足完美召回的博弈。数值实验表明,渐进式隐藏可以在一个基准的突发通信交易博弈中获得最佳收益。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Learning in Games with progressive hiding
When learning to play an imperfect information game, it is often easier to first start with the basic mechanics of the game rules. For example, one can play several example rounds with private cards revealed to all players to better understand the basic actions and their effects. Building on this intuition, this paper introduces {\it progressive hiding}, an algorithm that learns to play imperfect information games by first learning the basic mechanics and then progressively adding information constraints over time. Progressive hiding is inspired by methods from stochastic multistage optimization such as scenario decomposition and progressive hedging. We prove that it enables the adaptation of counterfactual regret minimization to games where perfect recall is not satisfied. Numerical experiments illustrate that progressive hiding can achieve optimal payoff in a benchmark of emergent communication trading game.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
MALADY: Multiclass Active Learning with Auction Dynamics on Graphs Mechanism Design for Extending the Accessibility of Facilities Common revenue allocation in DMUs with two stages based on DEA cross-efficiency and cooperative game The common revenue allocation based on modified Shapley value and DEA cross-efficiency On Robustness to $k$-wise Independence of Optimal Bayesian Mechanisms
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1