{"title":"完美信息蒙特卡洛与延迟推理","authors":"Jérôme Arjonilla, Abdallah Saffidine, Tristan Cazenave","doi":"arxiv-2408.02380","DOIUrl":null,"url":null,"abstract":"Imperfect information games, such as Bridge and Skat, present challenges due\nto state-space explosion and hidden information, posing formidable obstacles\nfor search algorithms. Determinization-based algorithms offer a resolution by\nsampling hidden information and solving the game in a perfect information\nsetting, facilitating rapid and effective action estimation. However,\ntransitioning to perfect information introduces challenges, notably one called\nstrategy fusion.This research introduces `Extended Perfect Information Monte\nCarlo' (EPIMC), an online algorithm inspired by the state-of-the-art\ndeterminization-based approach Perfect Information Monte Carlo (PIMC). EPIMC\nenhances the capabilities of PIMC by postponing the perfect information\nresolution, reducing alleviating issues related to strategy fusion. However,\nthe decision to postpone the leaf evaluator introduces novel considerations,\nsuch as the interplay between prior levels of reasoning and the newly deferred\nresolution. In our empirical analysis, we investigate the performance of EPIMC\nacross a range of games, with a particular focus on those characterized by\nvarying degrees of strategy fusion. Our results demonstrate notable performance\nenhancements, particularly in games where strategy fusion significantly impacts\ngameplay. Furthermore, our research contributes to the theoretical foundation\nof determinization-based algorithms addressing challenges associated with\nstrategy fusion.%, thereby enhancing our understanding of these algorithms\nwithin the context of imperfect information game scenarios.","PeriodicalId":501479,"journal":{"name":"arXiv - CS - Artificial Intelligence","volume":"15 Suppl 1 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Perfect Information Monte Carlo with Postponing Reasoning\",\"authors\":\"Jérôme Arjonilla, Abdallah Saffidine, Tristan Cazenave\",\"doi\":\"arxiv-2408.02380\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Imperfect information games, such as Bridge and Skat, present challenges due\\nto state-space explosion and hidden information, posing formidable obstacles\\nfor search algorithms. Determinization-based algorithms offer a resolution by\\nsampling hidden information and solving the game in a perfect information\\nsetting, facilitating rapid and effective action estimation. However,\\ntransitioning to perfect information introduces challenges, notably one called\\nstrategy fusion.This research introduces `Extended Perfect Information Monte\\nCarlo' (EPIMC), an online algorithm inspired by the state-of-the-art\\ndeterminization-based approach Perfect Information Monte Carlo (PIMC). EPIMC\\nenhances the capabilities of PIMC by postponing the perfect information\\nresolution, reducing alleviating issues related to strategy fusion. However,\\nthe decision to postpone the leaf evaluator introduces novel considerations,\\nsuch as the interplay between prior levels of reasoning and the newly deferred\\nresolution. In our empirical analysis, we investigate the performance of EPIMC\\nacross a range of games, with a particular focus on those characterized by\\nvarying degrees of strategy fusion. Our results demonstrate notable performance\\nenhancements, particularly in games where strategy fusion significantly impacts\\ngameplay. Furthermore, our research contributes to the theoretical foundation\\nof determinization-based algorithms addressing challenges associated with\\nstrategy fusion.%, thereby enhancing our understanding of these algorithms\\nwithin the context of imperfect information game scenarios.\",\"PeriodicalId\":501479,\"journal\":{\"name\":\"arXiv - CS - Artificial Intelligence\",\"volume\":\"15 Suppl 1 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-08-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Artificial Intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2408.02380\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.02380","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Perfect Information Monte Carlo with Postponing Reasoning
Imperfect information games, such as Bridge and Skat, present challenges due
to state-space explosion and hidden information, posing formidable obstacles
for search algorithms. Determinization-based algorithms offer a resolution by
sampling hidden information and solving the game in a perfect information
setting, facilitating rapid and effective action estimation. However,
transitioning to perfect information introduces challenges, notably one called
strategy fusion.This research introduces `Extended Perfect Information Monte
Carlo' (EPIMC), an online algorithm inspired by the state-of-the-art
determinization-based approach Perfect Information Monte Carlo (PIMC). EPIMC
enhances the capabilities of PIMC by postponing the perfect information
resolution, reducing alleviating issues related to strategy fusion. However,
the decision to postpone the leaf evaluator introduces novel considerations,
such as the interplay between prior levels of reasoning and the newly deferred
resolution. In our empirical analysis, we investigate the performance of EPIMC
across a range of games, with a particular focus on those characterized by
varying degrees of strategy fusion. Our results demonstrate notable performance
enhancements, particularly in games where strategy fusion significantly impacts
gameplay. Furthermore, our research contributes to the theoretical foundation
of determinization-based algorithms addressing challenges associated with
strategy fusion.%, thereby enhancing our understanding of these algorithms
within the context of imperfect information game scenarios.