{"title":"斯塔克尔伯格博弈中的满意与遗憾","authors":"Langford White, Duong Nguyen, Hung Nguyen","doi":"arxiv-2408.11340","DOIUrl":null,"url":null,"abstract":"This paper introduces the new concept of (follower) satisfaction in\nStackelberg games and compares the standard Stackelberg game with its\nsatisfaction version. Simulation results are presented which suggest that the\nfollower adopting satisfaction generally increases leader utility. This\nimportant new result is proven for the case where leader strategies to commit\nto are restricted to be deterministic (pure strategies). The paper then\naddresses the application of regret based algorithms to the Stackelberg\nproblem. Although it is known that the follower adopts a no-regret position in\na Stackelberg solution, this is not generally the case for the leader. The\nreport examines the convergence behaviour of unconditional and conditional\nregret matching (RM) algorithms in the Stackelberg setting. The paper shows\nthat, in the examples considered, that these algorithms either converge to any\npure Nash equilibria for the simultaneous move game, or to some mixed\nstrategies which do not have the \"no-regret\" property. In one case, convergence\nof the conditional RM algorithm over both players to a solution \"close\" to the\nStackelberg case was observed. The paper argues that further research in this\narea, in particular when applied in the satisfaction setting could be fruitful.","PeriodicalId":501316,"journal":{"name":"arXiv - CS - Computer Science and Game Theory","volume":"204 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Satisfaction and Regret in Stackelberg Games\",\"authors\":\"Langford White, Duong Nguyen, Hung Nguyen\",\"doi\":\"arxiv-2408.11340\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper introduces the new concept of (follower) satisfaction in\\nStackelberg games and compares the standard Stackelberg game with its\\nsatisfaction version. Simulation results are presented which suggest that the\\nfollower adopting satisfaction generally increases leader utility. This\\nimportant new result is proven for the case where leader strategies to commit\\nto are restricted to be deterministic (pure strategies). The paper then\\naddresses the application of regret based algorithms to the Stackelberg\\nproblem. Although it is known that the follower adopts a no-regret position in\\na Stackelberg solution, this is not generally the case for the leader. The\\nreport examines the convergence behaviour of unconditional and conditional\\nregret matching (RM) algorithms in the Stackelberg setting. The paper shows\\nthat, in the examples considered, that these algorithms either converge to any\\npure Nash equilibria for the simultaneous move game, or to some mixed\\nstrategies which do not have the \\\"no-regret\\\" property. In one case, convergence\\nof the conditional RM algorithm over both players to a solution \\\"close\\\" to the\\nStackelberg case was observed. The paper argues that further research in this\\narea, in particular when applied in the satisfaction setting could be fruitful.\",\"PeriodicalId\":501316,\"journal\":{\"name\":\"arXiv - CS - Computer Science and Game Theory\",\"volume\":\"204 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-08-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Computer Science and Game Theory\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2408.11340\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Computer Science and Game Theory","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.11340","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
This paper introduces the new concept of (follower) satisfaction in
Stackelberg games and compares the standard Stackelberg game with its
satisfaction version. Simulation results are presented which suggest that the
follower adopting satisfaction generally increases leader utility. This
important new result is proven for the case where leader strategies to commit
to are restricted to be deterministic (pure strategies). The paper then
addresses the application of regret based algorithms to the Stackelberg
problem. Although it is known that the follower adopts a no-regret position in
a Stackelberg solution, this is not generally the case for the leader. The
report examines the convergence behaviour of unconditional and conditional
regret matching (RM) algorithms in the Stackelberg setting. The paper shows
that, in the examples considered, that these algorithms either converge to any
pure Nash equilibria for the simultaneous move game, or to some mixed
strategies which do not have the "no-regret" property. In one case, convergence
of the conditional RM algorithm over both players to a solution "close" to the
Stackelberg case was observed. The paper argues that further research in this
area, in particular when applied in the satisfaction setting could be fruitful.