首页 > 最新文献

arXiv - CS - Computer Science and Game Theory最新文献

英文 中文
GPU-Accelerated Counterfactual Regret Minimization GPU 加速的反事实遗憾最小化
Pub Date : 2024-08-27 DOI: arxiv-2408.14778
Juho Kim
Counterfactual regret minimization (CFR) is a family of algorithms ofno-regret learning dynamics capable of solving large-scale imperfectinformation games. There has been a notable lack of work on making CFR morecomputationally efficient. We propose implementing this algorithm as a seriesof dense and sparse matrix and vector operations, thereby making it highlyparallelizable for a graphical processing unit. Our experiments show that ourimplementation performs up to about 352.5 times faster than OpenSpiel's Pythonimplementation and up to about 22.2 times faster than OpenSpiel's C++implementation and the speedup becomes more pronounced as the size of the gamebeing solved grows.
反事实遗憾最小化(CFR)是一种无遗憾学习动态算法,能够解决大规模的不完全信息博弈。在提高反事实遗憾最小化算法的计算效率方面,我们的研究明显不足。我们建议将该算法作为一系列密集和稀疏矩阵及向量运算来实现,从而使其在图形处理单元上具有高度可并行性。我们的实验表明,我们的实现比 OpenSpiel 的 Python 实现快约 352.5 倍,比 OpenSpiel 的 C++ 实现快约 22.2 倍。
{"title":"GPU-Accelerated Counterfactual Regret Minimization","authors":"Juho Kim","doi":"arxiv-2408.14778","DOIUrl":"https://doi.org/arxiv-2408.14778","url":null,"abstract":"Counterfactual regret minimization (CFR) is a family of algorithms of\u0000no-regret learning dynamics capable of solving large-scale imperfect\u0000information games. There has been a notable lack of work on making CFR more\u0000computationally efficient. We propose implementing this algorithm as a series\u0000of dense and sparse matrix and vector operations, thereby making it highly\u0000parallelizable for a graphical processing unit. Our experiments show that our\u0000implementation performs up to about 352.5 times faster than OpenSpiel's Python\u0000implementation and up to about 22.2 times faster than OpenSpiel's C++\u0000implementation and the speedup becomes more pronounced as the size of the game\u0000being solved grows.","PeriodicalId":501316,"journal":{"name":"arXiv - CS - Computer Science and Game Theory","volume":"11 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142197519","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ReLExS: Reinforcement Learning Explanations for Stackelberg No-Regret Learners ReLExS:针对 Stackelberg 无悔学习者的强化学习解释
Pub Date : 2024-08-26 DOI: arxiv-2408.14086
Xiangge Huang, Jingyuan Li, Jiaqing Xie
With the constraint of a no regret follower, will the players in a two-playerStackelberg game still reach Stackelberg equilibrium? We first show when thefollower strategy is either reward-average or transform-reward-average, the twoplayers can always get the Stackelberg Equilibrium. Then, we extend that theplayers can achieve the Stackelberg equilibrium in the two-player game underthe no regret constraint. Also, we show a strict upper bound of the follower'sutility difference between with and without no regret constraint. Moreover, inconstant-sum two-player Stackelberg games with non-regret action sequences, weensure the total optimal utility of the game remains also bounded.
在无悔追随者的约束下,双人斯塔克尔伯格博弈中的博弈者还能达到斯塔克尔伯格均衡吗?我们首先证明,当追随者的策略是奖励平均策略或变换奖励平均策略时,双人博弈者总能达到斯塔克尔伯格均衡。然后,我们进一步证明,在无悔约束条件下,玩家可以在双人博弈中实现斯塔克尔伯格均衡。同时,我们还证明了有无悔约束条件下追随者效用差的严格上限。此外,在有无悔行动序列的不恒等和双人斯塔克尔伯格博弈中,我们确保博弈的总最优效用也是有界的。
{"title":"ReLExS: Reinforcement Learning Explanations for Stackelberg No-Regret Learners","authors":"Xiangge Huang, Jingyuan Li, Jiaqing Xie","doi":"arxiv-2408.14086","DOIUrl":"https://doi.org/arxiv-2408.14086","url":null,"abstract":"With the constraint of a no regret follower, will the players in a two-player\u0000Stackelberg game still reach Stackelberg equilibrium? We first show when the\u0000follower strategy is either reward-average or transform-reward-average, the two\u0000players can always get the Stackelberg Equilibrium. Then, we extend that the\u0000players can achieve the Stackelberg equilibrium in the two-player game under\u0000the no regret constraint. Also, we show a strict upper bound of the follower's\u0000utility difference between with and without no regret constraint. Moreover, in\u0000constant-sum two-player Stackelberg games with non-regret action sequences, we\u0000ensure the total optimal utility of the game remains also bounded.","PeriodicalId":501316,"journal":{"name":"arXiv - CS - Computer Science and Game Theory","volume":"204 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142197520","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
How to guide a present-biased agent through prescribed tasks? 如何引导偏重当前的代理完成规定任务?
Pub Date : 2024-08-24 DOI: arxiv-2408.13675
Tatiana Belova, Yuriy Dementiev, Fedor V. Fomin, Petr A. Golovach, Artur Ignatiev
The present bias is a well-documented behavioral trait that significantlyinfluences human decision-making, with present-biased agents often prioritizingimmediate rewards over long-term benefits, leading to suboptimal outcomes invarious real-world scenarios. Kleinberg and Oren (2014) proposed a populargraph-theoretical model of inconsistent planning to capture the behavior ofpresent-biased agents. In this model, a multi-step project is represented by aweighted directed acyclic task graph, where the agent traverses the graph basedon present-biased preferences. We use the model of Kleinberg and Oren to address the principal-agentproblem, where a principal, fully aware of the agent's present bias, aims tomodify an existing project by adding or deleting tasks. The challenge is tocreate a modified project that satisfies two somewhat contradictory conditions.On one hand, the present-biased agent should select specific tasks deemedimportant by the principal. On the other hand, if the anticipated costs in themodified project become too high for the agent, there is a risk of the agentabandoning the entire project, which is not in the principal's interest. To tackle this issue, we leverage the tools of parameterized complexity toinvestigate whether the principal's strategy can be efficiently identified. Weprovide algorithms and complexity bounds for this problem.
当下偏差是一种有据可查的行为特征,它严重影响人类的决策,具有当下偏差的代理人通常会优先考虑眼前的回报而非长远利益,从而在现实世界的各种场景中导致次优结果。Kleinberg 和 Oren(2014 年)提出了一种流行的不一致规划图论模型,以捕捉有现时偏见的代理人的行为。在该模型中,一个多步骤项目由一个加权的有向无环任务图来表示,代理根据当前偏好来遍历该图。我们使用 Kleinberg 和 Oren 的模型来解决委托人-代理人问题,在这个问题中,委托人完全了解代理人的当前偏好,目的是通过添加或删除任务来修改现有项目。一方面,有当前偏见的代理人应该选择委托人认为重要的特定任务。另一方面,如果修改后项目的预期成本对代理人来说过高,代理人就有可能放弃整个项目,这不符合委托人的利益。为了解决这个问题,我们利用参数化复杂性工具来研究委托人的策略是否能被有效识别。我们为这个问题提供了算法和复杂度边界。
{"title":"How to guide a present-biased agent through prescribed tasks?","authors":"Tatiana Belova, Yuriy Dementiev, Fedor V. Fomin, Petr A. Golovach, Artur Ignatiev","doi":"arxiv-2408.13675","DOIUrl":"https://doi.org/arxiv-2408.13675","url":null,"abstract":"The present bias is a well-documented behavioral trait that significantly\u0000influences human decision-making, with present-biased agents often prioritizing\u0000immediate rewards over long-term benefits, leading to suboptimal outcomes in\u0000various real-world scenarios. Kleinberg and Oren (2014) proposed a popular\u0000graph-theoretical model of inconsistent planning to capture the behavior of\u0000present-biased agents. In this model, a multi-step project is represented by a\u0000weighted directed acyclic task graph, where the agent traverses the graph based\u0000on present-biased preferences. We use the model of Kleinberg and Oren to address the principal-agent\u0000problem, where a principal, fully aware of the agent's present bias, aims to\u0000modify an existing project by adding or deleting tasks. The challenge is to\u0000create a modified project that satisfies two somewhat contradictory conditions.\u0000On one hand, the present-biased agent should select specific tasks deemed\u0000important by the principal. On the other hand, if the anticipated costs in the\u0000modified project become too high for the agent, there is a risk of the agent\u0000abandoning the entire project, which is not in the principal's interest. To tackle this issue, we leverage the tools of parameterized complexity to\u0000investigate whether the principal's strategy can be efficiently identified. We\u0000provide algorithms and complexity bounds for this problem.","PeriodicalId":501316,"journal":{"name":"arXiv - CS - Computer Science and Game Theory","volume":"13 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142197521","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Temporal Elections: Welfare, Strategyproofness, and Proportionality 时间选举:福利、策略防范和比例性
Pub Date : 2024-08-24 DOI: arxiv-2408.13637
Edith Elkind, Tzeh Yuan Neoh, Nicholas Teh
We investigate a model of sequential decision-making where a singlealternative is chosen at each round. We focus on two objectives-utilitarianwelfare (Util) and egalitarian welfare (Egal)-and consider the computationalcomplexity of the associated maximization problems, as well as theircompatibility with strategyproofness and proportionality. We observe thatmaximizing Util is easy, but the corresponding decision problem for Egal isNP-complete even in restricted cases. We complement this hardness result forEgal with parameterized complexity analysis and an approximation algorithm.Additionally, we show that, while a mechanism that outputs a Util outcome isstrategyproof, all deterministic mechanisms for computing Egal outcomes fail avery weak variant of strategyproofness, called non-obvious manipulability(NOM). However, we show that when agents have non-empty approval sets at eachtimestep, choosing an Egal-maximizing outcome while breaking tieslexicographically satisfies NOM. Regarding proportionality, we prove that aproportional (PROP) outcome can be computed efficiently, but finding an outcomethat maximizes Util while guaranteeing PROP is NP-hard. We also derive upperand lower bounds on the price of proportionality with respect to Util and Egal.
我们研究了一个顺序决策模型,在这个模型中,每一轮都只选择一个备选方案。我们重点研究了两个目标--功利主义福利(Util)和平等主义福利(Egal)--并考虑了相关最大化问题的计算复杂性,以及它们与策略防范性和比例性的匹配性。我们发现,最大化 Util 很容易,但 Egal 的相应决策问题即使在受限情况下也是 NP-完全的。此外,我们还证明,虽然输出 Util 结果的机制是策略无误的,但所有计算 Egal 结果的确定性机制都无法通过策略无误性的一个非常弱的变体,即非显而易见的可操作性(NOM)。然而,我们证明,当代理在每一步都有非空的批准集时,在打破联系的同时选择一个 Egal 最大化的结果在逻辑上满足 NOM。关于比例性,我们证明了比例性(PROP)结果可以高效计算,但找到一个最大化 Util 同时保证 PROP 的结果是 NP 难的。我们还推导出了关于 Util 和 Egal 的比例性价格的上限和下限。
{"title":"Temporal Elections: Welfare, Strategyproofness, and Proportionality","authors":"Edith Elkind, Tzeh Yuan Neoh, Nicholas Teh","doi":"arxiv-2408.13637","DOIUrl":"https://doi.org/arxiv-2408.13637","url":null,"abstract":"We investigate a model of sequential decision-making where a single\u0000alternative is chosen at each round. We focus on two objectives-utilitarian\u0000welfare (Util) and egalitarian welfare (Egal)-and consider the computational\u0000complexity of the associated maximization problems, as well as their\u0000compatibility with strategyproofness and proportionality. We observe that\u0000maximizing Util is easy, but the corresponding decision problem for Egal is\u0000NP-complete even in restricted cases. We complement this hardness result for\u0000Egal with parameterized complexity analysis and an approximation algorithm.\u0000Additionally, we show that, while a mechanism that outputs a Util outcome is\u0000strategyproof, all deterministic mechanisms for computing Egal outcomes fail a\u0000very weak variant of strategyproofness, called non-obvious manipulability\u0000(NOM). However, we show that when agents have non-empty approval sets at each\u0000timestep, choosing an Egal-maximizing outcome while breaking ties\u0000lexicographically satisfies NOM. Regarding proportionality, we prove that a\u0000proportional (PROP) outcome can be computed efficiently, but finding an outcome\u0000that maximizes Util while guaranteeing PROP is NP-hard. We also derive upper\u0000and lower bounds on the price of proportionality with respect to Util and Egal.","PeriodicalId":501316,"journal":{"name":"arXiv - CS - Computer Science and Game Theory","volume":"30 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142197543","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Contested Logistics: A Game-Theoretic Approach 有争议的物流:博弈论方法
Pub Date : 2024-08-23 DOI: arxiv-2408.13057
Jakub Cerny, Chun Kai Ling, Darshan Chakrabarti, Jingwen Zhang, Gabriele Farina, Christian Kroer, Garud Iyengar
We introduce Contested Logistics Games, a variant of logistics problems thataccount for the presence of an adversary that can disrupt the movement of goodsin selected areas. We model this as a large two-player zero-sum one-shot gameplayed on a graph representation of the physical world, with the optimallogistics plans described by the (possibly randomized) Nash equilibria of thisgame. Our logistics model is fairly sophisticated, and is able to handlemultiple modes of transport and goods, accounting for possible storage of goodsin warehouses, as well as Leontief utilities based on demand satisfied. Weprove computational hardness results related to equilibrium finding and proposea practical double-oracle solver based on solving a series of best-responsemixed-integer linear programs. We experiment on both synthetic and real-worldmaps, demonstrating that our proposed method scales to reasonably large games.We also demonstrate the importance of explicitly modeling the capabilities ofthe adversary via ablation studies and comparisons with a naive logistics planbased on heuristics.
我们介绍了 "有争议的物流博弈",这是物流问题的一种变体,它考虑到了对手的存在,对手可能会扰乱选定区域的货物运输。我们将其建模为在物理世界的图表示上进行的大型双人零和一击博弈,最优物流计划由该博弈的纳什均衡(可能是随机的)来描述。我们的物流模型相当复杂,能够处理多种运输方式和货物,考虑到货物在仓库中的可能存储,以及基于需求满足的列昂惕夫效用。我们证明了与寻找平衡相关的计算硬度结果,并提出了一种基于求解一系列最佳响应混合整数线性程序的实用双奥义求解器。我们还通过消融研究以及与基于启发式的天真物流计划的比较,证明了明确模拟对手能力的重要性。
{"title":"Contested Logistics: A Game-Theoretic Approach","authors":"Jakub Cerny, Chun Kai Ling, Darshan Chakrabarti, Jingwen Zhang, Gabriele Farina, Christian Kroer, Garud Iyengar","doi":"arxiv-2408.13057","DOIUrl":"https://doi.org/arxiv-2408.13057","url":null,"abstract":"We introduce Contested Logistics Games, a variant of logistics problems that\u0000account for the presence of an adversary that can disrupt the movement of goods\u0000in selected areas. We model this as a large two-player zero-sum one-shot game\u0000played on a graph representation of the physical world, with the optimal\u0000logistics plans described by the (possibly randomized) Nash equilibria of this\u0000game. Our logistics model is fairly sophisticated, and is able to handle\u0000multiple modes of transport and goods, accounting for possible storage of goods\u0000in warehouses, as well as Leontief utilities based on demand satisfied. We\u0000prove computational hardness results related to equilibrium finding and propose\u0000a practical double-oracle solver based on solving a series of best-response\u0000mixed-integer linear programs. We experiment on both synthetic and real-world\u0000maps, demonstrating that our proposed method scales to reasonably large games.\u0000We also demonstrate the importance of explicitly modeling the capabilities of\u0000the adversary via ablation studies and comparisons with a naive logistics plan\u0000based on heuristics.","PeriodicalId":501316,"journal":{"name":"arXiv - CS - Computer Science and Game Theory","volume":"29 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142197542","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Social Welfare Maximization for Federated Learning with Network Effects 具有网络效应的联合学习的社会福利最大化
Pub Date : 2024-08-23 DOI: arxiv-2408.13223
Xiang Li, Yuan Luo, Bing Luo, Jianwei Huang
A proper mechanism design can help federated learning (FL) to achieve goodsocial welfare by coordinating self-interested clients through the learningprocess. However, existing mechanisms neglect the network effects of clientparticipation, leading to suboptimal incentives and social welfare. This paperaddresses this gap by exploring network effects in FL incentive mechanismdesign. We establish a theoretical model to analyze FL model performance andquantify the impact of network effects on heterogeneous client participation.Our analysis reveals the non-monotonic nature of FL network effects. Toleverage such effects, we propose a model trading and sharing (MTS) frameworkthat allows clients to obtain FL models through participation or purchase. Totackle heterogeneous clients' strategic behaviors, we further design a sociallyefficient model trading and sharing (SEMTS) mechanism. Our mechanism achievessocial welfare maximization solely through customer payments, withoutadditional incentive costs. Experimental results on an FL hardware prototypedemonstrate up to 148.86% improvement in social welfare compared to existingmechanisms.
适当的机制设计可以帮助联合学习(FL)通过协调学习过程中自利的客户实现良好的社会福利。然而,现有机制忽视了客户参与的网络效应,导致激励机制和社会福利达不到最优。本文通过探讨 FL 激励机制设计中的网络效应来弥补这一不足。我们建立了一个理论模型来分析 FL 模型的性能,并量化网络效应对异质客户参与的影响。为了利用这种效应,我们提出了模型交易和共享(MTS)框架,允许客户通过参与或购买获得 FL 模型。考虑到异质客户的战略行为,我们进一步设计了一种具有社会效率的模型交易和共享(SEMTS)机制。我们的机制仅通过客户付费来实现社会福利最大化,而不需要额外的激励成本。在 FL 硬件原型上的实验结果表明,与现有机制相比,社会福利提高了 148.86%。
{"title":"Social Welfare Maximization for Federated Learning with Network Effects","authors":"Xiang Li, Yuan Luo, Bing Luo, Jianwei Huang","doi":"arxiv-2408.13223","DOIUrl":"https://doi.org/arxiv-2408.13223","url":null,"abstract":"A proper mechanism design can help federated learning (FL) to achieve good\u0000social welfare by coordinating self-interested clients through the learning\u0000process. However, existing mechanisms neglect the network effects of client\u0000participation, leading to suboptimal incentives and social welfare. This paper\u0000addresses this gap by exploring network effects in FL incentive mechanism\u0000design. We establish a theoretical model to analyze FL model performance and\u0000quantify the impact of network effects on heterogeneous client participation.\u0000Our analysis reveals the non-monotonic nature of FL network effects. To\u0000leverage such effects, we propose a model trading and sharing (MTS) framework\u0000that allows clients to obtain FL models through participation or purchase. To\u0000tackle heterogeneous clients' strategic behaviors, we further design a socially\u0000efficient model trading and sharing (SEMTS) mechanism. Our mechanism achieves\u0000social welfare maximization solely through customer payments, without\u0000additional incentive costs. Experimental results on an FL hardware prototype\u0000demonstrate up to 148.86% improvement in social welfare compared to existing\u0000mechanisms.","PeriodicalId":501316,"journal":{"name":"arXiv - CS - Computer Science and Game Theory","volume":"69 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142197523","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Weighted Envy-Freeness in House Allocation 房屋分配中的加权羡慕嫉妒恨
Pub Date : 2024-08-22 DOI: arxiv-2408.12523
Sijia Dai, Yankai Chen, Xiaowei Wu, Yicheng Xu, Yong Zhang
The classic house allocation problem involves assigning $m$ houses to $n$agents based on their utility functions, ensuring each agent receives exactlyone house. A key criterion in these problems is satisfying fairness constraintssuch as envy-freeness. We extend this problem by considering agents witharbitrary weights, focusing on the concept of weighted envy-freeness, which hasbeen extensively studied in fair division. We present a polynomial-timealgorithm to determine whether weighted envy-free allocations exist and, if so,to compute one. Since weighted envy-free allocations do not always exist, wealso investigate the potential of achieving such allocations through the use ofsubsidies. We provide several characterizations for weighted envy-freeableallocations (allocations that can be turned weighted envy-free by introducingsubsidies) and show that they do not always exist, which is different from theunweighted setting. Furthermore, we explore the existence of weightedenvy-freeable allocations in specific scenarios and outline the conditionsunder which they exist.
经典的房屋分配问题包括根据代理人的效用函数,将 $m$ 的房屋分配给 $n$ 的代理人,确保每个代理人都能得到一套房子。这些问题中的一个关键标准是满足公平约束,如不受嫉妒约束。我们通过考虑具有任意权重的代理来扩展这一问题,重点关注加权无嫉妒概念,该概念已在公平分配中得到广泛研究。我们提出了一种多项式时间算法来确定是否存在加权无嫉妒分配,如果存在,则计算出一个加权无嫉妒分配。由于加权无嫉妒分配并不总是存在,我们还研究了通过使用补贴来实现这种分配的可能性。我们为加权无嫉妒分配(通过引入补贴可以变成加权无嫉妒的分配)提供了几个特征,并证明它们并不总是存在,这与无加权设置不同。此外,我们还探讨了特定情况下加权免嫉妒分配的存在性,并概述了它们存在的条件。
{"title":"Weighted Envy-Freeness in House Allocation","authors":"Sijia Dai, Yankai Chen, Xiaowei Wu, Yicheng Xu, Yong Zhang","doi":"arxiv-2408.12523","DOIUrl":"https://doi.org/arxiv-2408.12523","url":null,"abstract":"The classic house allocation problem involves assigning $m$ houses to $n$\u0000agents based on their utility functions, ensuring each agent receives exactly\u0000one house. A key criterion in these problems is satisfying fairness constraints\u0000such as envy-freeness. We extend this problem by considering agents with\u0000arbitrary weights, focusing on the concept of weighted envy-freeness, which has\u0000been extensively studied in fair division. We present a polynomial-time\u0000algorithm to determine whether weighted envy-free allocations exist and, if so,\u0000to compute one. Since weighted envy-free allocations do not always exist, we\u0000also investigate the potential of achieving such allocations through the use of\u0000subsidies. We provide several characterizations for weighted envy-freeable\u0000allocations (allocations that can be turned weighted envy-free by introducing\u0000subsidies) and show that they do not always exist, which is different from the\u0000unweighted setting. Furthermore, we explore the existence of weighted\u0000envy-freeable allocations in specific scenarios and outline the conditions\u0000under which they exist.","PeriodicalId":501316,"journal":{"name":"arXiv - CS - Computer Science and Game Theory","volume":"27 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142197546","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Non-Borda elections under relaxed IIA conditions 放宽国际投资协定条件下的非博尔达选举
Pub Date : 2024-08-22 DOI: arxiv-2408.12661
Gabriel Gendler
Arrow's celebrated Impossibility Theorem asserts that an election rule, orSocial Welfare Function (SWF), between three or more candidates meeting a setof strict criteria cannot exist. Maskin suggests that Arrow's conditions forSWFs are too strict. In particular he weakens the "Independence of IrrelevantAlternatives" condition (IIA), which states that if in two elections, eachvoter's binary preference between candidates $c_i$ and $c_j$ is the same, thenthe two results must agree on their preference between $c_i$ and $c_j$.Instead, he proposes a modified IIA condition (MIIA). Under this condition, theresult between $c_i$ and $c_j$ can be affected not just by the order of $c_i$and $c_j$ in each voter's ranking, but also the number of candidates betweenthem. More candidates between $c_i$ and $c_j$ communicates some informationabout the strength of a voter's preference between the two candidates, andMaskin argues that it should be admissible evidence in deciding on a finalranking. We construct SWFs for three-party elections which meet the MIIA criterionalong with other sensibility criteria, but are far from being Borda elections(where each voter assigns a score to each candidate linearly according to theirranking). On the other hand, we give cases in which any SWF must be the Bordarule.
阿罗著名的 "不可能定理 "断言,在三个或三个以上符合一系列严格标准的候选人之间,不可能存在选举规则或社会福利函数(SWF)。马斯金认为,阿罗关于社会福利函数的条件过于严格。该条件规定,如果在两次选举中,每个选民对候选人 $c_i$ 和 $c_j$ 的二元偏好相同,那么两次选举结果必须在候选人 $c_i$ 和 $c_j$ 之间的偏好上达成一致。在这个条件下,$c_i$和$c_j$之间的结果不仅会受到每个投票人的排序中$c_i$和$c_j$的先后顺序的影响,还会受到它们之间候选者数量的影响。在$c_i$和$c_j$之间有更多的候选者,这传递了选民在两个候选者之间偏好强度的一些信息,马斯金认为,在决定最终排名时,这应该是可接受的证据。我们为三方选举构建了 SWFs,这些选举符合 MIIA 标准和其他感性标准,但远非博尔达选举(即每位选民根据每位候选人的排名线性地给每位候选人打分)。另一方面,我们也给出了一些情况,在这些情况下,任何 SWF 都必须是博达尔选举。
{"title":"Non-Borda elections under relaxed IIA conditions","authors":"Gabriel Gendler","doi":"arxiv-2408.12661","DOIUrl":"https://doi.org/arxiv-2408.12661","url":null,"abstract":"Arrow's celebrated Impossibility Theorem asserts that an election rule, or\u0000Social Welfare Function (SWF), between three or more candidates meeting a set\u0000of strict criteria cannot exist. Maskin suggests that Arrow's conditions for\u0000SWFs are too strict. In particular he weakens the \"Independence of Irrelevant\u0000Alternatives\" condition (IIA), which states that if in two elections, each\u0000voter's binary preference between candidates $c_i$ and $c_j$ is the same, then\u0000the two results must agree on their preference between $c_i$ and $c_j$.\u0000Instead, he proposes a modified IIA condition (MIIA). Under this condition, the\u0000result between $c_i$ and $c_j$ can be affected not just by the order of $c_i$\u0000and $c_j$ in each voter's ranking, but also the number of candidates between\u0000them. More candidates between $c_i$ and $c_j$ communicates some information\u0000about the strength of a voter's preference between the two candidates, and\u0000Maskin argues that it should be admissible evidence in deciding on a final\u0000ranking. We construct SWFs for three-party elections which meet the MIIA criterion\u0000along with other sensibility criteria, but are far from being Borda elections\u0000(where each voter assigns a score to each candidate linearly according to their\u0000ranking). On the other hand, we give cases in which any SWF must be the Borda\u0000rule.","PeriodicalId":501316,"journal":{"name":"arXiv - CS - Computer Science and Game Theory","volume":"5 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142197544","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Satisfaction and Regret in Stackelberg Games 斯塔克尔伯格博弈中的满意与遗憾
Pub Date : 2024-08-21 DOI: arxiv-2408.11340
Langford White, Duong Nguyen, Hung Nguyen
This paper introduces the new concept of (follower) satisfaction inStackelberg games and compares the standard Stackelberg game with itssatisfaction version. Simulation results are presented which suggest that thefollower adopting satisfaction generally increases leader utility. Thisimportant new result is proven for the case where leader strategies to committo are restricted to be deterministic (pure strategies). The paper thenaddresses the application of regret based algorithms to the Stackelbergproblem. Although it is known that the follower adopts a no-regret position ina Stackelberg solution, this is not generally the case for the leader. Thereport examines the convergence behaviour of unconditional and conditionalregret matching (RM) algorithms in the Stackelberg setting. The paper showsthat, in the examples considered, that these algorithms either converge to anypure Nash equilibria for the simultaneous move game, or to some mixedstrategies which do not have the "no-regret" property. In one case, convergenceof the conditional RM algorithm over both players to a solution "close" to theStackelberg case was observed. The paper argues that further research in thisarea, in particular when applied in the satisfaction setting could be fruitful.
本文在斯泰尔伯格博弈中引入了(追随者)满意度这一新概念,并将标准斯泰尔伯格博弈与其满意度版本进行了比较。本文给出的模拟结果表明,追随者采取满意策略通常会增加领导者的效用。这一重要的新结果是在领导者的承诺策略被限制为确定性策略(纯策略)的情况下证明的。然后,本文讨论了基于遗憾的算法在斯塔克尔伯格问题中的应用。虽然众所周知,追随者会采取无遗憾的立场来解决斯塔克尔伯格问题,但领导者一般不会这样做。本报告研究了无条件和有条件遗憾匹配(RM)算法在 Stackelberg 环境中的收敛行为。论文表明,在所考虑的例子中,这些算法要么收敛到同时移动博弈的任何纯纳什均衡,要么收敛到某些不具有 "无遗憾 "属性的混合战略。在一种情况下,观察到条件 RM 算法收敛于两个棋手的解,"接近 "斯塔克尔伯格情况。本文认为,在这一领域的进一步研究,特别是在满意设置中的应用,可能会取得丰硕成果。
{"title":"Satisfaction and Regret in Stackelberg Games","authors":"Langford White, Duong Nguyen, Hung Nguyen","doi":"arxiv-2408.11340","DOIUrl":"https://doi.org/arxiv-2408.11340","url":null,"abstract":"This paper introduces the new concept of (follower) satisfaction in\u0000Stackelberg games and compares the standard Stackelberg game with its\u0000satisfaction version. Simulation results are presented which suggest that the\u0000follower adopting satisfaction generally increases leader utility. This\u0000important new result is proven for the case where leader strategies to commit\u0000to are restricted to be deterministic (pure strategies). The paper then\u0000addresses the application of regret based algorithms to the Stackelberg\u0000problem. Although it is known that the follower adopts a no-regret position in\u0000a Stackelberg solution, this is not generally the case for the leader. The\u0000report examines the convergence behaviour of unconditional and conditional\u0000regret matching (RM) algorithms in the Stackelberg setting. The paper shows\u0000that, in the examples considered, that these algorithms either converge to any\u0000pure Nash equilibria for the simultaneous move game, or to some mixed\u0000strategies which do not have the \"no-regret\" property. In one case, convergence\u0000of the conditional RM algorithm over both players to a solution \"close\" to the\u0000Stackelberg case was observed. The paper argues that further research in this\u0000area, in particular when applied in the satisfaction setting could be fruitful.","PeriodicalId":501316,"journal":{"name":"arXiv - CS - Computer Science and Game Theory","volume":"204 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142197551","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On the Distortion of Committee Election with 1-Euclidean Preferences and Few Distance Queries 论 1 欧几里得偏好和少量距离查询下的委员会选举失真问题
Pub Date : 2024-08-21 DOI: arxiv-2408.11755
Dimitris Fotakis, Laurent Gourvès, Panagiotis Patsilinakos
We consider committee election of $k geq 3$ (out of $m geq k+1$)candidates, where the voters and the candidates are associated with locationson the real line. Each voter's cardinal preferences over candidates correspondto her distance to the candidate locations, and each voter's cardinalpreferences over committees is defined as her distance to the nearest candidateelected in the committee. We consider a setting where the true distances andthe locations are unknown. We can nevertheless have access to degradedinformation which consists of an order of candidates for each voter. Weinvestigate the best possible distortion (a worst-case performance criterion)wrt. the social cost achieved by deterministic committee election rules basedon ordinal preferences submitted by $n$ voters and few additional distancequeries. We show that for any $k geq 3$, the best possible distortion of anydeterministic algorithm that uses at most $k-3$ distance queries cannot bebounded by any function of $n$, $m$ and $k$. We present deterministicalgorithms for $k$-committee election with distortion of $O(n)$ with $O(k)$distance queries and $O(1)$ with $O(k log n)$ distance queries.
我们考虑由 $k geq 3$ (out of $m geq k+1$) 名候选人组成的委员会选举,其中选民和候选人都与实线上的位置相关联。每个选民对候选人的基本偏好对应于她到候选人位置的距离,每个选民对委员会的基本偏好定义为她到委员会中最近当选候选人的距离。我们考虑的是真实距离和位置未知的情况。然而,我们可以获取退化信息,其中包括每个选民的候选人顺序。我们研究了基于 $n$ 选民提交的顺序偏好和少量额外距离查询的确定性委员会选举规则与社会成本之间的最佳失真(最坏情况下的性能标准)。我们证明,对于任何 $k geq 3$,任何使用最多 $k-3$ 距离查询的确定性算法的最佳可能失真都不能以 $n$、$m$ 和 $k$ 的任何函数为界。我们提出了$k$委员会选举的确定性算法,在使用$O(k)$距离查询时失真度为$O(n)$,在使用$O(k log n)$距离查询时失真度为$O(1)$。
{"title":"On the Distortion of Committee Election with 1-Euclidean Preferences and Few Distance Queries","authors":"Dimitris Fotakis, Laurent Gourvès, Panagiotis Patsilinakos","doi":"arxiv-2408.11755","DOIUrl":"https://doi.org/arxiv-2408.11755","url":null,"abstract":"We consider committee election of $k geq 3$ (out of $m geq k+1$)\u0000candidates, where the voters and the candidates are associated with locations\u0000on the real line. Each voter's cardinal preferences over candidates correspond\u0000to her distance to the candidate locations, and each voter's cardinal\u0000preferences over committees is defined as her distance to the nearest candidate\u0000elected in the committee. We consider a setting where the true distances and\u0000the locations are unknown. We can nevertheless have access to degraded\u0000information which consists of an order of candidates for each voter. We\u0000investigate the best possible distortion (a worst-case performance criterion)\u0000wrt. the social cost achieved by deterministic committee election rules based\u0000on ordinal preferences submitted by $n$ voters and few additional distance\u0000queries. We show that for any $k geq 3$, the best possible distortion of any\u0000deterministic algorithm that uses at most $k-3$ distance queries cannot be\u0000bounded by any function of $n$, $m$ and $k$. We present deterministic\u0000algorithms for $k$-committee election with distortion of $O(n)$ with $O(k)$\u0000distance queries and $O(1)$ with $O(k log n)$ distance queries.","PeriodicalId":501316,"journal":{"name":"arXiv - CS - Computer Science and Game Theory","volume":"19 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142197549","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
arXiv - CS - Computer Science and Game Theory
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1