首页 > 最新文献

Proceedings of the 2017 ACM Conference on Economics and Computation最新文献

英文 中文
Optimal Multi-Unit Mechanisms with Private Demands 考虑私人需求的最优多单元机制
Pub Date : 2017-04-17 DOI: 10.1145/3033274.3085122
Nikhil R. Devanur, Nima Haghpanah, Alexandros Psomas
We study a pricing problem that is motivated by the following examples. A cloud computing platform such as Amazon EC2 sells virtual machines to clients, each of who needs a different number of virtual machine hours. Similarly, cloud storage providers such as Dropbox have customers that require different amounts of storage. Software companies such as Microsoft sell software subscriptions that can have different levels of service. The levels could be the number of different documents you are allowed to create, or the number of hours you are allowed to use the software. Companies like Google and Microsoft sell API calls to artificial intelligence software such as face recognition, to other software developers. Video and mobile games are increasingly designed in such a way that one can pay for better access to certain features. Spotify and iTunes sell music subscription, and different people listen to different number of songs in a month. Cellphone service providers like AT&T and Verizon offer cellular phone call minutes and data. People have widely varying amounts of data consumption.
我们研究一个定价问题,它是由下面的例子驱动的。像Amazon EC2这样的云计算平台向客户出售虚拟机,每个客户需要不同数量的虚拟机小时。类似地,像Dropbox这样的云存储提供商也有需要不同存储容量的客户。像微软这样的软件公司销售的软件订阅可以有不同的服务级别。级别可以是允许创建不同文档的数量,或者允许使用软件的小时数。谷歌(Google)和微软(Microsoft)等公司向其他软件开发商出售人脸识别等人工智能软件的API调用。电子和手机游戏的设计越来越倾向于让玩家付费才能更好地使用某些功能。Spotify和iTunes销售音乐订阅,不同的人在一个月内听不同数量的歌曲。像AT&T和Verizon这样的手机服务提供商提供手机通话时间和数据。人们的数据消耗量差异很大。
{"title":"Optimal Multi-Unit Mechanisms with Private Demands","authors":"Nikhil R. Devanur, Nima Haghpanah, Alexandros Psomas","doi":"10.1145/3033274.3085122","DOIUrl":"https://doi.org/10.1145/3033274.3085122","url":null,"abstract":"We study a pricing problem that is motivated by the following examples. A cloud computing platform such as Amazon EC2 sells virtual machines to clients, each of who needs a different number of virtual machine hours. Similarly, cloud storage providers such as Dropbox have customers that require different amounts of storage. Software companies such as Microsoft sell software subscriptions that can have different levels of service. The levels could be the number of different documents you are allowed to create, or the number of hours you are allowed to use the software. Companies like Google and Microsoft sell API calls to artificial intelligence software such as face recognition, to other software developers. Video and mobile games are increasingly designed in such a way that one can pay for better access to certain features. Spotify and iTunes sell music subscription, and different people listen to different number of songs in a month. Cellphone service providers like AT&T and Verizon offer cellular phone call minutes and data. People have widely varying amounts of data consumption.","PeriodicalId":287551,"journal":{"name":"Proceedings of the 2017 ACM Conference on Economics and Computation","volume":"174 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132291747","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 21
Computing Equilibrium in Matching Markets 匹配市场中的均衡计算
Pub Date : 2017-03-30 DOI: 10.1145/3033274.3085150
S. Alaei, Pooya Jalaly, É. Tardos
Market equilibria of matching markets offer an intuitive and fair solution for matching problems without money with agents who have preferences over the items. Such a matching market can be viewed as a variation of Fisher market, albeit with rather peculiar preferences of agents. These preferences can be described by piece-wise linear concave (PLC) functions, which however, are not separable (due to each agent only asking for one item), are not monotone, and do not satisfy the gross substitute property-- increase in price of an item can result in increased demand for the item. Devanur and Kannan in FOCS 08 showed that market clearing prices can be found in polynomial time in markets with fixed number of items and general PLC preferences. They also consider Fischer markets with fixed number of agents (instead of fixed number of items), and give a polynomial time algorithm for this case if preferences are separable functions of the items, in addition to being PLC functions. Our main result is a polynomial time algorithm for finding market clearing prices in matching markets with fixed number of different agent preferences, despite that the utility corresponding to matching markets is not separable. We also give a simpler algorithm for the case of matching markets with fixed number of different items.
匹配市场的市场均衡为没有金钱的匹配问题和对物品有偏好的代理提供了一个直观和公平的解决方案。这种匹配市场可以看作是费雪市场的一种变体,尽管它具有相当特殊的代理人偏好。这些偏好可以用分段线性凹(PLC)函数来描述,然而,这些函数是不可分离的(由于每个代理只要求一件物品),不是单调的,并且不满足总替代属性——物品价格的增加会导致对该物品的需求增加。Devanur和Kannan在fos 2008中表明,在具有固定数量的项目和一般PLC偏好的市场中,市场出清价格可以在多项式时间内找到。他们还考虑了具有固定数量的代理(而不是固定数量的项目)的Fischer市场,并给出了一个多项式时间算法,如果除了作为PLC函数之外,偏好是项目的可分离函数。我们的主要成果是一个多项式时间算法,用于在具有固定数量不同代理偏好的匹配市场中寻找市场出清价格,尽管匹配市场对应的效用是不可分离的。对于具有固定数量的不同商品的匹配市场,我们也给出了一个更简单的算法。
{"title":"Computing Equilibrium in Matching Markets","authors":"S. Alaei, Pooya Jalaly, É. Tardos","doi":"10.1145/3033274.3085150","DOIUrl":"https://doi.org/10.1145/3033274.3085150","url":null,"abstract":"Market equilibria of matching markets offer an intuitive and fair solution for matching problems without money with agents who have preferences over the items. Such a matching market can be viewed as a variation of Fisher market, albeit with rather peculiar preferences of agents. These preferences can be described by piece-wise linear concave (PLC) functions, which however, are not separable (due to each agent only asking for one item), are not monotone, and do not satisfy the gross substitute property-- increase in price of an item can result in increased demand for the item. Devanur and Kannan in FOCS 08 showed that market clearing prices can be found in polynomial time in markets with fixed number of items and general PLC preferences. They also consider Fischer markets with fixed number of agents (instead of fixed number of items), and give a polynomial time algorithm for this case if preferences are separable functions of the items, in addition to being PLC functions. Our main result is a polynomial time algorithm for finding market clearing prices in matching markets with fixed number of different agent preferences, despite that the utility corresponding to matching markets is not separable. We also give a simpler algorithm for the case of matching markets with fixed number of different items.","PeriodicalId":287551,"journal":{"name":"Proceedings of the 2017 ACM Conference on Economics and Computation","volume":"76 1-2","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132812281","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Forecast Aggregation 预测聚合
Pub Date : 2017-03-16 DOI: 10.2139/ssrn.2934104
Itai Arieli, Y. Babichenko, Rann Smorodinsky
Bayesian experts with a common prior that are exposed to different evidence possibly make contradicting probabilistic forecasts. A policy maker who receives the forecasts must aggregate them in the best way possible. This is a challenge whenever the policy maker is not familiar with the prior nor the model and evidence available to the experts. We propose a model of non-Bayesian forecast aggregation and adapt the notion of regret as a means for evaluating the policy maker's performance. Whenever experts are Blackwell ordered taking a weighted average of the two forecasts, the weight of which is proportional to its precision (the reciprocal of the variance), is optimal. The resulting regret is equal 1/8(5√ 5-11) approx 0.0225425, which is 3 to 4 times better than naive approaches such as choosing one expert at random or taking the non-weighted average.
拥有共同先验的贝叶斯专家在接触不同证据的情况下,可能会做出相互矛盾的概率预测。收到预测的政策制定者必须尽可能以最佳方式汇总这些预测。每当政策制定者不熟悉先验,也不熟悉专家可用的模型和证据时,这就是一个挑战。我们提出了一个非贝叶斯预测聚合模型,并将后悔的概念作为评估决策者绩效的一种手段。当专家被布莱克威尔命令时,对两种预测进行加权平均,其权重与其精度(方差的倒数)成正比,是最优的。由此产生的遗憾等于1/8(5√5-11),约为0.0225425,这比随机选择一位专家或取非加权平均值等朴素方法好3到4倍。
{"title":"Forecast Aggregation","authors":"Itai Arieli, Y. Babichenko, Rann Smorodinsky","doi":"10.2139/ssrn.2934104","DOIUrl":"https://doi.org/10.2139/ssrn.2934104","url":null,"abstract":"Bayesian experts with a common prior that are exposed to different evidence possibly make contradicting probabilistic forecasts. A policy maker who receives the forecasts must aggregate them in the best way possible. This is a challenge whenever the policy maker is not familiar with the prior nor the model and evidence available to the experts. We propose a model of non-Bayesian forecast aggregation and adapt the notion of regret as a means for evaluating the policy maker's performance. Whenever experts are Blackwell ordered taking a weighted average of the two forecasts, the weight of which is proportional to its precision (the reciprocal of the variance), is optimal. The resulting regret is equal 1/8(5√ 5-11) approx 0.0225425, which is 3 to 4 times better than naive approaches such as choosing one expert at random or taking the non-weighted average.","PeriodicalId":287551,"journal":{"name":"Proceedings of the 2017 ACM Conference on Economics and Computation","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-03-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115374288","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Approximation Algorithms for Maximin Fair Division 最大化公平除法的近似算法
Pub Date : 2017-03-06 DOI: 10.1145/3033274.3085136
Siddharth Barman, S. K. Murthy
We consider the problem of dividing indivisible goods fairly among n agents who have additive and submodular valuations for the goods. Our fairness guarantees are in terms of the maximin share, that is defined to be the maximum value that an agent can ensure for herself, if she were to partition the goods into n bundles, and then receive a minimum valued bundle. Since maximin fair allocations (i.e., allocations in which each agent gets at least her maximin share) do not always exist, prior work has focussed on approximation results that aim to find allocations in which the value of the bundle allocated to each agent is (multiplicatively) as close to her maximin share as possible. In particular, Procaccia and Wang (2014) along with Amanatidis et al. (2015) have shown that under additive valuations a 2/3-approximate maximin fair allocation always exists and can be found in polynomial time. We complement these results by developing a simple and efficient algorithm that achieves the same approximation guarantee. Furthermore, we initiate the study of approximate maximin fair division under submodular valuations. Specifically, we show that when the valuations of the agents are nonnegative, monotone, and submodular, then a 1/10-approximate maximin fair allocation is guaranteed to exist. In fact, we show that such an allocation can be efficiently found by using a simple round-robin algorithm. A technical contribution of the paper is to analyze the performance of this combinatorial algorithm by employing the concept of multilinear extensions.
我们考虑在n个对商品具有加性和次模性估价的代理之间公平分配不可分割商品的问题。我们的公平保证是用最大份额来表示的,它被定义为如果一个代理将商品分成n个包,然后得到一个价值最小的包,那么她能为自己保证的最大价值。由于最大公平分配(即每个代理至少获得其最大份额的分配)并不总是存在,因此先前的工作主要集中在近似结果上,旨在找到分配给每个代理的bundle的值(乘上)尽可能接近其最大份额的分配。特别是,Procaccia和Wang(2014)以及Amanatidis等人(2015)已经表明,在可加性估值下,2/3近似最大公平分配总是存在的,并且可以在多项式时间内找到。我们通过开发一种简单有效的算法来补充这些结果,以实现相同的近似保证。在此基础上,研究了次模估值下的近似极大值公平分割问题。具体地说,我们证明了当代理的估值是非负的、单调的和次模的,那么一个1/10近似的最大公平分配是保证存在的。事实上,我们证明了这样的分配可以通过使用简单的轮循算法有效地找到。本文的一个技术贡献是利用多线性扩展的概念分析了该组合算法的性能。
{"title":"Approximation Algorithms for Maximin Fair Division","authors":"Siddharth Barman, S. K. Murthy","doi":"10.1145/3033274.3085136","DOIUrl":"https://doi.org/10.1145/3033274.3085136","url":null,"abstract":"We consider the problem of dividing indivisible goods fairly among n agents who have additive and submodular valuations for the goods. Our fairness guarantees are in terms of the maximin share, that is defined to be the maximum value that an agent can ensure for herself, if she were to partition the goods into n bundles, and then receive a minimum valued bundle. Since maximin fair allocations (i.e., allocations in which each agent gets at least her maximin share) do not always exist, prior work has focussed on approximation results that aim to find allocations in which the value of the bundle allocated to each agent is (multiplicatively) as close to her maximin share as possible. In particular, Procaccia and Wang (2014) along with Amanatidis et al. (2015) have shown that under additive valuations a 2/3-approximate maximin fair allocation always exists and can be found in polynomial time. We complement these results by developing a simple and efficient algorithm that achieves the same approximation guarantee. Furthermore, we initiate the study of approximate maximin fair division under submodular valuations. Specifically, we show that when the valuations of the agents are nonnegative, monotone, and submodular, then a 1/10-approximate maximin fair allocation is guaranteed to exist. In fact, we show that such an allocation can be efficiently found by using a simple round-robin algorithm. A technical contribution of the paper is to analyze the performance of this combinatorial algorithm by employing the concept of multilinear extensions.","PeriodicalId":287551,"journal":{"name":"Proceedings of the 2017 ACM Conference on Economics and Computation","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122703122","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 135
Truth and Regret in Online Scheduling 在线调度中的真相与遗憾
Pub Date : 2017-03-01 DOI: 10.1145/3033274.3085119
Shuchi Chawla, Nikhil R. Devanur, Janardhan Kulkarni, Rad Niazadeh
We consider a scheduling problem where a cloud service provider has multiple units of a resource available over time. Selfish clients submit jobs, each with an arrival time, deadline, length, and value. The service provider's goal is to implement a truthful online mechanism for scheduling jobs so as to maximize the social welfare of the schedule. Recent work shows that under a stochastic assumption on job arrivals, there is a single-parameter family of mechanisms that achieves near-optimal social welfare. We show that given any such family of near-optimal online mechanisms, there exists an online mechanism that in the worst case performs nearly as well as the best of the given mechanisms. Our mechanism is truthful whenever the mechanisms in the given family are truthful and prompt, and achieves optimal (within constant factors) regret. We model the problem of competing against a family of online scheduling mechanisms as one of learning from expert advice. A primary challenge is that any scheduling decisions we make affect not only the payoff at the current step, but also the resource availability and payoffs in future steps. Furthermore, switching from one algorithm (a.k.a. expert) to another in an online fashion is challenging both because it requires synchronization with the state of the latter algorithm as well as because it affects the incentive structure of the algorithms. We further show how to adapt our algorithm to a non-clairvoyant setting where job lengths are unknown until jobs are run to completion. Once again, in this setting, we obtain truthfulness along with asymptotically optimal regret (within polylogarithmic factors).
我们考虑一个调度问题,其中云服务提供商在一段时间内有多个可用的资源单元。自私的客户端提交作业,每个作业都有到达时间、截止日期、长度和价值。服务提供商的目标是实现一个真实的在线作业调度机制,从而使调度的社会福利最大化。最近的研究表明,在对工作到来的随机假设下,存在一个单参数的机制家族,可以实现接近最优的社会福利。我们证明了给定任何这样的近最优在线机制族,存在一个在线机制,在最坏的情况下,它的性能几乎与给定机制中的最佳机制一样好。我们的机制是真实的,只要在给定的家庭机制是真实和及时的,并达到最佳(在恒定的因素)后悔。我们将与一系列在线调度机制竞争的问题建模为从专家建议中学习的问题之一。一个主要的挑战是,我们做出的任何调度决策不仅会影响当前步骤的收益,还会影响未来步骤的资源可用性和收益。此外,以在线方式从一种算法(又名专家)切换到另一种算法是具有挑战性的,因为它需要与后一种算法的状态同步,也因为它影响算法的激励结构。我们进一步展示了如何使我们的算法适应非透视设置,其中作业长度在作业运行完成之前是未知的。再一次,在这种情况下,我们获得了真实性和渐近最优后悔(在多对数因素内)。
{"title":"Truth and Regret in Online Scheduling","authors":"Shuchi Chawla, Nikhil R. Devanur, Janardhan Kulkarni, Rad Niazadeh","doi":"10.1145/3033274.3085119","DOIUrl":"https://doi.org/10.1145/3033274.3085119","url":null,"abstract":"We consider a scheduling problem where a cloud service provider has multiple units of a resource available over time. Selfish clients submit jobs, each with an arrival time, deadline, length, and value. The service provider's goal is to implement a truthful online mechanism for scheduling jobs so as to maximize the social welfare of the schedule. Recent work shows that under a stochastic assumption on job arrivals, there is a single-parameter family of mechanisms that achieves near-optimal social welfare. We show that given any such family of near-optimal online mechanisms, there exists an online mechanism that in the worst case performs nearly as well as the best of the given mechanisms. Our mechanism is truthful whenever the mechanisms in the given family are truthful and prompt, and achieves optimal (within constant factors) regret. We model the problem of competing against a family of online scheduling mechanisms as one of learning from expert advice. A primary challenge is that any scheduling decisions we make affect not only the payoff at the current step, but also the resource availability and payoffs in future steps. Furthermore, switching from one algorithm (a.k.a. expert) to another in an online fashion is challenging both because it requires synchronization with the state of the latter algorithm as well as because it affects the incentive structure of the algorithms. We further show how to adapt our algorithm to a non-clairvoyant setting where job lengths are unknown until jobs are run to completion. Once again, in this setting, we obtain truthfulness along with asymptotically optimal regret (within polylogarithmic factors).","PeriodicalId":287551,"journal":{"name":"Proceedings of the 2017 ACM Conference on Economics and Computation","volume":"2015 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130112859","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
The Scope of Sequential Screening with Ex Post Participation Constraints 具有事后参与约束的顺序筛选的范围
Pub Date : 2017-02-18 DOI: 10.2139/ssrn.3569697
D. Bergemann, Francisco Castro, G. Weintraub
We study the classic sequential screening problem under ex-post participation constraints. Thus the seller is required to satisfy buyers' ex-post participation constraints. A leading example is the online display advertising market, in which publishers frequently cannot use up-front fees and instead use transaction-contingent fees. We establish when the optimal selling mechanism is static (buyers are not screened) or dynamic (buyers are screened), and obtain a full characterization of such contracts. We begin by analyzing our model within the leading case of exponential distributions with two types. We provide a necessary and sufficient condition for the optimality of the static contract. If the means of the two types are sufficiently close, then no screening is optimal. If they are sufficiently apart, then a dynamic contract becomes optimal. Importantly, the latter contract randomizes the low type buyer while giving a deterministic allocation to the high type. It also makes the low type worse-off and the high type better-off compared to the contract the seller would offer if he knew the buyer's type. Our main result establishes a necessary and sufficient condition under which the static contract is optimal for general distributions. We show that when this condition fails, a dynamic contract that randomizes the low type buyer is optimal.
研究了在事后参与约束下的经典顺序筛选问题。因此,卖方必须满足买方事后参与约束。一个典型的例子是在线展示广告市场,在这个市场中,出版商经常不能使用预付费用,而是使用交易条件费用。我们确定了最优销售机制是静态的(买家不被筛选)还是动态的(买家被筛选),并获得了这类合约的完整特征。我们首先在两种类型的指数分布的主要情况下分析我们的模型。给出了静态契约最优性的充分必要条件。如果这两种类型的平均值足够接近,那么没有筛查是最佳的。如果它们足够分开,那么动态契约就是最优的。重要的是,后者将低类型买家随机化,同时对高类型买家进行确定性分配。如果卖方知道买方的类型,那么他所提供的合同也会使低类型的情况更糟,而高类型的情况更好。我们的主要结果建立了静态契约对于一般分布是最优的充要条件。我们证明了当这个条件不满足时,随机化低类型买家的动态契约是最优的。
{"title":"The Scope of Sequential Screening with Ex Post Participation Constraints","authors":"D. Bergemann, Francisco Castro, G. Weintraub","doi":"10.2139/ssrn.3569697","DOIUrl":"https://doi.org/10.2139/ssrn.3569697","url":null,"abstract":"We study the classic sequential screening problem under ex-post participation constraints. Thus the seller is required to satisfy buyers' ex-post participation constraints. A leading example is the online display advertising market, in which publishers frequently cannot use up-front fees and instead use transaction-contingent fees. We establish when the optimal selling mechanism is static (buyers are not screened) or dynamic (buyers are screened), and obtain a full characterization of such contracts. We begin by analyzing our model within the leading case of exponential distributions with two types. We provide a necessary and sufficient condition for the optimality of the static contract. If the means of the two types are sufficiently close, then no screening is optimal. If they are sufficiently apart, then a dynamic contract becomes optimal. Importantly, the latter contract randomizes the low type buyer while giving a deterministic allocation to the high type. It also makes the low type worse-off and the high type better-off compared to the contract the seller would offer if he knew the buyer's type. Our main result establishes a necessary and sufficient condition under which the static contract is optimal for general distributions. We show that when this condition fails, a dynamic contract that randomizes the low type buyer is optimal.","PeriodicalId":287551,"journal":{"name":"Proceedings of the 2017 ACM Conference on Economics and Computation","volume":"50 17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122215809","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 38
Theoretical and Practical Advances on Smoothing for Extensive-Form Games 广义博弈平滑的理论与实践进展
Pub Date : 2017-02-16 DOI: 10.1145/3033274.3085131
Christian Kroer, K. Waugh, F. Kılınç-Karzan, T. Sandholm
Sparse iterative methods, in particular first-order methods, are known to be among the most effective in solving large-scale two-player zero-sum extensive-form games. The convergence rates of these methods depend heavily on the properties of the distance-generating function that they are based on. We investigate the acceleration of first-order methods for solving extensive-form games through better design of the dilated entropy function---a class of distance-generating functions related to the domains associated with the extensive-form games. By introducing a new weighting scheme for the dilated entropy function, we develop the first distance-generating function for the strategy spaces of sequential games that only a logarithmic dependence on the branching factor of the player. This result improves the convergence rate of several first-order methods by a factor of Ω(bdd), where b is the branching factor of the player, and d is the depth of the game tree. Thus far, counterfactual regret minimization methods have been faster in practice, and more popular, than first-order methods despite their theoretically inferior convergence rates. Using our new weighting scheme and practical tuning we show that, for the first time, the excessive gap technique can be made faster than the fastest counterfactual regret minimization algorithm, CFRP, in practice.
已知稀疏迭代方法,特别是一阶方法,是解决大规模二人零和广泛形式博弈最有效的方法之一。这些方法的收敛速度在很大程度上取决于它们所基于的距离生成函数的性质。我们通过更好地设计扩展熵函数(一类与广泛形式博弈相关的域相关的距离生成函数)来研究求解广泛形式博弈的一阶方法的加速。通过引入扩展熵函数的一种新的加权格式,我们开发了序列博弈策略空间的第一个距离生成函数,该策略空间仅对玩家的分支因子有对数依赖。该结果通过Ω(bdd)因子提高了几种一阶方法的收敛速度,其中b是玩家的分支因子,d是游戏树的深度。到目前为止,反事实遗憾最小化方法在实践中比一阶方法更快,更受欢迎,尽管理论上它们的收敛速度较低。利用我们的新加权方案和实际调整,我们首次表明,在实践中,过度间隙技术可以比最快的反事实遗憾最小化算法(CFRP)更快。
{"title":"Theoretical and Practical Advances on Smoothing for Extensive-Form Games","authors":"Christian Kroer, K. Waugh, F. Kılınç-Karzan, T. Sandholm","doi":"10.1145/3033274.3085131","DOIUrl":"https://doi.org/10.1145/3033274.3085131","url":null,"abstract":"Sparse iterative methods, in particular first-order methods, are known to be among the most effective in solving large-scale two-player zero-sum extensive-form games. The convergence rates of these methods depend heavily on the properties of the distance-generating function that they are based on. We investigate the acceleration of first-order methods for solving extensive-form games through better design of the dilated entropy function---a class of distance-generating functions related to the domains associated with the extensive-form games. By introducing a new weighting scheme for the dilated entropy function, we develop the first distance-generating function for the strategy spaces of sequential games that only a logarithmic dependence on the branching factor of the player. This result improves the convergence rate of several first-order methods by a factor of Ω(bdd), where b is the branching factor of the player, and d is the depth of the game tree. Thus far, counterfactual regret minimization methods have been faster in practice, and more popular, than first-order methods despite their theoretically inferior convergence rates. Using our new weighting scheme and practical tuning we show that, for the first time, the excessive gap technique can be made faster than the fastest counterfactual regret minimization algorithm, CFRP, in practice.","PeriodicalId":287551,"journal":{"name":"Proceedings of the 2017 ACM Conference on Economics and Computation","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125089971","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 24
A "Quantal Regret" Method for Structural Econometrics in Repeated Games 重复博弈中结构计量经济学的“定量后悔”方法
Pub Date : 2017-02-14 DOI: 10.1145/3033274.3085111
N. Nisan, Gali Noti
We suggest a general method for inferring players' values from their actions in repeated games. The method extends and improves upon the recent suggestion of (Nekipelov et al., EC 2015) and is based on the assumption that players are more likely to exhibit sequences of actions that have lower regret. We evaluate this "quantal-regret" method on two different datasets from experiments of repeated games with controlled player values: those of (Selten and Chmura, AER 2008) on a variety of two-player 2x2 games and our own experiment on ad-auctions (Noti et al., WWW 2014). We find that the quantal-regret method is consistently and significantly more precise than either "classic" econometric methods that are based on Nash equilibria, or the "min-regret" method of (Nekipelov et al., EC 2015).
我们建议一种从玩家在重复游戏中的行为推断其价值的通用方法。该方法扩展并改进了Nekipelov等人(EC 2015)的最新建议,并基于玩家更有可能表现出具有较低后悔的行动序列的假设。我们在两个不同的数据集上评估了这种“量子后悔”方法,这些数据集来自控制玩家价值的重复游戏实验:一个是(Selten和Chmura, AER 2008)关于各种双人2x2游戏的数据集,另一个是我们自己的广告拍卖实验(Noti et al., WWW 2014)。我们发现,与基于纳什均衡的“经典”计量经济学方法或(Nekipelov et al., EC 2015)的“最小后悔”方法相比,量子后悔方法始终且明显更精确。
{"title":"A \"Quantal Regret\" Method for Structural Econometrics in Repeated Games","authors":"N. Nisan, Gali Noti","doi":"10.1145/3033274.3085111","DOIUrl":"https://doi.org/10.1145/3033274.3085111","url":null,"abstract":"We suggest a general method for inferring players' values from their actions in repeated games. The method extends and improves upon the recent suggestion of (Nekipelov et al., EC 2015) and is based on the assumption that players are more likely to exhibit sequences of actions that have lower regret. We evaluate this \"quantal-regret\" method on two different datasets from experiments of repeated games with controlled player values: those of (Selten and Chmura, AER 2008) on a variety of two-player 2x2 games and our own experiment on ad-auctions (Noti et al., WWW 2014). We find that the quantal-regret method is consistently and significantly more precise than either \"classic\" econometric methods that are based on Nash equilibria, or the \"min-regret\" method of (Nekipelov et al., EC 2015).","PeriodicalId":287551,"journal":{"name":"Proceedings of the 2017 ACM Conference on Economics and Computation","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129351863","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Information Sharing and Privacy in Networks 网络中的信息共享与隐私
Pub Date : 2017-02-01 DOI: 10.1145/3033274.3085095
R. Gradwohl
Users of social, economic, or medical networks share personal information in exchange for tangible benefits, but may be harmed by leakage and misuse of the shared information. I analyze the effect of enhancing privacy in the presence of two opposing forces: network effects and informational interdependencies. I show that two privacy enhancements---reducing the likelihood of leakage and decreasing the level of informational interdependence---have opposite effects on the volume of information sharing, and that although they always seem beneficial to non-strategic users, both privacy enhancements may backfire when users are strategic.
社会、经济或医疗网络的用户共享个人信息以换取有形利益,但可能因共享信息的泄露和滥用而受到损害。我分析了在两种相反的力量存在下增强隐私的效果:网络效应和信息相互依赖。我展示了两种隐私增强——减少泄漏的可能性和降低信息相互依赖的水平——对信息共享的数量有相反的影响,尽管它们似乎总是对非战略用户有益,但当用户是战略用户时,这两种隐私增强都可能适得其反。
{"title":"Information Sharing and Privacy in Networks","authors":"R. Gradwohl","doi":"10.1145/3033274.3085095","DOIUrl":"https://doi.org/10.1145/3033274.3085095","url":null,"abstract":"Users of social, economic, or medical networks share personal information in exchange for tangible benefits, but may be harmed by leakage and misuse of the shared information. I analyze the effect of enhancing privacy in the presence of two opposing forces: network effects and informational interdependencies. I show that two privacy enhancements---reducing the likelihood of leakage and decreasing the level of informational interdependence---have opposite effects on the volume of information sharing, and that although they always seem beneficial to non-strategic users, both privacy enhancements may backfire when users are strategic.","PeriodicalId":287551,"journal":{"name":"Proceedings of the 2017 ACM Conference on Economics and Computation","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128595615","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Simple Approximate Equilibria in Games with Many Players 多人博弈中的简单近似均衡
Pub Date : 2017-01-27 DOI: 10.1145/3033274.3085110
Itai Arieli, Y. Babichenko
We consider ε-equilibria notions for a constant value of ε in n-player m-action games, where m is a constant. We focus on the following question: What is the largest grid size over the mixed strategies such that ε-equilibrium is guaranteed to exist over this grid. For Nash equilibrium, we prove that constant grid size (that depends on ε and m, but not on n) is sufficient to guarantee the existence of a weak approximate equilibrium. This result implies a polynomial (in the input) algorithm for a weak approximate equilibrium. For approximate Nash equilibrium we introduce a closely related question and prove its equivalence to the well-known Beck-Fiala conjecture from discrepancy theory. To the best of our knowledge, this is the first result that introduces a connection between game theory and discrepancy theory. For a correlated equilibrium, we prove a O(1 over log n) lower-bound on the grid size, which matches the known upper bound of Ω(1 over log n). Our result implies an Ω(log n) lower bound on the rate of convergence of any dynamic to approximate correlated (and coarse correlated) equilibrium. Again, this lower bound matches the O(log n) upper bound that is achieved by regret minimizing algorithms.
我们考虑在n人m行动博弈中ε的恒定值的ε均衡概念,其中m是一个常数。我们关注以下问题:在混合策略中,保证ε-均衡存在的最大网格尺寸是多少?对于纳什均衡,我们证明了恒定的网格大小(取决于ε和m,但不取决于n)足以保证弱近似均衡的存在。这个结果意味着一个多项式(在输入)算法弱近似平衡。对于近似纳什均衡,我们引入了一个密切相关的问题,并证明了它与差异理论中著名的Beck-Fiala猜想的等价性。据我们所知,这是引入博弈论和差异理论之间联系的第一个结果。对于相关平衡,我们证明了网格大小的O(1 / log n)下界,它与已知的Ω(1 / log n)上界相匹配。我们的结果暗示了任何动态到近似相关(和粗相关)平衡的收敛速度的Ω(log n)下界。同样,这个下界与0 (log n)的上界相匹配,这个上界是通过遗憾最小化算法实现的。
{"title":"Simple Approximate Equilibria in Games with Many Players","authors":"Itai Arieli, Y. Babichenko","doi":"10.1145/3033274.3085110","DOIUrl":"https://doi.org/10.1145/3033274.3085110","url":null,"abstract":"We consider ε-equilibria notions for a constant value of ε in n-player m-action games, where m is a constant. We focus on the following question: What is the largest grid size over the mixed strategies such that ε-equilibrium is guaranteed to exist over this grid. For Nash equilibrium, we prove that constant grid size (that depends on ε and m, but not on n) is sufficient to guarantee the existence of a weak approximate equilibrium. This result implies a polynomial (in the input) algorithm for a weak approximate equilibrium. For approximate Nash equilibrium we introduce a closely related question and prove its equivalence to the well-known Beck-Fiala conjecture from discrepancy theory. To the best of our knowledge, this is the first result that introduces a connection between game theory and discrepancy theory. For a correlated equilibrium, we prove a O(1 over log n) lower-bound on the grid size, which matches the known upper bound of Ω(1 over log n). Our result implies an Ω(log n) lower bound on the rate of convergence of any dynamic to approximate correlated (and coarse correlated) equilibrium. Again, this lower bound matches the O(log n) upper bound that is achieved by regret minimizing algorithms.","PeriodicalId":287551,"journal":{"name":"Proceedings of the 2017 ACM Conference on Economics and Computation","volume":"58 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122282180","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
期刊
Proceedings of the 2017 ACM Conference on Economics and Computation
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1