首页 > 最新文献

Proceedings of the 2019 ACM Conference on Economics and Computation最新文献

英文 中文
Seeding with Costly Network Information 用昂贵的网络信息播种
Pub Date : 2019-05-10 DOI: 10.2139/ssrn.3386417
Dean Eckles, Hossein Esfandiari, Elchanan Mossel, M. Amin Rahimian
Seeding the most influential individuals based on the contact structure can substantially enhance the extent of a spread over the social network. Most of the influence maximization literature assumes the knowledge of the entire network graph. However, in practice, obtaining full knowledge of the network structure is very costly. We propose polynomial-time algorithms that provide almost tight approximation guarantees using a bounded number of queries to the graph structure. We also provide impossibility results to lower bound the query complexity and show tightness of our guarantees.
在联系结构的基础上播种最具影响力的个人可以大大提高在社会网络中的传播程度。大多数影响最大化的文献假设了整个网络图的知识。然而,在实践中,获得网络结构的全部知识是非常昂贵的。我们提出了多项式时间算法,该算法使用对图结构的有限数量的查询来提供几乎严格的近似保证。我们还提供了不可能结果,以降低查询复杂度,并显示我们保证的严密性。
{"title":"Seeding with Costly Network Information","authors":"Dean Eckles, Hossein Esfandiari, Elchanan Mossel, M. Amin Rahimian","doi":"10.2139/ssrn.3386417","DOIUrl":"https://doi.org/10.2139/ssrn.3386417","url":null,"abstract":"Seeding the most influential individuals based on the contact structure can substantially enhance the extent of a spread over the social network. Most of the influence maximization literature assumes the knowledge of the entire network graph. However, in practice, obtaining full knowledge of the network structure is very costly. We propose polynomial-time algorithms that provide almost tight approximation guarantees using a bounded number of queries to the graph structure. We also provide impossibility results to lower bound the query complexity and show tightness of our guarantees.","PeriodicalId":416173,"journal":{"name":"Proceedings of the 2019 ACM Conference on Economics and Computation","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133040697","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Learning in Structured MDPs with Convex Cost Functions: Improved Regret Bounds for Inventory Management 具有凸成本函数的结构化mdp学习:改进的库存管理后悔边界
Pub Date : 2019-05-10 DOI: 10.1145/3328526.3329565
Shipra Agrawal, Randy Jia
We consider a stochastic inventory control problem under censored demands, lost sales, and positive lead times. This is a fundamental problem in inventory management, with significant literature establishing near-optimality of a simple class of policies called "base-stock policies" for the underlying Markov Decision Process (MDP), as well as convexity of long run average-cost under those policies. We consider the relatively less studied problem of designing a learning algorithm for this problem when the underlying demand distribution is unknown. The goal is to bound regret of the algorithm when compared to the best base-stock policy. We utilize the convexity properties and a newly derived bound on bias of base-stock policies to establish a connection to stochastic convex bandit optimization. Our main contribution is a learning algorithm with a regret bound of ~O (L√T+D) for the inventory control problem. Here L is the fixed and known lead time, and D is an unknown parameter of the demand distribution described roughly as the number of time steps needed to generate enough demand for depleting one unit of inventory. Notably, even though the state space of the underlying MDP is continuous and L-dimensional, our regret bounds depend linearly on L. Our results significantly improve the previously best known regret bounds for this problem where the dependence on L was exponential and many further assumptions on demand distribution were required. The techniques presented here may be of independent interest for other settings that involve large structured MDPs but with convex cost functions.
我们考虑一个随机库存控制问题在审查需求,销售损失,和积极的交货时间。这是库存管理中的一个基本问题,有大量文献为基础马尔可夫决策过程(MDP)建立了一类简单的策略(称为“基本库存策略”)的近最优性,以及这些策略下长期平均成本的凸性。我们考虑了一个相对较少研究的问题,即在潜在需求分布未知的情况下,为这个问题设计一个学习算法。目标是在与最佳基本库存策略进行比较时约束算法的遗憾。我们利用基-股策略的凸性性质和新导出的偏置界来建立与随机凸群优化的联系。我们的主要贡献是一个具有~O (L√T+D)遗憾界的学习算法来解决库存控制问题。这里L是固定且已知的交货时间,D是需求分布的未知参数,大致描述为消耗一单位库存所需的产生足够需求所需的时间步数。值得注意的是,即使底层MDP的状态空间是连续的和L维的,我们的遗憾界仍然线性地依赖于L。我们的结果显著地改进了这个问题之前最著名的遗憾界,其中对L的依赖是指数的,并且需要对需求分布进行许多进一步的假设。这里介绍的技术可能对涉及大型结构化mdp但具有凸成本函数的其他设置具有独立的兴趣。
{"title":"Learning in Structured MDPs with Convex Cost Functions: Improved Regret Bounds for Inventory Management","authors":"Shipra Agrawal, Randy Jia","doi":"10.1145/3328526.3329565","DOIUrl":"https://doi.org/10.1145/3328526.3329565","url":null,"abstract":"We consider a stochastic inventory control problem under censored demands, lost sales, and positive lead times. This is a fundamental problem in inventory management, with significant literature establishing near-optimality of a simple class of policies called \"base-stock policies\" for the underlying Markov Decision Process (MDP), as well as convexity of long run average-cost under those policies. We consider the relatively less studied problem of designing a learning algorithm for this problem when the underlying demand distribution is unknown. The goal is to bound regret of the algorithm when compared to the best base-stock policy. We utilize the convexity properties and a newly derived bound on bias of base-stock policies to establish a connection to stochastic convex bandit optimization. Our main contribution is a learning algorithm with a regret bound of ~O (L√T+D) for the inventory control problem. Here L is the fixed and known lead time, and D is an unknown parameter of the demand distribution described roughly as the number of time steps needed to generate enough demand for depleting one unit of inventory. Notably, even though the state space of the underlying MDP is continuous and L-dimensional, our regret bounds depend linearly on L. Our results significantly improve the previously best known regret bounds for this problem where the dependence on L was exponential and many further assumptions on demand distribution were required. The techniques presented here may be of independent interest for other settings that involve large structured MDPs but with convex cost functions.","PeriodicalId":416173,"journal":{"name":"Proceedings of the 2019 ACM Conference on Economics and Computation","volume":"85 4","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134476500","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 39
The Vickrey Auction with a Single Duplicate Bidder Approximates the Optimal Revenue 只有一个重复投标人的维克里拍卖近似于最优收益
Pub Date : 2019-05-09 DOI: 10.1145/3328526.3329597
Hu Fu, Christopher Liaw, Sikander Randhawa
Bulow and Klemperer's well-known result states that, in a single-item auction where the n bidders' values are independently and identically drawn from a regular distribution, the Vickrey auction with one additional bidder (a duplicate) extracts at least as much revenue as the optimal auction without the duplicate. Hartline and Roughgarden, in their influential 2009 paper, removed the requirement that the distributions be identical, at the cost of allowing the Vickrey auction to recruit n duplicates, one from each distribution, and relaxing its revenue advantage to a 2-approximation. In this work we restore Bulow and Klemperer's number of duplicates in Hartline and Roughgarden's more general setting. We show that recruiting a duplicate from one of the distributions suffices for the Vickrey auction to $10$-approximate the optimal revenue. We also show that in a k-unit auction, recruiting k duplicates suffices for the VCG auction to $O(1)$-approximate the optimal revenue. We also tighten the analysis for Hartline and Roughgarden's Vickrey auction with n duplicates. We show that, for two distributions, the Vickrey auction with two duplicates obtains at least $3/4$ of the optimal revenue. This is tight by meeting a lower bound by Hartline and Roughgarden. En route, we obtain a transparent analysis of their $2$-approximation, by a natural connection to Ronen's lookahead auction.
Bulow和Klemperer的著名结果表明,在单件拍卖中,n个竞标者的价值是独立且相同地从规则分布中提取的,有一个额外竞标者(副本)的Vickrey拍卖获得的收入至少与没有副本的最优拍卖一样多。Hartline和Roughgarden在2009年发表的一篇很有影响力的论文中,取消了分配相同的要求,其代价是允许维克里拍卖从每个分配中招募n个副本,并将其收入优势放宽到2个近似值。在这项工作中,我们恢复了比洛和克伦佩雷尔在哈特兰和拉夫加登更一般的环境中的复制数。我们证明,从其中一个分布中招募一个副本足以使维克里拍卖达到10美元——接近最优收益。我们还表明,在k单位拍卖中,招募k个副本足以使VCG拍卖达到$O(1)$-近似最优收益。我们还加强了对哈特兰和拉夫加登的维克里拍卖会的分析,有n个副本。我们证明,对于两种分布,具有两个副本的Vickrey拍卖至少获得了最优收益的3/4美元。通过满足Hartline和Roughgarden的下界,这是紧的。在此过程中,我们通过与Ronen的前瞻性拍卖的自然联系,对他们的2美元近似值进行了透明的分析。
{"title":"The Vickrey Auction with a Single Duplicate Bidder Approximates the Optimal Revenue","authors":"Hu Fu, Christopher Liaw, Sikander Randhawa","doi":"10.1145/3328526.3329597","DOIUrl":"https://doi.org/10.1145/3328526.3329597","url":null,"abstract":"Bulow and Klemperer's well-known result states that, in a single-item auction where the n bidders' values are independently and identically drawn from a regular distribution, the Vickrey auction with one additional bidder (a duplicate) extracts at least as much revenue as the optimal auction without the duplicate. Hartline and Roughgarden, in their influential 2009 paper, removed the requirement that the distributions be identical, at the cost of allowing the Vickrey auction to recruit n duplicates, one from each distribution, and relaxing its revenue advantage to a 2-approximation. In this work we restore Bulow and Klemperer's number of duplicates in Hartline and Roughgarden's more general setting. We show that recruiting a duplicate from one of the distributions suffices for the Vickrey auction to $10$-approximate the optimal revenue. We also show that in a k-unit auction, recruiting k duplicates suffices for the VCG auction to $O(1)$-approximate the optimal revenue. We also tighten the analysis for Hartline and Roughgarden's Vickrey auction with n duplicates. We show that, for two distributions, the Vickrey auction with two duplicates obtains at least $3/4$ of the optimal revenue. This is tight by meeting a lower bound by Hartline and Roughgarden. En route, we obtain a transparent analysis of their $2$-approximation, by a natural connection to Ronen's lookahead auction.","PeriodicalId":416173,"journal":{"name":"Proceedings of the 2019 ACM Conference on Economics and Computation","volume":"94 6","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131771074","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
The Implications of Pricing on Social Learning 定价对社会学习的影响
Pub Date : 2019-05-09 DOI: 10.1145/3328526.3329554
Itai Arieli, Moran Koren, Rann Smorodinsky
We study the implications of endogenous pricing for learning and welfare in the classic herding model. When prices are determined exogenously, it is known that learning occurs if and only if signals are unbounded. By contrast, we show that learning can occur when signals are bounded as long as non-conformism among consumers is scarce. More formally, learning happens if and only if signals exhibit the vanishing likelihood property introduced bellow. We discuss the implications of our results for potential market failure in the context of Schumpeterian growth with uncertainty over the value of innovations.
在经典羊群模型中,我们研究了内生定价对学习和福利的影响。当价格是外生决定时,我们知道学习发生当且仅当信号无界。相比之下,我们表明,只要消费者不墨守成规的行为很少,当信号有限时,学习就会发生。更正式地说,当且仅当信号表现出下面介绍的消失似然特性时,学习才会发生。我们讨论了我们的结果对潜在市场失灵的影响在熊彼特增长与创新价值的不确定性的背景下。
{"title":"The Implications of Pricing on Social Learning","authors":"Itai Arieli, Moran Koren, Rann Smorodinsky","doi":"10.1145/3328526.3329554","DOIUrl":"https://doi.org/10.1145/3328526.3329554","url":null,"abstract":"We study the implications of endogenous pricing for learning and welfare in the classic herding model. When prices are determined exogenously, it is known that learning occurs if and only if signals are unbounded. By contrast, we show that learning can occur when signals are bounded as long as non-conformism among consumers is scarce. More formally, learning happens if and only if signals exhibit the vanishing likelihood property introduced bellow. We discuss the implications of our results for potential market failure in the context of Schumpeterian growth with uncertainty over the value of innovations.","PeriodicalId":416173,"journal":{"name":"Proceedings of the 2019 ACM Conference on Economics and Computation","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125163679","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Pandora's Problem with Nonobligatory Inspection 潘多拉的非强制性检查问题
Pub Date : 2019-05-04 DOI: 10.1145/3328526.3329626
Hedyeh Beyhaghi, Robert D. Kleinberg
Martin Weitzman's "Pandora's problem" furnishes the mathematical basis for optimal search theory in economics. Nearly 40 years later, Laura Doval introduced a version of the problem in which the searcher is not obligated to pay the cost of inspecting an alternative's value before selecting it. Unlike the original Pandora's problem, the version with nonobligatory inspection cannot be solved optimally by any simple ranking-based policy, and it is unknown whether there exists any polynomial-time algorithm to compute the optimal policy. This motivates the study of approximately optimal policies that are simple and computationally efficient. In this work we provide the first non-trivial approximation guarantees for this problem. We introduce a family of "committing policies" such that it is computationally easy to find and implement the optimal committing policy. We prove that the optimal committing policy is guaranteed to approximate the fully optimal policy within a 1-1/e = 0.63... factor, and for the special case of two boxes we improve this factor to 4/5 and show that this approximation is tight for the class of committing policies.
Martin Weitzman的“潘多拉问题”为经济学中的最优搜索理论提供了数学基础。近40年后,劳拉·多瓦尔(Laura Doval)提出了这个问题的另一个版本,即搜索者在选择备选项之前没有义务支付检查备选项价值的费用。与原始的潘多拉问题不同,非强制检查版本不能通过任何简单的基于排名的策略来最优解决,并且不知道是否存在多项式时间算法来计算最优策略。这激发了对简单且计算效率高的近似最优策略的研究。在这项工作中,我们为这个问题提供了第一个非平凡近似保证。我们引入了一系列“提交策略”,以便在计算上容易找到并实现最优提交策略。我们证明了最优提交策略在1-1/e = 0.63范围内保证近似于完全最优策略。因子,对于两个盒子的特殊情况,我们将这个因子提高到4/5,并表明这个近似对于提交策略类是紧密的。
{"title":"Pandora's Problem with Nonobligatory Inspection","authors":"Hedyeh Beyhaghi, Robert D. Kleinberg","doi":"10.1145/3328526.3329626","DOIUrl":"https://doi.org/10.1145/3328526.3329626","url":null,"abstract":"Martin Weitzman's \"Pandora's problem\" furnishes the mathematical basis for optimal search theory in economics. Nearly 40 years later, Laura Doval introduced a version of the problem in which the searcher is not obligated to pay the cost of inspecting an alternative's value before selecting it. Unlike the original Pandora's problem, the version with nonobligatory inspection cannot be solved optimally by any simple ranking-based policy, and it is unknown whether there exists any polynomial-time algorithm to compute the optimal policy. This motivates the study of approximately optimal policies that are simple and computationally efficient. In this work we provide the first non-trivial approximation guarantees for this problem. We introduce a family of \"committing policies\" such that it is computationally easy to find and implement the optimal committing policy. We prove that the optimal committing policy is guaranteed to approximate the fully optimal policy within a 1-1/e = 0.63... factor, and for the special case of two boxes we improve this factor to 4/5 and show that this approximation is tight for the class of committing policies.","PeriodicalId":416173,"journal":{"name":"Proceedings of the 2019 ACM Conference on Economics and Computation","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-05-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131330796","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 24
Improved Metric Distortion for Deterministic Social Choice Rules 确定性社会选择规则的改进度量失真
Pub Date : 2019-05-04 DOI: 10.1145/3328526.3329550
Kamesh Munagala, Kangning Wang
In this paper, we study the metric distortion of deterministic social choice rules that choose a winning candidate from a set of candidates based on voter preferences. Voters and candidates are located in an underlying metric space. A voter has cost equal to her distance to the winning candidate. Ordinal social choice rules only have access to the ordinal preferences of the voters that are assumed to be consistent with the metric distances. Our goal is to design an ordinal social choice rule with minimum distortion, which is the worst-case ratio, over all consistent metrics, between the social cost of the rule and that of the optimal omniscient rule with knowledge of the underlying metric space. The distortion of the best deterministic social choice rule was known to be between 3 and 5. It had been conjectured that any rule that only looks at the weighted tournament graph on the candidates cannot have distortion better than 5. In our paper, we disprove it by presenting a weighted tournament rule with distortion of 4.236. We design this rule by generalizing the classic notion of uncovered sets, and further show that this class of rules cannot have distortion better than 4.236. We then propose a new voting rule, via an alternative generalization of uncovered sets. We show that if a candidate satisfying the criterion of this voting rule exists, then choosing such a candidate yields a distortion bound of 3, matching the lower bound. We present a combinatorial conjecture that implies distortion of $3$, and verify it for small numbers of candidates and voters by computer experiments. Using our framework, we also show that selecting any candidate guarantees distortion of at most 3 when the weighted tournament graph is cyclically symmetric.
本文研究了基于选民偏好从一组候选人中选择获胜候选人的确定性社会选择规则的度量失真。选民和候选人位于一个潜在的度量空间中。选民的成本等于她与获胜候选人的距离。序数社会选择规则只能访问被认为与度量距离一致的选民的序数偏好。我们的目标是设计一个具有最小扭曲的有序社会选择规则,这是在所有一致度量中,规则的社会成本与具有底层度量空间知识的最优全知规则之间的最坏比率。已知最佳确定性社会选择规则的扭曲度在3到5之间。据推测,任何只看候选人加权比赛图的规则都不可能有大于5的失真。在我们的论文中,我们提出了一个失真为4.236的加权比赛规则来反驳它。我们通过推广经典的未覆盖集的概念来设计这条规则,并进一步证明了这类规则的失真不可能大于4.236。然后,我们通过对未覆盖集的另一种泛化,提出了一种新的投票规则。我们证明,如果存在满足该投票规则标准的候选人,那么选择这样的候选人会产生3的失真界,与下界匹配。我们提出了一个暗示$3$扭曲的组合猜想,并通过计算机实验对少数候选人和选民进行了验证。使用我们的框架,我们还表明,当加权比赛图是循环对称时,选择任何候选保证最多3的失真。
{"title":"Improved Metric Distortion for Deterministic Social Choice Rules","authors":"Kamesh Munagala, Kangning Wang","doi":"10.1145/3328526.3329550","DOIUrl":"https://doi.org/10.1145/3328526.3329550","url":null,"abstract":"In this paper, we study the metric distortion of deterministic social choice rules that choose a winning candidate from a set of candidates based on voter preferences. Voters and candidates are located in an underlying metric space. A voter has cost equal to her distance to the winning candidate. Ordinal social choice rules only have access to the ordinal preferences of the voters that are assumed to be consistent with the metric distances. Our goal is to design an ordinal social choice rule with minimum distortion, which is the worst-case ratio, over all consistent metrics, between the social cost of the rule and that of the optimal omniscient rule with knowledge of the underlying metric space. The distortion of the best deterministic social choice rule was known to be between 3 and 5. It had been conjectured that any rule that only looks at the weighted tournament graph on the candidates cannot have distortion better than 5. In our paper, we disprove it by presenting a weighted tournament rule with distortion of 4.236. We design this rule by generalizing the classic notion of uncovered sets, and further show that this class of rules cannot have distortion better than 4.236. We then propose a new voting rule, via an alternative generalization of uncovered sets. We show that if a candidate satisfying the criterion of this voting rule exists, then choosing such a candidate yields a distortion bound of 3, matching the lower bound. We present a combinatorial conjecture that implies distortion of $3$, and verify it for small numbers of candidates and voters by computer experiments. Using our framework, we also show that selecting any candidate guarantees distortion of at most 3 when the weighted tournament graph is cyclically symmetric.","PeriodicalId":416173,"journal":{"name":"Proceedings of the 2019 ACM Conference on Economics and Computation","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-05-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125632227","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 36
Regression Equilibrium 回归平衡
Pub Date : 2019-05-04 DOI: 10.1145/3328526.3329560
Omer Ben-Porat, Moshe Tennenholtz
Prediction is a well-studied machine learning task, and prediction algorithms are core ingredients in online products and services. Despite their centrality in the competition between online companies who offer prediction-based products, the strategic use of prediction algorithms remains unexplored. The goal of this paper is to examine strategic use of prediction algorithms. We introduce a novel game-theoretic setting that is based on the PAC learning framework, where each player (aka a prediction algorithm aimed at competition) seeks to maximize the sum of points for which it produces an accurate prediction and the others do not. We show that algorithms aiming at generalization may wittingly mispredict some points to perform better than others on expectation. We analyze the empirical game, i.e., the game induced on a given sample, prove that it always possesses a pure Nash equilibrium, and show that every better-response learning process converges. Moreover, our learning-theoretic analysis suggests that players can, with high probability, learn an approximate pure Nash equilibrium for the whole population using a small number of samples.
预测是一项经过充分研究的机器学习任务,预测算法是在线产品和服务的核心成分。尽管它们在提供基于预测的产品的在线公司之间的竞争中处于中心地位,但预测算法的战略用途仍未得到探索。本文的目的是研究预测算法的策略使用。我们引入了一种基于PAC学习框架的新颖博弈论设置,其中每个参与者(又名旨在竞争的预测算法)寻求最大化其产生准确预测的点数总和,而其他参与者则没有。我们表明,以泛化为目标的算法可能有意地错误预测某些点,以便在期望上比其他点表现得更好。我们分析了经验博弈,即在给定样本上诱导的博弈,证明了它总是具有纯纳什均衡,并证明了每一个更好响应的学习过程都是收敛的。此外,我们的学习理论分析表明,玩家可以使用少量样本,以高概率学习整个群体的近似纯纳什均衡。
{"title":"Regression Equilibrium","authors":"Omer Ben-Porat, Moshe Tennenholtz","doi":"10.1145/3328526.3329560","DOIUrl":"https://doi.org/10.1145/3328526.3329560","url":null,"abstract":"Prediction is a well-studied machine learning task, and prediction algorithms are core ingredients in online products and services. Despite their centrality in the competition between online companies who offer prediction-based products, the strategic use of prediction algorithms remains unexplored. The goal of this paper is to examine strategic use of prediction algorithms. We introduce a novel game-theoretic setting that is based on the PAC learning framework, where each player (aka a prediction algorithm aimed at competition) seeks to maximize the sum of points for which it produces an accurate prediction and the others do not. We show that algorithms aiming at generalization may wittingly mispredict some points to perform better than others on expectation. We analyze the empirical game, i.e., the game induced on a given sample, prove that it always possesses a pure Nash equilibrium, and show that every better-response learning process converges. Moreover, our learning-theoretic analysis suggests that players can, with high probability, learn an approximate pure Nash equilibrium for the whole population using a small number of samples.","PeriodicalId":416173,"journal":{"name":"Proceedings of the 2019 ACM Conference on Economics and Computation","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-05-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130792557","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
LP-based Approximation for Personalized Reserve Prices 基于lp的个性化保留价格逼近
Pub Date : 2019-05-04 DOI: 10.1145/3328526.3329594
M. Derakhshan, Negin Golrezaei, R. Leme
We study the problem of computing personalized reserve prices in eager second price auctions without having any assumption on valuation distributions. Here, the input is a dataset that contains the submitted bids of n buyers in a set of auctions and the goal is to return personalized reserve prices r that maximize the revenue earned on these auctions by running eager second price auctions with reserve r. We present a novel LP formulation to this problem and a rounding procedure which achieves a (1+2(√2-1)e√2-2)-1≅0.684-approximation. This improves over the 1/2-approximation Algorithm due to Roughgarden and Wang. We show that our analysis is tight for this rounding procedure. We also bound the integrality gap of the LP, which bounds the performance of any algorithm based on this LP.
本文研究了在没有估价分布假设的情况下,急切第二价格拍卖中个性化保留价格的计算问题。在这里,输入是一个包含n个买家在一组拍卖中提交的出价的数据集,目标是返回个性化的保留价格r,通过运行具有保留r的急于第二价格拍卖来最大化这些拍卖所获得的收入。我们提出了一个新的LP公式和一个四舍五入的过程来实现(1+2(√2-1)e√2-2)-1 = 0.684近似值。这比Roughgarden和Wang提出的1/2近似算法有所改进。我们证明了我们的分析对于这个舍入过程是严格的。我们还限定了LP的完整性间隙,从而限定了基于该LP的任何算法的性能。
{"title":"LP-based Approximation for Personalized Reserve Prices","authors":"M. Derakhshan, Negin Golrezaei, R. Leme","doi":"10.1145/3328526.3329594","DOIUrl":"https://doi.org/10.1145/3328526.3329594","url":null,"abstract":"We study the problem of computing personalized reserve prices in eager second price auctions without having any assumption on valuation distributions. Here, the input is a dataset that contains the submitted bids of n buyers in a set of auctions and the goal is to return personalized reserve prices r that maximize the revenue earned on these auctions by running eager second price auctions with reserve r. We present a novel LP formulation to this problem and a rounding procedure which achieves a (1+2(√2-1)e√2-2)-1≅0.684-approximation. This improves over the 1/2-approximation Algorithm due to Roughgarden and Wang. We show that our analysis is tight for this rounding procedure. We also bound the integrality gap of the LP, which bounds the performance of any algorithm based on this LP.","PeriodicalId":416173,"journal":{"name":"Proceedings of the 2019 ACM Conference on Economics and Computation","volume":"91 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-05-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131692235","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Robust Non-Bayesian Social Learning 鲁棒非贝叶斯社会学习
Pub Date : 2019-05-02 DOI: 10.2139/ssrn.3381563
Itai Arieli, Y. Babichenko, Segev Shlomov
We study non-Bayesian social learning in large networks and binary state space. Agents who are located in a network receive conditionally i.i.d. signals over the state. We refer to the initial distribution of signals as the information structure. In each step, all agents aggregate their belief with the beliefs of their neighbors according to some non-Bayesian rule. We refer to the aggregation rule as the learning dynamic. We say that a dynamic leads to learning if the beliefs of all agents converge to the correct state with a probability that approaches one in an increasing sequence of large networks. We say that a class of information structures p is learnable if there exists a learning dynamic that leads to learning for all information structures in p. Namely, there exists a single learning dynamic that robustly leads to learning for all possible information structures. We provide a necessary and sufficient characterization of learnable classes of information structures. Whenever learning is possible in a class p it is also possible via a virtually additive learning dynamic, where players map beliefs to virtual values and in each period they simply sum up all neighbors' virtual values to deduce their new belief. In addition, we relax the common prior assumption and provide a sufficient condition for learning in the absence of a common prior.
我们研究了大型网络和二元状态空间中的非贝叶斯社会学习。位于网络中的代理在状态上接收有条件的id信号。我们把信号的初始分布称为信息结构。在每一步中,所有智能体根据一些非贝叶斯规则将自己的信念与邻居的信念进行聚合。我们把聚合规则称为学习动态。我们说,如果在一个不断增加的大型网络序列中,所有智能体的信念以接近1的概率收敛到正确的状态,那么动态导致学习。我们说一类信息结构p是可学习的,如果存在一个学习动态导致对p中的所有信息结构进行学习。也就是说,存在一个学习动态导致对所有可能的信息结构进行学习。我们提供了信息结构的可学习类的必要和充分的表征。在p类中,无论何时学习都是可能的,这也可能是通过虚拟的附加学习动态,即玩家将信念映射到虚拟值,并且在每个阶段他们只是简单地总结所有邻居的虚拟值来推断他们的新信念。此外,我们放宽了共同先验假设,并提供了在没有共同先验的情况下学习的充分条件。
{"title":"Robust Non-Bayesian Social Learning","authors":"Itai Arieli, Y. Babichenko, Segev Shlomov","doi":"10.2139/ssrn.3381563","DOIUrl":"https://doi.org/10.2139/ssrn.3381563","url":null,"abstract":"We study non-Bayesian social learning in large networks and binary state space. Agents who are located in a network receive conditionally i.i.d. signals over the state. We refer to the initial distribution of signals as the information structure. In each step, all agents aggregate their belief with the beliefs of their neighbors according to some non-Bayesian rule. We refer to the aggregation rule as the learning dynamic. We say that a dynamic leads to learning if the beliefs of all agents converge to the correct state with a probability that approaches one in an increasing sequence of large networks. We say that a class of information structures p is learnable if there exists a learning dynamic that leads to learning for all information structures in p. Namely, there exists a single learning dynamic that robustly leads to learning for all possible information structures. We provide a necessary and sufficient characterization of learnable classes of information structures. Whenever learning is possible in a class p it is also possible via a virtually additive learning dynamic, where players map beliefs to virtual values and in each period they simply sum up all neighbors' virtual values to deduce their new belief. In addition, we relax the common prior assumption and provide a sufficient condition for learning in the absence of a common prior.","PeriodicalId":416173,"journal":{"name":"Proceedings of the 2019 ACM Conference on Economics and Computation","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122684319","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Budget-Feasible Mechanism Design for Non-Monotone Submodular Objectives: Offline and Online 非单调子模目标的预算可行机制设计:离线和在线
Pub Date : 2019-05-02 DOI: 10.1145/3328526.3329622
Georgios Amanatidis, P. Kleer, G. Schäfer
The framework of budget-feasible mechanism design studies procurement auctions where the auctioneer (buyer) aims to maximize his valuation function subject to a hard budget constraint. We study the problem of designing truthful mechanisms that have good approximation guarantees and never pay the participating agents (sellers) more than the budget. We focus on the case of general (non-monotone) submodular valuation functions and derive the first truthful, budget-feasible and $O(1)$-approximation mechanisms that run in polynomial time in the value query model, for both offline and online auctions. Since the introduction of the problem by Singer citepSinger10, obtaining efficient mechanisms for objectives that go beyond the class of monotone submodular functions has been elusive. Prior to our work, the only $O(1)$-approximation mechanism known for non-monotone submodular objectives required an exponential number of value queries. At the heart of our approach lies a novel greedy algorithm for non-monotone submodular maximization under a knapsack constraint. Our algorithm builds two candidate solutions simultaneously (to achieve a good approximation), yet ensures that agents cannot jump from one solution to the other (to implicitly enforce truthfulness). Ours is the first mechanism for the problem where---crucially---the agents are not ordered according to their marginal value per cost. This allows us to appropriately adapt these ideas to the online setting as well. To further illustrate the applicability of our approach, we also consider the case where additional feasibility constraints are present, e.g., at most k agents can be selected. We obtain O(p)-approximation mechanisms for both monotone and non-monotone submodular objectives, when the feasible solutions are independent sets of a p-system. With the exception of additive valuation functions, no mechanisms were known for this setting prior to our work. Finally, we provide lower bounds suggesting that, when one cares about non-trivial approximation guarantees in polynomial time, our results are asymptotically best possible.
预算可行机制设计的框架研究采购拍卖,其中拍卖人(买方)的目标是在硬预算约束下最大化其估价功能。我们研究了设计具有良好近似保证的真实机制的问题,并且永远不会向参与的代理(卖方)支付超过预算的费用。我们专注于一般(非单调)子模估值函数的情况,并推导出第一个真实的、预算可行的和$O(1)$逼近机制,这些机制在价值查询模型中以多项式时间运行,适用于离线和在线拍卖。自从Singer citepSinger10引入这个问题以来,获得超越单调子模函数类的目标的有效机制一直是难以捉摸的。在我们的工作之前,已知的唯一用于非单调子模目标的$O(1)$逼近机制需要指数数量的值查询。该方法的核心是一种新颖的贪心算法,用于在背包约束下实现非单调次模最大化。我们的算法同时构建两个候选解决方案(以获得良好的近似值),但确保代理不能从一个解决方案跳到另一个解决方案(以隐式地强制真实性)。我们的机制是解决这个问题的第一个机制,关键是,在这个问题中,代理不是根据它们的每成本边际价值排序的。这也让我们能够将这些理念适当地应用于网络环境中。为了进一步说明我们方法的适用性,我们还考虑了存在额外可行性约束的情况,例如,最多可以选择k个代理。当可行解是p系统的独立集时,我们得到了单调和非单调子模目标的O(p)逼近机制。除了附加的估值函数,在我们的工作之前,没有机制是已知的。最后,我们提供的下界表明,当人们关心多项式时间内的非平凡近似保证时,我们的结果是渐近最佳可能的。
{"title":"Budget-Feasible Mechanism Design for Non-Monotone Submodular Objectives: Offline and Online","authors":"Georgios Amanatidis, P. Kleer, G. Schäfer","doi":"10.1145/3328526.3329622","DOIUrl":"https://doi.org/10.1145/3328526.3329622","url":null,"abstract":"The framework of budget-feasible mechanism design studies procurement auctions where the auctioneer (buyer) aims to maximize his valuation function subject to a hard budget constraint. We study the problem of designing truthful mechanisms that have good approximation guarantees and never pay the participating agents (sellers) more than the budget. We focus on the case of general (non-monotone) submodular valuation functions and derive the first truthful, budget-feasible and $O(1)$-approximation mechanisms that run in polynomial time in the value query model, for both offline and online auctions. Since the introduction of the problem by Singer citepSinger10, obtaining efficient mechanisms for objectives that go beyond the class of monotone submodular functions has been elusive. Prior to our work, the only $O(1)$-approximation mechanism known for non-monotone submodular objectives required an exponential number of value queries. At the heart of our approach lies a novel greedy algorithm for non-monotone submodular maximization under a knapsack constraint. Our algorithm builds two candidate solutions simultaneously (to achieve a good approximation), yet ensures that agents cannot jump from one solution to the other (to implicitly enforce truthfulness). Ours is the first mechanism for the problem where---crucially---the agents are not ordered according to their marginal value per cost. This allows us to appropriately adapt these ideas to the online setting as well. To further illustrate the applicability of our approach, we also consider the case where additional feasibility constraints are present, e.g., at most k agents can be selected. We obtain O(p)-approximation mechanisms for both monotone and non-monotone submodular objectives, when the feasible solutions are independent sets of a p-system. With the exception of additive valuation functions, no mechanisms were known for this setting prior to our work. Finally, we provide lower bounds suggesting that, when one cares about non-trivial approximation guarantees in polynomial time, our results are asymptotically best possible.","PeriodicalId":416173,"journal":{"name":"Proceedings of the 2019 ACM Conference on Economics and Computation","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127510240","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
期刊
Proceedings of the 2019 ACM Conference on Economics and Computation
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1