{"title":"On Mechanism Underlying Algorithmic Collusion","authors":"Zhang Xu, Wei Zhao","doi":"arxiv-2409.01147","DOIUrl":null,"url":null,"abstract":"Two issues of algorithmic collusion are addressed in this paper. First, we\nshow that in a general class of symmetric games, including Prisoner's Dilemma,\nBertrand competition, and any (nonlinear) mixture of first and second price\nauction, only (strict) Nash Equilibrium (NE) is stochastically stable.\nTherefore, the tacit collusion is driven by failure to learn NE due to\ninsufficient learning, instead of learning some strategies to sustain collusive\noutcomes. Second, we study how algorithms adapt to collusion in real\nsimulations with insufficient learning. Extensive explorations in early stages\nand discount factors inflates the Q-value, which interrupts the sequential and\nalternative price undercut and leads to bilateral rebound. The process is\niterated, making the price curves like Edgeworth cycles. When both exploration\nrate and Q-value decrease, algorithms may bilaterally rebound to relatively\nhigh common price level by coincidence, and then get stuck. Finally, we\naccommodate our reasoning to simulation outcomes in the literature, including\noptimistic initialization, market design and algorithm design.","PeriodicalId":501188,"journal":{"name":"arXiv - ECON - Theoretical Economics","volume":"68 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - ECON - Theoretical Economics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.01147","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Two issues of algorithmic collusion are addressed in this paper. First, we
show that in a general class of symmetric games, including Prisoner's Dilemma,
Bertrand competition, and any (nonlinear) mixture of first and second price
auction, only (strict) Nash Equilibrium (NE) is stochastically stable.
Therefore, the tacit collusion is driven by failure to learn NE due to
insufficient learning, instead of learning some strategies to sustain collusive
outcomes. Second, we study how algorithms adapt to collusion in real
simulations with insufficient learning. Extensive explorations in early stages
and discount factors inflates the Q-value, which interrupts the sequential and
alternative price undercut and leads to bilateral rebound. The process is
iterated, making the price curves like Edgeworth cycles. When both exploration
rate and Q-value decrease, algorithms may bilaterally rebound to relatively
high common price level by coincidence, and then get stuck. Finally, we
accommodate our reasoning to simulation outcomes in the literature, including
optimistic initialization, market design and algorithm design.