Risk preferences of learning algorithms

IF 1 3区经济学 Q3 ECONOMICS Games and Economic Behavior Pub Date : 2024-11-01 Epub Date: 2024-10-18 DOI:10.1016/j.geb.2024.09.013

Andreas Haupt, Aroon Narayanan

引用次数: 0

Abstract

Many economic decision-makers today rely on learning algorithms for important decisions. This paper shows that a widely used learning algorithm—ε-Greedy—exhibits emergent risk aversion, favoring actions with lower payoff variance. When presented with actions of the same expectated payoff, under a wide range of conditions, ε-Greedy chooses the lower-variance action with probability approaching one. This emergent preference can have wide-ranging consequences, from inequity to homogenization, and holds transiently even when the higher-variance action has a strictly higher expected payoff. We discuss two methods to restore risk neutrality. The first method reweights data as a function of how likely an action is chosen. The second method employs optimistic payoff estimates for actions that have not been taken often.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

学习算法的风险偏好

如今，许多经济决策者都依赖学习算法来做出重要决策。本文表明，一种广泛使用的学习算法--ε-Greedy--表现出了新出现的风险厌恶，它偏爱报酬方差较低的行动。在多种条件下，当面临相同预期报酬的行动时，ε-Greedy 选择方差较低行动的概率接近于 1。这种新出现的偏好会产生广泛的后果，从不公到同质化，甚至在高方差行动的预期报酬严格高于低方差行动时也会暂时保持不变。我们讨论了两种恢复风险中性的方法。第一种方法是根据行动被选择的可能性对数据重新加权。第二种方法对不经常采取的行动采用乐观的报酬估计。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Games and Economic Behavior ECONOMICS-

CiteScore

1.90

自引率

9.10%

发文量

148

期刊介绍： Games and Economic Behavior facilitates cross-fertilization between theories and applications of game theoretic reasoning. It consistently attracts the best quality and most creative papers in interdisciplinary studies within the social, biological, and mathematical sciences. Most readers recognize it as the leading journal in game theory. Research Areas Include: • Game theory • Economics • Political science • Biology • Computer science • Mathematics • Psychology

期刊最新文献

Compellingness in Nash implementation Evaluating non-clairvoyant dynamic mechanisms: Theory and experiment Welfare properties of the Taiwan mechanism Istanbul flower auction: The need for speed Algorithmic collusion and a folk theorem from learning with bounded rationality