Efficiency and fairness trade-offs in two player bargaining games

IF 1.5 1区 哲学 Q1 HISTORY & PHILOSOPHY OF SCIENCE European Journal for Philosophy of Science Pub Date : 2023-10-24 DOI:10.1007/s13194-023-00553-6
David Freeborn
{"title":"Efficiency and fairness trade-offs in two player bargaining games","authors":"David Freeborn","doi":"10.1007/s13194-023-00553-6","DOIUrl":null,"url":null,"abstract":"<p>Recent work on the evolution of social contracts and conventions has often used models of bargaining games, with reinforcement learning. A recent innovation is the requirement that every strategy must be invented either through through learning or reinforcement. However, agents frequently get stuck in highly-reinforced “traps” that prevent them from arriving at outcomes that are efficient or fair to the both players. Agents face a trade-off between exploration and exploitation, i.e. between continuing to invent new strategies and reinforcing strategies that have already become highly reinforced by yielding high rewards. In this paper I systematically study the relationship between rates of invention and the efficiency and fairness of outcomes in two-player, repeated bargaining games. I use a basic reinforcement learning model with invention, and five variations of this model, designed introduce various forms of forgetting, to prioritize more recent reinforcement, or to maintain a higher rate of invention. I use computer simulations to investigate the outcomes of each model. Each models shows qualitative similarities in the relationship between the efficiency and fairness of outcomes, and the relative amount of exploration or exploitation that takes place. Surprisingly, there are often trade-offs between the efficiency and the fairness of the outcomes.</p>","PeriodicalId":48832,"journal":{"name":"European Journal for Philosophy of Science","volume":"5 12","pages":""},"PeriodicalIF":1.5000,"publicationDate":"2023-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal for Philosophy of Science","FirstCategoryId":"98","ListUrlMain":"https://doi.org/10.1007/s13194-023-00553-6","RegionNum":1,"RegionCategory":"哲学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"HISTORY & PHILOSOPHY OF SCIENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Recent work on the evolution of social contracts and conventions has often used models of bargaining games, with reinforcement learning. A recent innovation is the requirement that every strategy must be invented either through through learning or reinforcement. However, agents frequently get stuck in highly-reinforced “traps” that prevent them from arriving at outcomes that are efficient or fair to the both players. Agents face a trade-off between exploration and exploitation, i.e. between continuing to invent new strategies and reinforcing strategies that have already become highly reinforced by yielding high rewards. In this paper I systematically study the relationship between rates of invention and the efficiency and fairness of outcomes in two-player, repeated bargaining games. I use a basic reinforcement learning model with invention, and five variations of this model, designed introduce various forms of forgetting, to prioritize more recent reinforcement, or to maintain a higher rate of invention. I use computer simulations to investigate the outcomes of each model. Each models shows qualitative similarities in the relationship between the efficiency and fairness of outcomes, and the relative amount of exploration or exploitation that takes place. Surprisingly, there are often trade-offs between the efficiency and the fairness of the outcomes.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
两人讨价还价博弈中的效率与公平权衡
最近关于社会契约和惯例演变的工作经常使用谈判游戏模型,并进行强化学习。最近的一项创新是要求每一种策略都必须通过学习或强化来发明。然而,特工们经常陷入高度强化的“陷阱”,这使他们无法达成对双方都有效或公平的结果。代理人面临着探索和利用之间的权衡,即在继续发明新策略和强化策略之间,这些策略已经通过产生高回报而得到高度强化。在本文中,我系统地研究了两人重复讨价还价游戏中发明率与结果的效率和公平性之间的关系。我使用了一个带有发明的基本强化学习模型,以及该模型的五个变体,旨在引入各种形式的遗忘,以优先考虑最近的强化,或保持更高的发明率。我使用计算机模拟来研究每个模型的结果。每一个模型都显示了结果的效率和公平性以及相对勘探或开采量之间的关系在质量上的相似性。令人惊讶的是,结果的效率和公平性之间往往存在权衡。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
European Journal for Philosophy of Science
European Journal for Philosophy of Science HISTORY & PHILOSOPHY OF SCIENCE-
CiteScore
2.60
自引率
13.30%
发文量
57
期刊介绍: The European Journal for Philosophy of Science publishes groundbreaking works that can deepen understanding of the concepts and methods of the sciences, as they explore increasingly many facets of the world we live in. It is of direct interest to philosophers of science coming from different perspectives, as well as scientists, citizens and policymakers. The journal is interested in articles from all traditions and all backgrounds, as long as they engage with the sciences in a constructive, and critical, way. The journal represents the various longstanding European philosophical traditions engaging with the sciences, but welcomes articles from every part of the world.
期刊最新文献
Questioning origins: the role of ethical and metaethical claims in the debate about the evolution of morality The extraterrestrial hypothesis: an epistemological case for removing the taboo Nagelian reduction and approximation The replication crisis is less of a “crisis” in Lakatos’ philosophy of science than it is in Popper’s Stopping rule and Bayesian confirmation theory
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1