迭代公共产品游戏中的改进学习:探索性噪音的影响

IF 1.3 4区 社会学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Journal of Mathematical Sociology Pub Date : 2018-01-02 DOI:10.1080/0022250X.2017.1396983
Johannes Zschache
{"title":"迭代公共产品游戏中的改进学习:探索性噪音的影响","authors":"Johannes Zschache","doi":"10.1080/0022250X.2017.1396983","DOIUrl":null,"url":null,"abstract":"ABSTRACT Experimental observations in iterated public goods games are explained by a simple but empirically well-grounded model of long-term reinforcement learning. In many experiments, medium levels of cooperation at the beginning decrease with further repetitions. However, in some settings, the actors only slowly learn the individual benefits of defection. In the present model, the decay in cooperation is mitigated by high individual returns, a large group size or stability in the group’s composition. Results from agent-based simulations are presented, and the underlying mechanisms are disclosed. The proposed explanation stresses the role of exploratory noise: if multiple actors explore their alternatives simultaneously, the marginal benefit of defection diminishes and cooperation can be sustained.","PeriodicalId":50139,"journal":{"name":"Journal of Mathematical Sociology","volume":null,"pages":null},"PeriodicalIF":1.3000,"publicationDate":"2018-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/0022250X.2017.1396983","citationCount":"1","resultStr":"{\"title\":\"Melioration learning in iterated public goods games: The impact of exploratory noise\",\"authors\":\"Johannes Zschache\",\"doi\":\"10.1080/0022250X.2017.1396983\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"ABSTRACT Experimental observations in iterated public goods games are explained by a simple but empirically well-grounded model of long-term reinforcement learning. In many experiments, medium levels of cooperation at the beginning decrease with further repetitions. However, in some settings, the actors only slowly learn the individual benefits of defection. In the present model, the decay in cooperation is mitigated by high individual returns, a large group size or stability in the group’s composition. Results from agent-based simulations are presented, and the underlying mechanisms are disclosed. The proposed explanation stresses the role of exploratory noise: if multiple actors explore their alternatives simultaneously, the marginal benefit of defection diminishes and cooperation can be sustained.\",\"PeriodicalId\":50139,\"journal\":{\"name\":\"Journal of Mathematical Sociology\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":1.3000,\"publicationDate\":\"2018-01-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1080/0022250X.2017.1396983\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Mathematical Sociology\",\"FirstCategoryId\":\"90\",\"ListUrlMain\":\"https://doi.org/10.1080/0022250X.2017.1396983\",\"RegionNum\":4,\"RegionCategory\":\"社会学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"MATHEMATICS, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Mathematical Sociology","FirstCategoryId":"90","ListUrlMain":"https://doi.org/10.1080/0022250X.2017.1396983","RegionNum":4,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MATHEMATICS, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 1

摘要

迭代公共产品博弈中的实验观察可以用一个简单但经验基础良好的长期强化学习模型来解释。在许多实验中,随着进一步的重复,开始时的中等合作水平会下降。然而,在某些情况下,演员只是慢慢地了解到背叛的个人利益。在目前的模型中,合作的衰减被高个人回报、大群体规模或群体组成的稳定性所缓解。给出了基于智能体的仿真结果,并揭示了其潜在机制。提出的解释强调探索性噪声的作用:如果多个参与者同时探索他们的替代方案,背叛的边际效益会减少,合作可以持续。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Melioration learning in iterated public goods games: The impact of exploratory noise
ABSTRACT Experimental observations in iterated public goods games are explained by a simple but empirically well-grounded model of long-term reinforcement learning. In many experiments, medium levels of cooperation at the beginning decrease with further repetitions. However, in some settings, the actors only slowly learn the individual benefits of defection. In the present model, the decay in cooperation is mitigated by high individual returns, a large group size or stability in the group’s composition. Results from agent-based simulations are presented, and the underlying mechanisms are disclosed. The proposed explanation stresses the role of exploratory noise: if multiple actors explore their alternatives simultaneously, the marginal benefit of defection diminishes and cooperation can be sustained.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Mathematical Sociology
Journal of Mathematical Sociology 数学-数学跨学科应用
CiteScore
2.90
自引率
10.00%
发文量
5
审稿时长
>12 weeks
期刊介绍: The goal of the Journal of Mathematical Sociology is to publish models and mathematical techniques that would likely be useful to professional sociologists. The Journal also welcomes papers of mutual interest to social scientists and other social and behavioral scientists, as well as papers by non-social scientists that may encourage fruitful connections between sociology and other disciplines. Reviews of new or developing areas of mathematics and mathematical modeling that may have significant applications in sociology will also be considered. The Journal of Mathematical Sociology is published in association with the International Network for Social Network Analysis, the Japanese Association for Mathematical Sociology, the Mathematical Sociology Section of the American Sociological Association, and the Methodology Section of the American Sociological Association.
期刊最新文献
Can altruism lead to a willingness to take risks? Everybody herds, sometimes: cumulative advantage as a product of rational learning Hurdle-QAP models overcome dependency and sparsity in scientific collaboration count networks Latent class analysis of multigroup heterogeneity in propensity for academic dishonesty Extremism, segregation and oscillatory states emerge through collective opinion dynamics in a novel agent-based model
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1