迭代公共产品游戏中的改进学习:探索性噪音的影响

IF 1.3 4区社会学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Journal of Mathematical Sociology Pub Date : 2018-01-02 DOI:10.1080/0022250X.2017.1396983

Johannes Zschache

{"title":"迭代公共产品游戏中的改进学习:探索性噪音的影响","authors":"Johannes Zschache","doi":"10.1080/0022250X.2017.1396983","DOIUrl":null,"url":null,"abstract":"ABSTRACT Experimental observations in iterated public goods games are explained by a simple but empirically well-grounded model of long-term reinforcement learning. In many experiments, medium levels of cooperation at the beginning decrease with further repetitions. However, in some settings, the actors only slowly learn the individual benefits of defection. In the present model, the decay in cooperation is mitigated by high individual returns, a large group size or stability in the group’s composition. Results from agent-based simulations are presented, and the underlying mechanisms are disclosed. The proposed explanation stresses the role of exploratory noise: if multiple actors explore their alternatives simultaneously, the marginal benefit of defection diminishes and cooperation can be sustained.","PeriodicalId":50139,"journal":{"name":"Journal of Mathematical Sociology","volume":"42 1","pages":"1 - 16"},"PeriodicalIF":1.3000,"publicationDate":"2018-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/0022250X.2017.1396983","citationCount":"1","resultStr":"{\"title\":\"Melioration learning in iterated public goods games: The impact of exploratory noise\",\"authors\":\"Johannes Zschache\",\"doi\":\"10.1080/0022250X.2017.1396983\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"ABSTRACT Experimental observations in iterated public goods games are explained by a simple but empirically well-grounded model of long-term reinforcement learning. In many experiments, medium levels of cooperation at the beginning decrease with further repetitions. However, in some settings, the actors only slowly learn the individual benefits of defection. In the present model, the decay in cooperation is mitigated by high individual returns, a large group size or stability in the group’s composition. Results from agent-based simulations are presented, and the underlying mechanisms are disclosed. The proposed explanation stresses the role of exploratory noise: if multiple actors explore their alternatives simultaneously, the marginal benefit of defection diminishes and cooperation can be sustained.\",\"PeriodicalId\":50139,\"journal\":{\"name\":\"Journal of Mathematical Sociology\",\"volume\":\"42 1\",\"pages\":\"1 - 16\"},\"PeriodicalIF\":1.3000,\"publicationDate\":\"2018-01-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1080/0022250X.2017.1396983\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Mathematical Sociology\",\"FirstCategoryId\":\"90\",\"ListUrlMain\":\"https://doi.org/10.1080/0022250X.2017.1396983\",\"RegionNum\":4,\"RegionCategory\":\"社会学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"MATHEMATICS, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Mathematical Sociology","FirstCategoryId":"90","ListUrlMain":"https://doi.org/10.1080/0022250X.2017.1396983","RegionNum":4,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MATHEMATICS, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}

引用次数: 1

摘要

迭代公共产品博弈中的实验观察可以用一个简单但经验基础良好的长期强化学习模型来解释。在许多实验中，随着进一步的重复，开始时的中等合作水平会下降。然而，在某些情况下，演员只是慢慢地了解到背叛的个人利益。在目前的模型中，合作的衰减被高个人回报、大群体规模或群体组成的稳定性所缓解。给出了基于智能体的仿真结果，并揭示了其潜在机制。提出的解释强调探索性噪声的作用:如果多个参与者同时探索他们的替代方案，背叛的边际效益会减少，合作可以持续。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Melioration learning in iterated public goods games: The impact of exploratory noise

ABSTRACT Experimental observations in iterated public goods games are explained by a simple but empirically well-grounded model of long-term reinforcement learning. In many experiments, medium levels of cooperation at the beginning decrease with further repetitions. However, in some settings, the actors only slowly learn the individual benefits of defection. In the present model, the decay in cooperation is mitigated by high individual returns, a large group size or stability in the group’s composition. Results from agent-based simulations are presented, and the underlying mechanisms are disclosed. The proposed explanation stresses the role of exploratory noise: if multiple actors explore their alternatives simultaneously, the marginal benefit of defection diminishes and cooperation can be sustained.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of Mathematical Sociology 数学-数学跨学科应用

CiteScore

2.90

自引率

10.00%

发文量

审稿时长

>12 weeks

期刊介绍： The goal of the Journal of Mathematical Sociology is to publish models and mathematical techniques that would likely be useful to professional sociologists. The Journal also welcomes papers of mutual interest to social scientists and other social and behavioral scientists, as well as papers by non-social scientists that may encourage fruitful connections between sociology and other disciplines. Reviews of new or developing areas of mathematics and mathematical modeling that may have significant applications in sociology will also be considered. The Journal of Mathematical Sociology is published in association with the International Network for Social Network Analysis, the Japanese Association for Mathematical Sociology, the Mathematical Sociology Section of the American Sociological Association, and the Methodology Section of the American Sociological Association.