广义回归神经网络在进化马尔可夫博弈中策略选择的有效性

Shirin Kordnoori, H. Mostafaei, M. Ostadrahimi, S. Banihashemi
{"title":"广义回归神经网络在进化马尔可夫博弈中策略选择的有效性","authors":"Shirin Kordnoori, H. Mostafaei, M. Ostadrahimi, S. Banihashemi","doi":"10.12962/j20882033.v32i1.7074","DOIUrl":null,"url":null,"abstract":"Nowadays, Evolutionary Game Theory which studies the learning model of players, has attracted more attention than before. These Games can simulate the real situation and dynamic during processing time. This paper creates the Evolutionary Markov Games, which maps players’ strategy-choosing to a Markov Decision Processes (MDPs) with payoffs. Boltzmann distribution is used for transition probability and the General Regression Neural Network (GRNN) simulating the strategy-choosing in Evolutionary Markov Games. Prisoner’s dilemma is a problem that uses the method and output results showing the overlapping the human strategy-choosing line and GRNN strategy-choosing line after 48 iterations, and they choose the same strategies. Also, the error rate of the GRNN training by Tit for Tat (TFT) strategy is lower than similar work and shows a better result.","PeriodicalId":14549,"journal":{"name":"IPTEK: The Journal for Technology and Science","volume":"48 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"The Efficacy of Choosing Strategy with General Regression Neural Network on Evolutionary Markov Games\",\"authors\":\"Shirin Kordnoori, H. Mostafaei, M. Ostadrahimi, S. Banihashemi\",\"doi\":\"10.12962/j20882033.v32i1.7074\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Nowadays, Evolutionary Game Theory which studies the learning model of players, has attracted more attention than before. These Games can simulate the real situation and dynamic during processing time. This paper creates the Evolutionary Markov Games, which maps players’ strategy-choosing to a Markov Decision Processes (MDPs) with payoffs. Boltzmann distribution is used for transition probability and the General Regression Neural Network (GRNN) simulating the strategy-choosing in Evolutionary Markov Games. Prisoner’s dilemma is a problem that uses the method and output results showing the overlapping the human strategy-choosing line and GRNN strategy-choosing line after 48 iterations, and they choose the same strategies. Also, the error rate of the GRNN training by Tit for Tat (TFT) strategy is lower than similar work and shows a better result.\",\"PeriodicalId\":14549,\"journal\":{\"name\":\"IPTEK: The Journal for Technology and Science\",\"volume\":\"48 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-08-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IPTEK: The Journal for Technology and Science\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.12962/j20882033.v32i1.7074\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IPTEK: The Journal for Technology and Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.12962/j20882033.v32i1.7074","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

目前,研究玩家学习模式的进化博弈论受到了越来越多的关注。这些游戏可以模拟真实情况和动态过程中的处理时间。本文创建了进化马尔可夫博弈,将玩家的策略选择映射到具有收益的马尔可夫决策过程(mdp)。转移概率采用玻尔兹曼分布,通用回归神经网络(GRNN)模拟进化马尔可夫博弈中的策略选择。囚徒困境是利用人类策略选择线与GRNN策略选择线在48次迭代后重叠的方法和输出结果,并选择相同策略的问题。此外,采用以牙还牙(Tit for Tat, TFT)策略训练的GRNN的错误率低于同类方法,并取得了较好的训练效果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
The Efficacy of Choosing Strategy with General Regression Neural Network on Evolutionary Markov Games
Nowadays, Evolutionary Game Theory which studies the learning model of players, has attracted more attention than before. These Games can simulate the real situation and dynamic during processing time. This paper creates the Evolutionary Markov Games, which maps players’ strategy-choosing to a Markov Decision Processes (MDPs) with payoffs. Boltzmann distribution is used for transition probability and the General Regression Neural Network (GRNN) simulating the strategy-choosing in Evolutionary Markov Games. Prisoner’s dilemma is a problem that uses the method and output results showing the overlapping the human strategy-choosing line and GRNN strategy-choosing line after 48 iterations, and they choose the same strategies. Also, the error rate of the GRNN training by Tit for Tat (TFT) strategy is lower than similar work and shows a better result.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
17
审稿时长
9 weeks
期刊最新文献
Deposition Silver Based Thin Film on Stainless Steel 316l as Antimicrobial Agent Using Electrophoretic Deposition Method Probabilistic Scheduling Based On Hybrid Bayesian Network–Program Evaluation Review Technique, Analysis of Level Team Effectiveness in The Implementation of Scrum Using Evidence-Based Management (Case Study: Company A as A Fintech Industry) Project Delay Risk Assessment User-Centered Design-Based Approach in Scheduling Management Application Design and Development
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1