基于强化学习的国际象棋引擎开发

Weidong Liao, Andrew Moseman
{"title":"基于强化学习的国际象棋引擎开发","authors":"Weidong Liao, Andrew Moseman","doi":"10.55632/pwvas.v95i2.990","DOIUrl":null,"url":null,"abstract":"Traditionally, chess engines use handcrafted evaluation functions based on human strategy. Recently, machine learning has been used as an alternative to direct position scoring. However, this typically involves training a model on human matches. Reinforcement learning has been shown to be a viable machine learning approach that, when combined with self play, can train a neural network for chess position evaluation without the need for human domain knowledge. This paper discusses our implementation of a reinforcement learning based chess engine, trained using self play.   ","PeriodicalId":92280,"journal":{"name":"Proceedings of the West Virginia Academy of Science","volume":"1 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-04-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Developing a Reinforcement Learning based Chess Engine\",\"authors\":\"Weidong Liao, Andrew Moseman\",\"doi\":\"10.55632/pwvas.v95i2.990\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Traditionally, chess engines use handcrafted evaluation functions based on human strategy. Recently, machine learning has been used as an alternative to direct position scoring. However, this typically involves training a model on human matches. Reinforcement learning has been shown to be a viable machine learning approach that, when combined with self play, can train a neural network for chess position evaluation without the need for human domain knowledge. This paper discusses our implementation of a reinforcement learning based chess engine, trained using self play.   \",\"PeriodicalId\":92280,\"journal\":{\"name\":\"Proceedings of the West Virginia Academy of Science\",\"volume\":\"1 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-04-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the West Virginia Academy of Science\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.55632/pwvas.v95i2.990\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the West Virginia Academy of Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.55632/pwvas.v95i2.990","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

传统上,国际象棋引擎使用基于人类策略的手工评估函数。最近,机器学习被用作直接位置评分的替代方法。然而,这通常需要训练一个人类匹配的模型。强化学习已被证明是一种可行的机器学习方法,当与自我下棋相结合时,可以在不需要人类领域知识的情况下训练神经网络来评估国际象棋的位置。本文讨论了我们基于强化学习的国际象棋引擎的实现,该引擎使用自对弈进行训练。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Developing a Reinforcement Learning based Chess Engine
Traditionally, chess engines use handcrafted evaluation functions based on human strategy. Recently, machine learning has been used as an alternative to direct position scoring. However, this typically involves training a model on human matches. Reinforcement learning has been shown to be a viable machine learning approach that, when combined with self play, can train a neural network for chess position evaluation without the need for human domain knowledge. This paper discusses our implementation of a reinforcement learning based chess engine, trained using self play.   
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Symmetry Equivalents of the Weak Value Measurement Pointer Hamiltonian West Virginia Human Whole-Body Donors in Undergraduate Biology Education at Radford University Geographical Impact of Human Gift Registries in West Virginia: A Model for Centralized Resources in Human Anatomy Education Geographical Impact of Human Gift Registries in West Virginia: A Model for Centralized Resources in Human Anatomy Education Evaluation of sample collection containers for selenium quantification
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1