{"title":"Reinforcement Learning for Build-Order Production in StarCraft II","authors":"Zhentao Tang, Dongbin Zhao, Yuanheng Zhu, Ping Guo","doi":"10.1109/ICIST.2018.8426160","DOIUrl":null,"url":null,"abstract":"StarCraft II is one of the most popular real-time strategy games and has become an important benchmark for AI research as it provides a complex environment with numerous challenges. The build order problem is one of the key challenges which concern the order and type of buildings and units to produce based on current game situation. In contrast to existing hand-craft methods, we propose two reinforcement learning based models: Neural Network Fitted Q-Learning (NNFQ) and Convolutional Neural Network Fitted Q-Learning (CNNFQ). NNFQ and CNNFQ have been applied into a simple bot for fighting against the enemy race. Experimental results show that both these two models are capable of finding the most effective production sequence to defeat the opponent.","PeriodicalId":331555,"journal":{"name":"2018 Eighth International Conference on Information Science and Technology (ICIST)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2018-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"18","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 Eighth International Conference on Information Science and Technology (ICIST)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIST.2018.8426160","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 18
Abstract
StarCraft II is one of the most popular real-time strategy games and has become an important benchmark for AI research as it provides a complex environment with numerous challenges. The build order problem is one of the key challenges which concern the order and type of buildings and units to produce based on current game situation. In contrast to existing hand-craft methods, we propose two reinforcement learning based models: Neural Network Fitted Q-Learning (NNFQ) and Convolutional Neural Network Fitted Q-Learning (CNNFQ). NNFQ and CNNFQ have been applied into a simple bot for fighting against the enemy race. Experimental results show that both these two models are capable of finding the most effective production sequence to defeat the opponent.