{"title":"主题演讲1:游戏中的共同进化学习","authors":"X. Yao","doi":"10.1109/CIG.2015.7317657","DOIUrl":null,"url":null,"abstract":"Co-evolution has been used widely in automatic learning of game-playing strategies, e.g., for iterated prisoner's dilemma games, backgammon, chess, etc. It is a very interesting form of learning because it learns by interactions only, without any explicit target output information. In other words, the correct choices or moves were not provided as teacher information in learning. Yet co-evolutionary learning is still able to learn high-performance, in comparison to average human performance, game-playing strategies. Interestingly, the research of co-evolutionary learning has not focused on its generalisation ability, in sharp contrast to machine learning in general, where generalisation is at the heart of learning of any form. This talk presents one of the few generic frameworks that are available for measuring generalisation of coevolutionary learning. It enables us to discuss and study generalisation of different co-evolutionary algorithms more objectively and quantitatively. As a result, it enables us to draw more appropriate conclusions about the abilities of our learned game-playing strategies in dealing with totally new and unseens environments (including opponents). The iterated prisoner's dilemma game will be used as an example in this talk to illustrate our theoretical framework and performance improvements we could gain by following this more principled approach to co-evolutionary learning.","PeriodicalId":6594,"journal":{"name":"2016 IEEE Conference on Computational Intelligence and Games (CIG)","volume":"52 360 1","pages":"16"},"PeriodicalIF":0.0000,"publicationDate":"2015-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Keynote speech I: Co-evolutionary learning in game-playing\",\"authors\":\"X. Yao\",\"doi\":\"10.1109/CIG.2015.7317657\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Co-evolution has been used widely in automatic learning of game-playing strategies, e.g., for iterated prisoner's dilemma games, backgammon, chess, etc. It is a very interesting form of learning because it learns by interactions only, without any explicit target output information. In other words, the correct choices or moves were not provided as teacher information in learning. Yet co-evolutionary learning is still able to learn high-performance, in comparison to average human performance, game-playing strategies. Interestingly, the research of co-evolutionary learning has not focused on its generalisation ability, in sharp contrast to machine learning in general, where generalisation is at the heart of learning of any form. This talk presents one of the few generic frameworks that are available for measuring generalisation of coevolutionary learning. It enables us to discuss and study generalisation of different co-evolutionary algorithms more objectively and quantitatively. As a result, it enables us to draw more appropriate conclusions about the abilities of our learned game-playing strategies in dealing with totally new and unseens environments (including opponents). The iterated prisoner's dilemma game will be used as an example in this talk to illustrate our theoretical framework and performance improvements we could gain by following this more principled approach to co-evolutionary learning.\",\"PeriodicalId\":6594,\"journal\":{\"name\":\"2016 IEEE Conference on Computational Intelligence and Games (CIG)\",\"volume\":\"52 360 1\",\"pages\":\"16\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 IEEE Conference on Computational Intelligence and Games (CIG)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CIG.2015.7317657\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE Conference on Computational Intelligence and Games (CIG)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CIG.2015.7317657","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Keynote speech I: Co-evolutionary learning in game-playing
Co-evolution has been used widely in automatic learning of game-playing strategies, e.g., for iterated prisoner's dilemma games, backgammon, chess, etc. It is a very interesting form of learning because it learns by interactions only, without any explicit target output information. In other words, the correct choices or moves were not provided as teacher information in learning. Yet co-evolutionary learning is still able to learn high-performance, in comparison to average human performance, game-playing strategies. Interestingly, the research of co-evolutionary learning has not focused on its generalisation ability, in sharp contrast to machine learning in general, where generalisation is at the heart of learning of any form. This talk presents one of the few generic frameworks that are available for measuring generalisation of coevolutionary learning. It enables us to discuss and study generalisation of different co-evolutionary algorithms more objectively and quantitatively. As a result, it enables us to draw more appropriate conclusions about the abilities of our learned game-playing strategies in dealing with totally new and unseens environments (including opponents). The iterated prisoner's dilemma game will be used as an example in this talk to illustrate our theoretical framework and performance improvements we could gain by following this more principled approach to co-evolutionary learning.