{"title":"一个解决问题的环境,帮助模型开发的强化学习算法","authors":"Taiyo Maeda, Y. Aoki, T. Murata","doi":"10.1109/ICCIT.2010.5711068","DOIUrl":null,"url":null,"abstract":"This paper reports a problem solving environments (PSE) to assist researchers who study stochastic simulations such as reinforcement learning algorithms. They have to run their programs many times to compare their algorithms and find better sets of parameters for their programs. In order to reduce the working time, this system has three sub-systems: a distributed computing system, a data management system and a graph generation system. Using this system, we conduct experiments with human subjects. They register their programs, run them on a distributed computing system, obtain results automatically, and compare them graphically. As a result, a user obtained five times speedup for the work time. We present a relationship between development of algorithms and the three sub-systems.","PeriodicalId":131337,"journal":{"name":"5th International Conference on Computer Sciences and Convergence Information Technology","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A problem solving environment that assists model development for reinforcement learning algorithms\",\"authors\":\"Taiyo Maeda, Y. Aoki, T. Murata\",\"doi\":\"10.1109/ICCIT.2010.5711068\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper reports a problem solving environments (PSE) to assist researchers who study stochastic simulations such as reinforcement learning algorithms. They have to run their programs many times to compare their algorithms and find better sets of parameters for their programs. In order to reduce the working time, this system has three sub-systems: a distributed computing system, a data management system and a graph generation system. Using this system, we conduct experiments with human subjects. They register their programs, run them on a distributed computing system, obtain results automatically, and compare them graphically. As a result, a user obtained five times speedup for the work time. We present a relationship between development of algorithms and the three sub-systems.\",\"PeriodicalId\":131337,\"journal\":{\"name\":\"5th International Conference on Computer Sciences and Convergence Information Technology\",\"volume\":\"5 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"5th International Conference on Computer Sciences and Convergence Information Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCIT.2010.5711068\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"5th International Conference on Computer Sciences and Convergence Information Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCIT.2010.5711068","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A problem solving environment that assists model development for reinforcement learning algorithms
This paper reports a problem solving environments (PSE) to assist researchers who study stochastic simulations such as reinforcement learning algorithms. They have to run their programs many times to compare their algorithms and find better sets of parameters for their programs. In order to reduce the working time, this system has three sub-systems: a distributed computing system, a data management system and a graph generation system. Using this system, we conduct experiments with human subjects. They register their programs, run them on a distributed computing system, obtain results automatically, and compare them graphically. As a result, a user obtained five times speedup for the work time. We present a relationship between development of algorithms and the three sub-systems.