{"title":"A problem solving environment that assists model development for reinforcement learning algorithms","authors":"Taiyo Maeda, Y. Aoki, T. Murata","doi":"10.1109/ICCIT.2010.5711068","DOIUrl":null,"url":null,"abstract":"This paper reports a problem solving environments (PSE) to assist researchers who study stochastic simulations such as reinforcement learning algorithms. They have to run their programs many times to compare their algorithms and find better sets of parameters for their programs. In order to reduce the working time, this system has three sub-systems: a distributed computing system, a data management system and a graph generation system. Using this system, we conduct experiments with human subjects. They register their programs, run them on a distributed computing system, obtain results automatically, and compare them graphically. As a result, a user obtained five times speedup for the work time. We present a relationship between development of algorithms and the three sub-systems.","PeriodicalId":131337,"journal":{"name":"5th International Conference on Computer Sciences and Convergence Information Technology","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"5th International Conference on Computer Sciences and Convergence Information Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCIT.2010.5711068","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This paper reports a problem solving environments (PSE) to assist researchers who study stochastic simulations such as reinforcement learning algorithms. They have to run their programs many times to compare their algorithms and find better sets of parameters for their programs. In order to reduce the working time, this system has three sub-systems: a distributed computing system, a data management system and a graph generation system. Using this system, we conduct experiments with human subjects. They register their programs, run them on a distributed computing system, obtain results automatically, and compare them graphically. As a result, a user obtained five times speedup for the work time. We present a relationship between development of algorithms and the three sub-systems.