{"title":"模仿学习与强化学习:关于绘画动作的习得","authors":"Tatsuya Sakato, Motoyuki Ozeki, N. Oka","doi":"10.1109/IIAI-AAI.2014.174","DOIUrl":null,"url":null,"abstract":"Learning is essential for an autonomous agent to adapt to an environment. One method of learning is through trial and error, however, this method is impractical in a complex environment because of the long learning time required by the agent. Therefore, guidelines are necessary in order to expedite the learning process in such environments, and imitation is one such guideline. Sakato, Ozeki, and Oka (2012-2013) recently proposed a computational model of imitation and autonomous behavior by which an agent can reduce its learning time through imitation. They evaluate the model in discrete and continuous spaces, and apply the model to a real robot in order to acquire painting skills. Their experimental results indicate that the model adapted to the experimental environment by imitation. In this paper, we introduce the model and discuss what are needed to improve the model.","PeriodicalId":432222,"journal":{"name":"2014 IIAI 3rd International Conference on Advanced Applied Informatics","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Learning through Imitation and Reinforcement Learning: Toward the Acquisition of Painting Motions\",\"authors\":\"Tatsuya Sakato, Motoyuki Ozeki, N. Oka\",\"doi\":\"10.1109/IIAI-AAI.2014.174\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Learning is essential for an autonomous agent to adapt to an environment. One method of learning is through trial and error, however, this method is impractical in a complex environment because of the long learning time required by the agent. Therefore, guidelines are necessary in order to expedite the learning process in such environments, and imitation is one such guideline. Sakato, Ozeki, and Oka (2012-2013) recently proposed a computational model of imitation and autonomous behavior by which an agent can reduce its learning time through imitation. They evaluate the model in discrete and continuous spaces, and apply the model to a real robot in order to acquire painting skills. Their experimental results indicate that the model adapted to the experimental environment by imitation. In this paper, we introduce the model and discuss what are needed to improve the model.\",\"PeriodicalId\":432222,\"journal\":{\"name\":\"2014 IIAI 3rd International Conference on Advanced Applied Informatics\",\"volume\":\"6 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 IIAI 3rd International Conference on Advanced Applied Informatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IIAI-AAI.2014.174\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 IIAI 3rd International Conference on Advanced Applied Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IIAI-AAI.2014.174","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Learning through Imitation and Reinforcement Learning: Toward the Acquisition of Painting Motions
Learning is essential for an autonomous agent to adapt to an environment. One method of learning is through trial and error, however, this method is impractical in a complex environment because of the long learning time required by the agent. Therefore, guidelines are necessary in order to expedite the learning process in such environments, and imitation is one such guideline. Sakato, Ozeki, and Oka (2012-2013) recently proposed a computational model of imitation and autonomous behavior by which an agent can reduce its learning time through imitation. They evaluate the model in discrete and continuous spaces, and apply the model to a real robot in order to acquire painting skills. Their experimental results indicate that the model adapted to the experimental environment by imitation. In this paper, we introduce the model and discuss what are needed to improve the model.