{"title":"多异构模块强化学习:发展型机器人学习框架","authors":"E. Uchibe, K. Doya","doi":"10.1109/DEVLRN.2005.1490949","DOIUrl":null,"url":null,"abstract":"Developmental learning approach by changing the internal state representation from simple to complex is promising in order for a robot to learn behaviors efficiently. We have proposed a reinforcement learning (RL) method for multiple learning modules with different state representations and algorithms. One of interesting results we showed is that a complex RL system can learn faster with the help of simpler RL systems that can not obtain the best performance. However, it did not consider the difference in sampling rates of learning modules. This paper discusses how the interaction among multiple learning modules with different sampling rates affects the robot learning. Experimental results in navigation task show that developmental learning described above is not always good strategy","PeriodicalId":297121,"journal":{"name":"Proceedings. The 4nd International Conference on Development and Learning, 2005.","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2005-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":"{\"title\":\"Reinforcement Learning with Multiple Heterogeneous Modules: A Framework for Developmental Robot Learning\",\"authors\":\"E. Uchibe, K. Doya\",\"doi\":\"10.1109/DEVLRN.2005.1490949\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Developmental learning approach by changing the internal state representation from simple to complex is promising in order for a robot to learn behaviors efficiently. We have proposed a reinforcement learning (RL) method for multiple learning modules with different state representations and algorithms. One of interesting results we showed is that a complex RL system can learn faster with the help of simpler RL systems that can not obtain the best performance. However, it did not consider the difference in sampling rates of learning modules. This paper discusses how the interaction among multiple learning modules with different sampling rates affects the robot learning. Experimental results in navigation task show that developmental learning described above is not always good strategy\",\"PeriodicalId\":297121,\"journal\":{\"name\":\"Proceedings. The 4nd International Conference on Development and Learning, 2005.\",\"volume\":\"22 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2005-07-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings. The 4nd International Conference on Development and Learning, 2005.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DEVLRN.2005.1490949\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings. The 4nd International Conference on Development and Learning, 2005.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DEVLRN.2005.1490949","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Reinforcement Learning with Multiple Heterogeneous Modules: A Framework for Developmental Robot Learning
Developmental learning approach by changing the internal state representation from simple to complex is promising in order for a robot to learn behaviors efficiently. We have proposed a reinforcement learning (RL) method for multiple learning modules with different state representations and algorithms. One of interesting results we showed is that a complex RL system can learn faster with the help of simpler RL systems that can not obtain the best performance. However, it did not consider the difference in sampling rates of learning modules. This paper discusses how the interaction among multiple learning modules with different sampling rates affects the robot learning. Experimental results in navigation task show that developmental learning described above is not always good strategy