Toshiaki Takano, H. Takase, H. Kawanaka, S. Tsuruoka
{"title":"同一迁移模型下强化学习的迁移方法——快速逼近与优先探索","authors":"Toshiaki Takano, H. Takase, H. Kawanaka, S. Tsuruoka","doi":"10.1109/ICMLA.2011.148","DOIUrl":null,"url":null,"abstract":"We aim to accelerate learning processes in reinforcement learning by transfer learning. Its concept is that knowledge to solve similar tasks accelerates a learning process of a target task. We have proposed that the basic transfer method based on forbidden rule set that is a set of rules which cause to immediately failure of a target task. However, the basic method works poorly for the gSame Transition Model,h which has same state transition probability and different goal. In this article, we propose an effective transfer learning method in same transition model. In detail, it consists of two strategies: (1) approaching to the goal for the selected source task quickly, and (2) exploring states around the goal preferentially.","PeriodicalId":439926,"journal":{"name":"2011 10th International Conference on Machine Learning and Applications and Workshops","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"Transfer Method for Reinforcement Learning in Same Transition Model -- Quick Approach and Preferential Exploration\",\"authors\":\"Toshiaki Takano, H. Takase, H. Kawanaka, S. Tsuruoka\",\"doi\":\"10.1109/ICMLA.2011.148\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We aim to accelerate learning processes in reinforcement learning by transfer learning. Its concept is that knowledge to solve similar tasks accelerates a learning process of a target task. We have proposed that the basic transfer method based on forbidden rule set that is a set of rules which cause to immediately failure of a target task. However, the basic method works poorly for the gSame Transition Model,h which has same state transition probability and different goal. In this article, we propose an effective transfer learning method in same transition model. In detail, it consists of two strategies: (1) approaching to the goal for the selected source task quickly, and (2) exploring states around the goal preferentially.\",\"PeriodicalId\":439926,\"journal\":{\"name\":\"2011 10th International Conference on Machine Learning and Applications and Workshops\",\"volume\":\"8 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-12-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2011 10th International Conference on Machine Learning and Applications and Workshops\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICMLA.2011.148\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 10th International Conference on Machine Learning and Applications and Workshops","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLA.2011.148","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Transfer Method for Reinforcement Learning in Same Transition Model -- Quick Approach and Preferential Exploration
We aim to accelerate learning processes in reinforcement learning by transfer learning. Its concept is that knowledge to solve similar tasks accelerates a learning process of a target task. We have proposed that the basic transfer method based on forbidden rule set that is a set of rules which cause to immediately failure of a target task. However, the basic method works poorly for the gSame Transition Model,h which has same state transition probability and different goal. In this article, we propose an effective transfer learning method in same transition model. In detail, it consists of two strategies: (1) approaching to the goal for the selected source task quickly, and (2) exploring states around the goal preferentially.