{"title":"Reinforcement Learning for Online Learning Recommendation System","authors":"Wacharawan Intayoad, Chayapol Kamyod, P. Temdee","doi":"10.1109/GWS.2018.8686513","DOIUrl":null,"url":null,"abstract":"In the learning environment, individual learner requires flexible and suitable learning processes. Online learning should be able to recommend appropriate learning objects (LOs) to an individual in real-time. Most of the existing approaches of online learning recommendation systems are based on collaborative filtering methods. Such methods have a limitation on realtime adaption and require the prior knowledge of students and LOs. Therefore, this study proposes a real-time recommendation method which is suitable for flexible and complex environments. The proposed method is based on Reinforcement Learning problem. The method is able to explore the environment to get information and exploit the information to make a decision. We evaluate the proposed method with the real world data. We vary e-greedy, the learning rate, and the discount rate for a tradeoff between the exploration and exploitation.","PeriodicalId":256742,"journal":{"name":"2018 Global Wireless Summit (GWS)","volume":"70 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 Global Wireless Summit (GWS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GWS.2018.8686513","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9
Abstract
In the learning environment, individual learner requires flexible and suitable learning processes. Online learning should be able to recommend appropriate learning objects (LOs) to an individual in real-time. Most of the existing approaches of online learning recommendation systems are based on collaborative filtering methods. Such methods have a limitation on realtime adaption and require the prior knowledge of students and LOs. Therefore, this study proposes a real-time recommendation method which is suitable for flexible and complex environments. The proposed method is based on Reinforcement Learning problem. The method is able to explore the environment to get information and exploit the information to make a decision. We evaluate the proposed method with the real world data. We vary e-greedy, the learning rate, and the discount rate for a tradeoff between the exploration and exploitation.