在线学习推荐系统的强化学习

2018 Global Wireless Summit (GWS) Pub Date : 2018-11-01 DOI:10.1109/GWS.2018.8686513

Wacharawan Intayoad, Chayapol Kamyod, P. Temdee

{"title":"在线学习推荐系统的强化学习","authors":"Wacharawan Intayoad, Chayapol Kamyod, P. Temdee","doi":"10.1109/GWS.2018.8686513","DOIUrl":null,"url":null,"abstract":"In the learning environment, individual learner requires flexible and suitable learning processes. Online learning should be able to recommend appropriate learning objects (LOs) to an individual in real-time. Most of the existing approaches of online learning recommendation systems are based on collaborative filtering methods. Such methods have a limitation on realtime adaption and require the prior knowledge of students and LOs. Therefore, this study proposes a real-time recommendation method which is suitable for flexible and complex environments. The proposed method is based on Reinforcement Learning problem. The method is able to explore the environment to get information and exploit the information to make a decision. We evaluate the proposed method with the real world data. We vary e-greedy, the learning rate, and the discount rate for a tradeoff between the exploration and exploitation.","PeriodicalId":256742,"journal":{"name":"2018 Global Wireless Summit (GWS)","volume":"70 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":"{\"title\":\"Reinforcement Learning for Online Learning Recommendation System\",\"authors\":\"Wacharawan Intayoad, Chayapol Kamyod, P. Temdee\",\"doi\":\"10.1109/GWS.2018.8686513\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the learning environment, individual learner requires flexible and suitable learning processes. Online learning should be able to recommend appropriate learning objects (LOs) to an individual in real-time. Most of the existing approaches of online learning recommendation systems are based on collaborative filtering methods. Such methods have a limitation on realtime adaption and require the prior knowledge of students and LOs. Therefore, this study proposes a real-time recommendation method which is suitable for flexible and complex environments. The proposed method is based on Reinforcement Learning problem. The method is able to explore the environment to get information and exploit the information to make a decision. We evaluate the proposed method with the real world data. We vary e-greedy, the learning rate, and the discount rate for a tradeoff between the exploration and exploitation.\",\"PeriodicalId\":256742,\"journal\":{\"name\":\"2018 Global Wireless Summit (GWS)\",\"volume\":\"70 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"9\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 Global Wireless Summit (GWS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/GWS.2018.8686513\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 Global Wireless Summit (GWS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GWS.2018.8686513","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 9

摘要

在学习环境中，个体学习者需要灵活和合适的学习过程。在线学习应该能够实时向个人推荐合适的学习对象(LOs)。现有的在线学习推荐系统大多是基于协同过滤的方法。这种方法在实时适应方面存在局限性，并且需要学生和LOs的先验知识。因此，本研究提出了一种适合于灵活复杂环境的实时推荐方法。该方法基于强化学习问题。该方法能够探索环境以获取信息，并利用信息做出决策。我们用真实世界的数据来评估所提出的方法。我们改变了e-greedy，学习率，以及在探索和开发之间权衡的折现率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Reinforcement Learning for Online Learning Recommendation System

In the learning environment, individual learner requires flexible and suitable learning processes. Online learning should be able to recommend appropriate learning objects (LOs) to an individual in real-time. Most of the existing approaches of online learning recommendation systems are based on collaborative filtering methods. Such methods have a limitation on realtime adaption and require the prior knowledge of students and LOs. Therefore, this study proposes a real-time recommendation method which is suitable for flexible and complex environments. The proposed method is based on Reinforcement Learning problem. The method is able to explore the environment to get information and exploit the information to make a decision. We evaluate the proposed method with the real world data. We vary e-greedy, the learning rate, and the discount rate for a tradeoff between the exploration and exploitation.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2018 Global Wireless Summit (GWS)

自引率

0.00%

发文量