在线学习推荐系统的强化学习

Wacharawan Intayoad, Chayapol Kamyod, P. Temdee
{"title":"在线学习推荐系统的强化学习","authors":"Wacharawan Intayoad, Chayapol Kamyod, P. Temdee","doi":"10.1109/GWS.2018.8686513","DOIUrl":null,"url":null,"abstract":"In the learning environment, individual learner requires flexible and suitable learning processes. Online learning should be able to recommend appropriate learning objects (LOs) to an individual in real-time. Most of the existing approaches of online learning recommendation systems are based on collaborative filtering methods. Such methods have a limitation on realtime adaption and require the prior knowledge of students and LOs. Therefore, this study proposes a real-time recommendation method which is suitable for flexible and complex environments. The proposed method is based on Reinforcement Learning problem. The method is able to explore the environment to get information and exploit the information to make a decision. We evaluate the proposed method with the real world data. We vary e-greedy, the learning rate, and the discount rate for a tradeoff between the exploration and exploitation.","PeriodicalId":256742,"journal":{"name":"2018 Global Wireless Summit (GWS)","volume":"70 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":"{\"title\":\"Reinforcement Learning for Online Learning Recommendation System\",\"authors\":\"Wacharawan Intayoad, Chayapol Kamyod, P. Temdee\",\"doi\":\"10.1109/GWS.2018.8686513\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the learning environment, individual learner requires flexible and suitable learning processes. Online learning should be able to recommend appropriate learning objects (LOs) to an individual in real-time. Most of the existing approaches of online learning recommendation systems are based on collaborative filtering methods. Such methods have a limitation on realtime adaption and require the prior knowledge of students and LOs. Therefore, this study proposes a real-time recommendation method which is suitable for flexible and complex environments. The proposed method is based on Reinforcement Learning problem. The method is able to explore the environment to get information and exploit the information to make a decision. We evaluate the proposed method with the real world data. We vary e-greedy, the learning rate, and the discount rate for a tradeoff between the exploration and exploitation.\",\"PeriodicalId\":256742,\"journal\":{\"name\":\"2018 Global Wireless Summit (GWS)\",\"volume\":\"70 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"9\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 Global Wireless Summit (GWS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/GWS.2018.8686513\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 Global Wireless Summit (GWS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GWS.2018.8686513","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9

摘要

在学习环境中,个体学习者需要灵活和合适的学习过程。在线学习应该能够实时向个人推荐合适的学习对象(LOs)。现有的在线学习推荐系统大多是基于协同过滤的方法。这种方法在实时适应方面存在局限性,并且需要学生和LOs的先验知识。因此,本研究提出了一种适合于灵活复杂环境的实时推荐方法。该方法基于强化学习问题。该方法能够探索环境以获取信息,并利用信息做出决策。我们用真实世界的数据来评估所提出的方法。我们改变了e-greedy,学习率,以及在探索和开发之间权衡的折现率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Reinforcement Learning for Online Learning Recommendation System
In the learning environment, individual learner requires flexible and suitable learning processes. Online learning should be able to recommend appropriate learning objects (LOs) to an individual in real-time. Most of the existing approaches of online learning recommendation systems are based on collaborative filtering methods. Such methods have a limitation on realtime adaption and require the prior knowledge of students and LOs. Therefore, this study proposes a real-time recommendation method which is suitable for flexible and complex environments. The proposed method is based on Reinforcement Learning problem. The method is able to explore the environment to get information and exploit the information to make a decision. We evaluate the proposed method with the real world data. We vary e-greedy, the learning rate, and the discount rate for a tradeoff between the exploration and exploitation.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
3D Channel Spatial Characteristic Emulation in Multi-Probe Anechoic Chamber Setups Business Model Innovation Coaching in a Three-Dimensional Continuum Design of Agency Communication for Contingency Cellular Network Performance Evaluation of Artificial Megneto-Dielectric Metamaterial with Microstrip Patch Antenna for Wireless System A Conceptual Model for Promoting Positive Security Behavior in Internet of Things Era
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1