运用Q-Learning实现加法问题教学策略的个性化

2021 International Conference on Signal Processing and Machine Learning (CONF-SPML) Pub Date : 2021-11-01 DOI:10.1109/CONF-SPML54095.2021.00043

Danyating Shen, Takara E. Truong, C. Weintz

{"title":"运用Q-Learning实现加法问题教学策略的个性化","authors":"Danyating Shen, Takara E. Truong, C. Weintz","doi":"10.1109/CONF-SPML54095.2021.00043","DOIUrl":null,"url":null,"abstract":"The prevalence of COVID-19 has illuminated the need for practical digital education tools over the past year. With students studying from home, teachers have struggled to provide their students with adequately challenging coursework. Our project aims to solve this issue in the context of math. More specifically, our goal is to encourage thoughtful learning by supplying students with personalized two-number addition problems that take time to solve but expect the student to answer correctly still. Our solution is to model the process of selecting a math problem to give a student as a Markov Decision Process (MDP) and then use Q-learning to determine the best policy for arriving at the most optimally challenging two-number addition problem for that student. The project creates three student simulators based on group member data. We show that it took student one: $(162 \\pm 134)$ iterations to give appropriate level problems where the first entry is mean and the second is the standard deviation. Student two took $(230 \\pm 205)$ iterations, and student three took $(247 \\pm 236)$ iterations. Lastly, we demonstrate that pre-training our model on students two and three and testing on student one showed a significant improvement from $(162 \\pm 134)$ iterations to $(35 \\pm 44)$ iterations.","PeriodicalId":415094,"journal":{"name":"2021 International Conference on Signal Processing and Machine Learning (CONF-SPML)","volume":"2010 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Using Q-Learning to Personalize Pedagogical Policies for Addition Problems\",\"authors\":\"Danyating Shen, Takara E. Truong, C. Weintz\",\"doi\":\"10.1109/CONF-SPML54095.2021.00043\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The prevalence of COVID-19 has illuminated the need for practical digital education tools over the past year. With students studying from home, teachers have struggled to provide their students with adequately challenging coursework. Our project aims to solve this issue in the context of math. More specifically, our goal is to encourage thoughtful learning by supplying students with personalized two-number addition problems that take time to solve but expect the student to answer correctly still. Our solution is to model the process of selecting a math problem to give a student as a Markov Decision Process (MDP) and then use Q-learning to determine the best policy for arriving at the most optimally challenging two-number addition problem for that student. The project creates three student simulators based on group member data. We show that it took student one: $(162 \\\\pm 134)$ iterations to give appropriate level problems where the first entry is mean and the second is the standard deviation. Student two took $(230 \\\\pm 205)$ iterations, and student three took $(247 \\\\pm 236)$ iterations. Lastly, we demonstrate that pre-training our model on students two and three and testing on student one showed a significant improvement from $(162 \\\\pm 134)$ iterations to $(35 \\\\pm 44)$ iterations.\",\"PeriodicalId\":415094,\"journal\":{\"name\":\"2021 International Conference on Signal Processing and Machine Learning (CONF-SPML)\",\"volume\":\"2010 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 International Conference on Signal Processing and Machine Learning (CONF-SPML)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CONF-SPML54095.2021.00043\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Signal Processing and Machine Learning (CONF-SPML)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CONF-SPML54095.2021.00043","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

过去一年，COVID-19的流行凸显了对实用数字教育工具的需求。由于学生在家学习，老师们一直在努力为他们的学生提供足够有挑战性的课程。我们的项目旨在在数学的背景下解决这个问题。更具体地说，我们的目标是通过为学生提供个性化的两数加法问题来鼓励深思熟虑的学习，这些问题需要时间来解决，但希望学生仍然能正确回答。我们的解决方案是将选择数学问题的过程建模为马尔可夫决策过程(MDP)，然后使用Q-learning来确定最佳策略，以达到对该学生最具挑战性的两数加法问题。该项目基于小组成员数据创建了三个学生模拟器。我们展示了学生1:$(162 \pm 134)$迭代来给出适当级别的问题，其中第一个条目是平均值，第二个是标准差。学生2获得$(230 \pm 205)$迭代，学生3获得$(247 \pm 236)$迭代。最后，我们证明了在学生2和3上预训练我们的模型并在学生1上进行测试显示了从$(162 \pm 134)$迭代到$(35 \pm 44)$迭代的显着改进。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Using Q-Learning to Personalize Pedagogical Policies for Addition Problems

The prevalence of COVID-19 has illuminated the need for practical digital education tools over the past year. With students studying from home, teachers have struggled to provide their students with adequately challenging coursework. Our project aims to solve this issue in the context of math. More specifically, our goal is to encourage thoughtful learning by supplying students with personalized two-number addition problems that take time to solve but expect the student to answer correctly still. Our solution is to model the process of selecting a math problem to give a student as a Markov Decision Process (MDP) and then use Q-learning to determine the best policy for arriving at the most optimally challenging two-number addition problem for that student. The project creates three student simulators based on group member data. We show that it took student one: $(162 \pm 134)$ iterations to give appropriate level problems where the first entry is mean and the second is the standard deviation. Student two took $(230 \pm 205)$ iterations, and student three took $(247 \pm 236)$ iterations. Lastly, we demonstrate that pre-training our model on students two and three and testing on student one showed a significant improvement from $(162 \pm 134)$ iterations to $(35 \pm 44)$ iterations.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2021 International Conference on Signal Processing and Machine Learning (CONF-SPML)

自引率

0.00%

发文量