A reinforcement learning approach to adaptive remediation in online training

IF 0.6 Q3 ENGINEERING, MULTIDISCIPLINARY Journal of Defense Modeling and Simulation-Applications Methodology Technology-JDMS Pub Date : 2021-07-23 DOI:10.1177/15485129211028317

Randall D. Spain, Jonathan P. Rowe, A. Smith, B. Goldberg, R. Pokorny, Bradford W. Mott, James C. Lester

{"title":"A reinforcement learning approach to adaptive remediation in online training","authors":"Randall D. Spain, Jonathan P. Rowe, A. Smith, B. Goldberg, R. Pokorny, Bradford W. Mott, James C. Lester","doi":"10.1177/15485129211028317","DOIUrl":null,"url":null,"abstract":"Advances in artificial intelligence (AI) and machine learning can be leveraged to tailor training based on the goals, learning needs, and preferences of learners. A key component of adaptive training systems is tutorial planning, which controls how scaffolding is structured and delivered to learners to create dynamically personalized learning experiences. The goal of this study was to induce data-driven policies for tutorial planning using reinforcement learning (RL) to provide adaptive scaffolding based on the Interactive, Constructive, Active, Passive framework for cognitive engagement. We describe a dataset that was collected to induce RL-based scaffolding policies, and we present the results of our policy analyses. Results showed that the best performing policies optimized learning gains by inducing an adaptive fading approach in which learners received less cognitively engaging forms of remediation as they advanced through the training course. This policy was consistent with preliminary analyses that showed constructive remediation became less effective as learners progressed through the training session. Results also showed that learners’ prior knowledge impacted the type of scaffold that was recommended, thus showing evidence of an aptitude–treatment interaction. We conclude with a discussion of how AI-based training can be leveraged to enhance training effectiveness as well as directions for future research.","PeriodicalId":44661,"journal":{"name":"Journal of Defense Modeling and Simulation-Applications Methodology Technology-JDMS","volume":"1 1","pages":"173 - 193"},"PeriodicalIF":0.6000,"publicationDate":"2021-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Defense Modeling and Simulation-Applications Methodology Technology-JDMS","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/15485129211028317","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 2

Abstract

Advances in artificial intelligence (AI) and machine learning can be leveraged to tailor training based on the goals, learning needs, and preferences of learners. A key component of adaptive training systems is tutorial planning, which controls how scaffolding is structured and delivered to learners to create dynamically personalized learning experiences. The goal of this study was to induce data-driven policies for tutorial planning using reinforcement learning (RL) to provide adaptive scaffolding based on the Interactive, Constructive, Active, Passive framework for cognitive engagement. We describe a dataset that was collected to induce RL-based scaffolding policies, and we present the results of our policy analyses. Results showed that the best performing policies optimized learning gains by inducing an adaptive fading approach in which learners received less cognitively engaging forms of remediation as they advanced through the training course. This policy was consistent with preliminary analyses that showed constructive remediation became less effective as learners progressed through the training session. Results also showed that learners’ prior knowledge impacted the type of scaffold that was recommended, thus showing evidence of an aptitude–treatment interaction. We conclude with a discussion of how AI-based training can be leveraged to enhance training effectiveness as well as directions for future research.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

在线培训中自适应修复的强化学习方法

人工智能(AI)和机器学习的进步可以根据学习者的目标、学习需求和偏好来定制培训。适应性培训系统的一个关键组成部分是教程规划，它控制如何搭建和交付给学习者创建动态个性化的学习体验。本研究的目的是利用强化学习(RL)诱导数据驱动的教学计划策略，以提供基于认知参与的交互式、建设性、主动、被动框架的自适应脚手架。我们描述了一个收集的数据集，该数据集用于诱导基于rl的脚手架策略，并展示了我们的政策分析结果。结果表明，表现最好的策略通过诱导自适应消退方法优化了学习收益，在这种方法中，学习者在培训课程中获得较少的认知参与形式的补救。这一政策与初步分析相一致，该分析表明，随着学习者在培训阶段的进步，建设性的补习变得不那么有效。结果还表明，学习者的先验知识影响了推荐的支架类型，从而显示了能力-治疗相互作用的证据。最后，我们讨论了如何利用基于人工智能的训练来提高训练效率以及未来的研究方向。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊