首页 > 最新文献

IEEE Transactions on Learning Technologies最新文献

英文 中文
How Well Can Tutoring Audio Be Autoclassified and Machine Explained With XAI: A Comparison of Three Types of Methods 如何利用 XAI 对辅导音频进行自动分类和机器解释:三种方法的比较
IF 3.7 3区 教育学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-03-22 DOI: 10.1109/TLT.2024.3381028
Lishan Zhang;Linyu Deng;Sixv Zhang;Ling Chen
With the popularity of online one-to-one tutoring, there are emerging concerns about the quality and effectiveness of this kind of tutoring. Although there are some evaluation methods available, they are heavily relied on manual coding by experts, which is too costly. Therefore, using machine learning to predict instruction quality automatically is an effective way to reduce human costs. Three classification methods are analyzed in this article: 1) random forest algorithm with human-engineered descriptive features; 2) long and short-term memory algorithm with acoustic features generated by open speech and music interpretation by large space extraction toolkit; and 3) convolutional neural network algorithm with Mel spectrogram of the audio. The results show that the three approaches can complete the prediction task well, with the second approach exhibiting the best accuracy. The importance of the features in these classification models is analyzed according to eXplainable Artificial Intelligence techniques (i.e., XAI) and statistical feature analysis methods. In this way, key indicators of high-quality tutoring are identified. This study demonstrated the usefulness of XAI techniques in understanding why some tutoring sessions are of good quality and others are not. The results can be potentially used to guide the improvement of online one-to-one tutoring in the future.
随着在线一对一辅导的普及,人们开始关注这种辅导的质量和效果。虽然目前已有一些评估方法,但这些方法主要依赖专家的人工编码,成本过高。因此,利用机器学习自动预测教学质量是降低人力成本的有效方法。本文分析了三种分类方法:1) 使用人工设计的描述性特征的随机森林算法;2) 使用大空间提取工具包通过开放语音和音乐解释生成的声学特征的长短期记忆算法;3) 使用音频的梅尔频谱图的卷积神经网络算法。结果表明,这三种方法都能很好地完成预测任务,其中第二种方法的准确率最高。根据可解释人工智能技术(即 XAI)和统计特征分析方法,分析了这些分类模型中特征的重要性。通过这种方法,确定了高质量辅导的关键指标。这项研究表明,XAI 技术有助于理解为什么有些辅导课质量高,而有些则质量低。研究结果可用于指导改进未来的在线一对一辅导。
{"title":"How Well Can Tutoring Audio Be Autoclassified and Machine Explained With XAI: A Comparison of Three Types of Methods","authors":"Lishan Zhang;Linyu Deng;Sixv Zhang;Ling Chen","doi":"10.1109/TLT.2024.3381028","DOIUrl":"10.1109/TLT.2024.3381028","url":null,"abstract":"With the popularity of online one-to-one tutoring, there are emerging concerns about the quality and effectiveness of this kind of tutoring. Although there are some evaluation methods available, they are heavily relied on manual coding by experts, which is too costly. Therefore, using machine learning to predict instruction quality automatically is an effective way to reduce human costs. Three classification methods are analyzed in this article: 1) random forest algorithm with human-engineered descriptive features; 2) long and short-term memory algorithm with acoustic features generated by open speech and music interpretation by large space extraction toolkit; and 3) convolutional neural network algorithm with Mel spectrogram of the audio. The results show that the three approaches can complete the prediction task well, with the second approach exhibiting the best accuracy. The importance of the features in these classification models is analyzed according to eXplainable Artificial Intelligence techniques (i.e., XAI) and statistical feature analysis methods. In this way, key indicators of high-quality tutoring are identified. This study demonstrated the usefulness of XAI techniques in understanding why some tutoring sessions are of good quality and others are not. The results can be potentially used to guide the improvement of online one-to-one tutoring in the future.","PeriodicalId":49191,"journal":{"name":"IEEE Transactions on Learning Technologies","volume":"17 ","pages":"1302-1312"},"PeriodicalIF":3.7,"publicationDate":"2024-03-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140198862","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"教育学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automated Program Repair for Introductory Programming Assignments 程序设计入门作业的自动程序修复
IF 3.7 3区 教育学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-03-21 DOI: 10.1109/TLT.2024.3403710
Han Wan;Hongzhen Luo;Mengying Li;Xiaoyan Luo
Automatic program repair (APR) tools are valuable for students to assist them with debugging tasks since program repair captures the code modification to make a buggy program pass the given test-suite. However, the process of manually generating catalogs of code modifications is intricate and time-consuming. This article proposes contextual error model repair (CEMR), an automated program repair tool for introductory programming assignments. CEMR is designed to learn program code modifications from incorrect–correct code pairs automatically. Then, it utilizes these code modifications along with CodeBERT, a generative AI, to repair students' new incorrect programs in the same programming assignment. CEMR builds on the observation that code edits performed by students in pairs of incorrect–correct code can be used as input–output examples for learning code modifications. The key idea of CEMR is to leverage the wisdom of the crowd: it uses the existing code modifications of incorrect–correct student code pairs to repair the new incorrect student attempts. We chose three of the most related APR tools, Refazer, Refactory, and AlphaRepair, as the baselines to compare against CEMR. The experimental results demonstrate that, on public and real classroom datasets, CEMR achieves higher repair rates than the baselines. Through further analysis, CEMR has demonstrated promising effectiveness in addressing semantical and logical errors while its performance in fixing syntactical errors is limited. In terms of time for repairing buggy programs, CEMR costs approximately half as much as AlphaRepair requires. We opine that CEMR not only be seen as a program repair method that achieves good results with incorrect–correct code pairs but also be further utilized to generate hints to better assist students in learning programming.
自动程序修复(APR)工具对于协助学生完成调试任务非常有价值,因为程序修复可以捕捉代码修改,使有错误的程序通过给定的测试套件。然而,手动生成代码修改目录的过程复杂而耗时。本文提出了上下文错误模型修复(CEMR)--一种用于编程入门作业的自动程序修复工具。CEMR 的设计目的是从错误-正确代码对中自动学习程序代码修改。然后,它利用这些代码修改和生成式人工智能 CodeBERT,修复学生在同一编程作业中的新错误程序。CEMR 基于以下观察结果:学生在错误-正确代码对中执行的代码编辑可用作学习代码修改的输入-输出示例。CEMR 的关键理念是利用群众的智慧:它使用错误-正确学生代码对的现有代码修改来修复新的错误学生尝试。我们选择了三个最相关的 APR 工具 Refazer、Refactory 和 AlphaRepair 作为基准与 CEMR 进行比较。实验结果表明,在公共和真实教室数据集上,CEMR 的修复率高于基线工具。通过进一步分析,CEMR 在解决语义和逻辑错误方面表现出了良好的效果,而在修复语法错误方面的表现却很有限。就修复错误程序所需的时间而言,CEMR 的成本约为 AlphaRepair 的一半。我们认为,CEMR 不仅可以作为一种程序修复方法,在修复错误代码对时取得良好效果,还可以进一步用于生成提示,以更好地帮助学生学习编程。
{"title":"Automated Program Repair for Introductory Programming Assignments","authors":"Han Wan;Hongzhen Luo;Mengying Li;Xiaoyan Luo","doi":"10.1109/TLT.2024.3403710","DOIUrl":"10.1109/TLT.2024.3403710","url":null,"abstract":"Automatic program repair (APR) tools are valuable for students to assist them with debugging tasks since program repair captures the code modification to make a buggy program pass the given test-suite. However, the process of manually generating catalogs of code modifications is intricate and time-consuming. This article proposes contextual error model repair (CEMR), an automated program repair tool for introductory programming assignments. CEMR is designed to learn program code modifications from incorrect–correct code pairs automatically. Then, it utilizes these code modifications along with CodeBERT, a generative AI, to repair students' new incorrect programs in the same programming assignment. CEMR builds on the observation that code edits performed by students in pairs of incorrect–correct code can be used as input–output examples for learning code modifications. The key idea of CEMR is to leverage the \u0000<italic>wisdom of the crowd</i>\u0000: it uses the existing code modifications of incorrect–correct student code pairs to repair the new incorrect student attempts. We chose three of the most related APR tools, Refazer, Refactory, and AlphaRepair, as the baselines to compare against CEMR. The experimental results demonstrate that, on public and real classroom datasets, CEMR achieves higher repair rates than the baselines. Through further analysis, CEMR has demonstrated promising effectiveness in addressing semantical and logical errors while its performance in fixing syntactical errors is limited. In terms of time for repairing buggy programs, CEMR costs approximately half as much as AlphaRepair requires. We opine that CEMR not only be seen as a program repair method that achieves good results with incorrect–correct code pairs but also be further utilized to generate hints to better assist students in learning programming.","PeriodicalId":49191,"journal":{"name":"IEEE Transactions on Learning Technologies","volume":"17 ","pages":"1745-1760"},"PeriodicalIF":3.7,"publicationDate":"2024-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141146079","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"教育学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Design and Evaluation of Trustworthy Knowledge Tracing Model for Intelligent Tutoring System 智能辅导系统可信知识追踪模型的设计与评估
IF 3.7 3区 教育学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-03-20 DOI: 10.1109/TLT.2024.3403135
Yu Lu;Deliang Wang;Penghe Chen;Zhi Zhang
Amid the rapid evolution of artificial intelligence (AI), the intricate model structures and opaque decision-making processes of AI-based systems have raised the trustworthy issues in education. We, therefore, first propose a novel three-layer knowledge tracing model designed to address trustworthiness for an intelligent tutoring system. Each layer is crafted to tackle a specific challenge: transparency, explainability, and accountability. We have introduced an explainable AI (xAI) approach to offer technical interpreting information, validated by the established educational theories and principles. The validated interpreting information is subsequently transitioned from its technical context into educational insights, which are then incorporated into the newly designed user interface. Our evaluations indicate that an intelligent tutoring system, when equipped with the designed trustworthy knowledge tracing model, significantly enhances user trust and knowledge from the perspectives of both teachers and students. This study, thus, contributes a tangible solution that utilizes the xAI approach as the enabling technology to construct trustworthy systems or tools in education.
在人工智能(AI)飞速发展的过程中,基于 AI 的系统错综复杂的模型结构和不透明的决策过程引发了教育领域的可信性问题。因此,我们首先提出了一个新颖的三层知识追踪模型,旨在解决智能辅导系统的可信性问题。每一层的设计都是为了应对一个特定的挑战:透明度、可解释性和问责制。我们引入了一种可解释的人工智能(xAI)方法来提供技术解释信息,并通过既定的教育理论和原则进行验证。经过验证的解释信息随后会从其技术背景转换为教育见解,然后将其纳入新设计的用户界面。我们的评估结果表明,当智能辅导系统配备所设计的可信知识追踪模型时,从教师和学生的角度来看,都能显著增强用户的信任度和知识水平。因此,这项研究提供了一个切实可行的解决方案,利用 xAI 方法作为构建教育可信系统或工具的使能技术。
{"title":"Design and Evaluation of Trustworthy Knowledge Tracing Model for Intelligent Tutoring System","authors":"Yu Lu;Deliang Wang;Penghe Chen;Zhi Zhang","doi":"10.1109/TLT.2024.3403135","DOIUrl":"10.1109/TLT.2024.3403135","url":null,"abstract":"Amid the rapid evolution of artificial intelligence (AI), the intricate model structures and opaque decision-making processes of AI-based systems have raised the trustworthy issues in education. We, therefore, first propose a novel three-layer knowledge tracing model designed to address trustworthiness for an intelligent tutoring system. Each layer is crafted to tackle a specific challenge: transparency, explainability, and accountability. We have introduced an explainable AI (xAI) approach to offer technical interpreting information, validated by the established educational theories and principles. The validated interpreting information is subsequently transitioned from its technical context into educational insights, which are then incorporated into the newly designed user interface. Our evaluations indicate that an intelligent tutoring system, when equipped with the designed trustworthy knowledge tracing model, significantly enhances user trust and knowledge from the perspectives of both teachers and students. This study, thus, contributes a tangible solution that utilizes the xAI approach as the enabling technology to construct trustworthy systems or tools in education.","PeriodicalId":49191,"journal":{"name":"IEEE Transactions on Learning Technologies","volume":"17 ","pages":"1701-1716"},"PeriodicalIF":3.7,"publicationDate":"2024-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141146168","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"教育学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Write-Curate-Verify: A Case Study of Leveraging Generative AI for Scenario Writing in Scenario-Based Learning 编写-评测-验证:在情景式学习中利用生成式人工智能进行情景写作的案例研究
IF 3.7 3区 教育学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-03-18 DOI: 10.1109/TLT.2024.3378306
Shurui Bai;Donn Emmanuel Gonda;Khe Foon Hew
This case study explored the use of generative artificial intelligence (GenAI), specifically chat generative pretraining transformer (ChatGPT), in writing scenarios for scenario-based learning (SBL). Our research addressed three key questions: 1) how do teachers leverage GenAI to write scenarios for SBL purposes? 2) what is the quality of GenAI-generated SBL scenarios and tasks? and 3) how does GenAI-supported SBL affect students’ motivation, learning performance, and learning perceptions? A three-step prompting engineering process (write the prompts, curate the output, and verify the output, WCV) was established during the teacher interaction with GenAI in the scenario writing. Findings revealed that by using the WCV approach, ChatGPT enabled the efficient creation of quality scenarios for SBL purposes in a short timeframe. Moreover, students exhibited increased intrinsic motivation, learning performance, and positive attitudes toward GenAI-supported scenarios. We also suggest guidelines for using the WCV prompt engineering process in scenario writing.
本案例研究探讨了生成式人工智能(GenAI),特别是聊天生成式预训练转换器(ChatGPT)在情景式学习(SBL)情景编写中的应用。我们的研究解决了三个关键问题:1) 教师如何利用 GenAI 为 SBL 编写情景?2)GenAI 生成的 SBL 情景和任务的质量如何? 3)GenAI 支持的 SBL 如何影响学生的学习动机、学习成绩和学习感知?在教师与 GenAI 的交互过程中,建立了三步提示工程流程(编写提示、策划输出、验证输出,WCV)。研究结果表明,通过使用 WCV 方法,ChatGPT 能够在短时间内为 SBL 目的高效创建高质量的情景。此外,学生对 GenAI 支持的情景模式表现出更高的内在动力、学习成绩和积极态度。我们还提出了在情景写作中使用 WCV 提示工程流程的指导原则。
{"title":"Write-Curate-Verify: A Case Study of Leveraging Generative AI for Scenario Writing in Scenario-Based Learning","authors":"Shurui Bai;Donn Emmanuel Gonda;Khe Foon Hew","doi":"10.1109/TLT.2024.3378306","DOIUrl":"10.1109/TLT.2024.3378306","url":null,"abstract":"This case study explored the use of generative artificial intelligence (GenAI), specifically chat generative pretraining transformer (ChatGPT), in writing scenarios for scenario-based learning (SBL). Our research addressed three key questions: 1) how do teachers leverage GenAI to write scenarios for SBL purposes? 2) what is the quality of GenAI-generated SBL scenarios and tasks? and 3) how does GenAI-supported SBL affect students’ motivation, learning performance, and learning perceptions? A three-step prompting engineering process (write the prompts, curate the output, and verify the output, WCV) was established during the teacher interaction with GenAI in the scenario writing. Findings revealed that by using the WCV approach, ChatGPT enabled the efficient creation of quality scenarios for SBL purposes in a short timeframe. Moreover, students exhibited increased intrinsic motivation, learning performance, and positive attitudes toward GenAI-supported scenarios. We also suggest guidelines for using the WCV prompt engineering process in scenario writing.","PeriodicalId":49191,"journal":{"name":"IEEE Transactions on Learning Technologies","volume":"17 ","pages":"1313-1324"},"PeriodicalIF":3.7,"publicationDate":"2024-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140170034","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"教育学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automatic Generation of Multimedia Teaching Materials Based on Generative AI: Taking Tang Poetry as an Example 基于生成式人工智能的多媒体教学材料自动生成--以唐诗为例
IF 3.7 3区 教育学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-03-18 DOI: 10.1109/TLT.2024.3378279
Xu Chen;Di Wu
Generative artificial intelligence (AI) is widely recognized as one of the most influential technologies for the future, having sparked a paradigm shift in scientific research. The field of education has also been greatly impacted by this transformative technology, with researchers exploring the applications of generative AI, particularly ChatGPT, in education. However, existing research primarily focuses on generating text from text, and there remains a relative scarcity of studies on leveraging multimodal generation capabilities to address key challenges in multimodal data supported instruction. In this article, we present a technical framework for generating Tang poetry situational videos, emphasizing the utilization of generative AI to address the need for multimedia teaching resources. Our framework comprises three main modules: textual situational comprehension, image creation, and video generation. Moreover, we have developed a situational video generation system that incorporates various technologies, including text-to-text generation models, text-to-image generation models, image interpolation, text-to-speech synthesis, and video synthesis. To ascertain the efficacy of the modules within the Tang poetry situational video generation system, we undertook a comparative analysis utilizing the prevalent text-to-image and text-to-video generation models. The empirical findings indicate that our approach is capable of generating images that exhibit greater semantic similarity with the poems, thereby enabling a better comprehension of the poem's connotations and its key components. Concurrently, the Tang poetry videos generated can significantly contribute to the reduction of cognitive load and the enhancement of understanding during the learning process. Our research showcases the potential of generative AI in the education field, specifically in the domain of multimodal teaching resources.
生成式人工智能(AI)被公认为是未来最具影响力的技术之一,引发了科学研究的范式转变。教育领域也受到了这一变革性技术的巨大影响,研究人员正在探索生成式人工智能,特别是 ChatGPT 在教育领域的应用。然而,现有的研究主要集中于从文本生成文本,利用多模态生成能力解决多模态数据支持教学中的关键挑战的研究仍然相对匮乏。在本文中,我们提出了一个生成唐诗情境视频的技术框架,强调利用生成式人工智能来满足多媒体教学资源的需求。我们的框架包括三个主要模块:文本情景理解、图像创建和视频生成。此外,我们还开发了一个情景视频生成系统,该系统融合了多种技术,包括文本到文本生成模型、文本到图像生成模型、图像插值、文本到语音合成和视频合成。为了确定唐诗情境视频生成系统中各模块的功效,我们利用流行的文本到图像生成模型和文本到视频生成模型进行了比较分析。实证结果表明,我们的方法能够生成与诗歌语义相似度更高的图像,从而能够更好地理解诗歌的内涵及其关键组成部分。同时,生成的唐诗视频还能在学习过程中大大减轻认知负担,增强理解能力。我们的研究展示了生成式人工智能在教育领域,特别是多模态教学资源领域的潜力。
{"title":"Automatic Generation of Multimedia Teaching Materials Based on Generative AI: Taking Tang Poetry as an Example","authors":"Xu Chen;Di Wu","doi":"10.1109/TLT.2024.3378279","DOIUrl":"10.1109/TLT.2024.3378279","url":null,"abstract":"Generative artificial intelligence (AI) is widely recognized as one of the most influential technologies for the future, having sparked a paradigm shift in scientific research. The field of education has also been greatly impacted by this transformative technology, with researchers exploring the applications of generative AI, particularly ChatGPT, in education. However, existing research primarily focuses on generating text from text, and there remains a relative scarcity of studies on leveraging multimodal generation capabilities to address key challenges in multimodal data supported instruction. In this article, we present a technical framework for generating Tang poetry situational videos, emphasizing the utilization of generative AI to address the need for multimedia teaching resources. Our framework comprises three main modules: textual situational comprehension, image creation, and video generation. Moreover, we have developed a situational video generation system that incorporates various technologies, including text-to-text generation models, text-to-image generation models, image interpolation, text-to-speech synthesis, and video synthesis. To ascertain the efficacy of the modules within the Tang poetry situational video generation system, we undertook a comparative analysis utilizing the prevalent text-to-image and text-to-video generation models. The empirical findings indicate that our approach is capable of generating images that exhibit greater semantic similarity with the poems, thereby enabling a better comprehension of the poem's connotations and its key components. Concurrently, the Tang poetry videos generated can significantly contribute to the reduction of cognitive load and the enhancement of understanding during the learning process. Our research showcases the potential of generative AI in the education field, specifically in the domain of multimodal teaching resources.","PeriodicalId":49191,"journal":{"name":"IEEE Transactions on Learning Technologies","volume":"17 ","pages":"1353-1366"},"PeriodicalIF":3.7,"publicationDate":"2024-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140170039","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"教育学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Leveraging Multicriteria Integer Programming Optimization for Effective Team Formation 利用多标准整数编程优化有效组建团队
IF 2.9 3区 教育学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-03-16 DOI: 10.1109/TLT.2024.3401734
Pallavi Singh;Phat K. Huynh;Dang Nguyen;Trung Q. Le;Wilfrido Moreno
In organizational and academic settings, the strategic formation of teams is paramount, necessitating an approach that transcends conventional methodologies. This study introduces a novel application of multicriteria integer programming (MCIP), which simultaneously accommodates multiple criteria, thereby innovatively addressing the complex task of team formation. Unlike traditional single-objective optimization methods, our research designs a comprehensive framework capable of modeling a wide array of factors, including skill levels, backgrounds, and personality traits. The objective function of this framework is optimized to maximize within-team diversity while minimizing both conflict levels and variance in diversity between teams. Central to our approach is a two-stage optimization process. Initially, it segments the population into subgroups using a weighted heterogeneous multivariate K-means algorithm, allowing for a targeted and nuanced team assembly. This is followed by the application of a surrogate optimization technique within these subgroups, efficiently navigating the complexities of MCIP for large-scale applications. Our approach is further enhanced by the inclusion of explicit constraints such as potential interpersonal conflicts, a factor often overlooked in previous studies. The results from our study demonstrate the optimality and robustness of our model across simulation scenarios with different data heterogeneity levels. The contributions of this study are manifold, addressing critical gaps in the existing literature with a theory-backed, empirically validated framework for advanced team formation. Beyond theoretical implications, our work provides a practical guide for implementing conflict-aware, sophisticated team formation strategies in real-world scenarios. This advancement paves the way for future research to explore and enhance this model, providing more sophisticated and efficient team formation strategies.
{"title":"Leveraging Multicriteria Integer Programming Optimization for Effective Team Formation","authors":"Pallavi Singh;Phat K. Huynh;Dang Nguyen;Trung Q. Le;Wilfrido Moreno","doi":"10.1109/TLT.2024.3401734","DOIUrl":"10.1109/TLT.2024.3401734","url":null,"abstract":"In organizational and academic settings, the strategic formation of teams is paramount, necessitating an approach that transcends conventional methodologies. This study introduces a novel application of multicriteria integer programming (MCIP), which simultaneously accommodates multiple criteria, thereby innovatively addressing the complex task of team formation. Unlike traditional single-objective optimization methods, our research designs a comprehensive framework capable of modeling a wide array of factors, including skill levels, backgrounds, and personality traits. The objective function of this framework is optimized to maximize within-team diversity while minimizing both conflict levels and variance in diversity between teams. Central to our approach is a two-stage optimization process. Initially, it segments the population into subgroups using a weighted heterogeneous multivariate <italic>K</i>-means algorithm, allowing for a targeted and nuanced team assembly. This is followed by the application of a surrogate optimization technique within these subgroups, efficiently navigating the complexities of MCIP for large-scale applications. Our approach is further enhanced by the inclusion of explicit constraints such as potential interpersonal conflicts, a factor often overlooked in previous studies. The results from our study demonstrate the optimality and robustness of our model across simulation scenarios with different data heterogeneity levels. The contributions of this study are manifold, addressing critical gaps in the existing literature with a theory-backed, empirically validated framework for advanced team formation. Beyond theoretical implications, our work provides a practical guide for implementing conflict-aware, sophisticated team formation strategies in real-world scenarios. This advancement paves the way for future research to explore and enhance this model, providing more sophisticated and efficient team formation strategies.","PeriodicalId":49191,"journal":{"name":"IEEE Transactions on Learning Technologies","volume":"18 ","pages":"72-84"},"PeriodicalIF":2.9,"publicationDate":"2024-03-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141063023","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"教育学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Using ChatGPT for Science Learning: A Study on Pre-service Teachers' Lesson Planning 使用 ChatGPT 学习科学:关于职前教师备课的研究
IF 3.7 3区 教育学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-03-15 DOI: 10.1109/TLT.2024.3401457
Gyeong-Geon Lee;Xiaoming Zhai
While ongoing efforts have continuously emphasized the integration of ChatGPT with science teaching and learning, there are limited empirical studies exploring its actual utility in the classroom. This study aims to fill this gap by analyzing the lesson plans developed by 29 pre-service elementary teachers and assessing how they integrated ChatGPT into science learning activities. We first examined how ChatGPT was integrated with the subject domains, teaching methods/strategies, and then evaluated the lesson plans using a generative artificial intelligence (AI)-technological pedagogical and content knowledge (TPACK)-based rubric. We further examined pre-service teachers' perceptions and concerns about integrating ChatGPT into science learning. Results show a diverse number of ChatGPT applications in different science domains—e.g., Biology (9/29), Chemistry (7/29), and Earth Science (7/29). A total of 14 types of teaching methods/strategies were identified in the lesson plans. On average, the pre-service teachers' lesson plans scored high on the modified TPACK-based rubric (M = 3.29; SD = 0.91; on a 1–4 scale), indicating a reasonable envisage of integrating ChatGPT into science learning, particularly in “instructional strategies and ChatGPT” (M = 3.48; SD = 0.99). However, they scored relatively lower on exploiting ChatGPT's functions toward its full potential (M = 3.00; SD = 0.93), compared to other aspects. We also identified several inappropriate use cases of ChatGPT in lesson planning (e.g., as a source of hallucinated Internet material and technically unsupported visual guidance). Pre-service teachers anticipated ChatGPT to afford high-quality questioning, self-directed learning, individualized learning support, and formative assessment. Meanwhile, they also expressed concerns about its accuracy and the risks that students may be overly dependent on ChatGPT. They further suggested solutions to systemizing classroom dynamics between teachers and students. The study underscores the need for more research on the roles of generative AI in actual classroom settings and provides insights for future AI-integrated science learning.
尽管人们一直在努力强调将 ChatGPT 与科学教学相结合,但探索其在课堂中实际效用的实证研究却十分有限。本研究旨在通过分析 29 位职前小学教师编写的教案,评估他们如何将 ChatGPT 整合到科学学习活动中,从而填补这一空白。我们首先考察了 ChatGPT 如何与学科领域、教学方法/策略相结合,然后使用基于生成人工智能(AI)-技术教学和内容知识(TPACK)的评分标准对教案进行了评估。我们进一步研究了职前教师对将 ChatGPT 整合到科学学习中的看法和担忧。结果显示,ChatGPT 在不同科学领域的应用数量各不相同,如生物(9/29)、化学(7/29)和地球科学(7/29)。教案中共确定了 14 种教学方法/策略。平均而言,职前教师的教案在基于 TPACK 的修改评分标准中得分较高(M = 3.29;SD = 0.91;1-4 分),表明他们对将 ChatGPT 整合到科学学习中有合理的设想,尤其是在 "教学策略和 ChatGPT "方面(M = 3.48;SD = 0.99)。然而,与其他方面相比,他们在充分发挥 ChatGPT 功能方面的得分相对较低(M = 3.00;SD = 0.93)。我们还发现了 ChatGPT 在备课中的一些不恰当使用情况(例如,作为幻觉网络材料的来源和无技术支持的视觉引导)。职前教师期待 ChatGPT 能够提供高质量的提问、自主学习、个性化学习支持和形成性评估。同时,他们也对 ChatGPT 的准确性以及学生可能过度依赖 ChatGPT 的风险表示担忧。他们进一步提出了将教师和学生之间的课堂动态系统化的解决方案。这项研究强调了对生成式人工智能在实际课堂环境中的作用进行更多研究的必要性,并为未来的人工智能整合科学学习提供了启示。
{"title":"Using ChatGPT for Science Learning: A Study on Pre-service Teachers' Lesson Planning","authors":"Gyeong-Geon Lee;Xiaoming Zhai","doi":"10.1109/TLT.2024.3401457","DOIUrl":"10.1109/TLT.2024.3401457","url":null,"abstract":"While ongoing efforts have continuously emphasized the integration of ChatGPT with science teaching and learning, there are limited empirical studies exploring its actual utility in the classroom. This study aims to fill this gap by analyzing the lesson plans developed by 29 pre-service elementary teachers and assessing how they integrated ChatGPT into science learning activities. We first examined how ChatGPT was integrated with the subject domains, teaching methods/strategies, and then evaluated the lesson plans using a generative artificial intelligence (AI)-technological pedagogical and content knowledge (TPACK)-based rubric. We further examined pre-service teachers' perceptions and concerns about integrating ChatGPT into science learning. Results show a diverse number of ChatGPT applications in different science domains—e.g., Biology (9/29), Chemistry (7/29), and Earth Science (7/29). A total of 14 types of teaching methods/strategies were identified in the lesson plans. On average, the pre-service teachers' lesson plans scored high on the modified TPACK-based rubric (M = 3.29; SD = 0.91; on a 1–4 scale), indicating a reasonable envisage of integrating ChatGPT into science learning, particularly in “instructional strategies and ChatGPT” (M = 3.48; SD = 0.99). However, they scored relatively lower on exploiting ChatGPT's functions toward its full potential (M = 3.00; SD = 0.93), compared to other aspects. We also identified several inappropriate use cases of ChatGPT in lesson planning (e.g., as a source of hallucinated Internet material and technically unsupported visual guidance). Pre-service teachers anticipated ChatGPT to afford high-quality questioning, self-directed learning, individualized learning support, and formative assessment. Meanwhile, they also expressed concerns about its accuracy and the risks that students may be overly dependent on ChatGPT. They further suggested solutions to systemizing classroom dynamics between teachers and students. The study underscores the need for more research on the roles of generative AI in actual classroom settings and provides insights for future AI-integrated science learning.","PeriodicalId":49191,"journal":{"name":"IEEE Transactions on Learning Technologies","volume":"17 ","pages":"1683-1700"},"PeriodicalIF":3.7,"publicationDate":"2024-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141063211","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"教育学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automated Essay Scoring and Revising Based on Open-Source Large Language Models 基于开源大语言模型的论文自动评分和修改
IF 2.9 3区 教育学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-03-06 DOI: 10.1109/TLT.2024.3396873
Yishen Song;Qianta Zhu;Huaibo Wang;Qinhua Zheng
Manually scoring and revising student essays has long been a time-consuming task for educators. With the rise of natural language processing techniques, automated essay scoring (AES) and automated essay revising (AER) have emerged to alleviate this burden. However, current AES and AER models require large amounts of training data and lack generalizability, which makes them hard to implement in daily teaching activities. Moreover, online sites offering AES and AER services charge high fees and have security issues uploading student content. In light of these challenges and recognizing the advancements in large language models (LLMs), we aim to fill these research gaps by analyzing the performance of open-source LLMs when accomplishing AES and AER tasks. Using a human-scored essay dataset (n = 600) collected in an online assessment, we implemented zero-shot, few-shot, and p-tuning AES methods based on the LLMs and conducted a human–machine consistency check. We conducted a similarity test and a score difference test for the results of AER with LLMs support. The human–machine consistency check result shows that the performance of open-source LLMs with a 10 B parameter size in the AES task is close to that of some deep-learning baseline models, and it can be improved by integrating the comment with the score into the shot or training continuous prompts. The similarity test and score difference test results show that open-source LLMs can effectively accomplish the AER task, improving the quality of the essays while ensuring that the revision results are similar to the original essays. This study reveals a practical path to cost-effectively, time-efficiently, and content-safely assisting teachers with student essay scoring and revising using open-source LLMs.
长期以来,人工评分和修改学生作文一直是教育工作者的一项耗时任务。随着自然语言处理技术的兴起,自动作文评分(AES)和自动作文修改(AER)的出现减轻了这一负担。然而,目前的自动作文评分(AES)和自动作文修改(AER)模型需要大量的训练数据,而且缺乏通用性,因此很难在日常教学活动中实施。此外,提供 AES 和 AER 服务的在线网站收费高昂,而且上传学生内容存在安全问题。考虑到这些挑战以及大型语言模型(LLM)的进步,我们旨在通过分析开源 LLM 在完成 AES 和 AER 任务时的性能来填补这些研究空白。利用在线评估中收集的人工评分作文数据集(n = 600),我们在 LLMs 的基础上实施了零次、少量和 p 调整 AES 方法,并进行了人机一致性检查。我们对支持 LLMs 的 AER 结果进行了相似性测试和分数差异测试。人机一致性检验结果表明,参数大小为 10 B 的开源 LLMs 在 AES 任务中的表现接近于一些深度学习基线模型,并且可以通过将带有分数的注释集成到拍摄或训练连续提示中来提高性能。相似性测试和分数差异测试结果表明,开源 LLM 可以有效完成 AER 任务,在提高作文质量的同时确保修改结果与原始作文相似。这项研究揭示了一条切实可行的道路,即利用开源 LLM,以低成本、高效率、低时间成本和内容安全的方式协助教师进行学生作文评分和修改。
{"title":"Automated Essay Scoring and Revising Based on Open-Source Large Language Models","authors":"Yishen Song;Qianta Zhu;Huaibo Wang;Qinhua Zheng","doi":"10.1109/TLT.2024.3396873","DOIUrl":"10.1109/TLT.2024.3396873","url":null,"abstract":"Manually scoring and revising student essays has long been a time-consuming task for educators. With the rise of natural language processing techniques, automated essay scoring (AES) and automated essay revising (AER) have emerged to alleviate this burden. However, current AES and AER models require large amounts of training data and lack generalizability, which makes them hard to implement in daily teaching activities. Moreover, online sites offering AES and AER services charge high fees and have security issues uploading student content. In light of these challenges and recognizing the advancements in large language models (LLMs), we aim to fill these research gaps by analyzing the performance of open-source LLMs when accomplishing AES and AER tasks. Using a human-scored essay dataset (\u0000<italic>n</i>\u0000 = 600) collected in an online assessment, we implemented zero-shot, few-shot, and p-tuning AES methods based on the LLMs and conducted a human–machine consistency check. We conducted a similarity test and a score difference test for the results of AER with LLMs support. The human–machine consistency check result shows that the performance of open-source LLMs with a 10 B parameter size in the AES task is close to that of some deep-learning baseline models, and it can be improved by integrating the comment with the score into the shot or training continuous prompts. The similarity test and score difference test results show that open-source LLMs can effectively accomplish the AER task, improving the quality of the essays while ensuring that the revision results are similar to the original essays. This study reveals a practical path to cost-effectively, time-efficiently, and content-safely assisting teachers with student essay scoring and revising using open-source LLMs.","PeriodicalId":49191,"journal":{"name":"IEEE Transactions on Learning Technologies","volume":"17 ","pages":"1920-1930"},"PeriodicalIF":2.9,"publicationDate":"2024-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140885774","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"教育学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Experimenting With Soft Robotics in Education: A Systematic Literature Review From 2006 to 2022 教育领域的软机器人实验:2006 年至 2022 年系统文献综述
IF 3.7 3区 教育学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-03-05 DOI: 10.1109/TLT.2024.3372894
Israel Ulises Cayetano-Jiménez;Erick Axel Martinez-Ríos;Rogelio Bustamante-Bello;Ricardo A. Ramírez-Mendoza;María Soledad Ramírez-Montoya
Educational robotics (ER) is a discipline of applied robotics focused on teaching robot design, analysis, application, and operation. Traditionally, ER has favored rigid robots, overlooking the potential of soft robots (SRs). While rigid robots offer insights into dynamics, kinematics, and control, they have limitations in exploring the depths of mechanical design and material properties. In this regard, SRs present an opportunity to expand educational topics and activities in robotics through their unique bioinspired properties and accessibility. Despite their promise, there is a notable lack of research on SRs as educational tools, limiting the identification of research avenues that could promote their adoption in educational settings. This study conducts a systematic literature review to elucidate the impact of SRs across academic levels, pedagogical strategies, prevalent artificial muscles, educational activities, and assessment methods. The findings indicate a significant focus on K-12 workshops utilizing soft pneumatic actuators. Furthermore, SRs have fostered the development of fabrication and mechanical design skills beyond mere programming tasks. However, there is a shortage of studies analyzing their use in higher education or their impact on learning outcomes, suggesting a critical need for comprehensive evaluations to determine their effectiveness, rather than solely relying on surveys for student feedback. Thus, there is an opportunity to explore and evaluate the use of SRs in more advanced settings and multidisciplinary activities, urging for rigorous assessments of their influence on learning outcomes. By undertaking this, we aim to provide a foundation for integrating SRs into the ER curriculum, potentially transforming teaching methodologies and enriching students' learning experiences.
教育机器人学(ER)是应用机器人学的一门学科,侧重于机器人设计、分析、应用和操作的教学。传统上,教育机器人学偏爱刚性机器人,忽视了软体机器人(SR)的潜力。虽然刚性机器人能提供动力学、运动学和控制方面的见解,但在探索机械设计和材料特性的深度方面却有局限性。在这方面,软体机器人通过其独特的生物启发特性和易用性,为拓展机器人学的教育主题和活动提供了机会。尽管SRs大有可为,但有关SRs作为教育工具的研究却明显不足,这就限制了研究途径的确定,从而无法促进SRs在教育环境中的应用。本研究进行了系统的文献综述,以阐明人造肌肉对不同学术水平、教学策略、流行的人造肌肉、教育活动和评估方法的影响。研究结果表明,利用软气动致动器的 K-12 研讨班受到了极大关注。此外,除了单纯的编程任务外,SR 还促进了制造和机械设计技能的发展。然而,缺乏对其在高等教育中的使用或对学习成果的影响进行分析的研究,这表明迫切需要进行全面评估,以确定其有效性,而不是仅仅依靠调查来获得学生的反馈。因此,我们有机会探索和评估在更高级的环境和多学科活动中使用员工代表的情况,敦促严格评估其对学习成果的影响。通过开展这项工作,我们旨在为将员工代表纳入企业资源规划课程奠定基础,从而有可能改变教学方法,丰富学生的学习体验。
{"title":"Experimenting With Soft Robotics in Education: A Systematic Literature Review From 2006 to 2022","authors":"Israel Ulises Cayetano-Jiménez;Erick Axel Martinez-Ríos;Rogelio Bustamante-Bello;Ricardo A. Ramírez-Mendoza;María Soledad Ramírez-Montoya","doi":"10.1109/TLT.2024.3372894","DOIUrl":"10.1109/TLT.2024.3372894","url":null,"abstract":"Educational robotics (ER) is a discipline of applied robotics focused on teaching robot design, analysis, application, and operation. Traditionally, ER has favored rigid robots, overlooking the potential of soft robots (SRs). While rigid robots offer insights into dynamics, kinematics, and control, they have limitations in exploring the depths of mechanical design and material properties. In this regard, SRs present an opportunity to expand educational topics and activities in robotics through their unique bioinspired properties and accessibility. Despite their promise, there is a notable lack of research on SRs as educational tools, limiting the identification of research avenues that could promote their adoption in educational settings. This study conducts a systematic literature review to elucidate the impact of SRs across academic levels, pedagogical strategies, prevalent artificial muscles, educational activities, and assessment methods. The findings indicate a significant focus on K-12 workshops utilizing soft pneumatic actuators. Furthermore, SRs have fostered the development of fabrication and mechanical design skills beyond mere programming tasks. However, there is a shortage of studies analyzing their use in higher education or their impact on learning outcomes, suggesting a critical need for comprehensive evaluations to determine their effectiveness, rather than solely relying on surveys for student feedback. Thus, there is an opportunity to explore and evaluate the use of SRs in more advanced settings and multidisciplinary activities, urging for rigorous assessments of their influence on learning outcomes. By undertaking this, we aim to provide a foundation for integrating SRs into the ER curriculum, potentially transforming teaching methodologies and enriching students' learning experiences.","PeriodicalId":49191,"journal":{"name":"IEEE Transactions on Learning Technologies","volume":"17 ","pages":"1261-1278"},"PeriodicalIF":3.7,"publicationDate":"2024-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10460415","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140045730","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"教育学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing Medical Training Through Learning From Mistakes by Interacting With an Ill-Trained Reinforcement Learning Agent 通过与未经训练的强化学习代理互动,从错误中学习,从而加强医学培训
IF 3.7 3区 教育学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-03-04 DOI: 10.1109/TLT.2024.3372508
Yasar C. Kakdas;Sinan Kockara;Tansel Halic;Doga Demirel
This article presents a 3-D medical simulation that employs reinforcement learning (RL) and interactive RL (IRL) to teach and assess the procedure of donning and doffing personal protective equipment (PPE). The simulation is motivated by the need for effective, safe, and remote training techniques in medicine, particularly in light of the COVID-19 pandemic. The simulation has two modes: a tutorial mode and an assessment mode. In the tutorial mode, a computer-based, ill-trained RL agent utilizes RL to learn the correct sequence of donning the PPE by trial and error. This allows students to experience many outlier cases they might not encounter in an in-class educational model. In the assessment mode, an IRL-based method is used to evaluate how effective the participant is at correcting the mistakes performed by the RL agent. Each time the RL agent interacts with the environment and performs an action, the participants provide positive or negative feedback regarding the action taken. Following the assessment, participants receive a score based on the accuracy of their feedback and the time taken for the RL agent to learn the correct sequence. An experiment was conducted using two groups, each consisting of ten participants. The first group received RL-assisted training for donning PPE, followed by an IRL-based assessment. Meanwhile, the second group observed a video featuring the RL agent demonstrating only the correct donning order without outlier cases, replicating traditional training, before undergoing the same assessment as the first group. Results showed that RL-assisted training with many outlier cases was more effective than traditional training with only regular cases. Moreover, combining RL with IRL significantly enhanced the participants' performance. Notably, 90% of the participants finished the assessment with perfect scores within three iterations. In contrast, only 10% of those who did not engage in RL-assisted training finished the assessment with a perfect score, highlighting the substantial impact of RL and IRL integration on participants’ overall achievement.
本文介绍了一种三维医学模拟,它采用强化学习(RL)和交互式 RL(IRL)来教授和评估穿脱个人防护设备(PPE)的程序。该模拟的动机是医学领域对有效、安全和远程培训技术的需求,尤其是在 COVID-19 大流行的情况下。模拟有两种模式:辅导模式和评估模式。在辅导模式中,一个基于计算机、训练有素的 RL 代理利用 RL,通过不断尝试和出错来学习穿戴个人防护设备的正确顺序。这样,学生就能体验到许多在课堂教育模式中可能不会遇到的异常情况。在评估模式中,使用基于 IRL 的方法来评估学员纠正 RL 代理所犯错误的效率。每当 RL 代理与环境交互并执行一项操作时,参与者都会就所执行的操作提供积极或消极的反馈。评估结束后,参与者会根据其反馈的准确性和 RL 代理学习正确序列所需的时间得到一个分数。实验分两组进行,每组有十名参与者。第一组接受穿戴个人防护设备的 RL 辅助培训,然后进行基于 IRL 的评估。与此同时,第二组在接受与第一组相同的评估之前,观看了一段视频,视频中的 RL 代理只演示了正确的穿戴顺序,而没有离群情况,这与传统的培训相同。结果表明,与仅使用常规案例的传统训练相比,使用大量离群案例的 RL 辅助训练更为有效。此外,将 RL 与 IRL 相结合还能显著提高学员的成绩。值得注意的是,90% 的学员在三次迭代中以满分完成了评估。相比之下,只有 10% 没有参加 RL 辅助训练的学员能以满分完成评估,这凸显了 RL 与 IRL 的结合对学员整体成绩的重大影响。
{"title":"Enhancing Medical Training Through Learning From Mistakes by Interacting With an Ill-Trained Reinforcement Learning Agent","authors":"Yasar C. Kakdas;Sinan Kockara;Tansel Halic;Doga Demirel","doi":"10.1109/TLT.2024.3372508","DOIUrl":"10.1109/TLT.2024.3372508","url":null,"abstract":"This article presents a 3-D medical simulation that employs reinforcement learning (RL) and interactive RL (IRL) to teach and assess the procedure of donning and doffing personal protective equipment (PPE). The simulation is motivated by the need for effective, safe, and remote training techniques in medicine, particularly in light of the COVID-19 pandemic. The simulation has two modes: a tutorial mode and an assessment mode. In the tutorial mode, a computer-based, ill-trained RL agent utilizes RL to learn the correct sequence of donning the PPE by trial and error. This allows students to experience many outlier cases they might not encounter in an in-class educational model. In the assessment mode, an IRL-based method is used to evaluate how effective the participant is at correcting the mistakes performed by the RL agent. Each time the RL agent interacts with the environment and performs an action, the participants provide positive or negative feedback regarding the action taken. Following the assessment, participants receive a score based on the accuracy of their feedback and the time taken for the RL agent to learn the correct sequence. An experiment was conducted using two groups, each consisting of ten participants. The first group received RL-assisted training for donning PPE, followed by an IRL-based assessment. Meanwhile, the second group observed a video featuring the RL agent demonstrating only the correct donning order without outlier cases, replicating traditional training, before undergoing the same assessment as the first group. Results showed that RL-assisted training with many outlier cases was more effective than traditional training with only regular cases. Moreover, combining RL with IRL significantly enhanced the participants' performance. Notably, 90% of the participants finished the assessment with perfect scores within three iterations. In contrast, only 10% of those who did not engage in RL-assisted training finished the assessment with a perfect score, highlighting the substantial impact of RL and IRL integration on participants’ overall achievement.","PeriodicalId":49191,"journal":{"name":"IEEE Transactions on Learning Technologies","volume":"17 ","pages":"1248-1260"},"PeriodicalIF":3.7,"publicationDate":"2024-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140037300","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"教育学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Transactions on Learning Technologies
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1