Solving the imbalanced data issue: automatic urgency detection for instructor assistance in MOOC discussion forums

IF 3.5 3区计算机科学 Q2 COMPUTER SCIENCE, CYBERNETICS User Modeling and User-Adapted Interaction Pub Date : 2023-12-01 DOI:10.1007/s11257-023-09381-y

Laila Alrajhi, Ahmed Alamri, Filipe Dwan Pereira, Alexandra I. Cristea, Elaine H. T. Oliveira

{"title":"Solving the imbalanced data issue: automatic urgency detection for instructor assistance in MOOC discussion forums","authors":"Laila Alrajhi, Ahmed Alamri, Filipe Dwan Pereira, Alexandra I. Cristea, Elaine H. T. Oliveira","doi":"10.1007/s11257-023-09381-y","DOIUrl":null,"url":null,"abstract":"In MOOCs, identifying urgent comments on discussion forums is an ongoing challenge. Whilst urgent comments require immediate reactions from instructors, to improve interaction with their learners, and potentially reducing drop-out rates—the task is difficult, as truly urgent comments are rare. From a data analytics perspective, this represents a highly unbalanced (sparse) dataset. Here, we aim to automate the urgent comments identification process, based on fine-grained learner modelling—to be used for automatic recommendations to instructors. To showcase and compare these models, we apply them to the first gold standard dataset for Urgent iNstructor InTErvention (UNITE), which we created by labelling FutureLearn MOOC data. We implement both benchmark shallow classifiers and deep learning. Importantly, we not only compare, for the first time for the unbalanced problem, several data balancing techniques, comprising text augmentation, text augmentation with undersampling, and undersampling, but also propose several new pipelines for combining different augmenters for text augmentation. Results show that models with undersampling can predict most urgent cases; and 3X augmentation + undersampling usually attains the best performance. We additionally validate the best models via a generic benchmark dataset (Stanford). As a case study, we showcase how the naïve Bayes with count vector can adaptively support instructors in answering learner questions/comments, potentially saving time or increasing efficiency in supporting learners. Finally, we show that the errors from the classifier mirrors the disagreements between annotators. Thus, our proposed algorithms perform at least as well as a ‘super-diligent’ human instructor (with the time to consider all comments).","PeriodicalId":49388,"journal":{"name":"User Modeling and User-Adapted Interaction","volume":"202 1","pages":""},"PeriodicalIF":3.5000,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"User Modeling and User-Adapted Interaction","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s11257-023-09381-y","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, CYBERNETICS","Score":null,"Total":0}

引用次数: 0

Abstract

In MOOCs, identifying urgent comments on discussion forums is an ongoing challenge. Whilst urgent comments require immediate reactions from instructors, to improve interaction with their learners, and potentially reducing drop-out rates—the task is difficult, as truly urgent comments are rare. From a data analytics perspective, this represents a highly unbalanced (sparse) dataset. Here, we aim to automate the urgent comments identification process, based on fine-grained learner modelling—to be used for automatic recommendations to instructors. To showcase and compare these models, we apply them to the first gold standard dataset for Urgent iNstructor InTErvention (UNITE), which we created by labelling FutureLearn MOOC data. We implement both benchmark shallow classifiers and deep learning. Importantly, we not only compare, for the first time for the unbalanced problem, several data balancing techniques, comprising text augmentation, text augmentation with undersampling, and undersampling, but also propose several new pipelines for combining different augmenters for text augmentation. Results show that models with undersampling can predict most urgent cases; and 3X augmentation + undersampling usually attains the best performance. We additionally validate the best models via a generic benchmark dataset (Stanford). As a case study, we showcase how the naïve Bayes with count vector can adaptively support instructors in answering learner questions/comments, potentially saving time or increasing efficiency in supporting learners. Finally, we show that the errors from the classifier mirrors the disagreements between annotators. Thus, our proposed algorithms perform at least as well as a ‘super-diligent’ human instructor (with the time to consider all comments).

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

解决数据不平衡问题:MOOC论坛讲师协助的自动紧急检测

在mooc中，识别论坛上的紧急评论是一项持续的挑战。虽然紧急评论需要教师立即做出反应，以改善与学习者的互动，并潜在地减少退学率，但这项任务很困难，因为真正紧急的评论很少。从数据分析的角度来看，这代表了一个高度不平衡(稀疏)的数据集。在这里，我们的目标是基于细粒度学习者建模，自动化紧急评论识别过程，用于向教师自动推荐。为了展示和比较这些模型，我们将它们应用于紧急讲师干预(UNITE)的第一个金标准数据集，该数据集是我们通过标记FutureLearn MOOC数据创建的。我们实现了基准浅分类器和深度学习。重要的是，对于不平衡问题，我们不仅首次比较了几种数据平衡技术，包括文本增强、欠采样文本增强和欠采样文本增强，而且还提出了几种新的管道，用于组合不同的增强器进行文本增强。结果表明，欠采样模型可以预测大多数紧急情况;3倍增强+欠采样通常可以获得最佳性能。我们还通过通用基准数据集(Stanford)验证了最佳模型。作为一个案例研究，我们展示了naïve带计数向量的贝叶斯如何自适应地支持教师回答学习者的问题/评论，潜在地节省时间或提高支持学习者的效率。最后，我们证明了来自分类器的错误反映了注释器之间的分歧。因此，我们提出的算法的表现至少与“超级勤奋”的人类讲师一样好(有时间考虑所有评论)。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

User Modeling and User-Adapted Interaction 工程技术-计算机：控制论

CiteScore

8.90

自引率

8.30%

发文量

审稿时长

>12 weeks

期刊介绍： User Modeling and User-Adapted Interaction provides an interdisciplinary forum for the dissemination of novel and significant original research results about interactive computer systems that can adapt themselves to their users, and on the design, use, and evaluation of user models for adaptation. The journal publishes high-quality original papers from, e.g., the following areas: acquisition and formal representation of user models; conceptual models and user stereotypes for personalization; student modeling and adaptive learning; models of groups of users; user model driven personalised information discovery and retrieval; recommender systems; adaptive user interfaces and agents; adaptation for accessibility and inclusion; generic user modeling systems and tools; interoperability of user models; personalization in areas such as; affective computing; ubiquitous and mobile computing; language based interactions; multi-modal interactions; virtual and augmented reality; social media and the Web; human-robot interaction; behaviour change interventions; personalized applications in specific domains; privacy, accountability, and security of information for personalization; responsible adaptation: fairness, accountability, explainability, transparency and control; methods for the design and evaluation of user models and adaptive systems