Jin Li;Shu Li;Yuan Zhao;Longjiang Guo;Fei Hao;Meirui Ren;Keqin Li
{"title":"预测 MOOC 注册前的辍学情况:可解释的自监督模型","authors":"Jin Li;Shu Li;Yuan Zhao;Longjiang Guo;Fei Hao;Meirui Ren;Keqin Li","doi":"10.1109/TSC.2023.3311627","DOIUrl":null,"url":null,"abstract":"Massive Open Online Courses (MOOCs) belong to a new cloud-based service in education that suffers from low completion rates. Effective pre-learning intervention services, such as recommending courses with a high probability of completion or filtering courses with a very low probability of completion, will encourage students to spend more time and energy on proper courses, thus can reduce the dropout ratio. In practice, intervention services are introduced when students are predicted to drop out. However, existing methods concentrate on analyzing students’ learning actions and predicting final dropout after a period of enrollment, which are insufficient in preventing students from enrolling in unsuitable courses and withdrawing mid-way. This paper presents a neural network-based Explainable Self-supervised Model (ESM) to predict MOOC dropout before enrollment. Specifically, the student's learning actions on an unenrolled course are estimated using previous logs by the neural network. And then, the action's contribution to the completion of a course is calculated in a similar way. Therefore, the probability of completion for an unenrolled course is predicted by aggregating the learning actions and their contribution to the completion. To train the neural network, a self-supervised training strategy is proposed, where enrolled courses in the training data are randomly selected as validation in each epoch. The ESM outperforms existing methods in terms of prediction accuracy and efficiency. The average increment of Area Under the ROC Curve (AUC) and F-score (F1) in the two MOOCs datasets, XuetangX and KDDCUP, are 8.3% and 0.6%, respectively. Furthermore, the two pre-learning intervention services named courses recommendation and courses filtration are proposed. When courses are recommended, the completion rate increased from 22% to 60% in XuetangX, and from 27% to 45% in KDDCUP. By filtering courses predicted with low completion probability, 40% wasted time in uncompleted courses will be saved in XuetangX.","PeriodicalId":13255,"journal":{"name":"IEEE Transactions on Services Computing","volume":"16 6","pages":"4154-4167"},"PeriodicalIF":5.5000,"publicationDate":"2023-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Predicting Dropouts Before Enrollments in MOOCs: An Explainable and Self-Supervised Model\",\"authors\":\"Jin Li;Shu Li;Yuan Zhao;Longjiang Guo;Fei Hao;Meirui Ren;Keqin Li\",\"doi\":\"10.1109/TSC.2023.3311627\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Massive Open Online Courses (MOOCs) belong to a new cloud-based service in education that suffers from low completion rates. Effective pre-learning intervention services, such as recommending courses with a high probability of completion or filtering courses with a very low probability of completion, will encourage students to spend more time and energy on proper courses, thus can reduce the dropout ratio. In practice, intervention services are introduced when students are predicted to drop out. However, existing methods concentrate on analyzing students’ learning actions and predicting final dropout after a period of enrollment, which are insufficient in preventing students from enrolling in unsuitable courses and withdrawing mid-way. This paper presents a neural network-based Explainable Self-supervised Model (ESM) to predict MOOC dropout before enrollment. Specifically, the student's learning actions on an unenrolled course are estimated using previous logs by the neural network. And then, the action's contribution to the completion of a course is calculated in a similar way. Therefore, the probability of completion for an unenrolled course is predicted by aggregating the learning actions and their contribution to the completion. To train the neural network, a self-supervised training strategy is proposed, where enrolled courses in the training data are randomly selected as validation in each epoch. The ESM outperforms existing methods in terms of prediction accuracy and efficiency. The average increment of Area Under the ROC Curve (AUC) and F-score (F1) in the two MOOCs datasets, XuetangX and KDDCUP, are 8.3% and 0.6%, respectively. Furthermore, the two pre-learning intervention services named courses recommendation and courses filtration are proposed. When courses are recommended, the completion rate increased from 22% to 60% in XuetangX, and from 27% to 45% in KDDCUP. By filtering courses predicted with low completion probability, 40% wasted time in uncompleted courses will be saved in XuetangX.\",\"PeriodicalId\":13255,\"journal\":{\"name\":\"IEEE Transactions on Services Computing\",\"volume\":\"16 6\",\"pages\":\"4154-4167\"},\"PeriodicalIF\":5.5000,\"publicationDate\":\"2023-09-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Services Computing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10238815/\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Services Computing","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10238815/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
摘要
大规模在线开放课程(MOOCs)属于一种新的基于云的教育服务,其完成率很低。有效的学习前干预服务,如推荐完成概率高的课程或过滤完成概率极低的课程,可以鼓励学生将更多的时间和精力花在合适的课程上,从而降低辍学率。在实践中,当学生被预测要退学时,就会引入干预服务。然而,现有的方法侧重于分析学生的学习行为,预测学生在入学一段时间后的最终退学情况,不足以防止学生选修不合适的课程,中途退学。本文提出了一种基于神经网络的可解释自监督模型(ESM)来预测MOOC学生在入学前辍学。具体来说,神经网络使用之前的日志来估计学生在未注册课程上的学习行为。然后,动作对完成课程的贡献以类似的方式计算。因此,通过汇总学习行为及其对完成的贡献来预测未注册课程的完成概率。为了训练神经网络,提出了一种自监督训练策略,随机选择训练数据中的注册课程作为每个历元的验证。该方法在预测精度和效率方面都优于现有方法。在XuetangX和KDDCUP两个mooc数据集上,ROC曲线下面积(Area Under The ROC Curve, AUC)和F-score (F-score, F1)的平均增量分别为8.3%和0.6%。在此基础上,提出了课程推荐和课程过滤两种学习前干预服务。当课程被推荐时,学堂x的完成率从22%提高到60%,KDDCUP的完成率从27%提高到45%。通过对预测完成概率较低的课程进行筛选,学堂x可以节省40%的未完成课程浪费时间。
Predicting Dropouts Before Enrollments in MOOCs: An Explainable and Self-Supervised Model
Massive Open Online Courses (MOOCs) belong to a new cloud-based service in education that suffers from low completion rates. Effective pre-learning intervention services, such as recommending courses with a high probability of completion or filtering courses with a very low probability of completion, will encourage students to spend more time and energy on proper courses, thus can reduce the dropout ratio. In practice, intervention services are introduced when students are predicted to drop out. However, existing methods concentrate on analyzing students’ learning actions and predicting final dropout after a period of enrollment, which are insufficient in preventing students from enrolling in unsuitable courses and withdrawing mid-way. This paper presents a neural network-based Explainable Self-supervised Model (ESM) to predict MOOC dropout before enrollment. Specifically, the student's learning actions on an unenrolled course are estimated using previous logs by the neural network. And then, the action's contribution to the completion of a course is calculated in a similar way. Therefore, the probability of completion for an unenrolled course is predicted by aggregating the learning actions and their contribution to the completion. To train the neural network, a self-supervised training strategy is proposed, where enrolled courses in the training data are randomly selected as validation in each epoch. The ESM outperforms existing methods in terms of prediction accuracy and efficiency. The average increment of Area Under the ROC Curve (AUC) and F-score (F1) in the two MOOCs datasets, XuetangX and KDDCUP, are 8.3% and 0.6%, respectively. Furthermore, the two pre-learning intervention services named courses recommendation and courses filtration are proposed. When courses are recommended, the completion rate increased from 22% to 60% in XuetangX, and from 27% to 45% in KDDCUP. By filtering courses predicted with low completion probability, 40% wasted time in uncompleted courses will be saved in XuetangX.
期刊介绍:
IEEE Transactions on Services Computing encompasses the computing and software aspects of the science and technology of services innovation research and development. It places emphasis on algorithmic, mathematical, statistical, and computational methods central to services computing. Topics covered include Service Oriented Architecture, Web Services, Business Process Integration, Solution Performance Management, and Services Operations and Management. The transactions address mathematical foundations, security, privacy, agreement, contract, discovery, negotiation, collaboration, and quality of service for web services. It also covers areas like composite web service creation, business and scientific applications, standards, utility models, business process modeling, integration, collaboration, and more in the realm of Services Computing.