A data mining approach using machine learning algorithms for early detection of low-performing students

IF 2.4 Q3 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS International Journal of Information and Learning Technology Pub Date : 2022-02-24 DOI:10.1108/ijilt-09-2021-0144
E. Khor
{"title":"A data mining approach using machine learning algorithms for early detection of low-performing students","authors":"E. Khor","doi":"10.1108/ijilt-09-2021-0144","DOIUrl":null,"url":null,"abstract":"PurposeThe purpose of the study is to build predictive models for early detection of low-performing students and examine the factors that influence massive open online courses students' performance.Design/methodology/approachFor the first step, the author performed exploratory data analysis to analyze the dataset. The process was then followed by data pre-processing and feature engineering (Step 2). Next, the author conducted data modelling and prediction (Step 3). Finally, the performance of the developed models was evaluated (Step 4).FindingsThe paper found that the decision trees algorithm outperformed other machine earning algorithms. The study also confirms the significant effect of the academic background and virtual learning environment (VLE) interactions feature categories to academic performance. The accuracy enhancement is 17.66% for decision trees classifier, 3.49% for logistic regression classifier and 4.89% for neural networks classifier. Based on the results of CorrelationAttributeEval technique with the use of a ranker search method, the author found that the assessment_score and sum_click features are more important among academic background and VLE interactions feature categories for the classification analysis in predicting students' academic performance.Originality/valueThe work meets the originality requirement.","PeriodicalId":51872,"journal":{"name":"International Journal of Information and Learning Technology","volume":null,"pages":null},"PeriodicalIF":2.4000,"publicationDate":"2022-02-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Information and Learning Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1108/ijilt-09-2021-0144","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 3

Abstract

PurposeThe purpose of the study is to build predictive models for early detection of low-performing students and examine the factors that influence massive open online courses students' performance.Design/methodology/approachFor the first step, the author performed exploratory data analysis to analyze the dataset. The process was then followed by data pre-processing and feature engineering (Step 2). Next, the author conducted data modelling and prediction (Step 3). Finally, the performance of the developed models was evaluated (Step 4).FindingsThe paper found that the decision trees algorithm outperformed other machine earning algorithms. The study also confirms the significant effect of the academic background and virtual learning environment (VLE) interactions feature categories to academic performance. The accuracy enhancement is 17.66% for decision trees classifier, 3.49% for logistic regression classifier and 4.89% for neural networks classifier. Based on the results of CorrelationAttributeEval technique with the use of a ranker search method, the author found that the assessment_score and sum_click features are more important among academic background and VLE interactions feature categories for the classification analysis in predicting students' academic performance.Originality/valueThe work meets the originality requirement.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
一种使用机器学习算法的数据挖掘方法,用于早期检测表现不佳的学生
目的本研究旨在建立早期发现成绩不佳学生的预测模型,并考察影响大规模开放在线课程学生成绩的因素。设计/方法论/方法第一步,作者进行了探索性数据分析来分析数据集。该过程之后是数据预处理和特征工程(步骤2)。接下来,作者进行了数据建模和预测(步骤3)。最后,对所开发的模型的性能进行了评估(步骤4)。发现决策树算法优于其他机器学习算法。研究还证实了学术背景和虚拟学习环境(VLE)互动特征类别对学习成绩的显著影响。决策树分类器、逻辑回归分类器和神经网络分类器的准确率分别提高了17.66%、3.49%和4.89%。基于CorrelationAttributeEval技术和ranker搜索方法的结果,作者发现在预测学生学习成绩的分类分析中,评估_核心和sum_点击特征在学术背景和VLE交互特征类别中更为重要。独创性/价值这件作品符合独创性的要求。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
International Journal of Information and Learning Technology
International Journal of Information and Learning Technology COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS-
CiteScore
6.10
自引率
3.30%
发文量
33
期刊介绍: International Journal of Information and Learning Technology (IJILT) provides a forum for the sharing of the latest theories, applications, and services related to planning, developing, managing, using, and evaluating information technologies in administrative, academic, and library computing, as well as other educational technologies. Submissions can include research: -Illustrating and critiquing educational technologies -New uses of technology in education -Issue-or results-focused case studies detailing examples of technology applications in higher education -In-depth analyses of the latest theories, applications and services in the field The journal provides wide-ranging and independent coverage of the management, use and integration of information resources and learning technologies.
期刊最新文献
Development of an Automated Hall Effect Experimentation Method for the Electrical Characterization of Thin Films Deteksi Tingkat Kematangan Buah Pinang Menggunakan Metode Support Vector Machine Berdasarkan Warna Dan Tekstur Analisis Kinerja Mikrokomputer Raspberry Pi Pada Smart Greenhouse Berbasis Internet Of Things (IoT) Menggunakan Algoritma Naive Baye SISTEM PENDUKUNG KEPUTUSAN PENENTUAN GURU BERPRESTASI MENGGUNAKAN METODE TOPSIS (STUDI KASUS: DINAS PPO KAB. TTU) Analisis Kepuasan Pengguna Terhadap Penerapan Sistem Informasi Terpadu Layanan Prodi (SIPLO) Menggunakan End User Computing Satisfaction (EUCS)
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1