Implementation of Data Mining to Predict Student Study Period with Decision Tree Algorithm (C4.5)

Kirana Alyssa Putri, Dimas Febriawan, Firman Noor Hasan
{"title":"Implementation of Data Mining to Predict Student Study Period with Decision Tree Algorithm (C4.5)","authors":"Kirana Alyssa Putri, Dimas Febriawan, Firman Noor Hasan","doi":"10.32736/sisfokom.v13i1.1943","DOIUrl":null,"url":null,"abstract":"Graduating on time is what every student wants to accomplish in college. Students of Prof. Dr. Hamka Muhammadiyah University are one of those who have this dream. Based on 2020 graduates data from the Tracer Study, 60% said the university had a high enough impact  on improving competence.  This data indicates that university needs to evaluate improvement of academic quality. Often, students have difficulty finding information about important factors that support achieving timely graduation. A prediction analysis is needed to provide information about the student's graduation study period. For this analysis, data mining is implemented using the classification function of the decision tree (C4.5) algorithm with RapidMiner tools. The methodology for implementing data mining follows the stages of Knowledge Discovery In Database (KDD), beginning with data collection, preprocessing, transformation, data mining, and evaluation. The research findings consist of visualization and decision tree rules that reveal GPA as the most influential factor in determining a student's study period.There is other information, namely, students graduated on time (less than equal to 4 years) amounted to 170 or 54.5% and students did not graduate on time (more than 4 years) amounted to 142 or 45.6%. Testing the performance of decision tree (C4.5) utilizing confusion matrix through RapidMiner tools, resulted in accuracy reaching 83.87%, with precision of 87.50% and recall of 91.18%. Provides evidence that the decision tree algorithm (C4.5) has optimal performance to provide valuable information about predicting student graduation in order to increase student enrollment with the right study period.","PeriodicalId":517030,"journal":{"name":"Jurnal Sisfokom (Sistem Informasi dan Komputer)","volume":"68 35","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Jurnal Sisfokom (Sistem Informasi dan Komputer)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.32736/sisfokom.v13i1.1943","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Graduating on time is what every student wants to accomplish in college. Students of Prof. Dr. Hamka Muhammadiyah University are one of those who have this dream. Based on 2020 graduates data from the Tracer Study, 60% said the university had a high enough impact  on improving competence.  This data indicates that university needs to evaluate improvement of academic quality. Often, students have difficulty finding information about important factors that support achieving timely graduation. A prediction analysis is needed to provide information about the student's graduation study period. For this analysis, data mining is implemented using the classification function of the decision tree (C4.5) algorithm with RapidMiner tools. The methodology for implementing data mining follows the stages of Knowledge Discovery In Database (KDD), beginning with data collection, preprocessing, transformation, data mining, and evaluation. The research findings consist of visualization and decision tree rules that reveal GPA as the most influential factor in determining a student's study period.There is other information, namely, students graduated on time (less than equal to 4 years) amounted to 170 or 54.5% and students did not graduate on time (more than 4 years) amounted to 142 or 45.6%. Testing the performance of decision tree (C4.5) utilizing confusion matrix through RapidMiner tools, resulted in accuracy reaching 83.87%, with precision of 87.50% and recall of 91.18%. Provides evidence that the decision tree algorithm (C4.5) has optimal performance to provide valuable information about predicting student graduation in order to increase student enrollment with the right study period.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用决策树算法(C4.5)实施数据挖掘以预测学生的学习时间
按时毕业是每个学生在大学期间都希望实现的目标。哈姆卡博士教授穆罕默迪亚大学的学生就是怀揣这一梦想的学生之一。根据追踪研究的 2020 届毕业生数据,60% 的人表示大学对提高能力的影响足够大。 这一数据表明,大学需要对学术质量的提高进行评估。通常情况下,学生很难找到支持按时毕业的重要因素的信息。需要进行预测分析,以提供有关学生毕业学习期的信息。为了进行这项分析,使用决策树(C4.5)算法的分类功能和 RapidMiner 工具实施了数据挖掘。数据挖掘的实施方法遵循数据库知识发现(KDD)的各个阶段,包括数据收集、预处理、转换、数据挖掘和评估。研究结果由可视化和决策树规则组成,显示 GPA 是决定学生学习时间的最有影响力的因素,还有其他信息,即按时毕业(少于等于 4 年)的学生有 170 人,占 54.5%,未按时毕业(超过 4 年)的学生有 142 人,占 45.6%。通过 RapidMiner 工具利用混淆矩阵测试决策树(C4.5)的性能,结果准确率达到 83.87%,精确率为 87.50%,召回率为 91.18%。这证明了决策树算法(C4.5)具有最佳性能,可为预测学生毕业提供有价值的信息,从而在正确的学习阶段提高学生入学率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Factors Influencing Acceptance of ILMU E-Learning Among Lecturers: An Empirical Study Based on UTAUT Model Detection of Rice Leaf Pests Based on Images with Convolution Neural Network in Yollo v8 Enterprise Architecture Planning Pada Industri Otomotif Pitcar Service Menggunakan Odoo Information Technology Security Audit at the YDSF National Zakat Institution Using the ISO 27001 Framework Data-Driven Strategies for Fuel Distribution in Indonesia: A Case Study of PT Pertamina Patra Niaga
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1