{"title":"Predicting degree-completion time with data mining","authors":"M. Wati, Haeruddin, Wahyu Indrawan","doi":"10.1109/ICSITECH.2017.8257209","DOIUrl":null,"url":null,"abstract":"Data mining in academic databases nowadays used for analyzing patterns and gaining new useful knowledge. This paper tries to predict the degree-completion time of bachelor's degree students using data mining technique and algorithms especially C4.5 and naive Bayes classifier algorithm, and measure the algorithms accuracy, precision, and recall percentages for both algorithms also exploring some factors that assume in theory have some impact on the model. The result from given dataset to build the models shows that C4.5 algorithm better than naive Bayes classifier algorithm with 78% accuracy, 85% weighted mean class precision, and 65% weighted mean class recall. This research can be expanded with different data mining algorithms or other related attributes that have some effects to the degree-completion time.","PeriodicalId":165045,"journal":{"name":"2017 3rd International Conference on Science in Information Technology (ICSITech)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 3rd International Conference on Science in Information Technology (ICSITech)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSITECH.2017.8257209","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7
Abstract
Data mining in academic databases nowadays used for analyzing patterns and gaining new useful knowledge. This paper tries to predict the degree-completion time of bachelor's degree students using data mining technique and algorithms especially C4.5 and naive Bayes classifier algorithm, and measure the algorithms accuracy, precision, and recall percentages for both algorithms also exploring some factors that assume in theory have some impact on the model. The result from given dataset to build the models shows that C4.5 algorithm better than naive Bayes classifier algorithm with 78% accuracy, 85% weighted mean class precision, and 65% weighted mean class recall. This research can be expanded with different data mining algorithms or other related attributes that have some effects to the degree-completion time.