Ricardo Timarán-Pereira, Javier Caicedo-Zambrano, Andrea Timarán-Buchely
{"title":"Applying Predictive Data Mining to Discover Factors Associated to the Language Skill Performance from Elementary School Students","authors":"Ricardo Timarán-Pereira, Javier Caicedo-Zambrano, Andrea Timarán-Buchely","doi":"10.19053/01211129.v31.n62.2022.14814","DOIUrl":null,"url":null,"abstract":"In this paper, predictive data mining techniques are applied to determine the academic performance from fifth grade students in the Saber 5° tests Language skill at Colombian elementary schools in 2017. We employed the CRISP-DM methodology. Socioeconomic, academic, and institutional information was available at the ICFES databases. A minable dataset was obtained using data cleaning and transformation techniques. A decision tree was built with the Weka tool J48 algorithm. Some of the predictors of the discovered patterns are the nature and location of the school, whether or not students failed a school year, the age group, the mother's educational attainment, and the rates of ICTs and household appliances. The findings of this research serve as quality information for the decision-making at the Ministry of National Education (MEN) and the secretaries of education, and for the directors of elementary educational institutions to define improvement plans that result in the quality of elementary school education in Colombia.","PeriodicalId":42846,"journal":{"name":"Revista Facultad de Ingenieria, Universidad Pedagogica y Tecnologica de Colombia","volume":"1 1","pages":""},"PeriodicalIF":0.2000,"publicationDate":"2022-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Revista Facultad de Ingenieria, Universidad Pedagogica y Tecnologica de Colombia","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.19053/01211129.v31.n62.2022.14814","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
In this paper, predictive data mining techniques are applied to determine the academic performance from fifth grade students in the Saber 5° tests Language skill at Colombian elementary schools in 2017. We employed the CRISP-DM methodology. Socioeconomic, academic, and institutional information was available at the ICFES databases. A minable dataset was obtained using data cleaning and transformation techniques. A decision tree was built with the Weka tool J48 algorithm. Some of the predictors of the discovered patterns are the nature and location of the school, whether or not students failed a school year, the age group, the mother's educational attainment, and the rates of ICTs and household appliances. The findings of this research serve as quality information for the decision-making at the Ministry of National Education (MEN) and the secretaries of education, and for the directors of elementary educational institutions to define improvement plans that result in the quality of elementary school education in Colombia.