{"title":"Big Data in Education: Students at Risk as a Case Study","authors":"Ahmed B. Altamimi","doi":"10.48084/etasr.6190","DOIUrl":null,"url":null,"abstract":"This paper analyzes various machine learning algorithms to predict student failure in a specific educational dataset and a specific environment. The paper handles the prediction of student failure given the students' grades, course difficulty level, and GPA, differing from most of the provided studies in the literature, where focus is given to the surrounding environment. The main aim is to early detect students at risk of academic underperformance and implement specific interventions to enhance their academic outcomes. A diverse set of eleven Machine Learning (ML) algorithms was used to analyze the dataset. The data went through preprocessing, and features were engineered to effectively capture essential information that may impact students' academic performance. A meticulous process for model selection and evaluation was utilized to compare the algorithms' performance with regard to metrics such as accuracy, precision, recall, F-score, specificity, and balanced accuracy. Our results demonstrate significant variability in the performance of the different algorithms, with Artificial Neural Networks (ANNs) and Convolutional Neural Networks (CNNs) showing the highest overall performance, followed closely by Gradient Boosting Classifier (GBC), Neuro-Fuzzy, and Random Forest (RF). The other algorithms exhibit varying performance levels, with the Recurrent Neural Networks (RNNs) showing the weakest results in recall and F-score. Educational institutions can use the insight gained from this study to make data-driven decisions and design targeted interventions to help students at risk succeed academically. Furthermore, the methodology presented in this paper can be generalized and applied to other educational datasets for similar predictive purposes.","PeriodicalId":1,"journal":{"name":"Accounts of Chemical Research","volume":null,"pages":null},"PeriodicalIF":16.4000,"publicationDate":"2023-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Accounts of Chemical Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48084/etasr.6190","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
This paper analyzes various machine learning algorithms to predict student failure in a specific educational dataset and a specific environment. The paper handles the prediction of student failure given the students' grades, course difficulty level, and GPA, differing from most of the provided studies in the literature, where focus is given to the surrounding environment. The main aim is to early detect students at risk of academic underperformance and implement specific interventions to enhance their academic outcomes. A diverse set of eleven Machine Learning (ML) algorithms was used to analyze the dataset. The data went through preprocessing, and features were engineered to effectively capture essential information that may impact students' academic performance. A meticulous process for model selection and evaluation was utilized to compare the algorithms' performance with regard to metrics such as accuracy, precision, recall, F-score, specificity, and balanced accuracy. Our results demonstrate significant variability in the performance of the different algorithms, with Artificial Neural Networks (ANNs) and Convolutional Neural Networks (CNNs) showing the highest overall performance, followed closely by Gradient Boosting Classifier (GBC), Neuro-Fuzzy, and Random Forest (RF). The other algorithms exhibit varying performance levels, with the Recurrent Neural Networks (RNNs) showing the weakest results in recall and F-score. Educational institutions can use the insight gained from this study to make data-driven decisions and design targeted interventions to help students at risk succeed academically. Furthermore, the methodology presented in this paper can be generalized and applied to other educational datasets for similar predictive purposes.
期刊介绍:
Accounts of Chemical Research presents short, concise and critical articles offering easy-to-read overviews of basic research and applications in all areas of chemistry and biochemistry. These short reviews focus on research from the author’s own laboratory and are designed to teach the reader about a research project. In addition, Accounts of Chemical Research publishes commentaries that give an informed opinion on a current research problem. Special Issues online are devoted to a single topic of unusual activity and significance.
Accounts of Chemical Research replaces the traditional article abstract with an article "Conspectus." These entries synopsize the research affording the reader a closer look at the content and significance of an article. Through this provision of a more detailed description of the article contents, the Conspectus enhances the article's discoverability by search engines and the exposure for the research.