Leslie J. Wardley , Enayat Rajabi , Saman Hassanzadeh Amin , Monisha Ramesh
{"title":"A machine learning approach feature to forecast the future performance of the universities in Canada","authors":"Leslie J. Wardley , Enayat Rajabi , Saman Hassanzadeh Amin , Monisha Ramesh","doi":"10.1016/j.mlwa.2024.100548","DOIUrl":null,"url":null,"abstract":"<div><p>University ranking is a technique of measuring the performance of Higher Education Institutions (HEIs) by evaluating them on various criteria like student satisfaction, expenditure, research and teaching quality, citation count, grants, and enrolment. Ranking has been determined as a vital factor that helps students decide which institution to attend. Hence, universities seek to increase their overall rank and use these measures of success in their marketing communications and prominently place their ranked status on their institution's websites. Despite decades of research on ranking methods, a limited number of studies have leveraged predictive analytics and machine learning to rank universities. In this article, we collected 49 Canadian universities’ data for 2017–2021 and divided them based on Maclean's categories into Primarily Undergraduate, Comprehensive, and Medical/Doctoral Universities. After identifying the input and output components, we leveraged various feature engineering and machine learning techniques to predict the universities’ ranks. We used Pearson Correlation, Feature Importance, and Chi-Square as the feature engineering methods, and the results show that “student to faculty ratio,” “total number of citations”, and “total number of Grants” are the most important factors in ranking Canadian universities. Also, the Random Forest machine learning model for the “primarily undergraduate category,” the Voting classifier model for the “comprehensive category” and the Gradient Boosting model for the “medical/doctoral category” performed the best. The selected machine learning models were evaluated based on accuracy, precision, F1 score, and recall.</p></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"16 ","pages":"Article 100548"},"PeriodicalIF":0.0000,"publicationDate":"2024-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666827024000240/pdfft?md5=8a9f7f98d8a5d63dd8dd9ea9fa0bafa4&pid=1-s2.0-S2666827024000240-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Machine learning with applications","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666827024000240","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
University ranking is a technique of measuring the performance of Higher Education Institutions (HEIs) by evaluating them on various criteria like student satisfaction, expenditure, research and teaching quality, citation count, grants, and enrolment. Ranking has been determined as a vital factor that helps students decide which institution to attend. Hence, universities seek to increase their overall rank and use these measures of success in their marketing communications and prominently place their ranked status on their institution's websites. Despite decades of research on ranking methods, a limited number of studies have leveraged predictive analytics and machine learning to rank universities. In this article, we collected 49 Canadian universities’ data for 2017–2021 and divided them based on Maclean's categories into Primarily Undergraduate, Comprehensive, and Medical/Doctoral Universities. After identifying the input and output components, we leveraged various feature engineering and machine learning techniques to predict the universities’ ranks. We used Pearson Correlation, Feature Importance, and Chi-Square as the feature engineering methods, and the results show that “student to faculty ratio,” “total number of citations”, and “total number of Grants” are the most important factors in ranking Canadian universities. Also, the Random Forest machine learning model for the “primarily undergraduate category,” the Voting classifier model for the “comprehensive category” and the Gradient Boosting model for the “medical/doctoral category” performed the best. The selected machine learning models were evaluated based on accuracy, precision, F1 score, and recall.