{"title":"Random forest models reveal academic and financial factors outweigh demographics in predicting completion of a year-round veterinary program.","authors":"Sarah E Hooper, Natalie Ragland, Elpida Artemiou","doi":"10.2460/javma.24.08.0501","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>The purpose of this study was to develop random forest classifier models (a type of supervised machine learning algorithm) that could (1) predict students who will or will not complete the DVM degree requirements and (2) identify the top predictors for academic success and completion of the DVM degree.</p><p><strong>Methods: </strong>The study utilized Ross University School of Veterinary Medicine student records from 2013 to 2022. Twenty-four variables encompassing demographic (eg, age, race), academic (eg, grade point average), and financial aid (eg, outstanding balances) data were assessed in 11 cross-validated random forest machine learning models. One model was built assessing all years of data and 10 individual models were developed for each enrollment year to compare how the top predictors of success varied among the years.</p><p><strong>Results: </strong>Consistently, only academic and financial factors were identified as being features of importance (predictors) in all models. Demographic factors such as race were not important for predicting student success. All models performed very well to excellently based on multiple performance metrics including accuracy, ranging from 96.1% to 99%, and the areas under the receiver operating characteristic curves, ranging from 98.1% to 99.9%.</p><p><strong>Conclusions: </strong>The random forest algorithm is a powerful machine learning prediction model that performs well with veterinary student academic records and is customizable such that variables important to each veterinary school's student population can be assessed.</p><p><strong>Clinical relevance: </strong>Identifying predictors of success as well as at-risk students is essential for providing targeted curricular interventions to increase retention and achieve timely completion of a DVM degree.</p>","PeriodicalId":14658,"journal":{"name":"Javma-journal of The American Veterinary Medical Association","volume":" ","pages":"1-9"},"PeriodicalIF":1.6000,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Javma-journal of The American Veterinary Medical Association","FirstCategoryId":"97","ListUrlMain":"https://doi.org/10.2460/javma.24.08.0501","RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"VETERINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Objective: The purpose of this study was to develop random forest classifier models (a type of supervised machine learning algorithm) that could (1) predict students who will or will not complete the DVM degree requirements and (2) identify the top predictors for academic success and completion of the DVM degree.
Methods: The study utilized Ross University School of Veterinary Medicine student records from 2013 to 2022. Twenty-four variables encompassing demographic (eg, age, race), academic (eg, grade point average), and financial aid (eg, outstanding balances) data were assessed in 11 cross-validated random forest machine learning models. One model was built assessing all years of data and 10 individual models were developed for each enrollment year to compare how the top predictors of success varied among the years.
Results: Consistently, only academic and financial factors were identified as being features of importance (predictors) in all models. Demographic factors such as race were not important for predicting student success. All models performed very well to excellently based on multiple performance metrics including accuracy, ranging from 96.1% to 99%, and the areas under the receiver operating characteristic curves, ranging from 98.1% to 99.9%.
Conclusions: The random forest algorithm is a powerful machine learning prediction model that performs well with veterinary student academic records and is customizable such that variables important to each veterinary school's student population can be assessed.
Clinical relevance: Identifying predictors of success as well as at-risk students is essential for providing targeted curricular interventions to increase retention and achieve timely completion of a DVM degree.
期刊介绍:
Published twice monthly, this peer-reviewed, general scientific journal provides reports of clinical research, feature articles and regular columns of interest to veterinarians in private and public practice. The News and Classified Ad sections are posted online 10 days to two weeks before they are delivered in print.