{"title":"稀疏Logistic回归分析大学辍学率影响因素","authors":"G. Hori","doi":"10.1109/IIAI-AAI.2018.00091","DOIUrl":null,"url":null,"abstract":"Prediction and prevention of dropout are main challenges of institutional research (IR) in universities nowa- days. If we can identify factors significantly contributing to dropout by statistical inference, then it is helpful for devel- oping strategies for dropout prevention. The main problem in identifying such significant factors is that available data contain much more candidate factors than the students. Most of conventional statistical methods do not work well unless the number of data is much more than the number of parameters to be estimated, which means that we cannot apply such methods in identifying factors contributing to dropout. To circumvent the situation, we propose to use sparse logistic regression for identifying factors contributing to dropout based on data with a large number of candidate factors. Sparse logis- tic regression is a method that can analyze such data reliably by pruning factors that do not contribute to the analysis. To demonstrate how sparse logistic regression identifies factors contributing to dropout, we applied the method to actual university credit data of 410 students for 302 courses and identified 18 courses that significantly contribute to dropout. The contributions of the identified courses to dropout are interpreted.","PeriodicalId":309975,"journal":{"name":"2018 7th International Congress on Advanced Applied Informatics (IIAI-AAI)","volume":"70 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Identifying Factors Contributing to University Dropout with Sparse Logistic Regression\",\"authors\":\"G. Hori\",\"doi\":\"10.1109/IIAI-AAI.2018.00091\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Prediction and prevention of dropout are main challenges of institutional research (IR) in universities nowa- days. If we can identify factors significantly contributing to dropout by statistical inference, then it is helpful for devel- oping strategies for dropout prevention. The main problem in identifying such significant factors is that available data contain much more candidate factors than the students. Most of conventional statistical methods do not work well unless the number of data is much more than the number of parameters to be estimated, which means that we cannot apply such methods in identifying factors contributing to dropout. To circumvent the situation, we propose to use sparse logistic regression for identifying factors contributing to dropout based on data with a large number of candidate factors. Sparse logis- tic regression is a method that can analyze such data reliably by pruning factors that do not contribute to the analysis. To demonstrate how sparse logistic regression identifies factors contributing to dropout, we applied the method to actual university credit data of 410 students for 302 courses and identified 18 courses that significantly contribute to dropout. The contributions of the identified courses to dropout are interpreted.\",\"PeriodicalId\":309975,\"journal\":{\"name\":\"2018 7th International Congress on Advanced Applied Informatics (IIAI-AAI)\",\"volume\":\"70 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 7th International Congress on Advanced Applied Informatics (IIAI-AAI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IIAI-AAI.2018.00091\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 7th International Congress on Advanced Applied Informatics (IIAI-AAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IIAI-AAI.2018.00091","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Identifying Factors Contributing to University Dropout with Sparse Logistic Regression
Prediction and prevention of dropout are main challenges of institutional research (IR) in universities nowa- days. If we can identify factors significantly contributing to dropout by statistical inference, then it is helpful for devel- oping strategies for dropout prevention. The main problem in identifying such significant factors is that available data contain much more candidate factors than the students. Most of conventional statistical methods do not work well unless the number of data is much more than the number of parameters to be estimated, which means that we cannot apply such methods in identifying factors contributing to dropout. To circumvent the situation, we propose to use sparse logistic regression for identifying factors contributing to dropout based on data with a large number of candidate factors. Sparse logis- tic regression is a method that can analyze such data reliably by pruning factors that do not contribute to the analysis. To demonstrate how sparse logistic regression identifies factors contributing to dropout, we applied the method to actual university credit data of 410 students for 302 courses and identified 18 courses that significantly contribute to dropout. The contributions of the identified courses to dropout are interpreted.