Omar Chamorro-Atalaya, J. Arévalo-Tuesta, Denisse Balarezo-Mares, Anthony Gonzáles-Pacheco, Olga Mendoza-León, Manuel Quipuscoa-Silvestre, Gregorio Tomás-Quispe, Raul Suarez-Bazalar
{"title":"大学生满意度的意见分类算法的K-Fold交叉验证","authors":"Omar Chamorro-Atalaya, J. Arévalo-Tuesta, Denisse Balarezo-Mares, Anthony Gonzáles-Pacheco, Olga Mendoza-León, Manuel Quipuscoa-Silvestre, Gregorio Tomás-Quispe, Raul Suarez-Bazalar","doi":"10.3991/ijoe.v19i11.39887","DOIUrl":null,"url":null,"abstract":"When using machine-learning techniques to determine algorithms or ranking models that identify student satisfaction, algorithms are often trained and tested on a single data set, leading to bias in their performance metrics. This article aims to identify the best algorithm to classify the satisfaction of university students applying the K-fold cross-validation technique, comparing the error rates of the performance metrics before and after its application. The method used began with the collection of student opinions on the teaching performance of the social network Twitter during an academic semester. Then, sentiment analysis was used for data processing, through which it was possible to categorize the opinions of the students into “satisfied” or “dissatisfied.” The results showed that the algorithm with the lowest error rate in its performance metric was the support vector machine (SVM). In addition, it was identified that its classification probability reached an accuracy of 91.76%. It is concluded that SVM classification using K-fold cross-validation will contribute to determining which factors associated with the teacher’s didactic strategies should be improved in each class session, since traditional surveying techniques have shortcomings.","PeriodicalId":36900,"journal":{"name":"International Journal of Online and Biomedical Engineering","volume":" ","pages":""},"PeriodicalIF":1.7000,"publicationDate":"2023-08-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"K-Fold Cross-Validation through Identification of the Opinion Classification Algorithm for the Satisfaction of University Students\",\"authors\":\"Omar Chamorro-Atalaya, J. Arévalo-Tuesta, Denisse Balarezo-Mares, Anthony Gonzáles-Pacheco, Olga Mendoza-León, Manuel Quipuscoa-Silvestre, Gregorio Tomás-Quispe, Raul Suarez-Bazalar\",\"doi\":\"10.3991/ijoe.v19i11.39887\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"When using machine-learning techniques to determine algorithms or ranking models that identify student satisfaction, algorithms are often trained and tested on a single data set, leading to bias in their performance metrics. This article aims to identify the best algorithm to classify the satisfaction of university students applying the K-fold cross-validation technique, comparing the error rates of the performance metrics before and after its application. The method used began with the collection of student opinions on the teaching performance of the social network Twitter during an academic semester. Then, sentiment analysis was used for data processing, through which it was possible to categorize the opinions of the students into “satisfied” or “dissatisfied.” The results showed that the algorithm with the lowest error rate in its performance metric was the support vector machine (SVM). In addition, it was identified that its classification probability reached an accuracy of 91.76%. It is concluded that SVM classification using K-fold cross-validation will contribute to determining which factors associated with the teacher’s didactic strategies should be improved in each class session, since traditional surveying techniques have shortcomings.\",\"PeriodicalId\":36900,\"journal\":{\"name\":\"International Journal of Online and Biomedical Engineering\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":1.7000,\"publicationDate\":\"2023-08-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Online and Biomedical Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3991/ijoe.v19i11.39887\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Online and Biomedical Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3991/ijoe.v19i11.39887","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
K-Fold Cross-Validation through Identification of the Opinion Classification Algorithm for the Satisfaction of University Students
When using machine-learning techniques to determine algorithms or ranking models that identify student satisfaction, algorithms are often trained and tested on a single data set, leading to bias in their performance metrics. This article aims to identify the best algorithm to classify the satisfaction of university students applying the K-fold cross-validation technique, comparing the error rates of the performance metrics before and after its application. The method used began with the collection of student opinions on the teaching performance of the social network Twitter during an academic semester. Then, sentiment analysis was used for data processing, through which it was possible to categorize the opinions of the students into “satisfied” or “dissatisfied.” The results showed that the algorithm with the lowest error rate in its performance metric was the support vector machine (SVM). In addition, it was identified that its classification probability reached an accuracy of 91.76%. It is concluded that SVM classification using K-fold cross-validation will contribute to determining which factors associated with the teacher’s didactic strategies should be improved in each class session, since traditional surveying techniques have shortcomings.