{"title":"Modelling a machine learning based multivariate content grading system for YouTube Tamil-post analysis","authors":"G. Srivatsun, S. Thivaharan","doi":"10.3233/jifs-222504","DOIUrl":null,"url":null,"abstract":"Writing is a crucial component of the language requirement and is an effective method for correctly reflecting language proficiency. Manually evaluating Tamil language exams becomes time-consuming and costly for standardized language administrators as they grow in popularity. Numerous studies on computerized English assessment systems have been conducted in recent years. Due to Tamil text’s complicated grammatical structures, less research has been done on computerized evaluation methods. In this research, we present a Tamil review comment analysis system using a novel multivariate naïve Bayes classifier (mv - NB) where the comments are acquired from an online social network and performed training using the database for further analysis. Experiments show that the graded Kappa of 0.4239, error rate of 2.55 and precision of 85% was achieved on the online dataset by our contents grading system, which is superior in grading compared to the other widely used machine learning algorithms training on big datasets. Our findings are promising. Additionally, our contents analysis may provide beneficial criticism on Tamil writing on YouTube posts including comments, spelling errors and morphological issues that help to analyze thelanguage correlation.","PeriodicalId":54795,"journal":{"name":"Journal of Intelligent & Fuzzy Systems","volume":"132 11","pages":"0"},"PeriodicalIF":1.7000,"publicationDate":"2023-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Intelligent & Fuzzy Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3233/jifs-222504","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Writing is a crucial component of the language requirement and is an effective method for correctly reflecting language proficiency. Manually evaluating Tamil language exams becomes time-consuming and costly for standardized language administrators as they grow in popularity. Numerous studies on computerized English assessment systems have been conducted in recent years. Due to Tamil text’s complicated grammatical structures, less research has been done on computerized evaluation methods. In this research, we present a Tamil review comment analysis system using a novel multivariate naïve Bayes classifier (mv - NB) where the comments are acquired from an online social network and performed training using the database for further analysis. Experiments show that the graded Kappa of 0.4239, error rate of 2.55 and precision of 85% was achieved on the online dataset by our contents grading system, which is superior in grading compared to the other widely used machine learning algorithms training on big datasets. Our findings are promising. Additionally, our contents analysis may provide beneficial criticism on Tamil writing on YouTube posts including comments, spelling errors and morphological issues that help to analyze thelanguage correlation.
期刊介绍:
The purpose of the Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology is to foster advancements of knowledge and help disseminate results concerning recent applications and case studies in the areas of fuzzy logic, intelligent systems, and web-based applications among working professionals and professionals in education and research, covering a broad cross-section of technical disciplines.