Wahyudi, Ricky Akbar, Teguh Nurhadi Suharsono, A. S. Indrapriyatna
{"title":"基于余弦相似度向量空间模型的论文电子测试","authors":"Wahyudi, Ricky Akbar, Teguh Nurhadi Suharsono, A. S. Indrapriyatna","doi":"10.1109/ISITDI55734.2022.9944506","DOIUrl":null,"url":null,"abstract":"The covid-19 pandemic has been pushing the development of online learning systems in Indonesia. In online learning, computer-based essay tests and assessments have an essential role. Essay test systems are designed to mimic the concept of essay tests without being computer-based. The answer from the lecturer is compared to the response from the student. The TF-IDF (Term Frequency -Inverse Document Frequency) cosine similarity is used. It is one of the methods of information re-gathering systems. The process in this model consists of two types: 1) creating a corpus/ inverted file, and the second is cosine similarity (CS) for calculating the similarity of the user's answers with the lecturer's. Creating a corpus/inverted file involves several stages like data collection, parsing sentences into terms, stoplist, weighting with IDF, and term weighting using TF-IDF. The cosine similarity process consists of parsing users' answers, weighting users' answers using TF-IDF, and finding cosine similarity values of users' answers with lecturers' answers using the vector space model. The highest cosine similarity value is taken to give the user's answer points. Testing the Essay Test system produces excellent grades. The tests were done Mean Squared Error (MSE) values resulted in an average MSE value of 3.28 from three students.","PeriodicalId":312644,"journal":{"name":"2022 International Symposium on Information Technology and Digital Innovation (ISITDI)","volume":"76 2","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Essay Test Based E-Testing Using Cosine Similarity Vector Space Model\",\"authors\":\"Wahyudi, Ricky Akbar, Teguh Nurhadi Suharsono, A. S. Indrapriyatna\",\"doi\":\"10.1109/ISITDI55734.2022.9944506\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The covid-19 pandemic has been pushing the development of online learning systems in Indonesia. In online learning, computer-based essay tests and assessments have an essential role. Essay test systems are designed to mimic the concept of essay tests without being computer-based. The answer from the lecturer is compared to the response from the student. The TF-IDF (Term Frequency -Inverse Document Frequency) cosine similarity is used. It is one of the methods of information re-gathering systems. The process in this model consists of two types: 1) creating a corpus/ inverted file, and the second is cosine similarity (CS) for calculating the similarity of the user's answers with the lecturer's. Creating a corpus/inverted file involves several stages like data collection, parsing sentences into terms, stoplist, weighting with IDF, and term weighting using TF-IDF. The cosine similarity process consists of parsing users' answers, weighting users' answers using TF-IDF, and finding cosine similarity values of users' answers with lecturers' answers using the vector space model. The highest cosine similarity value is taken to give the user's answer points. Testing the Essay Test system produces excellent grades. The tests were done Mean Squared Error (MSE) values resulted in an average MSE value of 3.28 from three students.\",\"PeriodicalId\":312644,\"journal\":{\"name\":\"2022 International Symposium on Information Technology and Digital Innovation (ISITDI)\",\"volume\":\"76 2\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-07-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 International Symposium on Information Technology and Digital Innovation (ISITDI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISITDI55734.2022.9944506\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Symposium on Information Technology and Digital Innovation (ISITDI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISITDI55734.2022.9944506","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
2019冠状病毒病大流行推动了印度尼西亚在线学习系统的发展。在在线学习中,基于计算机的论文测试和评估起着至关重要的作用。论文测试系统旨在模仿论文测试的概念,而不是基于计算机的。讲师的回答与学生的回答相比较。使用TF-IDF (Term Frequency -Inverse Document Frequency)余弦相似度。它是信息再收集系统的方法之一。该模型中的过程包括两种类型:1)创建语料库/倒排文件,第二种是余弦相似度(CS),用于计算用户的答案与讲师的答案的相似度。创建语料库/反向文件涉及几个阶段,如数据收集、将句子解析为术语、停止列表、使用IDF加权和使用TF-IDF加权术语。余弦相似度过程包括解析用户答案,使用TF-IDF对用户答案进行加权,使用向量空间模型找到用户答案与讲师答案的余弦相似度值。取最大的余弦相似度值来给出用户的答案点。测试论文测试系统产生优秀的成绩。均方误差(MSE)值导致三个学生的平均MSE值为3.28。
Essay Test Based E-Testing Using Cosine Similarity Vector Space Model
The covid-19 pandemic has been pushing the development of online learning systems in Indonesia. In online learning, computer-based essay tests and assessments have an essential role. Essay test systems are designed to mimic the concept of essay tests without being computer-based. The answer from the lecturer is compared to the response from the student. The TF-IDF (Term Frequency -Inverse Document Frequency) cosine similarity is used. It is one of the methods of information re-gathering systems. The process in this model consists of two types: 1) creating a corpus/ inverted file, and the second is cosine similarity (CS) for calculating the similarity of the user's answers with the lecturer's. Creating a corpus/inverted file involves several stages like data collection, parsing sentences into terms, stoplist, weighting with IDF, and term weighting using TF-IDF. The cosine similarity process consists of parsing users' answers, weighting users' answers using TF-IDF, and finding cosine similarity values of users' answers with lecturers' answers using the vector space model. The highest cosine similarity value is taken to give the user's answer points. Testing the Essay Test system produces excellent grades. The tests were done Mean Squared Error (MSE) values resulted in an average MSE value of 3.28 from three students.