R. A. Sukamto, M. Rischa, E. Piantari, Yudi Wibisono, R. Megasari
{"title":"基于词嵌入和词移动距离的在线裁判代码评注自动评估","authors":"R. A. Sukamto, M. Rischa, E. Piantari, Yudi Wibisono, R. Megasari","doi":"10.1145/3575882.3575949","DOIUrl":null,"url":null,"abstract":"Comments in source code are a form of inline documentation created by programmers to help others understand the function of the program. The students of the basic programming subject need how to learn to write better code comments which can be difficulties for the lecturer assessing. Therefore, the author proposes an automatic source code comment assessment method for the online judge system with a corpus-based text similarity approach. Word2vec, GloVe, and fastText models will be used to train word vectors with the Indonesian Wikipedia Dump. The Similarities will be measured using Word Mover's Distance (WMD). Experiments were carried out using epoch variations during the training process. Spearman's rho correlation coefficient, mean average error (MAE), and performance measurements of each model will be compared. The methods with the proposed word embedding approach still provide not good results.","PeriodicalId":367340,"journal":{"name":"Proceedings of the 2022 International Conference on Computer, Control, Informatics and Its Applications","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Auto Code Comment Assessment for Online Judge using Word Embedding and Word Mover's Distance\",\"authors\":\"R. A. Sukamto, M. Rischa, E. Piantari, Yudi Wibisono, R. Megasari\",\"doi\":\"10.1145/3575882.3575949\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Comments in source code are a form of inline documentation created by programmers to help others understand the function of the program. The students of the basic programming subject need how to learn to write better code comments which can be difficulties for the lecturer assessing. Therefore, the author proposes an automatic source code comment assessment method for the online judge system with a corpus-based text similarity approach. Word2vec, GloVe, and fastText models will be used to train word vectors with the Indonesian Wikipedia Dump. The Similarities will be measured using Word Mover's Distance (WMD). Experiments were carried out using epoch variations during the training process. Spearman's rho correlation coefficient, mean average error (MAE), and performance measurements of each model will be compared. The methods with the proposed word embedding approach still provide not good results.\",\"PeriodicalId\":367340,\"journal\":{\"name\":\"Proceedings of the 2022 International Conference on Computer, Control, Informatics and Its Applications\",\"volume\":\"7 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2022 International Conference on Computer, Control, Informatics and Its Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3575882.3575949\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2022 International Conference on Computer, Control, Informatics and Its Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3575882.3575949","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Auto Code Comment Assessment for Online Judge using Word Embedding and Word Mover's Distance
Comments in source code are a form of inline documentation created by programmers to help others understand the function of the program. The students of the basic programming subject need how to learn to write better code comments which can be difficulties for the lecturer assessing. Therefore, the author proposes an automatic source code comment assessment method for the online judge system with a corpus-based text similarity approach. Word2vec, GloVe, and fastText models will be used to train word vectors with the Indonesian Wikipedia Dump. The Similarities will be measured using Word Mover's Distance (WMD). Experiments were carried out using epoch variations during the training process. Spearman's rho correlation coefficient, mean average error (MAE), and performance measurements of each model will be compared. The methods with the proposed word embedding approach still provide not good results.