{"title":"Code Visualization for Plagiarism Detection","authors":"D. Bernhauer","doi":"10.1109/SNAMS58071.2022.10062664","DOIUrl":null,"url":null,"abstract":"The use of deep convolutional neural networks is very common in computer graphics. With this, methods for exploiting knowledge in other fields are also developing. Finding plagiarism among student source codes is challenging, especially when students have the same assignment. In this case, we try to find differences between two semantically identical codes at the level of syntax, approach, or just style. This paper aims to visualize binary codes and verify if it is possible to detect plagiarism using deep convolution neural networks. Using the siamese network, we trained a neural network to evaluate the similarity between the two programs. The training data for our network are the ICPC competition submissions for which we can be confident of their authorship. The overall success rate of our model consistently reaches 75 to 80 % accuracy, which mainly shows that the visualization of inherently non-graphical entities (like source code) can be useful in the application of neural networks designed primarily for graphical purposes.","PeriodicalId":371668,"journal":{"name":"2022 Ninth International Conference on Social Networks Analysis, Management and Security (SNAMS)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 Ninth International Conference on Social Networks Analysis, Management and Security (SNAMS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SNAMS58071.2022.10062664","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The use of deep convolutional neural networks is very common in computer graphics. With this, methods for exploiting knowledge in other fields are also developing. Finding plagiarism among student source codes is challenging, especially when students have the same assignment. In this case, we try to find differences between two semantically identical codes at the level of syntax, approach, or just style. This paper aims to visualize binary codes and verify if it is possible to detect plagiarism using deep convolution neural networks. Using the siamese network, we trained a neural network to evaluate the similarity between the two programs. The training data for our network are the ICPC competition submissions for which we can be confident of their authorship. The overall success rate of our model consistently reaches 75 to 80 % accuracy, which mainly shows that the visualization of inherently non-graphical entities (like source code) can be useful in the application of neural networks designed primarily for graphical purposes.