Detecting Source Code Plagiarism on .NET Programming Languages using Low-level Representation and Adaptive Local Alignment

IF 0.3 Q4 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Journal of Information and Organizational Sciences Pub Date : 2017-06-16 DOI:10.31341/JIOS.41.1.7
Faqih Salban Rabbani, Oscar Karnalim
{"title":"Detecting Source Code Plagiarism on .NET Programming Languages using Low-level Representation and Adaptive Local Alignment","authors":"Faqih Salban Rabbani, Oscar Karnalim","doi":"10.31341/JIOS.41.1.7","DOIUrl":null,"url":null,"abstract":"Even though there are various source code plagiarism detection approaches, only a few works which are focused on low-level representation for deducting similarity. Most of them are only focused on lexical token sequence extracted from source code. In our point of view, low-level representation is more beneficial than lexical token since its form is more compact than the source code itself. It only considers semantic-preserving instructions and ignores many source code delimiter tokens. This paper proposes a source code plagiarism detection which rely on low-level representation. For a case study, we focus our work on .NET programming languages with Common Intermediate Language as its low-level representation. In addition, we also incorporate Adaptive Local Alignment for detecting similarity. According to Lim et al, this algorithm outperforms code similarity state-of-the-art algorithm (i.e. Greedy String Tiling) in term of effectiveness. According to our evaluation which involves various plagiarism attacks, our approach is more effective and efficient when compared with standard lexical-token approach.","PeriodicalId":43428,"journal":{"name":"Journal of Information and Organizational Sciences","volume":"41 1","pages":"105-123"},"PeriodicalIF":0.3000,"publicationDate":"2017-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.31341/JIOS.41.1.7","citationCount":"19","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Information and Organizational Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.31341/JIOS.41.1.7","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 19

Abstract

Even though there are various source code plagiarism detection approaches, only a few works which are focused on low-level representation for deducting similarity. Most of them are only focused on lexical token sequence extracted from source code. In our point of view, low-level representation is more beneficial than lexical token since its form is more compact than the source code itself. It only considers semantic-preserving instructions and ignores many source code delimiter tokens. This paper proposes a source code plagiarism detection which rely on low-level representation. For a case study, we focus our work on .NET programming languages with Common Intermediate Language as its low-level representation. In addition, we also incorporate Adaptive Local Alignment for detecting similarity. According to Lim et al, this algorithm outperforms code similarity state-of-the-art algorithm (i.e. Greedy String Tiling) in term of effectiveness. According to our evaluation which involves various plagiarism attacks, our approach is more effective and efficient when compared with standard lexical-token approach.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用低级表示和自适应局部对齐检测。net编程语言的源代码抄袭
尽管有各种各样的源代码抄袭检测方法,但只有少数作品专注于低级表示来扣除相似度。它们大多只关注从源代码中提取的词法标记序列。在我们看来,低级表示比词法标记更有益,因为它的形式比源代码本身更紧凑。它只考虑保持语义的指令,而忽略许多源代码分隔符令牌。提出了一种基于底层表示的源代码抄袭检测方法。对于一个案例研究,我们将工作重点放在。net编程语言上,并将公共中间语言作为其低级表示形式。此外,我们还结合了自适应局部对齐来检测相似度。根据Lim等人的研究,该算法在有效性上优于代码相似最先进的算法(即贪心字符串拼接)。根据我们对各种抄袭攻击的评估,我们的方法比标准的词汇-令牌方法更有效。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Journal of Information and Organizational Sciences
Journal of Information and Organizational Sciences COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS-
CiteScore
1.10
自引率
0.00%
发文量
14
审稿时长
12 weeks
期刊最新文献
Employing a Time Series Forecasting Model for Tourism Demand Using ANFIS A Mobile Based Pharmacy Store Location-aware System The Contribution of Women on Corporate Boards Croatian Journals Covered by SCIE/SSCI Towards an Improved Framework for E-Risk Management for Digital Financial Services (DFS) in Ugandan Banks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1