Stemmer Porter和Nazief-Adriani对衡量抄袭的筛选算法性能的影响比较

A. Rahmatulloh, Neng Ika Kurniati, I. Darmawan, Adi Zaenal Asyikin, Deden Witarsyah
{"title":"Stemmer Porter和Nazief-Adriani对衡量抄袭的筛选算法性能的影响比较","authors":"A. Rahmatulloh, Neng Ika Kurniati, I. Darmawan, Adi Zaenal Asyikin, Deden Witarsyah","doi":"10.6025/jdim/2020/18/2/49-56","DOIUrl":null,"url":null,"abstract":"Current technological developments change physical paper patterns into digital, which has a very high impact. Positive impact because paper waste is reduced, on the other hand, the rampant copying of digital data raises the amount of plagiarism that is increasing. At present, there are many efforts made by experts to overcome the problem of plagiarism, one of which is by utilizing the winnowing algorithm as a tool to detect plagiarism data. In its development, many optimizing winnowing algorithms used stemming techniques. The most widely used stemmer algorithms include stemmer porter and nazief-adriani. However, there has not been a discussion on the comparison of the effect of performance using stemmer on the winnowing algorithm in measuring the value of plagiarism. So it is necessary to do research on the effect of stemmer algorithms on winnowing algorithms so that the results of plagiarism detection are more optimal. The results of this study indicate that the effect of nazief-adriani stemmer on the winnowing algorithm is superior to the stemmer porter, only decreasing the detection performance of the 0.28% similarity value while the porter stemmer is superior in increasing the processing time to 69% faster. Subject Categories and Descriptors [I.1.2 Algorithms]; [H.3.3 Information Search and Retrieval] General Terms: Plagiarism Detection, Winnowing algorithms, Stemmers","PeriodicalId":303976,"journal":{"name":"J. Digit. Inf. Manag.","volume":"66 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Comparison of the Effects Stemmer Porter and Nazief-Adriani on the Performance of Winnowing Algorithms for Measuring Plagiarism\",\"authors\":\"A. Rahmatulloh, Neng Ika Kurniati, I. Darmawan, Adi Zaenal Asyikin, Deden Witarsyah\",\"doi\":\"10.6025/jdim/2020/18/2/49-56\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Current technological developments change physical paper patterns into digital, which has a very high impact. Positive impact because paper waste is reduced, on the other hand, the rampant copying of digital data raises the amount of plagiarism that is increasing. At present, there are many efforts made by experts to overcome the problem of plagiarism, one of which is by utilizing the winnowing algorithm as a tool to detect plagiarism data. In its development, many optimizing winnowing algorithms used stemming techniques. The most widely used stemmer algorithms include stemmer porter and nazief-adriani. However, there has not been a discussion on the comparison of the effect of performance using stemmer on the winnowing algorithm in measuring the value of plagiarism. So it is necessary to do research on the effect of stemmer algorithms on winnowing algorithms so that the results of plagiarism detection are more optimal. The results of this study indicate that the effect of nazief-adriani stemmer on the winnowing algorithm is superior to the stemmer porter, only decreasing the detection performance of the 0.28% similarity value while the porter stemmer is superior in increasing the processing time to 69% faster. Subject Categories and Descriptors [I.1.2 Algorithms]; [H.3.3 Information Search and Retrieval] General Terms: Plagiarism Detection, Winnowing algorithms, Stemmers\",\"PeriodicalId\":303976,\"journal\":{\"name\":\"J. Digit. Inf. Manag.\",\"volume\":\"66 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"J. Digit. Inf. Manag.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.6025/jdim/2020/18/2/49-56\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"J. Digit. Inf. Manag.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.6025/jdim/2020/18/2/49-56","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

当前的技术发展将物理纸张模式转变为数字模式,这具有非常高的影响。积极的影响一方面是因为纸张的浪费减少了,另一方面,数字数据的猖獗复制增加了剽窃的数量。目前,为了克服抄袭问题,专家们做了很多努力,其中之一就是利用筛选算法作为检测抄袭数据的工具。在其发展过程中,许多优化筛选算法都使用了词干提取技术。使用最广泛的stemmer算法包括stemmer porter和nazief-adriani。然而,在衡量抄袭价值时,使用stemmer的性能与筛选算法的效果比较,尚未有讨论。因此,有必要研究stemmer算法对筛选算法的影响,使抄袭检测结果更加优化。本研究结果表明,nazief-adriani茎秆对筛选算法的影响优于茎秆搬运工,仅降低了0.28%相似值的检测性能,而茎秆搬运工则将处理时间提高了69%。主题类别和描述符[I.1.2算法];[H.3.3信息检索]一般术语:抄袭检测、筛选算法、Stemmers
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Comparison of the Effects Stemmer Porter and Nazief-Adriani on the Performance of Winnowing Algorithms for Measuring Plagiarism
Current technological developments change physical paper patterns into digital, which has a very high impact. Positive impact because paper waste is reduced, on the other hand, the rampant copying of digital data raises the amount of plagiarism that is increasing. At present, there are many efforts made by experts to overcome the problem of plagiarism, one of which is by utilizing the winnowing algorithm as a tool to detect plagiarism data. In its development, many optimizing winnowing algorithms used stemming techniques. The most widely used stemmer algorithms include stemmer porter and nazief-adriani. However, there has not been a discussion on the comparison of the effect of performance using stemmer on the winnowing algorithm in measuring the value of plagiarism. So it is necessary to do research on the effect of stemmer algorithms on winnowing algorithms so that the results of plagiarism detection are more optimal. The results of this study indicate that the effect of nazief-adriani stemmer on the winnowing algorithm is superior to the stemmer porter, only decreasing the detection performance of the 0.28% similarity value while the porter stemmer is superior in increasing the processing time to 69% faster. Subject Categories and Descriptors [I.1.2 Algorithms]; [H.3.3 Information Search and Retrieval] General Terms: Plagiarism Detection, Winnowing algorithms, Stemmers
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
A Graph-Based Approach for Aspect Extraction from Online Customer Reviews A Study of Data Requirements for Data Mining Applications in Banking Knowledge-Intensive Decision Support System for Manufacturing Equipment Maintenance Real Estate Loan Knowledge-Based Recommender System Comparison of the Effects Stemmer Porter and Nazief-Adriani on the Performance of Winnowing Algorithms for Measuring Plagiarism
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1