Eunji Lee, Htet Myet Lynn, Chang Choi, Hanil Kim, Pankoo Kim
{"title":"基于扩展索引项的文本重用度量方法研究","authors":"Eunji Lee, Htet Myet Lynn, Chang Choi, Hanil Kim, Pankoo Kim","doi":"10.1145/3129676.3129686","DOIUrl":null,"url":null,"abstract":"Text reuse has become prominent in the process of information content digitalization owing to the popularization of the Internet and smartphones. Problems related to text reuse are various and complex, and these include problems related to text insertion, deletion, and replacement, and changing of word order. Moreover, in order to inspect reuse in texts with many sources, there must be an efficient method to inspect within a reasonable amount of time and using a reasonable amount of resources. This work is an attempt to improve accuracy of text reuse measurement by using expanded index terms, expanding the range of reused inspection sentences, and circularizing words in order to resolve the issue of undetected reused sentences that arise from the replacement of similar terms. The efficiency of the proposed method was proven through a comparative evaluation with the existing reuse inspection methods.","PeriodicalId":326100,"journal":{"name":"Proceedings of the International Conference on Research in Adaptive and Convergent Systems","volume":"45 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Study on a Text Reuse Measurement Method Using Expanded Index Term\",\"authors\":\"Eunji Lee, Htet Myet Lynn, Chang Choi, Hanil Kim, Pankoo Kim\",\"doi\":\"10.1145/3129676.3129686\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Text reuse has become prominent in the process of information content digitalization owing to the popularization of the Internet and smartphones. Problems related to text reuse are various and complex, and these include problems related to text insertion, deletion, and replacement, and changing of word order. Moreover, in order to inspect reuse in texts with many sources, there must be an efficient method to inspect within a reasonable amount of time and using a reasonable amount of resources. This work is an attempt to improve accuracy of text reuse measurement by using expanded index terms, expanding the range of reused inspection sentences, and circularizing words in order to resolve the issue of undetected reused sentences that arise from the replacement of similar terms. The efficiency of the proposed method was proven through a comparative evaluation with the existing reuse inspection methods.\",\"PeriodicalId\":326100,\"journal\":{\"name\":\"Proceedings of the International Conference on Research in Adaptive and Convergent Systems\",\"volume\":\"45 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-09-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the International Conference on Research in Adaptive and Convergent Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3129676.3129686\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the International Conference on Research in Adaptive and Convergent Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3129676.3129686","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Study on a Text Reuse Measurement Method Using Expanded Index Term
Text reuse has become prominent in the process of information content digitalization owing to the popularization of the Internet and smartphones. Problems related to text reuse are various and complex, and these include problems related to text insertion, deletion, and replacement, and changing of word order. Moreover, in order to inspect reuse in texts with many sources, there must be an efficient method to inspect within a reasonable amount of time and using a reasonable amount of resources. This work is an attempt to improve accuracy of text reuse measurement by using expanded index terms, expanding the range of reused inspection sentences, and circularizing words in order to resolve the issue of undetected reused sentences that arise from the replacement of similar terms. The efficiency of the proposed method was proven through a comparative evaluation with the existing reuse inspection methods.