{"title":"HyAlg: A Multi-algorithm Cooperation for Balancing Performance and Accuracy","authors":"Hongwei Zhou, Xiaojie Huang, Zhipeng Ke, Yuchen Zhang, Jinhui Yuan","doi":"10.1109/ICISCAE55891.2022.9927556","DOIUrl":null,"url":null,"abstract":"Most of the existing text similarity algorithms aim to improve the accuracy, but this introduces a high overhead because of scanning repeatedly the text to collect the necessary features. In order to achieve a good balance between accuracy and performance overhead, this paper proposes a novel method based on multi-algorithm collaboration, which we call HyAlg, is the abbreviation of Hybrid Algorithm. Hy Alg consists of two stages which are general similarity determination and structure similarity determination. The former uses the general features of the text to quickly complete the similarity determination, and initially eliminates a large number of dissimilar texts, thus reducing the time cost. On the basis of the former filtering results, the latter further determines the similarity from the perspective of text structure to ensure the accuracy of similarity determination. Our experimental and analysis show that Hy Alg is able to effectively reduce the performance cost of the algorithm while ensuring the accuracy.","PeriodicalId":115061,"journal":{"name":"International Conference on Information Systems and Computer Aided Education","volume":"53 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Information Systems and Computer Aided Education","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICISCAE55891.2022.9927556","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Most of the existing text similarity algorithms aim to improve the accuracy, but this introduces a high overhead because of scanning repeatedly the text to collect the necessary features. In order to achieve a good balance between accuracy and performance overhead, this paper proposes a novel method based on multi-algorithm collaboration, which we call HyAlg, is the abbreviation of Hybrid Algorithm. Hy Alg consists of two stages which are general similarity determination and structure similarity determination. The former uses the general features of the text to quickly complete the similarity determination, and initially eliminates a large number of dissimilar texts, thus reducing the time cost. On the basis of the former filtering results, the latter further determines the similarity from the perspective of text structure to ensure the accuracy of similarity determination. Our experimental and analysis show that Hy Alg is able to effectively reduce the performance cost of the algorithm while ensuring the accuracy.