{"title":"GPU Acceleration of Longest Common Substrings Algorithm","authors":"Ádám Pintér, S. Szénási","doi":"10.1109/SACI58269.2023.10158638","DOIUrl":null,"url":null,"abstract":"The Longest Common Substring of two strings is a character sequence that appears in both texts and is the longest of these. The method is widely used in several text similarity measurement methods, usually used multiple times on the same textual data. There are several already known methods to solve the problem, but these are mostly based on very time and memory intensive procedures. This paper presents a novel data-parallel model to solve the same problem, available for GPU implementation. As our experimental results show, the data-parallel implementation is significantly faster for long textual data.","PeriodicalId":339156,"journal":{"name":"2023 IEEE 17th International Symposium on Applied Computational Intelligence and Informatics (SACI)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE 17th International Symposium on Applied Computational Intelligence and Informatics (SACI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SACI58269.2023.10158638","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The Longest Common Substring of two strings is a character sequence that appears in both texts and is the longest of these. The method is widely used in several text similarity measurement methods, usually used multiple times on the same textual data. There are several already known methods to solve the problem, but these are mostly based on very time and memory intensive procedures. This paper presents a novel data-parallel model to solve the same problem, available for GPU implementation. As our experimental results show, the data-parallel implementation is significantly faster for long textual data.