P. Mirabal, José Ignacio Abreu Salas, Diego Seco, Oscar Pedreira, Edgar Chávez
{"title":"New initialization for algorithms to solve Median String Problem","authors":"P. Mirabal, José Ignacio Abreu Salas, Diego Seco, Oscar Pedreira, Edgar Chávez","doi":"10.1109/SCCC51225.2020.9281215","DOIUrl":null,"url":null,"abstract":"The median string problem is NP-hard under several formulations, being the most competitive heuristics those using perturbation-based iterative algorithms. The initial string and the policy to order possible edit operations are key to the efficiency of such approaches. In this work, we tackle both sub-problems. We hypothesized that a better starting point for the algorithm can reduce the number of edit distances computed to obtain the median string, improving time performance without degrading the quality of the solution. Regarding the starting point, we use the median of a few strings of the input, that is selected as the Half Space Proximal (HSP) neighbors of the median of the set. The HSP neighbors are simultaneously close to the center but also diverse among them. To validate these results, we present comparative experiments, attending mainly to the quality of the median obtained, the time to compute such median, and the number of edit distances computed.","PeriodicalId":117157,"journal":{"name":"2020 39th International Conference of the Chilean Computer Science Society (SCCC)","volume":"124 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 39th International Conference of the Chilean Computer Science Society (SCCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SCCC51225.2020.9281215","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
The median string problem is NP-hard under several formulations, being the most competitive heuristics those using perturbation-based iterative algorithms. The initial string and the policy to order possible edit operations are key to the efficiency of such approaches. In this work, we tackle both sub-problems. We hypothesized that a better starting point for the algorithm can reduce the number of edit distances computed to obtain the median string, improving time performance without degrading the quality of the solution. Regarding the starting point, we use the median of a few strings of the input, that is selected as the Half Space Proximal (HSP) neighbors of the median of the set. The HSP neighbors are simultaneously close to the center but also diverse among them. To validate these results, we present comparative experiments, attending mainly to the quality of the median obtained, the time to compute such median, and the number of edit distances computed.