{"title":"最长公共子序列的线性时间n0.4逼近","authors":"Karl Bringmann, Vincent Cohen-Addad, Debarati Das","doi":"https://dl.acm.org/doi/10.1145/3568398","DOIUrl":null,"url":null,"abstract":"<p>We consider the classic problem of computing the <b>Longest Common Subsequence (LCS)</b> of two strings of length <i>n</i>. The 40-year-old quadratic-time dynamic programming algorithm has recently been shown to be near-optimal by Abboud, Backurs, and Vassilevska Williams [FOCS’15] and Bringmann and Künnemann [FOCS’15] assuming the Strong Exponential Time Hypothesis. This has led the community to look for subquadratic <i>approximation</i> algorithms for the problem.</p><p>Yet, unlike the edit distance problem for which a constant-factor approximation in almost-linear time is known, very little progress has been made on LCS, making it a notoriously difficult problem also in the realm of approximation. For the general setting, only a naive <i>O</i>(<i>n</i><sup>ɛ</sup>/2-approximation algorithm with running time <i>OŠ</i>(<i>n<sup>2-ɛ</sup></i> has been known, for any constant 0 < ɛ ≤ 1. Recently, a breakthrough result by Hajiaghayi, Seddighin, Seddighin, and Sun [SODA’19] provided a linear-time algorithm that yields a <i>O</i>(<i>n</i><sup>0.497956</sup>-approximation in expectation; improving upon the naive \\(O(\\sqrt {n})\\)-approximation for the first time.</p><p>In this paper, we provide an algorithm that in time <i>O</i>(<i>n</i><sub>2-ɛ</sub>) computes an <i>OŠ</i>(<i>n<sup>2ɛ/5</sup></i>-approximation with high probability, for any 0 < ɛ ≤ 1. Our result (1) gives an <i>OŠ</i>(<i>n</i><sup>0.4</sup>-approximation in linear time, improving upon the bound of Hajiaghayi, Seddighin, Seddighin, and Sun, (2) provides an algorithm whose approximation scales with any subquadratic running time <i>O</i>(<i>n</i><sup>2-ɛ</sup>), improving upon the naive bound of <i>O</i>(<i>n</i><sup>ɛ/2</sup>) for any ɛ, and (3) instead of only in expectation, succeeds with high probability.</p>","PeriodicalId":50922,"journal":{"name":"ACM Transactions on Algorithms","volume":"46 1","pages":""},"PeriodicalIF":0.9000,"publicationDate":"2023-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Linear-Time n0.4-Approximation for Longest Common Subsequence\",\"authors\":\"Karl Bringmann, Vincent Cohen-Addad, Debarati Das\",\"doi\":\"https://dl.acm.org/doi/10.1145/3568398\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>We consider the classic problem of computing the <b>Longest Common Subsequence (LCS)</b> of two strings of length <i>n</i>. The 40-year-old quadratic-time dynamic programming algorithm has recently been shown to be near-optimal by Abboud, Backurs, and Vassilevska Williams [FOCS’15] and Bringmann and Künnemann [FOCS’15] assuming the Strong Exponential Time Hypothesis. This has led the community to look for subquadratic <i>approximation</i> algorithms for the problem.</p><p>Yet, unlike the edit distance problem for which a constant-factor approximation in almost-linear time is known, very little progress has been made on LCS, making it a notoriously difficult problem also in the realm of approximation. For the general setting, only a naive <i>O</i>(<i>n</i><sup>ɛ</sup>/2-approximation algorithm with running time <i>OŠ</i>(<i>n<sup>2-ɛ</sup></i> has been known, for any constant 0 < ɛ ≤ 1. Recently, a breakthrough result by Hajiaghayi, Seddighin, Seddighin, and Sun [SODA’19] provided a linear-time algorithm that yields a <i>O</i>(<i>n</i><sup>0.497956</sup>-approximation in expectation; improving upon the naive \\\\(O(\\\\sqrt {n})\\\\)-approximation for the first time.</p><p>In this paper, we provide an algorithm that in time <i>O</i>(<i>n</i><sub>2-ɛ</sub>) computes an <i>OŠ</i>(<i>n<sup>2ɛ/5</sup></i>-approximation with high probability, for any 0 < ɛ ≤ 1. Our result (1) gives an <i>OŠ</i>(<i>n</i><sup>0.4</sup>-approximation in linear time, improving upon the bound of Hajiaghayi, Seddighin, Seddighin, and Sun, (2) provides an algorithm whose approximation scales with any subquadratic running time <i>O</i>(<i>n</i><sup>2-ɛ</sup>), improving upon the naive bound of <i>O</i>(<i>n</i><sup>ɛ/2</sup>) for any ɛ, and (3) instead of only in expectation, succeeds with high probability.</p>\",\"PeriodicalId\":50922,\"journal\":{\"name\":\"ACM Transactions on Algorithms\",\"volume\":\"46 1\",\"pages\":\"\"},\"PeriodicalIF\":0.9000,\"publicationDate\":\"2023-02-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ACM Transactions on Algorithms\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/https://dl.acm.org/doi/10.1145/3568398\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, THEORY & METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Transactions on Algorithms","FirstCategoryId":"94","ListUrlMain":"https://doi.org/https://dl.acm.org/doi/10.1145/3568398","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
引用次数: 0
摘要
我们考虑计算两个长度为n的字符串的最长公共子序列(LCS)的经典问题。Abboud, Backurs, and Vassilevska Williams [FOCS ' 15]和Bringmann and k nnemann [FOCS ' 15]最近证明了40年前的二次时间动态规划算法在强指数时间假设下是接近最优的。这使得社区开始寻找针对该问题的次二次逼近算法。然而,与编辑距离问题不同的是,在几乎线性时间内已知的常因子近似值,LCS几乎没有取得进展,这使得它在近似值领域也是一个众所周知的难题。对于一般设置,对于任意常数0 &lt,只有已知的运行时间为OŠ(n2- æ)的朴素O(n æ /2)近似算法;(1)最近,Hajiaghayi, Seddighin, Seddighin和Sun [SODA ' 19]的突破性成果提供了一种线性时间算法,该算法在期望上产生O(n0.497956)近似值;第一次改进了朴素的\(O(\sqrt {n})\) -近似。在本文中,我们提供了一种算法,该算法在时间O(n2- æ)内对任意0 &lt,以高概率计算出OŠ(n2 æ /5)近似;(1)我们的结果(1)给出了线性时间内的OŠ(n0.4)近似,改进了Hajiaghayi, Seddighin, Seddighin和Sun的界;(2)提供了一个算法,其近似尺度为任意次二次运行时间O(n2- æ),改进了任意æ的O(n æ /2)的naive界;(3)而不是仅在期望中,以高概率成功。
A Linear-Time n0.4-Approximation for Longest Common Subsequence
We consider the classic problem of computing the Longest Common Subsequence (LCS) of two strings of length n. The 40-year-old quadratic-time dynamic programming algorithm has recently been shown to be near-optimal by Abboud, Backurs, and Vassilevska Williams [FOCS’15] and Bringmann and Künnemann [FOCS’15] assuming the Strong Exponential Time Hypothesis. This has led the community to look for subquadratic approximation algorithms for the problem.
Yet, unlike the edit distance problem for which a constant-factor approximation in almost-linear time is known, very little progress has been made on LCS, making it a notoriously difficult problem also in the realm of approximation. For the general setting, only a naive O(nɛ/2-approximation algorithm with running time OŠ(n2-ɛ has been known, for any constant 0 < ɛ ≤ 1. Recently, a breakthrough result by Hajiaghayi, Seddighin, Seddighin, and Sun [SODA’19] provided a linear-time algorithm that yields a O(n0.497956-approximation in expectation; improving upon the naive \(O(\sqrt {n})\)-approximation for the first time.
In this paper, we provide an algorithm that in time O(n2-ɛ) computes an OŠ(n2ɛ/5-approximation with high probability, for any 0 < ɛ ≤ 1. Our result (1) gives an OŠ(n0.4-approximation in linear time, improving upon the bound of Hajiaghayi, Seddighin, Seddighin, and Sun, (2) provides an algorithm whose approximation scales with any subquadratic running time O(n2-ɛ), improving upon the naive bound of O(nɛ/2) for any ɛ, and (3) instead of only in expectation, succeeds with high probability.
期刊介绍:
ACM Transactions on Algorithms welcomes submissions of original research of the highest quality dealing with algorithms that are inherently discrete and finite, and having mathematical content in a natural way, either in the objective or in the analysis. Most welcome are new algorithms and data structures, new and improved analyses, and complexity results. Specific areas of computation covered by the journal include
combinatorial searches and objects;
counting;
discrete optimization and approximation;
randomization and quantum computation;
parallel and distributed computation;
algorithms for
graphs,
geometry,
arithmetic,
number theory,
strings;
on-line analysis;
cryptography;
coding;
data compression;
learning algorithms;
methods of algorithmic analysis;
discrete algorithms for application areas such as
biology,
economics,
game theory,
communication,
computer systems and architecture,
hardware design,
scientific computing