{"title":"Algorithms for Compressed Inputs","authors":"Nathan Brunelle, G. Robins, Abhi Shelat","doi":"10.1109/DCC.2013.60","DOIUrl":null,"url":null,"abstract":"We study compression-aware algorithms, i.e. algorithms that can exploit regularity in their input data by directly operating on compressed data. While popular with string algorithms, we consider this idea for algorithms operating on numeric sequences and graphs that have been compressed using a variety of schemes including LZ77, grammar-based compression, a graph interpretation of Re-Pair, and a method presented by Boldi and Vigna in The Web Graph Framework. In all cases, we discover algorithms outperforming a trivial approach: to decompress the input and run a standard algorithm. We aim to develop an algorithmic toolkit for basic tasks to operate on a variety of compression inputs.","PeriodicalId":388717,"journal":{"name":"2013 Data Compression Conference","volume":"22 6S 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 Data Compression Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DCC.2013.60","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
We study compression-aware algorithms, i.e. algorithms that can exploit regularity in their input data by directly operating on compressed data. While popular with string algorithms, we consider this idea for algorithms operating on numeric sequences and graphs that have been compressed using a variety of schemes including LZ77, grammar-based compression, a graph interpretation of Re-Pair, and a method presented by Boldi and Vigna in The Web Graph Framework. In all cases, we discover algorithms outperforming a trivial approach: to decompress the input and run a standard algorithm. We aim to develop an algorithmic toolkit for basic tasks to operate on a variety of compression inputs.
我们研究压缩感知算法,即通过直接操作压缩数据来利用输入数据中的规律性的算法。虽然在字符串算法中很流行,但我们认为这个想法适用于使用各种方案压缩的数字序列和图的算法,包括LZ77、基于语法的压缩、Re-Pair的图解释以及Boldi和Vigna在The Web graph Framework中提出的方法。在所有情况下,我们都发现算法优于一种简单的方法:解压缩输入并运行标准算法。我们的目标是为基本任务开发一个算法工具包,以在各种压缩输入上操作。