{"title":"Modifications of the Burrows and Wheeler data compression algorithm","authors":"B. Balkenhol, S. Kurtz, Y. Shtarkov","doi":"10.1109/DCC.1999.755668","DOIUrl":null,"url":null,"abstract":"We improve upon previous results on the Burrows and Wheeler (BW)-algorithm. Based on the context tree model, we consider the specific statistical properties of the data at the output of the BWT. We describe six important properties, three of which have not been described elsewhere. These considerations lead to modifications of the coding method, which in turn improve the coding efficiency. We briefly describe how to compute the BWT with low complexity in time and space, using suffix trees in two different representations. Finally, we present experimental results about the compression rate and running time of our method, and compare these results to previous achievements.","PeriodicalId":103598,"journal":{"name":"Proceedings DCC'99 Data Compression Conference (Cat. No. PR00096)","volume":"1048 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1999-03-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"72","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings DCC'99 Data Compression Conference (Cat. No. PR00096)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DCC.1999.755668","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 72
Abstract
We improve upon previous results on the Burrows and Wheeler (BW)-algorithm. Based on the context tree model, we consider the specific statistical properties of the data at the output of the BWT. We describe six important properties, three of which have not been described elsewhere. These considerations lead to modifications of the coding method, which in turn improve the coding efficiency. We briefly describe how to compute the BWT with low complexity in time and space, using suffix trees in two different representations. Finally, we present experimental results about the compression rate and running time of our method, and compare these results to previous achievements.
我们改进了Burrows and Wheeler (BW)算法的先前结果。基于上下文树模型,我们考虑了BWT输出时数据的特定统计属性。我们描述了六个重要的性质,其中三个没有在其他地方描述。这些考虑导致了对编码方法的修改,从而提高了编码效率。我们简要描述了如何使用后缀树在两种不同的表示中计算低时间和空间复杂度的BWT。最后,给出了该方法的压缩率和运行时间的实验结果,并与前人的成果进行了比较。