{"title":"概率多项式时间的通用几乎最优压缩和睡狼编码","authors":"Bruno Bauwens*, Marius Zimand","doi":"https://dl.acm.org/doi/10.1145/3575807","DOIUrl":null,"url":null,"abstract":"<p>In a lossless compression system with target lengths, a compressor 𝒞 maps an integer <i>m</i> and a binary string <i>x</i> to an <i>m</i>-bit code <i>p</i>, and if <i>m</i> is sufficiently large, a decompressor 𝒟 reconstructs <i>x</i> from <i>p</i>. We call a pair (<i>m,x</i>) <i>achievable</i> for (𝒞,𝒟) if this reconstruction is successful. We introduce the notion of an optimal compressor 𝒞<sub>opt</sub> by the following universality property: For any compressor-decompressor pair (𝒞,𝒟), there exists a decompressor 𝒟<sup>′</sup> such that if <i>(m,x)</i> is achievable for (𝒞,𝒟), then (<i>m</i> + Δ , <i>x</i>) is achievable for (𝒞<sub>opt</sub>, 𝒟<sup>′</sup>), where Δ is some small value called the overhead. We show that there exists an optimal compressor that has only polylogarithmic overhead and works in probabilistic polynomial time. Differently said, for any pair (𝒞,𝒟), no matter how slow 𝒞 is, or even if 𝒞 is non-computable, 𝒞<sub><i>opt</i></sub> is a fixed compressor that in polynomial time produces codes almost as short as those of 𝒞. The cost is that the corresponding decompressor is slower.</p><p>We also show that each such optimal compressor can be used for distributed compression, in which case it can achieve optimal compression rates as given in the Slepian–Wolf theorem and even for the Kolmogorov complexity variant of this theorem.</p>","PeriodicalId":50022,"journal":{"name":"Journal of the ACM","volume":"22 6","pages":""},"PeriodicalIF":2.3000,"publicationDate":"2023-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Universal almost Optimal Compression and Slepian-wolf Coding in Probabilistic Polynomial Time\",\"authors\":\"Bruno Bauwens*, Marius Zimand\",\"doi\":\"https://dl.acm.org/doi/10.1145/3575807\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>In a lossless compression system with target lengths, a compressor 𝒞 maps an integer <i>m</i> and a binary string <i>x</i> to an <i>m</i>-bit code <i>p</i>, and if <i>m</i> is sufficiently large, a decompressor 𝒟 reconstructs <i>x</i> from <i>p</i>. We call a pair (<i>m,x</i>) <i>achievable</i> for (𝒞,𝒟) if this reconstruction is successful. We introduce the notion of an optimal compressor 𝒞<sub>opt</sub> by the following universality property: For any compressor-decompressor pair (𝒞,𝒟), there exists a decompressor 𝒟<sup>′</sup> such that if <i>(m,x)</i> is achievable for (𝒞,𝒟), then (<i>m</i> + Δ , <i>x</i>) is achievable for (𝒞<sub>opt</sub>, 𝒟<sup>′</sup>), where Δ is some small value called the overhead. We show that there exists an optimal compressor that has only polylogarithmic overhead and works in probabilistic polynomial time. Differently said, for any pair (𝒞,𝒟), no matter how slow 𝒞 is, or even if 𝒞 is non-computable, 𝒞<sub><i>opt</i></sub> is a fixed compressor that in polynomial time produces codes almost as short as those of 𝒞. The cost is that the corresponding decompressor is slower.</p><p>We also show that each such optimal compressor can be used for distributed compression, in which case it can achieve optimal compression rates as given in the Slepian–Wolf theorem and even for the Kolmogorov complexity variant of this theorem.</p>\",\"PeriodicalId\":50022,\"journal\":{\"name\":\"Journal of the ACM\",\"volume\":\"22 6\",\"pages\":\"\"},\"PeriodicalIF\":2.3000,\"publicationDate\":\"2023-03-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of the ACM\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/https://dl.acm.org/doi/10.1145/3575807\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the ACM","FirstCategoryId":"94","ListUrlMain":"https://doi.org/https://dl.acm.org/doi/10.1145/3575807","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
Universal almost Optimal Compression and Slepian-wolf Coding in Probabilistic Polynomial Time
In a lossless compression system with target lengths, a compressor 𝒞 maps an integer m and a binary string x to an m-bit code p, and if m is sufficiently large, a decompressor 𝒟 reconstructs x from p. We call a pair (m,x) achievable for (𝒞,𝒟) if this reconstruction is successful. We introduce the notion of an optimal compressor 𝒞opt by the following universality property: For any compressor-decompressor pair (𝒞,𝒟), there exists a decompressor 𝒟′ such that if (m,x) is achievable for (𝒞,𝒟), then (m + Δ , x) is achievable for (𝒞opt, 𝒟′), where Δ is some small value called the overhead. We show that there exists an optimal compressor that has only polylogarithmic overhead and works in probabilistic polynomial time. Differently said, for any pair (𝒞,𝒟), no matter how slow 𝒞 is, or even if 𝒞 is non-computable, 𝒞opt is a fixed compressor that in polynomial time produces codes almost as short as those of 𝒞. The cost is that the corresponding decompressor is slower.
We also show that each such optimal compressor can be used for distributed compression, in which case it can achieve optimal compression rates as given in the Slepian–Wolf theorem and even for the Kolmogorov complexity variant of this theorem.
期刊介绍:
The best indicator of the scope of the journal is provided by the areas covered by its Editorial Board. These areas change from time to time, as the field evolves. The following areas are currently covered by a member of the Editorial Board: Algorithms and Combinatorial Optimization; Algorithms and Data Structures; Algorithms, Combinatorial Optimization, and Games; Artificial Intelligence; Complexity Theory; Computational Biology; Computational Geometry; Computer Graphics and Computer Vision; Computer-Aided Verification; Cryptography and Security; Cyber-Physical, Embedded, and Real-Time Systems; Database Systems and Theory; Distributed Computing; Economics and Computation; Information Theory; Logic and Computation; Logic, Algorithms, and Complexity; Machine Learning and Computational Learning Theory; Networking; Parallel Computing and Architecture; Programming Languages; Quantum Computing; Randomized Algorithms and Probabilistic Analysis of Algorithms; Scientific Computing and High Performance Computing; Software Engineering; Web Algorithms and Data Mining