马尔可夫链多面体及其应用

arXiv - CS - Information Theory Pub Date : 2024-01-21 DOI:arxiv-2401.11622

Mordecai J. Golin, Albert John Lalim Patupat

{"title":"马尔可夫链多面体及其应用","authors":"Mordecai J. Golin, Albert John Lalim Patupat","doi":"arxiv-2401.11622","DOIUrl":null,"url":null,"abstract":"This paper addresses the problem of finding a minimum-cost $m$-state Markov\nchain $(S_0,\\ldots,S_{m-1})$ in a large set of chains. The chains studied have\na reward associated with each state. The cost of a chain is its \"gain\", i.e.,\nits average reward under its stationary distribution. Specifically, for each $k=0,\\ldots,m-1$ there is a known set ${\\mathbb S}_k$\nof type-$k$ states. A permissible Markov chain contains exactly one state of\neach type; the problem is to find a minimum-cost permissible chain. The original motivation was to find a cheapest binary AIFV-$m$ lossless code\non a source alphabet of size $n$. Such a code is an $m$-tuple of trees, in\nwhich each tree can be viewed as a Markov Chain state. This formulation was\nthen used to address other problems in lossless compression. The known solution\ntechniques for finding minimum-cost Markov chains were iterative and ran in\nexponential time. This paper shows how to map every possible type-$k$ state into a type-$k$\nhyperplane and then define a \"Markov Chain Polytope\" as the lower envelope of\nall such hyperplanes. Finding a minimum-cost Markov chain can then be shown to\nbe equivalent to finding a \"highest\" point on this polytope. The local optimization procedures used in the previous iterative algorithms\nare shown to be separation oracles for this polytope. Since these were often\npolynomial time, an application of the Ellipsoid method immediately leads to\npolynomial time algorithms for these problems.","PeriodicalId":501433,"journal":{"name":"arXiv - CS - Information Theory","volume":"23 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"The Markov-Chain Polytope with Applications\",\"authors\":\"Mordecai J. Golin, Albert John Lalim Patupat\",\"doi\":\"arxiv-2401.11622\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper addresses the problem of finding a minimum-cost $m$-state Markov\\nchain $(S_0,\\\\ldots,S_{m-1})$ in a large set of chains. The chains studied have\\na reward associated with each state. The cost of a chain is its \\\"gain\\\", i.e.,\\nits average reward under its stationary distribution. Specifically, for each $k=0,\\\\ldots,m-1$ there is a known set ${\\\\mathbb S}_k$\\nof type-$k$ states. A permissible Markov chain contains exactly one state of\\neach type; the problem is to find a minimum-cost permissible chain. The original motivation was to find a cheapest binary AIFV-$m$ lossless code\\non a source alphabet of size $n$. Such a code is an $m$-tuple of trees, in\\nwhich each tree can be viewed as a Markov Chain state. This formulation was\\nthen used to address other problems in lossless compression. The known solution\\ntechniques for finding minimum-cost Markov chains were iterative and ran in\\nexponential time. This paper shows how to map every possible type-$k$ state into a type-$k$\\nhyperplane and then define a \\\"Markov Chain Polytope\\\" as the lower envelope of\\nall such hyperplanes. Finding a minimum-cost Markov chain can then be shown to\\nbe equivalent to finding a \\\"highest\\\" point on this polytope. The local optimization procedures used in the previous iterative algorithms\\nare shown to be separation oracles for this polytope. Since these were often\\npolynomial time, an application of the Ellipsoid method immediately leads to\\npolynomial time algorithms for these problems.\",\"PeriodicalId\":501433,\"journal\":{\"name\":\"arXiv - CS - Information Theory\",\"volume\":\"23 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-01-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Information Theory\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2401.11622\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Information Theory","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2401.11622","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

本文探讨的问题是在大量链中找到一个最小成本的 $m$ 状态马尔可夫链 $(S_0,\ldots,S_{m-1})$。所研究的链的每个状态都与奖励相关联。链的代价就是它的 "收益"，即它在静态分布下的平均奖励。具体来说，对于每个 $k=0,\ldots,m-1$，都有一个已知的 ${mathbb S}_k$ 类型为 $k$ 状态的集合。一条允许的马尔可夫链恰好包含每种类型的一个状态；问题是找到一条成本最低的允许链。最初的动机是在大小为 $n$ 的源字母表上找到最便宜的二进制 AIFV-$m$ 无损编码。这样的代码是一个 $m$ 的树元组，其中每棵树都可以被视为马尔可夫链状态。随后，这一公式被用于解决无损压缩中的其他问题。已知的寻找最小成本马尔可夫链的求解技术都是迭代式的，耗时不等。本文展示了如何将每种可能的 k$ 类型状态映射到 k$ 类型超平面中，然后将 "马尔可夫链多面体 "定义为所有此类超平面的下包络线。寻找成本最低的马尔可夫链就等同于在这个多面体上寻找一个 "最高 "点。先前迭代算法中使用的局部优化程序被证明是这个多面体的分离神谕。由于这些程序通常需要多项式时间，因此应用椭圆体方法立即就能为这些问题找到多项式时间算法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

The Markov-Chain Polytope with Applications

This paper addresses the problem of finding a minimum-cost $m$-state Markov chain $(S_0,\ldots,S_{m-1})$ in a large set of chains. The chains studied have a reward associated with each state. The cost of a chain is its "gain", i.e., its average reward under its stationary distribution. Specifically, for each $k=0,\ldots,m-1$ there is a known set ${\mathbb S}_k$ of type-$k$ states. A permissible Markov chain contains exactly one state of each type; the problem is to find a minimum-cost permissible chain. The original motivation was to find a cheapest binary AIFV-$m$ lossless code on a source alphabet of size $n$. Such a code is an $m$-tuple of trees, in which each tree can be viewed as a Markov Chain state. This formulation was then used to address other problems in lossless compression. The known solution techniques for finding minimum-cost Markov chains were iterative and ran in exponential time. This paper shows how to map every possible type-$k$ state into a type-$k$ hyperplane and then define a "Markov Chain Polytope" as the lower envelope of all such hyperplanes. Finding a minimum-cost Markov chain can then be shown to be equivalent to finding a "highest" point on this polytope. The local optimization procedures used in the previous iterative algorithms are shown to be separation oracles for this polytope. Since these were often polynomial time, an application of the Ellipsoid method immediately leads to polynomial time algorithms for these problems.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

arXiv - CS - Information Theory

自引率

0.00%

发文量