{"title":"The Markov-Chain Polytope with Applications","authors":"Mordecai J. Golin, Albert John Lalim Patupat","doi":"arxiv-2401.11622","DOIUrl":null,"url":null,"abstract":"This paper addresses the problem of finding a minimum-cost $m$-state Markov\nchain $(S_0,\\ldots,S_{m-1})$ in a large set of chains. The chains studied have\na reward associated with each state. The cost of a chain is its \"gain\", i.e.,\nits average reward under its stationary distribution. Specifically, for each $k=0,\\ldots,m-1$ there is a known set ${\\mathbb S}_k$\nof type-$k$ states. A permissible Markov chain contains exactly one state of\neach type; the problem is to find a minimum-cost permissible chain. The original motivation was to find a cheapest binary AIFV-$m$ lossless code\non a source alphabet of size $n$. Such a code is an $m$-tuple of trees, in\nwhich each tree can be viewed as a Markov Chain state. This formulation was\nthen used to address other problems in lossless compression. The known solution\ntechniques for finding minimum-cost Markov chains were iterative and ran in\nexponential time. This paper shows how to map every possible type-$k$ state into a type-$k$\nhyperplane and then define a \"Markov Chain Polytope\" as the lower envelope of\nall such hyperplanes. Finding a minimum-cost Markov chain can then be shown to\nbe equivalent to finding a \"highest\" point on this polytope. The local optimization procedures used in the previous iterative algorithms\nare shown to be separation oracles for this polytope. Since these were often\npolynomial time, an application of the Ellipsoid method immediately leads to\npolynomial time algorithms for these problems.","PeriodicalId":501433,"journal":{"name":"arXiv - CS - Information Theory","volume":"23 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Information Theory","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2401.11622","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This paper addresses the problem of finding a minimum-cost $m$-state Markov
chain $(S_0,\ldots,S_{m-1})$ in a large set of chains. The chains studied have
a reward associated with each state. The cost of a chain is its "gain", i.e.,
its average reward under its stationary distribution. Specifically, for each $k=0,\ldots,m-1$ there is a known set ${\mathbb S}_k$
of type-$k$ states. A permissible Markov chain contains exactly one state of
each type; the problem is to find a minimum-cost permissible chain. The original motivation was to find a cheapest binary AIFV-$m$ lossless code
on a source alphabet of size $n$. Such a code is an $m$-tuple of trees, in
which each tree can be viewed as a Markov Chain state. This formulation was
then used to address other problems in lossless compression. The known solution
techniques for finding minimum-cost Markov chains were iterative and ran in
exponential time. This paper shows how to map every possible type-$k$ state into a type-$k$
hyperplane and then define a "Markov Chain Polytope" as the lower envelope of
all such hyperplanes. Finding a minimum-cost Markov chain can then be shown to
be equivalent to finding a "highest" point on this polytope. The local optimization procedures used in the previous iterative algorithms
are shown to be separation oracles for this polytope. Since these were often
polynomial time, an application of the Ellipsoid method immediately leads to
polynomial time algorithms for these problems.