Ekaterina Shemetova, Alexander Okhotin, Semyon Grigorev
{"title":"Rational Index of Languages Defined by Grammars with Bounded Dimension of Parse Trees","authors":"Ekaterina Shemetova, Alexander Okhotin, Semyon Grigorev","doi":"10.1007/s00224-023-10159-3","DOIUrl":null,"url":null,"abstract":"<p>The rational index <span>\\(\\rho _L\\)</span> of a language <i>L</i> is an integer function, where <span>\\(\\rho _L(n)\\)</span> is the maximum length of the shortest string in <span>\\(L \\cap R\\)</span>, over all regular languages <i>R</i> recognized by <i>n</i>-state nondeterministic finite automata (NFA). This paper investigates the rational index of languages defined by grammars with bounded parse tree dimension: this is a numerical measure of the amount of branching in a tree (with trees in a linear grammar having dimension 1). For context-free grammars, a grammar with tree dimension bounded by <i>d</i> has rational index at most <span>\\(O(n^{2d})\\)</span>, and it is known from the literature that there exists a grammar with rational index <span>\\(\\Theta (n^{2d})\\)</span>. In this paper, it is shown that for multi-component grammars with at most <i>k</i> components (<i>k</i>-MCFG) and with a tree dimension bounded by <i>d</i>, the rational index is at most <span>\\(O(n^{2kd})\\)</span>, where the constant depends on the grammar, and there exists such a grammar with rational index <span>\\(\\frac{k}{2^{kd^2 - kd -2k -1} \\cdot (8k+1)^{2kd}} n^{2kd}\\)</span>. Also, for the case of ordinary context-free grammars, a more precise lower bound <span>\\(\\frac{1}{2^{d^2 + d - 3} 3^{2d}} n^{2d}\\)</span> is established.</p>","PeriodicalId":22832,"journal":{"name":"Theory of Computing Systems","volume":"10 1","pages":""},"PeriodicalIF":0.6000,"publicationDate":"2023-12-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Theory of Computing Systems","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s00224-023-10159-3","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
The rational index \(\rho _L\) of a language L is an integer function, where \(\rho _L(n)\) is the maximum length of the shortest string in \(L \cap R\), over all regular languages R recognized by n-state nondeterministic finite automata (NFA). This paper investigates the rational index of languages defined by grammars with bounded parse tree dimension: this is a numerical measure of the amount of branching in a tree (with trees in a linear grammar having dimension 1). For context-free grammars, a grammar with tree dimension bounded by d has rational index at most \(O(n^{2d})\), and it is known from the literature that there exists a grammar with rational index \(\Theta (n^{2d})\). In this paper, it is shown that for multi-component grammars with at most k components (k-MCFG) and with a tree dimension bounded by d, the rational index is at most \(O(n^{2kd})\), where the constant depends on the grammar, and there exists such a grammar with rational index \(\frac{k}{2^{kd^2 - kd -2k -1} \cdot (8k+1)^{2kd}} n^{2kd}\). Also, for the case of ordinary context-free grammars, a more precise lower bound \(\frac{1}{2^{d^2 + d - 3} 3^{2d}} n^{2d}\) is established.
期刊介绍:
TOCS is devoted to publishing original research from all areas of theoretical computer science, ranging from foundational areas such as computational complexity, to fundamental areas such as algorithms and data structures, to focused areas such as parallel and distributed algorithms and architectures.