Ekaterina Shemetova, Alexander Okhotin, Semyon Grigorev
{"title":"解析树维度有界的语法所定义语言的有理索引","authors":"Ekaterina Shemetova, Alexander Okhotin, Semyon Grigorev","doi":"10.1007/s00224-023-10159-3","DOIUrl":null,"url":null,"abstract":"<p>The rational index <span>\\(\\rho _L\\)</span> of a language <i>L</i> is an integer function, where <span>\\(\\rho _L(n)\\)</span> is the maximum length of the shortest string in <span>\\(L \\cap R\\)</span>, over all regular languages <i>R</i> recognized by <i>n</i>-state nondeterministic finite automata (NFA). This paper investigates the rational index of languages defined by grammars with bounded parse tree dimension: this is a numerical measure of the amount of branching in a tree (with trees in a linear grammar having dimension 1). For context-free grammars, a grammar with tree dimension bounded by <i>d</i> has rational index at most <span>\\(O(n^{2d})\\)</span>, and it is known from the literature that there exists a grammar with rational index <span>\\(\\Theta (n^{2d})\\)</span>. In this paper, it is shown that for multi-component grammars with at most <i>k</i> components (<i>k</i>-MCFG) and with a tree dimension bounded by <i>d</i>, the rational index is at most <span>\\(O(n^{2kd})\\)</span>, where the constant depends on the grammar, and there exists such a grammar with rational index <span>\\(\\frac{k}{2^{kd^2 - kd -2k -1} \\cdot (8k+1)^{2kd}} n^{2kd}\\)</span>. Also, for the case of ordinary context-free grammars, a more precise lower bound <span>\\(\\frac{1}{2^{d^2 + d - 3} 3^{2d}} n^{2d}\\)</span> is established.</p>","PeriodicalId":22832,"journal":{"name":"Theory of Computing Systems","volume":null,"pages":null},"PeriodicalIF":0.6000,"publicationDate":"2023-12-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Rational Index of Languages Defined by Grammars with Bounded Dimension of Parse Trees\",\"authors\":\"Ekaterina Shemetova, Alexander Okhotin, Semyon Grigorev\",\"doi\":\"10.1007/s00224-023-10159-3\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>The rational index <span>\\\\(\\\\rho _L\\\\)</span> of a language <i>L</i> is an integer function, where <span>\\\\(\\\\rho _L(n)\\\\)</span> is the maximum length of the shortest string in <span>\\\\(L \\\\cap R\\\\)</span>, over all regular languages <i>R</i> recognized by <i>n</i>-state nondeterministic finite automata (NFA). This paper investigates the rational index of languages defined by grammars with bounded parse tree dimension: this is a numerical measure of the amount of branching in a tree (with trees in a linear grammar having dimension 1). For context-free grammars, a grammar with tree dimension bounded by <i>d</i> has rational index at most <span>\\\\(O(n^{2d})\\\\)</span>, and it is known from the literature that there exists a grammar with rational index <span>\\\\(\\\\Theta (n^{2d})\\\\)</span>. In this paper, it is shown that for multi-component grammars with at most <i>k</i> components (<i>k</i>-MCFG) and with a tree dimension bounded by <i>d</i>, the rational index is at most <span>\\\\(O(n^{2kd})\\\\)</span>, where the constant depends on the grammar, and there exists such a grammar with rational index <span>\\\\(\\\\frac{k}{2^{kd^2 - kd -2k -1} \\\\cdot (8k+1)^{2kd}} n^{2kd}\\\\)</span>. Also, for the case of ordinary context-free grammars, a more precise lower bound <span>\\\\(\\\\frac{1}{2^{d^2 + d - 3} 3^{2d}} n^{2d}\\\\)</span> is established.</p>\",\"PeriodicalId\":22832,\"journal\":{\"name\":\"Theory of Computing Systems\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.6000,\"publicationDate\":\"2023-12-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Theory of Computing Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1007/s00224-023-10159-3\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"COMPUTER SCIENCE, THEORY & METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Theory of Computing Systems","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s00224-023-10159-3","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
引用次数: 0
摘要
语言 L 的有理指数 (\(\rho _L\))是一个整数函数,其中 \(\rho _L(n)\) 是在 n 状态非确定有限自动机(NFA)识别的所有规则语言 R 中,\(L \cap R\) 中最短字符串的最大长度。本文研究了由解析树维度有界的语法定义的语言的理性指数:这是树中分支量的数字度量(线性语法中的树维度为 1)。对于无上下文语法来说,树维度以 d 为界的语法的有理指数最多为 \(O(n^{2d})\),文献中已知存在一种有理指数为 \(\Theta (n^{2d})\)的语法。本文证明,对于最多有 k 个成分(k-MCFG)且树维度以 d 为界的多成分语法,合理指数最多为 (O(n^{2kd})\)、存在这样一种语法,其合理指数为 \(\frac{k}{2^{kd^2 - kd -2k -1} \cdot (8k+1)^{2kd}} n^{2kd}\).此外,对于普通无上下文语法,还建立了一个更精确的下界 \(\frac{1}{2^{d^2 + d - 3} 3^{2d}} n^{2d}\).
Rational Index of Languages Defined by Grammars with Bounded Dimension of Parse Trees
The rational index \(\rho _L\) of a language L is an integer function, where \(\rho _L(n)\) is the maximum length of the shortest string in \(L \cap R\), over all regular languages R recognized by n-state nondeterministic finite automata (NFA). This paper investigates the rational index of languages defined by grammars with bounded parse tree dimension: this is a numerical measure of the amount of branching in a tree (with trees in a linear grammar having dimension 1). For context-free grammars, a grammar with tree dimension bounded by d has rational index at most \(O(n^{2d})\), and it is known from the literature that there exists a grammar with rational index \(\Theta (n^{2d})\). In this paper, it is shown that for multi-component grammars with at most k components (k-MCFG) and with a tree dimension bounded by d, the rational index is at most \(O(n^{2kd})\), where the constant depends on the grammar, and there exists such a grammar with rational index \(\frac{k}{2^{kd^2 - kd -2k -1} \cdot (8k+1)^{2kd}} n^{2kd}\). Also, for the case of ordinary context-free grammars, a more precise lower bound \(\frac{1}{2^{d^2 + d - 3} 3^{2d}} n^{2d}\) is established.
期刊介绍:
TOCS is devoted to publishing original research from all areas of theoretical computer science, ranging from foundational areas such as computational complexity, to fundamental areas such as algorithms and data structures, to focused areas such as parallel and distributed algorithms and architectures.