首页 > 最新文献

arXiv - CS - Formal Languages and Automata Theory最新文献

英文 中文
Automata Extraction from Transformers 从变形金刚中提取自动机
Pub Date : 2024-06-08 DOI: arxiv-2406.05564
Yihao Zhang, Zeming Wei, Meng Sun
In modern machine (ML) learning systems, Transformer-based architectures haveachieved milestone success across a broad spectrum of tasks, yet understandingtheir operational mechanisms remains an open problem. To improve thetransparency of ML systems, automata extraction methods, which interpretstateful ML models as automata typically through formal languages, have proveneffective for explaining the mechanism of recurrent neural networks (RNNs).However, few works have been applied to this paradigm to Transformer models. Inparticular, understanding their processing of formal languages and identifyingtheir limitations in this area remains unexplored. In this paper, we propose anautomata extraction algorithm specifically designed for Transformer models.Treating the Transformer model as a black-box system, we track the modelthrough the transformation process of their internal latent representationsduring their operations, and then use classical pedagogical approaches like L*algorithm to interpret them as deterministic finite-state automata (DFA).Overall, our study reveals how the Transformer model comprehends the structureof formal languages, which not only enhances the interpretability of theTransformer-based ML systems but also marks a crucial step toward a deeperunderstanding of how ML systems process formal languages. Code and data areavailable at https://github.com/Zhang-Yihao/Transfomer2DFA.
在现代机器(ML)学习系统中,基于变形器的架构在广泛的任务中取得了里程碑式的成功,然而,理解其运行机制仍然是一个未决问题。为了提高 ML 系统的透明度,自动机提取方法(通常通过形式语言将有状态的 ML 模型解释为自动机)被证明对解释循环神经网络(RNN)的机制非常有效。特别是在理解它们对形式语言的处理以及识别它们在这方面的局限性方面,仍有待探索。本文提出了一种专为变换器模型设计的自动机提取算法。我们将变换器模型视为一个黑盒系统,在其运行过程中跟踪其内部潜在表征的变换过程,然后使用经典的教学方法(如 L* 算法)将其解释为确定性有限状态自动机(DFA)。总之,我们的研究揭示了变换器模型是如何理解形式语言结构的,这不仅增强了基于变换器的 ML 系统的可解释性,而且标志着我们朝着更深入地理解 ML 系统如何处理形式语言的方向迈出了关键的一步。代码和数据请访问 https://github.com/Zhang-Yihao/Transfomer2DFA。
{"title":"Automata Extraction from Transformers","authors":"Yihao Zhang, Zeming Wei, Meng Sun","doi":"arxiv-2406.05564","DOIUrl":"https://doi.org/arxiv-2406.05564","url":null,"abstract":"In modern machine (ML) learning systems, Transformer-based architectures have\u0000achieved milestone success across a broad spectrum of tasks, yet understanding\u0000their operational mechanisms remains an open problem. To improve the\u0000transparency of ML systems, automata extraction methods, which interpret\u0000stateful ML models as automata typically through formal languages, have proven\u0000effective for explaining the mechanism of recurrent neural networks (RNNs).\u0000However, few works have been applied to this paradigm to Transformer models. In\u0000particular, understanding their processing of formal languages and identifying\u0000their limitations in this area remains unexplored. In this paper, we propose an\u0000automata extraction algorithm specifically designed for Transformer models.\u0000Treating the Transformer model as a black-box system, we track the model\u0000through the transformation process of their internal latent representations\u0000during their operations, and then use classical pedagogical approaches like L*\u0000algorithm to interpret them as deterministic finite-state automata (DFA).\u0000Overall, our study reveals how the Transformer model comprehends the structure\u0000of formal languages, which not only enhances the interpretability of the\u0000Transformer-based ML systems but also marks a crucial step toward a deeper\u0000understanding of how ML systems process formal languages. Code and data are\u0000available at https://github.com/Zhang-Yihao/Transfomer2DFA.","PeriodicalId":501124,"journal":{"name":"arXiv - CS - Formal Languages and Automata Theory","volume":"13 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-06-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141511278","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Indexing Finite-State Automata Using Forward-Stable Partitions 使用前向稳定分区索引有限状态自动机
Pub Date : 2024-06-04 DOI: arxiv-2406.02763
Ruben Becker, Sung-Hwan Kim, Nicola Prezza, Carlo Tosoni
An index on a finite-state automaton is a data structure able to locatespecific patterns on the automaton's paths and consequently on the regularlanguage accepted by the automaton itself. Cotumaccio and Prezza [SODA '21],introduced a data structure able to solve pattern matching queries on automata,generalizing the famous FM-index for strings of Ferragina and Manzini [FOCS'00]. The efficiency of their index depends on the width of a particularpartial order of the automaton's states, the smaller the width of the partialorder, the faster is the index. However, computing the partial order of minimalwidth is NP-hard. This problem was mitigated by Cotumaccio [DCC '22], whorelaxed the conditions on the partial order, allowing it to be a partialpreorder. This relaxation yields the existence of a unique partial preorder ofminimal width that can be computed in polynomial time. In the paper at hand, wepresent a new class of partial preorders and show that they have the followinguseful properties: (i) they can be computed in polynomial time, (ii) theirwidth is never larger than the width of Cotumaccio's preorders, and (iii) thereexist infinite classes of automata on which the width of Cotumaccio's pre-orderis linearly larger than the width of our preorder.
有限状态自动机索引是一种数据结构,能够定位自动机路径上的特定模式,从而定位自动机本身所接受的正则表达式。Cotumaccio 和 Prezza [SODA '21] 提出了一种能解决自动机上模式匹配查询的数据结构,概括了 Ferragina 和 Manzini [FOCS'00] 著名的字符串 FM 索引。他们的索引的效率取决于自动机状态的特定部分阶的宽度,部分阶的宽度越小,索引的速度越快。然而,计算最小宽度的部分阶是 NP 难的。科图马乔(Cotumaccio)[DCC'22]缓解了这一问题,他放宽了部分阶的条件,使其成为部分前阶。这一松弛带来了一个宽度最小的唯一部分前序的存在,它可以在多项式时间内计算。在本文中,我们提出了一类新的部分预序,并证明它们具有以下有用的性质:(i) 它们可以在多项式时间内计算;(ii) 它们的宽度永远不会大于科图马乔预序的宽度;(iii) 存在无限类的自动机,在这些自动机上,科图马乔预序的宽度线性地大于我们预序的宽度。
{"title":"Indexing Finite-State Automata Using Forward-Stable Partitions","authors":"Ruben Becker, Sung-Hwan Kim, Nicola Prezza, Carlo Tosoni","doi":"arxiv-2406.02763","DOIUrl":"https://doi.org/arxiv-2406.02763","url":null,"abstract":"An index on a finite-state automaton is a data structure able to locate\u0000specific patterns on the automaton's paths and consequently on the regular\u0000language accepted by the automaton itself. Cotumaccio and Prezza [SODA '21],\u0000introduced a data structure able to solve pattern matching queries on automata,\u0000generalizing the famous FM-index for strings of Ferragina and Manzini [FOCS\u0000'00]. The efficiency of their index depends on the width of a particular\u0000partial order of the automaton's states, the smaller the width of the partial\u0000order, the faster is the index. However, computing the partial order of minimal\u0000width is NP-hard. This problem was mitigated by Cotumaccio [DCC '22], who\u0000relaxed the conditions on the partial order, allowing it to be a partial\u0000preorder. This relaxation yields the existence of a unique partial preorder of\u0000minimal width that can be computed in polynomial time. In the paper at hand, we\u0000present a new class of partial preorders and show that they have the following\u0000useful properties: (i) they can be computed in polynomial time, (ii) their\u0000width is never larger than the width of Cotumaccio's preorders, and (iii) there\u0000exist infinite classes of automata on which the width of Cotumaccio's pre-order\u0000is linearly larger than the width of our preorder.","PeriodicalId":501124,"journal":{"name":"arXiv - CS - Formal Languages and Automata Theory","volume":"22 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141546776","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Separability in Büchi Vass and Singly Non-Linear Systems of Inequalities Büchi Vass 和单非线性不等式系统中的可分性
Pub Date : 2024-06-03 DOI: arxiv-2406.01008
Pascal Baumann, Eren Keskin, Roland Meyer, Georg Zetzsche
The omega-regular separability problem for B"uchi VASS coverabilitylanguages has recently been shown to be decidable, but with an EXPSPACE lowerand a non-primitive recursive upper bound -- the exact complexity remainedopen. We close this gap and show that the problem is EXPSPACE-complete. Acareful analysis of our complexity bounds additionally yields a PSPACEprocedure in the case of fixed dimension >= 1, which matches a pre-establishedlower bound of PSPACE for one dimensional B"uchi VASS. Our algorithm is anon-deterministic search for a witness whose size, as we show, can be suitablybounded. Part of the procedure is to decide the existence of runs in VASS thatsatisfy certain non-linear properties. Therefore, a key technical ingredient isto analyze a class of systems of inequalities where one variable may occur innon-linear (polynomial) expressions. These so-called singly non-linear systems (SNLS) take the form A(x).y >=b(x), where A(x) and b(x) are a matrix resp. a vector whose entries arepolynomials in x, and y ranges over vectors in the rationals. Our maincontribution on SNLS is an exponential upper bound on the size of rationalsolutions to singly non-linear systems. The proof consists of three steps.First, we give a tailor-made quantifier elimination to characterize all realsolutions to x. Second, using the root separation theorem about the distance ofreal roots of polynomials, we show that if a rational solution exists, thenthere is one with at most polynomially many bits. Third, we insert the solutionfor x into the SNLS, making it linear and allowing us to invoke standardsolution bounds from convex geometry. Finally, we combine the results about SNLS with several techniques from thearea of VASS to devise an EXPSPACE decision procedure for omega-regularseparability of B"uchi VASS.
最近,有人证明了布内 VASS 可覆盖性语言的欧米伽正则可分性问题是可解的,但它有一个 EXPSPACE 下限和一个非直观递归上界--确切的复杂性仍然没有定论。我们填补了这一空白,并证明这个问题是EXPSPACE-complete的。对我们的复杂度边界进行仔细分析后,在固定维度 >= 1 的情况下,我们还得到了一个 PSPACEprocedure,它与一维 B"uchi VASS 的预设下限 PSPACE 相匹配。我们的算法是一种非确定性的搜索,我们证明,证人的大小可以被适当地限定。该过程的一部分是确定满足某些非线性特性的 VASS 运行的存在性。因此,一个关键的技术要素是分析一类变量可能出现在非线性(多项式)表达式中的不等式系统。这些所谓的单非线性系统(SNLS)的形式为 A(x).y>=b(x),其中 A(x) 和 b(x) 分别是一个矩阵和一个向量,其项是 x 的多项式,而 y 的范围是有理数中的向量。我们对单非线性系统的主要贡献是对单非线性系统的有理数解的大小提出了指数上界。首先,我们给出了一个量子消元法,以描述 x 的所有有理解。其次,利用多项式实根距离的根分离定理,我们证明了如果存在有理解,那么有理解的位数最多为多项式位数。第三,我们将 x 的解插入 SNLS,使其成为线性解,并允许我们引用凸几何中的标准解界值。最后,我们将SNLS的结果与VASS领域的几种技术相结合,设计出一种EXPSPACE决策程序,用于B"uchi VASS的ω-regular-separability。
{"title":"Separability in Büchi Vass and Singly Non-Linear Systems of Inequalities","authors":"Pascal Baumann, Eren Keskin, Roland Meyer, Georg Zetzsche","doi":"arxiv-2406.01008","DOIUrl":"https://doi.org/arxiv-2406.01008","url":null,"abstract":"The omega-regular separability problem for B\"uchi VASS coverability\u0000languages has recently been shown to be decidable, but with an EXPSPACE lower\u0000and a non-primitive recursive upper bound -- the exact complexity remained\u0000open. We close this gap and show that the problem is EXPSPACE-complete. A\u0000careful analysis of our complexity bounds additionally yields a PSPACE\u0000procedure in the case of fixed dimension >= 1, which matches a pre-established\u0000lower bound of PSPACE for one dimensional B\"uchi VASS. Our algorithm is a\u0000non-deterministic search for a witness whose size, as we show, can be suitably\u0000bounded. Part of the procedure is to decide the existence of runs in VASS that\u0000satisfy certain non-linear properties. Therefore, a key technical ingredient is\u0000to analyze a class of systems of inequalities where one variable may occur in\u0000non-linear (polynomial) expressions. These so-called singly non-linear systems (SNLS) take the form A(x).y >=\u0000b(x), where A(x) and b(x) are a matrix resp. a vector whose entries are\u0000polynomials in x, and y ranges over vectors in the rationals. Our main\u0000contribution on SNLS is an exponential upper bound on the size of rational\u0000solutions to singly non-linear systems. The proof consists of three steps.\u0000First, we give a tailor-made quantifier elimination to characterize all real\u0000solutions to x. Second, using the root separation theorem about the distance of\u0000real roots of polynomials, we show that if a rational solution exists, then\u0000there is one with at most polynomially many bits. Third, we insert the solution\u0000for x into the SNLS, making it linear and allowing us to invoke standard\u0000solution bounds from convex geometry. Finally, we combine the results about SNLS with several techniques from the\u0000area of VASS to devise an EXPSPACE decision procedure for omega-regular\u0000separability of B\"uchi VASS.","PeriodicalId":501124,"journal":{"name":"arXiv - CS - Formal Languages and Automata Theory","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141255985","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Linear equations and recursively enumerable sets 线性方程和递推可数集
Pub Date : 2024-06-02 DOI: arxiv-2406.00688
Juha Honkala
We study connections between linear equations over various semigroups andrecursively enumerable sets of positive integers. We give variants of theuniversal Diophantine representation of recursively enumerable sets of positiveintegers established by Matiyasevich. These variants use linear equations withone unkwown instead of polynomial equations with several unknowns. As acorollary we get undecidability results for linear equations over morphismsemigoups and over matrix semigroups.
我们研究了各种半群上的线性方程与正整数递归可数集之间的联系。我们给出了马蒂亚舍维奇建立的正整数递归可数集的通用二叉表示法的变体。这些变体使用的是只有一个未知数的线性方程,而不是有多个未知数的多项式方程。作为必然结果,我们得到了关于态群和矩阵半群的线性方程的不可判定性结果。
{"title":"Linear equations and recursively enumerable sets","authors":"Juha Honkala","doi":"arxiv-2406.00688","DOIUrl":"https://doi.org/arxiv-2406.00688","url":null,"abstract":"We study connections between linear equations over various semigroups and\u0000recursively enumerable sets of positive integers. We give variants of the\u0000universal Diophantine representation of recursively enumerable sets of positive\u0000integers established by Matiyasevich. These variants use linear equations with\u0000one unkwown instead of polynomial equations with several unknowns. As a\u0000corollary we get undecidability results for linear equations over morphism\u0000semigoups and over matrix semigroups.","PeriodicalId":501124,"journal":{"name":"arXiv - CS - Formal Languages and Automata Theory","volume":"34 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141255904","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The generating power of weighted tree automata with initial algebra semantics 具有初始代数语义的加权树自动机的生成能力
Pub Date : 2024-05-31 DOI: arxiv-2405.20753
Manfred Droste, Zoltán Fülöp, Andreja Tepavčević, Heiko Vogler
We consider the images of the initial algebra semantics of weighted treeautomata over strong bimonoids (hence also over semirings). These images aresubsets of the carrier set of the underlying strong bimonoid. We considerlocally finite, weakly locally finite, and bi-locally finite strong bimonoids.We show that there exists a strong bimonoid which is weakly locally finite andnot locally finite. We also show that if the ranked alphabet contains a binarysymbol, then for any finitely generated strong bimonoid, weighted tree automatacan generate, via their initial algebra semantics, all elements of the strongbimonoid. As a consequence of these results, for weakly locally finite strongbimonoids which are not locally finite, weighted tree automata can generateinfinite images provided that the input ranked alphabet contains at least onebinary symbol. This is in sharp contrast to the setting of weighted stringautomata, where each such image is known to be finite. As a furtherconsequence, for any finitely generated semiring, there exists a weighted treeautomaton which generates, via its run semantics, all elements of the semiring.
我们考虑的是加权树状图的初始代数语义在强双元上(因此也在语义上)的图像。这些图象是底层强双元体载体集的子集。我们考虑了局部有限的、弱局部有限的和双局部有限的强双元体,并证明存在一个弱局部有限而非局部有限的强双元体。我们还证明,如果排序字母表包含二元符号,那么对于任何有限生成的强双元体,加权树自动机都能通过其初始代数语义生成强双元体的所有元素。这些结果的结果是,对于非局部有限的弱局部有限强二元对偶体,只要输入的排序字母表至少包含一个二进制符号,加权树自动机就能生成无限图像。这与加权字符串自动机的情况形成了鲜明对比,在加权字符串自动机中,每个这样的图象都是已知有限的。进而,对于任何有限生成的语义,都存在一个加权树自动机,它可以通过运行语义生成语义的所有元素。
{"title":"The generating power of weighted tree automata with initial algebra semantics","authors":"Manfred Droste, Zoltán Fülöp, Andreja Tepavčević, Heiko Vogler","doi":"arxiv-2405.20753","DOIUrl":"https://doi.org/arxiv-2405.20753","url":null,"abstract":"We consider the images of the initial algebra semantics of weighted tree\u0000automata over strong bimonoids (hence also over semirings). These images are\u0000subsets of the carrier set of the underlying strong bimonoid. We consider\u0000locally finite, weakly locally finite, and bi-locally finite strong bimonoids.\u0000We show that there exists a strong bimonoid which is weakly locally finite and\u0000not locally finite. We also show that if the ranked alphabet contains a binary\u0000symbol, then for any finitely generated strong bimonoid, weighted tree automata\u0000can generate, via their initial algebra semantics, all elements of the strong\u0000bimonoid. As a consequence of these results, for weakly locally finite strong\u0000bimonoids which are not locally finite, weighted tree automata can generate\u0000infinite images provided that the input ranked alphabet contains at least one\u0000binary symbol. This is in sharp contrast to the setting of weighted string\u0000automata, where each such image is known to be finite. As a further\u0000consequence, for any finitely generated semiring, there exists a weighted tree\u0000automaton which generates, via its run semantics, all elements of the semiring.","PeriodicalId":501124,"journal":{"name":"arXiv - CS - Formal Languages and Automata Theory","volume":"70 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141255739","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reward Machines for Deep RL in Noisy and Uncertain Environments 噪声和不确定环境中的深度 RL 奖励机制
Pub Date : 2024-05-31 DOI: arxiv-2406.00120
Andrew C. Li, Zizhao Chen, Toryn Q. Klassen, Pashootan Vaezipoor, Rodrigo Toro Icarte, Sheila A. McIlraith
Reward Machines provide an automata-inspired structure for specifyinginstructions, safety constraints, and other temporally extended reward-worthybehaviour. By exposing complex reward function structure, they enablecounterfactual learning updates that have resulted in impressive sampleefficiency gains. While Reward Machines have been employed in both tabular anddeep RL settings, they have typically relied on a ground-truth interpretationof the domain-specific vocabulary that form the building blocks of the rewardfunction. Such ground-truth interpretations can be elusive in many real-worldsettings, due in part to partial observability or noisy sensing. In this paper,we explore the use of Reward Machines for Deep RL in noisy and uncertainenvironments. We characterize this problem as a POMDP and propose a suite of RLalgorithms that leverage task structure under uncertain interpretation ofdomain-specific vocabulary. Theoretical analysis exposes pitfalls in naiveapproaches to this problem, while experimental results show that our algorithmssuccessfully leverage task structure to improve performance under noisyinterpretations of the vocabulary. Our results provide a general framework forexploiting Reward Machines in partially observable environments.
奖励机器提供了一种受自动机启发的结构,用于指定指令、安全约束和其他时间扩展的值得奖励的行为。通过暴露复杂的奖励函数结构,奖励机器实现了反事实学习更新,从而提高了令人印象深刻的样本效率。虽然奖励机器已在表层和深层 RL 设置中得到应用,但它们通常依赖于对构成奖励函数构件的特定领域词汇的地面实况解释。在许多现实世界环境中,由于部分可观测性或嘈杂的传感等原因,这种地面实况解释可能难以捉摸。在本文中,我们探讨了在嘈杂和不确定的环境中使用奖励机器进行深度 RL 的问题。我们将这一问题描述为 POMDP,并提出了一套 RL 算法,在特定领域词汇解释不确定的情况下利用任务结构。理论分析揭示了解决这一问题的天真方法中存在的缺陷,而实验结果表明,我们的算法成功地利用了任务结构,从而提高了在词汇解释不确定的情况下的性能。我们的结果为在部分可观测环境中利用奖励机器提供了一个通用框架。
{"title":"Reward Machines for Deep RL in Noisy and Uncertain Environments","authors":"Andrew C. Li, Zizhao Chen, Toryn Q. Klassen, Pashootan Vaezipoor, Rodrigo Toro Icarte, Sheila A. McIlraith","doi":"arxiv-2406.00120","DOIUrl":"https://doi.org/arxiv-2406.00120","url":null,"abstract":"Reward Machines provide an automata-inspired structure for specifying\u0000instructions, safety constraints, and other temporally extended reward-worthy\u0000behaviour. By exposing complex reward function structure, they enable\u0000counterfactual learning updates that have resulted in impressive sample\u0000efficiency gains. While Reward Machines have been employed in both tabular and\u0000deep RL settings, they have typically relied on a ground-truth interpretation\u0000of the domain-specific vocabulary that form the building blocks of the reward\u0000function. Such ground-truth interpretations can be elusive in many real-world\u0000settings, due in part to partial observability or noisy sensing. In this paper,\u0000we explore the use of Reward Machines for Deep RL in noisy and uncertain\u0000environments. We characterize this problem as a POMDP and propose a suite of RL\u0000algorithms that leverage task structure under uncertain interpretation of\u0000domain-specific vocabulary. Theoretical analysis exposes pitfalls in naive\u0000approaches to this problem, while experimental results show that our algorithms\u0000successfully leverage task structure to improve performance under noisy\u0000interpretations of the vocabulary. Our results provide a general framework for\u0000exploiting Reward Machines in partially observable environments.","PeriodicalId":501124,"journal":{"name":"arXiv - CS - Formal Languages and Automata Theory","volume":"6 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141255823","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The CFG Complexity of Singleton Sets 单子集的 CFG 复杂性
Pub Date : 2024-05-30 DOI: arxiv-2405.20026
Lance Fortnow, William Gasarch
Let G be a context-free grammar (CFG) in Chomsky normal form. We take thenumber of rules in G to be the size of G. We also assume all CFGs are inChomsky normal form. We consider the question of, given a string w of length n, what is thesmallest CFG such that L(G)={w}? We show the following: 1) For all w, |w|=n, there is a CFG of size with O(n/log n) rules, such thatL(G)={w}. 2) There exists a string w, |w|=n, such that every CFG G with L(G)={w} is ofsize Omega(n/log n). We give two proofs of: one nonconstructive, the otherconstructive.
让 G 成为乔姆斯基正则形式的无上下文语法 (CFG)。我们将 G 中的规则数视为 G 的大小。我们还假设所有 CFG 都是乔姆斯基正态形式。我们要考虑的问题是:给定长度为 n 的字符串 w,L(G)={w} 的最小 CFG 是什么?我们证明如下:1) 对于所有 w,|w|=n,存在一个大小为 O(n/log n) 规则的 CFG,使得 L(G)={w}.2) 存在一个字符串 w,|w|=n,使得具有 L(G)={w} 的 CFG G 的大小为 Omega(n/log n)。我们给出两个证明:一个是非结构性证明,另一个是结构性证明。
{"title":"The CFG Complexity of Singleton Sets","authors":"Lance Fortnow, William Gasarch","doi":"arxiv-2405.20026","DOIUrl":"https://doi.org/arxiv-2405.20026","url":null,"abstract":"Let G be a context-free grammar (CFG) in Chomsky normal form. We take the\u0000number of rules in G to be the size of G. We also assume all CFGs are in\u0000Chomsky normal form. We consider the question of, given a string w of length n, what is the\u0000smallest CFG such that L(G)={w}? We show the following: 1) For all w, |w|=n, there is a CFG of size with O(n/log n) rules, such that\u0000L(G)={w}. 2) There exists a string w, |w|=n, such that every CFG G with L(G)={w} is of\u0000size Omega(n/log n). We give two proofs of: one nonconstructive, the other\u0000constructive.","PeriodicalId":501124,"journal":{"name":"arXiv - CS - Formal Languages and Automata Theory","volume":"103 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141197585","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DFAMiner: Mining minimal separating DFAs from labelled samples DFAMiner:从标记样本中挖掘最小分离 DFA
Pub Date : 2024-05-29 DOI: arxiv-2405.18871
Daniele Dell'Erba, Yong Li, Sven Schewe
We propose DFAMiner, a passive learning tool for learning minimal separatingdeterministic finite automata (DFA) from a set of labelled samples. Separatingautomata are an interesting class of automata that occurs generally in regularmodel checking and has raised interest in foundational questions of parity gamesolving. We first propose a simple and linear-time algorithm that incrementallyconstructs a three-valued DFA (3DFA) from a set of labelled samples given inthe usual lexicographical order. This 3DFA has accepting and rejecting statesas well as don't-care states, so that it can exactly recognise the labelledexamples. We then apply our tool to mining a minimal separating DFA for thelabelled samples by minimising the constructed automata via a reduction tosolving SAT problems. Empirical evaluation shows that our tool outperformscurrent state-of-the-art tools significantly on standard benchmarks forlearning minimal separating DFAs from samples. Progress in the efficientconstruction of separating DFAs can also lead to finding the lower bound ofparity game solving, where we show that DFAMiner can create optimal separatingautomata for simple languages with up to 7 colours. Future improvements mightoffer inroads to better data structures.
我们提出的 DFAMiner 是一种被动学习工具,用于从一组标记样本中学习最小分离决定论有限自动机(DFA)。分离式自动机是一类有趣的自动机,通常出现在常规模型检查中,并引起了人们对奇偶性博弈解的基础问题的兴趣。我们首先提出了一种简单的线性时间算法,它能从一组按通常的词典顺序给出的标记样本增量地构建三值 DFA(3DFA)。这个 3DFA 有接受和拒绝状态,也有不关心状态,因此它能准确识别标记的样本。然后,我们应用我们的工具,通过还原为解决 SAT 问题来最小化所构建的自动机,从而为标记样本挖掘出最小分离 DFA。经验评估表明,在从样本学习最小分离 DFA 的标准基准上,我们的工具明显优于目前最先进的工具。在高效构建分离式 DFA 方面取得的进展还可以帮助我们找到对偶博弈求解的下限,我们在此证明了 DFAMiner 可以为最多 7 种颜色的简单语言创建最优分离式自动机。未来的改进可能会提供更好的数据结构。
{"title":"DFAMiner: Mining minimal separating DFAs from labelled samples","authors":"Daniele Dell'Erba, Yong Li, Sven Schewe","doi":"arxiv-2405.18871","DOIUrl":"https://doi.org/arxiv-2405.18871","url":null,"abstract":"We propose DFAMiner, a passive learning tool for learning minimal separating\u0000deterministic finite automata (DFA) from a set of labelled samples. Separating\u0000automata are an interesting class of automata that occurs generally in regular\u0000model checking and has raised interest in foundational questions of parity game\u0000solving. We first propose a simple and linear-time algorithm that incrementally\u0000constructs a three-valued DFA (3DFA) from a set of labelled samples given in\u0000the usual lexicographical order. This 3DFA has accepting and rejecting states\u0000as well as don't-care states, so that it can exactly recognise the labelled\u0000examples. We then apply our tool to mining a minimal separating DFA for the\u0000labelled samples by minimising the constructed automata via a reduction to\u0000solving SAT problems. Empirical evaluation shows that our tool outperforms\u0000current state-of-the-art tools significantly on standard benchmarks for\u0000learning minimal separating DFAs from samples. Progress in the efficient\u0000construction of separating DFAs can also lead to finding the lower bound of\u0000parity game solving, where we show that DFAMiner can create optimal separating\u0000automata for simple languages with up to 7 colours. Future improvements might\u0000offer inroads to better data structures.","PeriodicalId":501124,"journal":{"name":"arXiv - CS - Formal Languages and Automata Theory","volume":"4 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141197579","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Oblivious Monitoring for Discrete-Time STL via Fully Homomorphic Encryption 通过完全同态加密实现离散时间 STL 的遗忘监控
Pub Date : 2024-05-27 DOI: arxiv-2405.16767
Masaki Waga, Kotaro Matsuoka, Takashi Suwa, Naoki Matsumoto, Ryotaro Banno, Song Bian, Kohei Suenaga
When monitoring a cyber-physical system (CPS) from a remote server, keepingthe monitored data secret is crucial, particularly when they contain sensitiveinformation, e.g., biological or location data. Recently, Banno et al. (CAV'22)proposed a protocol for online LTL monitoring that keeps data concealed fromthe server using Fully Homomorphic Encryption (FHE). We build on this protocolto allow arithmetic operations over encrypted values, e.g., to compute a safetymeasurement combining distance, velocity, and so forth. Overall, our protocolenables oblivious online monitoring of discrete-time real-valued signalsagainst signal temporal logic (STL) formulas. Our protocol combines two FHEschemes, CKKS and TFHE, leveraging their respective strengths. We employ CKKSto evaluate arithmetic predicates in STL formulas while utilizing TFHE toprocess them using a DFA derived from the STL formula. We conducted casestudies on monitoring blood glucose levels and vehicles' behavior against theResponsibility-Sensitive Safety (RSS) rules. Our results suggest the practicalrelevance of our protocol.
从远程服务器监控网络物理系统(CPS)时,保持监控数据的机密性至关重要,尤其是当这些数据包含敏感信息(如生物或位置数据)时。最近,Banno 等人(CAV'22)提出了一种在线 LTL 监控协议,该协议使用完全同态加密(FHE)技术对服务器进行数据保密。我们以该协议为基础,允许对加密值进行算术运算,例如,计算距离、速度等组合的安全测量值。总之,我们的协议可以根据信号时间逻辑(STL)公式对离散时间实值信号进行遗忘式在线监控。我们的协议结合了 CKKS 和 TFHE 两种 FHE 方案,充分利用了它们各自的优势。我们采用 CKKS 来评估 STL 公式中的算术谓词,同时利用 TFHE,使用从 STL 公式导出的 DFA 来处理这些算术谓词。我们针对责任敏感安全(RSS)规则进行了血糖水平和车辆行为监控的案例研究。我们的研究结果表明,我们的协议非常实用。
{"title":"Oblivious Monitoring for Discrete-Time STL via Fully Homomorphic Encryption","authors":"Masaki Waga, Kotaro Matsuoka, Takashi Suwa, Naoki Matsumoto, Ryotaro Banno, Song Bian, Kohei Suenaga","doi":"arxiv-2405.16767","DOIUrl":"https://doi.org/arxiv-2405.16767","url":null,"abstract":"When monitoring a cyber-physical system (CPS) from a remote server, keeping\u0000the monitored data secret is crucial, particularly when they contain sensitive\u0000information, e.g., biological or location data. Recently, Banno et al. (CAV'22)\u0000proposed a protocol for online LTL monitoring that keeps data concealed from\u0000the server using Fully Homomorphic Encryption (FHE). We build on this protocol\u0000to allow arithmetic operations over encrypted values, e.g., to compute a safety\u0000measurement combining distance, velocity, and so forth. Overall, our protocol\u0000enables oblivious online monitoring of discrete-time real-valued signals\u0000against signal temporal logic (STL) formulas. Our protocol combines two FHE\u0000schemes, CKKS and TFHE, leveraging their respective strengths. We employ CKKS\u0000to evaluate arithmetic predicates in STL formulas while utilizing TFHE to\u0000process them using a DFA derived from the STL formula. We conducted case\u0000studies on monitoring blood glucose levels and vehicles' behavior against the\u0000Responsibility-Sensitive Safety (RSS) rules. Our results suggest the practical\u0000relevance of our protocol.","PeriodicalId":501124,"journal":{"name":"arXiv - CS - Formal Languages and Automata Theory","volume":"164 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141172667","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Expressive Capacity of State Space Models: A Formal Language Perspective 状态空间模型的表达能力:形式语言视角
Pub Date : 2024-05-27 DOI: arxiv-2405.17394
Yash Sarrof, Yana Veitsman, Michael Hahn
Recently, recurrent models based on linear state space models (SSMs) haveshown promising performance in language modeling (LM), competititve withtransformers. However, there is little understanding of the in-principleabilities of such models, which could provide useful guidance to the search forbetter LM architectures. We present a comprehensive theoretical study of thecapacity of such SSMs as it compares to that of transformers and traditionalRNNs. We find that SSMs and transformers have overlapping but distinctstrengths. In star-free state tracking, SSMs implement straightforward andexact solutions to problems that transformers struggle to represent exactly.They can also model bounded hierarchical structure with optimal memory evenwithout simulating a stack. On the other hand, we identify a design choice incurrent SSMs that limits their expressive power. We discuss implications forSSM and LM research, and verify results empirically on a recent SSM, Mamba.
最近,基于线性状态空间模型(SSM)的循环模型在语言建模(LM)中表现出了可喜的性能,可与变换器相媲美。然而,人们对这些模型的原理性了解甚少,而这些原理性可以为寻找更好的语言建模架构提供有用的指导。我们对这种 SSM 的能力进行了全面的理论研究,并将其与变压器和传统 RNN 进行了比较。我们发现,SSM 和变压器的优势相互重叠,但又截然不同。在无星状态跟踪中,SSMs 可以直接、精确地解决变换器难以精确表示的问题。另一方面,我们发现当前 SSM 的设计选择限制了其表现力。我们讨论了 SSM 和 LM 研究的意义,并在最近的 SSM Mamba 上对结果进行了实证验证。
{"title":"The Expressive Capacity of State Space Models: A Formal Language Perspective","authors":"Yash Sarrof, Yana Veitsman, Michael Hahn","doi":"arxiv-2405.17394","DOIUrl":"https://doi.org/arxiv-2405.17394","url":null,"abstract":"Recently, recurrent models based on linear state space models (SSMs) have\u0000shown promising performance in language modeling (LM), competititve with\u0000transformers. However, there is little understanding of the in-principle\u0000abilities of such models, which could provide useful guidance to the search for\u0000better LM architectures. We present a comprehensive theoretical study of the\u0000capacity of such SSMs as it compares to that of transformers and traditional\u0000RNNs. We find that SSMs and transformers have overlapping but distinct\u0000strengths. In star-free state tracking, SSMs implement straightforward and\u0000exact solutions to problems that transformers struggle to represent exactly.\u0000They can also model bounded hierarchical structure with optimal memory even\u0000without simulating a stack. On the other hand, we identify a design choice in\u0000current SSMs that limits their expressive power. We discuss implications for\u0000SSM and LM research, and verify results empirically on a recent SSM, Mamba.","PeriodicalId":501124,"journal":{"name":"arXiv - CS - Formal Languages and Automata Theory","volume":"98 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141172738","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
arXiv - CS - Formal Languages and Automata Theory
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1