使用前向稳定分区索引有限状态自动机

Ruben Becker, Sung-Hwan Kim, Nicola Prezza, Carlo Tosoni
{"title":"使用前向稳定分区索引有限状态自动机","authors":"Ruben Becker, Sung-Hwan Kim, Nicola Prezza, Carlo Tosoni","doi":"arxiv-2406.02763","DOIUrl":null,"url":null,"abstract":"An index on a finite-state automaton is a data structure able to locate\nspecific patterns on the automaton's paths and consequently on the regular\nlanguage accepted by the automaton itself. Cotumaccio and Prezza [SODA '21],\nintroduced a data structure able to solve pattern matching queries on automata,\ngeneralizing the famous FM-index for strings of Ferragina and Manzini [FOCS\n'00]. The efficiency of their index depends on the width of a particular\npartial order of the automaton's states, the smaller the width of the partial\norder, the faster is the index. However, computing the partial order of minimal\nwidth is NP-hard. This problem was mitigated by Cotumaccio [DCC '22], who\nrelaxed the conditions on the partial order, allowing it to be a partial\npreorder. This relaxation yields the existence of a unique partial preorder of\nminimal width that can be computed in polynomial time. In the paper at hand, we\npresent a new class of partial preorders and show that they have the following\nuseful properties: (i) they can be computed in polynomial time, (ii) their\nwidth is never larger than the width of Cotumaccio's preorders, and (iii) there\nexist infinite classes of automata on which the width of Cotumaccio's pre-order\nis linearly larger than the width of our preorder.","PeriodicalId":501124,"journal":{"name":"arXiv - CS - Formal Languages and Automata Theory","volume":"22 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Indexing Finite-State Automata Using Forward-Stable Partitions\",\"authors\":\"Ruben Becker, Sung-Hwan Kim, Nicola Prezza, Carlo Tosoni\",\"doi\":\"arxiv-2406.02763\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"An index on a finite-state automaton is a data structure able to locate\\nspecific patterns on the automaton's paths and consequently on the regular\\nlanguage accepted by the automaton itself. Cotumaccio and Prezza [SODA '21],\\nintroduced a data structure able to solve pattern matching queries on automata,\\ngeneralizing the famous FM-index for strings of Ferragina and Manzini [FOCS\\n'00]. The efficiency of their index depends on the width of a particular\\npartial order of the automaton's states, the smaller the width of the partial\\norder, the faster is the index. However, computing the partial order of minimal\\nwidth is NP-hard. This problem was mitigated by Cotumaccio [DCC '22], who\\nrelaxed the conditions on the partial order, allowing it to be a partial\\npreorder. This relaxation yields the existence of a unique partial preorder of\\nminimal width that can be computed in polynomial time. In the paper at hand, we\\npresent a new class of partial preorders and show that they have the following\\nuseful properties: (i) they can be computed in polynomial time, (ii) their\\nwidth is never larger than the width of Cotumaccio's preorders, and (iii) there\\nexist infinite classes of automata on which the width of Cotumaccio's pre-order\\nis linearly larger than the width of our preorder.\",\"PeriodicalId\":501124,\"journal\":{\"name\":\"arXiv - CS - Formal Languages and Automata Theory\",\"volume\":\"22 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-06-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Formal Languages and Automata Theory\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2406.02763\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Formal Languages and Automata Theory","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2406.02763","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

有限状态自动机索引是一种数据结构,能够定位自动机路径上的特定模式,从而定位自动机本身所接受的正则表达式。Cotumaccio 和 Prezza [SODA '21] 提出了一种能解决自动机上模式匹配查询的数据结构,概括了 Ferragina 和 Manzini [FOCS'00] 著名的字符串 FM 索引。他们的索引的效率取决于自动机状态的特定部分阶的宽度,部分阶的宽度越小,索引的速度越快。然而,计算最小宽度的部分阶是 NP 难的。科图马乔(Cotumaccio)[DCC'22]缓解了这一问题,他放宽了部分阶的条件,使其成为部分前阶。这一松弛带来了一个宽度最小的唯一部分前序的存在,它可以在多项式时间内计算。在本文中,我们提出了一类新的部分预序,并证明它们具有以下有用的性质:(i) 它们可以在多项式时间内计算;(ii) 它们的宽度永远不会大于科图马乔预序的宽度;(iii) 存在无限类的自动机,在这些自动机上,科图马乔预序的宽度线性地大于我们预序的宽度。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Indexing Finite-State Automata Using Forward-Stable Partitions
An index on a finite-state automaton is a data structure able to locate specific patterns on the automaton's paths and consequently on the regular language accepted by the automaton itself. Cotumaccio and Prezza [SODA '21], introduced a data structure able to solve pattern matching queries on automata, generalizing the famous FM-index for strings of Ferragina and Manzini [FOCS '00]. The efficiency of their index depends on the width of a particular partial order of the automaton's states, the smaller the width of the partial order, the faster is the index. However, computing the partial order of minimal width is NP-hard. This problem was mitigated by Cotumaccio [DCC '22], who relaxed the conditions on the partial order, allowing it to be a partial preorder. This relaxation yields the existence of a unique partial preorder of minimal width that can be computed in polynomial time. In the paper at hand, we present a new class of partial preorders and show that they have the following useful properties: (i) they can be computed in polynomial time, (ii) their width is never larger than the width of Cotumaccio's preorders, and (iii) there exist infinite classes of automata on which the width of Cotumaccio's pre-order is linearly larger than the width of our preorder.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Query Learning of Advice and Nominal Automata Well-Behaved (Co)algebraic Semantics of Regular Expressions in Dafny Run supports and initial algebra supports of weighted automata Alternating hierarchy of sushifts defined by nondeterministic plane-walking automata $\mathbb{N}$-polyregular functions arise from well-quasi-orderings
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1