The Average Profile of Suffix Trees

Mark Daniel Ward
{"title":"The Average Profile of Suffix Trees","authors":"Mark Daniel Ward","doi":"10.1137/1.9781611972979.3","DOIUrl":null,"url":null,"abstract":"The internal profile of a tree structure denotes the number of internal nodes found at a specific level of the tree. Similarly, the external profile denotes the number of leaves on a level. The profile is of great interest because of its intimate connection to many other parameters of trees. For instance, the depth, fill-up level, height, path length, shortest path, and size of trees can each be interpreted in terms of the profile. \n \nThe current study is motivated by the work of Park et al. [22], which was a comprehensive study of the profile of tries constructed from independent strings (also, each string generated by a memoryless source). In the present paper, however, we consider suffix trees, which are constructed from suffixes of a common string. The dependency between suffixes demands a careful, intricate treatment of overlaps in words. \n \nWe precisely analyze the average internal and external profiles of suffix trees generated by a memoryless source. We utilize combinatorics on words (in particular, autocorrelation, i.e., the degree to which a word overlaps with itself) generating functions, singularity analysis, and the Mellin transform. We make comparisons of the average profile of suffix trees to the average profile of tries constructed from independent strings. We emphasize that our methods are extensible to higher moments. The present report describes the first moment of both the internal and external profiles of suffix trees.","PeriodicalId":340112,"journal":{"name":"Workshop on Analytic Algorithmics and Combinatorics","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Workshop on Analytic Algorithmics and Combinatorics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1137/1.9781611972979.3","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

The internal profile of a tree structure denotes the number of internal nodes found at a specific level of the tree. Similarly, the external profile denotes the number of leaves on a level. The profile is of great interest because of its intimate connection to many other parameters of trees. For instance, the depth, fill-up level, height, path length, shortest path, and size of trees can each be interpreted in terms of the profile. The current study is motivated by the work of Park et al. [22], which was a comprehensive study of the profile of tries constructed from independent strings (also, each string generated by a memoryless source). In the present paper, however, we consider suffix trees, which are constructed from suffixes of a common string. The dependency between suffixes demands a careful, intricate treatment of overlaps in words. We precisely analyze the average internal and external profiles of suffix trees generated by a memoryless source. We utilize combinatorics on words (in particular, autocorrelation, i.e., the degree to which a word overlaps with itself) generating functions, singularity analysis, and the Mellin transform. We make comparisons of the average profile of suffix trees to the average profile of tries constructed from independent strings. We emphasize that our methods are extensible to higher moments. The present report describes the first moment of both the internal and external profiles of suffix trees.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
后缀树的平均轮廓
树结构的内部轮廓表示在树的特定级别上发现的内部节点的数量。类似地,外部配置文件表示一个级别上的叶子数量。由于它与树木的许多其他参数密切相关,因此剖面非常有趣。例如,树的深度、填充水平、高度、路径长度、最短路径和大小都可以根据剖面进行解释。目前的研究受到Park等人[22]工作的启发,该工作是对由独立字符串(也由无记忆源生成的每个字符串)构建的尝试概况的全面研究。然而,在本文中,我们考虑后缀树,它是由一个公共字符串的后缀构成的。后缀之间的依赖关系要求对单词中的重叠部分进行仔细、复杂的处理。我们精确地分析了由无内存源生成的后缀树的平均内部和外部轮廓。我们在单词上使用组合学(特别是自相关,即单词与自身重叠的程度)生成函数,奇点分析和Mellin变换。我们将后缀树的平均轮廓与由独立字符串构造的尝试的平均轮廓进行了比较。我们强调我们的方法可以扩展到更高的矩。本报告描述了后缀树的内部和外部轮廓的第一个时刻。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Protection Number of Recursive Trees Ranked Schröder Trees QuickSort: Improved right-tail asymptotics for the limiting distribution, and large deviations (Extended Abstract) Subcritical random hypergraphs, high-order components, and hypertrees Esthetic Numbers and Lifting Restrictions on the Analysis of Summatory Functions of Regular Sequences
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1